AU2011311659B2 - Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC) - Google Patents

Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC) Download PDF

Info

Publication number
AU2011311659B2
AU2011311659B2 AU2011311659A AU2011311659A AU2011311659B2 AU 2011311659 B2 AU2011311659 B2 AU 2011311659B2 AU 2011311659 A AU2011311659 A AU 2011311659A AU 2011311659 A AU2011311659 A AU 2011311659A AU 2011311659 B2 AU2011311659 B2 AU 2011311659B2
Authority
AU
Australia
Prior art keywords
audio signal
samples
configurable
ratio
configurable number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
AU2011311659A
Other languages
English (en)
Other versions
AU2011311659A1 (en
Inventor
Bruno Bessette
Guillaume Fuchs
Philippe Gournay
Bernhard Grill
Roch Lefebvre
Markus Multrus
Max Neuendorf
Nikolaus Rettelbach
Stephan Wilde
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VoiceAge Corp
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Original Assignee
VoiceAge Corp
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VoiceAge Corp, Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV filed Critical VoiceAge Corp
Publication of AU2011311659A1 publication Critical patent/AU2011311659A1/en
Assigned to FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V., VOICEAGE CORPORATION reassignment FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V. Amend patent request/document other than specification (104) Assignors: FRAUNHOFER-GESELLSCHAFT ZUR FORDERUNG DER ANGEWANDTEN FORSCHUNG E.V., VOICEAGE CORPORATION
Application granted granted Critical
Publication of AU2011311659B2 publication Critical patent/AU2011311659B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0012Smoothing of parameters of the decoder interpolation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Laminated Bodies (AREA)
AU2011311659A 2010-10-06 2011-10-04 Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC) Active AU2011311659B2 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US39026710P 2010-10-06 2010-10-06
US61/390,267 2010-10-06
PCT/EP2011/067318 WO2012045744A1 (en) 2010-10-06 2011-10-04 Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)

Publications (2)

Publication Number Publication Date
AU2011311659A1 AU2011311659A1 (en) 2013-05-02
AU2011311659B2 true AU2011311659B2 (en) 2015-07-30

Family

ID=44759689

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2011311659A Active AU2011311659B2 (en) 2010-10-06 2011-10-04 Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC)

Country Status (17)

Country Link
US (1) US9552822B2 (enExample)
EP (1) EP2625688B1 (enExample)
JP (1) JP6100164B2 (enExample)
KR (1) KR101407120B1 (enExample)
CN (1) CN103403799B (enExample)
AR (2) AR083303A1 (enExample)
AU (1) AU2011311659B2 (enExample)
BR (1) BR112013008463B8 (enExample)
CA (1) CA2813859C (enExample)
ES (1) ES2530957T3 (enExample)
MX (1) MX2013003782A (enExample)
MY (1) MY155997A (enExample)
PL (1) PL2625688T3 (enExample)
RU (1) RU2562384C2 (enExample)
SG (1) SG189277A1 (enExample)
TW (1) TWI486950B (enExample)
WO (1) WO2012045744A1 (enExample)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2013003782A (es) * 2010-10-06 2013-10-03 Fraunhofer Ges Forschung Aparato y metodo para procesar una señal de audio y para otorgar una mayor granularidad temporal para un codificador-decodificador combinado y unificado de voz y audio (usac).
EP2777042B1 (en) * 2011-11-11 2019-08-14 Dolby International AB Upsampling using oversampled sbr
TWI557727B (zh) * 2013-04-05 2016-11-11 杜比國際公司 音訊處理系統、多媒體處理系統、處理音訊位元流的方法以及電腦程式產品
AU2014204540B1 (en) * 2014-07-21 2015-08-20 Matthew Brown Audio Signal Processing Methods and Systems
EP2980795A1 (en) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
EP2980794A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder using a frequency domain processor and a time domain processor
EP3107096A1 (en) * 2015-06-16 2016-12-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Downscaled decoding
EP3182411A1 (en) * 2015-12-14 2017-06-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing an encoded audio signal
RU2711513C1 (ru) * 2016-01-22 2020-01-17 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Устройство и способ оценивания межканальной разницы во времени
CN109328382B (zh) * 2016-06-22 2023-06-16 杜比国际公司 用于将数字音频信号从第一频域变换到第二频域的音频解码器及方法
US10249307B2 (en) * 2016-06-27 2019-04-02 Qualcomm Incorporated Audio decoding using intermediate sampling rate
TWI812658B (zh) 2017-12-19 2023-08-21 瑞典商都比國際公司 用於統一語音及音訊之解碼及編碼去關聯濾波器之改良之方法、裝置及系統
CN115668365B (zh) * 2020-05-20 2025-11-18 杜比国际公司 用于统一语音和音频解码改进的方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6208276B1 (en) * 1998-12-30 2001-03-27 At&T Corporation Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
EP1204095A1 (en) * 1999-06-11 2002-05-08 NEC Corporation Sound switching device
US20100153122A1 (en) * 2008-12-15 2010-06-17 Tandberg Television Inc. Multi-staging recursive audio frame-based resampling and time mapping

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03286698A (ja) 1990-04-02 1991-12-17 Onkyo Corp ソフトドーム振動板
KR970011728B1 (ko) 1994-12-21 1997-07-14 김광호 음향신호의 에러은닉방법 및 그 장치
IT1281001B1 (it) * 1995-10-27 1998-02-11 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per codificare, manipolare e decodificare segnali audio.
US6006108A (en) * 1996-01-31 1999-12-21 Qualcomm Incorporated Digital audio processing in a dual-mode telephone
DE19742655C2 (de) * 1997-09-26 1999-08-05 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Codieren eines zeitdiskreten Stereosignals
US6208671B1 (en) * 1998-01-20 2001-03-27 Cirrus Logic, Inc. Asynchronous sample rate converter
ES2247741T3 (es) * 1998-01-22 2006-03-01 Deutsche Telekom Ag Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio.
US6275836B1 (en) * 1998-06-12 2001-08-14 Oak Technology, Inc. Interpolation filter and method for switching between integer and fractional interpolation rates
EP1295390B1 (en) * 2000-06-23 2007-02-14 STMicroelectronics Asia Pacific Pte Ltd. Universal sampling rate converter for digital audio frequencies
CA2392640A1 (en) 2002-07-05 2004-01-05 Voiceage Corporation A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
JP2004120182A (ja) * 2002-09-25 2004-04-15 Sanyo Electric Co Ltd デシメーションフィルタおよびインターポレーションフィルタ
JP4369946B2 (ja) * 2002-11-21 2009-11-25 日本電信電話株式会社 ディジタル信号処理方法、そのプログラム、及びそのプログラムを格納した記録媒体
US7336208B2 (en) * 2003-03-31 2008-02-26 Nxp B.V. Up and down sample rate converter
EP2270774B1 (en) 2004-03-25 2016-07-27 DTS, Inc. Lossless multi-channel audio codec
DE102004043521A1 (de) 2004-09-08 2006-03-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes
ATE521143T1 (de) * 2005-02-23 2011-09-15 Ericsson Telefon Ab L M Adaptive bitzuweisung für die mehrkanal- audiokodierung
US7830921B2 (en) 2005-07-11 2010-11-09 Lg Electronics Inc. Apparatus and method of encoding and decoding audio signal
US7528745B2 (en) * 2006-02-15 2009-05-05 Qualcomm Incorporated Digital domain sampling rate converter
US7610195B2 (en) * 2006-06-01 2009-10-27 Nokia Corporation Decoding of predictively coded data using buffer adaptation
US9009032B2 (en) * 2006-11-09 2015-04-14 Broadcom Corporation Method and system for performing sample rate conversion
US7912728B2 (en) * 2006-11-30 2011-03-22 Broadcom Corporation Method and system for handling the processing of bluetooth data during multi-path multi-rate audio processing
CA2730196C (en) * 2008-07-11 2014-10-21 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Method and discriminator for classifying different segments of a signal
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
MX2011000372A (es) 2008-07-11 2011-05-19 Fraunhofer Ges Forschung Sintetizador de señales de audio y codificador de señales de audio.
CA2966469C (en) * 2009-01-28 2020-05-05 Dolby International Ab Improved harmonic transposition
KR101622950B1 (ko) * 2009-01-28 2016-05-23 삼성전자주식회사 오디오 신호의 부호화 및 복호화 방법 및 그 장치
US20110087494A1 (en) * 2009-10-09 2011-04-14 Samsung Electronics Co., Ltd. Apparatus and method of encoding audio signal by switching frequency domain transformation scheme and time domain transformation scheme
KR101137652B1 (ko) * 2009-10-14 2012-04-23 광운대학교 산학협력단 천이 구간에 기초하여 윈도우의 오버랩 영역을 조절하는 통합 음성/오디오 부호화/복호화 장치 및 방법
PL2491556T3 (pl) * 2009-10-20 2024-08-26 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Dekoder sygnału audio, odpowiadający mu sposób oraz program komputerowy
US8886523B2 (en) * 2010-04-14 2014-11-11 Huawei Technologies Co., Ltd. Audio decoding based on audio class with control code for post-processing modes
MX2013003782A (es) * 2010-10-06 2013-10-03 Fraunhofer Ges Forschung Aparato y metodo para procesar una señal de audio y para otorgar una mayor granularidad temporal para un codificador-decodificador combinado y unificado de voz y audio (usac).
MY167957A (en) * 2011-03-18 2018-10-08 Dolby Int Ab Frame Element Length Transmission in Audio Coding
CN104509119A (zh) * 2012-04-24 2015-04-08 Vid拓展公司 用于mpeg/3gpp-dash中平滑流切换的方法和装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6208276B1 (en) * 1998-12-30 2001-03-27 At&T Corporation Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
EP1204095A1 (en) * 1999-06-11 2002-05-08 NEC Corporation Sound switching device
US20100153122A1 (en) * 2008-12-15 2010-06-17 Tandberg Television Inc. Multi-staging recursive audio frame-based resampling and time mapping

Also Published As

Publication number Publication date
EP2625688B1 (en) 2014-12-03
KR101407120B1 (ko) 2014-06-13
HK1190223A1 (en) 2014-06-27
PL2625688T3 (pl) 2015-05-29
AR083303A1 (es) 2013-02-13
CN103403799B (zh) 2015-09-16
CN103403799A (zh) 2013-11-20
MX2013003782A (es) 2013-10-03
MY155997A (en) 2015-12-31
WO2012045744A1 (en) 2012-04-12
KR20130069821A (ko) 2013-06-26
JP2013543600A (ja) 2013-12-05
US20130226570A1 (en) 2013-08-29
US9552822B2 (en) 2017-01-24
AR101853A2 (es) 2017-01-18
BR112013008463B8 (pt) 2022-04-05
BR112013008463B1 (pt) 2021-06-01
AU2011311659A1 (en) 2013-05-02
RU2562384C2 (ru) 2015-09-10
TW201222532A (en) 2012-06-01
BR112013008463A2 (pt) 2016-08-09
RU2013120320A (ru) 2014-11-20
EP2625688A1 (en) 2013-08-14
CA2813859C (en) 2016-07-12
SG189277A1 (en) 2013-05-31
TWI486950B (zh) 2015-06-01
JP6100164B2 (ja) 2017-03-22
ES2530957T3 (es) 2015-03-09
CA2813859A1 (en) 2012-04-12

Similar Documents

Publication Publication Date Title
AU2011311659B2 (en) Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (USAC)
RU2680195C1 (ru) Аудиокодер для кодирования многоканального сигнала и аудиодекодер для декодирования кодированного аудиосигнала
US10600429B2 (en) Stereo audio encoder and decoder
CN113963706B (zh) 使用频域处理器以及时域处理器的音频编码器和解码器
EP2950308B1 (en) Bandwidth expansion parameter-generator, encoder, decoder, bandwidth expansion parameter-generating method, encoding method, and decoding method
EP2849180B1 (en) Hybrid audio signal encoder, hybrid audio signal decoder, method for encoding audio signal, and method for decoding audio signal
EP2997572B1 (en) Audio object separation from mixture signal using object-specific time/frequency resolutions
KR102560473B1 (ko) 후처리 지연을 저감시킨 고주파 재구성 기술의 통합
JP2021522543A (ja) 後処理遅延低減との高周波再構成技術の統合
HK1190223B (en) Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)
Riera-Palou et al. A bit stream scalable Speech/Audio coder combining enhanced regular pulse excitation and parametric coding

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)