CN103403799B - 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 - Google Patents
用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 Download PDFInfo
- Publication number
- CN103403799B CN103403799B CN201180058880.2A CN201180058880A CN103403799B CN 103403799 B CN103403799 B CN 103403799B CN 201180058880 A CN201180058880 A CN 201180058880A CN 103403799 B CN103403799 B CN 103403799B
- Authority
- CN
- China
- Prior art keywords
- samples
- configurable
- applicable
- sound signal
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/04—Time compression or expansion
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0012—Smoothing of parameters of the decoder interpolation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
- Laminated Bodies (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US39026710P | 2010-10-06 | 2010-10-06 | |
| US61/390,267 | 2010-10-06 | ||
| PCT/EP2011/067318 WO2012045744A1 (en) | 2010-10-06 | 2011-10-04 | Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac) |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103403799A CN103403799A (zh) | 2013-11-20 |
| CN103403799B true CN103403799B (zh) | 2015-09-16 |
Family
ID=44759689
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201180058880.2A Active CN103403799B (zh) | 2010-10-06 | 2011-10-04 | 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 |
Country Status (17)
| Country | Link |
|---|---|
| US (1) | US9552822B2 (enExample) |
| EP (1) | EP2625688B1 (enExample) |
| JP (1) | JP6100164B2 (enExample) |
| KR (1) | KR101407120B1 (enExample) |
| CN (1) | CN103403799B (enExample) |
| AR (2) | AR083303A1 (enExample) |
| AU (1) | AU2011311659B2 (enExample) |
| BR (1) | BR112013008463B8 (enExample) |
| CA (1) | CA2813859C (enExample) |
| ES (1) | ES2530957T3 (enExample) |
| MX (1) | MX2013003782A (enExample) |
| MY (1) | MY155997A (enExample) |
| PL (1) | PL2625688T3 (enExample) |
| RU (1) | RU2562384C2 (enExample) |
| SG (1) | SG189277A1 (enExample) |
| TW (1) | TWI486950B (enExample) |
| WO (1) | WO2012045744A1 (enExample) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MX2013003782A (es) * | 2010-10-06 | 2013-10-03 | Fraunhofer Ges Forschung | Aparato y metodo para procesar una señal de audio y para otorgar una mayor granularidad temporal para un codificador-decodificador combinado y unificado de voz y audio (usac). |
| EP2777042B1 (en) * | 2011-11-11 | 2019-08-14 | Dolby International AB | Upsampling using oversampled sbr |
| TWI557727B (zh) * | 2013-04-05 | 2016-11-11 | 杜比國際公司 | 音訊處理系統、多媒體處理系統、處理音訊位元流的方法以及電腦程式產品 |
| AU2014204540B1 (en) * | 2014-07-21 | 2015-08-20 | Matthew Brown | Audio Signal Processing Methods and Systems |
| EP2980795A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor |
| EP2980794A1 (en) | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder using a frequency domain processor and a time domain processor |
| EP3107096A1 (en) * | 2015-06-16 | 2016-12-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Downscaled decoding |
| EP3182411A1 (en) * | 2015-12-14 | 2017-06-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an encoded audio signal |
| RU2711513C1 (ru) * | 2016-01-22 | 2020-01-17 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Устройство и способ оценивания межканальной разницы во времени |
| CN109328382B (zh) * | 2016-06-22 | 2023-06-16 | 杜比国际公司 | 用于将数字音频信号从第一频域变换到第二频域的音频解码器及方法 |
| US10249307B2 (en) * | 2016-06-27 | 2019-04-02 | Qualcomm Incorporated | Audio decoding using intermediate sampling rate |
| TWI812658B (zh) | 2017-12-19 | 2023-08-21 | 瑞典商都比國際公司 | 用於統一語音及音訊之解碼及編碼去關聯濾波器之改良之方法、裝置及系統 |
| CN115668365B (zh) * | 2020-05-20 | 2025-11-18 | 杜比国际公司 | 用于统一语音和音频解码改进的方法和装置 |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6208276B1 (en) * | 1998-12-30 | 2001-03-27 | At&T Corporation | Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding |
| EP1204095A1 (en) * | 1999-06-11 | 2002-05-08 | NEC Corporation | Sound switching device |
| CN101218630A (zh) * | 2005-07-11 | 2008-07-09 | Lg电子株式会社 | 处理音频信号的装置和方法 |
| US20100153122A1 (en) * | 2008-12-15 | 2010-06-17 | Tandberg Television Inc. | Multi-staging recursive audio frame-based resampling and time mapping |
Family Cites Families (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH03286698A (ja) | 1990-04-02 | 1991-12-17 | Onkyo Corp | ソフトドーム振動板 |
| KR970011728B1 (ko) | 1994-12-21 | 1997-07-14 | 김광호 | 음향신호의 에러은닉방법 및 그 장치 |
| IT1281001B1 (it) * | 1995-10-27 | 1998-02-11 | Cselt Centro Studi Lab Telecom | Procedimento e apparecchiatura per codificare, manipolare e decodificare segnali audio. |
| US6006108A (en) * | 1996-01-31 | 1999-12-21 | Qualcomm Incorporated | Digital audio processing in a dual-mode telephone |
| DE19742655C2 (de) * | 1997-09-26 | 1999-08-05 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Codieren eines zeitdiskreten Stereosignals |
| US6208671B1 (en) * | 1998-01-20 | 2001-03-27 | Cirrus Logic, Inc. | Asynchronous sample rate converter |
| ES2247741T3 (es) * | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio. |
| US6275836B1 (en) * | 1998-06-12 | 2001-08-14 | Oak Technology, Inc. | Interpolation filter and method for switching between integer and fractional interpolation rates |
| EP1295390B1 (en) * | 2000-06-23 | 2007-02-14 | STMicroelectronics Asia Pacific Pte Ltd. | Universal sampling rate converter for digital audio frequencies |
| CA2392640A1 (en) | 2002-07-05 | 2004-01-05 | Voiceage Corporation | A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems |
| JP2004120182A (ja) * | 2002-09-25 | 2004-04-15 | Sanyo Electric Co Ltd | デシメーションフィルタおよびインターポレーションフィルタ |
| JP4369946B2 (ja) * | 2002-11-21 | 2009-11-25 | 日本電信電話株式会社 | ディジタル信号処理方法、そのプログラム、及びそのプログラムを格納した記録媒体 |
| US7336208B2 (en) * | 2003-03-31 | 2008-02-26 | Nxp B.V. | Up and down sample rate converter |
| EP2270774B1 (en) | 2004-03-25 | 2016-07-27 | DTS, Inc. | Lossless multi-channel audio codec |
| DE102004043521A1 (de) | 2004-09-08 | 2006-03-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes |
| ATE521143T1 (de) * | 2005-02-23 | 2011-09-15 | Ericsson Telefon Ab L M | Adaptive bitzuweisung für die mehrkanal- audiokodierung |
| US7528745B2 (en) * | 2006-02-15 | 2009-05-05 | Qualcomm Incorporated | Digital domain sampling rate converter |
| US7610195B2 (en) * | 2006-06-01 | 2009-10-27 | Nokia Corporation | Decoding of predictively coded data using buffer adaptation |
| US9009032B2 (en) * | 2006-11-09 | 2015-04-14 | Broadcom Corporation | Method and system for performing sample rate conversion |
| US7912728B2 (en) * | 2006-11-30 | 2011-03-22 | Broadcom Corporation | Method and system for handling the processing of bluetooth data during multi-path multi-rate audio processing |
| CA2730196C (en) * | 2008-07-11 | 2014-10-21 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Method and discriminator for classifying different segments of a signal |
| EP2144230A1 (en) * | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
| MX2011000372A (es) | 2008-07-11 | 2011-05-19 | Fraunhofer Ges Forschung | Sintetizador de señales de audio y codificador de señales de audio. |
| CA2966469C (en) * | 2009-01-28 | 2020-05-05 | Dolby International Ab | Improved harmonic transposition |
| KR101622950B1 (ko) * | 2009-01-28 | 2016-05-23 | 삼성전자주식회사 | 오디오 신호의 부호화 및 복호화 방법 및 그 장치 |
| US20110087494A1 (en) * | 2009-10-09 | 2011-04-14 | Samsung Electronics Co., Ltd. | Apparatus and method of encoding audio signal by switching frequency domain transformation scheme and time domain transformation scheme |
| KR101137652B1 (ko) * | 2009-10-14 | 2012-04-23 | 광운대학교 산학협력단 | 천이 구간에 기초하여 윈도우의 오버랩 영역을 조절하는 통합 음성/오디오 부호화/복호화 장치 및 방법 |
| PL2491556T3 (pl) * | 2009-10-20 | 2024-08-26 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Dekoder sygnału audio, odpowiadający mu sposób oraz program komputerowy |
| US8886523B2 (en) * | 2010-04-14 | 2014-11-11 | Huawei Technologies Co., Ltd. | Audio decoding based on audio class with control code for post-processing modes |
| MX2013003782A (es) * | 2010-10-06 | 2013-10-03 | Fraunhofer Ges Forschung | Aparato y metodo para procesar una señal de audio y para otorgar una mayor granularidad temporal para un codificador-decodificador combinado y unificado de voz y audio (usac). |
| MY167957A (en) * | 2011-03-18 | 2018-10-08 | Dolby Int Ab | Frame Element Length Transmission in Audio Coding |
| CN104509119A (zh) * | 2012-04-24 | 2015-04-08 | Vid拓展公司 | 用于mpeg/3gpp-dash中平滑流切换的方法和装置 |
-
2011
- 2011-10-04 MX MX2013003782A patent/MX2013003782A/es active IP Right Grant
- 2011-10-04 MY MYPI2013001206A patent/MY155997A/en unknown
- 2011-10-04 RU RU2013120320/08A patent/RU2562384C2/ru active
- 2011-10-04 PL PL11764739T patent/PL2625688T3/pl unknown
- 2011-10-04 JP JP2013532172A patent/JP6100164B2/ja active Active
- 2011-10-04 WO PCT/EP2011/067318 patent/WO2012045744A1/en not_active Ceased
- 2011-10-04 EP EP11764739.6A patent/EP2625688B1/en active Active
- 2011-10-04 ES ES11764739T patent/ES2530957T3/es active Active
- 2011-10-04 SG SG2013025382A patent/SG189277A1/en unknown
- 2011-10-04 KR KR1020137010454A patent/KR101407120B1/ko active Active
- 2011-10-04 CN CN201180058880.2A patent/CN103403799B/zh active Active
- 2011-10-04 CA CA2813859A patent/CA2813859C/en active Active
- 2011-10-04 AR ARP110103684A patent/AR083303A1/es active IP Right Grant
- 2011-10-04 AU AU2011311659A patent/AU2011311659B2/en active Active
- 2011-10-04 BR BR112013008463A patent/BR112013008463B8/pt active IP Right Grant
- 2011-10-05 TW TW100136050A patent/TWI486950B/zh active
-
2013
- 2013-04-03 US US13/855,889 patent/US9552822B2/en active Active
-
2015
- 2015-09-14 AR ARP150102919A patent/AR101853A2/es active IP Right Grant
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6208276B1 (en) * | 1998-12-30 | 2001-03-27 | At&T Corporation | Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding |
| EP1204095A1 (en) * | 1999-06-11 | 2002-05-08 | NEC Corporation | Sound switching device |
| CN101218630A (zh) * | 2005-07-11 | 2008-07-09 | Lg电子株式会社 | 处理音频信号的装置和方法 |
| US20100153122A1 (en) * | 2008-12-15 | 2010-06-17 | Tandberg Television Inc. | Multi-staging recursive audio frame-based resampling and time mapping |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2625688B1 (en) | 2014-12-03 |
| KR101407120B1 (ko) | 2014-06-13 |
| HK1190223A1 (en) | 2014-06-27 |
| PL2625688T3 (pl) | 2015-05-29 |
| AR083303A1 (es) | 2013-02-13 |
| CN103403799A (zh) | 2013-11-20 |
| MX2013003782A (es) | 2013-10-03 |
| MY155997A (en) | 2015-12-31 |
| WO2012045744A1 (en) | 2012-04-12 |
| KR20130069821A (ko) | 2013-06-26 |
| JP2013543600A (ja) | 2013-12-05 |
| US20130226570A1 (en) | 2013-08-29 |
| US9552822B2 (en) | 2017-01-24 |
| AR101853A2 (es) | 2017-01-18 |
| BR112013008463B8 (pt) | 2022-04-05 |
| BR112013008463B1 (pt) | 2021-06-01 |
| AU2011311659A1 (en) | 2013-05-02 |
| RU2562384C2 (ru) | 2015-09-10 |
| AU2011311659B2 (en) | 2015-07-30 |
| TW201222532A (en) | 2012-06-01 |
| BR112013008463A2 (pt) | 2016-08-09 |
| RU2013120320A (ru) | 2014-11-20 |
| EP2625688A1 (en) | 2013-08-14 |
| CA2813859C (en) | 2016-07-12 |
| SG189277A1 (en) | 2013-05-31 |
| TWI486950B (zh) | 2015-06-01 |
| JP6100164B2 (ja) | 2017-03-22 |
| ES2530957T3 (es) | 2015-03-09 |
| CA2813859A1 (en) | 2012-04-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103403799B (zh) | 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 | |
| JP7228607B2 (ja) | 全帯域ギャップ充填を備えた周波数ドメインプロセッサと時間ドメインプロセッサとを使用するオーディオ符号器及び復号器 | |
| CN102177426B (zh) | 多分辨率切换音频编码/解码方案 | |
| CN103400583B (zh) | 多声道下混对象编码的增强编码和参数表示 | |
| EP2849180B1 (en) | Hybrid audio signal encoder, hybrid audio signal decoder, method for encoding audio signal, and method for decoding audio signal | |
| CN102934163A (zh) | 用于宽带语音编码的系统、方法、设备和计算机程序产品 | |
| MX2011000373A (es) | Aparato y metodo para la codificacion/decodificacion de una señal de audio utilizando un esquema de conmutacion de generacion de señal ajena. | |
| CN101553865A (zh) | 用于处理音频信号的方法和装置 | |
| CN104123946A (zh) | 用于在与语音信号相关联的包中包含识别符的系统及方法 | |
| CA3162807A1 (en) | Cross product enhanced harmonic transposition | |
| CN102396024A (zh) | 使用自适应正弦波脉冲编码的用于音频信号的编码/解码方法及其设备 | |
| TW200931397A (en) | An encoder | |
| WO2009059631A1 (en) | Audio coding apparatus and method thereof | |
| US20100121632A1 (en) | Stereo audio encoding device, stereo audio decoding device, and their method | |
| CN103155035B (zh) | 基于celp的语音编码器中的音频信号带宽扩展 | |
| CN105280189B (zh) | 带宽扩展编码和解码中高频生成的方法和装置 | |
| Herre et al. | Perceptual Audio Coding: A 40-Year Historical Perspective | |
| Britanak et al. | Audio coding standards,(Proprietary) audio compression algorithms, and broadcasting/speech/data communication codecs: overview of adopted filter banks | |
| HK1190223B (en) | Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac) |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| C56 | Change in the name or address of the patentee | ||
| CP01 | Change in the name or title of a patent holder |
Address after: Munich, Germany Patentee after: Fraunhofer Application and Research Promotion Association Patentee after: Voiceage Corp Address before: Munich, Germany Patentee before: Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Patentee before: Voiceage Corp |