CN103403799B - 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 - Google Patents

用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 Download PDF

Info

Publication number
CN103403799B
CN103403799B CN201180058880.2A CN201180058880A CN103403799B CN 103403799 B CN103403799 B CN 103403799B CN 201180058880 A CN201180058880 A CN 201180058880A CN 103403799 B CN103403799 B CN 103403799B
Authority
CN
China
Prior art keywords
samples
configurable
applicable
sound signal
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201180058880.2A
Other languages
English (en)
Chinese (zh)
Other versions
CN103403799A (zh
Inventor
马库斯·穆赖特鲁斯
伯恩哈德·格里
马克思·纽恩多夫
尼古劳斯·雷特尔巴赫
纪尧姆·福奇斯
菲利普·古尔纳伊
罗什·勒菲弗
布鲁诺·贝塞特
斯特凡·维尔德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VoiceAge Corp
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Original Assignee
Sound Generation Co ltd
Franhofer Transportation Applied Research Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sound Generation Co ltd, Franhofer Transportation Applied Research Co filed Critical Sound Generation Co ltd
Publication of CN103403799A publication Critical patent/CN103403799A/zh
Application granted granted Critical
Publication of CN103403799B publication Critical patent/CN103403799B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0012Smoothing of parameters of the decoder interpolation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Laminated Bodies (AREA)
CN201180058880.2A 2010-10-06 2011-10-04 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法 Active CN103403799B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US39026710P 2010-10-06 2010-10-06
US61/390,267 2010-10-06
PCT/EP2011/067318 WO2012045744A1 (en) 2010-10-06 2011-10-04 Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)

Publications (2)

Publication Number Publication Date
CN103403799A CN103403799A (zh) 2013-11-20
CN103403799B true CN103403799B (zh) 2015-09-16

Family

ID=44759689

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201180058880.2A Active CN103403799B (zh) 2010-10-06 2011-10-04 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法

Country Status (17)

Country Link
US (1) US9552822B2 (https=)
EP (1) EP2625688B1 (https=)
JP (1) JP6100164B2 (https=)
KR (1) KR101407120B1 (https=)
CN (1) CN103403799B (https=)
AR (2) AR083303A1 (https=)
AU (1) AU2011311659B2 (https=)
BR (1) BR112013008463B8 (https=)
CA (1) CA2813859C (https=)
ES (1) ES2530957T3 (https=)
MX (1) MX2013003782A (https=)
MY (1) MY155997A (https=)
PL (1) PL2625688T3 (https=)
RU (1) RU2562384C2 (https=)
SG (1) SG189277A1 (https=)
TW (1) TWI486950B (https=)
WO (1) WO2012045744A1 (https=)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BR112013008463B8 (pt) * 2010-10-06 2022-04-05 Fraunhofer Ges Zur Foerderung Der Angewandten Forschubg E V Aparelho e método para processar um sinal de áudio e para prover uma granularidade temporal maior para um codec de fala e áudio unificado combinado (usac)
EP2777042B1 (en) * 2011-11-11 2019-08-14 Dolby International AB Upsampling using oversampled sbr
TWI557727B (zh) 2013-04-05 2016-11-11 杜比國際公司 音訊處理系統、多媒體處理系統、處理音訊位元流的方法以及電腦程式產品
AU2014204540B1 (en) * 2014-07-21 2015-08-20 Matthew Brown Audio Signal Processing Methods and Systems
EP2980794A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder using a frequency domain processor and a time domain processor
EP2980795A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
EP3107096A1 (en) * 2015-06-16 2016-12-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Downscaled decoding
EP3182411A1 (en) 2015-12-14 2017-06-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for processing an encoded audio signal
MY196436A (en) 2016-01-22 2023-04-11 Fraunhofer Ges Forschung Apparatus and Method for Encoding or Decoding a Multi-Channel Signal Using Frame Control Synchronization
CN109328382B (zh) * 2016-06-22 2023-06-16 杜比国际公司 用于将数字音频信号从第一频域变换到第二频域的音频解码器及方法
US10249307B2 (en) * 2016-06-27 2019-04-02 Qualcomm Incorporated Audio decoding using intermediate sampling rate
TWI812658B (zh) 2017-12-19 2023-08-21 瑞典商都比國際公司 用於統一語音及音訊之解碼及編碼去關聯濾波器之改良之方法、裝置及系統
EP4154249B1 (en) * 2020-05-20 2024-01-24 Dolby International AB Methods and apparatus for unified speech and audio decoding improvements

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6208276B1 (en) * 1998-12-30 2001-03-27 At&T Corporation Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
EP1204095A1 (en) * 1999-06-11 2002-05-08 NEC Corporation Sound switching device
CN101218630A (zh) * 2005-07-11 2008-07-09 Lg电子株式会社 处理音频信号的装置和方法
US20100153122A1 (en) * 2008-12-15 2010-06-17 Tandberg Television Inc. Multi-staging recursive audio frame-based resampling and time mapping

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03286698A (ja) 1990-04-02 1991-12-17 Onkyo Corp ソフトドーム振動板
KR970011728B1 (ko) * 1994-12-21 1997-07-14 김광호 음향신호의 에러은닉방법 및 그 장치
IT1281001B1 (it) * 1995-10-27 1998-02-11 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per codificare, manipolare e decodificare segnali audio.
US6006108A (en) * 1996-01-31 1999-12-21 Qualcomm Incorporated Digital audio processing in a dual-mode telephone
DE19742655C2 (de) * 1997-09-26 1999-08-05 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Codieren eines zeitdiskreten Stereosignals
US6208671B1 (en) * 1998-01-20 2001-03-27 Cirrus Logic, Inc. Asynchronous sample rate converter
ATE302991T1 (de) * 1998-01-22 2005-09-15 Deutsche Telekom Ag Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen
US6275836B1 (en) * 1998-06-12 2001-08-14 Oak Technology, Inc. Interpolation filter and method for switching between integer and fractional interpolation rates
US7177812B1 (en) * 2000-06-23 2007-02-13 Stmicroelectronics Asia Pacific Pte Ltd Universal sampling rate converter for digital audio frequencies
CA2392640A1 (en) * 2002-07-05 2004-01-05 Voiceage Corporation A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
JP2004120182A (ja) * 2002-09-25 2004-04-15 Sanyo Electric Co Ltd デシメーションフィルタおよびインターポレーションフィルタ
JP4369946B2 (ja) * 2002-11-21 2009-11-25 日本電信電話株式会社 ディジタル信号処理方法、そのプログラム、及びそのプログラムを格納した記録媒体
CN1768476B (zh) * 2003-03-31 2010-06-09 Nxp股份有限公司 采样率转换器及方法,包括采样率转换器的设备
KR101237559B1 (ko) 2004-03-25 2013-02-26 디티에스, 인코포레이티드 스케일러블 무손실 비트스트림의 인코딩 방법
DE102004043521A1 (de) 2004-09-08 2006-03-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes
JP4809370B2 (ja) * 2005-02-23 2011-11-09 テレフオンアクチーボラゲット エル エム エリクソン(パブル) マルチチャネル音声符号化における適応ビット割り当て
US7528745B2 (en) 2006-02-15 2009-05-05 Qualcomm Incorporated Digital domain sampling rate converter
US7610195B2 (en) * 2006-06-01 2009-10-27 Nokia Corporation Decoding of predictively coded data using buffer adaptation
US9009032B2 (en) * 2006-11-09 2015-04-14 Broadcom Corporation Method and system for performing sample rate conversion
US7912728B2 (en) * 2006-11-30 2011-03-22 Broadcom Corporation Method and system for handling the processing of bluetooth data during multi-path multi-rate audio processing
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
CN102089803B (zh) 2008-07-11 2013-02-27 弗劳恩霍夫应用研究促进协会 用以将信号的不同段分类的方法与鉴别器
JP5244971B2 (ja) 2008-07-11 2013-07-24 フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン オーディオ信号合成器及びオーディオ信号符号器
KR101622950B1 (ko) * 2009-01-28 2016-05-23 삼성전자주식회사 오디오 신호의 부호화 및 복호화 방법 및 그 장치
ES2639716T3 (es) * 2009-01-28 2017-10-30 Dolby International Ab Transposición armónica mejorada
US20110087494A1 (en) * 2009-10-09 2011-04-14 Samsung Electronics Co., Ltd. Apparatus and method of encoding audio signal by switching frequency domain transformation scheme and time domain transformation scheme
KR101137652B1 (ko) * 2009-10-14 2012-04-23 광운대학교 산학협력단 천이 구간에 기초하여 윈도우의 오버랩 영역을 조절하는 통합 음성/오디오 부호화/복호화 장치 및 방법
KR101411759B1 (ko) * 2009-10-20 2014-06-25 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 오디오 신호 인코더, 오디오 신호 디코더, 앨리어싱-소거를 이용하여 오디오 신호를 인코딩 또는 디코딩하는 방법
US8886523B2 (en) * 2010-04-14 2014-11-11 Huawei Technologies Co., Ltd. Audio decoding based on audio class with control code for post-processing modes
BR112013008463B8 (pt) * 2010-10-06 2022-04-05 Fraunhofer Ges Zur Foerderung Der Angewandten Forschubg E V Aparelho e método para processar um sinal de áudio e para prover uma granularidade temporal maior para um codec de fala e áudio unificado combinado (usac)
JP5805796B2 (ja) * 2011-03-18 2015-11-10 フラウンホーファーゲゼルシャフトツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. 柔軟なコンフィギュレーション機能性を有するオーディオエンコーダおよびデコーダ
WO2013163224A1 (en) * 2012-04-24 2013-10-31 Vid Scale, Inc. Method and apparatus for smooth stream switching in mpeg/3gpp-dash

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6208276B1 (en) * 1998-12-30 2001-03-27 At&T Corporation Method and apparatus for sample rate pre- and post-processing to achieve maximal coding gain for transform-based audio encoding and decoding
EP1204095A1 (en) * 1999-06-11 2002-05-08 NEC Corporation Sound switching device
CN101218630A (zh) * 2005-07-11 2008-07-09 Lg电子株式会社 处理音频信号的装置和方法
US20100153122A1 (en) * 2008-12-15 2010-06-17 Tandberg Television Inc. Multi-staging recursive audio frame-based resampling and time mapping

Also Published As

Publication number Publication date
EP2625688B1 (en) 2014-12-03
HK1190223A1 (en) 2014-06-27
EP2625688A1 (en) 2013-08-14
RU2013120320A (ru) 2014-11-20
SG189277A1 (en) 2013-05-31
TW201222532A (en) 2012-06-01
KR101407120B1 (ko) 2014-06-13
JP6100164B2 (ja) 2017-03-22
BR112013008463B8 (pt) 2022-04-05
MX2013003782A (es) 2013-10-03
BR112013008463A2 (pt) 2016-08-09
CA2813859A1 (en) 2012-04-12
US9552822B2 (en) 2017-01-24
ES2530957T3 (es) 2015-03-09
BR112013008463B1 (pt) 2021-06-01
KR20130069821A (ko) 2013-06-26
TWI486950B (zh) 2015-06-01
CN103403799A (zh) 2013-11-20
AU2011311659A1 (en) 2013-05-02
CA2813859C (en) 2016-07-12
WO2012045744A1 (en) 2012-04-12
US20130226570A1 (en) 2013-08-29
AR101853A2 (es) 2017-01-18
AU2011311659B2 (en) 2015-07-30
RU2562384C2 (ru) 2015-09-10
MY155997A (en) 2015-12-31
PL2625688T3 (pl) 2015-05-29
JP2013543600A (ja) 2013-12-05
AR083303A1 (es) 2013-02-13

Similar Documents

Publication Publication Date Title
CN103403799B (zh) 用于针对合成统一语音和音频编解码器(usac)处理音频信号和提供较高时间粒度的设备和方法
JP7228607B2 (ja) 全帯域ギャップ充填を備えた周波数ドメインプロセッサと時間ドメインプロセッサとを使用するオーディオ符号器及び復号器
CN101553865B (zh) 用于处理音频信号的方法和装置
CA3124108C (en) Cross product enhanced harmonic transposition
JP5520967B2 (ja) 適応的正弦波コーディングを用いるオーディオ信号の符号化及び復号化方法及び装置
JP2021099497A (ja) 周波数ドメインプロセッサ、時間ドメインプロセッサ及び連続的な初期化のためのクロスプロセッサを使用するオーディオ符号器及び復号器
CN102177426A (zh) 多分辨率切换音频编码/解码方案
MX2011000373A (es) Aparato y metodo para la codificacion/decodificacion de una señal de audio utilizando un esquema de conmutacion de generacion de señal ajena.
CN104123946A (zh) 用于在与语音信号相关联的包中包含识别符的系统及方法
WO2009059631A1 (en) Audio coding apparatus and method thereof
WO2013168414A1 (ja) 音信号ハイブリッドエンコーダ、音信号ハイブリッドデコーダ、音信号符号化方法、及び音信号復号方法
US20100121632A1 (en) Stereo audio encoding device, stereo audio decoding device, and their method
CN105280189B (zh) 带宽扩展编码和解码中高频生成的方法和装置
HK1190223B (en) Apparatus and method for processing an audio signal and for providing a higher temporal granularity for a combined unified speech and audio codec (usac)

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee
CP01 Change in the name or title of a patent holder

Address after: Munich, Germany

Patentee after: Fraunhofer Application and Research Promotion Association

Patentee after: Voiceage Corp

Address before: Munich, Germany

Patentee before: Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V.

Patentee before: Voiceage Corp