CN106910509B - 用于修正通用音频合成的设备及其方法 - Google Patents

用于修正通用音频合成的设备及其方法 Download PDF

Info

Publication number
CN106910509B
CN106910509B CN201710020311.8A CN201710020311A CN106910509B CN 106910509 B CN106910509 B CN 106910509B CN 201710020311 A CN201710020311 A CN 201710020311A CN 106910509 B CN106910509 B CN 106910509B
Authority
CN
China
Prior art keywords
frequency
domain excitation
normalized
bin
modified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710020311.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN106910509A (zh
Inventor
T.瓦兰考特
M.杰里尼克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengdai EVs Limited
Original Assignee
Voisage
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=48191141&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CN106910509(B) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Voisage filed Critical Voisage
Publication of CN106910509A publication Critical patent/CN106910509A/zh
Application granted granted Critical
Publication of CN106910509B publication Critical patent/CN106910509B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • H03G3/20Automatic control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Magnetic Resonance Imaging Apparatus (AREA)
  • Ultra Sonic Daignosis Equipment (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
CN201710020311.8A 2011-11-03 2012-11-01 用于修正通用音频合成的设备及其方法 Active CN106910509B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201161555246P 2011-11-03 2011-11-03
US61/555,246 2011-11-03
CN201280065936.1A CN104040624B (zh) 2011-11-03 2012-11-01 改善低速率码激励线性预测解码器的非语音内容

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201280065936.1A Division CN104040624B (zh) 2011-11-03 2012-11-01 改善低速率码激励线性预测解码器的非语音内容

Publications (2)

Publication Number Publication Date
CN106910509A CN106910509A (zh) 2017-06-30
CN106910509B true CN106910509B (zh) 2020-08-18

Family

ID=48191141

Family Applications (3)

Application Number Title Priority Date Filing Date
CN201710020311.8A Active CN106910509B (zh) 2011-11-03 2012-11-01 用于修正通用音频合成的设备及其方法
CN201280065936.1A Active CN104040624B (zh) 2011-11-03 2012-11-01 改善低速率码激励线性预测解码器的非语音内容
CN201710019918.4A Active CN107068158B (zh) 2011-11-03 2012-11-01 用于改善低速率码激励线性预测解码器的非语音内容的方法及其设备

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN201280065936.1A Active CN104040624B (zh) 2011-11-03 2012-11-01 改善低速率码激励线性预测解码器的非语音内容
CN201710019918.4A Active CN107068158B (zh) 2011-11-03 2012-11-01 用于改善低速率码激励线性预测解码器的非语音内容的方法及其设备

Country Status (15)

Country Link
US (1) US9252728B2 (enExample)
EP (3) EP4488997A3 (enExample)
JP (5) JP6239521B2 (enExample)
KR (1) KR102105044B1 (enExample)
CN (3) CN106910509B (enExample)
CA (1) CA2851370C (enExample)
DK (2) DK3709298T3 (enExample)
ES (2) ES2805308T3 (enExample)
FI (1) FI3709298T3 (enExample)
HR (2) HRP20201070T1 (enExample)
HU (2) HUE050600T2 (enExample)
IN (1) IN2014DN03022A (enExample)
LT (2) LT2774145T (enExample)
SI (2) SI2774145T1 (enExample)
WO (1) WO2013063688A1 (enExample)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6239521B2 (ja) * 2011-11-03 2017-11-29 ヴォイスエイジ・コーポレーション 低レートcelpデコーダに関する非音声コンテンツの向上
EP4246516B1 (en) * 2013-03-04 2025-07-23 VoiceAge EVS LLC Device and method for reducing quantization noise in a time-domain decoder
US9418671B2 (en) * 2013-08-15 2016-08-16 Huawei Technologies Co., Ltd. Adaptive high-pass post-filter
CN111312277B (zh) * 2014-03-03 2023-08-15 三星电子株式会社 用于带宽扩展的高频解码的方法及设备
CN110097892B (zh) 2014-06-03 2022-05-10 华为技术有限公司 一种语音频信号的处理方法和装置
JP6401521B2 (ja) * 2014-07-04 2018-10-10 クラリオン株式会社 信号処理装置及び信号処理方法
US10049684B2 (en) * 2015-04-05 2018-08-14 Qualcomm Incorporated Audio bandwidth selection
US9972334B2 (en) * 2015-09-10 2018-05-15 Qualcomm Incorporated Decoder audio classification
US10373608B2 (en) 2015-10-22 2019-08-06 Texas Instruments Incorporated Time-based frequency tuning of analog-to-information feature extraction
WO2019056107A1 (en) * 2017-09-20 2019-03-28 Voiceage Corporation METHOD AND DEVICE FOR ALLOCATING A BINARY BUDGET BETWEEN SUB-FRAMES IN A CELP CODEC
TWI790705B (zh) * 2021-08-06 2023-01-21 宏正自動科技股份有限公司 語速調整方法及其系統
CN115857614B (zh) * 2022-11-17 2023-12-29 弘正储能(上海)能源科技有限公司 多路光伏mppt交错式boost控制方法及其系统

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040002856A1 (en) * 2002-03-08 2004-01-01 Udaya Bhaskar Multi-rate frequency domain interpolative speech CODEC system

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58220199A (ja) * 1982-06-17 1983-12-21 日本電気株式会社 帯域分割型ボコ−ダ
JP3088121B2 (ja) * 1991-04-12 2000-09-18 沖電気工業株式会社 統計励振コードベクトルの最適化方法
JP2606006B2 (ja) * 1991-05-24 1997-04-30 ヤマハ株式会社 ノイズ音発生装置
JP3328080B2 (ja) * 1994-11-22 2002-09-24 沖電気工業株式会社 コード励振線形予測復号器
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
JP3451998B2 (ja) * 1999-05-31 2003-09-29 日本電気株式会社 無音声符号化を含む音声符号化・復号装置、復号化方法及びプログラムを記録した記録媒体
US7272553B1 (en) * 1999-09-08 2007-09-18 8X8, Inc. Varying pulse amplitude multi-pulse analysis speech processor and method
US7139700B1 (en) * 1999-09-22 2006-11-21 Texas Instruments Incorporated Hybrid speech coding and system
JP3478209B2 (ja) * 1999-11-01 2003-12-15 日本電気株式会社 音声信号復号方法及び装置と音声信号符号化復号方法及び装置と記録媒体
US6704711B2 (en) * 2000-01-28 2004-03-09 Telefonaktiebolaget Lm Ericsson (Publ) System and method for modifying speech signals
JP3462464B2 (ja) * 2000-10-20 2003-11-05 株式会社東芝 音声符号化方法、音声復号化方法及び電子装置
JP2003110429A (ja) * 2001-09-28 2003-04-11 Sony Corp 符号化方法及び装置、復号方法及び装置、伝送方法及び装置、並びに記録媒体
CA2388439A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for efficient frame erasure concealment in linear predictive based speech codecs
JP3861770B2 (ja) * 2002-08-21 2006-12-20 ソニー株式会社 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体
WO2004084182A1 (en) * 2003-03-15 2004-09-30 Mindspeed Technologies, Inc. Decomposition of voiced speech for celp speech coding
WO2004090870A1 (ja) * 2003-04-04 2004-10-21 Kabushiki Kaisha Toshiba 広帯域音声を符号化または復号化するための方法及び装置
UA93677C2 (ru) * 2005-04-01 2011-03-10 Квелкомм Инкорпорейтед Способы и устройства кодирования и декодирования части речевого сигнала диапазона высоких частот
US7707034B2 (en) * 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
US7630882B2 (en) * 2005-07-15 2009-12-08 Microsoft Corporation Frequency segmentation to obtain bands for efficient coding of digital media
KR20080047443A (ko) * 2005-10-14 2008-05-28 마츠시타 덴끼 산교 가부시키가이샤 변환 부호화 장치 및 변환 부호화 방법
US7490036B2 (en) * 2005-10-20 2009-02-10 Motorola, Inc. Adaptive equalizer for a coded speech signal
US8255207B2 (en) * 2005-12-28 2012-08-28 Voiceage Corporation Method and device for efficient frame erasure concealment in speech codecs
TWI333643B (en) * 2006-01-18 2010-11-21 Lg Electronics Inc Apparatus and method for encoding and decoding signal
EP1993320B1 (en) * 2006-03-03 2015-01-07 Nippon Telegraph And Telephone Corporation Reverberation removal device, reverberation removal method, reverberation removal program, and recording medium
US7590523B2 (en) * 2006-03-20 2009-09-15 Mindspeed Technologies, Inc. Speech post-processing using MDCT coefficients
CN101086845B (zh) * 2006-06-08 2011-06-01 北京天籁传音数字技术有限公司 声音编码装置及方法以及声音解码装置及方法
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
CN101140759B (zh) * 2006-09-08 2010-05-12 华为技术有限公司 语音或音频信号的带宽扩展方法及系统
CN101025918B (zh) * 2007-01-19 2011-06-29 清华大学 一种语音/音乐双模编解码无缝切换方法
AU2008221657B2 (en) * 2007-03-05 2010-12-02 Telefonaktiebolaget Lm Ericsson (Publ) Method and arrangement for smoothing of stationary background noise
CN101388214B (zh) * 2007-09-14 2012-07-04 向为 一种变速率的声码器及其编码方法
CN100585699C (zh) * 2007-11-02 2010-01-27 华为技术有限公司 一种音频解码的方法和装置
KR101221919B1 (ko) * 2008-03-03 2013-01-15 연세대학교 산학협력단 오디오 신호 처리 방법 및 장치
ES2464722T3 (es) * 2008-03-04 2014-06-03 Lg Electronics Inc. Método y aparato para procesar una señal de audio
CN101620854B (zh) * 2008-06-30 2012-04-04 华为技术有限公司 频带扩展的方法、系统和设备
EP2144229A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Efficient use of phase information in audio encoding and decoding
PL2311034T3 (pl) * 2008-07-11 2016-04-29 Fraunhofer Ges Forschung Koder i dekoder audio do kodowania ramek próbkowanego sygnału audio
ES2592416T3 (es) * 2008-07-17 2016-11-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Esquema de codificación/decodificación de audio que tiene una derivación conmutable
EP3640941A1 (en) * 2008-10-08 2020-04-22 Fraunhofer Gesellschaft zur Förderung der Angewand Multi-resolution switched audio encoding/decoding scheme
KR101622950B1 (ko) * 2009-01-28 2016-05-23 삼성전자주식회사 오디오 신호의 부호화 및 복호화 방법 및 그 장치
RU2591661C2 (ru) * 2009-10-08 2016-07-20 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Многорежимный декодировщик аудио сигнала, многорежимный кодировщик аудио сигналов, способы и компьютерные программы с использованием кодирования с линейным предсказанием на основе ограничения шума
WO2011086923A1 (ja) * 2010-01-14 2011-07-21 パナソニック株式会社 符号化装置、復号装置、スペクトル変動量算出方法及びスペクトル振幅調整方法
US8958572B1 (en) * 2010-04-19 2015-02-17 Audience, Inc. Adaptive noise cancellation for multi-microphone systems
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
CN102074245B (zh) * 2011-01-05 2012-10-10 瑞声声学科技(深圳)有限公司 基于双麦克风语音增强装置及语音增强方法
JP6239521B2 (ja) * 2011-11-03 2017-11-29 ヴォイスエイジ・コーポレーション 低レートcelpデコーダに関する非音声コンテンツの向上
DE102014101462B3 (de) 2014-02-06 2015-03-05 Sartorius Lab Instruments Gmbh & Co. Kg Verfahren zur Funktionsprüfung eines Messgerätes

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040002856A1 (en) * 2002-03-08 2004-01-01 Udaya Bhaskar Multi-rate frequency domain interpolative speech CODEC system

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
《A 5.85 kbits CELP algorithm for cellular applications》;W.B. Kleijn et al.;《1993 IEEE International Conference on Acoustics, Speech, and Signal Processing》;19931231;全文 *
《A new model of LPC excitation for producing natural-sounding speech at low bit rates》;B. Atal et al.;《ICASSP "82. IEEE International Conference on Acoustics, Speech, and Signal Processing》;19821231;全文 *
《Retrieving Sparse Patterns Using a Compressed Sensing Framework: Applications to Speech Coding Based on Sparse Linear Prediction》;Daniele Giacobello et al.;《IEEE Signal Processing Letters ( Volume: 17 , Issue: 1 , Jan. 2010 )》;20101231;全文 *
《基于子带清浊音模式的声码器增益参数抗误码算法》;洪侃 等;《清华大学学报(自然科学版)》;20081231;全文 *

Also Published As

Publication number Publication date
CA2851370A1 (en) 2013-05-10
JP6532926B2 (ja) 2019-06-19
ES2805308T3 (es) 2021-02-11
CN106910509A (zh) 2017-06-30
ES3012033T3 (en) 2025-04-08
WO2013063688A1 (en) 2013-05-10
US20130121508A1 (en) 2013-05-16
EP3709298A1 (en) 2020-09-16
JP2018045243A (ja) 2018-03-22
FI3709298T3 (fi) 2025-02-21
EP2774145A4 (en) 2015-10-21
US9252728B2 (en) 2016-02-02
EP2774145A1 (en) 2014-09-10
LT2774145T (lt) 2020-09-25
KR20140090214A (ko) 2014-07-16
JP2019152878A (ja) 2019-09-12
SI2774145T1 (sl) 2020-10-30
JP7237127B2 (ja) 2023-03-10
KR102105044B1 (ko) 2020-04-27
JP6513769B2 (ja) 2019-05-15
HUE070390T2 (hu) 2025-06-28
EP3709298B1 (en) 2024-11-20
LT3709298T (lt) 2025-02-25
JP2015501452A (ja) 2015-01-15
DK3709298T3 (da) 2025-01-13
HUE050600T2 (hu) 2021-01-28
JP6239521B2 (ja) 2017-11-29
HRP20201070T1 (hr) 2020-10-30
JP2022022247A (ja) 2022-02-03
IN2014DN03022A (enExample) 2015-05-08
JP2018045244A (ja) 2018-03-22
EP4488997A2 (en) 2025-01-08
DK2774145T3 (da) 2020-07-20
CN104040624B (zh) 2017-03-01
HK1198265A1 (en) 2015-03-20
SI3709298T1 (sl) 2025-05-30
CN107068158B (zh) 2020-08-21
CN104040624A (zh) 2014-09-10
CA2851370C (en) 2019-12-03
CN107068158A (zh) 2017-08-18
HRP20241659T1 (hr) 2025-02-28
EP4488997A3 (en) 2025-01-22
EP2774145B1 (en) 2020-06-17

Similar Documents

Publication Publication Date Title
CN106910509B (zh) 用于修正通用音频合成的设备及其方法
JP7427752B2 (ja) 時間領域デコーダにおける量子化雑音を低減するためのデバイスおよび方法
JP5247826B2 (ja) 復号化音調音響信号を増強するためのシステムおよび方法
US10672411B2 (en) Method for adaptively encoding an audio signal in dependence on noise information for higher encoding accuracy
HK40035914A (en) Improving non-speech content for low rate celp decoder
HK40117447A (en) Improving non-speech content for low rate celp decoder
HK40035914B (en) Improving non-speech content for low rate celp decoder
HK1198265B (en) Improving non-speech content for low rate celp decoder
HK40045960B (en) Device and method for reducing quantization noise in a time-domain decoder
HK1212088B (zh) 用於降低時域解碼器中的量化噪聲的裝置和方法
HK40029446A (en) Device and method for reducing quantization noise in a time-domain decoder

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20200908

Address after: California, USA

Patentee after: Shengdai EVs Limited

Address before: Kaisan ohokkatsu

Patentee before: Voisage