ZA200302468B - Apparatus for bandwidth expansion of a speech signal. - Google Patents
Apparatus for bandwidth expansion of a speech signal. Download PDFInfo
- Publication number
- ZA200302468B ZA200302468B ZA200302468A ZA200302468A ZA200302468B ZA 200302468 B ZA200302468 B ZA 200302468B ZA 200302468 A ZA200302468 A ZA 200302468A ZA 200302468 A ZA200302468 A ZA 200302468A ZA 200302468 B ZA200302468 B ZA 200302468B
- Authority
- ZA
- South Africa
- Prior art keywords
- speech
- signal
- scaling factor
- input signal
- periods
- Prior art date
Links
- 238000000034 method Methods 0.000 claims abstract description 41
- 238000001914 filtration Methods 0.000 claims abstract description 13
- 206010019133 Hangover Diseases 0.000 claims description 18
- 230000008569 process Effects 0.000 claims description 18
- 238000003786 synthesis reaction Methods 0.000 claims description 16
- 230000015572 biosynthetic process Effects 0.000 claims description 13
- 238000013139 quantization Methods 0.000 claims description 10
- 230000003595 spectral effect Effects 0.000 claims description 9
- 230000007246 mechanism Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 5
- 230000002194 synthesizing effect Effects 0.000 claims description 5
- 238000012544 monitoring process Methods 0.000 claims 2
- 238000007789 sealing Methods 0.000 claims 1
- 238000004040 coloring Methods 0.000 abstract description 5
- 238000012805 post-processing Methods 0.000 description 23
- 230000006978 adaptation Effects 0.000 description 19
- 230000005284 excitation Effects 0.000 description 15
- 238000005070 sampling Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 8
- 230000001052 transient effect Effects 0.000 description 5
- 238000007781 pre-processing Methods 0.000 description 4
- 230000001755 vocal effect Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- HJMIIBXYFPJZBP-UHFFFAOYSA-N 10-(2,3,4,5-tetrahydroxypentyl)-1h-pyrimido[4,5-b]quinoline-2,4,8-trione Chemical compound N1C(=O)NC(=O)C2=C1N(CC(O)C(O)C(O)CO)C1=CC(=O)C=CC1=C2 HJMIIBXYFPJZBP-UHFFFAOYSA-N 0.000 description 1
- LYPFDBRUNKHDGX-SOGSVHMOSA-N N1C2=CC=C1\C(=C1\C=CC(=N1)\C(=C1\C=C/C(/N1)=C(/C1=N/C(/CC1)=C2/C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1)C1=CC(O)=CC=C1 Chemical compound N1C2=CC=C1\C(=C1\C=CC(=N1)\C(=C1\C=C/C(/N1)=C(/C1=N/C(/CC1)=C2/C1=CC(O)=CC=C1)C1=CC(O)=CC=C1)\C1=CC(O)=CC=C1)C1=CC(O)=CC=C1 LYPFDBRUNKHDGX-SOGSVHMOSA-N 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 229960002197 temoporfin Drugs 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/691,440 US6615169B1 (en) | 2000-10-18 | 2000-10-18 | High frequency enhancement layer coding in wideband speech codec |
Publications (1)
Publication Number | Publication Date |
---|---|
ZA200302468B true ZA200302468B (en) | 2004-03-29 |
Family
ID=24776540
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ZA200302468A ZA200302468B (en) | 2000-10-18 | 2003-03-28 | Apparatus for bandwidth expansion of a speech signal. |
Country Status (14)
Country | Link |
---|---|
US (1) | US6615169B1 (zh) |
EP (1) | EP1328928B1 (zh) |
JP (1) | JP2004512562A (zh) |
KR (1) | KR100547235B1 (zh) |
CN (1) | CN1244907C (zh) |
AT (1) | ATE330311T1 (zh) |
AU (1) | AU2001294125A1 (zh) |
BR (1) | BR0114669A (zh) |
CA (1) | CA2425926C (zh) |
DE (1) | DE60120734T2 (zh) |
ES (1) | ES2265442T3 (zh) |
PT (1) | PT1328928E (zh) |
WO (1) | WO2002033697A2 (zh) |
ZA (1) | ZA200302468B (zh) |
Families Citing this family (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7113522B2 (en) * | 2001-01-24 | 2006-09-26 | Qualcomm, Incorporated | Enhanced conversion of wideband signals to narrowband signals |
US7522586B2 (en) * | 2002-05-22 | 2009-04-21 | Broadcom Corporation | Method and system for tunneling wideband telephony through the PSTN |
GB2389217A (en) * | 2002-05-27 | 2003-12-03 | Canon Kk | Speech recognition system |
ATE428167T1 (de) * | 2002-07-19 | 2009-04-15 | Nec Corp | Audiodekodierungseinrichtung, dekodierungsverfahren und programm |
DE10252070B4 (de) * | 2002-11-08 | 2010-07-15 | Palm, Inc. (n.d.Ges. d. Staates Delaware), Sunnyvale | Kommunikationsendgerät mit parametrierter Bandbreitenerweiterung und Verfahren zur Bandbreitenerweiterung dafür |
US7406096B2 (en) * | 2002-12-06 | 2008-07-29 | Qualcomm Incorporated | Tandem-free intersystem voice communication |
FR2867649A1 (fr) * | 2003-12-10 | 2005-09-16 | France Telecom | Procede de codage multiple optimise |
KR100587953B1 (ko) | 2003-12-26 | 2006-06-08 | 한국전자통신연구원 | 대역-분할 광대역 음성 코덱에서의 고대역 오류 은닉 장치 및 그를 이용한 비트스트림 복호화 시스템 |
FI118834B (fi) * | 2004-02-23 | 2008-03-31 | Nokia Corp | Audiosignaalien luokittelu |
JP4529492B2 (ja) * | 2004-03-11 | 2010-08-25 | 株式会社デンソー | 音声抽出方法、音声抽出装置、音声認識装置、及び、プログラム |
FI119533B (fi) * | 2004-04-15 | 2008-12-15 | Nokia Corp | Audiosignaalien koodaus |
EP1742202B1 (en) * | 2004-05-19 | 2008-05-07 | Matsushita Electric Industrial Co., Ltd. | Encoding device, decoding device, and method thereof |
US7921007B2 (en) * | 2004-08-17 | 2011-04-05 | Koninklijke Philips Electronics N.V. | Scalable audio coding |
JP4771674B2 (ja) * | 2004-09-02 | 2011-09-14 | パナソニック株式会社 | 音声符号化装置、音声復号化装置及びこれらの方法 |
JP4859670B2 (ja) * | 2004-10-27 | 2012-01-25 | パナソニック株式会社 | 音声符号化装置および音声符号化方法 |
US7386445B2 (en) * | 2005-01-18 | 2008-06-10 | Nokia Corporation | Compensation of transient effects in transform coding |
ES2351935T3 (es) * | 2005-04-01 | 2011-02-14 | Qualcomm Incorporated | Procedimiento y aparato para la cuantificación vectorial de una representación de envolvente espectral. |
US8086451B2 (en) * | 2005-04-20 | 2011-12-27 | Qnx Software Systems Co. | System for improving speech intelligibility through high frequency compression |
US7813931B2 (en) * | 2005-04-20 | 2010-10-12 | QNX Software Systems, Co. | System for improving speech quality and intelligibility with bandwidth compression/expansion |
US8249861B2 (en) * | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
US8311840B2 (en) * | 2005-06-28 | 2012-11-13 | Qnx Software Systems Limited | Frequency extension of harmonic signals |
WO2007043643A1 (ja) * | 2005-10-14 | 2007-04-19 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置、音声復号装置、音声符号化方法、及び音声復号化方法 |
US7546237B2 (en) * | 2005-12-23 | 2009-06-09 | Qnx Software Systems (Wavemakers), Inc. | Bandwidth extension of narrowband speech |
EP2063418A4 (en) * | 2006-09-15 | 2010-12-15 | Panasonic Corp | AUDIO CODING DEVICE AND AUDIO CODING METHOD |
US20100017197A1 (en) * | 2006-11-02 | 2010-01-21 | Panasonic Corporation | Voice coding device, voice decoding device and their methods |
WO2008066071A1 (en) * | 2006-11-29 | 2008-06-05 | Panasonic Corporation | Decoding apparatus and audio decoding method |
CN101246688B (zh) * | 2007-02-14 | 2011-01-12 | 华为技术有限公司 | 一种对背景噪声信号进行编解码的方法、系统和装置 |
US7912729B2 (en) * | 2007-02-23 | 2011-03-22 | Qnx Software Systems Co. | High-frequency bandwidth extension in the time domain |
US8195454B2 (en) | 2007-02-26 | 2012-06-05 | Dolby Laboratories Licensing Corporation | Speech enhancement in entertainment audio |
US20080208575A1 (en) * | 2007-02-27 | 2008-08-28 | Nokia Corporation | Split-band encoding and decoding of an audio signal |
CN101790756B (zh) | 2007-08-27 | 2012-09-05 | 爱立信电话股份有限公司 | 瞬态检测器以及用于支持音频信号的编码的方法 |
CN101483495B (zh) | 2008-03-20 | 2012-02-15 | 华为技术有限公司 | 一种背景噪声生成方法以及噪声处理装置 |
JP5010743B2 (ja) * | 2008-07-11 | 2012-08-29 | フラウンホーファー−ゲゼルシャフト・ツール・フェルデルング・デル・アンゲヴァンテン・フォルシュング・アインゲトラーゲネル・フェライン | スペクトル傾斜で制御されたフレーミングを使用して帯域拡張データを計算するための装置及び方法 |
CN101751926B (zh) * | 2008-12-10 | 2012-07-04 | 华为技术有限公司 | 信号编码、解码方法及装置、编解码系统 |
US9838784B2 (en) | 2009-12-02 | 2017-12-05 | Knowles Electronics, Llc | Directional audio capture |
US8798290B1 (en) * | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
ES2683648T3 (es) * | 2010-07-02 | 2018-09-27 | Dolby International Ab | Descodificación de audio con pos-filtración selectiva |
JP5552988B2 (ja) * | 2010-09-27 | 2014-07-16 | 富士通株式会社 | 音声帯域拡張装置および音声帯域拡張方法 |
WO2012121637A1 (en) | 2011-03-04 | 2012-09-13 | Telefonaktiebolaget L M Ericsson (Publ) | Post-quantization gain correction in audio coding |
JP5596618B2 (ja) * | 2011-05-17 | 2014-09-24 | 日本電信電話株式会社 | 擬似広帯域音声信号生成装置、擬似広帯域音声信号生成方法、及びそのプログラム |
CN102800317B (zh) * | 2011-05-25 | 2014-09-17 | 华为技术有限公司 | 信号分类方法及设备、编解码方法及设备 |
CN103187065B (zh) | 2011-12-30 | 2015-12-16 | 华为技术有限公司 | 音频数据的处理方法、装置和系统 |
US9460729B2 (en) | 2012-09-21 | 2016-10-04 | Dolby Laboratories Licensing Corporation | Layered approach to spatial audio coding |
RU2633107C2 (ru) | 2012-12-21 | 2017-10-11 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Добавление комфортного шума для моделирования фонового шума при низких скоростях передачи данных |
EP2936487B1 (en) * | 2012-12-21 | 2016-06-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Generation of a comfort noise with high spectro-temporal resolution in discontinuous transmission of audio signals |
CN103928029B (zh) * | 2013-01-11 | 2017-02-08 | 华为技术有限公司 | 音频信号编码和解码方法、音频信号编码和解码装置 |
US9336789B2 (en) * | 2013-02-21 | 2016-05-10 | Qualcomm Incorporated | Systems and methods for determining an interpolation factor set for synthesizing a speech signal |
WO2014173446A1 (en) * | 2013-04-25 | 2014-10-30 | Nokia Solutions And Networks Oy | Speech transcoding in packet networks |
US9570093B2 (en) | 2013-09-09 | 2017-02-14 | Huawei Technologies Co., Ltd. | Unvoiced/voiced decision for speech processing |
AU2014336357B2 (en) * | 2013-10-18 | 2017-04-13 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Concept for encoding an audio signal and decoding an audio signal using deterministic and noise like information |
EP3058568B1 (en) * | 2013-10-18 | 2021-01-13 | Fraunhofer Gesellschaft zur Förderung der angewandten Forschung E.V. | Concept for encoding an audio signal and decoding an audio signal using speech related spectral shaping information |
EP2980790A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for comfort noise generation mode selection |
CN107210824A (zh) | 2015-01-30 | 2017-09-26 | 美商楼氏电子有限公司 | 麦克风的环境切换 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6011360B2 (ja) * | 1981-12-15 | 1985-03-25 | ケイディディ株式会社 | 音声符号化方式 |
JP2779886B2 (ja) * | 1992-10-05 | 1998-07-23 | 日本電信電話株式会社 | 広帯域音声信号復元方法 |
EP0732687B2 (en) * | 1995-03-13 | 2005-10-12 | Matsushita Electric Industrial Co., Ltd. | Apparatus for expanding speech bandwidth |
CA2185745C (en) * | 1995-09-19 | 2001-02-13 | Juin-Hwey Chen | Synthesis of speech signals in the absence of coded parameters |
KR20000047944A (ko) | 1998-12-11 | 2000-07-25 | 이데이 노부유끼 | 수신장치 및 방법과 통신장치 및 방법 |
-
2000
- 2000-10-18 US US09/691,440 patent/US6615169B1/en not_active Expired - Lifetime
-
2001
- 2001-10-17 DE DE60120734T patent/DE60120734T2/de not_active Expired - Lifetime
- 2001-10-17 JP JP2002537004A patent/JP2004512562A/ja active Pending
- 2001-10-17 EP EP01974612A patent/EP1328928B1/en not_active Expired - Lifetime
- 2001-10-17 CN CNB018175996A patent/CN1244907C/zh not_active Expired - Lifetime
- 2001-10-17 WO PCT/IB2001/001947 patent/WO2002033697A2/en active IP Right Grant
- 2001-10-17 CA CA002425926A patent/CA2425926C/en not_active Expired - Lifetime
- 2001-10-17 PT PT01974612T patent/PT1328928E/pt unknown
- 2001-10-17 BR BR0114669-6A patent/BR0114669A/pt active IP Right Grant
- 2001-10-17 ES ES01974612T patent/ES2265442T3/es not_active Expired - Lifetime
- 2001-10-17 KR KR1020037005299A patent/KR100547235B1/ko active IP Right Grant
- 2001-10-17 AU AU2001294125A patent/AU2001294125A1/en not_active Abandoned
- 2001-10-17 AT AT01974612T patent/ATE330311T1/de not_active IP Right Cessation
-
2003
- 2003-03-28 ZA ZA200302468A patent/ZA200302468B/en unknown
Also Published As
Publication number | Publication date |
---|---|
ES2265442T3 (es) | 2007-02-16 |
PT1328928E (pt) | 2006-09-29 |
WO2002033697A2 (en) | 2002-04-25 |
DE60120734T2 (de) | 2007-06-14 |
EP1328928B1 (en) | 2006-06-14 |
CA2425926A1 (en) | 2002-04-25 |
CA2425926C (en) | 2009-01-27 |
JP2004512562A (ja) | 2004-04-22 |
EP1328928A2 (en) | 2003-07-23 |
ATE330311T1 (de) | 2006-07-15 |
KR100547235B1 (ko) | 2006-01-26 |
CN1244907C (zh) | 2006-03-08 |
DE60120734D1 (de) | 2006-07-27 |
AU2001294125A1 (en) | 2002-04-29 |
WO2002033697A3 (en) | 2002-07-11 |
BR0114669A (pt) | 2004-02-17 |
KR20030046510A (ko) | 2003-06-12 |
US6615169B1 (en) | 2003-09-02 |
CN1470052A (zh) | 2004-01-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6615169B1 (en) | High frequency enhancement layer coding in wideband speech codec | |
KR100574031B1 (ko) | 음성합성방법및장치그리고음성대역확장방법및장치 | |
AU2005234181B2 (en) | Coding of audio signals | |
US6691085B1 (en) | Method and system for estimating artificial high band signal in speech codec using voice activity information | |
JP4927257B2 (ja) | 可変レートスピーチ符号化 | |
JP4390803B2 (ja) | 可変ビットレート広帯域通話符号化におけるゲイン量子化方法および装置 | |
JP4870313B2 (ja) | 可変レート音声符号器におけるフレーム消去補償方法 | |
JPH09503874A (ja) | 減少レート、可変レートの音声分析合成を実行する方法及び装置 | |
JP2009503559A (ja) | レートスケーラブル及び帯域幅スケーラブルオーディオ復号化のレートの切り替えのための方法 | |
KR20150060897A (ko) | 오디오 신호를 인코딩하기 위한 방법 및 장치 | |
US20060235685A1 (en) | Framework for voice conversion | |
JPH10207498A (ja) | マルチモード符号励振線形予測により音声入力を符号化する方法及びその符号器 | |
CN1262577A (zh) | 无线语音信道上发送数据的方法 | |
US7089180B2 (en) | Method and device for coding speech in analysis-by-synthesis speech coders | |
US6856961B2 (en) | Speech coding system with input signal transformation | |
Choudhary et al. | Study and performance of amr codecs for gsm | |
Sun et al. | Speech compression | |
JP4230550B2 (ja) | 音声符号化方法及び装置、並びに音声復号化方法及び装置 | |
BRPI0114669B1 (pt) | A method of encoding a voice, a receiver system and a transmitter of the speech signal to an encoder and decoding the input signal, an encoder, a decoder, a mobile station and a network element |