CA2235455A1 - Method and apparatus for speech enhancement in a speech communication system - Google Patents

Method and apparatus for speech enhancement in a speech communication system Download PDF

Info

Publication number
CA2235455A1
CA2235455A1 CA002235455A CA2235455A CA2235455A1 CA 2235455 A1 CA2235455 A1 CA 2235455A1 CA 002235455 A CA002235455 A CA 002235455A CA 2235455 A CA2235455 A CA 2235455A CA 2235455 A1 CA2235455 A1 CA 2235455A1
Authority
CA
Canada
Prior art keywords
speech
frequency
output
amplitude
altering
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002235455A
Other languages
English (en)
French (fr)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Simoco International Ltd
Original Assignee
Simoco International Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco International Ltd filed Critical Simoco International Ltd
Publication of CA2235455A1 publication Critical patent/CA2235455A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)
CA002235455A 1997-07-02 1998-04-21 Method and apparatus for speech enhancement in a speech communication system Abandoned CA2235455A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system
GB9714001.6 1997-07-02

Publications (1)

Publication Number Publication Date
CA2235455A1 true CA2235455A1 (en) 1999-01-02

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002235455A Abandoned CA2235455A1 (en) 1997-07-02 1998-04-21 Method and apparatus for speech enhancement in a speech communication system

Country Status (12)

Country Link
EP (1) EP0993670B1 (ko)
JP (1) JP2002507291A (ko)
KR (1) KR20010014352A (ko)
CN (1) CN1265217A (ko)
AT (1) ATE214832T1 (ko)
AU (1) AU8227798A (ko)
CA (1) CA2235455A1 (ko)
DE (1) DE69804310D1 (ko)
GB (2) GB9714001D0 (ko)
PL (1) PL337717A1 (ko)
WO (1) WO1999001863A1 (ko)
ZA (1) ZA985607B (ko)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (fr) * 1999-05-27 2001-06-22 Sagem Procede de suppression de bruit
EP1210765B1 (en) 1999-07-28 2007-03-07 Clear Audio Ltd. Filter banked gain control of audio in a noisy environment
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (de) * 2001-05-17 2002-11-21 Siemens Ag Verfahren zum Signalempfang
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
JP2005530213A (ja) * 2002-06-19 2005-10-06 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 音声信号処理装置
WO2004068467A1 (en) * 2003-01-31 2004-08-12 Oticon A/S Sound system improving speech intelligibility
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
WO2006026812A2 (en) * 2004-09-07 2006-03-16 Sensear Pty Ltd Apparatus and method for sound enhancement
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (de) 2006-01-13 2007-07-19 Robert Bosch Gmbh Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm
EP1814109A1 (en) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Voice amplification apparatus for modelling the Lombard effect
JP2007295347A (ja) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp 音声処理装置
WO2018127263A2 (en) * 2017-01-03 2018-07-12 Lizn Aps Speech intelligibility enhancing system
KR101414233B1 (ko) 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법
JP4926005B2 (ja) 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 音声信号処理装置及び音声信号処理方法、通信端末
WO2009086174A1 (en) 2007-12-21 2009-07-09 Srs Labs, Inc. System for adjusting perceived loudness of audio signals
JP5453740B2 (ja) * 2008-07-02 2014-03-26 富士通株式会社 音声強調装置
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (en) * 2010-03-11 2011-10-05 Oticon A/S A speech intelligibility predictor and applications thereof
PL2737479T3 (pl) 2011-07-29 2017-07-31 Dts Llc Adaptacyjna poprawa zrozumiałości głosu
CN103002105A (zh) * 2011-09-16 2013-03-27 宏碁股份有限公司 可增加通讯内容清晰度的移动通讯方法
CN103297896B (zh) * 2012-02-27 2016-07-06 联想(北京)有限公司 一种音频输出方法及电子设备
US9015044B2 (en) 2012-03-05 2015-04-21 Malaspina Labs (Barbados) Inc. Formant based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (en) * 2014-10-14 2016-04-20 Thomson Licensing Method and apparatus for separating speech data from background data in audio communication
JP6565206B2 (ja) * 2015-02-20 2019-08-28 ヤマハ株式会社 音声処理装置および音声処理方法
EP3107097B1 (en) 2015-06-17 2017-11-15 Nxp B.V. Improved speech intelligilibility
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (ja) * 2016-11-02 2020-11-25 ヤマハ株式会社 信号処理方法、および信号処理装置
CN108369805B (zh) * 2017-12-27 2019-08-13 深圳前海达闼云端智能科技有限公司 一种语音交互方法、装置和智能终端
CN109346058B (zh) * 2018-11-29 2024-06-28 西安交通大学 一种语音声学特征扩大系统
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
WO1987000366A1 (en) * 1985-07-01 1987-01-15 Motorola, Inc. Noise supression system
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (en) * 1991-03-27 1997-02-04 Arnold I. Klayman Public address intelligibility system
FI102337B1 (fi) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Menetelmä ja piirijärjestely audiosignaalin käsittelemiseksi
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
GB9714001D0 (en) 1997-09-10
CN1265217A (zh) 2000-08-30
ZA985607B (en) 2000-06-01
GB2327835A (en) 1999-02-03
GB2327835B (en) 2000-04-19
PL337717A1 (en) 2000-08-28
WO1999001863A1 (en) 1999-01-14
GB9814279D0 (en) 1998-09-02
EP0993670A1 (en) 2000-04-19
JP2002507291A (ja) 2002-03-05
KR20010014352A (ko) 2001-02-26
EP0993670B1 (en) 2002-03-20
ATE214832T1 (de) 2002-04-15
AU8227798A (en) 1999-01-25
DE69804310D1 (de) 2002-04-25

Similar Documents

Publication Publication Date Title
EP0993670B1 (en) Method and apparatus for speech enhancement in a speech communication system
US10885926B2 (en) Classification between time-domain coding and frequency domain coding for high bit rates
US8265940B2 (en) Method and device for the artificial extension of the bandwidth of speech signals
KR100574031B1 (ko) 음성합성방법및장치그리고음성대역확장방법및장치
KR102105044B1 (ko) 낮은 레이트의 씨이엘피 디코더의 비 음성 콘텐츠의 개선
JP4040126B2 (ja) 音声復号化方法および装置
US5706392A (en) Perceptual speech coder and method
KR100216018B1 (ko) 배경음을 엔코딩 및 디코딩하는 방법 및 장치
JP2010520503A (ja) 通信ネットワークにおける方法及び装置
GB2343822A (en) Using LSP to alter frequency characteristics of speech
Vicente-Peña et al. Band-pass filtering of the time sequences of spectral parameters for robust wireless speech recognition
Sun et al. Speech compression
Motlicek et al. Wide-band audio coding based on frequency-domain linear prediction
Ekeroth Improvements of the voice activity detector in AMR-WB
McLoughlin CELP and speech enhancement
Kwon Improved Excitation Modeling for Low-Rate CELP Speech Coding
Hennix Decoder based noise suppression
Chen Adaptive variable bit-rate speech coder for wireless applications
Mermelstein et al. INR

Legal Events

Date Code Title Description
EEER Examination request
FZDE Discontinued