ZA985607B - Method and apparatus for speech enhancement in a speech communication system. - Google Patents

Method and apparatus for speech enhancement in a speech communication system.

Info

Publication number
ZA985607B
ZA985607B ZA9805607A ZA985607A ZA985607B ZA 985607 B ZA985607 B ZA 985607B ZA 9805607 A ZA9805607 A ZA 9805607A ZA 985607 A ZA985607 A ZA 985607A ZA 985607 B ZA985607 B ZA 985607B
Authority
ZA
South Africa
Prior art keywords
speech
unit
determines
intelligible
listener
Prior art date
Application number
ZA9805607A
Other languages
English (en)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Original Assignee
Simoco Int Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco Int Ltd filed Critical Simoco Int Ltd
Publication of ZA985607B publication Critical patent/ZA985607B/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)
ZA9805607A 1997-07-02 1998-06-26 Method and apparatus for speech enhancement in a speech communication system. ZA985607B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system

Publications (1)

Publication Number Publication Date
ZA985607B true ZA985607B (en) 2000-06-01

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
ZA9805607A ZA985607B (en) 1997-07-02 1998-06-26 Method and apparatus for speech enhancement in a speech communication system.

Country Status (12)

Country Link
EP (1) EP0993670B1 (ko)
JP (1) JP2002507291A (ko)
KR (1) KR20010014352A (ko)
CN (1) CN1265217A (ko)
AT (1) ATE214832T1 (ko)
AU (1) AU8227798A (ko)
CA (1) CA2235455A1 (ko)
DE (1) DE69804310D1 (ko)
GB (2) GB9714001D0 (ko)
PL (1) PL337717A1 (ko)
WO (1) WO1999001863A1 (ko)
ZA (1) ZA985607B (ko)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (fr) * 1999-05-27 2001-06-22 Sagem Procede de suppression de bruit
EP1210765B1 (en) 1999-07-28 2007-03-07 Clear Audio Ltd. Filter banked gain control of audio in a noisy environment
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (de) * 2001-05-17 2002-11-21 Siemens Ag Verfahren zum Signalempfang
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
JP2005530213A (ja) * 2002-06-19 2005-10-06 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 音声信号処理装置
EP1609134A1 (en) * 2003-01-31 2005-12-28 Oticon A/S Sound system improving speech intelligibility
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
EA011361B1 (ru) * 2004-09-07 2009-02-27 Сенсир Пти Лтд. Аппарат и способ усиления звука
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (de) 2006-01-13 2007-07-19 Robert Bosch Gmbh Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm
EP1814109A1 (en) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Voice amplification apparatus for modelling the Lombard effect
JP2007295347A (ja) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp 音声処理装置
KR101414233B1 (ko) 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법
JP4926005B2 (ja) * 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 音声信号処理装置及び音声信号処理方法、通信端末
CN102017402B (zh) 2007-12-21 2015-01-07 Dts有限责任公司 用于调节音频信号的感知响度的系统
JP5453740B2 (ja) * 2008-07-02 2014-03-26 富士通株式会社 音声強調装置
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (en) * 2010-03-11 2011-10-05 Oticon A/S A speech intelligibility predictor and applications thereof
JP6147744B2 (ja) * 2011-07-29 2017-06-14 ディーティーエス・エルエルシーDts Llc 適応音声了解度処理システムおよび方法
CN103002105A (zh) * 2011-09-16 2013-03-27 宏碁股份有限公司 可增加通讯内容清晰度的移动通讯方法
CN103297896B (zh) * 2012-02-27 2016-07-06 联想(北京)有限公司 一种音频输出方法及电子设备
US9020818B2 (en) 2012-03-05 2015-04-28 Malaspina Labs (Barbados) Inc. Format based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (en) * 2014-10-14 2016-04-20 Thomson Licensing Method and apparatus for separating speech data from background data in audio communication
JP6565206B2 (ja) * 2015-02-20 2019-08-28 ヤマハ株式会社 音声処理装置および音声処理方法
EP3107097B1 (en) 2015-06-17 2017-11-15 Nxp B.V. Improved speech intelligilibility
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (ja) * 2016-11-02 2020-11-25 ヤマハ株式会社 信号処理方法、および信号処理装置
DK3566469T3 (da) * 2017-01-03 2020-06-29 Lizn Aps Taleforståelighedsforstærkende system
CN108369805B (zh) * 2017-12-27 2019-08-13 深圳前海达闼云端智能科技有限公司 一种语音交互方法、装置和智能终端
CN109346058B (zh) * 2018-11-29 2024-06-28 西安交通大学 一种语音声学特征扩大系统
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
EP0226613B1 (en) * 1985-07-01 1993-09-15 Motorola, Inc. Noise supression system
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (en) * 1991-03-27 1997-02-04 Arnold I. Klayman Public address intelligibility system
FI102337B1 (fi) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Menetelmä ja piirijärjestely audiosignaalin käsittelemiseksi
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
CN1265217A (zh) 2000-08-30
CA2235455A1 (en) 1999-01-02
GB9814279D0 (en) 1998-09-02
PL337717A1 (en) 2000-08-28
GB2327835B (en) 2000-04-19
WO1999001863A1 (en) 1999-01-14
GB2327835A (en) 1999-02-03
DE69804310D1 (de) 2002-04-25
KR20010014352A (ko) 2001-02-26
EP0993670A1 (en) 2000-04-19
JP2002507291A (ja) 2002-03-05
EP0993670B1 (en) 2002-03-20
AU8227798A (en) 1999-01-25
ATE214832T1 (de) 2002-04-15
GB9714001D0 (en) 1997-09-10

Similar Documents

Publication Publication Date Title
GB2327835B (en) Method and apparatus for speech enhancement in a speech communication system
HK1003399A1 (en) Method and apparatus for detection and bypass of tandem vocoding
RU2146394C1 (ru) Способ и устройство вокодирования переменной скорости при пониженной скорости кодирования
EP0785541B1 (en) Usage of voice activity detection for efficient coding of speech
WO1998057436A3 (en) Source coding enhancement using spectral-band replication
SE9500321L (sv) Förfarande för bullerundertryckning genom spektral subtraktion
MX9600920A (es) Metodo y aparato para seleccionar una proporcion de codificacion en un vocodificador de proporcion variable.
MX9602391A (es) Metodo y aparato para reproducir señales de conversacion y metodo para transmitirlas.
JPH1097296A (ja) 音声符号化方法および装置、音声復号化方法および装置
JPH0748695B2 (ja) 音声符号化方式
JPH0556007A (ja) 混合音声信号伝送方式
AU1324592A (en) Method and apparatus for the teaching of languages
Paul A 500-800 bps adaptive vector quantization vocoder using a perceptually motivated distance measure
US7643991B2 (en) Speech enhancement for electronic voiced messages
Whitmal et al. Wavelet-based noise reduction
GB2343822A (en) Using LSP to alter frequency characteristics of speech
JP3166797B2 (ja) 音声符号化法及び音声復号化法並びに音声符復号化装置
Cox Current methods of speech coding
Brandenburg et al. Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time
Gan et al. Implementation of silence compression scheme for G. 723.1 speech coder using TI TMS320C51 DSP chip
Riedhammer et al. A software kit for automatic voice descrambling
Bertrand Secure narrowband digital conferencing
JPS5853349B2 (ja) 音声分析合成方法
Bunnell et al. Speech processing program
Okazaki et al. Implementation of a 4.8 kbps voice codec based on pitch-synchronous DFT coding