ATE214832T1 - Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem - Google Patents

Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Info

Publication number
ATE214832T1
ATE214832T1 AT98932337T AT98932337T ATE214832T1 AT E214832 T1 ATE214832 T1 AT E214832T1 AT 98932337 T AT98932337 T AT 98932337T AT 98932337 T AT98932337 T AT 98932337T AT E214832 T1 ATE214832 T1 AT E214832T1
Authority
AT
Austria
Prior art keywords
speech
unit
determines
intelligible
voice
Prior art date
Application number
AT98932337T
Other languages
German (de)
English (en)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Original Assignee
Simoco Int Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco Int Ltd filed Critical Simoco Int Ltd
Application granted granted Critical
Publication of ATE214832T1 publication Critical patent/ATE214832T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Telephone Function (AREA)
AT98932337T 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem ATE214832T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system
PCT/GB1998/001936 WO1999001863A1 (en) 1997-07-02 1998-07-01 Method and apparatus for speech enhancement in a speech communication system

Publications (1)

Publication Number Publication Date
ATE214832T1 true ATE214832T1 (de) 2002-04-15

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
AT98932337T ATE214832T1 (de) 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Country Status (12)

Country Link
EP (1) EP0993670B1 (ko)
JP (1) JP2002507291A (ko)
KR (1) KR20010014352A (ko)
CN (1) CN1265217A (ko)
AT (1) ATE214832T1 (ko)
AU (1) AU8227798A (ko)
CA (1) CA2235455A1 (ko)
DE (1) DE69804310D1 (ko)
GB (2) GB9714001D0 (ko)
PL (1) PL337717A1 (ko)
WO (1) WO1999001863A1 (ko)
ZA (1) ZA985607B (ko)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (fr) * 1999-05-27 2001-06-22 Sagem Procede de suppression de bruit
US7120579B1 (en) 1999-07-28 2006-10-10 Clear Audio Ltd. Filter banked gain control of audio in a noisy environment
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (de) * 2001-05-17 2002-11-21 Siemens Ag Verfahren zum Signalempfang
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
WO2004002028A2 (en) * 2002-06-19 2003-12-31 Koninklijke Philips Electronics N.V. Audio signal processing apparatus and method
US20060126859A1 (en) * 2003-01-31 2006-06-15 Claus Elberling Sound system improving speech intelligibility
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
KR101215944B1 (ko) * 2004-09-07 2012-12-27 센시어 피티와이 엘티디 청취보호기와 음향개선방법
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (de) 2006-01-13 2007-07-19 Robert Bosch Gmbh Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm
EP1814109A1 (en) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Voice amplification apparatus for modelling the Lombard effect
JP2007295347A (ja) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp 音声処理装置
KR101414233B1 (ko) 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법
JP4926005B2 (ja) * 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 音声信号処理装置及び音声信号処理方法、通信端末
CN102017402B (zh) 2007-12-21 2015-01-07 Dts有限责任公司 用于调节音频信号的感知响度的系统
JP5453740B2 (ja) * 2008-07-02 2014-03-26 富士通株式会社 音声強調装置
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (en) * 2010-03-11 2011-10-05 Oticon A/S A speech intelligibility predictor and applications thereof
US9117455B2 (en) 2011-07-29 2015-08-25 Dts Llc Adaptive voice intelligibility processor
CN103002105A (zh) * 2011-09-16 2013-03-27 宏碁股份有限公司 可增加通讯内容清晰度的移动通讯方法
CN103297896B (zh) * 2012-02-27 2016-07-06 联想(北京)有限公司 一种音频输出方法及电子设备
US9015044B2 (en) 2012-03-05 2015-04-21 Malaspina Labs (Barbados) Inc. Formant based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (en) * 2014-10-14 2016-04-20 Thomson Licensing Method and apparatus for separating speech data from background data in audio communication
JP6565206B2 (ja) * 2015-02-20 2019-08-28 ヤマハ株式会社 音声処理装置および音声処理方法
EP3107097B1 (en) 2015-06-17 2017-11-15 Nxp B.V. Improved speech intelligilibility
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (ja) * 2016-11-02 2020-11-25 ヤマハ株式会社 信号処理方法、および信号処理装置
EP3566469B1 (en) 2017-01-03 2020-04-01 Lizn APS Speech intelligibility enhancing system
WO2019127112A1 (zh) * 2017-12-27 2019-07-04 深圳前海达闼云端智能科技有限公司 一种语音交互方法、装置和智能终端
CN109346058B (zh) * 2018-11-29 2024-06-28 西安交通大学 一种语音声学特征扩大系统
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
KR940009391B1 (ko) * 1985-07-01 1994-10-07 모토로라 인코포레이티드 잡음 억제 시스템
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (en) * 1991-03-27 1997-02-04 Arnold I. Klayman Public address intelligibility system
FI102337B (fi) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Menetelmä ja piirijärjestely audiosignaalin käsittelemiseksi
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
GB9814279D0 (en) 1998-09-02
JP2002507291A (ja) 2002-03-05
CA2235455A1 (en) 1999-01-02
DE69804310D1 (de) 2002-04-25
EP0993670B1 (en) 2002-03-20
ZA985607B (en) 2000-06-01
GB9714001D0 (en) 1997-09-10
GB2327835B (en) 2000-04-19
PL337717A1 (en) 2000-08-28
CN1265217A (zh) 2000-08-30
GB2327835A (en) 1999-02-03
WO1999001863A1 (en) 1999-01-14
KR20010014352A (ko) 2001-02-26
EP0993670A1 (en) 2000-04-19
AU8227798A (en) 1999-01-25

Similar Documents

Publication Publication Date Title
ATE214832T1 (de) Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem
ATE216173T1 (de) Verfahren und vorrichtung zur detektion und umgehung von tandem-sprachkodierung
MX9600920A (es) Metodo y aparato para seleccionar una proporcion de codificacion en un vocodificador de proporcion variable.
SE9500321L (sv) Förfarande för bullerundertryckning genom spektral subtraktion
JP2002014689A (ja) デジタルに圧縮されたスピーチの了解度を向上させる方法および装置
AU2001277647A1 (en) Method for noise robust classification in speech coding
Espy-Wilson et al. Enhancement of alaryngeal speech by adaptive filtering
JPH0556007A (ja) 混合音声信号伝送方式
EP1010170A4 (en) METHOD AND SYSTEM FOR AUTOMATIC EVALUATION OF INDEPENDENT TEXT PRONUNCIATION FOR LANGUAGE LEARNING
BR9204112A (pt) Processo e aparelho para o ensino de linguas
GB2343822A (en) Using LSP to alter frequency characteristics of speech
El-Maleh Classification-based Techniques for Digital Coding of Speech-plus-noise
Patwardhan et al. Effect of voice quality on frequency-warped modeling of vowel spectra
JP3166797B2 (ja) 音声符号化法及び音声復号化法並びに音声符復号化装置
Cox Current methods of speech coding
SU1674226A1 (ru) Способ обнаружени речевых сигналов и их границ и устройство дл его осуществлени
Riedhammer et al. A software kit for automatic voice descrambling
Brandenburg et al. Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time
Liu Audio watermarking through parametric synthesis models
Patwardhan et al. Effect of voice quality on frequency-warped modeling
Suzuki et al. 8 kbps voice transmission by SPAC
Bunnell et al. Speech processing program
McGahan et al. Modelling listeners’ identification of concurrent vowels using a Kohonen net
JPS5853349B2 (ja) 音声分析合成方法
Bertrand Secure narrowband digital conferencing

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties