DE69804310D1 - Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem - Google Patents

Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Info

Publication number
DE69804310D1
DE69804310D1 DE69804310T DE69804310T DE69804310D1 DE 69804310 D1 DE69804310 D1 DE 69804310D1 DE 69804310 T DE69804310 T DE 69804310T DE 69804310 T DE69804310 T DE 69804310T DE 69804310 D1 DE69804310 D1 DE 69804310D1
Authority
DE
Germany
Prior art keywords
speech
unit
determines
language
intelligible
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69804310T
Other languages
German (de)
English (en)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Simoco International Ltd
Original Assignee
Simoco International Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco International Ltd filed Critical Simoco International Ltd
Application granted granted Critical
Publication of DE69804310D1 publication Critical patent/DE69804310D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)
DE69804310T 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem Expired - Lifetime DE69804310D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system
PCT/GB1998/001936 WO1999001863A1 (fr) 1997-07-02 1998-07-01 Procede et appareil d'amelioration de qualite de son vocal dans un systeme de communication par son vocal

Publications (1)

Publication Number Publication Date
DE69804310D1 true DE69804310D1 (de) 2002-04-25

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69804310T Expired - Lifetime DE69804310D1 (de) 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Country Status (12)

Country Link
EP (1) EP0993670B1 (fr)
JP (1) JP2002507291A (fr)
KR (1) KR20010014352A (fr)
CN (1) CN1265217A (fr)
AT (1) ATE214832T1 (fr)
AU (1) AU8227798A (fr)
CA (1) CA2235455A1 (fr)
DE (1) DE69804310D1 (fr)
GB (2) GB9714001D0 (fr)
PL (1) PL337717A1 (fr)
WO (1) WO1999001863A1 (fr)
ZA (1) ZA985607B (fr)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (fr) * 1999-05-27 2001-06-22 Sagem Procede de suppression de bruit
ATE356469T1 (de) 1999-07-28 2007-03-15 Clear Audio Ltd Verstärkungsregelung von audiosignalen in lärmender umgebung mit hilfe einer filterbank
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (de) * 2001-05-17 2002-11-21 Siemens Ag Verfahren zum Signalempfang
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
EP1518224A2 (fr) * 2002-06-19 2005-03-30 Koninklijke Philips Electronics N.V. Processeur de signaux audio
US20060126859A1 (en) * 2003-01-31 2006-06-15 Claus Elberling Sound system improving speech intelligibility
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
CN101091412B (zh) * 2004-09-07 2012-12-26 森塞尔有限公司 用于声音增强的装置和方法
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (de) 2006-01-13 2007-07-19 Robert Bosch Gmbh Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm
EP1814109A1 (fr) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Amplification d'un signal de parole en tenant compte l'effet Lombard
JP2007295347A (ja) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp 音声処理装置
WO2018127263A2 (fr) * 2017-01-03 2018-07-12 Lizn Aps Système d'amélioration d'intelligibilité de la parole
KR101414233B1 (ko) 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법
JP4926005B2 (ja) 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 音声信号処理装置及び音声信号処理方法、通信端末
PL2232700T3 (pl) 2007-12-21 2015-01-30 Dts Llc System regulacji odczuwanej głośności sygnałów audio
JP5453740B2 (ja) * 2008-07-02 2014-03-26 富士通株式会社 音声強調装置
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (fr) * 2010-03-11 2011-10-05 Oticon A/S Prédicateur d'intelligibilité vocale et applications associées
EP2737479B1 (fr) * 2011-07-29 2017-01-18 Dts Llc Amélioration adaptative de l'intelligibilité vocale
CN103002105A (zh) * 2011-09-16 2013-03-27 宏碁股份有限公司 可增加通讯内容清晰度的移动通讯方法
CN103297896B (zh) * 2012-02-27 2016-07-06 联想(北京)有限公司 一种音频输出方法及电子设备
US9015044B2 (en) * 2012-03-05 2015-04-21 Malaspina Labs (Barbados) Inc. Formant based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (fr) * 2014-10-14 2016-04-20 Thomson Licensing Procédé et appareil pour séparer les données vocales issues des données contextuelles dans une communication audio
JP6565206B2 (ja) * 2015-02-20 2019-08-28 ヤマハ株式会社 音声処理装置および音声処理方法
EP3107097B1 (fr) 2015-06-17 2017-11-15 Nxp B.V. Intelligilibilité vocale améliorée
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (ja) * 2016-11-02 2020-11-25 ヤマハ株式会社 信号処理方法、および信号処理装置
CN108369805B (zh) * 2017-12-27 2019-08-13 深圳前海达闼云端智能科技有限公司 一种语音交互方法、装置和智能终端
CN109346058A (zh) * 2018-11-29 2019-02-15 西安交通大学 一种语音声学特征扩大系统
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
DE3689035T2 (de) * 1985-07-01 1994-01-20 Motorola Inc Rauschminderungssystem.
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (fr) * 1991-03-27 1997-02-04 Arnold I. Klayman Dispositif pour ameliorer l'intelligibilite dans les systemes de sonorisation
FI102337B1 (fi) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Menetelmä ja piirijärjestely audiosignaalin käsittelemiseksi
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
GB2327835A (en) 1999-02-03
EP0993670B1 (fr) 2002-03-20
AU8227798A (en) 1999-01-25
CA2235455A1 (fr) 1999-01-02
CN1265217A (zh) 2000-08-30
GB9814279D0 (en) 1998-09-02
JP2002507291A (ja) 2002-03-05
ATE214832T1 (de) 2002-04-15
KR20010014352A (ko) 2001-02-26
GB2327835B (en) 2000-04-19
EP0993670A1 (fr) 2000-04-19
GB9714001D0 (en) 1997-09-10
WO1999001863A1 (fr) 1999-01-14
PL337717A1 (en) 2000-08-28
ZA985607B (en) 2000-06-01

Similar Documents

Publication Publication Date Title
DE69804310D1 (de) Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem
DE69620585T2 (de) Verfahren und vorrichtung zur detektion und umgehung von tandem-sprachkodierung
JP2002014689A (ja) デジタルに圧縮されたスピーチの了解度を向上させる方法および装置
AU2001277647A1 (en) Method for noise robust classification in speech coding
EP1010170A4 (fr) Procede et systeme d'evaluation automatique de la prononciation independamment du texte pour l'apprentissage d'une langue
BR9204112A (pt) Processo e aparelho para o ensino de linguas
JPH0556007A (ja) 混合音声信号伝送方式
Tchorz et al. Estimation of the signal-to-noise ratio with amplitude modulation spectrograms
GB2343822A (en) Using LSP to alter frequency characteristics of speech
El-Maleh Classification-based Techniques for Digital Coding of Speech-plus-noise
ATE336778T1 (de) Verfahren und vorrichtung zur abschwächung von übertragungsfehlern in einem verteilten spracherkennungsverfahren und system
JP3166797B2 (ja) 音声符号化法及び音声復号化法並びに音声符復号化装置
Riedhammer et al. A software kit for automatic voice descrambling
CN117746831A (zh) 基于特定人物少样本情况下情感可控语音合成方法及系统
Brandenburg et al. Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time
Bunnell et al. Speech processing program
Patwardhan et al. Effect of voice quality on frequency-warped modeling
Bunnell et al. Speech processing program
Patwardhan et al. Frequency Warped All-Pole Modeling of Vowel Spectra: Dependence on Voice and Vowel Quality
JPS5853349B2 (ja) 音声分析合成方法
JPS61296399A (ja) 音声分析合成方式

Legal Events

Date Code Title Description
8332 No legal effect for de