ATE214832T1 - Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem - Google Patents

Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Info

Publication number
ATE214832T1
ATE214832T1 AT98932337T AT98932337T ATE214832T1 AT E214832 T1 ATE214832 T1 AT E214832T1 AT 98932337 T AT98932337 T AT 98932337T AT 98932337 T AT98932337 T AT 98932337T AT E214832 T1 ATE214832 T1 AT E214832T1
Authority
AT
Austria
Prior art keywords
speech
unit
determines
intelligible
voice
Prior art date
Application number
AT98932337T
Other languages
German (de)
English (en)
Inventor
Robert James Chance
Ian Vince Mcloughlin
Original Assignee
Simoco Int Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simoco Int Ltd filed Critical Simoco Int Ltd
Application granted granted Critical
Publication of ATE214832T1 publication Critical patent/ATE214832T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • G10L2021/0135Voice conversion or morphing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/15Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being formant information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Interconnected Communication Systems, Intercoms, And Interphones (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)
  • Mobile Radio Communication Systems (AREA)
AT98932337T 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem ATE214832T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GBGB9714001.6A GB9714001D0 (en) 1997-07-02 1997-07-02 Method and apparatus for speech enhancement in a speech communication system
PCT/GB1998/001936 WO1999001863A1 (fr) 1997-07-02 1998-07-01 Procede et appareil d'amelioration de qualite de son vocal dans un systeme de communication par son vocal

Publications (1)

Publication Number Publication Date
ATE214832T1 true ATE214832T1 (de) 2002-04-15

Family

ID=10815285

Family Applications (1)

Application Number Title Priority Date Filing Date
AT98932337T ATE214832T1 (de) 1997-07-02 1998-07-01 Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem

Country Status (12)

Country Link
EP (1) EP0993670B1 (fr)
JP (1) JP2002507291A (fr)
KR (1) KR20010014352A (fr)
CN (1) CN1265217A (fr)
AT (1) ATE214832T1 (fr)
AU (1) AU8227798A (fr)
CA (1) CA2235455A1 (fr)
DE (1) DE69804310D1 (fr)
GB (2) GB9714001D0 (fr)
PL (1) PL337717A1 (fr)
WO (1) WO1999001863A1 (fr)
ZA (1) ZA985607B (fr)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9903553D0 (sv) * 1999-01-27 1999-10-01 Lars Liljeryd Enhancing percepptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL)
FR2794322B1 (fr) * 1999-05-27 2001-06-22 Sagem Procede de suppression de bruit
ATE356469T1 (de) 1999-07-28 2007-03-15 Clear Audio Ltd Verstärkungsregelung von audiosignalen in lärmender umgebung mit hilfe einer filterbank
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
DE10124189A1 (de) * 2001-05-17 2002-11-21 Siemens Ag Verfahren zum Signalempfang
JP2003255993A (ja) * 2002-03-04 2003-09-10 Ntt Docomo Inc 音声認識システム、音声認識方法、音声認識プログラム、音声合成システム、音声合成方法、音声合成プログラム
EP1518224A2 (fr) * 2002-06-19 2005-03-30 Koninklijke Philips Electronics N.V. Processeur de signaux audio
US20060126859A1 (en) * 2003-01-31 2006-06-15 Claus Elberling Sound system improving speech intelligibility
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
CN101091412B (zh) * 2004-09-07 2012-12-26 森塞尔有限公司 用于声音增强的装置和方法
US8280730B2 (en) 2005-05-25 2012-10-02 Motorola Mobility Llc Method and apparatus of increasing speech intelligibility in noisy environments
GB2433849B (en) 2005-12-29 2008-05-21 Motorola Inc Telecommunications terminal and method of operation of the terminal
DE102006001730A1 (de) 2006-01-13 2007-07-19 Robert Bosch Gmbh Beschallungsanlage, Verfahren zur Verbesserung der Sprachqualität und/oder Verständlichkeit von Sprachdurchsagen sowie Computerprogramm
EP1814109A1 (fr) * 2006-01-27 2007-08-01 Texas Instruments Incorporated Amplification d'un signal de parole en tenant compte l'effet Lombard
JP2007295347A (ja) * 2006-04-26 2007-11-08 Mitsubishi Electric Corp 音声処理装置
WO2018127263A2 (fr) * 2017-01-03 2018-07-12 Lizn Aps Système d'amélioration d'intelligibilité de la parole
KR101414233B1 (ko) 2007-01-05 2014-07-02 삼성전자 주식회사 음성 신호의 명료도를 향상시키는 장치 및 방법
JP4926005B2 (ja) 2007-11-13 2012-05-09 ソニー・エリクソン・モバイルコミュニケーションズ株式会社 音声信号処理装置及び音声信号処理方法、通信端末
PL2232700T3 (pl) 2007-12-21 2015-01-30 Dts Llc System regulacji odczuwanej głośności sygnałów audio
JP5453740B2 (ja) * 2008-07-02 2014-03-26 富士通株式会社 音声強調装置
US8538042B2 (en) 2009-08-11 2013-09-17 Dts Llc System for increasing perceived loudness of speakers
EP2372700A1 (fr) * 2010-03-11 2011-10-05 Oticon A/S Prédicateur d'intelligibilité vocale et applications associées
EP2737479B1 (fr) * 2011-07-29 2017-01-18 Dts Llc Amélioration adaptative de l'intelligibilité vocale
CN103002105A (zh) * 2011-09-16 2013-03-27 宏碁股份有限公司 可增加通讯内容清晰度的移动通讯方法
CN103297896B (zh) * 2012-02-27 2016-07-06 联想(北京)有限公司 一种音频输出方法及电子设备
US9015044B2 (en) * 2012-03-05 2015-04-21 Malaspina Labs (Barbados) Inc. Formant based speech reconstruction from noisy signals
US9312829B2 (en) 2012-04-12 2016-04-12 Dts Llc System for adjusting loudness of audio signals in real time
EP3010017A1 (fr) * 2014-10-14 2016-04-20 Thomson Licensing Procédé et appareil pour séparer les données vocales issues des données contextuelles dans une communication audio
JP6565206B2 (ja) * 2015-02-20 2019-08-28 ヤマハ株式会社 音声処理装置および音声処理方法
EP3107097B1 (fr) 2015-06-17 2017-11-15 Nxp B.V. Intelligilibilité vocale améliorée
US9847093B2 (en) 2015-06-19 2017-12-19 Samsung Electronics Co., Ltd. Method and apparatus for processing speech signal
JP6790732B2 (ja) * 2016-11-02 2020-11-25 ヤマハ株式会社 信号処理方法、および信号処理装置
CN108369805B (zh) * 2017-12-27 2019-08-13 深圳前海达闼云端智能科技有限公司 一种语音交互方法、装置和智能终端
CN109346058A (zh) * 2018-11-29 2019-02-15 西安交通大学 一种语音声学特征扩大系统
US11817114B2 (en) * 2019-12-09 2023-11-14 Dolby Laboratories Licensing Corporation Content and environmentally aware environmental noise compensation

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5870292A (ja) * 1981-10-22 1983-04-26 日産自動車株式会社 車両用音声認識装置
US4538295A (en) * 1982-08-16 1985-08-27 Nissan Motor Company, Limited Speech recognition system for an automotive vehicle
DE3689035T2 (de) * 1985-07-01 1994-01-20 Motorola Inc Rauschminderungssystem.
GB8801014D0 (en) * 1988-01-18 1988-02-17 British Telecomm Noise reduction
US5235669A (en) * 1990-06-29 1993-08-10 At&T Laboratories Low-delay code-excited linear-predictive coding of wideband speech at 32 kbits/sec
CA2056110C (fr) * 1991-03-27 1997-02-04 Arnold I. Klayman Dispositif pour ameliorer l'intelligibilite dans les systemes de sonorisation
FI102337B1 (fi) * 1995-09-13 1998-11-13 Nokia Mobile Phones Ltd Menetelmä ja piirijärjestely audiosignaalin käsittelemiseksi
GB2306086A (en) * 1995-10-06 1997-04-23 Richard Morris Trim Improved adaptive audio systems

Also Published As

Publication number Publication date
GB2327835A (en) 1999-02-03
EP0993670B1 (fr) 2002-03-20
AU8227798A (en) 1999-01-25
CA2235455A1 (fr) 1999-01-02
CN1265217A (zh) 2000-08-30
GB9814279D0 (en) 1998-09-02
JP2002507291A (ja) 2002-03-05
KR20010014352A (ko) 2001-02-26
GB2327835B (en) 2000-04-19
DE69804310D1 (de) 2002-04-25
EP0993670A1 (fr) 2000-04-19
GB9714001D0 (en) 1997-09-10
WO1999001863A1 (fr) 1999-01-14
PL337717A1 (en) 2000-08-28
ZA985607B (en) 2000-06-01

Similar Documents

Publication Publication Date Title
ATE214832T1 (de) Verfahren und vorrichtung zur sprachverbesserung in einem sprachübertragungssystem
DE69620585D1 (de) Verfahren und vorrichtung zur detektion und umgehung von tandem-sprachkodierung
Servetti et al. Perception-based partial encryption of compressed speech
SE9500321L (sv) Förfarande för bullerundertryckning genom spektral subtraktion
JP2002014689A (ja) デジタルに圧縮されたスピーチの了解度を向上させる方法および装置
AU2001277647A1 (en) Method for noise robust classification in speech coding
JPH0556007A (ja) 混合音声信号伝送方式
EP1010170A4 (fr) Procede et systeme d'evaluation automatique de la prononciation independamment du texte pour l'apprentissage d'une langue
BR9204112A (pt) Processo e aparelho para o ensino de linguas
GB2343822A (en) Using LSP to alter frequency characteristics of speech
El-Maleh Classification-based Techniques for Digital Coding of Speech-plus-noise
Patwardhan et al. Effect of voice quality on frequency-warped modeling of vowel spectra
JP3166797B2 (ja) 音声符号化法及び音声復号化法並びに音声符復号化装置
Riedhammer et al. A software kit for automatic voice descrambling
SU1674226A1 (ru) Способ обнаружени речевых сигналов и их границ и устройство дл его осуществлени
Brandenburg et al. Fast signal processor encodes 48 kHz/16-bit audio into 3-bit in real time
Bunnell et al. Speech processing program
Patwardhan et al. Effect of voice quality on frequency-warped modeling
Suzuki et al. 8 kbps voice transmission by SPAC
Bunnell et al. Speech processing program
McGahan et al. Modelling listeners’ identification of concurrent vowels using a Kohonen net
JPS5853349B2 (ja) 音声分析合成方法
O'Brien et al. Preliminary study of multilevel peak‐clipped and time‐quantized speech

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties