ATE282235T1 - Robuste merkmale für die erkennung von verrauschten sprachsignalen - Google Patents
Robuste merkmale für die erkennung von verrauschten sprachsignalenInfo
- Publication number
- ATE282235T1 ATE282235T1 AT01925226T AT01925226T ATE282235T1 AT E282235 T1 ATE282235 T1 AT E282235T1 AT 01925226 T AT01925226 T AT 01925226T AT 01925226 T AT01925226 T AT 01925226T AT E282235 T1 ATE282235 T1 AT E282235T1
- Authority
- AT
- Austria
- Prior art keywords
- vectors
- voice signals
- speech
- robust features
- detecting noise
- Prior art date
Links
- 239000013598 vector Substances 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Machine Translation (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Noise Elimination (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP00870094A EP1152399A1 (de) | 2000-05-04 | 2000-05-04 | Teilband-Sprachverarbeitung mit neuronalen Netzwerken |
| PCT/BE2001/000072 WO2001084537A1 (fr) | 2000-05-04 | 2001-04-25 | Parametres robustes pour la reconnaissance de parole bruitee |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ATE282235T1 true ATE282235T1 (de) | 2004-11-15 |
Family
ID=8175744
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AT01925226T ATE282235T1 (de) | 2000-05-04 | 2001-04-25 | Robuste merkmale für die erkennung von verrauschten sprachsignalen |
Country Status (8)
| Country | Link |
|---|---|
| US (1) | US7212965B2 (de) |
| EP (2) | EP1152399A1 (de) |
| JP (1) | JP2003532162A (de) |
| AT (1) | ATE282235T1 (de) |
| AU (1) | AU776919B2 (de) |
| CA (1) | CA2404441C (de) |
| DE (1) | DE60107072T2 (de) |
| WO (1) | WO2001084537A1 (de) |
Families Citing this family (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP1416472A1 (de) * | 2002-10-30 | 2004-05-06 | Swisscom AG | Bandbreitenabhängiges Spracherkennungssystem |
| US7620546B2 (en) * | 2004-03-23 | 2009-11-17 | Qnx Software Systems (Wavemakers), Inc. | Isolating speech signals utilizing neural networks |
| US20060206320A1 (en) * | 2005-03-14 | 2006-09-14 | Li Qi P | Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers |
| US20070239444A1 (en) * | 2006-03-29 | 2007-10-11 | Motorola, Inc. | Voice signal perturbation for speech recognition |
| US8386125B2 (en) * | 2006-11-22 | 2013-02-26 | General Motors Llc | Adaptive communication between a vehicle telematics unit and a call center based on acoustic conditions |
| CN101996628A (zh) * | 2009-08-21 | 2011-03-30 | 索尼株式会社 | 提取语音信号的韵律特征的方法和装置 |
| US8972256B2 (en) | 2011-10-17 | 2015-03-03 | Nuance Communications, Inc. | System and method for dynamic noise adaptation for robust automatic speech recognition |
| US9934780B2 (en) | 2012-01-17 | 2018-04-03 | GM Global Technology Operations LLC | Method and system for using sound related vehicle information to enhance spoken dialogue by modifying dialogue's prompt pitch |
| US9263040B2 (en) | 2012-01-17 | 2016-02-16 | GM Global Technology Operations LLC | Method and system for using sound related vehicle information to enhance speech recognition |
| US9418674B2 (en) * | 2012-01-17 | 2016-08-16 | GM Global Technology Operations LLC | Method and system for using vehicle sound information to enhance audio prompting |
| US8571871B1 (en) * | 2012-10-02 | 2013-10-29 | Google Inc. | Methods and systems for adaptation of synthetic speech in an environment |
| US9280968B2 (en) | 2013-10-04 | 2016-03-08 | At&T Intellectual Property I, L.P. | System and method of using neural transforms of robust audio features for speech processing |
| US10720165B2 (en) * | 2017-01-23 | 2020-07-21 | Qualcomm Incorporated | Keyword voice authentication |
| US10283140B1 (en) | 2018-01-12 | 2019-05-07 | Alibaba Group Holding Limited | Enhancing audio signals using sub-band deep neural networks |
| US10997967B2 (en) * | 2019-04-18 | 2021-05-04 | Honeywell International Inc. | Methods and systems for cockpit speech recognition acoustic model training with multi-level corpus data augmentation |
| CN110047468B (zh) * | 2019-05-20 | 2022-01-25 | 北京达佳互联信息技术有限公司 | 语音识别方法、装置及存储介质 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2776848B2 (ja) * | 1988-12-14 | 1998-07-16 | 株式会社日立製作所 | 雑音除去方法、それに用いるニューラルネットワークの学習方法 |
| JP3084721B2 (ja) * | 1990-02-23 | 2000-09-04 | ソニー株式会社 | 雑音除去回路 |
| JPH0566795A (ja) * | 1991-09-06 | 1993-03-19 | Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho | 雑音抑圧装置とその調整装置 |
| US5381512A (en) * | 1992-06-24 | 1995-01-10 | Moscom Corporation | Method and apparatus for speech feature recognition based on models of auditory signal processing |
| US6070140A (en) * | 1995-06-05 | 2000-05-30 | Tran; Bao Q. | Speech recognizer |
| US5806025A (en) * | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
| US5963899A (en) * | 1996-08-07 | 1999-10-05 | U S West, Inc. | Method and system for region based filtering of speech |
| US6035048A (en) * | 1997-06-18 | 2000-03-07 | Lucent Technologies Inc. | Method and apparatus for reducing noise in speech and audio signals |
| FR2765715B1 (fr) * | 1997-07-04 | 1999-09-17 | Sextant Avionique | Procede de recherche d'un modele de bruit dans des signaux sonores bruites |
| US6230122B1 (en) * | 1998-09-09 | 2001-05-08 | Sony Corporation | Speech detection with noise suppression based on principal components analysis |
| US6173258B1 (en) * | 1998-09-09 | 2001-01-09 | Sony Corporation | Method for reducing noise distortions in a speech recognition system |
| US6347297B1 (en) * | 1998-10-05 | 2002-02-12 | Legerity, Inc. | Matrix quantization with vector quantization error compensation and neural network postprocessing for robust speech recognition |
-
2000
- 2000-05-04 EP EP00870094A patent/EP1152399A1/de not_active Withdrawn
-
2001
- 2001-04-25 DE DE60107072T patent/DE60107072T2/de not_active Expired - Lifetime
- 2001-04-25 CA CA002404441A patent/CA2404441C/fr not_active Expired - Fee Related
- 2001-04-25 WO PCT/BE2001/000072 patent/WO2001084537A1/fr not_active Ceased
- 2001-04-25 AT AT01925226T patent/ATE282235T1/de not_active IP Right Cessation
- 2001-04-25 JP JP2001581270A patent/JP2003532162A/ja active Pending
- 2001-04-25 AU AU52051/01A patent/AU776919B2/en not_active Ceased
- 2001-04-25 US US10/275,451 patent/US7212965B2/en not_active Expired - Fee Related
- 2001-04-25 EP EP01925226A patent/EP1279166B1/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| DE60107072D1 (de) | 2004-12-16 |
| US7212965B2 (en) | 2007-05-01 |
| EP1152399A1 (de) | 2001-11-07 |
| CA2404441A1 (fr) | 2001-11-08 |
| AU5205101A (en) | 2001-11-12 |
| CA2404441C (fr) | 2009-07-14 |
| EP1279166A1 (de) | 2003-01-29 |
| JP2003532162A (ja) | 2003-10-28 |
| US20030182114A1 (en) | 2003-09-25 |
| EP1279166B1 (de) | 2004-11-10 |
| DE60107072T2 (de) | 2005-10-27 |
| AU776919B2 (en) | 2004-09-23 |
| WO2001084537A1 (fr) | 2001-11-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ATE282235T1 (de) | Robuste merkmale für die erkennung von verrauschten sprachsignalen | |
| ATE352836T1 (de) | Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern | |
| DK215690A (da) | Taleaktivitetsdetektor | |
| EP0788091A3 (de) | Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung | |
| IL154397A0 (en) | Voice enhancement system | |
| EP0785419A3 (de) | Sprachaktivitätserkennung | |
| WO1998034216A3 (en) | System and method for detecting a recorded voice | |
| EP1168306A3 (de) | Verfahren und Vorrichtung zur Verbesserung von der Verständlichkeit eines digital komprimierten Sprachsignals | |
| EP0764937A3 (de) | Verfahren zur Sprachdetektion bei starken Umgebungsgeräuschen | |
| IL125649A0 (en) | A method and device for recognizing a sampled sound signal in noise | |
| WO1994022131A3 (en) | Speech recognition with pause detection | |
| EP1158664A3 (de) | Verfahren zur Analyse eines EKG-Signals | |
| EP1861846A4 (de) | Adaptive stimmenmodus-erweiterung für einen stimmenaktivitäts-detektor | |
| WO2000031720A3 (en) | Complex signal activity detection for improved speech/noise classification of an audio signal | |
| WO2006007290B1 (en) | Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system | |
| CA2228948A1 (en) | Pattern recognition | |
| ATE421139T1 (de) | Verfahren zum betreiben eines spracherkennungssystemes | |
| EP1093112A3 (de) | Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung | |
| FI955345A0 (fi) | Karkea äänenkorkeuden estimointimenetelmä ja -laite puhelinkeskustelua varten | |
| EP1096475A3 (de) | Verziehung der Frequenzen für Spracherkennung | |
| DE59809897D1 (de) | Sprachaktivitätserkennung | |
| CA2076606A1 (en) | Method for detecting voice presence on a communication line | |
| DE69012446D1 (de) | Detektor für Niederfrequenz-Wechselstromsignale für eine Telefonverbindungsleitung. | |
| ATE282879T1 (de) | Signalverarbeitungsverfahren zur analyse von sprachsignal-transienten | |
| EP1489597A8 (de) | Vorrichtung zur Sprachdetektion |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |