ATE282235T1 - Robuste merkmale für die erkennung von verrauschten sprachsignalen - Google Patents
Robuste merkmale für die erkennung von verrauschten sprachsignalenInfo
- Publication number
- ATE282235T1 ATE282235T1 AT01925226T AT01925226T ATE282235T1 AT E282235 T1 ATE282235 T1 AT E282235T1 AT 01925226 T AT01925226 T AT 01925226T AT 01925226 T AT01925226 T AT 01925226T AT E282235 T1 ATE282235 T1 AT E282235T1
- Authority
- AT
- Austria
- Prior art keywords
- vectors
- voice signals
- speech
- robust features
- detecting noise
- Prior art date
Links
- 239000013598 vector Substances 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Machine Translation (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Noise Elimination (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00870094A EP1152399A1 (de) | 2000-05-04 | 2000-05-04 | Teilband-Sprachverarbeitung mit neuronalen Netzwerken |
PCT/BE2001/000072 WO2001084537A1 (fr) | 2000-05-04 | 2001-04-25 | Parametres robustes pour la reconnaissance de parole bruitee |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE282235T1 true ATE282235T1 (de) | 2004-11-15 |
Family
ID=8175744
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT01925226T ATE282235T1 (de) | 2000-05-04 | 2001-04-25 | Robuste merkmale für die erkennung von verrauschten sprachsignalen |
Country Status (8)
Country | Link |
---|---|
US (1) | US7212965B2 (de) |
EP (2) | EP1152399A1 (de) |
JP (1) | JP2003532162A (de) |
AT (1) | ATE282235T1 (de) |
AU (1) | AU776919B2 (de) |
CA (1) | CA2404441C (de) |
DE (1) | DE60107072T2 (de) |
WO (1) | WO2001084537A1 (de) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1416472A1 (de) * | 2002-10-30 | 2004-05-06 | Swisscom AG | Bandbreitenabhängiges Spracherkennungssystem |
US7620546B2 (en) * | 2004-03-23 | 2009-11-17 | Qnx Software Systems (Wavemakers), Inc. | Isolating speech signals utilizing neural networks |
US20060206320A1 (en) * | 2005-03-14 | 2006-09-14 | Li Qi P | Apparatus and method for noise reduction and speech enhancement with microphones and loudspeakers |
US20070239444A1 (en) * | 2006-03-29 | 2007-10-11 | Motorola, Inc. | Voice signal perturbation for speech recognition |
US8386125B2 (en) * | 2006-11-22 | 2013-02-26 | General Motors Llc | Adaptive communication between a vehicle telematics unit and a call center based on acoustic conditions |
CN101996628A (zh) * | 2009-08-21 | 2011-03-30 | 索尼株式会社 | 提取语音信号的韵律特征的方法和装置 |
US8972256B2 (en) | 2011-10-17 | 2015-03-03 | Nuance Communications, Inc. | System and method for dynamic noise adaptation for robust automatic speech recognition |
US9418674B2 (en) * | 2012-01-17 | 2016-08-16 | GM Global Technology Operations LLC | Method and system for using vehicle sound information to enhance audio prompting |
US9934780B2 (en) | 2012-01-17 | 2018-04-03 | GM Global Technology Operations LLC | Method and system for using sound related vehicle information to enhance spoken dialogue by modifying dialogue's prompt pitch |
US9263040B2 (en) | 2012-01-17 | 2016-02-16 | GM Global Technology Operations LLC | Method and system for using sound related vehicle information to enhance speech recognition |
US8571871B1 (en) * | 2012-10-02 | 2013-10-29 | Google Inc. | Methods and systems for adaptation of synthetic speech in an environment |
US9280968B2 (en) | 2013-10-04 | 2016-03-08 | At&T Intellectual Property I, L.P. | System and method of using neural transforms of robust audio features for speech processing |
US10720165B2 (en) * | 2017-01-23 | 2020-07-21 | Qualcomm Incorporated | Keyword voice authentication |
US10283140B1 (en) * | 2018-01-12 | 2019-05-07 | Alibaba Group Holding Limited | Enhancing audio signals using sub-band deep neural networks |
US10997967B2 (en) * | 2019-04-18 | 2021-05-04 | Honeywell International Inc. | Methods and systems for cockpit speech recognition acoustic model training with multi-level corpus data augmentation |
CN110047468B (zh) * | 2019-05-20 | 2022-01-25 | 北京达佳互联信息技术有限公司 | 语音识别方法、装置及存储介质 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2776848B2 (ja) * | 1988-12-14 | 1998-07-16 | 株式会社日立製作所 | 雑音除去方法、それに用いるニューラルネットワークの学習方法 |
JP3084721B2 (ja) * | 1990-02-23 | 2000-09-04 | ソニー株式会社 | 雑音除去回路 |
JPH0566795A (ja) * | 1991-09-06 | 1993-03-19 | Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho | 雑音抑圧装置とその調整装置 |
US5381512A (en) * | 1992-06-24 | 1995-01-10 | Moscom Corporation | Method and apparatus for speech feature recognition based on models of auditory signal processing |
US6070140A (en) * | 1995-06-05 | 2000-05-30 | Tran; Bao Q. | Speech recognizer |
US5963899A (en) * | 1996-08-07 | 1999-10-05 | U S West, Inc. | Method and system for region based filtering of speech |
US5806025A (en) * | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
US6035048A (en) * | 1997-06-18 | 2000-03-07 | Lucent Technologies Inc. | Method and apparatus for reducing noise in speech and audio signals |
FR2765715B1 (fr) * | 1997-07-04 | 1999-09-17 | Sextant Avionique | Procede de recherche d'un modele de bruit dans des signaux sonores bruites |
US6230122B1 (en) * | 1998-09-09 | 2001-05-08 | Sony Corporation | Speech detection with noise suppression based on principal components analysis |
US6173258B1 (en) * | 1998-09-09 | 2001-01-09 | Sony Corporation | Method for reducing noise distortions in a speech recognition system |
US6347297B1 (en) * | 1998-10-05 | 2002-02-12 | Legerity, Inc. | Matrix quantization with vector quantization error compensation and neural network postprocessing for robust speech recognition |
-
2000
- 2000-05-04 EP EP00870094A patent/EP1152399A1/de not_active Withdrawn
-
2001
- 2001-04-25 CA CA002404441A patent/CA2404441C/fr not_active Expired - Fee Related
- 2001-04-25 JP JP2001581270A patent/JP2003532162A/ja active Pending
- 2001-04-25 AT AT01925226T patent/ATE282235T1/de not_active IP Right Cessation
- 2001-04-25 AU AU52051/01A patent/AU776919B2/en not_active Ceased
- 2001-04-25 DE DE60107072T patent/DE60107072T2/de not_active Expired - Lifetime
- 2001-04-25 US US10/275,451 patent/US7212965B2/en not_active Expired - Fee Related
- 2001-04-25 WO PCT/BE2001/000072 patent/WO2001084537A1/fr active IP Right Grant
- 2001-04-25 EP EP01925226A patent/EP1279166B1/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP1152399A1 (de) | 2001-11-07 |
CA2404441C (fr) | 2009-07-14 |
CA2404441A1 (fr) | 2001-11-08 |
EP1279166B1 (de) | 2004-11-10 |
WO2001084537A1 (fr) | 2001-11-08 |
DE60107072T2 (de) | 2005-10-27 |
US7212965B2 (en) | 2007-05-01 |
EP1279166A1 (de) | 2003-01-29 |
AU776919B2 (en) | 2004-09-23 |
US20030182114A1 (en) | 2003-09-25 |
DE60107072D1 (de) | 2004-12-16 |
AU5205101A (en) | 2001-11-12 |
JP2003532162A (ja) | 2003-10-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE282235T1 (de) | Robuste merkmale für die erkennung von verrauschten sprachsignalen | |
ATE352836T1 (de) | Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern | |
DE68910859D1 (de) | Detektion für die Anwesenheit eines Sprachsignals. | |
EP0788091A3 (de) | Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung | |
IL154397A0 (en) | Voice enhancement system | |
EP1067800A4 (de) | Verfahren zur signalverarbeitung und vorrichtung zur verarbeitung von bild/ton | |
MXPA03005619A (es) | Metodo y arreglo para determinar una senal de ruido de una fuente de ruido. | |
WO2002029780A3 (en) | Speech detection with source separation | |
EP0785419A3 (de) | Sprachaktivitätserkennung | |
WO1998034216A3 (en) | System and method for detecting a recorded voice | |
EP0764937A3 (de) | Verfahren zur Sprachdetektion bei starken Umgebungsgeräuschen | |
EP0674306A3 (de) | Korrektur von Signalverzerrungen für robuste Spracherkennung über Telefon. | |
EP1286328A3 (de) | Verfahren zur Verbesserung der nahen Sprachaktivitätsdetektion in einem System zur Sprecherlokalisierung mit Hilfe von Strahlbildung | |
EP0950239A4 (de) | Verfahren und gerät zum erkennen von geräuschsignalproben aus einem geräusch | |
WO1996008992A3 (en) | Apparatus and method for time dependent power spectrum analysis of physiological signals | |
CA2162407A1 (en) | A robust pitch estimation method and device for telephone speech | |
EP1093112A3 (de) | Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung | |
KR910020644A (ko) | 음성잡음분리장치 | |
EP1096475A3 (de) | Verziehung der Frequenzen für Spracherkennung | |
DE59809897D1 (de) | Sprachaktivitätserkennung | |
DE69012446D1 (de) | Detektor für Niederfrequenz-Wechselstromsignale für eine Telefonverbindungsleitung. | |
ATE282879T1 (de) | Signalverarbeitungsverfahren zur analyse von sprachsignal-transienten | |
JP2564821B2 (ja) | 音声判定検出装置 | |
KR920009957B1 (ko) | 과대음성 검출장치 | |
AU3452397A (en) | Speech synthesis system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |