ES2144031T3 - Aparato de reconocimiento de voz. - Google Patents
Aparato de reconocimiento de voz.Info
- Publication number
- ES2144031T3 ES2144031T3 ES94120541T ES94120541T ES2144031T3 ES 2144031 T3 ES2144031 T3 ES 2144031T3 ES 94120541 T ES94120541 T ES 94120541T ES 94120541 T ES94120541 T ES 94120541T ES 2144031 T3 ES2144031 T3 ES 2144031T3
- Authority
- ES
- Spain
- Prior art keywords
- entry
- noise
- spectrum
- reference model
- speaking
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 239000013598 vector Substances 0.000 abstract 9
- 238000001228 spectrum Methods 0.000 abstract 6
- 230000005540 biological transmission Effects 0.000 abstract 1
- 238000006243 chemical reaction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0635—Training updating or merging of old and new templates; Mean values; Weighting
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
Abstract
UN APARATO DE RECONOCIMIENTO DE LA PALABRA HABLADA DE ACUERDO A LA PRESENTE INVENCION INCLUYE UNA PORCION (5) DE CALCULO DE VECTOR MEDIO, UNA PORCION (6) DE COMPENSACION, Y UNA PORCION (8) DE EMPAREJAMIENTO. LA PORCION (5) DE CALCULO DE VECTOR MEDIO CALCULA UN VECTOR MEDIO PARA CADA UNA DE LAS REGIONES DE RUIDO Y LAS REGIONES DE HABLA DEL DISCURSO DE ENTRADA Y UN MODELO DE REFERENCIA RECIBIDO DESDE UNA PORCION (4) DE CONVERSION DE ESPECTRO QUE CORRESPONDE A INFORMACION EMPAREJADA RECIBIDA DESDE UNA PORCION (2) PRELIMINARMENTE EMPAREJADA. LA PORCION (6) DE COMPENSACION COMPENSA LOS VECTORES MEDIOS CALCULADOS POR LA PORCION (5) DE CALCULO DE VECTOR MEDIO PARA AL MENOS UNA DE LAS SECUENCIAS DEL ESPECTRO DEL DISCURSO DE ENTRADA Y LA SECUENCIA DE TIEMPO DEL ESPECTRO DEL MODELO DE REFERENCIA DE MODO QUE EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE RUIDO DEL HABLA DE ENTRADA SE ADAPTA CON EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE RUIDO DEL MODELO DE REFERENCIA Y QUE EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE HABLA DE ENTRADA ENCAJA CON EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE HABLA DEL MODELO DE REFERENCIA. LA PORCION (8) DE EMPAREJAMIENTO FINALMENTE EMPAREJA EL MODELO DE REFERENCIA CON EL DISCURSO DE ENTRADA Y PRODUCE UN RESULTADO DE RECONOCIMIENTO. PUESTO QUE RUIDO ADICIONAL Y CONDICIONES DE RUIDO DE L A DISTORSION DEL CANAL DEL DISCURSO DE ENTRADA A SER RECONOCIDA SON RAPIDAMENTE EMPAREJADOS CON LOS DE UN MODELO DE REFERENCIA, INCLUSO SI EL RUIDO ADICIONAL Y MICROFONO Y CANAL DE TRANSMISION A TRAVES DE LOS CUALES ES CAPTADO EL DISCURSO DE ENTRADA SON DESCONOCIDOS, CUANDO LA PALABRA HABLADA DE ENTRADA ES PREPARADA Y EL RUIDO AÑADIDO Y LAS CONDICIONES DE RUIDO VARIAN PARA CADA DISCURSO DE ENTRADA, EL APARATO DE RECONOCIMIENTO DE VOZ PUEDE RECONOCER PRECISAMENTE VOZ NO INFLUIDA POR RUIDO AMBIENTAL. POR ELLO, EL APARATO DE ACUERDO A LA PRESENTE INVENCION PUEDE RESOLVER LOSINCONVENIENTES QUE LOS APARATOS CONVENCIONALES HAN TENIDO.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP5331478A JP2737624B2 (ja) | 1993-12-27 | 1993-12-27 | 音声認識装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2144031T3 true ES2144031T3 (es) | 2000-06-01 |
Family
ID=18244101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES94120541T Expired - Lifetime ES2144031T3 (es) | 1993-12-27 | 1994-12-23 | Aparato de reconocimiento de voz. |
Country Status (5)
Country | Link |
---|---|
US (1) | US5655057A (es) |
EP (1) | EP0660300B1 (es) |
JP (1) | JP2737624B2 (es) |
DE (1) | DE69423588T2 (es) |
ES (1) | ES2144031T3 (es) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3254994B2 (ja) * | 1995-03-01 | 2002-02-12 | セイコーエプソン株式会社 | 音声認識対話装置および音声認識対話処理方法 |
JP2780676B2 (ja) * | 1995-06-23 | 1998-07-30 | 日本電気株式会社 | 音声認識装置及び音声認識方法 |
JPH0981183A (ja) * | 1995-09-14 | 1997-03-28 | Pioneer Electron Corp | 音声モデルの作成方法およびこれを用いた音声認識装置 |
TW347503B (en) * | 1995-11-15 | 1998-12-11 | Hitachi Ltd | Character recognition translation system and voice recognition translation system |
JP3452443B2 (ja) * | 1996-03-25 | 2003-09-29 | 三菱電機株式会社 | 騒音下音声認識装置及び騒音下音声認識方法 |
JPH10257583A (ja) * | 1997-03-06 | 1998-09-25 | Asahi Chem Ind Co Ltd | 音声処理装置およびその音声処理方法 |
GB9706174D0 (en) * | 1997-03-25 | 1997-11-19 | Secr Defence | Recognition system |
GB2336929A (en) * | 1997-03-25 | 1999-11-03 | The Secretary Of State For Defence | Recognition system |
DE29718636U1 (de) * | 1997-10-21 | 1998-02-12 | Rosenbaum, Lothar, 56727 Mayen | Phonetische Steuer-, Eingabe- und Kommunikationseinrichtung mit akustischer Rückmeldung, insbesondere für Holzbearbeitungsmaschinen |
GB2349259B (en) | 1999-04-23 | 2003-11-12 | Canon Kk | Speech processing apparatus and method |
DE10005609C1 (de) * | 2000-02-09 | 2001-08-09 | Siemens Ag | Verfahren zur Spracherkennung |
TW521266B (en) * | 2000-07-13 | 2003-02-21 | Verbaltek Inc | Perceptual phonetic feature speech recognition system and method |
EP1229516A1 (en) * | 2001-01-26 | 2002-08-07 | Telefonaktiebolaget L M Ericsson (Publ) | Method, device, terminal and system for the automatic recognition of distorted speech data |
US6957183B2 (en) * | 2002-03-20 | 2005-10-18 | Qualcomm Inc. | Method for robust voice recognition by analyzing redundant features of source signal |
DE10253868B3 (de) * | 2002-11-15 | 2004-07-29 | Digital Design Gmbh | Verfahren und Anordnung zur Synchronisation von Test- und Referenzmustern sowie ein entsprechendes Computerprogramm-Erzeugnis und ein entsprechendes computerlesbares Speichermedium |
US20050216266A1 (en) * | 2004-03-29 | 2005-09-29 | Yifan Gong | Incremental adjustment of state-dependent bias parameters for adaptive speech recognition |
US20060100866A1 (en) * | 2004-10-28 | 2006-05-11 | International Business Machines Corporation | Influencing automatic speech recognition signal-to-noise levels |
US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
US8566086B2 (en) * | 2005-06-28 | 2013-10-22 | Qnx Software Systems Limited | System for adaptive enhancement of speech signals |
JP4765461B2 (ja) * | 2005-07-27 | 2011-09-07 | 日本電気株式会社 | 雑音抑圧システムと方法及びプログラム |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US7778831B2 (en) * | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8615397B2 (en) * | 2008-04-04 | 2013-12-24 | Intuit Inc. | Identifying audio content using distorted target patterns |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
US10595083B2 (en) | 2018-04-20 | 2020-03-17 | The Nielsen Company (Us), Llc | Methods and apparatus to determine audio source impact on an audience of media |
EP3950236A4 (en) * | 2019-03-29 | 2022-07-06 | Sony Group Corporation | INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND PROGRAM |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5569880A (en) * | 1978-11-22 | 1980-05-26 | Nec Corp | Pattern recognition unit |
JPS5722295A (en) * | 1980-07-15 | 1982-02-05 | Nippon Electric Co | Speaker recognizing system |
JPS58130396A (ja) * | 1982-01-29 | 1983-08-03 | 株式会社東芝 | 音声認識装置 |
US5359695A (en) * | 1984-01-30 | 1994-10-25 | Canon Kabushiki Kaisha | Speech perception apparatus |
US4852181A (en) * | 1985-09-26 | 1989-07-25 | Oki Electric Industry Co., Ltd. | Speech recognition for recognizing the catagory of an input speech pattern |
US5189727A (en) * | 1989-07-28 | 1993-02-23 | Electronic Warfare Associates, Inc. | Method and apparatus for language and speaker recognition |
JPH03120598A (ja) * | 1989-10-03 | 1991-05-22 | Canon Inc | 音声認識方法及び装置 |
CA2042926C (en) * | 1990-05-22 | 1997-02-25 | Ryuhei Fujiwara | Speech recognition method with noise reduction and a system therefor |
US5276766A (en) * | 1991-07-16 | 1994-01-04 | International Business Machines Corporation | Fast algorithm for deriving acoustic prototypes for automatic speech recognition |
JPH05134694A (ja) * | 1991-11-15 | 1993-05-28 | Sony Corp | 音声認識装置 |
-
1993
- 1993-12-27 JP JP5331478A patent/JP2737624B2/ja not_active Expired - Lifetime
-
1994
- 1994-12-22 US US08/361,567 patent/US5655057A/en not_active Expired - Lifetime
- 1994-12-23 EP EP94120541A patent/EP0660300B1/en not_active Expired - Lifetime
- 1994-12-23 DE DE69423588T patent/DE69423588T2/de not_active Expired - Lifetime
- 1994-12-23 ES ES94120541T patent/ES2144031T3/es not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
DE69423588D1 (de) | 2000-04-27 |
EP0660300B1 (en) | 2000-03-22 |
JP2737624B2 (ja) | 1998-04-08 |
JPH07191689A (ja) | 1995-07-28 |
EP0660300A1 (en) | 1995-06-28 |
DE69423588T2 (de) | 2000-11-16 |
US5655057A (en) | 1997-08-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2144031T3 (es) | Aparato de reconocimiento de voz. | |
EP1195744A2 (en) | Noise robust voice recognition | |
JP5288723B2 (ja) | マルチチャネルの反響補償 | |
EP0398574A3 (en) | Speech recognition employing key word modeling and non-key word modeling | |
US20080281588A1 (en) | Speech processing method and apparatus, storage medium, and speech system | |
ES2107635T3 (es) | Procedimiento de reduccion de ruido acustico en una señal de voz. | |
US4179586A (en) | System of encoded speech transmission and reception | |
MX9306601A (es) | Aparato para la eliminacion o reduccion del ruido de fondo, para usarse con un microtelefono. | |
HK1047817A1 (en) | Spectral magnitude quantization for a speech coder. | |
JPH0723009A (ja) | 冗長性低減方法 | |
JP2002217793A (ja) | エコー抑圧装置 | |
US5515445A (en) | Long-time balancing of omni microphones | |
KR970068222A (ko) | Ds-cdma 다중 이용자 직렬 간섭 캔설러 장치 및 그 간섭 반복 신호 전송 방법 | |
JPS5672499A (en) | Pretreatment for voice identifier | |
NO981334D0 (no) | Fremgangsmåte og system for hurtig å generere og sende en tegnsekvens ved hjelp av talefrekvenser | |
JP3240908B2 (ja) | 声質変換方法 | |
US10848879B2 (en) | Method for improving the spatial hearing perception of a binaural hearing aid | |
JPS63502146A (ja) | 音声認識テンプレ−トから音声を合成する方法および装置 | |
DK0489023T3 (da) | I-øret høreapparat med lydkompensationskanal | |
ATE242873T1 (de) | Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen | |
KR830003980A (ko) | 이퀄라이저(equalizer) | |
JP2000252891A (ja) | 信号処理装置 | |
JP2961916B2 (ja) | 音声認識装置 | |
JPS6128128A (ja) | 電子通訳装置 | |
KR100322299B1 (ko) | 스마트 안테나를 사용한 주파수 영역 nlms 적응 등화기 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 660300 Country of ref document: ES |