ES2144031T3 - Aparato de reconocimiento de voz. - Google Patents

Aparato de reconocimiento de voz.

Info

Publication number
ES2144031T3
ES2144031T3 ES94120541T ES94120541T ES2144031T3 ES 2144031 T3 ES2144031 T3 ES 2144031T3 ES 94120541 T ES94120541 T ES 94120541T ES 94120541 T ES94120541 T ES 94120541T ES 2144031 T3 ES2144031 T3 ES 2144031T3
Authority
ES
Spain
Prior art keywords
entry
noise
spectrum
reference model
speaking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
ES94120541T
Other languages
English (en)
Inventor
Takagi Keizaburo
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Application granted granted Critical
Publication of ES2144031T3 publication Critical patent/ES2144031T3/es
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

UN APARATO DE RECONOCIMIENTO DE LA PALABRA HABLADA DE ACUERDO A LA PRESENTE INVENCION INCLUYE UNA PORCION (5) DE CALCULO DE VECTOR MEDIO, UNA PORCION (6) DE COMPENSACION, Y UNA PORCION (8) DE EMPAREJAMIENTO. LA PORCION (5) DE CALCULO DE VECTOR MEDIO CALCULA UN VECTOR MEDIO PARA CADA UNA DE LAS REGIONES DE RUIDO Y LAS REGIONES DE HABLA DEL DISCURSO DE ENTRADA Y UN MODELO DE REFERENCIA RECIBIDO DESDE UNA PORCION (4) DE CONVERSION DE ESPECTRO QUE CORRESPONDE A INFORMACION EMPAREJADA RECIBIDA DESDE UNA PORCION (2) PRELIMINARMENTE EMPAREJADA. LA PORCION (6) DE COMPENSACION COMPENSA LOS VECTORES MEDIOS CALCULADOS POR LA PORCION (5) DE CALCULO DE VECTOR MEDIO PARA AL MENOS UNA DE LAS SECUENCIAS DEL ESPECTRO DEL DISCURSO DE ENTRADA Y LA SECUENCIA DE TIEMPO DEL ESPECTRO DEL MODELO DE REFERENCIA DE MODO QUE EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE RUIDO DEL HABLA DE ENTRADA SE ADAPTA CON EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE RUIDO DEL MODELO DE REFERENCIA Y QUE EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE HABLA DE ENTRADA ENCAJA CON EL VECTOR MEDIO DE LA SECUENCIA DE TIEMPO DEL ESPECTRO DE LA REGION DE HABLA DEL MODELO DE REFERENCIA. LA PORCION (8) DE EMPAREJAMIENTO FINALMENTE EMPAREJA EL MODELO DE REFERENCIA CON EL DISCURSO DE ENTRADA Y PRODUCE UN RESULTADO DE RECONOCIMIENTO. PUESTO QUE RUIDO ADICIONAL Y CONDICIONES DE RUIDO DE L A DISTORSION DEL CANAL DEL DISCURSO DE ENTRADA A SER RECONOCIDA SON RAPIDAMENTE EMPAREJADOS CON LOS DE UN MODELO DE REFERENCIA, INCLUSO SI EL RUIDO ADICIONAL Y MICROFONO Y CANAL DE TRANSMISION A TRAVES DE LOS CUALES ES CAPTADO EL DISCURSO DE ENTRADA SON DESCONOCIDOS, CUANDO LA PALABRA HABLADA DE ENTRADA ES PREPARADA Y EL RUIDO AÑADIDO Y LAS CONDICIONES DE RUIDO VARIAN PARA CADA DISCURSO DE ENTRADA, EL APARATO DE RECONOCIMIENTO DE VOZ PUEDE RECONOCER PRECISAMENTE VOZ NO INFLUIDA POR RUIDO AMBIENTAL. POR ELLO, EL APARATO DE ACUERDO A LA PRESENTE INVENCION PUEDE RESOLVER LOSINCONVENIENTES QUE LOS APARATOS CONVENCIONALES HAN TENIDO.
ES94120541T 1993-12-27 1994-12-23 Aparato de reconocimiento de voz. Expired - Lifetime ES2144031T3 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5331478A JP2737624B2 (ja) 1993-12-27 1993-12-27 音声認識装置

Publications (1)

Publication Number Publication Date
ES2144031T3 true ES2144031T3 (es) 2000-06-01

Family

ID=18244101

Family Applications (1)

Application Number Title Priority Date Filing Date
ES94120541T Expired - Lifetime ES2144031T3 (es) 1993-12-27 1994-12-23 Aparato de reconocimiento de voz.

Country Status (5)

Country Link
US (1) US5655057A (es)
EP (1) EP0660300B1 (es)
JP (1) JP2737624B2 (es)
DE (1) DE69423588T2 (es)
ES (1) ES2144031T3 (es)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3254994B2 (ja) * 1995-03-01 2002-02-12 セイコーエプソン株式会社 音声認識対話装置および音声認識対話処理方法
JP2780676B2 (ja) * 1995-06-23 1998-07-30 日本電気株式会社 音声認識装置及び音声認識方法
JPH0981183A (ja) * 1995-09-14 1997-03-28 Pioneer Electron Corp 音声モデルの作成方法およびこれを用いた音声認識装置
TW347503B (en) * 1995-11-15 1998-12-11 Hitachi Ltd Character recognition translation system and voice recognition translation system
JP3452443B2 (ja) * 1996-03-25 2003-09-29 三菱電機株式会社 騒音下音声認識装置及び騒音下音声認識方法
JPH10257583A (ja) * 1997-03-06 1998-09-25 Asahi Chem Ind Co Ltd 音声処理装置およびその音声処理方法
GB9706174D0 (en) * 1997-03-25 1997-11-19 Secr Defence Recognition system
GB2336929A (en) * 1997-03-25 1999-11-03 The Secretary Of State For Defence Recognition system
DE29718636U1 (de) * 1997-10-21 1998-02-12 Rosenbaum, Lothar, 56727 Mayen Phonetische Steuer-, Eingabe- und Kommunikationseinrichtung mit akustischer Rückmeldung, insbesondere für Holzbearbeitungsmaschinen
GB2349259B (en) 1999-04-23 2003-11-12 Canon Kk Speech processing apparatus and method
DE10005609C1 (de) * 2000-02-09 2001-08-09 Siemens Ag Verfahren zur Spracherkennung
TW521266B (en) * 2000-07-13 2003-02-21 Verbaltek Inc Perceptual phonetic feature speech recognition system and method
EP1229516A1 (en) * 2001-01-26 2002-08-07 Telefonaktiebolaget L M Ericsson (Publ) Method, device, terminal and system for the automatic recognition of distorted speech data
US6957183B2 (en) * 2002-03-20 2005-10-18 Qualcomm Inc. Method for robust voice recognition by analyzing redundant features of source signal
DE10253868B3 (de) * 2002-11-15 2004-07-29 Digital Design Gmbh Verfahren und Anordnung zur Synchronisation von Test- und Referenzmustern sowie ein entsprechendes Computerprogramm-Erzeugnis und ein entsprechendes computerlesbares Speichermedium
US20050216266A1 (en) * 2004-03-29 2005-09-29 Yifan Gong Incremental adjustment of state-dependent bias parameters for adaptive speech recognition
US20060100866A1 (en) * 2004-10-28 2006-05-11 International Business Machines Corporation Influencing automatic speech recognition signal-to-noise levels
US8219391B2 (en) * 2005-02-15 2012-07-10 Raytheon Bbn Technologies Corp. Speech analyzing system with speech codebook
US8566086B2 (en) * 2005-06-28 2013-10-22 Qnx Software Systems Limited System for adaptive enhancement of speech signals
JP4765461B2 (ja) * 2005-07-27 2011-09-07 日本電気株式会社 雑音抑圧システムと方法及びプログラム
US7970613B2 (en) 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US8010358B2 (en) * 2006-02-21 2011-08-30 Sony Computer Entertainment Inc. Voice recognition with parallel gender and age normalization
US7778831B2 (en) * 2006-02-21 2010-08-17 Sony Computer Entertainment Inc. Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
US8615397B2 (en) * 2008-04-04 2013-12-24 Intuit Inc. Identifying audio content using distorted target patterns
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
US10595083B2 (en) 2018-04-20 2020-03-17 The Nielsen Company (Us), Llc Methods and apparatus to determine audio source impact on an audience of media
EP3950236A4 (en) * 2019-03-29 2022-07-06 Sony Group Corporation INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND PROGRAM

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5569880A (en) * 1978-11-22 1980-05-26 Nec Corp Pattern recognition unit
JPS5722295A (en) * 1980-07-15 1982-02-05 Nippon Electric Co Speaker recognizing system
JPS58130396A (ja) * 1982-01-29 1983-08-03 株式会社東芝 音声認識装置
US5359695A (en) * 1984-01-30 1994-10-25 Canon Kabushiki Kaisha Speech perception apparatus
US4852181A (en) * 1985-09-26 1989-07-25 Oki Electric Industry Co., Ltd. Speech recognition for recognizing the catagory of an input speech pattern
US5189727A (en) * 1989-07-28 1993-02-23 Electronic Warfare Associates, Inc. Method and apparatus for language and speaker recognition
JPH03120598A (ja) * 1989-10-03 1991-05-22 Canon Inc 音声認識方法及び装置
CA2042926C (en) * 1990-05-22 1997-02-25 Ryuhei Fujiwara Speech recognition method with noise reduction and a system therefor
US5276766A (en) * 1991-07-16 1994-01-04 International Business Machines Corporation Fast algorithm for deriving acoustic prototypes for automatic speech recognition
JPH05134694A (ja) * 1991-11-15 1993-05-28 Sony Corp 音声認識装置

Also Published As

Publication number Publication date
DE69423588D1 (de) 2000-04-27
EP0660300B1 (en) 2000-03-22
JP2737624B2 (ja) 1998-04-08
JPH07191689A (ja) 1995-07-28
EP0660300A1 (en) 1995-06-28
DE69423588T2 (de) 2000-11-16
US5655057A (en) 1997-08-05

Similar Documents

Publication Publication Date Title
ES2144031T3 (es) Aparato de reconocimiento de voz.
EP1195744A2 (en) Noise robust voice recognition
JP5288723B2 (ja) マルチチャネルの反響補償
EP0398574A3 (en) Speech recognition employing key word modeling and non-key word modeling
US20080281588A1 (en) Speech processing method and apparatus, storage medium, and speech system
ES2107635T3 (es) Procedimiento de reduccion de ruido acustico en una señal de voz.
US4179586A (en) System of encoded speech transmission and reception
MX9306601A (es) Aparato para la eliminacion o reduccion del ruido de fondo, para usarse con un microtelefono.
HK1047817A1 (en) Spectral magnitude quantization for a speech coder.
JPH0723009A (ja) 冗長性低減方法
JP2002217793A (ja) エコー抑圧装置
US5515445A (en) Long-time balancing of omni microphones
KR970068222A (ko) Ds-cdma 다중 이용자 직렬 간섭 캔설러 장치 및 그 간섭 반복 신호 전송 방법
JPS5672499A (en) Pretreatment for voice identifier
NO981334D0 (no) Fremgangsmåte og system for hurtig å generere og sende en tegnsekvens ved hjelp av talefrekvenser
JP3240908B2 (ja) 声質変換方法
US10848879B2 (en) Method for improving the spatial hearing perception of a binaural hearing aid
JPS63502146A (ja) 音声認識テンプレ−トから音声を合成する方法および装置
DK0489023T3 (da) I-øret høreapparat med lydkompensationskanal
ATE242873T1 (de) Mikrophonanordnung für die spracherkennung unter variablen räumlichen bedingungen
KR830003980A (ko) 이퀄라이저(equalizer)
JP2000252891A (ja) 信号処理装置
JP2961916B2 (ja) 音声認識装置
JPS6128128A (ja) 電子通訳装置
KR100322299B1 (ko) 스마트 안테나를 사용한 주파수 영역 nlms 적응 등화기

Legal Events

Date Code Title Description
FG2A Definitive protection

Ref document number: 660300

Country of ref document: ES