DE69519453D1 - Spracherkennung mit Sprecheradaptierung mittels Berechnung von Mittelwerten akustischer Kategorien - Google Patents

Spracherkennung mit Sprecheradaptierung mittels Berechnung von Mittelwerten akustischer Kategorien

Info

Publication number
DE69519453D1
DE69519453D1 DE69519453T DE69519453T DE69519453D1 DE 69519453 D1 DE69519453 D1 DE 69519453D1 DE 69519453 T DE69519453 T DE 69519453T DE 69519453 T DE69519453 T DE 69519453T DE 69519453 D1 DE69519453 D1 DE 69519453D1
Authority
DE
Germany
Prior art keywords
speech recognition
mean values
speaker adaptation
calculating mean
acoustic categories
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69519453T
Other languages
English (en)
Other versions
DE69519453T2 (de
Inventor
Keizaburo Takagi
Hiroaki Hattori
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of DE69519453D1 publication Critical patent/DE69519453D1/de
Application granted granted Critical
Publication of DE69519453T2 publication Critical patent/DE69519453T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Complex Calculations (AREA)
DE69519453T 1994-06-07 1995-06-06 Spracherkennung mit Sprecheradaptierung mittels Berechnung von Mittelwerten akustischer Kategorien Expired - Fee Related DE69519453T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP6125528A JP2692581B2 (ja) 1994-06-07 1994-06-07 音響カテゴリ平均値計算装置及び適応化装置

Publications (2)

Publication Number Publication Date
DE69519453D1 true DE69519453D1 (de) 2000-12-28
DE69519453T2 DE69519453T2 (de) 2001-03-29

Family

ID=14912415

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69519453T Expired - Fee Related DE69519453T2 (de) 1994-06-07 1995-06-06 Spracherkennung mit Sprecheradaptierung mittels Berechnung von Mittelwerten akustischer Kategorien

Country Status (4)

Country Link
US (1) US5651094A (de)
EP (1) EP0686965B1 (de)
JP (1) JP2692581B2 (de)
DE (1) DE69519453T2 (de)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2738403B2 (ja) * 1995-05-12 1998-04-08 日本電気株式会社 音声認識装置
GB9602691D0 (en) * 1996-02-09 1996-04-10 Canon Kk Word model generation
JP4339931B2 (ja) * 1996-09-27 2009-10-07 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 発話を認識する方法及びシステム
JP3061114B2 (ja) * 1996-11-25 2000-07-10 日本電気株式会社 音声認識装置
US6654955B1 (en) * 1996-12-19 2003-11-25 International Business Machines Corporation Adding speech recognition libraries to an existing program at runtime
AU744678B2 (en) * 1997-10-15 2002-02-28 British Telecommunications Public Limited Company Pattern recognition using multiple reference models
US6343267B1 (en) 1998-04-30 2002-01-29 Matsushita Electric Industrial Co., Ltd. Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques
US6263309B1 (en) 1998-04-30 2001-07-17 Matsushita Electric Industrial Co., Ltd. Maximum likelihood method for finding an adapted speaker model in eigenvoice space
JP2000259198A (ja) * 1999-03-04 2000-09-22 Sony Corp パターン認識装置および方法、並びに提供媒体
US6571208B1 (en) 1999-11-29 2003-05-27 Matsushita Electric Industrial Co., Ltd. Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
US6526379B1 (en) 1999-11-29 2003-02-25 Matsushita Electric Industrial Co., Ltd. Discriminative clustering methods for automatic speech recognition
AU5205700A (en) * 2000-06-15 2002-01-08 Intel Corporation Speaker adaptation using weighted feedback
US6917918B2 (en) * 2000-12-22 2005-07-12 Microsoft Corporation Method and system for frame alignment and unsupervised adaptation of acoustic models
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
US7509257B2 (en) * 2002-12-24 2009-03-24 Marvell International Ltd. Method and apparatus for adapting reference templates
US7756709B2 (en) * 2004-02-02 2010-07-13 Applied Voice & Speech Technologies, Inc. Detection of voice inactivity within a sound stream
US8229751B2 (en) * 2004-02-26 2012-07-24 Mediaguide, Inc. Method and apparatus for automatic detection and identification of unidentified Broadcast audio or video signals
KR20060135794A (ko) * 2004-02-26 2006-12-29 미디어 가이드, 인코포레이티드 방송 오디오 또는 비디오 프로그래밍 신호의 자동 검출 및식별 방법, 및 장치
GB2418764B (en) * 2004-09-30 2008-04-09 Fluency Voice Technology Ltd Improving pattern recognition accuracy with distortions
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7865362B2 (en) * 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
US7895039B2 (en) * 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7827032B2 (en) * 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7949533B2 (en) * 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US20090006337A1 (en) * 2005-12-30 2009-01-01 Mediaguide, Inc. Method and apparatus for automatic detection and identification of unidentified video signals
CN101390156B (zh) * 2006-02-27 2011-12-07 日本电气株式会社 标准模式适应装置、标准模式适应方法
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU7529981A (en) * 1980-09-19 1982-03-25 Hitachi Limited Language analysis by pattern recognition
JPS5885499A (ja) * 1981-11-18 1983-05-21 株式会社デンソー 連続音声認識装置
US4720802A (en) * 1983-07-26 1988-01-19 Lear Siegler Noise compensation arrangement
JPH0792673B2 (ja) * 1984-10-02 1995-10-09 株式会社東芝 認識用辞書学習方法
JPS61145599A (ja) * 1984-12-19 1986-07-03 日本電気株式会社 連続音声認識装置
JPH0638199B2 (ja) * 1985-09-02 1994-05-18 日本電気株式会社 音声認識装置
US5315689A (en) * 1988-05-27 1994-05-24 Kabushiki Kaisha Toshiba Speech recognition system having word-based and phoneme-based recognition means
US5159637A (en) * 1988-07-27 1992-10-27 Fujitsu Limited Speech word recognizing apparatus using information indicative of the relative significance of speech features
JP2852298B2 (ja) * 1990-07-31 1999-01-27 日本電気株式会社 標準パターン適応化方式

Also Published As

Publication number Publication date
JP2692581B2 (ja) 1997-12-17
EP0686965B1 (de) 2000-11-22
EP0686965A3 (de) 1997-10-29
JPH07334184A (ja) 1995-12-22
EP0686965A2 (de) 1995-12-13
DE69519453T2 (de) 2001-03-29
US5651094A (en) 1997-07-22

Similar Documents

Publication Publication Date Title
DE69519453T2 (de) Spracherkennung mit Sprecheradaptierung mittels Berechnung von Mittelwerten akustischer Kategorien
DE60233763D1 (de) Spracherkennungsystem mittels impliziter Sprecheradaptation
DE69632517D1 (de) Erkennung kontinuierlicher Sprache
DE69914839D1 (de) Sprecherverifikation und -erkennung mittels Eigenstimmen
DE69635325D1 (de) Verbesserungen zur Spracherkennung
DE69615667T2 (de) Spracherkennung
FI971822A0 (fi) Puheentunnistus
DE69807765D1 (de) Kombination von Frequenzverzerrung und spektraler Formung in einem HMM - basierten Spracherkenner
DE69827586D1 (de) Technik zur Adaptation von Hidden Markov Modellen für die Spracherkennung
DE68910139T2 (de) Mikrophon mit akustischer Frequenzanhebung.
DE69621393T2 (de) Quantisierung von Sprachsignalen in prädiktiven Kodiersystemen unter Verwendung von Modellen menschlichen Hörens
DE69819951D1 (de) Spracherkenner mit Rauschadaptierung
DE68912397T2 (de) Spracherkennung mit Sprecheranpassung durch Lernprozess.
DK0749109T3 (da) Talegenkendelse for tonesprog
DE69413912D1 (de) Sprachumsetzungsverfahren
AU2727697A (en) Method and recognizer for recognizing tonal acoustic sound signals
ES8708266A1 (es) Procedimiento y aparato para reconocer a una persona
DE60017880D1 (de) Adaptive postfiltertechnik auf basis eines yule-walkerfilters
DE69635485D1 (de) Tonwiedergabegerät oder mikrofon
RU2002129029A (ru) Способ дикторонезависимого распознавания звуков речи
FI935378A (fi) Menetelmä akustisen puhesignaalin äänen korkeuden arvioimiseksi sekä menetelmää hyödyntävä puheen tunnistusjärjestelmä
SE9502654D0 (sv) Anordning för taligenkänning med två eller fler mikrofoner
JPS62179800U (de)
KR960029930U (ko) 음향기기의 마이크 수납장치
KR970046157U (ko) 자동차의 음성인식 오디오 선국 제어장치

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee