DE69607913D1 - Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle - Google Patents

Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle

Info

Publication number
DE69607913D1
DE69607913D1 DE69607913T DE69607913T DE69607913D1 DE 69607913 D1 DE69607913 D1 DE 69607913D1 DE 69607913 T DE69607913 T DE 69607913T DE 69607913 T DE69607913 T DE 69607913T DE 69607913 D1 DE69607913 D1 DE 69607913D1
Authority
DE
Germany
Prior art keywords
basis
voice recognition
new word
word models
models
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE69607913T
Other languages
English (en)
Other versions
DE69607913T2 (de
Inventor
Reinhold Haeb-Umbach
Peter Beyerlein
Eric Thelen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Philips Intellectual Property and Standards GmbH
Koninklijke Philips NV
Original Assignee
Philips Corporate Intellectual Property GmbH
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Corporate Intellectual Property GmbH, Koninklijke Philips Electronics NV filed Critical Philips Corporate Intellectual Property GmbH
Application granted granted Critical
Publication of DE69607913D1 publication Critical patent/DE69607913D1/de
Publication of DE69607913T2 publication Critical patent/DE69607913T2/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Telephonic Communication Services (AREA)
DE69607913T 1995-05-03 1996-05-02 Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle Expired - Fee Related DE69607913T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP95201139 1995-05-03
PCT/IB1996/000396 WO1996035207A1 (en) 1995-05-03 1996-05-02 Speech recognition methods and apparatus on the basis of the modelling of new words

Publications (2)

Publication Number Publication Date
DE69607913D1 true DE69607913D1 (de) 2000-05-31
DE69607913T2 DE69607913T2 (de) 2000-10-05

Family

ID=8220249

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69607913T Expired - Fee Related DE69607913T2 (de) 1995-05-03 1996-05-02 Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle

Country Status (6)

Country Link
US (1) US5873061A (de)
EP (1) EP0769184B1 (de)
JP (1) JPH10503033A (de)
CN (1) CN1130688C (de)
DE (1) DE69607913T2 (de)
WO (1) WO1996035207A1 (de)

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100397435B1 (ko) * 1996-07-20 2003-12-24 엘지전자 주식회사 음성인식시스템에서새로운등록단어처리가가능한클래식를이용한언어학적모델처리방법
DE19751123C1 (de) * 1997-11-19 1999-06-17 Deutsche Telekom Ag Vorrichtung und Verfahren zur sprecherunabhängigen Sprachnamenwahl für Telekommunikations-Endeinrichtungen
US5927988A (en) * 1997-12-17 1999-07-27 Jenkins; William M. Method and apparatus for training of sensory and perceptual systems in LLI subjects
JP2002539528A (ja) * 1999-03-05 2002-11-19 キヤノン株式会社 データベース注釈付け及び検索
DE60026637T2 (de) * 1999-06-30 2006-10-05 International Business Machines Corp. Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
CN1329861C (zh) * 1999-10-28 2007-08-01 佳能株式会社 模式匹配方法和装置
US7310600B1 (en) 1999-10-28 2007-12-18 Canon Kabushiki Kaisha Language recognition using a similarity measure
US6434547B1 (en) 1999-10-28 2002-08-13 Qenm.Com Data capture and verification system
US6882970B1 (en) 1999-10-28 2005-04-19 Canon Kabushiki Kaisha Language recognition using sequence frequency
DE19952049A1 (de) * 1999-10-28 2001-05-10 Siemens Ag Verfahren und Anordnung zur Verifikation eines Sprechers anhand eines Rechners
GB0011798D0 (en) * 2000-05-16 2000-07-05 Canon Kk Database annotation and retrieval
GB0015233D0 (en) 2000-06-21 2000-08-16 Canon Kk Indexing method and apparatus
US6961703B1 (en) * 2000-09-13 2005-11-01 Itt Manufacturing Enterprises, Inc. Method for speech processing involving whole-utterance modeling
GB0023930D0 (en) 2000-09-29 2000-11-15 Canon Kk Database annotation and retrieval
GB0027178D0 (en) * 2000-11-07 2000-12-27 Canon Kk Speech processing system
GB0028277D0 (en) 2000-11-20 2001-01-03 Canon Kk Speech processing system
US6973427B2 (en) * 2000-12-26 2005-12-06 Microsoft Corporation Method for adding phonetic descriptions to a speech recognition lexicon
GB0204474D0 (en) * 2002-02-26 2002-04-10 Canon Kk Speech recognition system
US20080208578A1 (en) * 2004-09-23 2008-08-28 Koninklijke Philips Electronics, N.V. Robust Speaker-Dependent Speech Recognition System
DE102005002474A1 (de) 2005-01-19 2006-07-27 Obstfelder, Sigrid Handy und Verfahren zur Spracheingabe in ein solches sowie Spracheingabebaustein und Verfahren zur Spracheingabe in einen solchen
WO2007097390A1 (ja) * 2006-02-23 2007-08-30 Nec Corporation 音声認識システム、音声認識結果出力方法、及び音声認識結果出力プログラム
DE102012202391A1 (de) * 2012-02-16 2013-08-22 Continental Automotive Gmbh Verfahren und Einrichtung zur Phonetisierung von textenthaltenden Datensätzen
US9570069B2 (en) * 2014-09-09 2017-02-14 Disney Enterprises, Inc. Sectioned memory networks for online word-spotting in continuous speech
KR102413067B1 (ko) * 2015-07-28 2022-06-24 삼성전자주식회사 문법 모델을 갱신하고, 문법 모델에 기초하여 음성 인식을 수행하는 방법 및 디바이스
CN106548787B (zh) * 2016-11-01 2019-07-09 云知声(上海)智能科技有限公司 优化生词的评测方法及评测系统
EP3698358A1 (de) 2017-10-18 2020-08-26 Soapbox Labs Ltd. Verfahren und systeme zur verarbeitung von audiosignalen, die sprachdaten enthalten

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5165007A (en) * 1985-02-01 1992-11-17 International Business Machines Corporation Feneme-based Markov models for words
US4819271A (en) * 1985-05-29 1989-04-04 International Business Machines Corporation Constructing Markov model word baseforms from multiple utterances by concatenating model sequences for word segments
JP2955297B2 (ja) * 1988-05-27 1999-10-04 株式会社東芝 音声認識システム
US5315689A (en) * 1988-05-27 1994-05-24 Kabushiki Kaisha Toshiba Speech recognition system having word-based and phoneme-based recognition means
DE3931638A1 (de) * 1989-09-22 1991-04-04 Standard Elektrik Lorenz Ag Verfahren zur sprecheradaptiven erkennung von sprache
US5129001A (en) * 1990-04-25 1992-07-07 International Business Machines Corporation Method and apparatus for modeling words with multi-arc markov models
US5454062A (en) * 1991-03-27 1995-09-26 Audio Navigation Systems, Inc. Method for recognizing spoken words
DE4130632A1 (de) * 1991-09-14 1993-03-18 Philips Patentverwaltung Verfahren zum erkennen der gesprochenen woerter in einem sprachsignal
US5390278A (en) * 1991-10-08 1995-02-14 Bell Canada Phoneme based speech recognition
EP0562138A1 (de) * 1992-03-25 1993-09-29 International Business Machines Corporation Methode und Einrichtung zur automatischen Erzeugung von Markov-Modellen von neuen Wörtern zur Aufnahme in einem Wortschatz zur Spracherkennung
US5502774A (en) * 1992-06-09 1996-03-26 International Business Machines Corporation Automatic recognition of a consistent message using multiple complimentary sources of information
JPH0772840B2 (ja) * 1992-09-29 1995-08-02 日本アイ・ビー・エム株式会社 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法
US5528728A (en) * 1993-07-12 1996-06-18 Kabushiki Kaisha Meidensha Speaker independent speech recognition system and method using neural network and DTW matching technique
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
US5429513A (en) * 1994-02-10 1995-07-04 Diaz-Plaza; Ruth R. Interactive teaching apparatus and method for teaching graphemes, grapheme names, phonemes, and phonetics
US5638487A (en) * 1994-12-30 1997-06-10 Purespeech, Inc. Automatic speech recognition

Also Published As

Publication number Publication date
WO1996035207A1 (en) 1996-11-07
EP0769184A1 (de) 1997-04-23
EP0769184B1 (de) 2000-04-26
CN1153567A (zh) 1997-07-02
US5873061A (en) 1999-02-16
CN1130688C (zh) 2003-12-10
DE69607913T2 (de) 2000-10-05
JPH10503033A (ja) 1998-03-17

Similar Documents

Publication Publication Date Title
DE69607913D1 (de) Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle
DE69518705D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69324629D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69717899D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69732769D1 (de) Einrichtung und verfahren zur verminderung der undurchschaubarkeit eines spracherkennungswortverzeichnisses und zur dynamischen selektion von akustischen modellen
DE69806557D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69625950D1 (de) Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69632901D1 (de) Vorrichtung und Verfahren zur Sprachsynthese
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69433254D1 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69517705D1 (de) Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem
DE69923253D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69324428D1 (de) Verfahren zur Sprachformung und Gerät zur Spracherkennung
DE69519840D1 (de) Einrichtung und Verfahren zur Spracherkennung
DE69631728D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69420400D1 (de) Verfahren und gerät zur sprechererkennung
DE69817844D1 (de) Verfahren und vorrichtung zur spracherkennungscomputereingabe
DE69629763D1 (de) Verfahren und Vorrichtung zur Ermittlung von Triphone Hidden Markov Modellen (HMM)
DE69332459D1 (de) Verfahren und Vorrichtung zur Zeichenerkennung
DE69519820D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69830017D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69031284D1 (de) Verfahren und Einrichtung zur Spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: PHILIPS INTELLECTUAL PROPERTY & STANDARDS GMBH, 20

Owner name: KONINKLIJKE PHILIPS ELECTRONICS N.V., EINDHOVEN, N

8339 Ceased/non-payment of the annual fee