DE3685435D1 - Verfahren und einrichtung zur spracherkennung mit wirksamer speicherung und schnellem zusammenfuegen von phonologischen darstellungen. - Google Patents

Verfahren und einrichtung zur spracherkennung mit wirksamer speicherung und schnellem zusammenfuegen von phonologischen darstellungen.

Info

Publication number
DE3685435D1
DE3685435D1 DE8686104219T DE3685435T DE3685435D1 DE 3685435 D1 DE3685435 D1 DE 3685435D1 DE 8686104219 T DE8686104219 T DE 8686104219T DE 3685435 T DE3685435 T DE 3685435T DE 3685435 D1 DE3685435 D1 DE 3685435D1
Authority
DE
Germany
Prior art keywords
subgraph
confluent
boundary
word
phonological
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
DE8686104219T
Other languages
English (en)
Inventor
Lalit Rai Bahl
Paul Sheldon Cohen
Robert Leroy Mercer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Application granted granted Critical
Publication of DE3685435D1 publication Critical patent/DE3685435D1/de
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Probability & Statistics with Applications (AREA)
  • Artificial Intelligence (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Character Discrimination (AREA)
DE8686104219T 1985-05-09 1986-03-27 Verfahren und einrichtung zur spracherkennung mit wirksamer speicherung und schnellem zusammenfuegen von phonologischen darstellungen. Expired - Fee Related DE3685435D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US06/732,472 US4980918A (en) 1985-05-09 1985-05-09 Speech recognition system with efficient storage and rapid assembly of phonological graphs
EP86104219A EP0238692B1 (de) 1985-05-09 1986-03-27 Verfahren und Einrichtung zur Spracherkennung mit wirksamer Speicherung und schnellem Zusammenfügen von phonologischen Darstellungen

Publications (1)

Publication Number Publication Date
DE3685435D1 true DE3685435D1 (de) 1992-06-25

Family

ID=24943640

Family Applications (1)

Application Number Title Priority Date Filing Date
DE8686104219T Expired - Fee Related DE3685435D1 (de) 1985-05-09 1986-03-27 Verfahren und einrichtung zur spracherkennung mit wirksamer speicherung und schnellem zusammenfuegen von phonologischen darstellungen.

Country Status (4)

Country Link
US (1) US4980918A (de)
EP (1) EP0238692B1 (de)
CA (1) CA1242028A (de)
DE (1) DE3685435D1 (de)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5268990A (en) * 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5388183A (en) * 1991-09-30 1995-02-07 Kurzwell Applied Intelligence, Inc. Speech recognition providing multiple outputs
US5280562A (en) * 1991-10-03 1994-01-18 International Business Machines Corporation Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer
US5222146A (en) * 1991-10-23 1993-06-22 International Business Machines Corporation Speech recognition apparatus having a speech coder outputting acoustic prototype ranks
US5278942A (en) * 1991-12-05 1994-01-11 International Business Machines Corporation Speech coding apparatus having speaker dependent prototypes generated from nonuser reference data
US5267345A (en) * 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
CA2088080C (en) * 1992-04-02 1997-10-07 Enrico Luigi Bocchieri Automatic speech recognizer
US5233681A (en) * 1992-04-24 1993-08-03 International Business Machines Corporation Context-dependent speech recognizer using estimated next word context
US5293584A (en) * 1992-05-21 1994-03-08 International Business Machines Corporation Speech recognition system for natural language translation
US5333236A (en) * 1992-09-10 1994-07-26 International Business Machines Corporation Speech recognizer having a speech coder for an acoustic match based on context-dependent speech-transition acoustic models
US5497447A (en) * 1993-03-08 1996-03-05 International Business Machines Corporation Speech coding apparatus having acoustic prototype vectors generated by tying to elementary models and clustering around reference vectors
US6230128B1 (en) 1993-03-31 2001-05-08 British Telecommunications Public Limited Company Path link passing speech recognition with vocabulary node being capable of simultaneously processing plural path links
US5819222A (en) * 1993-03-31 1998-10-06 British Telecommunications Public Limited Company Task-constrained connected speech recognition of propagation of tokens only if valid propagation path is present
CN1196104C (zh) * 1993-03-31 2005-04-06 英国电讯有限公司 语音处理
CA2115210C (en) * 1993-04-21 1997-09-23 Joseph C. Andreshak Interactive computer system recognizing spoken commands
US5465317A (en) * 1993-05-18 1995-11-07 International Business Machines Corporation Speech recognition system with improved rejection of words and sounds not in the system vocabulary
US5544277A (en) * 1993-07-28 1996-08-06 International Business Machines Corporation Speech coding apparatus and method for generating acoustic feature vector component values by combining values of the same features for multiple time intervals
US5522011A (en) * 1993-09-27 1996-05-28 International Business Machines Corporation Speech coding apparatus and method using classification rules
GB2290684A (en) * 1994-06-22 1996-01-03 Ibm Speech synthesis using hidden Markov model to determine speech unit durations
JP3741156B2 (ja) * 1995-04-07 2006-02-01 ソニー株式会社 音声認識装置および音声認識方法並びに音声翻訳装置
CA2220004A1 (en) * 1995-05-26 1996-11-28 John N. Nguyen Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system
US5677991A (en) * 1995-06-30 1997-10-14 Kurzweil Applied Intelligence, Inc. Speech recognition system using arbitration between continuous speech and isolated word modules
US5794196A (en) * 1995-06-30 1998-08-11 Kurzweil Applied Intelligence, Inc. Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules
US5799279A (en) * 1995-11-13 1998-08-25 Dragon Systems, Inc. Continuous speech recognition of text and commands
US5875426A (en) * 1996-06-12 1999-02-23 International Business Machines Corporation Recognizing speech having word liaisons by adding a phoneme to reference word models
US5864793A (en) * 1996-08-06 1999-01-26 Cirrus Logic, Inc. Persistence and dynamic threshold based intermittent signal detector
DE19639844A1 (de) * 1996-09-27 1998-04-02 Philips Patentverwaltung Verfahren zum Ableiten wenigstens einer Folge von Wörtern aus einem Sprachsignal
US6151575A (en) * 1996-10-28 2000-11-21 Dragon Systems, Inc. Rapid adaptation of speech models
JP3061114B2 (ja) * 1996-11-25 2000-07-10 日本電気株式会社 音声認識装置
US6224636B1 (en) 1997-02-28 2001-05-01 Dragon Systems, Inc. Speech recognition using nonparametric speech models
US6097776A (en) * 1998-02-12 2000-08-01 Cirrus Logic, Inc. Maximum likelihood estimation of symbol offset
EP1133766B1 (de) * 1998-11-25 2004-01-21 Entropic Limited Netzwerk- und sprachmodelle zur verwendung in einem spracherkennungssystem
CN1343337B (zh) * 1999-03-05 2013-03-20 佳能株式会社 用于产生包括音素数据和解码的字的注释数据的方法和设备
WO2001031627A2 (en) * 1999-10-28 2001-05-03 Canon Kabushiki Kaisha Pattern matching method and apparatus
US7310600B1 (en) 1999-10-28 2007-12-18 Canon Kabushiki Kaisha Language recognition using a similarity measure
US6882970B1 (en) 1999-10-28 2005-04-19 Canon Kabushiki Kaisha Language recognition using sequence frequency
GB0011798D0 (en) * 2000-05-16 2000-07-05 Canon Kk Database annotation and retrieval
GB0015233D0 (en) 2000-06-21 2000-08-16 Canon Kk Indexing method and apparatus
US7216077B1 (en) * 2000-09-26 2007-05-08 International Business Machines Corporation Lattice-based unsupervised maximum likelihood linear regression for speaker adaptation
GB0023930D0 (en) 2000-09-29 2000-11-15 Canon Kk Database annotation and retrieval
JP2002149187A (ja) * 2000-11-07 2002-05-24 Sony Corp 音声認識装置および音声認識方法、並びに記録媒体
GB0027178D0 (en) 2000-11-07 2000-12-27 Canon Kk Speech processing system
GB0028277D0 (en) 2000-11-20 2001-01-03 Canon Kk Speech processing system
US7027987B1 (en) 2001-02-07 2006-04-11 Google Inc. Voice interface for a search engine
US20030009331A1 (en) * 2001-07-05 2003-01-09 Johan Schalkwyk Grammars for speech recognition
US20030009335A1 (en) * 2001-07-05 2003-01-09 Johan Schalkwyk Speech recognition with dynamic grammars
KR100406307B1 (ko) * 2001-08-09 2003-11-19 삼성전자주식회사 음성등록방법 및 음성등록시스템과 이에 기초한음성인식방법 및 음성인식시스템
US20030110040A1 (en) * 2001-12-07 2003-06-12 Creative Logic Solutions Inc. System and method for dynamically changing software programs by voice commands
GB2391679B (en) * 2002-02-04 2004-03-24 Zentian Ltd Speech recognition circuit using parallel processors
US8959019B2 (en) 2002-10-31 2015-02-17 Promptu Systems Corporation Efficient empirical determination, computation, and use of acoustic confusability measures
US7149688B2 (en) * 2002-11-04 2006-12-12 Speechworks International, Inc. Multi-lingual speech recognition with cross-language context modeling
US8535236B2 (en) * 2004-03-19 2013-09-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for analyzing a sound signal using a physiological ear model
US20060031071A1 (en) * 2004-08-03 2006-02-09 Sony Corporation System and method for automatically implementing a finite state automaton for speech recognition
DE102005030326B4 (de) * 2005-06-29 2016-02-25 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und Computerprogramm zur Analyse eines Audiosignals
US7996212B2 (en) * 2005-06-29 2011-08-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Device, method and computer program for analyzing an audio signal
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
CN102176310B (zh) 2005-12-08 2013-08-21 纽昂斯奥地利通讯有限公司 具有巨大词汇量的语音识别系统
DE102006006296B3 (de) * 2006-02-10 2007-10-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Verfahren, Vorrichtung und Computerprogramm zum Erzeugen eines Ansteuersignals für ein Cochlea-Implantat basierend auf einem Audiosignal
US20110224982A1 (en) * 2010-03-12 2011-09-15 c/o Microsoft Corporation Automatic speech recognition based upon information retrieval methods
US8375061B2 (en) 2010-06-08 2013-02-12 International Business Machines Corporation Graphical models for representing text documents for computer analysis
GB201322377D0 (en) * 2013-12-18 2014-02-05 Isis Innovation Method and apparatus for automatic speech recognition
US9607613B2 (en) 2014-04-23 2017-03-28 Google Inc. Speech endpointing based on word comparisons
US9721564B2 (en) 2014-07-31 2017-08-01 Rovi Guides, Inc. Systems and methods for performing ASR in the presence of heterographs
CN104267922B (zh) * 2014-09-16 2019-05-31 联想(北京)有限公司 一种信息处理方法及电子设备
US9830321B2 (en) 2014-09-30 2017-11-28 Rovi Guides, Inc. Systems and methods for searching for a media asset
KR20160098910A (ko) * 2015-02-11 2016-08-19 한국전자통신연구원 음성 인식 데이터 베이스 확장 방법 및 장치
EP3577645B1 (de) 2017-06-06 2022-08-03 Google LLC Erkennung des endes einer abfrage
US10929754B2 (en) 2017-06-06 2021-02-23 Google Llc Unified endpointer using multitask and multidomain learning

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3679830A (en) * 1970-05-11 1972-07-25 Malcolm R Uffelman Cohesive zone boundary detector
US4032710A (en) * 1975-03-10 1977-06-28 Threshold Technology, Inc. Word boundary detector for speech recognition equipment
US4059725A (en) * 1975-03-12 1977-11-22 Nippon Electric Company, Ltd. Automatic continuous speech recognition system employing dynamic programming
US4092493A (en) * 1976-11-30 1978-05-30 Bell Telephone Laboratories, Incorporated Speech recognition system
US4107460A (en) * 1976-12-06 1978-08-15 Threshold Technology, Inc. Apparatus for recognizing words from among continuous speech
US4181821A (en) * 1978-10-31 1980-01-01 Bell Telephone Laboratories, Incorporated Multiple template speech recognition system
US4400788A (en) * 1981-03-27 1983-08-23 Bell Telephone Laboratories, Incorporated Continuous speech pattern recognizer
JPS57178295A (en) * 1981-04-27 1982-11-02 Nippon Electric Co Continuous word recognition apparatus
US4481593A (en) * 1981-10-05 1984-11-06 Exxon Corporation Continuous speech recognition
US4587670A (en) * 1982-10-15 1986-05-06 At&T Bell Laboratories Hidden Markov model speech recognition arrangement
US4723290A (en) * 1983-05-16 1988-02-02 Kabushiki Kaisha Toshiba Speech recognition apparatus
US4741036A (en) * 1985-01-31 1988-04-26 International Business Machines Corporation Determination of phone weights for markov models in a speech recognition system

Also Published As

Publication number Publication date
EP0238692B1 (de) 1992-05-20
US4980918A (en) 1990-12-25
EP0238692A1 (de) 1987-09-30
CA1242028A (en) 1988-09-13

Similar Documents

Publication Publication Date Title
DE3685435D1 (de) Verfahren und einrichtung zur spracherkennung mit wirksamer speicherung und schnellem zusammenfuegen von phonologischen darstellungen.
ATE374421T1 (de) Segmentierungsverfahren zur erweiterung des aktiven vokabulars von spracherkennern
CA2089786A1 (en) Context-dependent speech recognizer using estimated next word context
DE59809609D1 (de) Verfahren zur Spracherkennung mit Sprachmodellanpassung
DE59801560D1 (de) Verfahren zur Spracherkennung mit Sprachmodellanpassung
DE69602444D1 (de) System und verfahren zum einschränken des suchumfangs in einem lexikon
DE60004862D1 (de) Automatische bestimmung der genauigkeit eines aussprachewörterbuchs in einem spracherkennungssystem
CA2198306A1 (en) Method and apparatus for an improved language recognition system
ES2153021T3 (es) Procedimiento y disposicion para la conversion del habla a texto.
DE68912397D1 (de) Spracherkennung mit Sprecheranpassung durch Lernprozess.
DE60111329D1 (de) Anpassung des phonetischen Kontextes zur Verbesserung der Spracherkennung
DE3779170D1 (de) Erzeugung von wortgrundstrukturen zur spracherkennung.
ATE258332T1 (de) Netzwerk- und sprachmodelle zur verwendung in einem spracherkennungssystem
ATE398323T1 (de) Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache
BR9913524A (pt) Reconhecedor de voz, e, processo de reconhecimento de voz
DE68904330D1 (de) Hirudin-varianten, deren verwendung und verfahren zu deren herstellung.
AT371132B (de) Verfahren zur herstellung von modifizierten, waesserigen kunststoff-dispersionen
WO1996000962A3 (en) Method and device for adapting a speech recognition equipment for dialectal variations in a language
GB2196460B (en) Methods for comparing an input voice pattern with a registered voice pattern and voice recognition systems
DE3673857D1 (de) Auf einem erworbenen wissensgut basierte einrichtung und verfahren zur automatischen spracherkennung.
Angelini et al. Automatic segmentation and labeling of English and Italian speech databases.
ATE48486T1 (de) Schluesselworterkennungssystem unter anwendung eines sprachmusterverkettungsmodels.
DE59802584D1 (de) Vefahren zur spracherkennung unter verwendung von einer grammatik
ATE241196T1 (de) Erweiterung eines spracherkennungswortschatzes unter verwendung von abgeleiteten wörtern
DE3681155D1 (de) Verfahren und einrichtung zur ermittlung einer wahrscheinlichen woerterfolge aus durch einen akustischen prozessor erzeugten kennsaetzen.

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8339 Ceased/non-payment of the annual fee