DE60022291D1 - Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz - Google Patents

Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz

Info

Publication number
DE60022291D1
DE60022291D1 DE60022291T DE60022291T DE60022291D1 DE 60022291 D1 DE60022291 D1 DE 60022291D1 DE 60022291 T DE60022291 T DE 60022291T DE 60022291 T DE60022291 T DE 60022291T DE 60022291 D1 DE60022291 D1 DE 60022291D1
Authority
DE
Germany
Prior art keywords
vocabulary
unsaturated
great
adjustment
automatic language
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60022291T
Other languages
English (en)
Other versions
DE60022291T2 (de
Inventor
S Zimmerman
N Tajchman
S Boardman
W Rahmel
B Schalk
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Inc
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=23206426&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=DE60022291(D1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Application granted granted Critical
Publication of DE60022291D1 publication Critical patent/DE60022291D1/de
Publication of DE60022291T2 publication Critical patent/DE60022291T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
DE60022291T 1999-05-13 2000-05-10 Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz Expired - Lifetime DE60022291T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US09/311,333 US7505905B1 (en) 1999-05-13 1999-05-13 In-the-field adaptation of a large vocabulary automatic speech recognizer (ASR)
US311333 1999-05-13
PCT/EP2000/004246 WO2000070603A1 (en) 1999-05-13 2000-05-10 Unsupervised adaptation of a large vocabulary automatic speech recognizer

Publications (2)

Publication Number Publication Date
DE60022291D1 true DE60022291D1 (de) 2005-10-06
DE60022291T2 DE60022291T2 (de) 2006-06-29

Family

ID=23206426

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60022291T Expired - Lifetime DE60022291T2 (de) 1999-05-13 2000-05-10 Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz

Country Status (6)

Country Link
US (1) US7505905B1 (de)
EP (1) EP1097446B1 (de)
JP (1) JP2003526117A (de)
KR (1) KR20010053521A (de)
DE (1) DE60022291T2 (de)
WO (1) WO2000070603A1 (de)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10127559A1 (de) * 2001-06-06 2002-12-12 Philips Corp Intellectual Pty Benutzergruppenspezifisches Musterverarbeitungssystem
JP2003099086A (ja) * 2001-09-25 2003-04-04 Nippon Hoso Kyokai <Nhk> 言語・音響モデル作成方法および言語・音響モデル作成装置ならびに言語・音響モデル作成プログラム
US7424421B2 (en) * 2004-03-03 2008-09-09 Microsoft Corporation Word collection method and system for use in word-breaking
TWI342010B (en) * 2006-12-13 2011-05-11 Delta Electronics Inc Speech recognition method and system with intelligent classification and adjustment
US8583415B2 (en) 2007-06-29 2013-11-12 Microsoft Corporation Phonetic search using normalized string
US9224384B2 (en) * 2012-06-06 2015-12-29 Cypress Semiconductor Corporation Histogram based pre-pruning scheme for active HMMS
US9502029B1 (en) * 2012-06-25 2016-11-22 Amazon Technologies, Inc. Context-aware speech processing
JP5966689B2 (ja) * 2012-07-04 2016-08-10 日本電気株式会社 音響モデル適応装置、音響モデル適応方法および音響モデル適応プログラム
KR102073102B1 (ko) * 2013-03-21 2020-02-04 삼성전자 주식회사 언어인식을 위한 언어모델 db, 언어인식장치와 언어인식방법, 및 언어인식시스템
US9489943B2 (en) * 2013-10-16 2016-11-08 Interactive Intelligence Group, Inc. System and method for learning alternate pronunciations for speech recognition
EP3958255A1 (de) 2015-01-16 2022-02-23 Samsung Electronics Co., Ltd. Verfahren und vorrichtung zur durchführung von spracherkennung
US10147428B1 (en) 2018-05-30 2018-12-04 Green Key Technologies Llc Computer systems exhibiting improved computer speed and transcription accuracy of automatic speech transcription (AST) based on a multiple speech-to-text engines and methods of use thereof
KR102515914B1 (ko) * 2022-12-21 2023-03-30 주식회사 액션파워 Stt 모델을 활용하는 발음 전사 방법

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6461799A (en) * 1987-09-01 1989-03-08 Nec Corp Fast voice recognition equipment
US5199077A (en) * 1991-09-19 1993-03-30 Xerox Corporation Wordspotting for voice editing and indexing
JP3216565B2 (ja) * 1996-08-02 2001-10-09 日本電信電話株式会社 音声モデルの話者適応化方法及びその方法を用いた音声認識方法及びその方法を記録した記録媒体
US5835890A (en) 1996-08-02 1998-11-10 Nippon Telegraph And Telephone Corporation Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon
JPH1185184A (ja) * 1997-09-04 1999-03-30 Atr Onsei Honyaku Tsushin Kenkyusho:Kk 音声認識装置
US6208964B1 (en) * 1998-08-31 2001-03-27 Nortel Networks Limited Method and apparatus for providing unsupervised adaptation of transcriptions

Also Published As

Publication number Publication date
US7505905B1 (en) 2009-03-17
DE60022291T2 (de) 2006-06-29
EP1097446A1 (de) 2001-05-09
WO2000070603A1 (en) 2000-11-23
JP2003526117A (ja) 2003-09-02
KR20010053521A (ko) 2001-06-25
EP1097446B1 (de) 2005-08-31

Similar Documents

Publication Publication Date Title
DE60022291D1 (de) Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz
DE69315374D1 (de) Spracherkennungssystem zur naturgetreuen Sprachübersetzung
AU2001263138A1 (en) Automated voice-based dialogue with a voice mail system by imitation of the human voice
AU4019801A (en) Language translation using a constrained grammar in the form of structured sentences
AU2002218916A1 (en) Hierarchical language models for speech recognition
AU7358900A (en) Computer-assisted language translation
AU2002236034A1 (en) Spoken language interface
GB0204056D0 (en) Voice activated language translation
DE60022441D1 (de) Vorrichtung zur automatischen regelung der medikamentenabgabe
DE69823954D1 (de) Quellen-normalisierendes Training zur Sprachmodellierung
AU3573100A (en) Natural language interface for searching database
EP1020789B8 (de) Gerät mit sprachgesteuerter oder handbedienter Benutzerschnittstelle und Lernhilfeverfahren zum Lernen der Sprachbefehle eines sochen Geräts
DE69617319D1 (de) Dünnfilmübertrager zur Unterschwingungsreduktion
DE60111481D1 (de) Handhabung benutzerspezifischer Wortschatzteile in Sprachendienstleistungssystemen
AU2181601A (en) Method and device for speech recognition with disjoint language models
EP1251489A3 (de) Training von Parametern eines Spracherkennungssystems zur Erkennung von Aussprachevarianten
IT1267251B1 (it) Dispositivo per il deposito asimmetrico delle spire
DE69506667D1 (de) Optoelektronische Vorrichtung zur Steuerungshilfe eines Flugzeuges
AU2001262407A1 (en) Dynamic language models for speech recognition
DE69836081D1 (de) Transmitter mit verbessertem harmonischen sprachkodierer
ITUD950080A0 (it) Procedimento di regolazione automatica dei rulli di guida laminato e relativo dispositivo
AU1767600A (en) Speech recognizer with a lexical tree based n-gram language model
GB0007877D0 (en) Elecronic dictionary with vocabulary learning function
AU2002231046A1 (en) Context-responsive spoken language instruction
DE50014521D1 (de) Verfahren zum Training eines automatischen Spracherkenners

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: NUANCE COMMUNICATIONS, INC. (N.D.GES.D. STAATE, US

8328 Change in the person/name/address of the agent

Representative=s name: WITTE, WELLER & PARTNER, 70173 STUTTGART