DE60022291D1 - Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz - Google Patents
Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatzInfo
- Publication number
- DE60022291D1 DE60022291D1 DE60022291T DE60022291T DE60022291D1 DE 60022291 D1 DE60022291 D1 DE 60022291D1 DE 60022291 T DE60022291 T DE 60022291T DE 60022291 T DE60022291 T DE 60022291T DE 60022291 D1 DE60022291 D1 DE 60022291D1
- Authority
- DE
- Germany
- Prior art keywords
- vocabulary
- unsaturated
- great
- adjustment
- automatic language
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/311,333 US7505905B1 (en) | 1999-05-13 | 1999-05-13 | In-the-field adaptation of a large vocabulary automatic speech recognizer (ASR) |
US311333 | 1999-05-13 | ||
PCT/EP2000/004246 WO2000070603A1 (en) | 1999-05-13 | 2000-05-10 | Unsupervised adaptation of a large vocabulary automatic speech recognizer |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60022291D1 true DE60022291D1 (de) | 2005-10-06 |
DE60022291T2 DE60022291T2 (de) | 2006-06-29 |
Family
ID=23206426
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60022291T Expired - Lifetime DE60022291T2 (de) | 1999-05-13 | 2000-05-10 | Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz |
Country Status (6)
Country | Link |
---|---|
US (1) | US7505905B1 (de) |
EP (1) | EP1097446B1 (de) |
JP (1) | JP2003526117A (de) |
KR (1) | KR20010053521A (de) |
DE (1) | DE60022291T2 (de) |
WO (1) | WO2000070603A1 (de) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10127559A1 (de) * | 2001-06-06 | 2002-12-12 | Philips Corp Intellectual Pty | Benutzergruppenspezifisches Musterverarbeitungssystem |
JP2003099086A (ja) * | 2001-09-25 | 2003-04-04 | Nippon Hoso Kyokai <Nhk> | 言語・音響モデル作成方法および言語・音響モデル作成装置ならびに言語・音響モデル作成プログラム |
US7424421B2 (en) * | 2004-03-03 | 2008-09-09 | Microsoft Corporation | Word collection method and system for use in word-breaking |
TWI342010B (en) * | 2006-12-13 | 2011-05-11 | Delta Electronics Inc | Speech recognition method and system with intelligent classification and adjustment |
US8583415B2 (en) | 2007-06-29 | 2013-11-12 | Microsoft Corporation | Phonetic search using normalized string |
US9224384B2 (en) * | 2012-06-06 | 2015-12-29 | Cypress Semiconductor Corporation | Histogram based pre-pruning scheme for active HMMS |
US9502029B1 (en) * | 2012-06-25 | 2016-11-22 | Amazon Technologies, Inc. | Context-aware speech processing |
JP5966689B2 (ja) * | 2012-07-04 | 2016-08-10 | 日本電気株式会社 | 音響モデル適応装置、音響モデル適応方法および音響モデル適応プログラム |
KR102073102B1 (ko) * | 2013-03-21 | 2020-02-04 | 삼성전자 주식회사 | 언어인식을 위한 언어모델 db, 언어인식장치와 언어인식방법, 및 언어인식시스템 |
US9489943B2 (en) * | 2013-10-16 | 2016-11-08 | Interactive Intelligence Group, Inc. | System and method for learning alternate pronunciations for speech recognition |
EP3958255A1 (de) | 2015-01-16 | 2022-02-23 | Samsung Electronics Co., Ltd. | Verfahren und vorrichtung zur durchführung von spracherkennung |
US10147428B1 (en) | 2018-05-30 | 2018-12-04 | Green Key Technologies Llc | Computer systems exhibiting improved computer speed and transcription accuracy of automatic speech transcription (AST) based on a multiple speech-to-text engines and methods of use thereof |
KR102515914B1 (ko) * | 2022-12-21 | 2023-03-30 | 주식회사 액션파워 | Stt 모델을 활용하는 발음 전사 방법 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6461799A (en) * | 1987-09-01 | 1989-03-08 | Nec Corp | Fast voice recognition equipment |
US5199077A (en) * | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
JP3216565B2 (ja) * | 1996-08-02 | 2001-10-09 | 日本電信電話株式会社 | 音声モデルの話者適応化方法及びその方法を用いた音声認識方法及びその方法を記録した記録媒体 |
US5835890A (en) | 1996-08-02 | 1998-11-10 | Nippon Telegraph And Telephone Corporation | Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon |
JPH1185184A (ja) * | 1997-09-04 | 1999-03-30 | Atr Onsei Honyaku Tsushin Kenkyusho:Kk | 音声認識装置 |
US6208964B1 (en) * | 1998-08-31 | 2001-03-27 | Nortel Networks Limited | Method and apparatus for providing unsupervised adaptation of transcriptions |
-
1999
- 1999-05-13 US US09/311,333 patent/US7505905B1/en not_active Expired - Lifetime
-
2000
- 2000-05-10 WO PCT/EP2000/004246 patent/WO2000070603A1/en not_active Application Discontinuation
- 2000-05-10 EP EP00931185A patent/EP1097446B1/de not_active Expired - Lifetime
- 2000-05-10 JP JP2000618971A patent/JP2003526117A/ja active Pending
- 2000-05-10 DE DE60022291T patent/DE60022291T2/de not_active Expired - Lifetime
- 2000-05-10 KR KR1020017000558A patent/KR20010053521A/ko not_active Application Discontinuation
Also Published As
Publication number | Publication date |
---|---|
US7505905B1 (en) | 2009-03-17 |
DE60022291T2 (de) | 2006-06-29 |
EP1097446A1 (de) | 2001-05-09 |
WO2000070603A1 (en) | 2000-11-23 |
JP2003526117A (ja) | 2003-09-02 |
KR20010053521A (ko) | 2001-06-25 |
EP1097446B1 (de) | 2005-08-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60022291D1 (de) | Unüberwachte anpassung eines automatischen spracherkenners mit grossem wortschatz | |
DE69315374D1 (de) | Spracherkennungssystem zur naturgetreuen Sprachübersetzung | |
AU2001263138A1 (en) | Automated voice-based dialogue with a voice mail system by imitation of the human voice | |
AU4019801A (en) | Language translation using a constrained grammar in the form of structured sentences | |
AU2002218916A1 (en) | Hierarchical language models for speech recognition | |
AU7358900A (en) | Computer-assisted language translation | |
AU2002236034A1 (en) | Spoken language interface | |
GB0204056D0 (en) | Voice activated language translation | |
DE60022441D1 (de) | Vorrichtung zur automatischen regelung der medikamentenabgabe | |
DE69823954D1 (de) | Quellen-normalisierendes Training zur Sprachmodellierung | |
AU3573100A (en) | Natural language interface for searching database | |
EP1020789B8 (de) | Gerät mit sprachgesteuerter oder handbedienter Benutzerschnittstelle und Lernhilfeverfahren zum Lernen der Sprachbefehle eines sochen Geräts | |
DE69617319D1 (de) | Dünnfilmübertrager zur Unterschwingungsreduktion | |
DE60111481D1 (de) | Handhabung benutzerspezifischer Wortschatzteile in Sprachendienstleistungssystemen | |
AU2181601A (en) | Method and device for speech recognition with disjoint language models | |
EP1251489A3 (de) | Training von Parametern eines Spracherkennungssystems zur Erkennung von Aussprachevarianten | |
IT1267251B1 (it) | Dispositivo per il deposito asimmetrico delle spire | |
DE69506667D1 (de) | Optoelektronische Vorrichtung zur Steuerungshilfe eines Flugzeuges | |
AU2001262407A1 (en) | Dynamic language models for speech recognition | |
DE69836081D1 (de) | Transmitter mit verbessertem harmonischen sprachkodierer | |
ITUD950080A0 (it) | Procedimento di regolazione automatica dei rulli di guida laminato e relativo dispositivo | |
AU1767600A (en) | Speech recognizer with a lexical tree based n-gram language model | |
GB0007877D0 (en) | Elecronic dictionary with vocabulary learning function | |
AU2002231046A1 (en) | Context-responsive spoken language instruction | |
DE50014521D1 (de) | Verfahren zum Training eines automatischen Spracherkenners |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8327 | Change in the person/name/address of the patent owner |
Owner name: NUANCE COMMUNICATIONS, INC. (N.D.GES.D. STAATE, US |
|
8328 | Change in the person/name/address of the agent |
Representative=s name: WITTE, WELLER & PARTNER, 70173 STUTTGART |