JP4360453B2 - 文法を使用する音声認識のための方法 - Google Patents
文法を使用する音声認識のための方法 Download PDFInfo
- Publication number
- JP4360453B2 JP4360453B2 JP2000524788A JP2000524788A JP4360453B2 JP 4360453 B2 JP4360453 B2 JP 4360453B2 JP 2000524788 A JP2000524788 A JP 2000524788A JP 2000524788 A JP2000524788 A JP 2000524788A JP 4360453 B2 JP4360453 B2 JP 4360453B2
- Authority
- JP
- Japan
- Prior art keywords
- recognition
- word
- recognition method
- syntax
- words
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 77
- 238000011156 evaluation Methods 0.000 claims abstract description 8
- 238000001514 detection method Methods 0.000 claims 2
- 238000010586 diagram Methods 0.000 abstract 1
- 238000012545 processing Methods 0.000 description 17
- 230000007704 transition Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 2
- 241000252794 Sphinx Species 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000007430 reference method Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE19754957A DE19754957A1 (de) | 1997-12-11 | 1997-12-11 | Verfahren zur Spracherkennung |
| DE19754957.8 | 1997-12-11 | ||
| PCT/DE1998/003536 WO1999030314A1 (de) | 1997-12-11 | 1998-12-02 | Verfahren zur spracherkennung unter verwendung von einer grammatik |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2001526411A JP2001526411A (ja) | 2001-12-18 |
| JP2001526411A5 JP2001526411A5 (https=) | 2009-07-23 |
| JP4360453B2 true JP4360453B2 (ja) | 2009-11-11 |
Family
ID=7851483
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2000524788A Expired - Fee Related JP4360453B2 (ja) | 1997-12-11 | 1998-12-02 | 文法を使用する音声認識のための方法 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US7020606B1 (https=) |
| EP (1) | EP1038293B1 (https=) |
| JP (1) | JP4360453B2 (https=) |
| AT (1) | ATE211291T1 (https=) |
| DE (2) | DE19754957A1 (https=) |
| ES (1) | ES2169572T3 (https=) |
| WO (1) | WO1999030314A1 (https=) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE102007042971A1 (de) * | 2007-09-10 | 2009-03-12 | Siemens Ag | Spracherkennungsverfahren und Spracherkennungsvorrichtung |
| US20090245646A1 (en) * | 2008-03-28 | 2009-10-01 | Microsoft Corporation | Online Handwriting Expression Recognition |
| US20100166314A1 (en) * | 2008-12-30 | 2010-07-01 | Microsoft Corporation | Segment Sequence-Based Handwritten Expression Recognition |
| CN103971686B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 自动语音识别方法和系统 |
| WO2014189399A1 (en) | 2013-05-22 | 2014-11-27 | Axon Doo | A mixed-structure n-gram language model |
| LU101763B1 (en) * | 2020-05-04 | 2021-11-05 | Microsoft Technology Licensing Llc | Microsegment secure speech transcription |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4718094A (en) * | 1984-11-19 | 1988-01-05 | International Business Machines Corp. | Speech recognition system |
| DE3711348A1 (de) * | 1987-04-03 | 1988-10-20 | Philips Patentverwaltung | Verfahren zum erkennen kontinuierlich gesprochener woerter |
| JPH01177600A (ja) * | 1988-01-06 | 1989-07-13 | Nec Corp | 音声認識誤り訂正装置 |
| JP3004023B2 (ja) * | 1989-11-28 | 2000-01-31 | 株式会社東芝 | 音声認識装置 |
| TW323364B (https=) * | 1993-11-24 | 1997-12-21 | At & T Corp | |
| DE19501599C1 (de) * | 1995-01-20 | 1996-05-02 | Daimler Benz Ag | Verfahren zur Spracherkennung |
| JP3180655B2 (ja) * | 1995-06-19 | 2001-06-25 | 日本電信電話株式会社 | パターンマッチングによる単語音声認識方法及びその方法を実施する装置 |
| DE69517705T2 (de) * | 1995-11-04 | 2000-11-23 | International Business Machines Corp., Armonk | Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem |
| EP0849723A3 (en) * | 1996-12-20 | 1998-12-30 | ATR Interpreting Telecommunications Research Laboratories | Speech recognition apparatus equipped with means for removing erroneous candidate of speech recognition |
| DE69937176T2 (de) * | 1998-08-28 | 2008-07-10 | International Business Machines Corp. | Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern |
-
1997
- 1997-12-11 DE DE19754957A patent/DE19754957A1/de not_active Withdrawn
-
1998
- 1998-12-02 AT AT98965097T patent/ATE211291T1/de not_active IP Right Cessation
- 1998-12-02 WO PCT/DE1998/003536 patent/WO1999030314A1/de not_active Ceased
- 1998-12-02 DE DE59802584T patent/DE59802584D1/de not_active Expired - Lifetime
- 1998-12-02 EP EP98965097A patent/EP1038293B1/de not_active Expired - Lifetime
- 1998-12-02 ES ES98965097T patent/ES2169572T3/es not_active Expired - Lifetime
- 1998-12-02 JP JP2000524788A patent/JP4360453B2/ja not_active Expired - Fee Related
- 1998-12-02 US US09/581,408 patent/US7020606B1/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1038293B1 (de) | 2001-12-19 |
| DE59802584D1 (de) | 2002-01-31 |
| WO1999030314A1 (de) | 1999-06-17 |
| ATE211291T1 (de) | 2002-01-15 |
| JP2001526411A (ja) | 2001-12-18 |
| US7020606B1 (en) | 2006-03-28 |
| EP1038293A1 (de) | 2000-09-27 |
| ES2169572T3 (es) | 2002-07-01 |
| DE19754957A1 (de) | 1999-06-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5758024A (en) | Method and system for encoding pronunciation prefix trees | |
| US6983239B1 (en) | Method and apparatus for embedding grammars in a natural language understanding (NLU) statistical parser | |
| US7072837B2 (en) | Method for processing initially recognized speech in a speech recognition session | |
| US7120582B1 (en) | Expanding an effective vocabulary of a speech recognition system | |
| Ward et al. | Recent improvements in the CMU spoken language understanding system | |
| Hirsimaki et al. | Importance of high-order n-gram models in morph-based speech recognition | |
| US6574597B1 (en) | Fully expanded context-dependent networks for speech recognition | |
| US20040220809A1 (en) | System with composite statistical and rules-based grammar model for speech recognition and natural language understanding | |
| US5875426A (en) | Recognizing speech having word liaisons by adding a phoneme to reference word models | |
| JPH0583918B2 (https=) | ||
| Meteer et al. | Statistical language modeling combining n-gram and context-free grammars | |
| KR100726875B1 (ko) | 구두 대화에서의 전형적인 실수에 대한 보완적인 언어모델을 갖는 음성 인식 디바이스 | |
| JP4360453B2 (ja) | 文法を使用する音声認識のための方法 | |
| EP1111587B1 (en) | Speech recognition device implementing a syntactic permutation rule | |
| Seneff et al. | ANGIE: A new framework for speech analysis based on morpho-phonological modelling | |
| JP2001242885A (ja) | 音声認識装置および音声認識方法、並びに記録媒体 | |
| Vu et al. | Vietnamese automatic speech recognition: The flavor approach | |
| Hanazawa et al. | An efficient search method for large-vocabulary continuous-speech recognition. | |
| Chung et al. | Integrating speech with keypad input for automatic entry of spelling and pronunciation of new words. | |
| Hori et al. | Spoken interactive odqa system: Spiqa | |
| Rotovnik et al. | A comparison of HTK, ISIP and julius in slovenian large vocabulary continuous speech recognition | |
| Seneff | The use of subword linguistic modeling for multiple tasks in speech recognition | |
| JPH09281989A (ja) | 音声認識装置および方法 | |
| KR20010077042A (ko) | 트리 구조의 단어사전을 갖는 연속음성 인식 장치 | |
| KR20010077041A (ko) | 트리구조의 언어모델을 갖는 연속 음성 인식 장치 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20051201 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090206 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20090507 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20090514 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090603 |
|
| A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20090603 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20090708 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20090804 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20120821 Year of fee payment: 3 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130821 Year of fee payment: 4 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |