JP4515054B2 - 音声認識の方法および音声信号を復号化する方法 - Google Patents
音声認識の方法および音声信号を復号化する方法 Download PDFInfo
- Publication number
- JP4515054B2 JP4515054B2 JP2003278640A JP2003278640A JP4515054B2 JP 4515054 B2 JP4515054 B2 JP 4515054B2 JP 2003278640 A JP2003278640 A JP 2003278640A JP 2003278640 A JP2003278640 A JP 2003278640A JP 4515054 B2 JP4515054 B2 JP 4515054B2
- Authority
- JP
- Japan
- Prior art keywords
- value
- sound
- state
- generation
- trajectory
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/12—Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
- Image Analysis (AREA)
- Machine Translation (AREA)
- Noise Elimination (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US39816602P | 2002-07-23 | 2002-07-23 | |
| US40597102P | 2002-08-26 | 2002-08-26 | |
| US10/267,522 US7050975B2 (en) | 2002-07-23 | 2002-10-09 | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2004054298A JP2004054298A (ja) | 2004-02-19 |
| JP2004054298A5 JP2004054298A5 (enExample) | 2006-08-31 |
| JP4515054B2 true JP4515054B2 (ja) | 2010-07-28 |
Family
ID=30003734
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2003278640A Expired - Fee Related JP4515054B2 (ja) | 2002-07-23 | 2003-07-23 | 音声認識の方法および音声信号を復号化する方法 |
Country Status (5)
| Country | Link |
|---|---|
| US (2) | US7050975B2 (enExample) |
| EP (1) | EP1385147B1 (enExample) |
| JP (1) | JP4515054B2 (enExample) |
| AT (1) | ATE394773T1 (enExample) |
| DE (1) | DE60320719D1 (enExample) |
Families Citing this family (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7209881B2 (en) * | 2001-12-20 | 2007-04-24 | Matsushita Electric Industrial Co., Ltd. | Preparing acoustic models by sufficient statistics and noise-superimposed speech data |
| US7174292B2 (en) * | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
| US7103540B2 (en) * | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
| US7050975B2 (en) * | 2002-07-23 | 2006-05-23 | Microsoft Corporation | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
| FR2846458B1 (fr) * | 2002-10-25 | 2005-02-25 | France Telecom | Procede de traitement automatique d'un signal de parole. |
| US9117460B2 (en) * | 2004-05-12 | 2015-08-25 | Core Wireless Licensing S.A.R.L. | Detection of end of utterance in speech recognition system |
| US7409346B2 (en) * | 2004-11-05 | 2008-08-05 | Microsoft Corporation | Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction |
| US7565284B2 (en) * | 2004-11-05 | 2009-07-21 | Microsoft Corporation | Acoustic models with structured hidden dynamics with integration over many possible hidden trajectories |
| US7519531B2 (en) * | 2005-03-30 | 2009-04-14 | Microsoft Corporation | Speaker adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation |
| US7805301B2 (en) * | 2005-07-01 | 2010-09-28 | Microsoft Corporation | Covariance estimation for pattern recognition |
| US7653535B2 (en) | 2005-12-15 | 2010-01-26 | Microsoft Corporation | Learning statistically characterized resonance targets in a hidden trajectory model |
| US8010356B2 (en) * | 2006-02-17 | 2011-08-30 | Microsoft Corporation | Parameter learning in a hidden trajectory model |
| US7877256B2 (en) * | 2006-02-17 | 2011-01-25 | Microsoft Corporation | Time synchronous decoding for long-span hidden trajectory model |
| US7805308B2 (en) * | 2007-01-19 | 2010-09-28 | Microsoft Corporation | Hidden trajectory modeling with differential cepstra for speech recognition |
| US9020816B2 (en) * | 2008-08-14 | 2015-04-28 | 21Ct, Inc. | Hidden markov model for speech processing with training method |
| US9009039B2 (en) * | 2009-06-12 | 2015-04-14 | Microsoft Technology Licensing, Llc | Noise adaptive training for speech recognition |
| EP2539888B1 (en) | 2010-02-22 | 2015-05-20 | Nuance Communications, Inc. | Online maximum-likelihood mean and variance normalization for speech recognition |
| TWI442384B (zh) * | 2011-07-26 | 2014-06-21 | Ind Tech Res Inst | 以麥克風陣列為基礎之語音辨識系統與方法 |
| JP6301664B2 (ja) | 2014-01-31 | 2018-03-28 | 株式会社東芝 | 変換装置、パターン認識システム、変換方法およびプログラム |
| US9953646B2 (en) | 2014-09-02 | 2018-04-24 | Belleau Technologies | Method and system for dynamic speech recognition and tracking of prewritten script |
| US10354642B2 (en) * | 2017-03-03 | 2019-07-16 | Microsoft Technology Licensing, Llc | Hyperarticulation detection in repetitive voice queries using pairwise comparison for improved speech recognition |
| JP6599914B2 (ja) * | 2017-03-09 | 2019-10-30 | 株式会社東芝 | 音声認識装置、音声認識方法およびプログラム |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4980917A (en) * | 1987-11-18 | 1990-12-25 | Emerson & Stern Associates, Inc. | Method and apparatus for determining articulatory parameters from speech data |
| JP2986345B2 (ja) * | 1993-10-18 | 1999-12-06 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声記録指標化装置及び方法 |
| GB2290684A (en) * | 1994-06-22 | 1996-01-03 | Ibm | Speech synthesis using hidden Markov model to determine speech unit durations |
| JPH0895592A (ja) * | 1994-09-21 | 1996-04-12 | Nippon Telegr & Teleph Corp <Ntt> | パターン認識方法 |
| JPH0822296A (ja) * | 1994-07-07 | 1996-01-23 | Nippon Telegr & Teleph Corp <Ntt> | パターン認識方法 |
| US5937384A (en) * | 1996-05-01 | 1999-08-10 | Microsoft Corporation | Method and system for speech recognition using continuous density hidden Markov models |
| US7050975B2 (en) * | 2002-07-23 | 2006-05-23 | Microsoft Corporation | Method of speech recognition using time-dependent interpolation and hidden dynamic value classes |
-
2002
- 2002-10-09 US US10/267,522 patent/US7050975B2/en not_active Expired - Fee Related
-
2003
- 2003-06-30 DE DE60320719T patent/DE60320719D1/de not_active Expired - Lifetime
- 2003-06-30 EP EP03014848A patent/EP1385147B1/en not_active Expired - Lifetime
- 2003-06-30 AT AT03014848T patent/ATE394773T1/de not_active IP Right Cessation
- 2003-07-23 JP JP2003278640A patent/JP4515054B2/ja not_active Expired - Fee Related
-
2005
- 2005-12-06 US US11/294,858 patent/US7206741B2/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| US20060085191A1 (en) | 2006-04-20 |
| JP2004054298A (ja) | 2004-02-19 |
| US7206741B2 (en) | 2007-04-17 |
| EP1385147B1 (en) | 2008-05-07 |
| ATE394773T1 (de) | 2008-05-15 |
| US7050975B2 (en) | 2006-05-23 |
| EP1385147A2 (en) | 2004-01-28 |
| EP1385147A3 (en) | 2005-04-20 |
| DE60320719D1 (de) | 2008-06-19 |
| US20040019483A1 (en) | 2004-01-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP4515054B2 (ja) | 音声認識の方法および音声信号を復号化する方法 | |
| US8280733B2 (en) | Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections | |
| EP1465154B1 (en) | Method of speech recognition using variational inference with switching state space models | |
| US20060009965A1 (en) | Method and apparatus for distribution-based language model adaptation | |
| US7617104B2 (en) | Method of speech recognition using hidden trajectory Hidden Markov Models | |
| JP5072206B2 (ja) | 音声分類および音声認識のための隠れ条件付確率場モデル | |
| JP6031316B2 (ja) | 音声認識装置、誤り修正モデル学習方法、及びプログラム | |
| JP2001092496A (ja) | 連続音声認識装置および記録媒体 | |
| US7480615B2 (en) | Method of speech recognition using multimodal variational inference with switching state space models | |
| US20060200351A1 (en) | Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction | |
| US7565284B2 (en) | Acoustic models with structured hidden dynamics with integration over many possible hidden trajectories | |
| US7346510B2 (en) | Method of speech recognition using variables representing dynamic aspects of speech |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20060719 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20060719 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20091215 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20100312 |
|
| A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20100317 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100415 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20100507 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20100512 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 4515054 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130521 Year of fee payment: 3 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130521 Year of fee payment: 3 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
| R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |