JP4515054B2 - 音声認識の方法および音声信号を復号化する方法 - Google Patents

音声認識の方法および音声信号を復号化する方法 Download PDF

Info

Publication number
JP4515054B2
JP4515054B2 JP2003278640A JP2003278640A JP4515054B2 JP 4515054 B2 JP4515054 B2 JP 4515054B2 JP 2003278640 A JP2003278640 A JP 2003278640A JP 2003278640 A JP2003278640 A JP 2003278640A JP 4515054 B2 JP4515054 B2 JP 4515054B2
Authority
JP
Japan
Prior art keywords
value
sound
state
generation
trajectory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2003278640A
Other languages
English (en)
Japanese (ja)
Other versions
JP2004054298A (ja
JP2004054298A5 (enExample
Inventor
リ デン
ジャン−ライ チョウ
トルステン ベルント ザイデ フランク
ジェイ.アール.グナワルデナ アセラ
アティアス ハガイ
アセロ アレハンドロ
シュエドン ホアン
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2004054298A publication Critical patent/JP2004054298A/ja
Publication of JP2004054298A5 publication Critical patent/JP2004054298A5/ja
Application granted granted Critical
Publication of JP4515054B2 publication Critical patent/JP4515054B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/12Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Image Analysis (AREA)
  • Machine Translation (AREA)
  • Noise Elimination (AREA)
  • Complex Calculations (AREA)
JP2003278640A 2002-07-23 2003-07-23 音声認識の方法および音声信号を復号化する方法 Expired - Fee Related JP4515054B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US39816602P 2002-07-23 2002-07-23
US40597102P 2002-08-26 2002-08-26
US10/267,522 US7050975B2 (en) 2002-07-23 2002-10-09 Method of speech recognition using time-dependent interpolation and hidden dynamic value classes

Publications (3)

Publication Number Publication Date
JP2004054298A JP2004054298A (ja) 2004-02-19
JP2004054298A5 JP2004054298A5 (enExample) 2006-08-31
JP4515054B2 true JP4515054B2 (ja) 2010-07-28

Family

ID=30003734

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2003278640A Expired - Fee Related JP4515054B2 (ja) 2002-07-23 2003-07-23 音声認識の方法および音声信号を復号化する方法

Country Status (5)

Country Link
US (2) US7050975B2 (enExample)
EP (1) EP1385147B1 (enExample)
JP (1) JP4515054B2 (enExample)
AT (1) ATE394773T1 (enExample)
DE (1) DE60320719D1 (enExample)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7209881B2 (en) * 2001-12-20 2007-04-24 Matsushita Electric Industrial Co., Ltd. Preparing acoustic models by sufficient statistics and noise-superimposed speech data
US7174292B2 (en) * 2002-05-20 2007-02-06 Microsoft Corporation Method of determining uncertainty associated with acoustic distortion-based noise reduction
US7103540B2 (en) * 2002-05-20 2006-09-05 Microsoft Corporation Method of pattern recognition using noise reduction uncertainty
US7050975B2 (en) * 2002-07-23 2006-05-23 Microsoft Corporation Method of speech recognition using time-dependent interpolation and hidden dynamic value classes
FR2846458B1 (fr) * 2002-10-25 2005-02-25 France Telecom Procede de traitement automatique d'un signal de parole.
US9117460B2 (en) * 2004-05-12 2015-08-25 Core Wireless Licensing S.A.R.L. Detection of end of utterance in speech recognition system
US7409346B2 (en) * 2004-11-05 2008-08-05 Microsoft Corporation Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction
US7565284B2 (en) * 2004-11-05 2009-07-21 Microsoft Corporation Acoustic models with structured hidden dynamics with integration over many possible hidden trajectories
US7519531B2 (en) * 2005-03-30 2009-04-14 Microsoft Corporation Speaker adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation
US7805301B2 (en) * 2005-07-01 2010-09-28 Microsoft Corporation Covariance estimation for pattern recognition
US7653535B2 (en) 2005-12-15 2010-01-26 Microsoft Corporation Learning statistically characterized resonance targets in a hidden trajectory model
US8010356B2 (en) * 2006-02-17 2011-08-30 Microsoft Corporation Parameter learning in a hidden trajectory model
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
US7805308B2 (en) * 2007-01-19 2010-09-28 Microsoft Corporation Hidden trajectory modeling with differential cepstra for speech recognition
US9020816B2 (en) * 2008-08-14 2015-04-28 21Ct, Inc. Hidden markov model for speech processing with training method
US9009039B2 (en) * 2009-06-12 2015-04-14 Microsoft Technology Licensing, Llc Noise adaptive training for speech recognition
EP2539888B1 (en) 2010-02-22 2015-05-20 Nuance Communications, Inc. Online maximum-likelihood mean and variance normalization for speech recognition
TWI442384B (zh) * 2011-07-26 2014-06-21 Ind Tech Res Inst 以麥克風陣列為基礎之語音辨識系統與方法
JP6301664B2 (ja) 2014-01-31 2018-03-28 株式会社東芝 変換装置、パターン認識システム、変換方法およびプログラム
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
US10354642B2 (en) * 2017-03-03 2019-07-16 Microsoft Technology Licensing, Llc Hyperarticulation detection in repetitive voice queries using pairwise comparison for improved speech recognition
JP6599914B2 (ja) * 2017-03-09 2019-10-30 株式会社東芝 音声認識装置、音声認識方法およびプログラム

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4980917A (en) * 1987-11-18 1990-12-25 Emerson & Stern Associates, Inc. Method and apparatus for determining articulatory parameters from speech data
JP2986345B2 (ja) * 1993-10-18 1999-12-06 インターナショナル・ビジネス・マシーンズ・コーポレイション 音声記録指標化装置及び方法
GB2290684A (en) * 1994-06-22 1996-01-03 Ibm Speech synthesis using hidden Markov model to determine speech unit durations
JPH0895592A (ja) * 1994-09-21 1996-04-12 Nippon Telegr & Teleph Corp <Ntt> パターン認識方法
JPH0822296A (ja) * 1994-07-07 1996-01-23 Nippon Telegr & Teleph Corp <Ntt> パターン認識方法
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US7050975B2 (en) * 2002-07-23 2006-05-23 Microsoft Corporation Method of speech recognition using time-dependent interpolation and hidden dynamic value classes

Also Published As

Publication number Publication date
US20060085191A1 (en) 2006-04-20
JP2004054298A (ja) 2004-02-19
US7206741B2 (en) 2007-04-17
EP1385147B1 (en) 2008-05-07
ATE394773T1 (de) 2008-05-15
US7050975B2 (en) 2006-05-23
EP1385147A2 (en) 2004-01-28
EP1385147A3 (en) 2005-04-20
DE60320719D1 (de) 2008-06-19
US20040019483A1 (en) 2004-01-29

Similar Documents

Publication Publication Date Title
JP4515054B2 (ja) 音声認識の方法および音声信号を復号化する方法
US8280733B2 (en) Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections
EP1465154B1 (en) Method of speech recognition using variational inference with switching state space models
US20060009965A1 (en) Method and apparatus for distribution-based language model adaptation
US7617104B2 (en) Method of speech recognition using hidden trajectory Hidden Markov Models
JP5072206B2 (ja) 音声分類および音声認識のための隠れ条件付確率場モデル
JP6031316B2 (ja) 音声認識装置、誤り修正モデル学習方法、及びプログラム
JP2001092496A (ja) 連続音声認識装置および記録媒体
US7480615B2 (en) Method of speech recognition using multimodal variational inference with switching state space models
US20060200351A1 (en) Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction
US7565284B2 (en) Acoustic models with structured hidden dynamics with integration over many possible hidden trajectories
US7346510B2 (en) Method of speech recognition using variables representing dynamic aspects of speech

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20060719

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20060719

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20091215

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20100312

A602 Written permission of extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A602

Effective date: 20100317

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100415

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20100507

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20100512

R150 Certificate of patent or registration of utility model

Ref document number: 4515054

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130521

Year of fee payment: 3

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130521

Year of fee payment: 3

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees