JP5072206B2 - 音声分類および音声認識のための隠れ条件付確率場モデル - Google Patents

音声分類および音声認識のための隠れ条件付確率場モデル Download PDF

Info

Publication number
JP5072206B2
JP5072206B2 JP2005268550A JP2005268550A JP5072206B2 JP 5072206 B2 JP5072206 B2 JP 5072206B2 JP 2005268550 A JP2005268550 A JP 2005268550A JP 2005268550 A JP2005268550 A JP 2005268550A JP 5072206 B2 JP5072206 B2 JP 5072206B2
Authority
JP
Japan
Prior art keywords
state
score
trellis
computer
values
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2005268550A
Other languages
English (en)
Japanese (ja)
Other versions
JP2006113570A (ja
JP2006113570A5 (enExample
Inventor
アセロ アレハンドロ
ジェー.グーナワードナ アセラ
ブイ.マハジャン ミリンド
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2006113570A publication Critical patent/JP2006113570A/ja
Publication of JP2006113570A5 publication Critical patent/JP2006113570A5/ja
Application granted granted Critical
Publication of JP5072206B2 publication Critical patent/JP5072206B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Document Processing Apparatus (AREA)
JP2005268550A 2004-10-15 2005-09-15 音声分類および音声認識のための隠れ条件付確率場モデル Expired - Fee Related JP5072206B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/966,047 2004-10-15
US10/966,047 US7627473B2 (en) 2004-10-15 2004-10-15 Hidden conditional random field models for phonetic classification and speech recognition

Publications (3)

Publication Number Publication Date
JP2006113570A JP2006113570A (ja) 2006-04-27
JP2006113570A5 JP2006113570A5 (enExample) 2008-10-30
JP5072206B2 true JP5072206B2 (ja) 2012-11-14

Family

ID=35520793

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005268550A Expired - Fee Related JP5072206B2 (ja) 2004-10-15 2005-09-15 音声分類および音声認識のための隠れ条件付確率場モデル

Country Status (7)

Country Link
US (1) US7627473B2 (enExample)
EP (1) EP1647970B1 (enExample)
JP (1) JP5072206B2 (enExample)
KR (1) KR101153078B1 (enExample)
CN (1) CN1760974B (enExample)
AT (1) ATE487212T1 (enExample)
DE (1) DE602005024497D1 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8751226B2 (en) 2006-06-29 2014-06-10 Nec Corporation Learning a verification model for speech recognition based on extracted recognition and language feature information
KR100774800B1 (ko) * 2006-09-06 2007-11-07 한국정보통신대학교 산학협력단 포아송 폴링 기법을 이용한 세그먼트 단위의 음성/비음성분류 방법 및 장치
WO2008105263A1 (ja) * 2007-02-28 2008-09-04 Nec Corporation 重み係数学習システム及び音声認識システム
US7509163B1 (en) * 2007-09-28 2009-03-24 International Business Machines Corporation Method and system for subject-adaptive real-time sleep stage classification
KR101230183B1 (ko) * 2008-07-14 2013-02-15 광운대학교 산학협력단 오디오 신호의 상태결정 장치
US20100076978A1 (en) * 2008-09-09 2010-03-25 Microsoft Corporation Summarizing online forums into question-context-answer triples
US8140328B2 (en) * 2008-12-01 2012-03-20 At&T Intellectual Property I, L.P. User intention based on N-best list of recognition hypotheses for utterances in a dialog
US8306806B2 (en) * 2008-12-02 2012-11-06 Microsoft Corporation Adaptive web mining of bilingual lexicon
US8473430B2 (en) * 2010-01-29 2013-06-25 Microsoft Corporation Deep-structured conditional random fields for sequential labeling and classification
US9355683B2 (en) 2010-07-30 2016-05-31 Samsung Electronics Co., Ltd. Audio playing method and apparatus
US9031844B2 (en) 2010-09-21 2015-05-12 Microsoft Technology Licensing, Llc Full-sequence training of deep structures for speech recognition
US9164983B2 (en) 2011-05-27 2015-10-20 Robert Bosch Gmbh Broad-coverage normalization system for social media language
CN104933048B (zh) * 2014-03-17 2018-08-31 联想(北京)有限公司 一种语音信息处理方法、装置和电子设备
US9785891B2 (en) * 2014-12-09 2017-10-10 Conduent Business Services, Llc Multi-task conditional random field models for sequence labeling
CN104700833A (zh) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 一种大数据语音分类方法
US9875736B2 (en) 2015-02-19 2018-01-23 Microsoft Technology Licensing, Llc Pre-training and/or transfer learning for sequence taggers
US11030407B2 (en) * 2016-01-28 2021-06-08 Rakuten, Inc. Computer system, method and program for performing multilingual named entity recognition model transfer
US10311863B2 (en) * 2016-09-02 2019-06-04 Disney Enterprises, Inc. Classifying segments of speech based on acoustic features and context
CN109829164B (zh) * 2019-02-01 2020-05-22 北京字节跳动网络技术有限公司 用于生成文本的方法和装置
CN110826320B (zh) * 2019-11-28 2023-10-13 上海观安信息技术股份有限公司 一种基于文本识别的敏感数据发现方法及系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3285047B2 (ja) * 1992-09-04 2002-05-27 日本電信電話株式会社 不特定話者用音声認識装置
JPH06266389A (ja) * 1993-03-10 1994-09-22 N T T Data Tsushin Kk 音素ラベリング装置
JPH0990975A (ja) * 1995-09-22 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> パターン認識のためのモデル学習方法
US6505152B1 (en) * 1999-09-03 2003-01-07 Microsoft Corporation Method and apparatus for using formant models in speech systems

Also Published As

Publication number Publication date
ATE487212T1 (de) 2010-11-15
EP1647970B1 (en) 2010-11-03
KR101153078B1 (ko) 2012-06-04
CN1760974B (zh) 2012-04-18
EP1647970A1 (en) 2006-04-19
KR20060050361A (ko) 2006-05-19
DE602005024497D1 (de) 2010-12-16
US20060085190A1 (en) 2006-04-20
JP2006113570A (ja) 2006-04-27
CN1760974A (zh) 2006-04-19
US7627473B2 (en) 2009-12-01

Similar Documents

Publication Publication Date Title
JP5072206B2 (ja) 音声分類および音声認識のための隠れ条件付確率場モデル
US11664020B2 (en) Speech recognition method and apparatus
US8280733B2 (en) Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections
JP4195428B2 (ja) 多数の音声特徴を利用する音声認識
CN106463113B (zh) 在语音辨识中预测发音
EP1199708B1 (en) Noise robust pattern recognition
JP4528535B2 (ja) テキストから単語誤り率を予測するための方法および装置
EP1575030A1 (en) New-word pronunciation learning using a pronunciation graph
JP4515054B2 (ja) 音声認識の方法および音声信号を復号化する方法
US7617104B2 (en) Method of speech recognition using hidden trajectory Hidden Markov Models
JP2004310098A (ja) スイッチング状態空間型モデルによる変分推論を用いた音声認識の方法
WO2019126881A1 (en) System and method for tone recognition in spoken languages
KR20080018622A (ko) 휴대용 단말기의 음성 인식 시스템
EP1557823A2 (en) Method of setting posterior probability parameters for a switching state space model and method of speech recognition
CN111816164A (zh) 用于语音识别的方法及设备
CN116994570A (zh) 语音识别模型的训练方法和装置、语音识别方法和装置
JP2008129527A (ja) 音響モデル生成装置、方法、プログラム及びその記録媒体
JP4950600B2 (ja) 音響モデル作成装置、その装置を用いた音声認識装置、これらの方法、これらのプログラム、およびこれらの記録媒体

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20080916

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080916

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20110422

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20110721

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20120210

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20120510

A602 Written permission of extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A602

Effective date: 20120515

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20120608

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20120817

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20120821

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20150831

Year of fee payment: 3

S111 Request for change of ownership or part of ownership

Free format text: JAPANESE INTERMEDIATE CODE: R313113

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

LAPS Cancellation because of no payment of annual fees