DE602005024497D1 - Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung - Google Patents

Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung

Info

Publication number
DE602005024497D1
DE602005024497D1 DE602005024497T DE602005024497T DE602005024497D1 DE 602005024497 D1 DE602005024497 D1 DE 602005024497D1 DE 602005024497 T DE602005024497 T DE 602005024497T DE 602005024497 T DE602005024497 T DE 602005024497T DE 602005024497 D1 DE602005024497 D1 DE 602005024497D1
Authority
DE
Germany
Prior art keywords
random field
conditional random
speech recognition
field models
phonetic classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
DE602005024497T
Other languages
German (de)
English (en)
Inventor
Alejandro Acero
Asela J Gunawardana
Milind V Mahajan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of DE602005024497D1 publication Critical patent/DE602005024497D1/de
Anticipated expiration legal-status Critical
Active legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Document Processing Apparatus (AREA)
DE602005024497T 2004-10-15 2005-09-27 Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung Active DE602005024497D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/966,047 US7627473B2 (en) 2004-10-15 2004-10-15 Hidden conditional random field models for phonetic classification and speech recognition

Publications (1)

Publication Number Publication Date
DE602005024497D1 true DE602005024497D1 (de) 2010-12-16

Family

ID=35520793

Family Applications (1)

Application Number Title Priority Date Filing Date
DE602005024497T Active DE602005024497D1 (de) 2004-10-15 2005-09-27 Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung

Country Status (7)

Country Link
US (1) US7627473B2 (enExample)
EP (1) EP1647970B1 (enExample)
JP (1) JP5072206B2 (enExample)
KR (1) KR101153078B1 (enExample)
CN (1) CN1760974B (enExample)
AT (1) ATE487212T1 (enExample)
DE (1) DE602005024497D1 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5223673B2 (ja) 2006-06-29 2013-06-26 日本電気株式会社 音声処理装置およびプログラム、並びに、音声処理方法
KR100774800B1 (ko) * 2006-09-06 2007-11-07 한국정보통신대학교 산학협력단 포아송 폴링 기법을 이용한 세그먼트 단위의 음성/비음성분류 방법 및 장치
EP2133868A4 (en) * 2007-02-28 2013-01-16 Nec Corp WEIGHT COEFFICIENT LEARNING SYSTEM AND AUDIO RECOGNITION SYSTEM
US7509163B1 (en) * 2007-09-28 2009-03-24 International Business Machines Corporation Method and system for subject-adaptive real-time sleep stage classification
KR101230183B1 (ko) * 2008-07-14 2013-02-15 광운대학교 산학협력단 오디오 신호의 상태결정 장치
US20100076978A1 (en) * 2008-09-09 2010-03-25 Microsoft Corporation Summarizing online forums into question-context-answer triples
US8140328B2 (en) * 2008-12-01 2012-03-20 At&T Intellectual Property I, L.P. User intention based on N-best list of recognition hypotheses for utterances in a dialog
US8306806B2 (en) * 2008-12-02 2012-11-06 Microsoft Corporation Adaptive web mining of bilingual lexicon
US8473430B2 (en) * 2010-01-29 2013-06-25 Microsoft Corporation Deep-structured conditional random fields for sequential labeling and classification
US9355683B2 (en) 2010-07-30 2016-05-31 Samsung Electronics Co., Ltd. Audio playing method and apparatus
US9031844B2 (en) 2010-09-21 2015-05-12 Microsoft Technology Licensing, Llc Full-sequence training of deep structures for speech recognition
US9164983B2 (en) 2011-05-27 2015-10-20 Robert Bosch Gmbh Broad-coverage normalization system for social media language
CN104933048B (zh) * 2014-03-17 2018-08-31 联想(北京)有限公司 一种语音信息处理方法、装置和电子设备
US9785891B2 (en) * 2014-12-09 2017-10-10 Conduent Business Services, Llc Multi-task conditional random field models for sequence labeling
CN104700833A (zh) * 2014-12-29 2015-06-10 芜湖乐锐思信息咨询有限公司 一种大数据语音分类方法
US9875736B2 (en) 2015-02-19 2018-01-23 Microsoft Technology Licensing, Llc Pre-training and/or transfer learning for sequence taggers
WO2017130434A1 (ja) * 2016-01-28 2017-08-03 楽天株式会社 多言語の固有表現認識モデルの転移を行うコンピュータシステム、方法、およびプログラム
US10311863B2 (en) * 2016-09-02 2019-06-04 Disney Enterprises, Inc. Classifying segments of speech based on acoustic features and context
CN109829164B (zh) * 2019-02-01 2020-05-22 北京字节跳动网络技术有限公司 用于生成文本的方法和装置
CN110826320B (zh) * 2019-11-28 2023-10-13 上海观安信息技术股份有限公司 一种基于文本识别的敏感数据发现方法及系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3285047B2 (ja) * 1992-09-04 2002-05-27 日本電信電話株式会社 不特定話者用音声認識装置
JPH06266389A (ja) * 1993-03-10 1994-09-22 N T T Data Tsushin Kk 音素ラベリング装置
JPH0990975A (ja) * 1995-09-22 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> パターン認識のためのモデル学習方法
US6505152B1 (en) * 1999-09-03 2003-01-07 Microsoft Corporation Method and apparatus for using formant models in speech systems

Also Published As

Publication number Publication date
JP5072206B2 (ja) 2012-11-14
ATE487212T1 (de) 2010-11-15
EP1647970B1 (en) 2010-11-03
EP1647970A1 (en) 2006-04-19
JP2006113570A (ja) 2006-04-27
CN1760974B (zh) 2012-04-18
US20060085190A1 (en) 2006-04-20
CN1760974A (zh) 2006-04-19
KR20060050361A (ko) 2006-05-19
US7627473B2 (en) 2009-12-01
KR101153078B1 (ko) 2012-06-04

Similar Documents

Publication Publication Date Title
DE602005024497D1 (de) Verstekte bedingte Zufallfeldermodelle für phonetische Klassifizierung und Spracherkennung
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
DE602005018552D1 (de) Verfahren zum anpassen eines neuronalen netzwerks einer automatischen spracherkennungseinrichtung
ATE417346T1 (de) Spracherkennungs- und korrektursystem, korrekturvorrichtung und verfahren zur erstellung eines lexikons von alternativen
EP1696421A3 (en) Learning in automatic speech recognition
DE602005027770D1 (de) Generierung von grossen Graphonem-Einheiten mit Kriterium gegenseitiger Information für die Sprachsynthese
WO2004100638A3 (en) Source-dependent text-to-speech system
WO2007056344A3 (en) Techiques for model optimization for statistical pattern recognition
WO2009114499A3 (en) Methods and devices for language skill development
WO2008073850A3 (en) Method and apparatus for reading education
EP1705645A3 (en) Apparatus and method for analysis of language model changes
DE602004030635D1 (de) Regelbasierte Grammatik für Slots und statistisches Modell für Preterminale in einem System zum Verstehen natürlicher Sprache
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
WO2007051106A3 (en) Semantic processor for recognition of cause-effect relations in natural language documents
ATE406073T1 (de) Verfahren zum nachtrainieren und betreiben eines hörgeräts und entsprechendes hörgerät
WO2009025356A1 (ja) 音声認識装置および音声認識方法
WO2011133766A3 (en) Methods and systems for training dictation-based speech-to-text systems using recorded samples
GB2443753A (en) Spoken language proficiency assessment by computer
WO2015057907A3 (en) System and method for learning alternate pronunciations for speech recognition
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
ATE531031T1 (de) Segmentbasierte tonale modellierung für tonale sprachen
ATE401644T1 (de) Verfahren zur spracherkennung
FI20010792A7 (fi) Käyttäjäriippumattoman puheentunnistuksen järjestäminen
ATE445896T1 (de) Spracherkennungsverfahren das variationsinferenz mit veränderlichen zustandsraummodellen benuzt