ATE398324T1 - Spracherkennung durch kontextuelle modellierung der spracheinheiten - Google Patents

Spracherkennung durch kontextuelle modellierung der spracheinheiten

Info

Publication number
ATE398324T1
ATE398324T1 AT04742550T AT04742550T ATE398324T1 AT E398324 T1 ATE398324 T1 AT E398324T1 AT 04742550 T AT04742550 T AT 04742550T AT 04742550 T AT04742550 T AT 04742550T AT E398324 T1 ATE398324 T1 AT E398324T1
Authority
AT
Austria
Prior art keywords
units
language
acoustic
voice units
states
Prior art date
Application number
AT04742550T
Other languages
English (en)
Inventor
Ronaldo Messina
Denis Jouvet
Original Assignee
France Telecom
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom filed Critical France Telecom
Application granted granted Critical
Publication of ATE398324T1 publication Critical patent/ATE398324T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/022Demisyllables, biphones or triphones being the recognition units

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
AT04742550T 2004-04-20 2004-04-20 Spracherkennung durch kontextuelle modellierung der spracheinheiten ATE398324T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/FR2004/000972 WO2005112000A1 (fr) 2004-04-20 2004-04-20 Procede et systeme de reconnaissance vocale par modelisation contextuelle d’unites vocales

Publications (1)

Publication Number Publication Date
ATE398324T1 true ATE398324T1 (de) 2008-07-15

Family

ID=34958050

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04742550T ATE398324T1 (de) 2004-04-20 2004-04-20 Spracherkennung durch kontextuelle modellierung der spracheinheiten

Country Status (5)

Country Link
US (1) US7818172B2 (de)
EP (1) EP1741092B1 (de)
AT (1) ATE398324T1 (de)
DE (1) DE602004014416D1 (de)
WO (1) WO2005112000A1 (de)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8160878B2 (en) * 2008-09-16 2012-04-17 Microsoft Corporation Piecewise-based variable-parameter Hidden Markov Models and the training thereof
US8145488B2 (en) * 2008-09-16 2012-03-27 Microsoft Corporation Parameter clustering and sharing for variable-parameter hidden markov models
KR101625304B1 (ko) * 2014-11-18 2016-05-27 경희대학교 산학협력단 음향 정보에 기초한 사용자 다수 행위 인식 방법

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0782348B2 (ja) * 1992-03-21 1995-09-06 株式会社エイ・ティ・アール自動翻訳電話研究所 音声認識用サブワードモデル生成方法
GB9223066D0 (en) * 1992-11-04 1992-12-16 Secr Defence Children's speech training aid
US5737490A (en) * 1993-09-30 1998-04-07 Apple Computer, Inc. Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models
US5794197A (en) * 1994-01-21 1998-08-11 Micrsoft Corporation Senone tree representation and evaluation
JP3581401B2 (ja) * 1994-10-07 2004-10-27 キヤノン株式会社 音声認識方法
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Ind Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US20060074664A1 (en) * 2000-01-10 2006-04-06 Lam Kwok L System and method for utterance verification of chinese long and short keywords
JP2002366187A (ja) * 2001-06-08 2002-12-20 Sony Corp 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体

Also Published As

Publication number Publication date
EP1741092B1 (de) 2008-06-11
US7818172B2 (en) 2010-10-19
US20070271096A1 (en) 2007-11-22
DE602004014416D1 (de) 2008-07-24
EP1741092A1 (de) 2007-01-10
WO2005112000A1 (fr) 2005-11-24

Similar Documents

Publication Publication Date Title
US10074363B2 (en) Method and apparatus for keyword speech recognition
CN104036774B (zh) 藏语方言识别方法及系统
Tao et al. Exploring deep learning architectures for automatically grading non-native spontaneous speech
Bell et al. Prosodic adaptation in human-computer interaction
JP7070894B2 (ja) 時系列情報の学習システム、方法およびニューラルネットワークモデル
ATE419616T1 (de) Verfahren, einrichtung und computerprogramm zur spracherkennung
RU2432623C2 (ru) Способ и устройство для естественно-речевого распознавания речевого высказывания
ATE457510T1 (de) Spracherkennungssystem mit riesigem vokabular
EP1233406A1 (de) Angepasste Spracherkennung für ausländische Sprecher
CN104575490A (zh) 基于深度神经网络后验概率算法的口语发音评测方法
CN105869644A (zh) 基于深度学习的声纹认证方法和装置
CN105096941A (zh) 语音识别方法以及装置
CN104978963A (zh) 语音识别装置、方法以及电子设备
WO2008087934A1 (ja) 拡張認識辞書学習装置と音声認識システム
CN105590625A (zh) 声学模型自适应方法及系统
ATE403213T1 (de) System und verfahren zur automatischen spracherkennung
CN107871496B (zh) 语音识别方法和装置
EP1507255A3 (de) Blasteilung für kompakte akustische Modelle
DE60134395D1 (de) Diskriminatives Trainieren von Hidden Markov Modellen für die Erkennung fliessender Sprache
US20180308501A1 (en) Multi speaker attribution using personal grammar detection
ATE349750T1 (de) Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung
CN107274890A (zh) 声纹谱提取方法及装置
ATE353156T1 (de) Verfolgen von vokaltraktresonanzen unter verwendung einer zielgeführten einschränkung
ATE441918T1 (de) Sprachdialogverfahren und -system
ATE398324T1 (de) Spracherkennung durch kontextuelle modellierung der spracheinheiten

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties