DE602004014416D1 - Spracherkennung durch kontextuelle modellierung der spracheinheiten - Google Patents
Spracherkennung durch kontextuelle modellierung der spracheinheitenInfo
- Publication number
- DE602004014416D1 DE602004014416D1 DE602004014416T DE602004014416T DE602004014416D1 DE 602004014416 D1 DE602004014416 D1 DE 602004014416D1 DE 602004014416 T DE602004014416 T DE 602004014416T DE 602004014416 T DE602004014416 T DE 602004014416T DE 602004014416 D1 DE602004014416 D1 DE 602004014416D1
- Authority
- DE
- Germany
- Prior art keywords
- units
- language
- acoustic
- voice units
- states
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000001419 dependent effect Effects 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/022—Demisyllables, biphones or triphones being the recognition units
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/FR2004/000972 WO2005112000A1 (fr) | 2004-04-20 | 2004-04-20 | Procede et systeme de reconnaissance vocale par modelisation contextuelle d’unites vocales |
Publications (1)
Publication Number | Publication Date |
---|---|
DE602004014416D1 true DE602004014416D1 (de) | 2008-07-24 |
Family
ID=34958050
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE602004014416T Expired - Lifetime DE602004014416D1 (de) | 2004-04-20 | 2004-04-20 | Spracherkennung durch kontextuelle modellierung der spracheinheiten |
Country Status (5)
Country | Link |
---|---|
US (1) | US7818172B2 (de) |
EP (1) | EP1741092B1 (de) |
AT (1) | ATE398324T1 (de) |
DE (1) | DE602004014416D1 (de) |
WO (1) | WO2005112000A1 (de) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8160878B2 (en) * | 2008-09-16 | 2012-04-17 | Microsoft Corporation | Piecewise-based variable-parameter Hidden Markov Models and the training thereof |
US8145488B2 (en) * | 2008-09-16 | 2012-03-27 | Microsoft Corporation | Parameter clustering and sharing for variable-parameter hidden markov models |
KR101625304B1 (ko) * | 2014-11-18 | 2016-05-27 | 경희대학교 산학협력단 | 음향 정보에 기초한 사용자 다수 행위 인식 방법 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0782348B2 (ja) * | 1992-03-21 | 1995-09-06 | 株式会社エイ・ティ・アール自動翻訳電話研究所 | 音声認識用サブワードモデル生成方法 |
GB9223066D0 (en) * | 1992-11-04 | 1992-12-16 | Secr Defence | Children's speech training aid |
US5737490A (en) * | 1993-09-30 | 1998-04-07 | Apple Computer, Inc. | Method and apparatus for constructing continuous parameter fenonic hidden markov models by replacing phonetic models with continous fenonic models |
US5794197A (en) * | 1994-01-21 | 1998-08-11 | Micrsoft Corporation | Senone tree representation and evaluation |
JP3581401B2 (ja) * | 1994-10-07 | 2004-10-27 | キヤノン株式会社 | 音声認識方法 |
US5937384A (en) * | 1996-05-01 | 1999-08-10 | Microsoft Corporation | Method and system for speech recognition using continuous density hidden Markov models |
US5806030A (en) * | 1996-05-06 | 1998-09-08 | Matsushita Electric Ind Co Ltd | Low complexity, high accuracy clustering method for speech recognizer |
US20060074664A1 (en) * | 2000-01-10 | 2006-04-06 | Lam Kwok L | System and method for utterance verification of chinese long and short keywords |
JP2002366187A (ja) * | 2001-06-08 | 2002-12-20 | Sony Corp | 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体 |
-
2004
- 2004-04-20 US US11/587,136 patent/US7818172B2/en not_active Expired - Fee Related
- 2004-04-20 EP EP04742550A patent/EP1741092B1/de not_active Expired - Lifetime
- 2004-04-20 AT AT04742550T patent/ATE398324T1/de not_active IP Right Cessation
- 2004-04-20 DE DE602004014416T patent/DE602004014416D1/de not_active Expired - Lifetime
- 2004-04-20 WO PCT/FR2004/000972 patent/WO2005112000A1/fr active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
EP1741092B1 (de) | 2008-06-11 |
EP1741092A1 (de) | 2007-01-10 |
US20070271096A1 (en) | 2007-11-22 |
WO2005112000A1 (fr) | 2005-11-24 |
ATE398324T1 (de) | 2008-07-15 |
US7818172B2 (en) | 2010-10-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10074363B2 (en) | Method and apparatus for keyword speech recognition | |
CN104036774B (zh) | 藏语方言识别方法及系统 | |
Tao et al. | Exploring deep learning architectures for automatically grading non-native spontaneous speech | |
Bell et al. | Prosodic adaptation in human-computer interaction | |
ATE419616T1 (de) | Verfahren, einrichtung und computerprogramm zur spracherkennung | |
JP7070894B2 (ja) | 時系列情報の学習システム、方法およびニューラルネットワークモデル | |
RU2432623C2 (ru) | Способ и устройство для естественно-речевого распознавания речевого высказывания | |
ATE457510T1 (de) | Spracherkennungssystem mit riesigem vokabular | |
EP1233406A1 (de) | Angepasste Spracherkennung für ausländische Sprecher | |
CN104575490A (zh) | 基于深度神经网络后验概率算法的口语发音评测方法 | |
CN107195296A (zh) | 一种语音识别方法、装置、终端及系统 | |
CN105869644A (zh) | 基于深度学习的声纹认证方法和装置 | |
WO2008087934A1 (ja) | 拡張認識辞書学習装置と音声認識システム | |
CN105096941A (zh) | 语音识别方法以及装置 | |
CN104978963A (zh) | 语音识别装置、方法以及电子设备 | |
CN105590625A (zh) | 声学模型自适应方法及系统 | |
ATE403213T1 (de) | System und verfahren zur automatischen spracherkennung | |
CN107871496B (zh) | 语音识别方法和装置 | |
EP1507255A3 (de) | Blasteilung für kompakte akustische Modelle | |
DE60134395D1 (de) | Diskriminatives Trainieren von Hidden Markov Modellen für die Erkennung fliessender Sprache | |
JP2019219574A (ja) | 話者モデル作成システム、認識システム、プログラムおよび制御装置 | |
US20180308501A1 (en) | Multi speaker attribution using personal grammar detection | |
ATE349750T1 (de) | Verfahren zur beschleunigung der durchführung von spracherkennung mit neuralen netzwerken, sowie entsprechende vorrichtung | |
KR20220090171A (ko) | 음성 인식 장치, 프로그램 및 그것의 학습 제어 방법 | |
ATE353156T1 (de) | Verfolgen von vokaltraktresonanzen unter verwendung einer zielgeführten einschränkung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |