HK1058428A1 - Combining dtw and hmm in speaker dependent and independent modes for speech recognition - Google Patents
Combining dtw and hmm in speaker dependent and independent modes for speech recognitionInfo
- Publication number
- HK1058428A1 HK1058428A1 HK04101178A HK04101178A HK1058428A1 HK 1058428 A1 HK1058428 A1 HK 1058428A1 HK 04101178 A HK04101178 A HK 04101178A HK 04101178 A HK04101178 A HK 04101178A HK 1058428 A1 HK1058428 A1 HK 1058428A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- dtw
- hmm
- combining
- speech recognition
- engines
- Prior art date
Links
- 230000001419 dependent effect Effects 0.000 title abstract 2
- 238000013507 mapping Methods 0.000 abstract 1
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/32—Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/12—Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Electrically Operated Instructional Devices (AREA)
- Image Analysis (AREA)
- Selective Calling Equipment (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Electric Clocks (AREA)
- Toys (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/657,760 US6754629B1 (en) | 2000-09-08 | 2000-09-08 | System and method for automatic voice recognition using mapping |
PCT/US2001/027625 WO2002021513A1 (fr) | 2000-09-08 | 2001-09-05 | Systeme et procede de reconnaissance automatique de la voix par mappage |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1058428A1 true HK1058428A1 (en) | 2004-05-14 |
Family
ID=24638560
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK04101178A HK1058428A1 (en) | 2000-09-08 | 2004-02-19 | Combining dtw and hmm in speaker dependent and independent modes for speech recognition |
Country Status (13)
Country | Link |
---|---|
US (1) | US6754629B1 (fr) |
EP (1) | EP1316086B1 (fr) |
JP (1) | JP2004518155A (fr) |
KR (1) | KR100901092B1 (fr) |
CN (1) | CN1238836C (fr) |
AT (1) | ATE344959T1 (fr) |
AU (1) | AU2001288808A1 (fr) |
BR (1) | BR0113725A (fr) |
DE (1) | DE60124408T2 (fr) |
ES (1) | ES2273885T3 (fr) |
HK (1) | HK1058428A1 (fr) |
TW (1) | TW548630B (fr) |
WO (1) | WO2002021513A1 (fr) |
Families Citing this family (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE328345T1 (de) * | 2000-09-19 | 2006-06-15 | Thomson Licensing | Sprachsteuerung von elektronischen geräten |
US20030004720A1 (en) * | 2001-01-30 | 2003-01-02 | Harinath Garudadri | System and method for computing and transmitting parameters in a distributed voice recognition system |
US20020143540A1 (en) * | 2001-03-28 | 2002-10-03 | Narendranath Malayath | Voice recognition system using implicit speaker adaptation |
US7941313B2 (en) * | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
US7203643B2 (en) * | 2001-06-14 | 2007-04-10 | Qualcomm Incorporated | Method and apparatus for transmitting speech activity in distributed voice recognition systems |
US20040138885A1 (en) * | 2003-01-09 | 2004-07-15 | Xiaofan Lin | Commercial automatic speech recognition engine combinations |
DE10334400A1 (de) | 2003-07-28 | 2005-02-24 | Siemens Ag | Verfahren zur Spracherkennung und Kommunikationsgerät |
KR100571574B1 (ko) * | 2004-07-26 | 2006-04-17 | 한양대학교 산학협력단 | 비선형 분석을 이용한 유사화자 인식방법 및 그 시스템 |
KR100693284B1 (ko) * | 2005-04-14 | 2007-03-13 | 학교법인 포항공과대학교 | 음성 인식 장치 |
US20070225970A1 (en) * | 2006-03-21 | 2007-09-27 | Kady Mark A | Multi-context voice recognition system for long item list searches |
US8532984B2 (en) | 2006-07-31 | 2013-09-10 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
GB0616070D0 (en) * | 2006-08-12 | 2006-09-20 | Ibm | Speech Recognition Feedback |
US8239190B2 (en) | 2006-08-22 | 2012-08-07 | Qualcomm Incorporated | Time-warping frames of wideband vocoder |
US7881928B2 (en) * | 2006-09-01 | 2011-02-01 | International Business Machines Corporation | Enhanced linguistic transformation |
CN101256769B (zh) * | 2008-03-21 | 2011-06-15 | 深圳市汉音科技有限公司 | 语音识别装置及其方法 |
US9659559B2 (en) * | 2009-06-25 | 2017-05-23 | Adacel Systems, Inc. | Phonetic distance measurement system and related methods |
US9192773B2 (en) * | 2009-07-17 | 2015-11-24 | Peter Forsell | System for voice control of a medical implant |
KR101066472B1 (ko) * | 2009-09-15 | 2011-09-21 | 국민대학교산학협력단 | 초성 기반 음성인식장치 및 음성인식방법 |
CN102651218A (zh) * | 2011-02-25 | 2012-08-29 | 株式会社东芝 | 用于创建语音标签的方法以及设备 |
KR101255141B1 (ko) * | 2011-08-11 | 2013-04-22 | 주식회사 씨에스 | 거절율을 확보하고 오인식을 줄이는 실시간 음성 인식 방법 |
US9767793B2 (en) | 2012-06-08 | 2017-09-19 | Nvoq Incorporated | Apparatus and methods using a pattern matching speech recognition engine to train a natural language speech recognition engine |
WO2014068788A1 (fr) * | 2012-11-05 | 2014-05-08 | 三菱電機株式会社 | Dispositif de reconnaissance de parole |
CN103065627B (zh) * | 2012-12-17 | 2015-07-29 | 中南大学 | 基于dtw与hmm证据融合的特种车鸣笛声识别方法 |
CN105027198B (zh) * | 2013-02-25 | 2018-11-20 | 三菱电机株式会社 | 语音识别系统以及语音识别装置 |
CN104143330A (zh) * | 2013-05-07 | 2014-11-12 | 佳能株式会社 | 语音识别方法和语音识别系统 |
US9390708B1 (en) * | 2013-05-28 | 2016-07-12 | Amazon Technologies, Inc. | Low latency and memory efficient keywork spotting |
TWI506458B (zh) * | 2013-12-24 | 2015-11-01 | Ind Tech Res Inst | 辨識網路產生裝置及其方法 |
CN104103272B (zh) * | 2014-07-15 | 2017-10-10 | 无锡中感微电子股份有限公司 | 语音识别方法、装置和蓝牙耳机 |
EP3065131B1 (fr) | 2015-03-06 | 2020-05-20 | ZETES Industries S.A. | Méthode et système de post-traitement d'un résultat de reconnaissance vocale |
EP3065133A1 (fr) | 2015-03-06 | 2016-09-07 | ZETES Industries S.A. | Méthode et système pour générer une solution optimisée en reconnaissance vocale |
EP3065132A1 (fr) | 2015-03-06 | 2016-09-07 | ZETES Industries S.A. | Méthode et système de détermination de validité d'un élément d'un résultat de reconnaissance vocale |
US10170110B2 (en) * | 2016-11-17 | 2019-01-01 | Robert Bosch Gmbh | System and method for ranking of hybrid speech recognition results with neural networks |
US10360914B2 (en) | 2017-01-26 | 2019-07-23 | Essence, Inc | Speech recognition based on context and multiple recognition engines |
US10861450B2 (en) | 2017-02-10 | 2020-12-08 | Samsung Electronics Co., Ltd. | Method and apparatus for managing voice-based interaction in internet of things network system |
CN107039037A (zh) * | 2017-04-21 | 2017-08-11 | 南京邮电大学 | 一种基于dtw的孤立词语音识别方法 |
CN109767758B (zh) * | 2019-01-11 | 2021-06-08 | 中山大学 | 车载语音分析方法、系统、存储介质以及设备 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4831551A (en) * | 1983-01-28 | 1989-05-16 | Texas Instruments Incorporated | Speaker-dependent connected speech word recognizer |
US4763278A (en) * | 1983-04-13 | 1988-08-09 | Texas Instruments Incorporated | Speaker-independent word recognizer |
US4783804A (en) * | 1985-03-21 | 1988-11-08 | American Telephone And Telegraph Company, At&T Bell Laboratories | Hidden Markov model speech recognition arrangement |
US5073939A (en) * | 1989-06-08 | 1991-12-17 | Itt Corporation | Dynamic time warping (DTW) apparatus for use in speech recognition systems |
WO1996008005A1 (fr) | 1994-09-07 | 1996-03-14 | Motorola Inc. | Systeme pour reconnaitre des sons prononces dans un discours continu et procede pour sa mise en ×uvre |
US5717826A (en) * | 1995-08-11 | 1998-02-10 | Lucent Technologies Inc. | Utterance verification using word based minimum verification error training for recognizing a keyboard string |
US5754978A (en) * | 1995-10-27 | 1998-05-19 | Speech Systems Of Colorado, Inc. | Speech recognition system |
US6272455B1 (en) * | 1997-10-22 | 2001-08-07 | Lucent Technologies, Inc. | Method and apparatus for understanding natural language |
US6125341A (en) * | 1997-12-19 | 2000-09-26 | Nortel Networks Corporation | Speech recognition system and method |
US6321195B1 (en) * | 1998-04-28 | 2001-11-20 | Lg Electronics Inc. | Speech recognition method |
ITTO980383A1 (it) | 1998-05-07 | 1999-11-07 | Cselt Centro Studi Lab Telecom | Procedimento e dispositivo di riconoscimento vocale con doppio passo di riconoscimento neurale e markoviano. |
US6275800B1 (en) * | 1999-02-23 | 2001-08-14 | Motorola, Inc. | Voice recognition system and method |
US6526380B1 (en) * | 1999-03-26 | 2003-02-25 | Koninklijke Philips Electronics N.V. | Speech recognition system having parallel large vocabulary recognition engines |
US6671669B1 (en) | 2000-07-18 | 2003-12-30 | Qualcomm Incorporated | combined engine system and method for voice recognition |
-
2000
- 2000-09-08 US US09/657,760 patent/US6754629B1/en not_active Expired - Lifetime
-
2001
- 2001-09-05 AU AU2001288808A patent/AU2001288808A1/en not_active Abandoned
- 2001-09-05 EP EP01968568A patent/EP1316086B1/fr not_active Expired - Lifetime
- 2001-09-05 BR BR0113725-5A patent/BR0113725A/pt not_active IP Right Cessation
- 2001-09-05 WO PCT/US2001/027625 patent/WO2002021513A1/fr active IP Right Grant
- 2001-09-05 AT AT01968568T patent/ATE344959T1/de not_active IP Right Cessation
- 2001-09-05 CN CNB018153631A patent/CN1238836C/zh not_active Expired - Fee Related
- 2001-09-05 DE DE60124408T patent/DE60124408T2/de not_active Expired - Lifetime
- 2001-09-05 ES ES01968568T patent/ES2273885T3/es not_active Expired - Lifetime
- 2001-09-05 JP JP2002525645A patent/JP2004518155A/ja active Pending
- 2001-09-05 KR KR1020037003316A patent/KR100901092B1/ko not_active IP Right Cessation
- 2001-09-07 TW TW090122242A patent/TW548630B/zh active
-
2004
- 2004-02-19 HK HK04101178A patent/HK1058428A1/xx not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
DE60124408T2 (de) | 2007-09-06 |
KR20030061797A (ko) | 2003-07-22 |
EP1316086A1 (fr) | 2003-06-04 |
EP1316086B1 (fr) | 2006-11-08 |
ES2273885T3 (es) | 2007-05-16 |
BR0113725A (pt) | 2004-08-17 |
AU2001288808A1 (en) | 2002-03-22 |
US6754629B1 (en) | 2004-06-22 |
KR100901092B1 (ko) | 2009-06-08 |
CN1454381A (zh) | 2003-11-05 |
ATE344959T1 (de) | 2006-11-15 |
CN1238836C (zh) | 2006-01-25 |
TW548630B (en) | 2003-08-21 |
JP2004518155A (ja) | 2004-06-17 |
WO2002021513A8 (fr) | 2002-06-20 |
WO2002021513A1 (fr) | 2002-03-14 |
DE60124408D1 (de) | 2006-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1058428A1 (en) | Combining dtw and hmm in speaker dependent and independent modes for speech recognition | |
AU2001275991A1 (en) | System and method for voice recognition with a plurality of voice recognition engines | |
MX9703138A (es) | Reconocimiento de lenguaje. | |
EP1901282A3 (fr) | Système de communication vocale pour véhicule et procédé de fonctionnement d'un système de communication vocale pour véhicule | |
DE60325826D1 (de) | Audiovisuelle sprachaktivitätsdetektion für ein spracherkennungssystem | |
HK1062738A1 (en) | Apparation and method for performing voice recognition using acoustic feature vector modification | |
WO2004100638A3 (fr) | Systeme de synthese vocale a partir du texte, dependant de la source | |
WO2006023631A3 (fr) | Adaptation d'un systeme de transcription de documents | |
IL146985A0 (en) | Automatic dynamic speech recognition vocabulary based on external sources of information | |
TW200601263A (en) | Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition | |
EP1629464A4 (fr) | Systeme et procede de reconnaissance vocale fondes sur la phonetique | |
WO2002097590A3 (fr) | Systeme de gestion des informations a commande vocale et independant du langage | |
WO2003036617A1 (fr) | Appareil de reconnaissance vocale et procede de reconnaissance de la parole | |
GB0207343D0 (en) | Signal processing system | |
DE60004331D1 (de) | Sprecher-erkennung | |
GB2366434A (en) | Selective speaker adaption for an in-vehicle speech recognition system | |
WO2007117814A3 (fr) | Perturbation de signaux vocaux à des fins de reconnaissance vocale | |
DE59904741D1 (de) | Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner | |
EP0949606A3 (fr) | Procédé et dispositif de reconnaissance de la parole utilisant des transcriptions phonétiques | |
ATE261607T1 (de) | Sprachgesteuertes tragbares endgerät | |
WO2003098373A3 (fr) | Authentification vocale | |
Heracleous et al. | Unvoiced speech recognition using tissue-conductive acoustic sensor | |
KR20020049061A (ko) | 음성 변환 방법 | |
WO2006034152A3 (fr) | Entrainement discriminatif d'un systeme de transcription de documents | |
AU2000276394A1 (en) | Method and system for generating and searching an optimal maximum likelihood decision tree for hidden markov model (hmm) based speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20110905 |