DE69726235D1 - Verfahren und Vorrichtung zur Spracherkennung - Google Patents
Verfahren und Vorrichtung zur SpracherkennungInfo
- Publication number
- DE69726235D1 DE69726235D1 DE69726235T DE69726235T DE69726235D1 DE 69726235 D1 DE69726235 D1 DE 69726235D1 DE 69726235 T DE69726235 T DE 69726235T DE 69726235 T DE69726235 T DE 69726235T DE 69726235 D1 DE69726235 D1 DE 69726235D1
- Authority
- DE
- Germany
- Prior art keywords
- speech recognition
- speech
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Probability & Statistics with Applications (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP8249972A JPH1097276A (ja) | 1996-09-20 | 1996-09-20 | 音声認識方法及び装置並びに記憶媒体 |
JP24997296 | 1996-09-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69726235D1 true DE69726235D1 (de) | 2003-12-24 |
DE69726235T2 DE69726235T2 (de) | 2004-08-19 |
Family
ID=17200934
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69726235T Expired - Lifetime DE69726235T2 (de) | 1996-09-20 | 1997-09-18 | Verfahren und Vorrichtung zur Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US6108628A (de) |
EP (1) | EP0831456B1 (de) |
JP (1) | JPH1097276A (de) |
DE (1) | DE69726235T2 (de) |
Families Citing this family (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5895447A (en) * | 1996-02-02 | 1999-04-20 | International Business Machines Corporation | Speech recognition using thresholded speaker class model selection or model adaptation |
US6684186B2 (en) * | 1999-01-26 | 2004-01-27 | International Business Machines Corporation | Speaker recognition using a hierarchical speaker model tree |
JP2001075964A (ja) * | 1999-08-31 | 2001-03-23 | Sony Corp | 情報処理装置および情報処理方法、並びに記録媒体 |
JP3969908B2 (ja) | 1999-09-14 | 2007-09-05 | キヤノン株式会社 | 音声入力端末器、音声認識装置、音声通信システム及び音声通信方法 |
US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
US7689416B1 (en) | 1999-09-29 | 2010-03-30 | Poirier Darrell A | System for transferring personalize matter from one computer to another |
US6526379B1 (en) * | 1999-11-29 | 2003-02-25 | Matsushita Electric Industrial Co., Ltd. | Discriminative clustering methods for automatic speech recognition |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
US7039588B2 (en) * | 2000-03-31 | 2006-05-02 | Canon Kabushiki Kaisha | Synthesis unit selection apparatus and method, and storage medium |
JP4632384B2 (ja) * | 2000-03-31 | 2011-02-16 | キヤノン株式会社 | 音声情報処理装置及びその方法と記憶媒体 |
JP2001282278A (ja) * | 2000-03-31 | 2001-10-12 | Canon Inc | 音声情報処理装置及びその方法と記憶媒体 |
JP3728177B2 (ja) | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
US7047192B2 (en) * | 2000-06-28 | 2006-05-16 | Poirier Darrell A | Simultaneous multi-user real-time speech recognition system |
JP2002073072A (ja) * | 2000-08-31 | 2002-03-12 | Sony Corp | モデル適応装置およびモデル適応方法、記録媒体、並びにパターン認識装置 |
JP3774698B2 (ja) * | 2000-10-11 | 2006-05-17 | キヤノン株式会社 | 情報処理装置、情報処理方法及び記憶媒体 |
US7529666B1 (en) * | 2000-10-30 | 2009-05-05 | International Business Machines Corporation | Minimum bayes error feature selection in speech recognition |
EP1207517B1 (de) * | 2000-11-16 | 2007-01-03 | Sony Deutschland GmbH | Verfahren zur Spracherkennung |
JP2002268681A (ja) * | 2001-03-08 | 2002-09-20 | Canon Inc | 音声認識システム及び方法及び該システムに用いる情報処理装置とその方法 |
US7038690B2 (en) * | 2001-03-23 | 2006-05-02 | Microsoft Corporation | Methods and systems for displaying animated graphics on a computing device |
US7239324B2 (en) * | 2001-03-23 | 2007-07-03 | Microsoft Corporation | Methods and systems for merging graphics for display on a computing device |
US20020143540A1 (en) * | 2001-03-28 | 2002-10-03 | Narendranath Malayath | Voice recognition system using implicit speaker adaptation |
JP3542578B2 (ja) * | 2001-11-22 | 2004-07-14 | キヤノン株式会社 | 音声認識装置及びその方法、プログラム |
JP2004012698A (ja) * | 2002-06-05 | 2004-01-15 | Canon Inc | 情報処理装置及び情報処理方法 |
JP4280505B2 (ja) * | 2003-01-20 | 2009-06-17 | キヤノン株式会社 | 情報処理装置及び情報処理方法 |
JP4587160B2 (ja) * | 2004-03-26 | 2010-11-24 | キヤノン株式会社 | 信号処理装置および方法 |
JP4541781B2 (ja) * | 2004-06-29 | 2010-09-08 | キヤノン株式会社 | 音声認識装置および方法 |
US20070124148A1 (en) * | 2005-11-28 | 2007-05-31 | Canon Kabushiki Kaisha | Speech processing apparatus and speech processing method |
JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
US8548807B2 (en) * | 2009-06-09 | 2013-10-01 | At&T Intellectual Property I, L.P. | System and method for adapting automatic speech recognition pronunciation by acoustic model restructuring |
US8392189B2 (en) * | 2009-09-28 | 2013-03-05 | Broadcom Corporation | Speech recognition using speech characteristic probabilities |
CN102074236B (zh) * | 2010-11-29 | 2012-06-06 | 清华大学 | 一种分布式麦克风的说话人聚类方法 |
US9311914B2 (en) * | 2012-09-03 | 2016-04-12 | Nice-Systems Ltd | Method and apparatus for enhanced phonetic indexing and search |
CN104143326B (zh) * | 2013-12-03 | 2016-11-02 | 腾讯科技(深圳)有限公司 | 一种语音命令识别方法和装置 |
CN111613219B (zh) * | 2020-05-15 | 2023-10-27 | 深圳前海微众银行股份有限公司 | 语音数据识别方法、设备及介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4837831A (en) * | 1986-10-15 | 1989-06-06 | Dragon Systems, Inc. | Method for creating and using multiple-word sound models in speech recognition |
US4914703A (en) * | 1986-12-05 | 1990-04-03 | Dragon Systems, Inc. | Method for deriving acoustic models for use in speech recognition |
DE69028072T2 (de) * | 1989-11-06 | 1997-01-09 | Canon Kk | Verfahren und Einrichtung zur Sprachsynthese |
JPH03150599A (ja) * | 1989-11-07 | 1991-06-26 | Canon Inc | 日本語音節の符号化方式 |
US5271088A (en) * | 1991-05-13 | 1993-12-14 | Itt Corporation | Automated sorting of voice messages through speaker spotting |
JPH04362698A (ja) * | 1991-06-11 | 1992-12-15 | Canon Inc | 音声認識方法及び装置 |
JP3066920B2 (ja) * | 1991-06-11 | 2000-07-17 | キヤノン株式会社 | 音声認識方法及び装置 |
JP2795058B2 (ja) * | 1992-06-03 | 1998-09-10 | 松下電器産業株式会社 | 時系列信号処理装置 |
US5598507A (en) * | 1994-04-12 | 1997-01-28 | Xerox Corporation | Method of speaker clustering for unknown speakers in conversational audio data |
US5606643A (en) * | 1994-04-12 | 1997-02-25 | Xerox Corporation | Real-time audio recording system for automatic speaker indexing |
JP3745403B2 (ja) * | 1994-04-12 | 2006-02-15 | ゼロックス コーポレイション | オーディオデータセグメントのクラスタリング方法 |
JP2871561B2 (ja) * | 1995-11-30 | 1999-03-17 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 不特定話者モデル生成装置及び音声認識装置 |
US5787394A (en) * | 1995-12-13 | 1998-07-28 | International Business Machines Corporation | State-dependent speaker clustering for speaker adaptation |
-
1996
- 1996-09-20 JP JP8249972A patent/JPH1097276A/ja not_active Withdrawn
-
1997
- 1997-09-16 US US08/931,527 patent/US6108628A/en not_active Expired - Lifetime
- 1997-09-18 DE DE69726235T patent/DE69726235T2/de not_active Expired - Lifetime
- 1997-09-18 EP EP97307276A patent/EP0831456B1/de not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
JPH1097276A (ja) | 1998-04-14 |
EP0831456A2 (de) | 1998-03-25 |
US6108628A (en) | 2000-08-22 |
EP0831456A3 (de) | 1998-10-14 |
DE69726235T2 (de) | 2004-08-19 |
EP0831456B1 (de) | 2003-11-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69717899D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69726235D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69828141D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69518705D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69524829D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69806557D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE59707384D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69923253D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69625950D1 (de) | Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem | |
DE69830017D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69324629D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69730930D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69727895D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69831991D1 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
DE69631728D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69519840D1 (de) | Einrichtung und Verfahren zur Spracherkennung | |
DE69725106D1 (de) | Verfahren und Vorrichtung zur Spracherkennung mit Rauschadaptierung | |
DE69732156D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69707876D1 (de) | Verfahren und vorrichtung fuer dynamisch eingestelltes training zur spracherkennung | |
DE69420400D1 (de) | Verfahren und gerät zur sprechererkennung | |
DE69432943D1 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
DE69428475D1 (de) | Verfahren und Gerät zur automatischen Spracherkennung | |
DE69632901D1 (de) | Vorrichtung und Verfahren zur Sprachsynthese | |
DE69519820D1 (de) | Verfahren und Vorrichtung zur Sprachsynthese | |
DE69726685D1 (de) | Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |