JP4541781B2 - 音声認識装置および方法 - Google Patents
音声認識装置および方法 Download PDFInfo
- Publication number
- JP4541781B2 JP4541781B2 JP2004191460A JP2004191460A JP4541781B2 JP 4541781 B2 JP4541781 B2 JP 4541781B2 JP 2004191460 A JP2004191460 A JP 2004191460A JP 2004191460 A JP2004191460 A JP 2004191460A JP 4541781 B2 JP4541781 B2 JP 4541781B2
- Authority
- JP
- Japan
- Prior art keywords
- acoustic model
- likelihood
- state
- speech
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004191460A JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
| US11/165,167 US7565290B2 (en) | 2004-06-29 | 2005-06-24 | Speech recognition method and apparatus |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004191460A JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2006011257A JP2006011257A (ja) | 2006-01-12 |
| JP2006011257A5 JP2006011257A5 (https=) | 2010-02-25 |
| JP4541781B2 true JP4541781B2 (ja) | 2010-09-08 |
Family
ID=35507163
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2004191460A Expired - Fee Related JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US7565290B2 (https=) |
| JP (1) | JP4541781B2 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2007142840A (ja) * | 2005-11-18 | 2007-06-07 | Canon Inc | 情報処理装置及び情報処理方法 |
| JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
| JP4758919B2 (ja) * | 2007-01-22 | 2011-08-31 | 日本放送協会 | 音声認識装置及び音声認識プログラム |
| JP2008225254A (ja) * | 2007-03-14 | 2008-09-25 | Canon Inc | 音声合成装置及び方法並びにプログラム |
| US8275615B2 (en) | 2007-07-13 | 2012-09-25 | International Business Machines Corporation | Model weighting, selection and hypotheses combination for automatic speech recognition and machine translation |
| JP5273844B2 (ja) * | 2008-03-31 | 2013-08-28 | Kddi株式会社 | 字幕ずれ推定装置、字幕ずれ補正装置、再生装置および放送装置 |
| CN102027534B (zh) * | 2008-05-16 | 2013-07-31 | 日本电气株式会社 | 语言模型得分前瞻值赋值方法及设备 |
| JP5246948B2 (ja) * | 2009-03-27 | 2013-07-24 | Kddi株式会社 | 字幕ずれ補正装置、再生装置および放送装置 |
| CN103971685B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 语音命令识别方法和系统 |
| CN105869624B (zh) * | 2016-03-29 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数字语音识别中语音解码网络的构建方法及装置 |
| JP6585112B2 (ja) * | 2017-03-17 | 2019-10-02 | 株式会社東芝 | 音声キーワード検出装置および音声キーワード検出方法 |
| CN112242144A (zh) * | 2019-07-17 | 2021-01-19 | 百度在线网络技术(北京)有限公司 | 基于流式注意力模型的语音识别解码方法、装置、设备以及计算机可读存储介质 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2712856B2 (ja) * | 1991-03-08 | 1998-02-16 | 三菱電機株式会社 | 音声認識装置 |
| JP3397372B2 (ja) * | 1993-06-16 | 2003-04-14 | キヤノン株式会社 | 音声認識方法及び装置 |
| US5621859A (en) * | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
| JP3450411B2 (ja) * | 1994-03-22 | 2003-09-22 | キヤノン株式会社 | 音声情報処理方法及び装置 |
| JP3581401B2 (ja) * | 1994-10-07 | 2004-10-27 | キヤノン株式会社 | 音声認識方法 |
| JP3453456B2 (ja) * | 1995-06-19 | 2003-10-06 | キヤノン株式会社 | 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置 |
| JP3459712B2 (ja) * | 1995-11-01 | 2003-10-27 | キヤノン株式会社 | 音声認識方法及び装置及びコンピュータ制御装置 |
| JPH1097276A (ja) * | 1996-09-20 | 1998-04-14 | Canon Inc | 音声認識方法及び装置並びに記憶媒体 |
| US6076056A (en) * | 1997-09-19 | 2000-06-13 | Microsoft Corporation | Speech recognition system for recognizing continuous and isolated speech |
| US6018628A (en) * | 1998-06-16 | 2000-01-25 | Sun Microsystems, Inc. | Method of implementing parameterized types to be compatible with existing unparameterized libraries |
| US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
| JP4543294B2 (ja) * | 2000-03-14 | 2010-09-15 | ソニー株式会社 | 音声認識装置および音声認識方法、並びに記録媒体 |
| JP2001312293A (ja) * | 2000-04-28 | 2001-11-09 | Matsushita Electric Ind Co Ltd | 音声認識方法およびその装置、並びにコンピュータ読み取り可能な記憶媒体 |
| JP3728177B2 (ja) * | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
| AU2000276400A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Search method based on single triphone tree for large vocabulary continuous speech recognizer |
| JP2002149187A (ja) * | 2000-11-07 | 2002-05-24 | Sony Corp | 音声認識装置および音声認識方法、並びに記録媒体 |
| JP2002189494A (ja) * | 2000-12-19 | 2002-07-05 | Atr Onsei Gengo Tsushin Kenkyusho:Kk | 音声認識システム |
| JP2002215187A (ja) | 2001-01-23 | 2002-07-31 | Matsushita Electric Ind Co Ltd | 音声認識方法及びその装置 |
| JP3885002B2 (ja) * | 2002-06-28 | 2007-02-21 | キヤノン株式会社 | 情報処理装置およびその方法 |
-
2004
- 2004-06-29 JP JP2004191460A patent/JP4541781B2/ja not_active Expired - Fee Related
-
2005
- 2005-06-24 US US11/165,167 patent/US7565290B2/en not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| JP2006011257A (ja) | 2006-01-12 |
| US20050288929A1 (en) | 2005-12-29 |
| US7565290B2 (en) | 2009-07-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11664020B2 (en) | Speech recognition method and apparatus | |
| JP4322815B2 (ja) | 音声認識システム及び方法 | |
| US9697827B1 (en) | Error reduction in speech processing | |
| Hwang et al. | Predicting unseen triphones with senones | |
| US7496512B2 (en) | Refining of segmental boundaries in speech waveforms using contextual-dependent models | |
| CN105336322B (zh) | 多音字模型训练方法、语音合成方法及装置 | |
| US10319373B2 (en) | Information processing device, information processing method, computer program product, and recognition system | |
| US20080077404A1 (en) | Speech recognition device, speech recognition method, and computer program product | |
| JP4541781B2 (ja) | 音声認識装置および方法 | |
| KR20140028174A (ko) | 음성 인식 방법 및 이를 적용한 전자 장치 | |
| US20100100379A1 (en) | Voice recognition correlation rule learning system, voice recognition correlation rule learning program, and voice recognition correlation rule learning method | |
| US20050228666A1 (en) | Method, apparatus, and system for building context dependent models for a large vocabulary continuous speech recognition (lvcsr) system | |
| CN106847259B (zh) | 一种音频关键词模板的筛选和优化方法 | |
| Duchateau et al. | Fast and accurate acoustic modelling with semi-continuous HMMs | |
| US20020040296A1 (en) | Phoneme assigning method | |
| KR102199445B1 (ko) | 클래스 기반 음향 모델의 변별 학습 방법 및 장치, 그리고 이를 이용한 음성 인식 장치 | |
| KR102299269B1 (ko) | 음성 및 스크립트를 정렬하여 음성 데이터베이스를 구축하는 방법 및 장치 | |
| US9355636B1 (en) | Selective speech recognition scoring using articulatory features | |
| JP6350935B2 (ja) | 音響モデル生成装置、音響モデルの生産方法、およびプログラム | |
| WO2012076895A1 (en) | Pattern recognition | |
| Ko et al. | Eigentriphones for context-dependent acoustic modeling | |
| Paul | New results with the Lincoln tied-mixture HMM CSR system | |
| JP3439700B2 (ja) | 音響モデル学習装置、音響モデル変換装置及び音声認識装置 | |
| JP4379050B2 (ja) | 音声認識装置、音声認識高速化方法、および、プログラム | |
| Wiggers et al. | Automatic Speech Recognition USING Hidden Markov Models |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20070627 |
|
| RD03 | Notification of appointment of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7423 Effective date: 20070627 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100106 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20100406 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20100412 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20100531 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20100618 |
|
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20100624 |
|
| R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130702 Year of fee payment: 3 |
|
| LAPS | Cancellation because of no payment of annual fees |