JP2006011257A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2006011257A5 JP2006011257A5 JP2004191460A JP2004191460A JP2006011257A5 JP 2006011257 A5 JP2006011257 A5 JP 2006011257A5 JP 2004191460 A JP2004191460 A JP 2004191460A JP 2004191460 A JP2004191460 A JP 2004191460A JP 2006011257 A5 JP2006011257 A5 JP 2006011257A5
- Authority
- JP
- Japan
- Prior art keywords
- acoustic model
- likelihood
- speech
- state
- calculating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims 12
- 239000000203 mixture Substances 0.000 claims 2
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004191460A JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
| US11/165,167 US7565290B2 (en) | 2004-06-29 | 2005-06-24 | Speech recognition method and apparatus |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2004191460A JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2006011257A JP2006011257A (ja) | 2006-01-12 |
| JP2006011257A5 true JP2006011257A5 (https=) | 2010-02-25 |
| JP4541781B2 JP4541781B2 (ja) | 2010-09-08 |
Family
ID=35507163
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2004191460A Expired - Fee Related JP4541781B2 (ja) | 2004-06-29 | 2004-06-29 | 音声認識装置および方法 |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US7565290B2 (https=) |
| JP (1) | JP4541781B2 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2007142840A (ja) * | 2005-11-18 | 2007-06-07 | Canon Inc | 情報処理装置及び情報処理方法 |
| JP4188989B2 (ja) * | 2006-09-15 | 2008-12-03 | 本田技研工業株式会社 | 音声認識装置、音声認識方法、及び音声認識プログラム |
| JP4758919B2 (ja) * | 2007-01-22 | 2011-08-31 | 日本放送協会 | 音声認識装置及び音声認識プログラム |
| JP2008225254A (ja) * | 2007-03-14 | 2008-09-25 | Canon Inc | 音声合成装置及び方法並びにプログラム |
| US8275615B2 (en) | 2007-07-13 | 2012-09-25 | International Business Machines Corporation | Model weighting, selection and hypotheses combination for automatic speech recognition and machine translation |
| JP5273844B2 (ja) * | 2008-03-31 | 2013-08-28 | Kddi株式会社 | 字幕ずれ推定装置、字幕ずれ補正装置、再生装置および放送装置 |
| CN102027534B (zh) * | 2008-05-16 | 2013-07-31 | 日本电气株式会社 | 语言模型得分前瞻值赋值方法及设备 |
| JP5246948B2 (ja) * | 2009-03-27 | 2013-07-24 | Kddi株式会社 | 字幕ずれ補正装置、再生装置および放送装置 |
| CN103971685B (zh) * | 2013-01-30 | 2015-06-10 | 腾讯科技(深圳)有限公司 | 语音命令识别方法和系统 |
| CN105869624B (zh) * | 2016-03-29 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数字语音识别中语音解码网络的构建方法及装置 |
| JP6585112B2 (ja) * | 2017-03-17 | 2019-10-02 | 株式会社東芝 | 音声キーワード検出装置および音声キーワード検出方法 |
| CN112242144A (zh) * | 2019-07-17 | 2021-01-19 | 百度在线网络技术(北京)有限公司 | 基于流式注意力模型的语音识别解码方法、装置、设备以及计算机可读存储介质 |
Family Cites Families (19)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2712856B2 (ja) * | 1991-03-08 | 1998-02-16 | 三菱電機株式会社 | 音声認識装置 |
| JP3397372B2 (ja) * | 1993-06-16 | 2003-04-14 | キヤノン株式会社 | 音声認識方法及び装置 |
| US5621859A (en) * | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
| JP3450411B2 (ja) * | 1994-03-22 | 2003-09-22 | キヤノン株式会社 | 音声情報処理方法及び装置 |
| JP3581401B2 (ja) * | 1994-10-07 | 2004-10-27 | キヤノン株式会社 | 音声認識方法 |
| JP3453456B2 (ja) * | 1995-06-19 | 2003-10-06 | キヤノン株式会社 | 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置 |
| JP3459712B2 (ja) * | 1995-11-01 | 2003-10-27 | キヤノン株式会社 | 音声認識方法及び装置及びコンピュータ制御装置 |
| JPH1097276A (ja) * | 1996-09-20 | 1998-04-14 | Canon Inc | 音声認識方法及び装置並びに記憶媒体 |
| US6076056A (en) * | 1997-09-19 | 2000-06-13 | Microsoft Corporation | Speech recognition system for recognizing continuous and isolated speech |
| US6018628A (en) * | 1998-06-16 | 2000-01-25 | Sun Microsystems, Inc. | Method of implementing parameterized types to be compatible with existing unparameterized libraries |
| US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
| JP4543294B2 (ja) * | 2000-03-14 | 2010-09-15 | ソニー株式会社 | 音声認識装置および音声認識方法、並びに記録媒体 |
| JP2001312293A (ja) * | 2000-04-28 | 2001-11-09 | Matsushita Electric Ind Co Ltd | 音声認識方法およびその装置、並びにコンピュータ読み取り可能な記憶媒体 |
| JP3728177B2 (ja) * | 2000-05-24 | 2005-12-21 | キヤノン株式会社 | 音声処理システム、装置、方法及び記憶媒体 |
| AU2000276400A1 (en) * | 2000-09-30 | 2002-04-15 | Intel Corporation | Search method based on single triphone tree for large vocabulary continuous speech recognizer |
| JP2002149187A (ja) * | 2000-11-07 | 2002-05-24 | Sony Corp | 音声認識装置および音声認識方法、並びに記録媒体 |
| JP2002189494A (ja) * | 2000-12-19 | 2002-07-05 | Atr Onsei Gengo Tsushin Kenkyusho:Kk | 音声認識システム |
| JP2002215187A (ja) | 2001-01-23 | 2002-07-31 | Matsushita Electric Ind Co Ltd | 音声認識方法及びその装置 |
| JP3885002B2 (ja) * | 2002-06-28 | 2007-02-21 | キヤノン株式会社 | 情報処理装置およびその方法 |
-
2004
- 2004-06-29 JP JP2004191460A patent/JP4541781B2/ja not_active Expired - Fee Related
-
2005
- 2005-06-24 US US11/165,167 patent/US7565290B2/en not_active Expired - Fee Related
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Koller et al. | Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers | |
| JP2006011257A5 (https=) | ||
| US9721561B2 (en) | Method and apparatus for speech recognition using neural networks with speaker adaptation | |
| WO2016188558A1 (en) | Select one of plurality of neural networks | |
| KR100406604B1 (ko) | 음성인식방법및장치 | |
| US20160322042A1 (en) | Fast deep neural network feature transformation via optimized memory bandwidth utilization | |
| JP2017525993A5 (https=) | ||
| US10515312B1 (en) | Neural network model compaction using selective unit removal | |
| WO2016139670A8 (en) | System and method for generating accurate speech transcription from natural speech audio signals | |
| US9886948B1 (en) | Neural network processing of multiple feature streams using max pooling and restricted connectivity | |
| CN112578419B (zh) | 一种基于gru网络和卡尔曼滤波的gps数据重构方法 | |
| JP2022515048A5 (https=) | ||
| WO2007118030A3 (en) | Methods and systems for optimizing model adaptation for a speech recognition system | |
| US9177552B2 (en) | Method and apparatus for setting selected recognition parameters to minimize an application cost function | |
| CN105989839B (zh) | 语音识别方法和装置 | |
| JP4541781B2 (ja) | 音声認識装置および方法 | |
| Chen et al. | A novel keyword+ LVCSR-filler based grammar network representation for spoken keyword search | |
| BenZeghiba et al. | Phonotactic Language Recognition Using MLP Features. | |
| Saon et al. | A comparison of two optimization techniques for sequence discriminative training of deep neural networks | |
| Nolden et al. | Extended search space pruning in LVCSR | |
| WO2017159207A1 (ja) | 処理実行装置、処理実行装置の制御方法、および制御プログラム | |
| Tian et al. | Deep neural networks based speaker modeling at different levels of phonetic granularity | |
| KR100612843B1 (ko) | 은닉 마코프 모델를 위한 확률밀도함수 보상 방법, 그에따른 음성 인식 방법 및 장치 | |
| CN105931636A (zh) | 多语系语音辨识装置及其方法 | |
| RU2698773C2 (ru) | Устройство и способ распознавания речи |