JP4541781B2 - 音声認識装置および方法 - Google Patents

音声認識装置および方法 Download PDF

Info

Publication number
JP4541781B2
JP4541781B2 JP2004191460A JP2004191460A JP4541781B2 JP 4541781 B2 JP4541781 B2 JP 4541781B2 JP 2004191460 A JP2004191460 A JP 2004191460A JP 2004191460 A JP2004191460 A JP 2004191460A JP 4541781 B2 JP4541781 B2 JP 4541781B2
Authority
JP
Japan
Prior art keywords
acoustic model
likelihood
state
speech
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP2004191460A
Other languages
English (en)
Japanese (ja)
Other versions
JP2006011257A (ja
JP2006011257A5 (https=
Inventor
英生 久保山
俊明 深田
康弘 小森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP2004191460A priority Critical patent/JP4541781B2/ja
Priority to US11/165,167 priority patent/US7565290B2/en
Publication of JP2006011257A publication Critical patent/JP2006011257A/ja
Publication of JP2006011257A5 publication Critical patent/JP2006011257A5/ja
Application granted granted Critical
Publication of JP4541781B2 publication Critical patent/JP4541781B2/ja
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
JP2004191460A 2004-06-29 2004-06-29 音声認識装置および方法 Expired - Fee Related JP4541781B2 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2004191460A JP4541781B2 (ja) 2004-06-29 2004-06-29 音声認識装置および方法
US11/165,167 US7565290B2 (en) 2004-06-29 2005-06-24 Speech recognition method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2004191460A JP4541781B2 (ja) 2004-06-29 2004-06-29 音声認識装置および方法

Publications (3)

Publication Number Publication Date
JP2006011257A JP2006011257A (ja) 2006-01-12
JP2006011257A5 JP2006011257A5 (https=) 2010-02-25
JP4541781B2 true JP4541781B2 (ja) 2010-09-08

Family

ID=35507163

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2004191460A Expired - Fee Related JP4541781B2 (ja) 2004-06-29 2004-06-29 音声認識装置および方法

Country Status (2)

Country Link
US (1) US7565290B2 (https=)
JP (1) JP4541781B2 (https=)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007142840A (ja) * 2005-11-18 2007-06-07 Canon Inc 情報処理装置及び情報処理方法
JP4188989B2 (ja) * 2006-09-15 2008-12-03 本田技研工業株式会社 音声認識装置、音声認識方法、及び音声認識プログラム
JP4758919B2 (ja) * 2007-01-22 2011-08-31 日本放送協会 音声認識装置及び音声認識プログラム
JP2008225254A (ja) * 2007-03-14 2008-09-25 Canon Inc 音声合成装置及び方法並びにプログラム
US8275615B2 (en) 2007-07-13 2012-09-25 International Business Machines Corporation Model weighting, selection and hypotheses combination for automatic speech recognition and machine translation
JP5273844B2 (ja) * 2008-03-31 2013-08-28 Kddi株式会社 字幕ずれ推定装置、字幕ずれ補正装置、再生装置および放送装置
CN102027534B (zh) * 2008-05-16 2013-07-31 日本电气株式会社 语言模型得分前瞻值赋值方法及设备
JP5246948B2 (ja) * 2009-03-27 2013-07-24 Kddi株式会社 字幕ずれ補正装置、再生装置および放送装置
CN103971685B (zh) * 2013-01-30 2015-06-10 腾讯科技(深圳)有限公司 语音命令识别方法和系统
CN105869624B (zh) * 2016-03-29 2019-05-10 腾讯科技(深圳)有限公司 数字语音识别中语音解码网络的构建方法及装置
JP6585112B2 (ja) * 2017-03-17 2019-10-02 株式会社東芝 音声キーワード検出装置および音声キーワード検出方法
CN112242144A (zh) * 2019-07-17 2021-01-19 百度在线网络技术(北京)有限公司 基于流式注意力模型的语音识别解码方法、装置、设备以及计算机可读存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2712856B2 (ja) * 1991-03-08 1998-02-16 三菱電機株式会社 音声認識装置
JP3397372B2 (ja) * 1993-06-16 2003-04-14 キヤノン株式会社 音声認識方法及び装置
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
JP3450411B2 (ja) * 1994-03-22 2003-09-22 キヤノン株式会社 音声情報処理方法及び装置
JP3581401B2 (ja) * 1994-10-07 2004-10-27 キヤノン株式会社 音声認識方法
JP3453456B2 (ja) * 1995-06-19 2003-10-06 キヤノン株式会社 状態共有モデルの設計方法及び装置ならびにその状態共有モデルを用いた音声認識方法および装置
JP3459712B2 (ja) * 1995-11-01 2003-10-27 キヤノン株式会社 音声認識方法及び装置及びコンピュータ制御装置
JPH1097276A (ja) * 1996-09-20 1998-04-14 Canon Inc 音声認識方法及び装置並びに記憶媒体
US6076056A (en) * 1997-09-19 2000-06-13 Microsoft Corporation Speech recognition system for recognizing continuous and isolated speech
US6018628A (en) * 1998-06-16 2000-01-25 Sun Microsystems, Inc. Method of implementing parameterized types to be compatible with existing unparameterized libraries
US6542866B1 (en) * 1999-09-22 2003-04-01 Microsoft Corporation Speech recognition method and apparatus utilizing multiple feature streams
JP4543294B2 (ja) * 2000-03-14 2010-09-15 ソニー株式会社 音声認識装置および音声認識方法、並びに記録媒体
JP2001312293A (ja) * 2000-04-28 2001-11-09 Matsushita Electric Ind Co Ltd 音声認識方法およびその装置、並びにコンピュータ読み取り可能な記憶媒体
JP3728177B2 (ja) * 2000-05-24 2005-12-21 キヤノン株式会社 音声処理システム、装置、方法及び記憶媒体
AU2000276400A1 (en) * 2000-09-30 2002-04-15 Intel Corporation Search method based on single triphone tree for large vocabulary continuous speech recognizer
JP2002149187A (ja) * 2000-11-07 2002-05-24 Sony Corp 音声認識装置および音声認識方法、並びに記録媒体
JP2002189494A (ja) * 2000-12-19 2002-07-05 Atr Onsei Gengo Tsushin Kenkyusho:Kk 音声認識システム
JP2002215187A (ja) 2001-01-23 2002-07-31 Matsushita Electric Ind Co Ltd 音声認識方法及びその装置
JP3885002B2 (ja) * 2002-06-28 2007-02-21 キヤノン株式会社 情報処理装置およびその方法

Also Published As

Publication number Publication date
JP2006011257A (ja) 2006-01-12
US20050288929A1 (en) 2005-12-29
US7565290B2 (en) 2009-07-21

Similar Documents

Publication Publication Date Title
US11664020B2 (en) Speech recognition method and apparatus
JP4322815B2 (ja) 音声認識システム及び方法
US9697827B1 (en) Error reduction in speech processing
Hwang et al. Predicting unseen triphones with senones
US7496512B2 (en) Refining of segmental boundaries in speech waveforms using contextual-dependent models
CN105336322B (zh) 多音字模型训练方法、语音合成方法及装置
US10319373B2 (en) Information processing device, information processing method, computer program product, and recognition system
US20080077404A1 (en) Speech recognition device, speech recognition method, and computer program product
JP4541781B2 (ja) 音声認識装置および方法
KR20140028174A (ko) 음성 인식 방법 및 이를 적용한 전자 장치
US20100100379A1 (en) Voice recognition correlation rule learning system, voice recognition correlation rule learning program, and voice recognition correlation rule learning method
US20050228666A1 (en) Method, apparatus, and system for building context dependent models for a large vocabulary continuous speech recognition (lvcsr) system
CN106847259B (zh) 一种音频关键词模板的筛选和优化方法
Duchateau et al. Fast and accurate acoustic modelling with semi-continuous HMMs
US20020040296A1 (en) Phoneme assigning method
KR102199445B1 (ko) 클래스 기반 음향 모델의 변별 학습 방법 및 장치, 그리고 이를 이용한 음성 인식 장치
KR102299269B1 (ko) 음성 및 스크립트를 정렬하여 음성 데이터베이스를 구축하는 방법 및 장치
US9355636B1 (en) Selective speech recognition scoring using articulatory features
JP6350935B2 (ja) 音響モデル生成装置、音響モデルの生産方法、およびプログラム
WO2012076895A1 (en) Pattern recognition
Ko et al. Eigentriphones for context-dependent acoustic modeling
Paul New results with the Lincoln tied-mixture HMM CSR system
JP3439700B2 (ja) 音響モデル学習装置、音響モデル変換装置及び音声認識装置
JP4379050B2 (ja) 音声認識装置、音声認識高速化方法、および、プログラム
Wiggers et al. Automatic Speech Recognition USING Hidden Markov Models

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20070627

RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20070627

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100106

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20100406

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20100412

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20100531

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20100618

A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20100624

R150 Certificate of patent or registration of utility model

Free format text: JAPANESE INTERMEDIATE CODE: R150

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20130702

Year of fee payment: 3

LAPS Cancellation because of no payment of annual fees