WO2009004750A1 - 音声認識装置 - Google Patents

音声認識装置 Download PDF

Info

Publication number
WO2009004750A1
WO2009004750A1 PCT/JP2008/000772 JP2008000772W WO2009004750A1 WO 2009004750 A1 WO2009004750 A1 WO 2009004750A1 JP 2008000772 W JP2008000772 W JP 2008000772W WO 2009004750 A1 WO2009004750 A1 WO 2009004750A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice
utterance
audio signal
system response
contents
Prior art date
Application number
PCT/JP2008/000772
Other languages
English (en)
French (fr)
Inventor
Yuzuru Inoue
Tadashi Suzuki
Fumitaka Sato
Takayoshi Chikuri
Original Assignee
Mitsubishi Electric Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corporation filed Critical Mitsubishi Electric Corporation
Priority to JP2009521505A priority Critical patent/JP4859982B2/ja
Priority to DE112008001334.9T priority patent/DE112008001334B4/de
Priority to US12/599,217 priority patent/US8407051B2/en
Priority to CN2008800222921A priority patent/CN101689366B/zh
Publication of WO2009004750A1 publication Critical patent/WO2009004750A1/ja

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Navigation (AREA)
  • User Interface Of Digital Computer (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)

Abstract

音声認識の開始を指示する音声開始指示部3と、発話された音声を入力して音声信号に変換する音声入力部1と、音声信号に基づき音声を認識する音声認識部2と、音声開始指示部による指示がなされてから、音声入力部から音声信号が送られてくるまでの時間を検出する発話開始時間検出部4と、発話開始時間検出部で検出された時間と所定の閾値とを比較して発話開始の早遅を表す発話タイミングを判定する発話タイミング判定部5と、判定された発話タイミングに応じて、音声認識部における認識結果を提示する際の提示内容を決定する対話制御部6と、決定された提示内容に基づきシステム応答を生成するシステム応答生成部7と、生成されたシステム応答を出力する出力部8、9とを備えている。
PCT/JP2008/000772 2007-07-02 2008-03-27 音声認識装置 WO2009004750A1 (ja)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2009521505A JP4859982B2 (ja) 2007-07-02 2008-03-27 音声認識装置
DE112008001334.9T DE112008001334B4 (de) 2007-07-02 2008-03-27 Spracherkennungsvorrichtung
US12/599,217 US8407051B2 (en) 2007-07-02 2008-03-27 Speech recognizing apparatus
CN2008800222921A CN101689366B (zh) 2007-07-02 2008-03-27 声音识别装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2007-174386 2007-07-02
JP2007174386 2007-07-02

Publications (1)

Publication Number Publication Date
WO2009004750A1 true WO2009004750A1 (ja) 2009-01-08

Family

ID=40225818

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2008/000772 WO2009004750A1 (ja) 2007-07-02 2008-03-27 音声認識装置

Country Status (5)

Country Link
US (1) US8407051B2 (ja)
JP (1) JP4859982B2 (ja)
CN (1) CN101689366B (ja)
DE (1) DE112008001334B4 (ja)
WO (1) WO2009004750A1 (ja)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011039222A (ja) * 2009-08-10 2011-02-24 Nec Corp 音声認識システム、音声認識方法および音声認識プログラム
US20110276329A1 (en) * 2009-01-20 2011-11-10 Masaaki Ayabe Speech dialogue apparatus, dialogue control method, and dialogue control program
JP2018045123A (ja) * 2016-09-15 2018-03-22 東芝テック株式会社 音声認識装置、音声認識方法及び音声認識プログラム
WO2022215104A1 (ja) * 2021-04-05 2022-10-13 三菱電機株式会社 音声対話装置および音声対話方法

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5834449B2 (ja) * 2010-04-22 2015-12-24 富士通株式会社 発話状態検出装置、発話状態検出プログラムおよび発話状態検出方法
CN103038818B (zh) * 2010-06-24 2016-10-12 本田技研工业株式会社 在车载语音识别系统与车外语音识别系统之间的通信系统和方法
KR20140089871A (ko) 2013-01-07 2014-07-16 삼성전자주식회사 대화형 서버, 그 제어 방법 및 대화형 시스템
JP6389171B2 (ja) * 2013-06-19 2018-09-12 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 音声対話方法、及び機器
US9953644B2 (en) 2014-12-01 2018-04-24 At&T Intellectual Property I, L.P. Targeted clarification questions in speech recognition with concept presence score and concept correctness score
KR102420450B1 (ko) 2015-09-23 2022-07-14 삼성전자주식회사 음성인식장치, 음성인식방법 및 컴퓨터 판독가능 기록매체
CN106027588A (zh) * 2015-12-09 2016-10-12 展视网(北京)科技有限公司 一种语音识别车载终端控制方法
US10475447B2 (en) * 2016-01-25 2019-11-12 Ford Global Technologies, Llc Acoustic and domain based speech recognition for vehicles
JP2019200393A (ja) * 2018-05-18 2019-11-21 シャープ株式会社 判定装置、電子機器、応答システム、判定装置の制御方法、および制御プログラム
JP6936772B2 (ja) * 2018-06-04 2021-09-22 株式会社ホンダアクセス 情報提供装置
RU2744063C1 (ru) 2018-12-18 2021-03-02 Общество С Ограниченной Ответственностью "Яндекс" Способ и система определения говорящего пользователя управляемого голосом устройства
DE102022112743B4 (de) 2022-05-20 2024-02-01 Audi Aktiengesellschaft Verfahren zur Verbesserung der Qualität einer Audio- und/oder Videoaufzeichnung sowie Steuervorrichtung für ein mobiles Endgerät

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0518118B2 (ja) * 1984-05-24 1993-03-11 Tokyo Shibaura Electric Co
JP2002149191A (ja) * 2000-11-09 2002-05-24 Toyota Central Res & Dev Lab Inc 音声入力装置
JP2003029778A (ja) * 2001-07-16 2003-01-31 Fujitsu Ten Ltd ナビゲーションシステムにおける音声対話インターフェース処理方法
JP2006313261A (ja) * 2005-05-09 2006-11-16 Mitsubishi Electric Corp 音声認識装置並びに音声認識プログラム及び音声認識プログラムを記録したコンピュータ読み取り可能な記録媒体
JP2007004054A (ja) * 2005-06-27 2007-01-11 Nissan Motor Co Ltd 音声対話装置及び音声理解結果生成方法

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5918222A (en) * 1995-03-17 1999-06-29 Kabushiki Kaisha Toshiba Information disclosing apparatus and multi-modal information input/output system
US6012027A (en) * 1997-05-27 2000-01-04 Ameritech Corporation Criteria for usable repetitions of an utterance during speech reference enrollment
DE19941227A1 (de) 1999-08-30 2001-03-08 Philips Corp Intellectual Pty Verfahren und Anordnung zur Spracherkennung
DE19956747C1 (de) 1999-11-25 2001-01-11 Siemens Ag Verfahren und Vorrichtung zur Spracherkennung sowie ein Telekommunikationssystem
JP2002149187A (ja) * 2000-11-07 2002-05-24 Sony Corp 音声認識装置および音声認識方法、並びに記録媒体
JP2003091299A (ja) 2001-07-13 2003-03-28 Honda Motor Co Ltd 車載用音声認識装置
GB0224806D0 (en) 2002-10-24 2002-12-04 Ibm Method and apparatus for a interactive voice response system
JP2004239963A (ja) * 2003-02-03 2004-08-26 Mitsubishi Electric Corp 車載制御装置
JP2004333543A (ja) 2003-04-30 2004-11-25 Matsushita Electric Ind Co Ltd 音声対話システム及び音声対話方法
US7724889B2 (en) 2004-11-29 2010-05-25 At&T Intellectual Property I, L.P. System and method for utilizing confidence levels in automated call routing
US8090582B2 (en) 2005-12-14 2012-01-03 Mitsubishi Electric Corporation Voice recognition apparatus
JP5018118B2 (ja) 2007-02-15 2012-09-05 コニカミノルタビジネステクノロジーズ株式会社 文書管理装置、文書管理方法及び文書管理プログラム
JP2008203559A (ja) * 2007-02-20 2008-09-04 Toshiba Corp 対話装置及び方法

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0518118B2 (ja) * 1984-05-24 1993-03-11 Tokyo Shibaura Electric Co
JP2002149191A (ja) * 2000-11-09 2002-05-24 Toyota Central Res & Dev Lab Inc 音声入力装置
JP2003029778A (ja) * 2001-07-16 2003-01-31 Fujitsu Ten Ltd ナビゲーションシステムにおける音声対話インターフェース処理方法
JP2006313261A (ja) * 2005-05-09 2006-11-16 Mitsubishi Electric Corp 音声認識装置並びに音声認識プログラム及び音声認識プログラムを記録したコンピュータ読み取り可能な記録媒体
JP2007004054A (ja) * 2005-06-27 2007-01-11 Nissan Motor Co Ltd 音声対話装置及び音声理解結果生成方法

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110276329A1 (en) * 2009-01-20 2011-11-10 Masaaki Ayabe Speech dialogue apparatus, dialogue control method, and dialogue control program
JP2011039222A (ja) * 2009-08-10 2011-02-24 Nec Corp 音声認識システム、音声認識方法および音声認識プログラム
JP2018045123A (ja) * 2016-09-15 2018-03-22 東芝テック株式会社 音声認識装置、音声認識方法及び音声認識プログラム
WO2022215104A1 (ja) * 2021-04-05 2022-10-13 三菱電機株式会社 音声対話装置および音声対話方法

Also Published As

Publication number Publication date
JPWO2009004750A1 (ja) 2010-08-26
US20110208525A1 (en) 2011-08-25
CN101689366B (zh) 2011-12-07
US8407051B2 (en) 2013-03-26
DE112008001334B4 (de) 2016-12-15
CN101689366A (zh) 2010-03-31
DE112008001334T5 (de) 2010-05-12
JP4859982B2 (ja) 2012-01-25

Similar Documents

Publication Publication Date Title
WO2009004750A1 (ja) 音声認識装置
JP6436400B2 (ja) 音声コマンド入力装置および音声コマンド入力方法
US8170875B2 (en) Speech end-pointer
ATE390684T1 (de) Verbesserung der verständlichkeit von sprache enthaltenden audiosignalen
EP4235648A3 (en) Language model biasing
TW201640493A (zh) 具有兩個麥克風的音訊緩衝追趕設備及方法
WO2019161193A3 (en) System and method for adaptive detection of spoken language via multiple speech models
TW200715146A (en) System and method for detecting the recognizability of inputted speech signals
WO2015057907A3 (en) System and method for learning alternate pronunciations for speech recognition
GB0207343D0 (en) Signal processing system
JP2017535809A5 (ja)
ATE441175T1 (de) Verteiltes spracherkennungsverfahren
WO2008064358A3 (en) Recognition of speech in editable audio streams
WO2009145508A3 (ko) 실시간 호출명령어 인식을 이용한 잡음환경에서의 음성구간검출과 연속음성인식 시스템
TW200729706A (en) Method and audio system for controlling a gain of a voice signal
AU2003274432A1 (en) Method and system for speech recognition
ATE491201T1 (de) Sprachdialogsystem mit an den benutzer angepasster sprachausgabe
AU2003269418A1 (en) Method for operating a speech recognition system
US10672395B2 (en) Voice control system and method for voice selection, and smart robot using the same
WO2015012680A3 (en) A method for speech watermarking in speaker verification
AU2003216582A1 (en) Method and system for speech recognition of symbol sequences
WO2009066401A1 (ja) オーディオ機器用音声認識装置
KR101811716B1 (ko) 음성 인식 방법 및 그에 따른 음성 인식 장치
WO2008108239A1 (ja) 音声認識システム、方法およびプログラム
BRPI0803090A2 (pt) método e dispositivo para reconhecer pulsos

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880022292.1

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08738440

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2009521505

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 12599217

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 1120080013349

Country of ref document: DE

RET De translation (de og part 6b)

Ref document number: 112008001334

Country of ref document: DE

Date of ref document: 20100512

Kind code of ref document: P

122 Ep: pct application non-entry in european phase

Ref document number: 08738440

Country of ref document: EP

Kind code of ref document: A1