WO2003036617A1 - Appareil de reconnaissance vocale et procede de reconnaissance de la parole - Google Patents

Appareil de reconnaissance vocale et procede de reconnaissance de la parole Download PDF

Info

Publication number
WO2003036617A1
WO2003036617A1 PCT/JP2002/010868 JP0210868W WO03036617A1 WO 2003036617 A1 WO2003036617 A1 WO 2003036617A1 JP 0210868 W JP0210868 W JP 0210868W WO 03036617 A1 WO03036617 A1 WO 03036617A1
Authority
WO
WIPO (PCT)
Prior art keywords
speech recognition
distance
acoustic model
recognition unit
speech
Prior art date
Application number
PCT/JP2002/010868
Other languages
English (en)
French (fr)
Inventor
Yasuharu Asano
Original Assignee
Sony Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corporation filed Critical Sony Corporation
Priority to DE60234530T priority Critical patent/DE60234530D1/de
Priority to EP02802031A priority patent/EP1441328B1/en
Priority to US10/451,285 priority patent/US7031917B2/en
Publication of WO2003036617A1 publication Critical patent/WO2003036617A1/ja
Priority to US11/362,331 priority patent/US7321853B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Toys (AREA)
  • Manipulator (AREA)
  • Traffic Control Systems (AREA)
  • Measurement Of Optical Distance (AREA)
  • Closed-Circuit Television Systems (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
  • Length Measuring Devices By Optical Means (AREA)
PCT/JP2002/010868 2001-10-22 2002-10-21 Appareil de reconnaissance vocale et procede de reconnaissance de la parole WO2003036617A1 (fr)

Priority Applications (4)

Application Number Priority Date Filing Date Title
DE60234530T DE60234530D1 (de) 2001-10-22 2002-10-21 Vorrichtung und verfahren zur spracherkennung
EP02802031A EP1441328B1 (en) 2001-10-22 2002-10-21 Speech recognition apparatus and speech recognition method
US10/451,285 US7031917B2 (en) 2001-10-22 2002-10-21 Speech recognition apparatus using distance based acoustic models
US11/362,331 US7321853B2 (en) 2001-10-22 2006-02-24 Speech recognition apparatus and speech recognition method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001-323012 2001-10-22
JP2001323012A JP2003131683A (ja) 2001-10-22 2001-10-22 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US10451285 A-371-Of-International 2002-10-21
US11/362,331 Continuation US7321853B2 (en) 2001-10-22 2006-02-24 Speech recognition apparatus and speech recognition method

Publications (1)

Publication Number Publication Date
WO2003036617A1 true WO2003036617A1 (fr) 2003-05-01

Family

ID=19139964

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2002/010868 WO2003036617A1 (fr) 2001-10-22 2002-10-21 Appareil de reconnaissance vocale et procede de reconnaissance de la parole

Country Status (6)

Country Link
US (2) US7031917B2 (ja)
EP (1) EP1441328B1 (ja)
JP (1) JP2003131683A (ja)
CN (1) CN1488134A (ja)
DE (1) DE60234530D1 (ja)
WO (1) WO2003036617A1 (ja)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106328141A (zh) * 2016-09-05 2017-01-11 南京大学 一种面向移动终端的超声波唇读识别装置及方法
CN109377991A (zh) * 2018-09-30 2019-02-22 珠海格力电器股份有限公司 一种智能设备控制方法及装置
CN112151080A (zh) * 2020-10-28 2020-12-29 成都启英泰伦科技有限公司 一种录制和处理训练语料的方法

Families Citing this family (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090030552A1 (en) * 2002-12-17 2009-01-29 Japan Science And Technology Agency Robotics visual and auditory system
WO2004071102A1 (ja) * 2003-01-20 2004-08-19 Sanyo Electric Co,. Ltd. 立体視用映像提供方法及び立体映像表示装置
US8103873B2 (en) * 2003-09-05 2012-01-24 Emc Corporation Method and system for processing auditory communications
US8209185B2 (en) 2003-09-05 2012-06-26 Emc Corporation Interface for management of auditory communications
JP4516527B2 (ja) * 2003-11-12 2010-08-04 本田技研工業株式会社 音声認識装置
US20060004818A1 (en) * 2004-07-01 2006-01-05 Claudatos Christopher H Efficient information management
US8180743B2 (en) * 2004-07-01 2012-05-15 Emc Corporation Information management
US9268780B2 (en) 2004-07-01 2016-02-23 Emc Corporation Content-driven information lifecycle management
US8229904B2 (en) 2004-07-01 2012-07-24 Emc Corporation Storage pools for information management
US8180742B2 (en) * 2004-07-01 2012-05-15 Emc Corporation Policy-based information management
US8244542B2 (en) * 2004-07-01 2012-08-14 Emc Corporation Video surveillance
JP4600736B2 (ja) * 2004-07-22 2010-12-15 ソニー株式会社 ロボット制御装置および方法、記録媒体、並びにプログラム
US8626514B2 (en) 2004-08-31 2014-01-07 Emc Corporation Interface for management of multiple auditory communications
JP4204541B2 (ja) * 2004-12-24 2009-01-07 株式会社東芝 対話型ロボット、対話型ロボットの音声認識方法および対話型ロボットの音声認識プログラム
KR20080046199A (ko) * 2005-09-21 2008-05-26 코닌클리케 필립스 일렉트로닉스 엔.브이. 원거리에 위치한 마이크로폰을 사용한 음성 작동 제어를가진 초음파 이미징 시스템
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
WO2008001486A1 (fr) * 2006-06-29 2008-01-03 Nec Corporation Dispositif et programme de traitement vocal, et procédé de traitement vocal
JP4469880B2 (ja) * 2007-08-09 2010-06-02 株式会社東芝 音声処理装置及び方法
CN101411946B (zh) * 2007-10-19 2012-03-28 鸿富锦精密工业(深圳)有限公司 玩具恐龙
JP5075664B2 (ja) * 2008-02-15 2012-11-21 株式会社東芝 音声対話装置及び支援方法
TW200937348A (en) * 2008-02-19 2009-09-01 Univ Nat Chiao Tung Calibration method for image capturing device
US20090287489A1 (en) * 2008-05-15 2009-11-19 Palm, Inc. Speech processing for plurality of users
CN101610360A (zh) * 2008-06-19 2009-12-23 鸿富锦精密工业(深圳)有限公司 自动追踪声源的摄像装置
JP5617083B2 (ja) * 2009-09-03 2014-11-05 本田技研工業株式会社 コマンド認識装置、コマンド認識方法、及びコマンド認識ロボット
US8676581B2 (en) * 2010-01-22 2014-03-18 Microsoft Corporation Speech recognition analysis via identification information
JP5393544B2 (ja) * 2010-03-12 2014-01-22 本田技研工業株式会社 ロボット、ロボット制御方法およびプログラム
US9105053B2 (en) * 2010-03-23 2015-08-11 Nokia Technologies Oy Method and apparatus for determining a user age range
US9274744B2 (en) 2010-09-10 2016-03-01 Amazon Technologies, Inc. Relative position-inclusive device interfaces
US8700392B1 (en) * 2010-09-10 2014-04-15 Amazon Technologies, Inc. Speech-inclusive device interfaces
US8886532B2 (en) * 2010-10-27 2014-11-11 Microsoft Corporation Leveraging interaction context to improve recognition confidence scores
US20120143611A1 (en) * 2010-12-07 2012-06-07 Microsoft Corporation Trajectory Tiling Approach for Text-to-Speech
KR101791907B1 (ko) 2011-01-04 2017-11-02 삼성전자주식회사 위치 기반의 음향 처리 장치 및 방법
US9223415B1 (en) 2012-01-17 2015-12-29 Amazon Technologies, Inc. Managing resource usage for task performance
JP5862349B2 (ja) * 2012-02-16 2016-02-16 株式会社Jvcケンウッド ノイズ低減装置、音声入力装置、無線通信装置、およびノイズ低減方法
US8831957B2 (en) * 2012-08-01 2014-09-09 Google Inc. Speech recognition models based on location indicia
US9208777B2 (en) * 2013-01-25 2015-12-08 Microsoft Technology Licensing, Llc Feature space transformation for personalization using generalized i-vector clustering
CA2914677A1 (en) * 2013-06-04 2014-12-11 Ims Solutions Inc. Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning
JP6169910B2 (ja) * 2013-07-08 2017-07-26 本田技研工業株式会社 音声処理装置
US9310800B1 (en) * 2013-07-30 2016-04-12 The Boeing Company Robotic platform evaluation system
US9847082B2 (en) * 2013-08-23 2017-12-19 Honeywell International Inc. System for modifying speech recognition and beamforming using a depth image
US11199906B1 (en) 2013-09-04 2021-12-14 Amazon Technologies, Inc. Global user input management
US9367203B1 (en) 2013-10-04 2016-06-14 Amazon Technologies, Inc. User interface techniques for simulating three-dimensional depth
CN104715753B (zh) * 2013-12-12 2018-08-31 联想(北京)有限公司 一种数据处理的方法及电子设备
US9472186B1 (en) * 2014-01-28 2016-10-18 Nvoq Incorporated Automated training of a user audio profile using transcribed medical record recordings
CN103928025B (zh) 2014-04-08 2017-06-27 华为技术有限公司 一种语音识别的方法及移动终端
CN104267920B (zh) * 2014-09-29 2017-10-27 北京奇艺世纪科技有限公司 用户识别方法、装置、系统及显示模式切换方法、装置
EP3958255A1 (en) * 2015-01-16 2022-02-23 Samsung Electronics Co., Ltd. Method and device for performing voice recognition
JP6703460B2 (ja) * 2016-08-25 2020-06-03 本田技研工業株式会社 音声処理装置、音声処理方法及び音声処理プログラム
CN106356064A (zh) * 2016-08-30 2017-01-25 合肥前瞻智能科技有限公司 一种定向声控开关语音识别系统
US10140987B2 (en) * 2016-09-16 2018-11-27 International Business Machines Corporation Aerial drone companion device and a method of operating an aerial drone companion device
KR20180037543A (ko) * 2016-10-04 2018-04-12 삼성전자주식회사 음성 인식 전자 장치
US20180158458A1 (en) * 2016-10-21 2018-06-07 Shenetics, Inc. Conversational voice interface of connected devices, including toys, cars, avionics, mobile, iot and home appliances
JP6705410B2 (ja) * 2017-03-27 2020-06-03 カシオ計算機株式会社 音声認識装置、音声認識方法、プログラム及びロボット
KR102012968B1 (ko) * 2018-08-07 2019-08-27 주식회사 서큘러스 인터렉션 로봇의 제어 방법 및 제어 서버
CN109637540B (zh) * 2019-02-28 2021-02-26 北京百度网讯科技有限公司 智能语音设备的蓝牙评测方法、装置、设备及介质
CN110515449B (zh) * 2019-08-30 2021-06-04 北京安云世纪科技有限公司 唤醒智能设备的方法及装置
JP7395446B2 (ja) * 2020-09-08 2023-12-11 株式会社東芝 音声認識装置、方法およびプログラム
CN113628621A (zh) * 2021-08-18 2021-11-09 北京声智科技有限公司 一种实现设备就近唤醒的方法、系统及装置
CN114464184B (zh) * 2022-04-11 2022-09-02 北京荣耀终端有限公司 语音识别的方法、设备和存储介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02230896A (ja) * 1989-03-03 1990-09-13 Nippon Telegr & Teleph Corp <Ntt> 音響信号入力装置
JPH06236196A (ja) 1993-02-08 1994-08-23 Nippon Telegr & Teleph Corp <Ntt> 音声認識方法および装置
JPH0713591A (ja) * 1993-06-22 1995-01-17 Hitachi Ltd 音声認識装置および音声認識方法
JPH0788791A (ja) * 1993-09-20 1995-04-04 Mitsubishi Electric Corp ロボット装置およびその周辺装置
JPH1113507A (ja) * 1997-06-27 1999-01-19 Mitsubishi Motors Corp 自動追従走行システム

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5946427A (ja) * 1982-09-10 1984-03-15 Matsushita Electric Ind Co Ltd 加熱装置
JPS63121098A (ja) * 1986-11-10 1988-05-25 松下電器産業株式会社 電話用音声認識装置
JPS63121097A (ja) * 1986-11-10 1988-05-25 松下電器産業株式会社 電話用音声認識装置
JPS63248218A (ja) * 1987-04-03 1988-10-14 Oki Electric Ind Co Ltd 適応制御フイルタ
JPH02132499A (ja) * 1988-11-14 1990-05-21 Toshiba Corp 音声入力装置
US5008941A (en) * 1989-03-31 1991-04-16 Kurzweil Applied Intelligence, Inc. Method and apparatus for automatically updating estimates of undesirable components of the speech signal in a speech recognition system
JPH05227531A (ja) * 1992-02-17 1993-09-03 Sanyo Electric Co Ltd カメラ監視システム
US5307405A (en) * 1992-09-25 1994-04-26 Qualcomm Incorporated Network echo canceller
JP3714706B2 (ja) * 1995-02-17 2005-11-09 株式会社竹中工務店 音抽出装置
US5905773A (en) * 1996-03-28 1999-05-18 Northern Telecom Limited Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models
JPH11237897A (ja) * 1998-02-23 1999-08-31 Kenwood Corp 音響装置
JPH11296192A (ja) * 1998-04-10 1999-10-29 Pioneer Electron Corp 音声認識における音声特徴量の補正方法、音声認識方法、音声認識装置及び音声認識プログラムを記録した記録媒体
JP3919337B2 (ja) * 1998-06-19 2007-05-23 株式会社東海理化電機製作所 車両用音声認識装置
US6904405B2 (en) * 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
US6752498B2 (en) * 2001-05-14 2004-06-22 Eastman Kodak Company Adaptive autostereoscopic display system
AU2002311452B2 (en) * 2001-06-19 2008-06-19 Speech Sentinel Limited Speaker recognition system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02230896A (ja) * 1989-03-03 1990-09-13 Nippon Telegr & Teleph Corp <Ntt> 音響信号入力装置
JPH06236196A (ja) 1993-02-08 1994-08-23 Nippon Telegr & Teleph Corp <Ntt> 音声認識方法および装置
JPH0713591A (ja) * 1993-06-22 1995-01-17 Hitachi Ltd 音声認識装置および音声認識方法
JPH0788791A (ja) * 1993-09-20 1995-04-04 Mitsubishi Electric Corp ロボット装置およびその周辺装置
JPH1113507A (ja) * 1997-06-27 1999-01-19 Mitsubishi Motors Corp 自動追従走行システム

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1441328A4

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106328141A (zh) * 2016-09-05 2017-01-11 南京大学 一种面向移动终端的超声波唇读识别装置及方法
CN109377991A (zh) * 2018-09-30 2019-02-22 珠海格力电器股份有限公司 一种智能设备控制方法及装置
CN112151080A (zh) * 2020-10-28 2020-12-29 成都启英泰伦科技有限公司 一种录制和处理训练语料的方法
CN112151080B (zh) * 2020-10-28 2021-08-03 成都启英泰伦科技有限公司 一种录制和处理训练语料的方法

Also Published As

Publication number Publication date
US7031917B2 (en) 2006-04-18
US20060143006A1 (en) 2006-06-29
EP1441328A4 (en) 2005-11-23
US20040054531A1 (en) 2004-03-18
DE60234530D1 (de) 2010-01-07
CN1488134A (zh) 2004-04-07
EP1441328A1 (en) 2004-07-28
EP1441328B1 (en) 2009-11-25
JP2003131683A (ja) 2003-05-09
US7321853B2 (en) 2008-01-22

Similar Documents

Publication Publication Date Title
WO2003036617A1 (fr) Appareil de reconnaissance vocale et procede de reconnaissance de la parole
EP1635327B1 (en) Information transmission device
Wand et al. Session-independent EMG-based Speech Recognition.
KR100933108B1 (ko) 함축적인 화자 적응을 사용하는 음성 인식 시스템
WO2019214047A1 (zh) 建立声纹模型的方法、装置、计算机设备和存储介质
WO2002097590A3 (en) Language independent and voice operated information management system
AU2003218398A1 (en) Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition
EP1507255A3 (en) Bubble splitting for compact acoustic modeling
CN101627427A (zh) 声音强调装置及声音强调方法
WO2006023631A3 (en) Document transcription system training
Aggarwal et al. Performance evaluation of sequentially combined heterogeneous feature streams for Hindi speech recognition system
US20170084266A1 (en) Voice synthesis apparatus and method for synthesizing voice
EP1241662A3 (en) Method of speech recognition with compensation for both channel distortion and background noise
JP2004198831A (ja) 音声認識装置および方法、プログラム、並びに記録媒体
EP1471501A3 (en) Speech recognition apparatus, speech recognition method, and recording medium on which speech recognition program is computer-readable recorded
EP1378885A3 (en) Word-spotting apparatus, word-spotting method, and word-spotting program
WO2004068893A3 (en) Method and apparatus for noise suppression within a distributed speech recognition system
DE60008893D1 (de) Sprachgesteuertes tragbares Endgerät
US7353173B2 (en) System and method for Mandarin Chinese speech recognition using an optimized phone set
KR20040038419A (ko) 음성을 이용한 감정인식 시스템 및 감정인식 방법
Nguyen et al. Vietnamese voice recognition for home automation using MFCC and DTW techniques
Tolba et al. Speech recognition by intelligent machines
Okuno et al. Computational auditory scene analysis and its application to robot audition: Five years experience
JP2004170756A (ja) ロボット制御装置および方法、記録媒体、並びにプログラム
JP2001188783A (ja) 情報処理装置および方法、並びに記録媒体

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CN US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR

WWE Wipo information: entry into national phase

Ref document number: 2002802031

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10451285

Country of ref document: US

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 028040511

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2002802031

Country of ref document: EP