WO2003036617A1 - Appareil de reconnaissance vocale et procede de reconnaissance de la parole - Google Patents
Appareil de reconnaissance vocale et procede de reconnaissance de la parole Download PDFInfo
- Publication number
- WO2003036617A1 WO2003036617A1 PCT/JP2002/010868 JP0210868W WO03036617A1 WO 2003036617 A1 WO2003036617 A1 WO 2003036617A1 JP 0210868 W JP0210868 W JP 0210868W WO 03036617 A1 WO03036617 A1 WO 03036617A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech recognition
- distance
- acoustic model
- recognition unit
- speech
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Toys (AREA)
- Manipulator (AREA)
- Traffic Control Systems (AREA)
- Measurement Of Optical Distance (AREA)
- Closed-Circuit Television Systems (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
- Length Measuring Devices By Optical Means (AREA)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE60234530T DE60234530D1 (de) | 2001-10-22 | 2002-10-21 | Vorrichtung und verfahren zur spracherkennung |
EP02802031A EP1441328B1 (en) | 2001-10-22 | 2002-10-21 | Speech recognition apparatus and speech recognition method |
US10/451,285 US7031917B2 (en) | 2001-10-22 | 2002-10-21 | Speech recognition apparatus using distance based acoustic models |
US11/362,331 US7321853B2 (en) | 2001-10-22 | 2006-02-24 | Speech recognition apparatus and speech recognition method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2001-323012 | 2001-10-22 | ||
JP2001323012A JP2003131683A (ja) | 2001-10-22 | 2001-10-22 | 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10451285 A-371-Of-International | 2002-10-21 | ||
US11/362,331 Continuation US7321853B2 (en) | 2001-10-22 | 2006-02-24 | Speech recognition apparatus and speech recognition method |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2003036617A1 true WO2003036617A1 (fr) | 2003-05-01 |
Family
ID=19139964
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2002/010868 WO2003036617A1 (fr) | 2001-10-22 | 2002-10-21 | Appareil de reconnaissance vocale et procede de reconnaissance de la parole |
Country Status (6)
Country | Link |
---|---|
US (2) | US7031917B2 (ja) |
EP (1) | EP1441328B1 (ja) |
JP (1) | JP2003131683A (ja) |
CN (1) | CN1488134A (ja) |
DE (1) | DE60234530D1 (ja) |
WO (1) | WO2003036617A1 (ja) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106328141A (zh) * | 2016-09-05 | 2017-01-11 | 南京大学 | 一种面向移动终端的超声波唇读识别装置及方法 |
CN109377991A (zh) * | 2018-09-30 | 2019-02-22 | 珠海格力电器股份有限公司 | 一种智能设备控制方法及装置 |
CN112151080A (zh) * | 2020-10-28 | 2020-12-29 | 成都启英泰伦科技有限公司 | 一种录制和处理训练语料的方法 |
Families Citing this family (59)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090030552A1 (en) * | 2002-12-17 | 2009-01-29 | Japan Science And Technology Agency | Robotics visual and auditory system |
WO2004071102A1 (ja) * | 2003-01-20 | 2004-08-19 | Sanyo Electric Co,. Ltd. | 立体視用映像提供方法及び立体映像表示装置 |
US8103873B2 (en) * | 2003-09-05 | 2012-01-24 | Emc Corporation | Method and system for processing auditory communications |
US8209185B2 (en) | 2003-09-05 | 2012-06-26 | Emc Corporation | Interface for management of auditory communications |
JP4516527B2 (ja) * | 2003-11-12 | 2010-08-04 | 本田技研工業株式会社 | 音声認識装置 |
US20060004818A1 (en) * | 2004-07-01 | 2006-01-05 | Claudatos Christopher H | Efficient information management |
US8180743B2 (en) * | 2004-07-01 | 2012-05-15 | Emc Corporation | Information management |
US9268780B2 (en) | 2004-07-01 | 2016-02-23 | Emc Corporation | Content-driven information lifecycle management |
US8229904B2 (en) | 2004-07-01 | 2012-07-24 | Emc Corporation | Storage pools for information management |
US8180742B2 (en) * | 2004-07-01 | 2012-05-15 | Emc Corporation | Policy-based information management |
US8244542B2 (en) * | 2004-07-01 | 2012-08-14 | Emc Corporation | Video surveillance |
JP4600736B2 (ja) * | 2004-07-22 | 2010-12-15 | ソニー株式会社 | ロボット制御装置および方法、記録媒体、並びにプログラム |
US8626514B2 (en) | 2004-08-31 | 2014-01-07 | Emc Corporation | Interface for management of multiple auditory communications |
JP4204541B2 (ja) * | 2004-12-24 | 2009-01-07 | 株式会社東芝 | 対話型ロボット、対話型ロボットの音声認識方法および対話型ロボットの音声認識プログラム |
KR20080046199A (ko) * | 2005-09-21 | 2008-05-26 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 원거리에 위치한 마이크로폰을 사용한 음성 작동 제어를가진 초음파 이미징 시스템 |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
WO2008001486A1 (fr) * | 2006-06-29 | 2008-01-03 | Nec Corporation | Dispositif et programme de traitement vocal, et procédé de traitement vocal |
JP4469880B2 (ja) * | 2007-08-09 | 2010-06-02 | 株式会社東芝 | 音声処理装置及び方法 |
CN101411946B (zh) * | 2007-10-19 | 2012-03-28 | 鸿富锦精密工业(深圳)有限公司 | 玩具恐龙 |
JP5075664B2 (ja) * | 2008-02-15 | 2012-11-21 | 株式会社東芝 | 音声対話装置及び支援方法 |
TW200937348A (en) * | 2008-02-19 | 2009-09-01 | Univ Nat Chiao Tung | Calibration method for image capturing device |
US20090287489A1 (en) * | 2008-05-15 | 2009-11-19 | Palm, Inc. | Speech processing for plurality of users |
CN101610360A (zh) * | 2008-06-19 | 2009-12-23 | 鸿富锦精密工业(深圳)有限公司 | 自动追踪声源的摄像装置 |
JP5617083B2 (ja) * | 2009-09-03 | 2014-11-05 | 本田技研工業株式会社 | コマンド認識装置、コマンド認識方法、及びコマンド認識ロボット |
US8676581B2 (en) * | 2010-01-22 | 2014-03-18 | Microsoft Corporation | Speech recognition analysis via identification information |
JP5393544B2 (ja) * | 2010-03-12 | 2014-01-22 | 本田技研工業株式会社 | ロボット、ロボット制御方法およびプログラム |
US9105053B2 (en) * | 2010-03-23 | 2015-08-11 | Nokia Technologies Oy | Method and apparatus for determining a user age range |
US9274744B2 (en) | 2010-09-10 | 2016-03-01 | Amazon Technologies, Inc. | Relative position-inclusive device interfaces |
US8700392B1 (en) * | 2010-09-10 | 2014-04-15 | Amazon Technologies, Inc. | Speech-inclusive device interfaces |
US8886532B2 (en) * | 2010-10-27 | 2014-11-11 | Microsoft Corporation | Leveraging interaction context to improve recognition confidence scores |
US20120143611A1 (en) * | 2010-12-07 | 2012-06-07 | Microsoft Corporation | Trajectory Tiling Approach for Text-to-Speech |
KR101791907B1 (ko) | 2011-01-04 | 2017-11-02 | 삼성전자주식회사 | 위치 기반의 음향 처리 장치 및 방법 |
US9223415B1 (en) | 2012-01-17 | 2015-12-29 | Amazon Technologies, Inc. | Managing resource usage for task performance |
JP5862349B2 (ja) * | 2012-02-16 | 2016-02-16 | 株式会社Jvcケンウッド | ノイズ低減装置、音声入力装置、無線通信装置、およびノイズ低減方法 |
US8831957B2 (en) * | 2012-08-01 | 2014-09-09 | Google Inc. | Speech recognition models based on location indicia |
US9208777B2 (en) * | 2013-01-25 | 2015-12-08 | Microsoft Technology Licensing, Llc | Feature space transformation for personalization using generalized i-vector clustering |
CA2914677A1 (en) * | 2013-06-04 | 2014-12-11 | Ims Solutions Inc. | Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning |
JP6169910B2 (ja) * | 2013-07-08 | 2017-07-26 | 本田技研工業株式会社 | 音声処理装置 |
US9310800B1 (en) * | 2013-07-30 | 2016-04-12 | The Boeing Company | Robotic platform evaluation system |
US9847082B2 (en) * | 2013-08-23 | 2017-12-19 | Honeywell International Inc. | System for modifying speech recognition and beamforming using a depth image |
US11199906B1 (en) | 2013-09-04 | 2021-12-14 | Amazon Technologies, Inc. | Global user input management |
US9367203B1 (en) | 2013-10-04 | 2016-06-14 | Amazon Technologies, Inc. | User interface techniques for simulating three-dimensional depth |
CN104715753B (zh) * | 2013-12-12 | 2018-08-31 | 联想(北京)有限公司 | 一种数据处理的方法及电子设备 |
US9472186B1 (en) * | 2014-01-28 | 2016-10-18 | Nvoq Incorporated | Automated training of a user audio profile using transcribed medical record recordings |
CN103928025B (zh) | 2014-04-08 | 2017-06-27 | 华为技术有限公司 | 一种语音识别的方法及移动终端 |
CN104267920B (zh) * | 2014-09-29 | 2017-10-27 | 北京奇艺世纪科技有限公司 | 用户识别方法、装置、系统及显示模式切换方法、装置 |
EP3958255A1 (en) * | 2015-01-16 | 2022-02-23 | Samsung Electronics Co., Ltd. | Method and device for performing voice recognition |
JP6703460B2 (ja) * | 2016-08-25 | 2020-06-03 | 本田技研工業株式会社 | 音声処理装置、音声処理方法及び音声処理プログラム |
CN106356064A (zh) * | 2016-08-30 | 2017-01-25 | 合肥前瞻智能科技有限公司 | 一种定向声控开关语音识别系统 |
US10140987B2 (en) * | 2016-09-16 | 2018-11-27 | International Business Machines Corporation | Aerial drone companion device and a method of operating an aerial drone companion device |
KR20180037543A (ko) * | 2016-10-04 | 2018-04-12 | 삼성전자주식회사 | 음성 인식 전자 장치 |
US20180158458A1 (en) * | 2016-10-21 | 2018-06-07 | Shenetics, Inc. | Conversational voice interface of connected devices, including toys, cars, avionics, mobile, iot and home appliances |
JP6705410B2 (ja) * | 2017-03-27 | 2020-06-03 | カシオ計算機株式会社 | 音声認識装置、音声認識方法、プログラム及びロボット |
KR102012968B1 (ko) * | 2018-08-07 | 2019-08-27 | 주식회사 서큘러스 | 인터렉션 로봇의 제어 방법 및 제어 서버 |
CN109637540B (zh) * | 2019-02-28 | 2021-02-26 | 北京百度网讯科技有限公司 | 智能语音设备的蓝牙评测方法、装置、设备及介质 |
CN110515449B (zh) * | 2019-08-30 | 2021-06-04 | 北京安云世纪科技有限公司 | 唤醒智能设备的方法及装置 |
JP7395446B2 (ja) * | 2020-09-08 | 2023-12-11 | 株式会社東芝 | 音声認識装置、方法およびプログラム |
CN113628621A (zh) * | 2021-08-18 | 2021-11-09 | 北京声智科技有限公司 | 一种实现设备就近唤醒的方法、系统及装置 |
CN114464184B (zh) * | 2022-04-11 | 2022-09-02 | 北京荣耀终端有限公司 | 语音识别的方法、设备和存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH02230896A (ja) * | 1989-03-03 | 1990-09-13 | Nippon Telegr & Teleph Corp <Ntt> | 音響信号入力装置 |
JPH06236196A (ja) | 1993-02-08 | 1994-08-23 | Nippon Telegr & Teleph Corp <Ntt> | 音声認識方法および装置 |
JPH0713591A (ja) * | 1993-06-22 | 1995-01-17 | Hitachi Ltd | 音声認識装置および音声認識方法 |
JPH0788791A (ja) * | 1993-09-20 | 1995-04-04 | Mitsubishi Electric Corp | ロボット装置およびその周辺装置 |
JPH1113507A (ja) * | 1997-06-27 | 1999-01-19 | Mitsubishi Motors Corp | 自動追従走行システム |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5946427A (ja) * | 1982-09-10 | 1984-03-15 | Matsushita Electric Ind Co Ltd | 加熱装置 |
JPS63121098A (ja) * | 1986-11-10 | 1988-05-25 | 松下電器産業株式会社 | 電話用音声認識装置 |
JPS63121097A (ja) * | 1986-11-10 | 1988-05-25 | 松下電器産業株式会社 | 電話用音声認識装置 |
JPS63248218A (ja) * | 1987-04-03 | 1988-10-14 | Oki Electric Ind Co Ltd | 適応制御フイルタ |
JPH02132499A (ja) * | 1988-11-14 | 1990-05-21 | Toshiba Corp | 音声入力装置 |
US5008941A (en) * | 1989-03-31 | 1991-04-16 | Kurzweil Applied Intelligence, Inc. | Method and apparatus for automatically updating estimates of undesirable components of the speech signal in a speech recognition system |
JPH05227531A (ja) * | 1992-02-17 | 1993-09-03 | Sanyo Electric Co Ltd | カメラ監視システム |
US5307405A (en) * | 1992-09-25 | 1994-04-26 | Qualcomm Incorporated | Network echo canceller |
JP3714706B2 (ja) * | 1995-02-17 | 2005-11-09 | 株式会社竹中工務店 | 音抽出装置 |
US5905773A (en) * | 1996-03-28 | 1999-05-18 | Northern Telecom Limited | Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models |
JPH11237897A (ja) * | 1998-02-23 | 1999-08-31 | Kenwood Corp | 音響装置 |
JPH11296192A (ja) * | 1998-04-10 | 1999-10-29 | Pioneer Electron Corp | 音声認識における音声特徴量の補正方法、音声認識方法、音声認識装置及び音声認識プログラムを記録した記録媒体 |
JP3919337B2 (ja) * | 1998-06-19 | 2007-05-23 | 株式会社東海理化電機製作所 | 車両用音声認識装置 |
US6904405B2 (en) * | 1999-07-17 | 2005-06-07 | Edwin A. Suominen | Message recognition using shared language model |
US6752498B2 (en) * | 2001-05-14 | 2004-06-22 | Eastman Kodak Company | Adaptive autostereoscopic display system |
AU2002311452B2 (en) * | 2001-06-19 | 2008-06-19 | Speech Sentinel Limited | Speaker recognition system |
-
2001
- 2001-10-22 JP JP2001323012A patent/JP2003131683A/ja active Pending
-
2002
- 2002-10-21 EP EP02802031A patent/EP1441328B1/en not_active Expired - Fee Related
- 2002-10-21 WO PCT/JP2002/010868 patent/WO2003036617A1/ja active Application Filing
- 2002-10-21 DE DE60234530T patent/DE60234530D1/de not_active Expired - Lifetime
- 2002-10-21 CN CNA028040511A patent/CN1488134A/zh active Pending
- 2002-10-21 US US10/451,285 patent/US7031917B2/en not_active Expired - Fee Related
-
2006
- 2006-02-24 US US11/362,331 patent/US7321853B2/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH02230896A (ja) * | 1989-03-03 | 1990-09-13 | Nippon Telegr & Teleph Corp <Ntt> | 音響信号入力装置 |
JPH06236196A (ja) | 1993-02-08 | 1994-08-23 | Nippon Telegr & Teleph Corp <Ntt> | 音声認識方法および装置 |
JPH0713591A (ja) * | 1993-06-22 | 1995-01-17 | Hitachi Ltd | 音声認識装置および音声認識方法 |
JPH0788791A (ja) * | 1993-09-20 | 1995-04-04 | Mitsubishi Electric Corp | ロボット装置およびその周辺装置 |
JPH1113507A (ja) * | 1997-06-27 | 1999-01-19 | Mitsubishi Motors Corp | 自動追従走行システム |
Non-Patent Citations (1)
Title |
---|
See also references of EP1441328A4 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106328141A (zh) * | 2016-09-05 | 2017-01-11 | 南京大学 | 一种面向移动终端的超声波唇读识别装置及方法 |
CN109377991A (zh) * | 2018-09-30 | 2019-02-22 | 珠海格力电器股份有限公司 | 一种智能设备控制方法及装置 |
CN112151080A (zh) * | 2020-10-28 | 2020-12-29 | 成都启英泰伦科技有限公司 | 一种录制和处理训练语料的方法 |
CN112151080B (zh) * | 2020-10-28 | 2021-08-03 | 成都启英泰伦科技有限公司 | 一种录制和处理训练语料的方法 |
Also Published As
Publication number | Publication date |
---|---|
US7031917B2 (en) | 2006-04-18 |
US20060143006A1 (en) | 2006-06-29 |
EP1441328A4 (en) | 2005-11-23 |
US20040054531A1 (en) | 2004-03-18 |
DE60234530D1 (de) | 2010-01-07 |
CN1488134A (zh) | 2004-04-07 |
EP1441328A1 (en) | 2004-07-28 |
EP1441328B1 (en) | 2009-11-25 |
JP2003131683A (ja) | 2003-05-09 |
US7321853B2 (en) | 2008-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2003036617A1 (fr) | Appareil de reconnaissance vocale et procede de reconnaissance de la parole | |
EP1635327B1 (en) | Information transmission device | |
Wand et al. | Session-independent EMG-based Speech Recognition. | |
KR100933108B1 (ko) | 함축적인 화자 적응을 사용하는 음성 인식 시스템 | |
WO2019214047A1 (zh) | 建立声纹模型的方法、装置、计算机设备和存储介质 | |
WO2002097590A3 (en) | Language independent and voice operated information management system | |
AU2003218398A1 (en) | Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition | |
EP1507255A3 (en) | Bubble splitting for compact acoustic modeling | |
CN101627427A (zh) | 声音强调装置及声音强调方法 | |
WO2006023631A3 (en) | Document transcription system training | |
Aggarwal et al. | Performance evaluation of sequentially combined heterogeneous feature streams for Hindi speech recognition system | |
US20170084266A1 (en) | Voice synthesis apparatus and method for synthesizing voice | |
EP1241662A3 (en) | Method of speech recognition with compensation for both channel distortion and background noise | |
JP2004198831A (ja) | 音声認識装置および方法、プログラム、並びに記録媒体 | |
EP1471501A3 (en) | Speech recognition apparatus, speech recognition method, and recording medium on which speech recognition program is computer-readable recorded | |
EP1378885A3 (en) | Word-spotting apparatus, word-spotting method, and word-spotting program | |
WO2004068893A3 (en) | Method and apparatus for noise suppression within a distributed speech recognition system | |
DE60008893D1 (de) | Sprachgesteuertes tragbares Endgerät | |
US7353173B2 (en) | System and method for Mandarin Chinese speech recognition using an optimized phone set | |
KR20040038419A (ko) | 음성을 이용한 감정인식 시스템 및 감정인식 방법 | |
Nguyen et al. | Vietnamese voice recognition for home automation using MFCC and DTW techniques | |
Tolba et al. | Speech recognition by intelligent machines | |
Okuno et al. | Computational auditory scene analysis and its application to robot audition: Five years experience | |
JP2004170756A (ja) | ロボット制御装置および方法、記録媒体、並びにプログラム | |
JP2001188783A (ja) | 情報処理装置および方法、並びに記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): CN US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002802031 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10451285 Country of ref document: US |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 028040511 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2002802031 Country of ref document: EP |