DE60234530D1 - Vorrichtung und verfahren zur spracherkennung - Google Patents

Vorrichtung und verfahren zur spracherkennung

Info

Publication number
DE60234530D1
DE60234530D1 DE60234530T DE60234530T DE60234530D1 DE 60234530 D1 DE60234530 D1 DE 60234530D1 DE 60234530 T DE60234530 T DE 60234530T DE 60234530 T DE60234530 T DE 60234530T DE 60234530 D1 DE60234530 D1 DE 60234530D1
Authority
DE
Germany
Prior art keywords
language recognition
language
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60234530T
Other languages
English (en)
Inventor
Yasuharu Asano
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Application granted granted Critical
Publication of DE60234530D1 publication Critical patent/DE60234530D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Toys (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
  • Measurement Of Optical Distance (AREA)
  • Closed-Circuit Television Systems (AREA)
  • Manipulator (AREA)
  • Length Measuring Devices By Optical Means (AREA)
  • Traffic Control Systems (AREA)
DE60234530T 2001-10-22 2002-10-21 Vorrichtung und verfahren zur spracherkennung Expired - Lifetime DE60234530D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001323012A JP2003131683A (ja) 2001-10-22 2001-10-22 音声認識装置および音声認識方法、並びにプログラムおよび記録媒体
PCT/JP2002/010868 WO2003036617A1 (fr) 2001-10-22 2002-10-21 Appareil de reconnaissance vocale et procede de reconnaissance de la parole

Publications (1)

Publication Number Publication Date
DE60234530D1 true DE60234530D1 (de) 2010-01-07

Family

ID=19139964

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60234530T Expired - Lifetime DE60234530D1 (de) 2001-10-22 2002-10-21 Vorrichtung und verfahren zur spracherkennung

Country Status (6)

Country Link
US (2) US7031917B2 (de)
EP (1) EP1441328B1 (de)
JP (1) JP2003131683A (de)
CN (1) CN1488134A (de)
DE (1) DE60234530D1 (de)
WO (1) WO2003036617A1 (de)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113628621A (zh) * 2021-08-18 2021-11-09 北京声智科技有限公司 一种实现设备就近唤醒的方法、系统及装置

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090030552A1 (en) * 2002-12-17 2009-01-29 Japan Science And Technology Agency Robotics visual and auditory system
EP1587329B1 (de) * 2003-01-20 2015-04-15 Sanyo Electric Co., Ltd. Bereitstellungsverfahren für dreidimensionales video und anzeigeeinrichtung für dreidimensionales video
US8209185B2 (en) 2003-09-05 2012-06-26 Emc Corporation Interface for management of auditory communications
US8103873B2 (en) * 2003-09-05 2012-01-24 Emc Corporation Method and system for processing auditory communications
DE602004021716D1 (de) * 2003-11-12 2009-08-06 Honda Motor Co Ltd Spracherkennungssystem
US8180743B2 (en) * 2004-07-01 2012-05-15 Emc Corporation Information management
US8244542B2 (en) * 2004-07-01 2012-08-14 Emc Corporation Video surveillance
US20060004818A1 (en) * 2004-07-01 2006-01-05 Claudatos Christopher H Efficient information management
US8180742B2 (en) * 2004-07-01 2012-05-15 Emc Corporation Policy-based information management
US9268780B2 (en) 2004-07-01 2016-02-23 Emc Corporation Content-driven information lifecycle management
US8229904B2 (en) 2004-07-01 2012-07-24 Emc Corporation Storage pools for information management
JP4600736B2 (ja) * 2004-07-22 2010-12-15 ソニー株式会社 ロボット制御装置および方法、記録媒体、並びにプログラム
US8626514B2 (en) 2004-08-31 2014-01-07 Emc Corporation Interface for management of multiple auditory communications
JP4204541B2 (ja) * 2004-12-24 2009-01-07 株式会社東芝 対話型ロボット、対話型ロボットの音声認識方法および対話型ロボットの音声認識プログラム
CN101427154A (zh) * 2005-09-21 2009-05-06 皇家飞利浦电子股份有限公司 使用远程位置麦克风进行语音激活控制的超声成像系统
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
JP5223673B2 (ja) * 2006-06-29 2013-06-26 日本電気株式会社 音声処理装置およびプログラム、並びに、音声処理方法
JP4469880B2 (ja) * 2007-08-09 2010-06-02 株式会社東芝 音声処理装置及び方法
CN101411946B (zh) * 2007-10-19 2012-03-28 鸿富锦精密工业(深圳)有限公司 玩具恐龙
JP5075664B2 (ja) * 2008-02-15 2012-11-21 株式会社東芝 音声対話装置及び支援方法
TW200937348A (en) * 2008-02-19 2009-09-01 Univ Nat Chiao Tung Calibration method for image capturing device
US20090287489A1 (en) * 2008-05-15 2009-11-19 Palm, Inc. Speech processing for plurality of users
CN101610360A (zh) * 2008-06-19 2009-12-23 鸿富锦精密工业(深圳)有限公司 自动追踪声源的摄像装置
US8532989B2 (en) * 2009-09-03 2013-09-10 Honda Motor Co., Ltd. Command recognition device, command recognition method, and command recognition robot
US8676581B2 (en) * 2010-01-22 2014-03-18 Microsoft Corporation Speech recognition analysis via identification information
JP5393544B2 (ja) * 2010-03-12 2014-01-22 本田技研工業株式会社 ロボット、ロボット制御方法およびプログラム
EP2550614A4 (de) * 2010-03-23 2013-09-18 Nokia Corp Verfahren und vorrichtung zur bestimmung der altersspanne eines benutzers
US8700392B1 (en) * 2010-09-10 2014-04-15 Amazon Technologies, Inc. Speech-inclusive device interfaces
US9274744B2 (en) 2010-09-10 2016-03-01 Amazon Technologies, Inc. Relative position-inclusive device interfaces
US8886532B2 (en) * 2010-10-27 2014-11-11 Microsoft Corporation Leveraging interaction context to improve recognition confidence scores
US20120143611A1 (en) * 2010-12-07 2012-06-07 Microsoft Corporation Trajectory Tiling Approach for Text-to-Speech
KR101791907B1 (ko) 2011-01-04 2017-11-02 삼성전자주식회사 위치 기반의 음향 처리 장치 및 방법
US9223415B1 (en) 2012-01-17 2015-12-29 Amazon Technologies, Inc. Managing resource usage for task performance
JP5862349B2 (ja) * 2012-02-16 2016-02-16 株式会社Jvcケンウッド ノイズ低減装置、音声入力装置、無線通信装置、およびノイズ低減方法
US8831957B2 (en) * 2012-08-01 2014-09-09 Google Inc. Speech recognition models based on location indicia
US9208777B2 (en) * 2013-01-25 2015-12-08 Microsoft Technology Licensing, Llc Feature space transformation for personalization using generalized i-vector clustering
US20150206539A1 (en) * 2013-06-04 2015-07-23 Ims Solutions, Inc. Enhanced human machine interface through hybrid word recognition and dynamic speech synthesis tuning
JP6169910B2 (ja) * 2013-07-08 2017-07-26 本田技研工業株式会社 音声処理装置
US9310800B1 (en) * 2013-07-30 2016-04-12 The Boeing Company Robotic platform evaluation system
US9847082B2 (en) 2013-08-23 2017-12-19 Honeywell International Inc. System for modifying speech recognition and beamforming using a depth image
US11199906B1 (en) 2013-09-04 2021-12-14 Amazon Technologies, Inc. Global user input management
US9367203B1 (en) 2013-10-04 2016-06-14 Amazon Technologies, Inc. User interface techniques for simulating three-dimensional depth
CN104715753B (zh) * 2013-12-12 2018-08-31 联想(北京)有限公司 一种数据处理的方法及电子设备
US9472186B1 (en) * 2014-01-28 2016-10-18 Nvoq Incorporated Automated training of a user audio profile using transcribed medical record recordings
CN103928025B (zh) * 2014-04-08 2017-06-27 华为技术有限公司 一种语音识别的方法及移动终端
CN104267920B (zh) * 2014-09-29 2017-10-27 北京奇艺世纪科技有限公司 用户识别方法、装置、系统及显示模式切换方法、装置
US10403267B2 (en) * 2015-01-16 2019-09-03 Samsung Electronics Co., Ltd Method and device for performing voice recognition using grammar model
JP6703460B2 (ja) * 2016-08-25 2020-06-03 本田技研工業株式会社 音声処理装置、音声処理方法及び音声処理プログラム
CN106356064A (zh) * 2016-08-30 2017-01-25 合肥前瞻智能科技有限公司 一种定向声控开关语音识别系统
CN106328141B (zh) * 2016-09-05 2019-06-14 南京大学 一种面向移动终端的超声波唇读识别装置及方法
US10140987B2 (en) * 2016-09-16 2018-11-27 International Business Machines Corporation Aerial drone companion device and a method of operating an aerial drone companion device
KR20180037543A (ko) * 2016-10-04 2018-04-12 삼성전자주식회사 음성 인식 전자 장치
US20180158458A1 (en) * 2016-10-21 2018-06-07 Shenetics, Inc. Conversational voice interface of connected devices, including toys, cars, avionics, mobile, iot and home appliances
JP6705410B2 (ja) * 2017-03-27 2020-06-03 カシオ計算機株式会社 音声認識装置、音声認識方法、プログラム及びロボット
KR102012968B1 (ko) * 2018-08-07 2019-08-27 주식회사 서큘러스 인터렉션 로봇의 제어 방법 및 제어 서버
CN109377991B (zh) * 2018-09-30 2021-07-23 珠海格力电器股份有限公司 一种智能设备控制方法及装置
CN109637540B (zh) * 2019-02-28 2021-02-26 北京百度网讯科技有限公司 智能语音设备的蓝牙评测方法、装置、设备及介质
CN110515449B (zh) * 2019-08-30 2021-06-04 北京安云世纪科技有限公司 唤醒智能设备的方法及装置
JP7395446B2 (ja) * 2020-09-08 2023-12-11 株式会社東芝 音声認識装置、方法およびプログラム
CN112151080B (zh) * 2020-10-28 2021-08-03 成都启英泰伦科技有限公司 一种录制和处理训练语料的方法
CN114464184B (zh) * 2022-04-11 2022-09-02 北京荣耀终端有限公司 语音识别的方法、设备和存储介质

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5946427A (ja) * 1982-09-10 1984-03-15 Matsushita Electric Ind Co Ltd 加熱装置
JPS63121097A (ja) * 1986-11-10 1988-05-25 松下電器産業株式会社 電話用音声認識装置
JPS63121098A (ja) * 1986-11-10 1988-05-25 松下電器産業株式会社 電話用音声認識装置
JPS63248218A (ja) * 1987-04-03 1988-10-14 Oki Electric Ind Co Ltd 適応制御フイルタ
JPH02132499A (ja) * 1988-11-14 1990-05-21 Toshiba Corp 音声入力装置
JPH02230896A (ja) * 1989-03-03 1990-09-13 Nippon Telegr & Teleph Corp <Ntt> 音響信号入力装置
US5008941A (en) 1989-03-31 1991-04-16 Kurzweil Applied Intelligence, Inc. Method and apparatus for automatically updating estimates of undesirable components of the speech signal in a speech recognition system
JPH05227531A (ja) * 1992-02-17 1993-09-03 Sanyo Electric Co Ltd カメラ監視システム
US5307405A (en) 1992-09-25 1994-04-26 Qualcomm Incorporated Network echo canceller
JPH06236196A (ja) * 1993-02-08 1994-08-23 Nippon Telegr & Teleph Corp <Ntt> 音声認識方法および装置
JPH0713591A (ja) 1993-06-22 1995-01-17 Hitachi Ltd 音声認識装置および音声認識方法
JP3426002B2 (ja) * 1993-09-20 2003-07-14 三菱電機株式会社 物体認識装置
JP3714706B2 (ja) * 1995-02-17 2005-11-09 株式会社竹中工務店 音抽出装置
US5905773A (en) * 1996-03-28 1999-05-18 Northern Telecom Limited Apparatus and method for reducing speech recognition vocabulary perplexity and dynamically selecting acoustic models
JP3480484B2 (ja) * 1997-06-27 2003-12-22 三菱ふそうトラック・バス株式会社 自動追従走行システム
JPH11237897A (ja) * 1998-02-23 1999-08-31 Kenwood Corp 音響装置
JPH11296192A (ja) * 1998-04-10 1999-10-29 Pioneer Electron Corp 音声認識における音声特徴量の補正方法、音声認識方法、音声認識装置及び音声認識プログラムを記録した記録媒体
JP3919337B2 (ja) * 1998-06-19 2007-05-23 株式会社東海理化電機製作所 車両用音声認識装置
US6904405B2 (en) 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
US6752498B2 (en) 2001-05-14 2004-06-22 Eastman Kodak Company Adaptive autostereoscopic display system
WO2002103680A2 (en) 2001-06-19 2002-12-27 Securivox Ltd Speaker recognition system ____________________________________

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113628621A (zh) * 2021-08-18 2021-11-09 北京声智科技有限公司 一种实现设备就近唤醒的方法、系统及装置

Also Published As

Publication number Publication date
JP2003131683A (ja) 2003-05-09
CN1488134A (zh) 2004-04-07
WO2003036617A1 (fr) 2003-05-01
US7321853B2 (en) 2008-01-22
EP1441328A4 (de) 2005-11-23
EP1441328B1 (de) 2009-11-25
US20060143006A1 (en) 2006-06-29
US7031917B2 (en) 2006-04-18
EP1441328A1 (de) 2004-07-28
US20040054531A1 (en) 2004-03-18

Similar Documents

Publication Publication Date Title
DE60234530D1 (de) Vorrichtung und verfahren zur spracherkennung
DE60207863D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE60309822D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE60124559D1 (de) Einrichtung und verfahren zur spracherkennung
DE60317025D1 (de) Vorrichtung und Verfahren zur Gesichtserkennung
DE60213490D1 (de) Gerät und Verfahren zur Fingerabdruckerkennung
DE69923253D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE60237007D1 (de) Verfahren und vorrichtung zur kurzfristigen inspekrobustheit
ATE299060T1 (de) Verfahren und vorrichtung zur drehbearbeitung
DE60217597D1 (de) Gerät und Verfahren zur Personenerkennung
DE60310785D1 (de) Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE60135686D1 (de) Vorrichtung und Verfahren zur Personenidentifizierung
DE602004023364D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE60236693D1 (de) Verfahren und Vorrichtung zur Bildverarbeitung
DE60228013D1 (de) Vorrichtung und Verfahren zur Fahrzeugsteuerung
DE60218252D1 (de) Verfahren und Vorrichtung zur Sprachtranskodierung
DE60132586D1 (de) Verfahren und vorrichtung zur abtastratenwandlung
DE60124225D1 (de) Verfahren und Vorrichtung zur Erkennung von Emotionen
DE69828141D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69806557D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE602004014675D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE50204204D1 (de) Verfahren und vorrichtung zur reihenbildung von packgütern
DE60229315D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE60023736D1 (de) Verfahren und vorrichtung zur spracherkennung mit verschiedenen sprachmodellen
DE50109323D1 (de) Verfahren und vorrichtung zur spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition