CN108701455A - 信息处理装置、信息处理方法和程序 - Google Patents

信息处理装置、信息处理方法和程序 Download PDF

Info

Publication number
CN108701455A
CN108701455A CN201680081621.4A CN201680081621A CN108701455A CN 108701455 A CN108701455 A CN 108701455A CN 201680081621 A CN201680081621 A CN 201680081621A CN 108701455 A CN108701455 A CN 108701455A
Authority
CN
China
Prior art keywords
user
action
information processing
distance
case
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN201680081621.4A
Other languages
English (en)
Chinese (zh)
Inventor
桐原丽子
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN108701455A publication Critical patent/CN108701455A/zh
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • User Interface Of Digital Computer (AREA)
  • Manipulator (AREA)
CN201680081621.4A 2016-02-18 2016-12-13 信息处理装置、信息处理方法和程序 Withdrawn CN108701455A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2016-028899 2016-02-18
JP2016028899A JP2017144521A (ja) 2016-02-18 2016-02-18 情報処理装置、情報処理方法、及びプログラム
PCT/JP2016/087096 WO2017141530A1 (ja) 2016-02-18 2016-12-13 情報処理装置、情報処理方法、及びプログラム

Publications (1)

Publication Number Publication Date
CN108701455A true CN108701455A (zh) 2018-10-23

Family

ID=59625697

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680081621.4A Withdrawn CN108701455A (zh) 2016-02-18 2016-12-13 信息处理装置、信息处理方法和程序

Country Status (6)

Country Link
US (1) US11237794B2 (enExample)
EP (1) EP3419020B1 (enExample)
JP (1) JP2017144521A (enExample)
KR (1) KR20180113503A (enExample)
CN (1) CN108701455A (enExample)
WO (1) WO2017141530A1 (enExample)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019072787A (ja) * 2017-10-13 2019-05-16 シャープ株式会社 制御装置、ロボット、制御方法、および制御プログラム
JP2019079083A (ja) * 2017-10-19 2019-05-23 アイシン精機株式会社 運転支援装置
JP2019185389A (ja) * 2018-04-10 2019-10-24 日本電信電話株式会社 情報処理装置、情報処理方法および情報処理プログラム
US10878279B2 (en) * 2018-05-04 2020-12-29 Google Llc Generating and/or adapting automated assistant content according to a distance between user(s) and an automated assistant interface
KR102523982B1 (ko) 2018-08-21 2023-04-20 구글 엘엘씨 자동화된 어시스턴트를 호출하기 위한 다이내믹 및/또는 컨텍스트-특정 핫 워드
WO2020040745A1 (en) * 2018-08-21 2020-02-27 Google Llc Dynamic and/or context-specific hot words to invoke automated assistant
KR102228866B1 (ko) * 2018-10-18 2021-03-17 엘지전자 주식회사 로봇 및 그의 제어 방법
JP6887035B1 (ja) * 2020-02-26 2021-06-16 株式会社サイバーエージェント 制御システム、制御装置、制御方法及びコンピュータプログラム
JP2022118998A (ja) * 2021-02-03 2022-08-16 株式会社デンソーテン 音声認識応答装置及び方法並びに車載装置
CN120036016A (zh) * 2023-09-19 2025-05-23 松下知识产权经营株式会社 通知系统、通知方法、服务器装置以及设备

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2878712B2 (ja) * 1989-06-08 1999-04-05 株式会社東芝 音声認識装置
JP3273620B2 (ja) * 1991-09-30 2002-04-08 株式会社東芝 音声対話装置
JP2007160440A (ja) 2005-12-12 2007-06-28 Honda Motor Co Ltd 脚式移動ロボットの制御装置
JP4751192B2 (ja) 2005-12-12 2011-08-17 本田技研工業株式会社 移動ロボット
JP2007199552A (ja) * 2006-01-30 2007-08-09 Toyota Motor Corp 音声認識装置と音声認識方法
JP4976903B2 (ja) 2007-04-05 2012-07-18 本田技研工業株式会社 ロボット
KR101644421B1 (ko) * 2008-12-23 2016-08-03 삼성전자주식회사 사용자의 관심 정도에 기반한 컨텐츠 제공장치 및 방법
JP2010204260A (ja) * 2009-03-02 2010-09-16 Brother Ind Ltd 対話装置
KR101581883B1 (ko) * 2009-04-30 2016-01-11 삼성전자주식회사 모션 정보를 이용하는 음성 검출 장치 및 방법
JP5834941B2 (ja) * 2012-01-19 2015-12-24 富士通株式会社 注目対象特定装置、注目対象特定方法、及びプログラム
CN104488025A (zh) 2012-03-16 2015-04-01 纽昂斯通讯公司 用户专用的自动语音识别
JP6171353B2 (ja) * 2013-01-18 2017-08-02 株式会社リコー 情報処理装置、システム、情報処理方法およびプログラム
US20170017501A1 (en) 2013-12-16 2017-01-19 Nuance Communications, Inc. Systems and methods for providing a virtual assistant
EP2911149B1 (en) 2014-02-19 2019-04-17 Nokia Technologies OY Determination of an operational directive based at least in part on a spatial audio property
US9338761B2 (en) 2014-02-26 2016-05-10 Empire Technology Development Llc Presence-based device mode modification
EP2933070A1 (en) * 2014-04-17 2015-10-21 Aldebaran Robotics Methods and systems of handling a dialog with a robot
JP2017138476A (ja) 2016-02-03 2017-08-10 ソニー株式会社 情報処理装置、情報処理方法、及びプログラム

Also Published As

Publication number Publication date
EP3419020A1 (en) 2018-12-26
EP3419020A4 (en) 2019-02-27
US11237794B2 (en) 2022-02-01
KR20180113503A (ko) 2018-10-16
EP3419020B1 (en) 2021-09-22
WO2017141530A1 (ja) 2017-08-24
US20190042188A1 (en) 2019-02-07
JP2017144521A (ja) 2017-08-24

Similar Documents

Publication Publication Date Title
CN108701455A (zh) 信息处理装置、信息处理方法和程序
JP7250887B2 (ja) IoTベースの通知の生成、およびクライアントデバイスの自動化アシスタントクライアントによるIoTベースの通知の自動レンダリングを引き起こすコマンドの提供
EP3300074B1 (en) Information processing apparatus
CN107408027B (zh) 信息处理设备、控制方法及程序
JP6516585B2 (ja) 制御装置、その方法及びプログラム
JP6739907B2 (ja) 機器特定方法、機器特定装置及びプログラム
EP3279791A1 (en) Information processing device, control method, and program
JP2018190413A (ja) ユーザ発話の表現法を把握して機器の動作やコンテンツ提供範囲を調整し提供するユーザ命令処理方法およびシステム
CN112136102B (zh) 信息处理装置、信息处理方法以及信息处理系统
CN106462641A (zh) 媒体呈现设备存在时基于多个用户的用户偏好来呈现内容的方法、系统和介质
WO2019107145A1 (ja) 情報処理装置、及び情報処理方法
CN110100257A (zh) 信息处理设备、信息处理方法和程序
JP6973380B2 (ja) 情報処理装置、および情報処理方法
US11936718B2 (en) Information processing device and information processing method
WO2019017033A1 (ja) 情報処理装置、情報処理方法、およびプログラム
WO2018139050A1 (ja) 情報処理装置、情報処理方法およびプログラム
CN110637296A (zh) 信息处理装置、信息处理方法和程序
CN110741330B (zh) 信息处理装置、信息处理方法和程序
WO2022107447A1 (ja) 情報処理装置、情報処理方法、およびプログラム
CN118900300A (zh) 数据处理方法、装置、电子设备和存储介质

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20181023