CN104781782A - 信息处理设备、信息处理方法和程序 - Google Patents

信息处理设备、信息处理方法和程序 Download PDF

Info

Publication number
CN104781782A
CN104781782A CN201380057286.0A CN201380057286A CN104781782A CN 104781782 A CN104781782 A CN 104781782A CN 201380057286 A CN201380057286 A CN 201380057286A CN 104781782 A CN104781782 A CN 104781782A
Authority
CN
China
Prior art keywords
image
user
information processing
voice
display
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201380057286.0A
Other languages
English (en)
Chinese (zh)
Inventor
大村淳己
河野道成
池田卓郎
冈田宪一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN104781782A publication Critical patent/CN104781782A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition
    • G06V40/28Recognition of hand or arm movements, e.g. recognition of deaf sign language
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/02Recognising information on displays, dials, clocks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Psychiatry (AREA)
  • Social Psychology (AREA)
  • User Interface Of Digital Computer (AREA)
CN201380057286.0A 2012-11-08 2013-10-01 信息处理设备、信息处理方法和程序 Pending CN104781782A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2012246118A JP5998861B2 (ja) 2012-11-08 2012-11-08 情報処理装置、情報処理方法及びプログラム
JP2012-246118 2012-11-08
PCT/JP2013/005859 WO2014073149A1 (en) 2012-11-08 2013-10-01 Information processing apparatus, information processing method, and program

Publications (1)

Publication Number Publication Date
CN104781782A true CN104781782A (zh) 2015-07-15

Family

ID=49510468

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380057286.0A Pending CN104781782A (zh) 2012-11-08 2013-10-01 信息处理设备、信息处理方法和程序

Country Status (5)

Country Link
US (1) US10438058B2 (enExample)
EP (1) EP2917824B1 (enExample)
JP (1) JP5998861B2 (enExample)
CN (1) CN104781782A (enExample)
WO (1) WO2014073149A1 (enExample)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105869639A (zh) * 2016-03-21 2016-08-17 广东小天才科技有限公司 一种语音识别的方法及系统
CN108647002A (zh) * 2018-03-30 2018-10-12 联想(北京)有限公司 信息处理方法及电子设备
CN111033606A (zh) * 2017-08-31 2020-04-17 索尼公司 信息处理装置、信息处理方法和程序
WO2020244410A1 (zh) * 2019-06-03 2020-12-10 清华大学 基于捂嘴动作识别的语音交互唤醒电子设备、方法和介质

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102210433B1 (ko) * 2014-01-21 2021-02-01 삼성전자주식회사 전자 장치 및 이의 음성 인식 방법
JP6418820B2 (ja) * 2014-07-07 2018-11-07 キヤノン株式会社 情報処理装置、表示制御方法、及びコンピュータプログラム
JP6772839B2 (ja) * 2014-12-25 2020-10-21 ソニー株式会社 情報処理装置、情報処理方法およびプログラム
EP3276618A4 (en) * 2015-03-23 2018-11-07 Sony Corporation Information processing system and information processing method
USD777784S1 (en) * 2015-08-26 2017-01-31 Google Inc. Display screen with icon
JP6547551B2 (ja) * 2015-09-28 2019-07-24 ブラザー工業株式会社 カラオケ装置、プログラム
US20180018965A1 (en) * 2016-07-12 2018-01-18 Bose Corporation Combining Gesture and Voice User Interfaces
US20180039478A1 (en) * 2016-08-02 2018-02-08 Google Inc. Voice interaction services
KR102591413B1 (ko) * 2016-11-16 2023-10-19 엘지전자 주식회사 이동단말기 및 그 제어방법
EP3343483A1 (en) 2016-12-30 2018-07-04 Spotify AB System and method for providing a video with lyrics overlay for use in a social messaging environment
US10146501B1 (en) * 2017-06-01 2018-12-04 Qualcomm Incorporated Sound control by various hand gestures
CN109391884A (zh) * 2017-08-08 2019-02-26 惠州超声音响有限公司 扬声器系统及操控扬声器的方法
WO2019070242A1 (en) * 2017-10-03 2019-04-11 Google Llc DATA TRANSFERS FROM A MEMORY TO MANAGE GRAPHIC OUTPUT LATENCY
US11195525B2 (en) * 2018-06-13 2021-12-07 Panasonic Intellectual Property Corporation Of America Operation terminal, voice inputting method, and computer-readable recording medium
EP3848788A4 (en) * 2018-09-07 2021-11-10 Sony Group Corporation TERMINAL DEVICE AS WELL AS THE MANUFACTURING PROCESS OF THE SAME, AND RECORDING MEDIA
US11157086B2 (en) * 2020-01-28 2021-10-26 Pison Technology, Inc. Determining a geographical location based on human gestures
JP7491147B2 (ja) * 2020-08-31 2024-05-28 セイコーエプソン株式会社 表示システムの制御方法、表示システム、及び、表示装置の制御方法
US20240281194A1 (en) * 2021-06-08 2024-08-22 Hewlett-Packard Development Company, L.P. Gestures for switching audio endpoints
WO2024257321A1 (ja) * 2023-06-16 2024-12-19 三菱電機モビリティ株式会社 ジェスチャ検出装置、乗員監視システム、及びジェスチャ検出方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002108390A (ja) * 2000-09-27 2002-04-10 Sharp Corp 音声認識装置及びコンピュータ読み取り可能な記録媒体
WO2008109299A2 (en) * 2007-03-01 2008-09-12 Sony Computer Entertainment America Inc. System and method for communicating with a virtual world
CN100520912C (zh) * 2003-02-03 2009-07-29 三菱电机株式会社 车载控制装置

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1372660A (zh) * 2000-03-09 2002-10-02 皇家菲利浦电子有限公司 与消费电子系统进行交互的方法
JP3705735B2 (ja) * 2000-08-29 2005-10-12 シャープ株式会社 オンデマンド・インタフェース装置とそのウィンドウ表示装置
JP4689548B2 (ja) * 2006-07-19 2011-05-25 株式会社ソニー・コンピュータエンタテインメント 画像処理装置、画像処理方法、記録媒体、コンピュータプログラム、半導体デバイス
GB0703974D0 (en) * 2007-03-01 2007-04-11 Sony Comp Entertainment Europe Entertainment device
KR101502003B1 (ko) * 2008-07-08 2015-03-12 엘지전자 주식회사 이동 단말기 및 그 텍스트 입력 방법
KR20100088094A (ko) * 2009-01-29 2010-08-06 삼성전자주식회사 다중 입력 소스를 이용한 오브젝트 조작 장치
KR101623007B1 (ko) 2009-11-11 2016-05-20 엘지전자 주식회사 디스플레이 장치 및 그 제어방법
US9205706B2 (en) * 2010-01-27 2015-12-08 Bridgestone Americas Tire Operations, Llc Tire with noise-reducing tread pattern
KR101184460B1 (ko) * 2010-02-05 2012-09-19 연세대학교 산학협력단 마우스 포인터 제어 장치 및 방법
JP2012058838A (ja) 2010-09-06 2012-03-22 Sony Corp 画像処理装置、プログラム及び画像処理方法
US20120259638A1 (en) 2011-04-08 2012-10-11 Sony Computer Entertainment Inc. Apparatus and method for determining relevance of input speech
JP2013080015A (ja) * 2011-09-30 2013-05-02 Toshiba Corp 音声認識装置および音声認識方法
US9423870B2 (en) * 2012-05-08 2016-08-23 Google Inc. Input determination method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002108390A (ja) * 2000-09-27 2002-04-10 Sharp Corp 音声認識装置及びコンピュータ読み取り可能な記録媒体
CN100520912C (zh) * 2003-02-03 2009-07-29 三菱电机株式会社 车载控制装置
WO2008109299A2 (en) * 2007-03-01 2008-09-12 Sony Computer Entertainment America Inc. System and method for communicating with a virtual world

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105869639A (zh) * 2016-03-21 2016-08-17 广东小天才科技有限公司 一种语音识别的方法及系统
CN111033606A (zh) * 2017-08-31 2020-04-17 索尼公司 信息处理装置、信息处理方法和程序
US11460994B2 (en) 2017-08-31 2022-10-04 Sony Corporation Information processing apparatus and information processing method
CN108647002A (zh) * 2018-03-30 2018-10-12 联想(北京)有限公司 信息处理方法及电子设备
WO2020244410A1 (zh) * 2019-06-03 2020-12-10 清华大学 基于捂嘴动作识别的语音交互唤醒电子设备、方法和介质
US12112756B2 (en) 2019-06-03 2024-10-08 Tsinghua University Voice interaction wakeup electronic device, method and medium based on mouth-covering action recognition

Also Published As

Publication number Publication date
WO2014073149A1 (en) 2014-05-15
EP2917824B1 (en) 2018-07-25
JP5998861B2 (ja) 2016-09-28
JP2014095766A (ja) 2014-05-22
US10438058B2 (en) 2019-10-08
US20150262005A1 (en) 2015-09-17
EP2917824A1 (en) 2015-09-16

Similar Documents

Publication Publication Date Title
CN104781782A (zh) 信息处理设备、信息处理方法和程序
US10771845B2 (en) Information processing apparatus and method for estimating attribute of a user based on a voice input
US12003804B2 (en) Information processing device, information processing method, and computer program
US9965039B2 (en) Device and method for displaying user interface of virtual input device based on motion recognition
US10678563B2 (en) Display apparatus and method for controlling display apparatus
US11308694B2 (en) Image processing apparatus and image processing method
US11449307B2 (en) Remote controller for controlling an external device using voice recognition and method thereof
US11972761B2 (en) Electronic device for sharing user-specific voice command and method for controlling same
TW201344597A (zh) 顯示裝置控制方法、控制器及多媒體系統
KR20160014297A (ko) 전자 장치 및 이의 제어 방법
CN108881986A (zh) 影像显示装置,及其设定变更方法,设定变更程序
KR102328121B1 (ko) 전자 장치 및 그 동작 방법
KR102274317B1 (ko) 디바이스 정보에 기초하여 음성 인식을 수행하는 방법 및 장치
KR102464907B1 (ko) 전자 장치 및 그 동작 방법
WO2016152200A1 (ja) 情報処理システムおよび情報処理方法
CN112188249A (zh) 一种基于电子说明书的播放方法及显示设备
CN112256232B (zh) 显示设备与自然语言生成后处理方法
KR101992193B1 (ko) 적어도 하나 이상의 네트워크 인터페이스로 연결된 멀티미디어 디바이스 및 그 제어 방법
KR102455067B1 (ko) 전자 장치 및 그 제어 방법
KR102278213B1 (ko) 휴대 장치 및 휴대 장치의 화면 제어방법
CN117041645A (zh) 基于数字人的视频播放方法、装置、电子设备及存储介质
CN117809645A (zh) 显示设备、服务器及其各自对应的语音键盘输入识别方法
CN119166254A (zh) 一种服务器、显示设备及数字人交互方法

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20150715