CN104838339B - 便携终端装置以及信息处理系统 - Google Patents
便携终端装置以及信息处理系统 Download PDFInfo
- Publication number
- CN104838339B CN104838339B CN201380064683.0A CN201380064683A CN104838339B CN 104838339 B CN104838339 B CN 104838339B CN 201380064683 A CN201380064683 A CN 201380064683A CN 104838339 B CN104838339 B CN 104838339B
- Authority
- CN
- China
- Prior art keywords
- lip
- operator
- voice
- data
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/002—Specific input/output arrangements not covered by G06F3/01 - G06F3/16
- G06F3/005—Input arrangements through a video camera
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G06F3/012—Head tracking input arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/0482—Interaction with lists of selectable items, e.g. menus
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
- G10L15/25—Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/038—Indexing scheme relating to G06F3/038
- G06F2203/0381—Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2250/00—Details of telephonic subscriber devices
- H04M2250/52—Details of telephonic subscriber devices including functional features of a camera
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2250/00—Details of telephonic subscriber devices
- H04M2250/74—Details of telephonic subscriber devices with voice recognition means
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- User Interface Of Digital Computer (AREA)
- Telephone Function (AREA)
Abstract
Description
Claims (10)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013000297A JP5902632B2 (ja) | 2013-01-07 | 2013-01-07 | 携帯端末装置及び情報処理システム |
JP2013-000297 | 2013-01-07 | ||
PCT/JP2013/083815 WO2014106927A1 (ja) | 2013-01-07 | 2013-12-18 | 携帯端末装置及び情報処理システム |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104838339A CN104838339A (zh) | 2015-08-12 |
CN104838339B true CN104838339B (zh) | 2018-03-13 |
Family
ID=51062249
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201380064683.0A Active CN104838339B (zh) | 2013-01-07 | 2013-12-18 | 便携终端装置以及信息处理系统 |
Country Status (4)
Country | Link |
---|---|
US (4) | US10303433B2 (zh) |
JP (1) | JP5902632B2 (zh) |
CN (1) | CN104838339B (zh) |
WO (1) | WO2014106927A1 (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6518134B2 (ja) | 2015-05-27 | 2019-05-22 | 株式会社ソニー・インタラクティブエンタテインメント | 眼前装着型表示装置 |
CN106919891B (zh) * | 2015-12-26 | 2019-08-23 | 腾讯科技(深圳)有限公司 | 一种图像处理方法及装置 |
US10360441B2 (en) | 2015-11-25 | 2019-07-23 | Tencent Technology (Shenzhen) Company Limited | Image processing method and apparatus |
CN105632497A (zh) * | 2016-01-06 | 2016-06-01 | 昆山龙腾光电有限公司 | 一种语音输出方法、语音输出系统 |
JP6532085B2 (ja) * | 2016-03-07 | 2019-06-19 | セイコーソリューションズ株式会社 | 注文管理システム |
JP2018091954A (ja) * | 2016-12-01 | 2018-06-14 | オリンパス株式会社 | 音声認識装置、及び音声認識方法 |
CN107679449B (zh) * | 2017-08-17 | 2018-08-03 | 平安科技(深圳)有限公司 | 嘴唇动作捕捉方法、装置及存储介质 |
EP3450372A1 (en) * | 2017-08-28 | 2019-03-06 | Otis Elevator Company | Spoken command interface |
KR102417524B1 (ko) * | 2017-10-13 | 2022-07-07 | 현대자동차주식회사 | 음성 인식 기반의 자동차 제어 방법 |
JP7081164B2 (ja) * | 2018-01-17 | 2022-06-07 | 株式会社Jvcケンウッド | 表示制御装置、通信装置、表示制御方法および通信方法 |
JP7010012B2 (ja) * | 2018-01-17 | 2022-01-26 | 株式会社Jvcケンウッド | 音声出力制御装置、電子機器、音声出力制御方法およびプログラム |
CN108521516A (zh) * | 2018-03-30 | 2018-09-11 | 百度在线网络技术(北京)有限公司 | 用于终端设备的控制方法和装置 |
CN108538291A (zh) * | 2018-04-11 | 2018-09-14 | 百度在线网络技术(北京)有限公司 | 语音控制方法、终端设备、云端服务器及系统 |
WO2019219968A1 (en) * | 2018-05-18 | 2019-11-21 | Deepmind Technologies Limited | Visual speech recognition by phoneme prediction |
CN111049664A (zh) * | 2018-10-11 | 2020-04-21 | 中兴通讯股份有限公司 | 一种网络告警处理方法、装置及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101111886A (zh) * | 2005-01-28 | 2008-01-23 | 京瓷株式会社 | 发声内容识别装置与发声内容识别方法 |
CN102104651A (zh) * | 2009-12-22 | 2011-06-22 | 康佳集团股份有限公司 | 移动终端接收来电时播放预留语音的方法及其移动终端 |
CN102117115A (zh) * | 2009-12-31 | 2011-07-06 | 上海量科电子科技有限公司 | 一种利用唇语进行文字输入选择的系统及实现方法 |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3710205B2 (ja) * | 1996-06-05 | 2005-10-26 | 沖電気工業株式会社 | 音声認識装置 |
US6014625A (en) * | 1996-12-30 | 2000-01-11 | Daewoo Electronics Co., Ltd | Method and apparatus for producing lip-movement parameters in a three-dimensional-lip-model |
JP2000068882A (ja) * | 1998-08-17 | 2000-03-03 | Matsushita Electric Ind Co Ltd | 無線通信装置 |
AU780674B2 (en) * | 1999-10-27 | 2005-04-07 | Keyless Systems Ltd. | Integrated keypad system |
JP2001358828A (ja) | 2000-06-10 | 2001-12-26 | Masahiko Okuno | モバイル機器、モバイル機器の指紋認証方法及びモバイル機器の指紋認証プログラムを記録した記録媒体 |
JP2002118623A (ja) * | 2000-10-06 | 2002-04-19 | Matsushita Electric Ind Co Ltd | 移動体通信装置 |
JP4222742B2 (ja) * | 2001-05-07 | 2009-02-12 | 株式会社リコー | 移動体無線端末 |
JP2002368870A (ja) | 2001-06-04 | 2002-12-20 | Nec Corp | 移動通信端末装置 |
JP2004246095A (ja) * | 2003-02-14 | 2004-09-02 | Nec Saitama Ltd | 携帯電話装置及び遠隔制御方法 |
JP2005184485A (ja) | 2003-12-19 | 2005-07-07 | Casio Comput Co Ltd | 撮像装置、撮像装置の動作制御方法及びプログラム |
JP2007041089A (ja) | 2005-08-01 | 2007-02-15 | Hitachi Ltd | 情報端末および音声認識プログラム |
US20070048695A1 (en) * | 2005-08-31 | 2007-03-01 | Wen-Chen Huang | Interactive scoring system for learning language |
JP2009009170A (ja) * | 2005-10-24 | 2009-01-15 | Advanced Media Inc | 情報検索システム及びサーバ装置 |
JP2007280179A (ja) * | 2006-04-10 | 2007-10-25 | Mitsubishi Electric Corp | 携帯端末 |
KR101502003B1 (ko) * | 2008-07-08 | 2015-03-12 | 엘지전자 주식회사 | 이동 단말기 및 그 텍스트 입력 방법 |
JP2010026731A (ja) | 2008-07-17 | 2010-02-04 | Nec Saitama Ltd | 文字入力装置、文字入力方法、文字入力システム、文字入力サーバー及び端末 |
JP2010272077A (ja) * | 2009-05-25 | 2010-12-02 | Toshiba Corp | 情報再生方法及び情報再生装置 |
JP5341678B2 (ja) | 2009-08-27 | 2013-11-13 | 京セラ株式会社 | 通信システム |
KR101092820B1 (ko) * | 2009-09-22 | 2011-12-12 | 현대자동차주식회사 | 립리딩과 음성 인식 통합 멀티모달 인터페이스 시스템 |
JP2011071937A (ja) * | 2009-09-28 | 2011-04-07 | Kyocera Corp | 電子機器 |
JP2011186994A (ja) * | 2010-03-11 | 2011-09-22 | Fujitsu Ltd | 文字入力装置および文字入力方法 |
US8635066B2 (en) * | 2010-04-14 | 2014-01-21 | T-Mobile Usa, Inc. | Camera-assisted noise cancellation and speech recognition |
US8700392B1 (en) * | 2010-09-10 | 2014-04-15 | Amazon Technologies, Inc. | Speech-inclusive device interfaces |
WO2013097075A1 (en) * | 2011-12-26 | 2013-07-04 | Intel Corporation | Vehicle based determination of occupant audio and visual input |
KR101891259B1 (ko) * | 2012-04-04 | 2018-09-28 | 삼성전자주식회사 | 지능형 이벤트 정보 출력 지원 방법 및 단말기 |
TW201342278A (zh) * | 2012-04-06 | 2013-10-16 | Wei-Yen Yeh | 資訊整合互動系統及其方法 |
-
2013
- 2013-01-07 JP JP2013000297A patent/JP5902632B2/ja active Active
- 2013-12-18 CN CN201380064683.0A patent/CN104838339B/zh active Active
- 2013-12-18 WO PCT/JP2013/083815 patent/WO2014106927A1/ja active Application Filing
- 2013-12-18 US US14/651,002 patent/US10303433B2/en active Active
-
2019
- 2019-04-29 US US16/396,985 patent/US11487502B2/en active Active
-
2022
- 2022-10-20 US US17/969,868 patent/US11861264B2/en active Active
-
2023
- 2023-10-12 US US18/379,239 patent/US20240036815A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101111886A (zh) * | 2005-01-28 | 2008-01-23 | 京瓷株式会社 | 发声内容识别装置与发声内容识别方法 |
CN102104651A (zh) * | 2009-12-22 | 2011-06-22 | 康佳集团股份有限公司 | 移动终端接收来电时播放预留语音的方法及其移动终端 |
CN102117115A (zh) * | 2009-12-31 | 2011-07-06 | 上海量科电子科技有限公司 | 一种利用唇语进行文字输入选择的系统及实现方法 |
Also Published As
Publication number | Publication date |
---|---|
US11861264B2 (en) | 2024-01-02 |
US11487502B2 (en) | 2022-11-01 |
WO2014106927A1 (ja) | 2014-07-10 |
JP5902632B2 (ja) | 2016-04-13 |
JP2014132396A (ja) | 2014-07-17 |
US10303433B2 (en) | 2019-05-28 |
US20190250884A1 (en) | 2019-08-15 |
US20230039067A1 (en) | 2023-02-09 |
US20240036815A1 (en) | 2024-02-01 |
CN104838339A (zh) | 2015-08-12 |
US20150324168A1 (en) | 2015-11-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104838339B (zh) | 便携终端装置以及信息处理系统 | |
US11605229B2 (en) | Inmate tracking system in a controlled environment | |
CN107169430B (zh) | 基于图像处理语义分析的阅读环境音效增强系统及方法 | |
CN107395352B (zh) | 基于声纹的身份识别方法及装置 | |
CN100470540C (zh) | 在移动电话系统中存储和检索多媒体数据和相关注释数据 | |
CN110434853B (zh) | 一种机器人控制方法、装置及存储介质 | |
CN106104569A (zh) | 用于在电子装置之间建立连接的方法及设备 | |
CN103794214A (zh) | 一种信息处理方法、装置和电子设备 | |
JP2006190296A (ja) | マルチメディア通信システムにおけるコンテキスト抽出及びこれを用いた情報提供装置及び方法 | |
JP2009540414A (ja) | メディア識別 | |
CN104484037A (zh) | 通过可穿戴设备进行智能控制的方法及该可穿戴设备 | |
CN106104575A (zh) | 指纹模板生成方法及装置 | |
CN102905233A (zh) | 一种终端功能推荐的方法及装置 | |
CN110073673A (zh) | 面部识别系统 | |
CN105354284B (zh) | 模板的处理方法及装置、短信识别方法及装置 | |
CN107766820A (zh) | 图像分类方法及装置 | |
CN105550235A (zh) | 信息获取方法及装置 | |
CN109727342A (zh) | 门禁系统的识别方法、装置、门禁系统及存储介质 | |
CN106027801A (zh) | 一种通信消息的处理方法及装置、移动设备 | |
CN107690038A (zh) | 业务语音导航方法和装置 | |
CN104010060A (zh) | 识别来电呼入方身份的方法和电子设备 | |
CN107977187B (zh) | 一种混响调节方法及电子设备 | |
CN105843401A (zh) | 基于摄像头的读屏应用指令输入方法及装置 | |
JP2003044497A (ja) | モバイル図鑑 | |
CN109784267B (zh) | 一种移动端多源融合图像语义内容生成系统及方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
EXSB | Decision made by sipo to initiate substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20171213 Address after: Kyoto Japan Applicant after: MAXELL, Ltd. Address before: Osaka Applicant before: Hitachi Maxell, Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: Kyoto Japan Patentee after: MAXELL, Ltd. Address before: Kyoto Japan Patentee before: MAXELL HOLDINGS, Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220610 Address after: Kyoto Japan Patentee after: MAXELL HOLDINGS, Ltd. Address before: Kyoto, Japan Patentee before: MAXELL, Ltd. |