CN105957521B - 一种用于机器人的语音和图像复合交互执行方法及系统 - Google Patents
一种用于机器人的语音和图像复合交互执行方法及系统 Download PDFInfo
- Publication number
- CN105957521B CN105957521B CN201610107985.7A CN201610107985A CN105957521B CN 105957521 B CN105957521 B CN 105957521B CN 201610107985 A CN201610107985 A CN 201610107985A CN 105957521 B CN105957521 B CN 105957521B
- Authority
- CN
- China
- Prior art keywords
- sound source
- voice
- command
- robot
- human
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
- G10L15/25—Speech recognition using non-acoustical features using position of the lips, movement of the lips or face analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Abstract
Description
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610107985.7A CN105957521B (zh) | 2016-02-29 | 2016-02-29 | 一种用于机器人的语音和图像复合交互执行方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610107985.7A CN105957521B (zh) | 2016-02-29 | 2016-02-29 | 一种用于机器人的语音和图像复合交互执行方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105957521A CN105957521A (zh) | 2016-09-21 |
CN105957521B true CN105957521B (zh) | 2020-07-10 |
Family
ID=56917242
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610107985.7A Active CN105957521B (zh) | 2016-02-29 | 2016-02-29 | 一种用于机器人的语音和图像复合交互执行方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105957521B (zh) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106599866B (zh) * | 2016-12-22 | 2020-06-02 | 上海百芝龙网络科技有限公司 | 一种多维度用户身份识别方法 |
CN106653041B (zh) * | 2017-01-17 | 2020-02-14 | 北京地平线信息技术有限公司 | 音频信号处理设备、方法和电子设备 |
US11178280B2 (en) * | 2017-06-20 | 2021-11-16 | Lenovo (Singapore) Pte. Ltd. | Input during conversational session |
CN107297745B (zh) * | 2017-06-28 | 2019-08-13 | 上海木木机器人技术有限公司 | 语音交互方法、语音交互装置及机器人 |
CN109493871A (zh) * | 2017-09-11 | 2019-03-19 | 上海博泰悦臻网络技术服务有限公司 | 车载系统的多屏语音交互方法及装置、存储介质和车机 |
WO2019118089A1 (en) | 2017-12-11 | 2019-06-20 | Analog Devices, Inc. | Multi-modal far field user interfaces and vision-assisted audio processing |
CN109981970B (zh) * | 2017-12-28 | 2021-07-27 | 深圳市优必选科技有限公司 | 一种确定拍摄场景的方法、装置和机器人 |
CN108322855B (zh) * | 2018-02-11 | 2020-11-17 | 北京百度网讯科技有限公司 | 用于获取音频信息的方法及装置 |
US11195525B2 (en) * | 2018-06-13 | 2021-12-07 | Panasonic Intellectual Property Corporation Of America | Operation terminal, voice inputting method, and computer-readable recording medium |
CN110889315B (zh) * | 2018-09-10 | 2023-04-28 | 北京市商汤科技开发有限公司 | 图像处理方法、装置、电子设备及系统 |
CN109147813A (zh) * | 2018-09-21 | 2019-01-04 | 神思电子技术股份有限公司 | 一种基于影音定位技术的服务机器人降噪方法 |
JP2020089947A (ja) * | 2018-12-06 | 2020-06-11 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
CN110799913A (zh) * | 2018-12-29 | 2020-02-14 | 深圳市大疆创新科技有限公司 | 一种地面遥控机器人的控制方法和装置 |
CN109506568B (zh) * | 2018-12-29 | 2021-06-18 | 思必驰科技股份有限公司 | 一种基于图像识别和语音识别的声源定位方法及装置 |
EP3712787B1 (en) * | 2019-03-18 | 2021-12-29 | Siemens Aktiengesellschaft | A method for generating a semantic description of a composite interaction |
CN110051289B (zh) * | 2019-04-03 | 2022-03-29 | 北京石头世纪科技股份有限公司 | 扫地机器人语音控制方法、装置、机器人和介质 |
CN110390300A (zh) * | 2019-07-24 | 2019-10-29 | 北京洛必德科技有限公司 | 一种用于机器人的目标跟随方法和装置 |
CN110524559B (zh) * | 2019-08-30 | 2022-06-10 | 成都未至科技有限公司 | 基于人员行为数据的智能人机交互系统及方法 |
CN111048113B (zh) * | 2019-12-18 | 2023-07-28 | 腾讯科技(深圳)有限公司 | 声音方向定位处理方法、装置、系统、计算机设备及存储介质 |
WO2022000174A1 (zh) * | 2020-06-29 | 2022-01-06 | 深圳市大疆创新科技有限公司 | 音频处理方法、音频处理装置、电子设备 |
CN115862668B (zh) * | 2022-11-28 | 2023-10-24 | 之江实验室 | 机器人基于声源定位判断交互对象的方法和系统 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100505837C (zh) * | 2007-05-10 | 2009-06-24 | 华为技术有限公司 | 一种控制图像采集装置进行目标定位的系统及方法 |
US9092394B2 (en) * | 2012-06-15 | 2015-07-28 | Honda Motor Co., Ltd. | Depth based context identification |
CN104269172A (zh) * | 2014-07-31 | 2015-01-07 | 广东美的制冷设备有限公司 | 基于视频定位的语音控制方法和系统 |
CN105234945A (zh) * | 2015-09-29 | 2016-01-13 | 塔米智能科技(北京)有限公司 | 一种基于网络语音对话及体感互动的迎宾机器人 |
-
2016
- 2016-02-29 CN CN201610107985.7A patent/CN105957521B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN105957521A (zh) | 2016-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105957521B (zh) | 一种用于机器人的语音和图像复合交互执行方法及系统 | |
CN103353935B (zh) | 一种用于智能家居系统的3d动态手势识别方法 | |
US8837780B2 (en) | Gesture based human interfaces | |
US10043064B2 (en) | Method and apparatus of detecting object using event-based sensor | |
Barzelay et al. | Harmony in motion | |
KR102133728B1 (ko) | 인공지능을 이용한 멀티모달 감성인식 장치, 방법 및 저장매체 | |
US20060104454A1 (en) | Method for selectively picking up a sound signal | |
CN110362210B (zh) | 虚拟装配中融合眼动跟踪和手势识别的人机交互方法和装置 | |
US20110158476A1 (en) | Robot and method for recognizing human faces and gestures thereof | |
EP2584531A1 (en) | Gesture recognition device, gesture recognition method, and program | |
US8965068B2 (en) | Apparatus and method for discriminating disguised face | |
CN107894836B (zh) | 基于手势和语音识别的遥感图像处理与展示的人机交互方法 | |
US10013070B2 (en) | System and method for recognizing hand gesture | |
CN111048113A (zh) | 声音方向定位处理方法、装置、系统、计算机设备及存储介质 | |
US11790900B2 (en) | System and method for audio-visual multi-speaker speech separation with location-based selection | |
KR102290186B1 (ko) | 사람의 감성 상태를 결정하기 위하여 영상을 처리하는 감성인식 방법 | |
KR20120072009A (ko) | 다중 사용자의 인터렉션 인식 장치 및 방법 | |
WO2007138503A1 (en) | Method of driving a speech recognition system | |
US20140321750A1 (en) | Dynamic gesture recognition process and authoring system | |
Joslin et al. | Dynamic gesture recognition | |
KR101553484B1 (ko) | 손동작 인식 장치 및 그 방법 | |
Brueckmann et al. | Adaptive noise reduction and voice activity detection for improved verbal human-robot interaction using binaural data | |
KR101158016B1 (ko) | 상체자세 및 손모양 검출 장치 및 그 방법 | |
US20190377938A1 (en) | Device and method for recognizing gesture | |
Jacob et al. | Real time static and dynamic hand gestures cognizance for human computer interaction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220803 Address after: No.6 Fenhe Road, Jiaozhou economic and Technological Development Zone, Qingdao, Shandong Province 266000 Patentee after: Qingdao Kelu Intelligent Technology Co.,Ltd. Address before: 266300 east of Shangde Avenue and south of Fenhe Road, Jiaozhou Economic Development Zone, Qingdao, Shandong Patentee before: QINGDAO KRUND ROBOT Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230918 Address after: No.6 Fenhe Road, Jiaozhou economic and Technological Development Zone, Qingdao, Shandong Province 266000 Patentee after: Qingdao Luteng Intelligent Equipment Technology Co.,Ltd. Address before: No.6 Fenhe Road, Jiaozhou economic and Technological Development Zone, Qingdao, Shandong Province 266000 Patentee before: Qingdao Kelu Intelligent Technology Co.,Ltd. |