CN111522524B - 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 - Google Patents
一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 Download PDFInfo
- Publication number
- CN111522524B CN111522524B CN202010198293.4A CN202010198293A CN111522524B CN 111522524 B CN111522524 B CN 111522524B CN 202010198293 A CN202010198293 A CN 202010198293A CN 111522524 B CN111522524 B CN 111522524B
- Authority
- CN
- China
- Prior art keywords
- gesture
- recognition
- attention
- area
- control instruction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 80
- 238000013527 convolutional neural network Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 6
- 230000000007 visual effect Effects 0.000 claims description 6
- 230000006870 function Effects 0.000 description 28
- 238000004891 communication Methods 0.000 description 10
- 230000009286 beneficial effect Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 230000004927 fusion Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 238000003491 array Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/438—Presentation of query results
- G06F16/4387—Presentation of query results by the use of playlists
- G06F16/4393—Multimedia presentations, e.g. slide shows, multimedia albums
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0484—Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/28—Recognition of hand or arm movements, e.g. recognition of deaf sign language
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010198293.4A CN111522524B (zh) | 2020-03-19 | 2020-03-19 | 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010198293.4A CN111522524B (zh) | 2020-03-19 | 2020-03-19 | 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111522524A CN111522524A (zh) | 2020-08-11 |
CN111522524B true CN111522524B (zh) | 2023-01-03 |
Family
ID=71901784
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010198293.4A Active CN111522524B (zh) | 2020-03-19 | 2020-03-19 | 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111522524B (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114664295B (zh) * | 2020-12-07 | 2024-08-13 | 北京小米移动软件有限公司 | 用于机器人的语音识别方法、装置及机器人 |
CN112750437A (zh) * | 2021-01-04 | 2021-05-04 | 欧普照明股份有限公司 | 控制方法、控制装置及电子设备 |
CN113425079A (zh) * | 2021-06-15 | 2021-09-24 | 同济大学 | 一种智能演讲台机器人 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102520793A (zh) * | 2011-11-30 | 2012-06-27 | 苏州奇可思信息科技有限公司 | 基于手势识别的会议演示交互方法 |
CN108536302A (zh) * | 2018-04-17 | 2018-09-14 | 中国矿业大学 | 一种基于人体手势和语音的教学方法及系统 |
CN108920128A (zh) * | 2018-07-12 | 2018-11-30 | 苏州思必驰信息科技有限公司 | 演示文稿的操作方法及系统 |
CN208905094U (zh) * | 2018-09-30 | 2019-05-24 | 海南小青桔网络科技有限公司 | 一种基于kinect的会议内容控制系统 |
-
2020
- 2020-03-19 CN CN202010198293.4A patent/CN111522524B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102520793A (zh) * | 2011-11-30 | 2012-06-27 | 苏州奇可思信息科技有限公司 | 基于手势识别的会议演示交互方法 |
CN108536302A (zh) * | 2018-04-17 | 2018-09-14 | 中国矿业大学 | 一种基于人体手势和语音的教学方法及系统 |
CN108920128A (zh) * | 2018-07-12 | 2018-11-30 | 苏州思必驰信息科技有限公司 | 演示文稿的操作方法及系统 |
CN208905094U (zh) * | 2018-09-30 | 2019-05-24 | 海南小青桔网络科技有限公司 | 一种基于kinect的会议内容控制系统 |
Also Published As
Publication number | Publication date |
---|---|
CN111522524A (zh) | 2020-08-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102270394B1 (ko) | 이미지를 인식하기 위한 방법, 단말, 및 저장 매체 | |
US10922530B2 (en) | Display device and operating method thereof with adjustments related to an image display according to bending motion of the display device | |
CN110674719B (zh) | 目标对象匹配方法及装置、电子设备和存储介质 | |
CN111522524B (zh) | 一种基于会议机器人的演示文稿控制方法、装置、存储介质及终端 | |
JP2021526698A (ja) | 画像生成方法および装置、電子機器、並びに記憶媒体 | |
KR101756042B1 (ko) | 입력 처리 방법, 장치 및 설비 | |
KR102193029B1 (ko) | 디스플레이 장치 및 그의 화상 통화 수행 방법 | |
CN109145970B (zh) | 基于图像的问答处理方法和装置、电子设备及存储介质 | |
CN113065591B (zh) | 目标检测方法及装置、电子设备和存储介质 | |
EP2597623A2 (en) | Apparatus and method for providing augmented reality service for mobile terminal | |
EP4287068A1 (en) | Model training method, scene recognition method, and related device | |
CN109495616B (zh) | 一种拍照方法及终端设备 | |
CN108922531B (zh) | 槽位识别方法、装置、电子设备及存储介质 | |
CN111242303A (zh) | 网络训练方法及装置、图像处理方法及装置 | |
CN110135349A (zh) | 识别方法、装置、设备及存储介质 | |
CN111382748A (zh) | 图像翻译方法、装置及存储介质 | |
CN113727021A (zh) | 拍摄方法、装置及电子设备 | |
CN110633715B (zh) | 图像处理方法、网络训练方法及装置、和电子设备 | |
CN108055461B (zh) | 自拍角度的推荐方法、装置、终端设备及存储介质 | |
CN111626922B (zh) | 图片生成方法、装置、电子设备及计算机可读存储介质 | |
CN112700783A (zh) | 通讯的变声方法、终端设备和存储介质 | |
CN110377914B (zh) | 字符识别方法、装置及存储介质 | |
KR102567003B1 (ko) | 전자 장치 및 그 동작방법 | |
KR20190108977A (ko) | 화면 제어 방법 및 이를 지원하는 전자 장치 | |
CN111367492B (zh) | 网页页面展示方法及装置、存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200826 Address after: Room 101, building 1, block C, Qianjiang Century Park, ningwei street, Xiaoshan District, Hangzhou City, Zhejiang Province Applicant after: Hangzhou Weiming Information Technology Co.,Ltd. Applicant after: Institute of Information Technology, Zhejiang Peking University Address before: Room 288-1, 857 Xinbei Road, Ningwei Town, Xiaoshan District, Hangzhou City, Zhejiang Province Applicant before: Institute of Information Technology, Zhejiang Peking University Applicant before: Hangzhou Weiming Information Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20200811 Assignee: Zhejiang smart video security Innovation Center Co.,Ltd. Assignor: Institute of Information Technology, Zhejiang Peking University Contract record no.: X2022330000930 Denomination of invention: A presentation control method, device, storage medium and terminal based on conference robot License type: Common License Record date: 20221229 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20200811 Assignee: Zhejiang Visual Intelligence Innovation Center Co.,Ltd. Assignor: Institute of Information Technology, Zhejiang Peking University|Hangzhou Weiming Information Technology Co.,Ltd. Contract record no.: X2023330000927 Denomination of invention: A presentation control method, device, storage medium, and terminal based on conference robots Granted publication date: 20230103 License type: Common License Record date: 20231219 |