CN108391057A - 摄像头拍摄控制方法、装置、智能设备及计算机存储介质 - Google Patents
摄像头拍摄控制方法、装置、智能设备及计算机存储介质 Download PDFInfo
- Publication number
- CN108391057A CN108391057A CN201810300875.1A CN201810300875A CN108391057A CN 108391057 A CN108391057 A CN 108391057A CN 201810300875 A CN201810300875 A CN 201810300875A CN 108391057 A CN108391057 A CN 108391057A
- Authority
- CN
- China
- Prior art keywords
- voice data
- scene
- voice
- camera
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 58
- 238000000605 extraction Methods 0.000 claims abstract description 30
- 238000004590 computer program Methods 0.000 claims description 19
- 238000003066 decision tree Methods 0.000 claims description 18
- 238000010801 machine learning Methods 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 13
- 239000000284 extract Substances 0.000 claims description 12
- 238000003384 imaging method Methods 0.000 claims description 7
- 238000005070 sampling Methods 0.000 claims description 4
- 238000001514 detection method Methods 0.000 claims description 2
- 238000004891 communication Methods 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 abstract description 4
- 230000006870 function Effects 0.000 description 11
- 238000004422 calculation algorithm Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 241000406668 Loxodonta cyclotis Species 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 241001633942 Dais Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 238000007477 logistic regression Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000007637 random forest analysis Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/667—Camera operation mode switching, e.g. between still and video, sport and normal or high- and low-resolution modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/695—Control of camera direction for changing a field of view, e.g. pan, tilt or based on tracking of objects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Studio Devices (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (13)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810300875.1A CN108391057B (zh) | 2018-04-04 | 2018-04-04 | 摄像头拍摄控制方法、装置、智能设备及计算机存储介质 |
JP2019067344A JP6759406B2 (ja) | 2018-04-04 | 2019-03-29 | カメラ撮影制御方法、装置、インテリジェント装置およびコンピュータ記憶媒体 |
EP19167327.6A EP3550828B1 (en) | 2018-04-04 | 2019-04-04 | Method and device for controlling camera shooting, smart device and computer storage medium |
US16/375,399 US11445145B2 (en) | 2018-04-04 | 2019-04-04 | Method and device for controlling camera shooting, smart device and computer storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810300875.1A CN108391057B (zh) | 2018-04-04 | 2018-04-04 | 摄像头拍摄控制方法、装置、智能设备及计算机存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108391057A true CN108391057A (zh) | 2018-08-10 |
CN108391057B CN108391057B (zh) | 2020-10-16 |
Family
ID=63073605
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810300875.1A Active CN108391057B (zh) | 2018-04-04 | 2018-04-04 | 摄像头拍摄控制方法、装置、智能设备及计算机存储介质 |
Country Status (4)
Country | Link |
---|---|
US (1) | US11445145B2 (zh) |
EP (1) | EP3550828B1 (zh) |
JP (1) | JP6759406B2 (zh) |
CN (1) | CN108391057B (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111756986A (zh) * | 2019-03-27 | 2020-10-09 | 上海博泰悦臻电子设备制造有限公司 | 一种摄像头控制方法、存储介质、装置及具有其的电子设备 |
CN113377329A (zh) * | 2021-07-01 | 2021-09-10 | 安徽文香科技有限公司 | 一种虚拟音频设备、音频数据处理方法及装置 |
CN113724704A (zh) * | 2021-08-30 | 2021-11-30 | 深圳创维-Rgb电子有限公司 | 一种语音获取方法、装置、终端及存储介质 |
CN116801102A (zh) * | 2023-08-22 | 2023-09-22 | 瑞芯微电子股份有限公司 | 控制摄像头的方法、视频会议系统、电子设备和存储介质 |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3838127A1 (en) * | 2019-12-18 | 2021-06-23 | Koninklijke Philips N.V. | Device, system and method for monitoring of a subject |
WO2021145442A1 (ja) | 2020-01-16 | 2021-07-22 | 日本製鉄株式会社 | ホットスタンプ成形体 |
MX2022008472A (es) | 2020-01-16 | 2022-08-02 | Nippon Steel Corp | Carroceria estampada en caliente. |
CN113556499B (zh) * | 2020-04-07 | 2023-05-09 | 上海汽车集团股份有限公司 | 一种车载视频通话方法及车载系统 |
US11178357B1 (en) | 2020-09-22 | 2021-11-16 | Roku, Inc. | Streaming a video chat from a mobile device to a display device using a rotating base |
US11743570B1 (en) * | 2022-05-18 | 2023-08-29 | Motorola Solutions, Inc. | Camera parameter adjustment based on frequency shift |
CN114900644B (zh) * | 2022-07-13 | 2022-10-21 | 杭州全能数字科技有限公司 | 一种视频会议中云台相机的预置位远程操作方法及系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102256098A (zh) * | 2010-05-18 | 2011-11-23 | 宝利通公司 | 具有多个语音跟踪摄像机的视频会议端点 |
CN104618456A (zh) * | 2015-01-13 | 2015-05-13 | 小米科技有限责任公司 | 信息发布方法及装置 |
CN107452372A (zh) * | 2017-09-22 | 2017-12-08 | 百度在线网络技术(北京)有限公司 | 远场语音识别模型的训练方法和装置 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7518631B2 (en) * | 2005-06-28 | 2009-04-14 | Microsoft Corporation | Audio-visual control system |
US11317057B2 (en) * | 2006-03-18 | 2022-04-26 | Steve H MCNELLEY | Advanced telepresence environments |
US8289363B2 (en) * | 2006-12-28 | 2012-10-16 | Mark Buckler | Video conferencing |
US8358328B2 (en) * | 2008-11-20 | 2013-01-22 | Cisco Technology, Inc. | Multiple video camera processing for teleconferencing |
CN101534413B (zh) * | 2009-04-14 | 2012-07-04 | 华为终端有限公司 | 一种远程呈现的系统、装置和方法 |
JP2013046217A (ja) * | 2011-08-24 | 2013-03-04 | Nec Casio Mobile Communications Ltd | 撮影装置、撮影画像処理方法、およびプログラム |
US8773498B2 (en) * | 2011-09-30 | 2014-07-08 | Polycom, Inc. | Background compression and resolution enhancement technique for video telephony and video conferencing |
US10248862B2 (en) * | 2014-07-23 | 2019-04-02 | Ebay Inc. | Use of camera metadata for recommendations |
US9398258B1 (en) * | 2015-03-26 | 2016-07-19 | Cisco Technology, Inc. | Method and system for video conferencing units |
EP3335204B1 (en) * | 2015-05-18 | 2020-01-15 | Booz Allen Hamilton Inc. | Portable aerial reconnaissance targeting intelligence device |
JP2017034312A (ja) * | 2015-07-28 | 2017-02-09 | 株式会社リコー | 通信装置、通信システム、およびプログラム |
CN106888361A (zh) * | 2015-12-11 | 2017-06-23 | 深圳市轻生活科技有限公司 | 视频交互控制方法和装置 |
JP6766086B2 (ja) * | 2017-09-28 | 2020-10-07 | キヤノン株式会社 | 撮像装置およびその制御方法 |
JP2019117375A (ja) * | 2017-12-26 | 2019-07-18 | キヤノン株式会社 | 撮像装置及びその制御方法及びプログラム |
CN108737719A (zh) * | 2018-04-04 | 2018-11-02 | 深圳市冠旭电子股份有限公司 | 摄像头拍摄控制方法、装置、智能设备及存储介质 |
-
2018
- 2018-04-04 CN CN201810300875.1A patent/CN108391057B/zh active Active
-
2019
- 2019-03-29 JP JP2019067344A patent/JP6759406B2/ja active Active
- 2019-04-04 EP EP19167327.6A patent/EP3550828B1/en active Active
- 2019-04-04 US US16/375,399 patent/US11445145B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102256098A (zh) * | 2010-05-18 | 2011-11-23 | 宝利通公司 | 具有多个语音跟踪摄像机的视频会议端点 |
CN104618456A (zh) * | 2015-01-13 | 2015-05-13 | 小米科技有限责任公司 | 信息发布方法及装置 |
CN107452372A (zh) * | 2017-09-22 | 2017-12-08 | 百度在线网络技术(北京)有限公司 | 远场语音识别模型的训练方法和装置 |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111756986A (zh) * | 2019-03-27 | 2020-10-09 | 上海博泰悦臻电子设备制造有限公司 | 一种摄像头控制方法、存储介质、装置及具有其的电子设备 |
CN113377329A (zh) * | 2021-07-01 | 2021-09-10 | 安徽文香科技有限公司 | 一种虚拟音频设备、音频数据处理方法及装置 |
CN113377329B (zh) * | 2021-07-01 | 2024-04-26 | 安徽文香科技股份有限公司 | 一种虚拟音频设备、音频数据处理方法及装置 |
CN113724704A (zh) * | 2021-08-30 | 2021-11-30 | 深圳创维-Rgb电子有限公司 | 一种语音获取方法、装置、终端及存储介质 |
CN113724704B (zh) * | 2021-08-30 | 2024-07-02 | 深圳创维-Rgb电子有限公司 | 一种语音获取方法、装置、终端及存储介质 |
CN116801102A (zh) * | 2023-08-22 | 2023-09-22 | 瑞芯微电子股份有限公司 | 控制摄像头的方法、视频会议系统、电子设备和存储介质 |
CN116801102B (zh) * | 2023-08-22 | 2024-02-09 | 瑞芯微电子股份有限公司 | 控制摄像头的方法、视频会议系统、电子设备和存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US11445145B2 (en) | 2022-09-13 |
EP3550828A1 (en) | 2019-10-09 |
JP2019186931A (ja) | 2019-10-24 |
EP3550828B1 (en) | 2022-02-23 |
JP6759406B2 (ja) | 2020-09-23 |
CN108391057B (zh) | 2020-10-16 |
US20190313057A1 (en) | 2019-10-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108391057A (zh) | 摄像头拍摄控制方法、装置、智能设备及计算机存储介质 | |
CN107911644B (zh) | 基于虚拟人脸表情进行视频通话的方法及装置 | |
JP4219682B2 (ja) | 自動マルチカメラ映像合成 | |
CN104731880B (zh) | 图片排序方法和装置 | |
CN105376515B (zh) | 用于视频通讯的通讯信息的呈现方法、装置及系统 | |
CN106331504A (zh) | 拍摄方法及装置 | |
CN108600632A (zh) | 拍照提示方法、智能眼镜及计算机可读存储介质 | |
CN106791235A (zh) | 一种选择服务座席的方法、装置及系统 | |
CN101682696A (zh) | 移动终端、移动终端的控制方法、移动终端的控制程序和记录了该控制程序的计算机可读记录介质 | |
CN108600633A (zh) | 一种拍摄角度确定方法、装置、终端及可读存储介质 | |
CN106375872A (zh) | 一种视频剪辑方法及装置 | |
CN108337471B (zh) | 视频画面的处理方法及装置 | |
CN107632814A (zh) | 音频信息的播放方法、装置和系统、存储介质、处理器 | |
CN110572570B (zh) | 一种多人场景的智能识别拍摄的方法、系统及存储介质 | |
CN111081257A (zh) | 一种语音采集方法、装置、设备及存储介质 | |
CN110636315B (zh) | 一种多人虚拟直播方法、装置、电子设备及存储介质 | |
CN108898591A (zh) | 图像质量的评分方法及装置、电子设备、可读存储介质 | |
CN110298296A (zh) | 应用于边缘计算设备的人脸识别方法 | |
CN112637490A (zh) | 视频制作方法、装置、电子设备及存储介质 | |
CN109427038A (zh) | 一种手机照片显示方法和系统 | |
CN110297929A (zh) | 图像匹配方法、装置、电子设备及存储介质 | |
CN104869283B (zh) | 一种拍摄方法及电子设备 | |
CN104469092B (zh) | 一种图像采集方法及电子设备 | |
CN110072055A (zh) | 基于人工智能的视频制作方法及系统 | |
CN110191280A (zh) | 基于盖板显示的拍照方法及相关产品 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Camera shooting control method, device, intelligent device, and computer storage medium Effective date of registration: 20230824 Granted publication date: 20201016 Pledgee: Shenzhen Rural Commercial Bank Co.,Ltd. Pingdi Sub branch Pledgor: SHENZHEN GRANDSUN ELECTRONIC Co.,Ltd. Registration number: Y2023980053775 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20201016 Pledgee: Shenzhen Rural Commercial Bank Co.,Ltd. Pingdi Sub branch Pledgor: SHENZHEN GRANDSUN ELECTRONIC Co.,Ltd. Registration number: Y2023980053775 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Camera shooting control method, device, intelligent equipment, and computer storage medium Granted publication date: 20201016 Pledgee: Shenzhen Rural Commercial Bank Co.,Ltd. Pingdi Sub branch Pledgor: SHENZHEN GRANDSUN ELECTRONIC Co.,Ltd. Registration number: Y2024980034910 |