WO2018145650A1 - Aircraft and control method therefor - Google Patents

Aircraft and control method therefor Download PDF

Info

Publication number
WO2018145650A1
WO2018145650A1 PCT/CN2018/075877 CN2018075877W WO2018145650A1 WO 2018145650 A1 WO2018145650 A1 WO 2018145650A1 CN 2018075877 W CN2018075877 W CN 2018075877W WO 2018145650 A1 WO2018145650 A1 WO 2018145650A1
Authority
WO
WIPO (PCT)
Prior art keywords
aircraft
control
image
gesture
module
Prior art date
Application number
PCT/CN2018/075877
Other languages
French (fr)
Chinese (zh)
Inventor
胡华智
孙海洋
Original Assignee
亿航智能设备(广州)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 亿航智能设备(广州)有限公司 filed Critical 亿航智能设备(广州)有限公司
Publication of WO2018145650A1 publication Critical patent/WO2018145650A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/10Simultaneous control of position or course in three dimensions
    • G05D1/101Simultaneous control of position or course in three dimensions specially adapted for aircraft
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/107Static hand or arm
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/107Static hand or arm
    • G06V40/113Recognition of static hand signs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • G06V40/166Detection; Localisation; Normalisation using acquisition arrangements

Definitions

  • the present invention relates to the field of aircraft technology, and in particular, to an aircraft and a control method thereof.
  • UAV aircraft Because UAV aircraft have the advantages of good maneuverability, low cost and convenient use, UAV aircraft have been used in many industries, such as aerial photography, agricultural plant protection, surveying and mapping, etc.
  • the inventors have found that the prior art has the following problems: the existing drones are controlled by a remote control device or a mobile device such as a mobile phone.
  • the advantage is that it can realize accurate and real-time control of the drone, and the reliability is high; the disadvantage is that the additional cost is added and it is not conducive to carrying, and the operation is complicated, the problem is difficult to get started, and there are certain requirements for the novice.
  • the main object of the present invention is to provide an aircraft and a control method thereof, which aim to solve the problems of the prior art.
  • a first aspect of an embodiment of the present invention provides a method for controlling an aircraft, the method comprising the steps of:
  • the aircraft is controlled within the safe distance in accordance with a control command of the aircraft.
  • the recognizing the gesture of the person in the depth image, and acquiring the control instruction of the aircraft according to the gesture of the recognized person includes:
  • a control instruction of the aircraft is obtained according to the identified gesture category.
  • the gesture category includes a static gesture and a dynamic gesture.
  • the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
  • control of the specific function includes control of a camera camera function and control of a camera recording function.
  • the recognizing the gesture of the person in the depth image before acquiring the control instruction of the aircraft according to the gesture of the recognized person, further includes the steps of:
  • Adjusting the aircraft or pan/tilt angle according to the position of the face in the fused image within the safe distance ensures that the face position is located in the fused image center area.
  • a second aspect of the embodiments of the present invention provides an aircraft, where the aircraft includes a binocular camera, a depth image processing module, a safety distance processing module, a gesture recognition module, and a control module;
  • the binocular camera is configured to acquire an image
  • the depth image processing module is configured to perform depth processing on the image captured by the binocular camera to obtain a depth image
  • the safety distance processing module is configured to control a safety distance between the aircraft and a person according to the depth image and parameters of the binocular camera;
  • the gesture recognition module is configured to identify a gesture of a person in the depth image, and acquire a control instruction of the aircraft according to the gesture of the recognized person;
  • the control module is configured to control the aircraft according to a control instruction of the aircraft within the safety distance.
  • the gesture recognition module includes a segmentation processing module, a gesture category recognition module, and an acquisition module;
  • the segmentation processing module is configured to perform segmentation processing on the depth image to obtain a human hand image
  • the gesture category identifying module is configured to perform gesture category recognition by using a template matching or a cascade classifier on the human hand image;
  • the acquiring module is configured to acquire a control instruction of the aircraft according to the identified gesture category.
  • the gesture category includes a static gesture and a dynamic gesture.
  • the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
  • control of the specific function includes control of a camera camera function and control of a camera recording function.
  • the aircraft further includes an image fusion processing module, a face recognition tracking module, and a face location processing module;
  • the image fusion processing module is configured to perform fusion processing on the image captured by the binocular camera to obtain a fused image
  • the face recognition tracking module is configured to identify and track a face in the fused image, and determine a face position in the fused image;
  • the face position processing module is configured to adjust an aircraft or a pan/tilt angle according to a face position in the fused image within the safety distance to ensure that the face position is located in the fused image center area.
  • the aircraft and the control method thereof after performing deep processing on the image captured by the binocular camera, identify the gesture of the person in the depth image, and obtain the control instruction of the aircraft according to the recognized gesture of the person, and at a safe distance
  • the aircraft is controlled according to the control instructions of the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
  • FIG. 1 is a schematic flow chart of an aircraft control method according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a gesture recognition process of an aircraft control method according to an embodiment of the present invention
  • FIG. 3 is another schematic flowchart of an aircraft control method according to an embodiment of the present invention.
  • FIG. 4 is a schematic structural view of an aircraft according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of a gesture recognition module of an aircraft according to an embodiment of the present invention.
  • FIG. 6 is another schematic structural view of an aircraft according to an embodiment of the present invention.
  • a first embodiment of the present invention provides a method for controlling an aircraft, the method comprising the steps of:
  • the aircraft is provided with two cameras to form a set of binocular cameras.
  • the two cameras can be set side by side or side by side. Of course, it can also be staggered, that is, the two cameras are not on the same horizontal line or vertical line.
  • the two cameras are separated by a certain distance. In theory, the larger the separation distance, the better.
  • images are acquired simultaneously (synchronously) by the binocular camera, and the acquired images may be photos or video streams.
  • the two images collected by the two cameras are spliced side by side.
  • the image captured by the left camera can be spliced to the left
  • the image captured by the right camera is spliced to the right
  • a stereo image of the left and right format is obtained
  • the binocular camera is arranged side by side
  • the two cameras are The collected images are spliced together side by side.
  • the image captured by the upper camera can be spliced to the upper side
  • the image captured by the lower camera is spliced to the lower side, and finally a stereoscopic image of the top and bottom format is obtained.
  • the aircraft before performing image splicing, the aircraft first performs resolution reduction processing on the original image, and then splicing the reduced resolution image to reduce the final stereoscopic image size, thereby avoiding excessive bandwidth consumption in subsequent transmission. Resources, thereby increasing the transmission speed and improving the real-time performance of image transmission.
  • the aircraft first samples the two video streams collected by the two cameras into two preset resolution video streams, and then splicing the two preset resolution video streams into one stereo.
  • Video stream where the preset resolution is lower than the original resolution.
  • the two cameras of the aircraft each capture a 4K resolution video stream, and the aircraft first samples two 4K resolution video streams into two 720P format video streams, and then streams the two 720P format video streams (or Up and down) stitching into a stereo video stream with a resolution of 2560*720 in the left and right format (or top and bottom format).
  • the aircraft can also save the original images captured by the two cameras in a local storage space. Further, before the saving, the original image is also compressed to save storage space, such as compressing the video stream into a H.265 format video file.
  • the safety distance between the aircraft and the person is controlled according to the depth image and the parameters of the binocular camera.
  • Parameters can include focal length, accuracy, distortion factor, and more.
  • the step may include:
  • the gesture categories include static gestures and dynamic gestures.
  • the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
  • the control of specific functions includes the control of the camera function and the control of the camera recording function.
  • the six degrees of freedom refer to the forward, backward, leftward, rightward, upward and downward flight of the aircraft.
  • the recognized gesture when the recognized gesture is “index finger facing down”, the corresponding control aircraft may be landed; when the recognized gesture is “Victory” (ie, the index finger and the middle finger), the camera camera function of the aircraft may be correspondingly controlled; When the recognized gesture is "palm”, the camera recording function of the aircraft can be controlled correspondingly; when the recognized gesture is "thumbs up”, the corresponding camera of the controlling aircraft can stop the recording function; when the recognized gesture is "fist” "When the aircraft is controlled, it can fly in six degrees of freedom.
  • the method may further include the following steps:
  • the aircraft control method after performing deep processing on the image captured by the binocular camera, identifies a gesture of a person in the depth image, acquires a control instruction of the aircraft according to the recognized gesture of the person, and is within a safe distance according to the method.
  • the aircraft's control commands control the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
  • FIG. 4 is an aircraft according to a second embodiment of the present invention.
  • the aircraft includes: a binocular camera 20, a depth image processing module 21, a safety distance processing module 22, a gesture recognition module 23, and a control module 24;
  • the binocular camera 20 is used to acquire images.
  • the aircraft is provided with two cameras to form a set of binocular cameras.
  • the two cameras can be set side by side or side by side. Of course, it can also be staggered, that is, the two cameras are not on the same horizontal line or vertical line.
  • the two cameras are separated by a certain distance. In theory, the larger the separation distance, the better.
  • the images are acquired simultaneously (synchronously) by the binocular camera, and the acquired images may be photos or video streams.
  • the depth image processing module 21 is configured to perform depth processing on the image captured by the binocular camera 20 to obtain a depth image.
  • the two images collected by the two cameras are spliced side by side.
  • the image captured by the left camera can be spliced to the left
  • the image captured by the right camera is spliced to the right
  • a stereo image of the left and right format is obtained
  • the binocular camera is arranged side by side
  • the two cameras are The collected images are spliced together side by side.
  • the image captured by the upper camera can be spliced to the upper side
  • the image captured by the lower camera is spliced to the lower side, and finally a stereoscopic image of the top and bottom format is obtained.
  • the aircraft before performing image splicing, the aircraft first performs resolution reduction processing on the original image, and then splicing the reduced resolution image to reduce the final stereoscopic image size, thereby avoiding excessive bandwidth consumption in subsequent transmission. Resources, thereby increasing the transmission speed and improving the real-time performance of image transmission.
  • the aircraft first samples the two video streams collected by the two cameras into two preset resolution video streams, and then splicing the two preset resolution video streams into one stereo.
  • Video stream where the preset resolution is lower than the original resolution.
  • the two cameras of the aircraft each capture a 4K resolution video stream, and the aircraft first samples two 4K resolution video streams into two 720P format video streams, and then streams the two 720P format video streams (or Up and down) stitching into a stereo video stream with a resolution of 2560*720 in the left and right format (or top and bottom format).
  • the aircraft can also save the original images captured by the two cameras in a local storage space. Further, before the saving, the original image is also compressed to save storage space, such as compressing the video stream into a H.265 format video file.
  • the safety distance processing module 22 is configured to control the safety distance between the aircraft and the person according to the depth image and the parameters of the binocular camera.
  • the safety distance between the aircraft and the person is controlled according to the depth image and the parameters of the binocular camera.
  • Parameters can include focal length, accuracy, distortion factor, and more.
  • the gesture recognition module 23 is configured to identify a gesture of a person in the depth image, and acquire a control instruction of the aircraft according to the gesture of the recognized person.
  • the gesture recognition module 23 may include a segmentation processing module 231, a gesture category identification module 232, and an acquisition module 233;
  • the segmentation processing module 231 is configured to perform segmentation processing on the depth image to obtain a human hand image.
  • the gesture category identification module 232 is configured to perform gesture category recognition on the human hand image by using a template matching or a cascade classifier.
  • the obtaining module 233 is configured to acquire a control instruction of the aircraft according to the identified gesture category.
  • the gesture categories include static gestures and dynamic gestures.
  • the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
  • the control of specific functions includes the control of the camera function and the control of the camera recording function.
  • the six degrees of freedom refer to the forward, backward, leftward, rightward, upward and downward flight of the aircraft.
  • the recognized gesture when the recognized gesture is “index finger facing down”, the corresponding control aircraft may be landed; when the recognized gesture is “Victory” (ie, the index finger and the middle finger), the camera camera function of the aircraft may be correspondingly controlled; When the recognized gesture is "palm”, the camera recording function of the aircraft can be controlled correspondingly; when the recognized gesture is "thumbs up”, the corresponding camera of the controlling aircraft can stop the recording function; when the recognized gesture is "fist” "When the aircraft is controlled, it can fly in six degrees of freedom.
  • the control module 24 is configured to control the aircraft according to a control instruction of the aircraft within a safe distance.
  • the aircraft further includes an image fusion processing module 221, a face recognition tracking module 222, and a face location processing module 223;
  • the image fusion processing module 221 is configured to perform fusion processing on the image captured by the binocular camera to obtain the fused image;
  • the face recognition tracking module 222 is configured to identify and track a face in the fused image, and determine a face position in the fused image;
  • the face position processing module 223 is configured to adjust an aircraft or a pan/tilt angle according to a face position in the fused image within a safe distance to ensure that the face position is located in the fused image center area.
  • the aircraft after performing deep processing on the image captured by the binocular camera, identifies a gesture of a person in the depth image, acquires a control instruction of the aircraft according to the recognized gesture of the person, and is within a safe distance according to the aircraft Control commands to control the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
  • the aircraft and the control method thereof after performing deep processing on the image captured by the binocular camera, identify the gesture of the person in the depth image, and obtain the control instruction of the aircraft according to the recognized gesture of the person, and at a safe distance
  • the aircraft is controlled according to the control instructions of the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft. Therefore, it has industrial applicability.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • General Health & Medical Sciences (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Automation & Control Theory (AREA)
  • Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Image Analysis (AREA)

Abstract

An aircraft and a control method therefor, the method comprising the following steps: acquiring an image by means of a binocular camera (10); carrying out depth processing on the image acquired by the binocular camera to acquire a depth image (20); controlling the safe distance between an aircraft and a person according to the depth image and parameters of the binocular camera (30); recognizing a gesture of the person in the depth image, and acquiring a control instruction of the aircraft according to the recognized gesture of the person (40); and controlling the aircraft within the safe distance according to the control instruction of the aircraft (50). With the aircraft and the control method therefor, it is possible to control the aircraft by means of human gestures, thereby facilitating use by a user, reducing the purchase cost of the user, increasing the amusement in controlling the aircraft and enhancing the interaction between the user and the aircraft.

Description

一种飞行器及其控制方法Aircraft and control method thereof 技术领域Technical field
本发明涉及飞行器技术领域,尤其涉及一种飞行器及其控制方法。The present invention relates to the field of aircraft technology, and in particular, to an aircraft and a control method thereof.
背景技术Background technique
由于无人机飞行器具有可操控性好,低成本,使用方便等优点,因此,无人机飞行器已经被应用于许多行业,如在航拍、农业植保、测绘等多个领域已经得到广泛应用。Because UAV aircraft have the advantages of good maneuverability, low cost and convenient use, UAV aircraft have been used in many industries, such as aerial photography, agricultural plant protection, surveying and mapping, etc.
技术问题technical problem
在实现本发明的过程中,发明人发现现有技术存在以下问题:现有的无人机都是通过遥控装置或者移动装置,例如:手机,对其进行控制。优点是能够对无人机实现精准、实时控制,可靠性较高;缺点是增加额外成本且不利于携带,存在操作复杂,上手难度大的问题,而且对新手有一定的要求。In the process of implementing the present invention, the inventors have found that the prior art has the following problems: the existing drones are controlled by a remote control device or a mobile device such as a mobile phone. The advantage is that it can realize accurate and real-time control of the drone, and the reliability is high; the disadvantage is that the additional cost is added and it is not conducive to carrying, and the operation is complicated, the problem is difficult to get started, and there are certain requirements for the novice.
技术解决方案Technical solution
本发明的主要目的在于提出一种飞行器及其控制方法,旨在解决现有技术存在的问题。The main object of the present invention is to provide an aircraft and a control method thereof, which aim to solve the problems of the prior art.
为实现上述目的,本发明实施例第一方面提供一种飞行器的控制方法,所述方法包括步骤:To achieve the above objective, a first aspect of an embodiment of the present invention provides a method for controlling an aircraft, the method comprising the steps of:
通过双目摄像头采集图像;Acquiring images through a binocular camera;
对所述双目摄像头采集的图像进行深度处理,获取到深度图像;Performing depth processing on the image captured by the binocular camera to obtain a depth image;
根据所述深度图像和所述双目摄像头的参数,控制所述飞行器与人的安全距离;Controlling a safe distance between the aircraft and a person according to the depth image and parameters of the binocular camera;
识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令;Identifying a gesture of a person in the depth image, and acquiring a control instruction of the aircraft according to the gesture of the recognized person;
在所述安全距离内根据所述飞行器的控制指令,对所述飞行器进行控制。The aircraft is controlled within the safe distance in accordance with a control command of the aircraft.
进一步地,所述识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令包括:Further, the recognizing the gesture of the person in the depth image, and acquiring the control instruction of the aircraft according to the gesture of the recognized person includes:
对所述深度图像进行分割处理,获取到人手图像;Performing a segmentation process on the depth image to obtain a human hand image;
对所述人手图像采用模板匹配或者级联分类器进行手势类别识别;Using the template matching or the cascade classifier to perform gesture category recognition on the human hand image;
根据识别的手势类别,获取所述飞行器的控制指令。A control instruction of the aircraft is obtained according to the identified gesture category.
进一步地,所述手势类别包括静态手势和动态手势。Further, the gesture category includes a static gesture and a dynamic gesture.
进一步地,所述飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。Further, the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
进一步地,所述特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。Further, the control of the specific function includes control of a camera camera function and control of a camera recording function.
进一步地,所述识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令之前还包括步骤:Further, the recognizing the gesture of the person in the depth image, before acquiring the control instruction of the aircraft according to the gesture of the recognized person, further includes the steps of:
对所述双目摄像头采集的图像进行融合处理,获取到融合图像;Performing fusion processing on the image acquired by the binocular camera to obtain a fused image;
对所述融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置;Identifying and tracking a face in the fused image to determine a face position in the fused image;
在所述安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。Adjusting the aircraft or pan/tilt angle according to the position of the face in the fused image within the safe distance ensures that the face position is located in the fused image center area.
此外,为实现上述目的,本发明实施例第二方面提供一种飞行器,所述飞行器包括双目摄像头、深度图像处理模块、安全距离处理模块、手势识别模块以及控制模块;In addition, in order to achieve the above object, a second aspect of the embodiments of the present invention provides an aircraft, where the aircraft includes a binocular camera, a depth image processing module, a safety distance processing module, a gesture recognition module, and a control module;
所述双目摄像头,用于采集图像;The binocular camera is configured to acquire an image;
所述深度图像处理模块,用于对所述双目摄像头采集的图像进行深度处理,获取到深度图像;The depth image processing module is configured to perform depth processing on the image captured by the binocular camera to obtain a depth image;
所述安全距离处理模块,用于根据所述深度图像和所述双目摄像头的参数,控制所述飞行器与人的安全距离;The safety distance processing module is configured to control a safety distance between the aircraft and a person according to the depth image and parameters of the binocular camera;
所述手势识别模块,用于识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令;The gesture recognition module is configured to identify a gesture of a person in the depth image, and acquire a control instruction of the aircraft according to the gesture of the recognized person;
所述控制模块,用于在所述安全距离内根据所述飞行器的控制指令,对所述飞行器进行控制。The control module is configured to control the aircraft according to a control instruction of the aircraft within the safety distance.
进一步地,所述手势识别模块包括分割处理模块、手势类别识别模块以及获取模块;Further, the gesture recognition module includes a segmentation processing module, a gesture category recognition module, and an acquisition module;
所述分割处理模块,用于对所述深度图像进行分割处理,获取到人手图像;The segmentation processing module is configured to perform segmentation processing on the depth image to obtain a human hand image;
所述手势类别识别模块,用于对所述人手图像采用模板匹配或者级联分类器进行手势类别识别;The gesture category identifying module is configured to perform gesture category recognition by using a template matching or a cascade classifier on the human hand image;
所述获取模块,用于根据识别的手势类别,获取所述飞行器的控制指令。The acquiring module is configured to acquire a control instruction of the aircraft according to the identified gesture category.
进一步地,所述手势类别包括静态手势和动态手势。Further, the gesture category includes a static gesture and a dynamic gesture.
进一步地,所述飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。Further, the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions.
进一步地,所述特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。Further, the control of the specific function includes control of a camera camera function and control of a camera recording function.
进一步地,所述飞行器还包括图像融合处理模块、人脸识别跟踪模块以及人脸位置处理模块;Further, the aircraft further includes an image fusion processing module, a face recognition tracking module, and a face location processing module;
所述图像融合处理模块,用于对所述双目摄像头采集的图像进行融合处理,获取到融合图像;The image fusion processing module is configured to perform fusion processing on the image captured by the binocular camera to obtain a fused image;
所述人脸识别跟踪模块,用于对所述融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置;The face recognition tracking module is configured to identify and track a face in the fused image, and determine a face position in the fused image;
所述人脸位置处理模块,用于在所述安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。The face position processing module is configured to adjust an aircraft or a pan/tilt angle according to a face position in the fused image within the safety distance to ensure that the face position is located in the fused image center area.
有益效果Beneficial effect
本发明实施例提供的飞行器及其控制方法,通过对双目摄像头采集的图像进行深度处理后,识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令,并在安全距离内根据飞行器的控制指令,对飞行器进行控制。从而通过人的手势对飞行器进行控制,方便了用户的使用,降低了用户的购置成本,还可增加飞行器的操控乐趣,增强人与飞行器的互动。The aircraft and the control method thereof according to the embodiments of the present invention, after performing deep processing on the image captured by the binocular camera, identify the gesture of the person in the depth image, and obtain the control instruction of the aircraft according to the recognized gesture of the person, and at a safe distance The aircraft is controlled according to the control instructions of the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
附图说明DRAWINGS
图1为本发明实施例的飞行器控制方法流程示意图;1 is a schematic flow chart of an aircraft control method according to an embodiment of the present invention;
图2为本发明实施例的飞行器控制方法的手势识别流程示意图;2 is a schematic diagram of a gesture recognition process of an aircraft control method according to an embodiment of the present invention;
图3为本发明实施例的飞行器控制方法另一流程示意图;3 is another schematic flowchart of an aircraft control method according to an embodiment of the present invention;
图4为本发明实施例的飞行器结构示意图;4 is a schematic structural view of an aircraft according to an embodiment of the present invention;
图5为本发明实施例的飞行器的手势识别模块结构示意图;FIG. 5 is a schematic structural diagram of a gesture recognition module of an aircraft according to an embodiment of the present invention; FIG.
图6为本发明实施例的飞行器另一结构示意图。FIG. 6 is another schematic structural view of an aircraft according to an embodiment of the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。。The implementation, functional features, and advantages of the present invention will be further described in conjunction with the embodiments. .
本发明的实施方式Embodiments of the invention
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
现在将参考附图描述实现本发明各个实施例的移动终端。在后续的描述中,使用用于表示元件的诸如“模块”、“部件”或“单元”的后缀仅为了有利于本发明的说明,其本身并没有特定的意义。因此,"模块"与"部件"可以混合地使用。A mobile terminal embodying various embodiments of the present invention will now be described with reference to the accompanying drawings. In the following description, the use of suffixes such as "module", "component" or "unit" for indicating an element is merely an explanation for facilitating the present invention, and does not have a specific meaning per se. Therefore, "module" and "component" can be used in combination.
如图1所示,本发明第一实施例提出一种飞行器的控制方法,该方法包括步骤:As shown in FIG. 1, a first embodiment of the present invention provides a method for controlling an aircraft, the method comprising the steps of:
10、通过双目摄像头采集图像。10. Capture images through a binocular camera.
本发明实施例中,飞行器设置有两个摄像头,组成一组双目摄像头。两个摄像头可以左右并排设置,也可以上下并列设置。当然,也可以交错设置,即两个摄像头不在同一条水平线或垂直线上。两个摄像头之间间隔一定距离,理论上间隔距离越大越好。In the embodiment of the invention, the aircraft is provided with two cameras to form a set of binocular cameras. The two cameras can be set side by side or side by side. Of course, it can also be staggered, that is, the two cameras are not on the same horizontal line or vertical line. The two cameras are separated by a certain distance. In theory, the larger the separation distance, the better.
本步骤中,通过双目摄像头同时(同步)采集图像,采集的图像可以是照片或者视频流。In this step, images are acquired simultaneously (synchronously) by the binocular camera, and the acquired images may be photos or video streams.
11、对双目摄像头采集的图像进行深度处理,获取到深度图像。11. Perform deep processing on the image captured by the binocular camera to obtain a depth image.
具体的,当双目摄像头左右并排设置时,将两个摄像头采集的两个图像左右并排拼接在一起。在一种实施方式中,可将左边摄像头采集的图像拼接在左边,将右边摄像头采集的图像拼接在右边,最终获得一个左右格式的立体图像;当双目摄像头上下并列设置时,将两个摄像头采集的两个图像上下并列拼接在一起,在一种实施方式中,可将上边摄像头采集的图像拼接在上边,将下边摄像头采集的图像拼接在下边,最终获得一个上下格式的立体图像。Specifically, when the binocular cameras are arranged side by side, the two images collected by the two cameras are spliced side by side. In one embodiment, the image captured by the left camera can be spliced to the left, the image captured by the right camera is spliced to the right, and finally a stereo image of the left and right format is obtained; when the binocular camera is arranged side by side, the two cameras are The collected images are spliced together side by side. In one embodiment, the image captured by the upper camera can be spliced to the upper side, and the image captured by the lower camera is spliced to the lower side, and finally a stereoscopic image of the top and bottom format is obtained.
进一步地,在进行图像拼接之前,飞行器先对原始图像进行分辨率降低处理,然后对降低了分辨率的图像进行拼接处理,以减小最终的立体图像尺寸,避免后续传输时消耗太多的带宽资源,从而提高传输速度,提高图像传输的实时性。Further, before performing image splicing, the aircraft first performs resolution reduction processing on the original image, and then splicing the reduced resolution image to reduce the final stereoscopic image size, thereby avoiding excessive bandwidth consumption in subsequent transmission. Resources, thereby increasing the transmission speed and improving the real-time performance of image transmission.
以采集的图像为视频流为例,飞行器首先将两个摄像头采集的两个视频流分别采样为两个预设分辨率的视频流,然后将两个预设分辨率的视频流拼接为一个立体视频流,其中,预设分辨率低于原始分辨率。例如,飞行器的两个摄像头各自拍摄4K分辨率的视频流,飞行器首先将两个4K分辨率的视频流采样成为两个720P格式的的视频流,然后将两个720P格式的视频流左右(或上下)拼接成为一个分辨率为2560*720的左右格式(或上下格式)的立体视频流。Taking the captured image as a video stream as an example, the aircraft first samples the two video streams collected by the two cameras into two preset resolution video streams, and then splicing the two preset resolution video streams into one stereo. Video stream, where the preset resolution is lower than the original resolution. For example, the two cameras of the aircraft each capture a 4K resolution video stream, and the aircraft first samples two 4K resolution video streams into two 720P format video streams, and then streams the two 720P format video streams (or Up and down) stitching into a stereo video stream with a resolution of 2560*720 in the left and right format (or top and bottom format).
此外,飞行器还可以将两个摄像头拍摄的原始图像保存于本地的存储空间内。进一步地,在保存前,还对原始图像进行压缩处理,以节省存储空间,如将视频流压缩为H.265格式的视频文件。In addition, the aircraft can also save the original images captured by the two cameras in a local storage space. Further, before the saving, the original image is also compressed to save storage space, such as compressing the video stream into a H.265 format video file.
12、根据深度图像和双目摄像头的参数,控制飞行器与人的安全距离。12. Control the safety distance between the aircraft and the person according to the depth image and the parameters of the binocular camera.
在本实施例中,为了保证飞行器与人的安全距离,根据深度图像和双目摄像头的参数,控制飞行器与人的安全距离。参数可包括焦距,精度,畸变系数等等。In this embodiment, in order to ensure a safe distance between the aircraft and the person, the safety distance between the aircraft and the person is controlled according to the depth image and the parameters of the binocular camera. Parameters can include focal length, accuracy, distortion factor, and more.
13、识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令。13. Identifying a gesture of a person in the depth image, and acquiring a control instruction of the aircraft according to the gesture of the recognized person.
请参考图2所示,在本实施例中,该步骤可包括:Please refer to FIG. 2, in this embodiment, the step may include:
131、对深度图像进行分割处理,获取到人手图像。131. Perform a segmentation process on the depth image to obtain a human hand image.
132、对人手图像采用模板匹配或者级联分类器进行手势类别识别。132. Using a template matching or a cascade classifier to perform gesture category recognition on the human hand image.
133、根据识别的手势类别,获取飞行器的控制指令。133. Acquire a control instruction of the aircraft according to the identified gesture category.
在本实施例中,手势类别包括静态手势和动态手势。In this embodiment, the gesture categories include static gestures and dynamic gestures.
在本实施例中,飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。In the present embodiment, the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions. The control of specific functions includes the control of the camera function and the control of the camera recording function.
需要说明的是,六个自由度指的是飞行器的向前、向后、向左、向右、向上及向下飞行。It should be noted that the six degrees of freedom refer to the forward, backward, leftward, rightward, upward and downward flight of the aircraft.
作为一示例地,当识别的手势为“食指朝下”时,可对应的控制飞行器降落;当识别的手势为“Victory”时(即食指和中指),可对应的控制飞行器的摄像头照相功能;当识别的手势为“手掌”时,可对应的控制飞行器的摄像头录像功能;当识别的手势为“拇指朝上”时,可对应的控制飞行器的摄像头停止录像功能;当识别的手势为“握拳”时,可对应的控制飞行器六个自由度飞行。As an example, when the recognized gesture is “index finger facing down”, the corresponding control aircraft may be landed; when the recognized gesture is “Victory” (ie, the index finger and the middle finger), the camera camera function of the aircraft may be correspondingly controlled; When the recognized gesture is "palm", the camera recording function of the aircraft can be controlled correspondingly; when the recognized gesture is "thumbs up", the corresponding camera of the controlling aircraft can stop the recording function; when the recognized gesture is "fist" "When the aircraft is controlled, it can fly in six degrees of freedom.
14、在安全距离内根据飞行器的控制指令,对飞行器进行控制。14. Control the aircraft within a safe distance according to the aircraft's control commands.
请参考图3所示,进一步地,在本实施例中,步骤13之前还可包括步骤:Please refer to FIG. 3, further, in this embodiment, before step 13, the method may further include the following steps:
121、对双目摄像头采集的图像进行融合处理,获取到融合图像。121. Perform fusion processing on the image captured by the binocular camera to obtain the fused image.
122、对融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置。122. Identify and track a face in the fused image to determine a face position in the fused image.
123、在安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。123. Adjust the position of the aircraft or the pan/tilt according to the position of the face in the fused image within the safe distance to ensure that the position of the face is located in the central area of the fused image.
本发明实施例提供的飞行器控制方法,通过对双目摄像头采集的图像进行深度处理后,识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令,并在安全距离内根据飞行器的控制指令,对飞行器进行控制。从而通过人的手势对飞行器进行控制,方便了用户的使用,降低了用户的购置成本,还可增加飞行器的操控乐趣,增强人与飞行器的互动。The aircraft control method provided by the embodiment of the present invention, after performing deep processing on the image captured by the binocular camera, identifies a gesture of a person in the depth image, acquires a control instruction of the aircraft according to the recognized gesture of the person, and is within a safe distance according to the method. The aircraft's control commands control the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
参照图4,图4为本发明第二实施例提出的一种飞行器,该飞行器包括:双目摄像头20、深度图像处理模块21、安全距离处理模块22、手势识别模块23以及控制模块24;Referring to FIG. 4, FIG. 4 is an aircraft according to a second embodiment of the present invention. The aircraft includes: a binocular camera 20, a depth image processing module 21, a safety distance processing module 22, a gesture recognition module 23, and a control module 24;
双目摄像头20,用于采集图像。The binocular camera 20 is used to acquire images.
本发明实施例中,飞行器设置有两个摄像头,组成一组双目摄像头。两个摄像头可以左右并排设置,也可以上下并列设置。当然,也可以交错设置,即两个摄像头不在同一条水平线或垂直线上。两个摄像头之间间隔一定距离,理论上间隔距离越大越好。In the embodiment of the invention, the aircraft is provided with two cameras to form a set of binocular cameras. The two cameras can be set side by side or side by side. Of course, it can also be staggered, that is, the two cameras are not on the same horizontal line or vertical line. The two cameras are separated by a certain distance. In theory, the larger the separation distance, the better.
通过双目摄像头同时(同步)采集图像,采集的图像可以是照片或者视频流。The images are acquired simultaneously (synchronously) by the binocular camera, and the acquired images may be photos or video streams.
深度图像处理模块21,用于对双目摄像头20采集的图像进行深度处理,获取到深度图像。The depth image processing module 21 is configured to perform depth processing on the image captured by the binocular camera 20 to obtain a depth image.
具体的,当双目摄像头左右并排设置时,将两个摄像头采集的两个图像左右并排拼接在一起。在一种实施方式中,可将左边摄像头采集的图像拼接在左边,将右边摄像头采集的图像拼接在右边,最终获得一个左右格式的立体图像;当双目摄像头上下并列设置时,将两个摄像头采集的两个图像上下并列拼接在一起,在一种实施方式中,可将上边摄像头采集的图像拼接在上边,将下边摄像头采集的图像拼接在下边,最终获得一个上下格式的立体图像。Specifically, when the binocular cameras are arranged side by side, the two images collected by the two cameras are spliced side by side. In one embodiment, the image captured by the left camera can be spliced to the left, the image captured by the right camera is spliced to the right, and finally a stereo image of the left and right format is obtained; when the binocular camera is arranged side by side, the two cameras are The collected images are spliced together side by side. In one embodiment, the image captured by the upper camera can be spliced to the upper side, and the image captured by the lower camera is spliced to the lower side, and finally a stereoscopic image of the top and bottom format is obtained.
进一步地,在进行图像拼接之前,飞行器先对原始图像进行分辨率降低处理,然后对降低了分辨率的图像进行拼接处理,以减小最终的立体图像尺寸,避免后续传输时消耗太多的带宽资源,从而提高传输速度,提高图像传输的实时性。Further, before performing image splicing, the aircraft first performs resolution reduction processing on the original image, and then splicing the reduced resolution image to reduce the final stereoscopic image size, thereby avoiding excessive bandwidth consumption in subsequent transmission. Resources, thereby increasing the transmission speed and improving the real-time performance of image transmission.
以采集的图像为视频流为例,飞行器首先将两个摄像头采集的两个视频流分别采样为两个预设分辨率的视频流,然后将两个预设分辨率的视频流拼接为一个立体视频流,其中,预设分辨率低于原始分辨率。例如,飞行器的两个摄像头各自拍摄4K分辨率的视频流,飞行器首先将两个4K分辨率的视频流采样成为两个720P格式的的视频流,然后将两个720P格式的视频流左右(或上下)拼接成为一个分辨率为2560*720的左右格式(或上下格式)的立体视频流。Taking the captured image as a video stream as an example, the aircraft first samples the two video streams collected by the two cameras into two preset resolution video streams, and then splicing the two preset resolution video streams into one stereo. Video stream, where the preset resolution is lower than the original resolution. For example, the two cameras of the aircraft each capture a 4K resolution video stream, and the aircraft first samples two 4K resolution video streams into two 720P format video streams, and then streams the two 720P format video streams (or Up and down) stitching into a stereo video stream with a resolution of 2560*720 in the left and right format (or top and bottom format).
此外,飞行器还可以将两个摄像头拍摄的原始图像保存于本地的存储空间内。进一步地,在保存前,还对原始图像进行压缩处理,以节省存储空间,如将视频流压缩为H.265格式的视频文件。In addition, the aircraft can also save the original images captured by the two cameras in a local storage space. Further, before the saving, the original image is also compressed to save storage space, such as compressing the video stream into a H.265 format video file.
安全距离处理模块22,用于根据深度图像和双目摄像头的参数,控制飞行器与人的安全距离。The safety distance processing module 22 is configured to control the safety distance between the aircraft and the person according to the depth image and the parameters of the binocular camera.
在本实施例中,为了保证飞行器与人的安全距离,根据深度图像和双目摄像头的参数,控制飞行器与人的安全距离。参数可包括焦距,精度,畸变系数等等。In this embodiment, in order to ensure a safe distance between the aircraft and the person, the safety distance between the aircraft and the person is controlled according to the depth image and the parameters of the binocular camera. Parameters can include focal length, accuracy, distortion factor, and more.
手势识别模块23,用于识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令。The gesture recognition module 23 is configured to identify a gesture of a person in the depth image, and acquire a control instruction of the aircraft according to the gesture of the recognized person.
请参考图5所示,在本实施例中,手势识别模块23可包括分割处理模块231、手势类别识别模块232以及获取模块233;Please refer to FIG. 5, in this embodiment, the gesture recognition module 23 may include a segmentation processing module 231, a gesture category identification module 232, and an acquisition module 233;
分割处理模块231,用于对深度图像进行分割处理,获取到人手图像。The segmentation processing module 231 is configured to perform segmentation processing on the depth image to obtain a human hand image.
手势类别识别模块232,用于对人手图像采用模板匹配或者级联分类器进行手势类别识别。The gesture category identification module 232 is configured to perform gesture category recognition on the human hand image by using a template matching or a cascade classifier.
获取模块233,用于根据识别的手势类别,获取飞行器的控制指令。The obtaining module 233 is configured to acquire a control instruction of the aircraft according to the identified gesture category.
在本实施例中,手势类别包括静态手势和动态手势。In this embodiment, the gesture categories include static gestures and dynamic gestures.
在本实施例中,飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。In the present embodiment, the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start and stop landing control, and control of specific functions. The control of specific functions includes the control of the camera function and the control of the camera recording function.
需要说明的是,六个自由度指的是飞行器的向前、向后、向左、向右、向上及向下飞行。It should be noted that the six degrees of freedom refer to the forward, backward, leftward, rightward, upward and downward flight of the aircraft.
作为一示例地,当识别的手势为“食指朝下”时,可对应的控制飞行器降落;当识别的手势为“Victory”时(即食指和中指),可对应的控制飞行器的摄像头照相功能;当识别的手势为“手掌”时,可对应的控制飞行器的摄像头录像功能;当识别的手势为“拇指朝上”时,可对应的控制飞行器的摄像头停止录像功能;当识别的手势为“握拳”时,可对应的控制飞行器六个自由度飞行。As an example, when the recognized gesture is “index finger facing down”, the corresponding control aircraft may be landed; when the recognized gesture is “Victory” (ie, the index finger and the middle finger), the camera camera function of the aircraft may be correspondingly controlled; When the recognized gesture is "palm", the camera recording function of the aircraft can be controlled correspondingly; when the recognized gesture is "thumbs up", the corresponding camera of the controlling aircraft can stop the recording function; when the recognized gesture is "fist" "When the aircraft is controlled, it can fly in six degrees of freedom.
控制模块24,用于在安全距离内根据飞行器的控制指令,对飞行器进行控制。The control module 24 is configured to control the aircraft according to a control instruction of the aircraft within a safe distance.
请参考图6所示,进一步地,在本实施例中,飞行器还包括图像融合处理模块221、人脸识别跟踪模块222以及人脸位置处理模块223;Please refer to FIG. 6, further, in this embodiment, the aircraft further includes an image fusion processing module 221, a face recognition tracking module 222, and a face location processing module 223;
图像融合处理模块221,用于对双目摄像头采集的图像进行融合处理,获取到融合图像;The image fusion processing module 221 is configured to perform fusion processing on the image captured by the binocular camera to obtain the fused image;
人脸识别跟踪模块222,用于对融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置;The face recognition tracking module 222 is configured to identify and track a face in the fused image, and determine a face position in the fused image;
人脸位置处理模块223,用于在安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。The face position processing module 223 is configured to adjust an aircraft or a pan/tilt angle according to a face position in the fused image within a safe distance to ensure that the face position is located in the fused image center area.
本发明实施例提供的飞行器,通过对双目摄像头采集的图像进行深度处理后,识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令,并在安全距离内根据飞行器的控制指令,对飞行器进行控制。从而通过人的手势对飞行器进行控制,方便了用户的使用,降低了用户的购置成本,还可增加飞行器的操控乐趣,增强人与飞行器的互动。The aircraft provided by the embodiment of the present invention, after performing deep processing on the image captured by the binocular camera, identifies a gesture of a person in the depth image, acquires a control instruction of the aircraft according to the recognized gesture of the person, and is within a safe distance according to the aircraft Control commands to control the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device comprising a series of elements includes those elements. It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, article, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only the preferred embodiments of the present invention, and are not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the present invention and the drawings are directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of the present invention.
工业实用性Industrial applicability
本发明实施例提供的飞行器及其控制方法,通过对双目摄像头采集的图像进行深度处理后,识别深度图像中人的手势,根据识别出的人的手势获取飞行器的控制指令,并在安全距离内根据飞行器的控制指令,对飞行器进行控制。从而通过人的手势对飞行器进行控制,方便了用户的使用,降低了用户的购置成本,还可增加飞行器的操控乐趣,增强人与飞行器的互动。因此,具有工业实用性。The aircraft and the control method thereof according to the embodiments of the present invention, after performing deep processing on the image captured by the binocular camera, identify the gesture of the person in the depth image, and obtain the control instruction of the aircraft according to the recognized gesture of the person, and at a safe distance The aircraft is controlled according to the control instructions of the aircraft. Therefore, the aircraft is controlled by human gestures, which is convenient for the user to use, reduces the user's purchase cost, increases the control fun of the aircraft, and enhances the interaction between the person and the aircraft. Therefore, it has industrial applicability.

Claims (2)

  1. 一种飞行器的控制方法,所述方法包括步骤:A method of controlling an aircraft, the method comprising the steps of:
    通过双目摄像头采集图像;Acquiring images through a binocular camera;
    对所述双目摄像头采集的图像进行深度处理,获取到深度图像;Performing depth processing on the image captured by the binocular camera to obtain a depth image;
    根据所述深度图像和所述双目摄像头的参数,控制所述飞行器与人的安全距离;Controlling a safe distance between the aircraft and a person according to the depth image and parameters of the binocular camera;
    识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令;Identifying a gesture of a person in the depth image, and acquiring a control instruction of the aircraft according to the gesture of the recognized person;
    在所述安全距离内根据所述飞行器的控制指令,对所述飞行器进行控制。The aircraft is controlled within the safe distance in accordance with a control command of the aircraft.
  2. 根据权利要求1所述的一种飞行器的控制方法,其中,所述识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令包括:The control method of an aircraft according to claim 1, wherein the recognizing a gesture of a person in the depth image, and acquiring a control instruction of the aircraft according to the gesture of the recognized person comprises:
    对所述深度图像进行分割处理,获取到人手图像;Performing a segmentation process on the depth image to obtain a human hand image;
    对所述人手图像采用模板匹配或者级联分类器进行手势类别识别;Using the template matching or the cascade classifier to perform gesture category recognition on the human hand image;
    根据识别的手势类别,获取所述飞行器的控制指令。A control instruction of the aircraft is obtained according to the identified gesture category.
    3. 根据权利要求2所述的一种飞行器的控制方法,其中,所述手势类别包括静态手势和动态手势。3. The control method of an aircraft according to claim 2, wherein the gesture category comprises a static gesture and a dynamic gesture.
    4. 根据权利要求1-3任一所述的一种飞行器的控制方法,其中,所述飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。4. The control method of an aircraft according to any one of claims 1 to 3, wherein the control command of the aircraft comprises six degrees of freedom control of the aircraft, aircraft start/stop landing control, and control of a specific function.
    5. 根据权利要求4所述的一种飞行器的控制方法,其中,所述特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。5. The control method of an aircraft according to claim 4, wherein the control of the specific function comprises control of a camera camera function and control of a camera recording function.
    6. 根据权利要求5所述的一种飞行器的控制方法,其中,所述识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令之前还包括步骤:The method for controlling an aircraft according to claim 5, wherein the step of identifying a gesture of a person in the depth image and obtaining a control instruction of the aircraft according to the gesture of the recognized person further comprises the steps of:
    对所述双目摄像头采集的图像进行融合处理,获取到融合图像;Performing fusion processing on the image acquired by the binocular camera to obtain a fused image;
    对所述融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置;Identifying and tracking a face in the fused image to determine a face position in the fused image;
    在所述安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。Adjusting the aircraft or pan/tilt angle according to the position of the face in the fused image within the safe distance ensures that the face position is located in the fused image center area.
    7. 一种飞行器,所述飞行器包括双目摄像头、深度图像处理模块、安全距离处理模块、手势识别模块以及控制模块;7. An aircraft comprising a binocular camera, a depth image processing module, a safety distance processing module, a gesture recognition module, and a control module;
    所述双目摄像头,用于采集图像;The binocular camera is configured to acquire an image;
    所述深度图像处理模块,用于对所述双目摄像头采集的图像进行深度处理,获取到深度图像;The depth image processing module is configured to perform depth processing on the image captured by the binocular camera to obtain a depth image;
    所述安全距离处理模块,用于根据所述深度图像和所述双目摄像头的参数,控制所述飞行器与人的安全距离;The safety distance processing module is configured to control a safety distance between the aircraft and a person according to the depth image and parameters of the binocular camera;
    所述手势识别模块,用于识别所述深度图像中人的手势,根据识别出的人的手势获取所述飞行器的控制指令;The gesture recognition module is configured to identify a gesture of a person in the depth image, and acquire a control instruction of the aircraft according to the gesture of the recognized person;
    所述控制模块,用于在所述安全距离内根据所述飞行器的控制指令,对所述飞行器进行控制。The control module is configured to control the aircraft according to a control instruction of the aircraft within the safety distance.
    8. 根据权利要求7所述的一种飞行器,其中,所述手势识别模块包括分割处理模块、手势类别识别模块以及获取模块;8. The aircraft of claim 7, wherein the gesture recognition module comprises a segmentation processing module, a gesture category identification module, and an acquisition module;
    所述分割处理模块,用于对所述深度图像进行分割处理,获取到人手图像;The segmentation processing module is configured to perform segmentation processing on the depth image to obtain a human hand image;
    所述手势类别识别模块,用于对所述人手图像采用模板匹配或者级联分类器进行手势类别识别;The gesture category identifying module is configured to perform gesture category recognition by using a template matching or a cascade classifier on the human hand image;
    所述获取模块,用于根据识别的手势类别,获取所述飞行器的控制指令。The acquiring module is configured to acquire a control instruction of the aircraft according to the identified gesture category.
    9. 根据权利要求8所述的一种飞行器,其中,所述手势类别包括静态手势和动态手势。9. An aircraft according to claim 8, wherein the gesture categories comprise static gestures and dynamic gestures.
    10. 根据权利要求7-9任一所述的一种飞行器,其中,所述飞行器的控制指令包括飞行器六个自由度的控制、飞行器启停降落控制以及特定功能的控制。10. An aircraft according to any of claims 7-9, wherein the aircraft's control commands include six degrees of freedom control of the aircraft, aircraft start/stop landing control, and control of specific functions.
    11. 根据权利要求10所述的一种飞行器,其中,所述特定功能的控制包括摄像头照相功能的控制、摄像头录像功能的控制。11. An aircraft according to claim 10, wherein the control of the particular function comprises control of a camera camera function, control of a camera recording function.
    12. 根据权利要求11所述的一种飞行器,其中,所述飞行器还包括图像融合处理模块、人脸识别跟踪模块以及人脸位置处理模块;12. The aircraft of claim 11, wherein the aircraft further comprises an image fusion processing module, a face recognition tracking module, and a face location processing module;
    所述图像融合处理模块,用于对所述双目摄像头采集的图像进行融合处理,获取到融合图像;The image fusion processing module is configured to perform fusion processing on the image captured by the binocular camera to obtain a fused image;
    所述人脸识别跟踪模块,用于对所述融合图像中的人脸进行识别和跟踪,确定融合图像中的人脸位置;The face recognition tracking module is configured to identify and track a face in the fused image, and determine a face position in the fused image;
    所述人脸位置处理模块,用于在所述安全距离内根据融合图像中的人脸位置,调整飞行器或者云台角度确保人脸位置位于融合图像中心区域。The face position processing module is configured to adjust an aircraft or a pan/tilt angle according to a face position in the fused image within the safety distance to ensure that the face position is located in the fused image center area.
PCT/CN2018/075877 2017-02-08 2018-02-08 Aircraft and control method therefor WO2018145650A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710068629.3A CN106774947A (en) 2017-02-08 2017-02-08 A kind of aircraft and its control method
CN201710068629.3 2017-02-08

Publications (1)

Publication Number Publication Date
WO2018145650A1 true WO2018145650A1 (en) 2018-08-16

Family

ID=58956213

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/075877 WO2018145650A1 (en) 2017-02-08 2018-02-08 Aircraft and control method therefor

Country Status (2)

Country Link
CN (1) CN106774947A (en)
WO (1) WO2018145650A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112712462A (en) * 2019-10-24 2021-04-27 上海宗保科技有限公司 Unmanned aerial vehicle image acquisition system based on image splicing

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106774947A (en) * 2017-02-08 2017-05-31 亿航智能设备(广州)有限公司 A kind of aircraft and its control method
CN107831791B (en) * 2017-11-17 2020-12-15 深圳意动航空科技有限公司 Unmanned aerial vehicle control method and device, control equipment and storage medium
WO2019144291A1 (en) * 2018-01-23 2019-08-01 深圳市大疆创新科技有限公司 Flight control method, apparatus, and machine-readable storage medium
WO2019144295A1 (en) * 2018-01-23 2019-08-01 深圳市大疆创新科技有限公司 Flight control method and device, and aircraft, system and storage medium
CN108459617A (en) * 2018-03-15 2018-08-28 深圳市乐升科技有限公司 A kind of UAV Flight Control System based on projection mobile phone
CN109343565A (en) * 2018-10-29 2019-02-15 中国航空无线电电子研究所 A kind of UAV Intelligent ground control control method based on gesture perception identification
CN109254595A (en) * 2018-11-28 2019-01-22 广东海洋大学 Man-machine flight control method and the system of dancing together of aircraft
CN109521785B (en) * 2018-12-29 2021-07-27 西安电子科技大学 Intelligent rotor craft system capable of being shot with oneself
CN109948423B (en) * 2019-01-18 2020-09-11 特斯联(北京)科技有限公司 Unmanned aerial vehicle travel accompanying service method applying face and posture recognition and unmanned aerial vehicle
CN111316325B (en) * 2019-03-08 2021-07-30 深圳市大疆创新科技有限公司 Shooting device parameter calibration method, equipment and storage medium
CN111988493B (en) * 2019-05-21 2021-11-30 北京小米移动软件有限公司 Interaction processing method, device, equipment and storage medium
CN111123959B (en) * 2019-11-18 2023-05-30 亿航智能设备(广州)有限公司 Unmanned aerial vehicle control method based on gesture recognition and unmanned aerial vehicle adopting method
CN111176323A (en) * 2019-12-30 2020-05-19 湖南华诺星空电子技术有限公司 Radar and infrared integrated unmanned aerial vehicle landing control method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130253733A1 (en) * 2012-03-26 2013-09-26 Hon Hai Precision Industry Co., Ltd. Computing device and method for controlling unmanned aerial vehicle in flight space
CN105787447A (en) * 2016-02-26 2016-07-20 深圳市道通智能航空技术有限公司 Method and system of unmanned plane omnibearing obstacle avoidance based on binocular vision
CN106227231A (en) * 2016-07-15 2016-12-14 深圳奥比中光科技有限公司 The control method of unmanned plane, body feeling interaction device and unmanned plane
CN106774947A (en) * 2017-02-08 2017-05-31 亿航智能设备(广州)有限公司 A kind of aircraft and its control method

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104834249A (en) * 2015-03-16 2015-08-12 张时勉 Wearable remote controller
CN104808799A (en) * 2015-05-20 2015-07-29 成都通甲优博科技有限责任公司 Unmanned aerial vehicle capable of indentifying gesture and identifying method thereof
US20160349845A1 (en) * 2015-05-28 2016-12-01 Google Inc. Gesture Detection Haptics and Virtual Tools
CN105589466A (en) * 2016-02-24 2016-05-18 谭圆圆 Flight control device of unmanned aircraft and flight control method thereof
CN105892474A (en) * 2016-03-31 2016-08-24 深圳奥比中光科技有限公司 Unmanned plane and control method of unmanned plane
CN105847684A (en) * 2016-03-31 2016-08-10 深圳奥比中光科技有限公司 Unmanned aerial vehicle

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130253733A1 (en) * 2012-03-26 2013-09-26 Hon Hai Precision Industry Co., Ltd. Computing device and method for controlling unmanned aerial vehicle in flight space
CN105787447A (en) * 2016-02-26 2016-07-20 深圳市道通智能航空技术有限公司 Method and system of unmanned plane omnibearing obstacle avoidance based on binocular vision
CN106227231A (en) * 2016-07-15 2016-12-14 深圳奥比中光科技有限公司 The control method of unmanned plane, body feeling interaction device and unmanned plane
CN106774947A (en) * 2017-02-08 2017-05-31 亿航智能设备(广州)有限公司 A kind of aircraft and its control method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112712462A (en) * 2019-10-24 2021-04-27 上海宗保科技有限公司 Unmanned aerial vehicle image acquisition system based on image splicing

Also Published As

Publication number Publication date
CN106774947A (en) 2017-05-31

Similar Documents

Publication Publication Date Title
WO2018145650A1 (en) Aircraft and control method therefor
US11914370B2 (en) System and method for providing easy-to-use release and auto-positioning for drone applications
US11340606B2 (en) System and method for controller-free user drone interaction
US11688034B2 (en) Virtual lens simulation for video and photo cropping
WO2018214078A1 (en) Photographing control method and device
CN104410883B (en) The mobile wearable contactless interactive system of one kind and method
WO2018133589A1 (en) Aerial photography method, device, and unmanned aerial vehicle
WO2016187985A1 (en) Photographing device, tracking photographing method and system, and computer storage medium
CN206181190U (en) Automatic cloud platform camera system that follows
CN107831791B (en) Unmanned aerial vehicle control method and device, control equipment and storage medium
US20170374276A1 (en) Controlling capturing of a multimedia stream with user physical responses
WO2020227996A1 (en) Photography control method and apparatus, control device and photography device
CN204287973U (en) flight camera
WO2019061466A1 (en) Flight control method, remote control device, and remote control system
WO2022082440A1 (en) Method, apparatus and system for determining target following strategy, and device and storage medium
WO2021203368A1 (en) Image processing method and apparatus, electronic device and storage medium
WO2021243566A1 (en) Imaging method, imaging apparatus, and computer readable storage medium
KR20180000110A (en) Drone and method for controlling the same
CN209297326U (en) Two axle The Cloud Terraces iris face acquisition device
WO2023123254A1 (en) Control method and device for unmanned aerial vehicle, unmanned aerial vehicle, and storage medium
WO2022016334A1 (en) Image processing method and apparatus, racing drone, image optimization system and storage medium
WO2022178724A1 (en) Image photographing method, terminal device, photographing apparatus, and storage medium
KR20210001080U (en) Drone device for aerial photography or video
WO2018214075A1 (en) Video image generation method and device
CN108858213A (en) Robot, traveling method of controlling operation and robot working method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18751233

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 08/01/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18751233

Country of ref document: EP

Kind code of ref document: A1