WO2021169704A1 - Method, device and apparatus for determining depth of gesture, and storage medium - Google Patents

Method, device and apparatus for determining depth of gesture, and storage medium Download PDF

Info

Publication number
WO2021169704A1
WO2021169704A1 PCT/CN2021/073715 CN2021073715W WO2021169704A1 WO 2021169704 A1 WO2021169704 A1 WO 2021169704A1 CN 2021073715 W CN2021073715 W CN 2021073715W WO 2021169704 A1 WO2021169704 A1 WO 2021169704A1
Authority
WO
WIPO (PCT)
Prior art keywords
depth
feature point
point
preset
value
Prior art date
Application number
PCT/CN2021/073715
Other languages
French (fr)
Chinese (zh)
Inventor
黄少光
许秋子
Original Assignee
深圳市瑞立视多媒体科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市瑞立视多媒体科技有限公司 filed Critical 深圳市瑞立视多媒体科技有限公司
Publication of WO2021169704A1 publication Critical patent/WO2021169704A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person

Definitions

  • the present invention relates to the field of image processing technology, and in particular to a method, device, device and storage medium for determining the depth of a gesture.
  • depth cameras have gradually been applied to application scenarios such as object recognition and scene modeling.
  • object recognition and scene modeling Unlike ordinary color cameras that can only take 2D images of objects, depth cameras are a kind of 3D cameras.
  • the depth images they take include not only color information, but also depth information, that is, the distance between the actual object feature points in the image and the camera. , Through the depth image, the three-dimensional coordinates of the object can be obtained, so as to restore the real scene and realize applications such as scene modeling.
  • the main purpose of the present invention is to provide a gesture depth determination method, device, equipment and storage medium, aiming to improve the accuracy of measuring the depth information of the hand feature points based on the depth camera.
  • the first aspect of the present invention provides a method for determining the depth of a gesture, and the method for determining the depth of a gesture includes:
  • the preset wrist feature point is a noise point
  • the target finger feature point is not a noise point, determining whether pixels other than the target finger feature point are all noise points on the line connecting the preset wrist feature point and the target finger feature point;
  • the preset wrist feature points and the target finger are calculated
  • the median depth of all normal points on the line of the characteristic points is taken as the depth of the preset wrist characteristic points.
  • the step of judging whether the preset wrist feature point is a noise according to the depth value includes:
  • the preset wrist feature point is a noise point.
  • the step of judging whether the feature point of the target finger is a noise point includes:
  • the target finger feature point is not a noise point.
  • the method further includes:
  • the target finger feature point is a noise point
  • obtain the depth of the normal point closest to the preset wrist feature point and use the depth of the normal point closest to the preset wrist feature point as the preset wrist feature point depth.
  • pixels other than the target finger feature point are determined on the line connecting the preset wrist feature point and the target finger feature point After the steps of whether the points are noise points, it also includes:
  • pixel points other than the target finger feature point are noise points
  • obtain the normal point closest to the preset wrist feature point Depth the depth of the normal point closest to the preset wrist feature point is taken as the depth of the preset wrist feature point.
  • a second aspect of the present invention provides a gesture depth determining device, the gesture depth determining device includes:
  • An acquisition module configured to acquire a hand depth image through a depth camera, and acquire preset wrist feature points contained in the hand depth image and the depth value of the preset wrist feature points;
  • the first judgment module is configured to judge whether the preset wrist feature point is a noise point according to the depth value
  • An acquiring module configured to acquire a preset finger feature point closest to the preset wrist feature point as the target finger feature point when the preset wrist feature point is a noise point;
  • the second judgment module is used to judge whether the feature point of the target finger is a noise point
  • the third judgment module is used for judging the pixel points other than the target finger feature points on the line between the preset wrist feature points and the target finger feature points when the target finger feature points are not noise points Are they all noisy;
  • a calculation module for calculating the preset wrist feature point when the pixel points other than the target finger feature point on the line connecting the preset wrist feature point and the target finger feature point are not all noise points The median depth of all normal points on the line with the feature point of the target finger is taken as the depth of the preset wrist feature point.
  • a third aspect of the present invention provides a gesture depth determining device.
  • the gesture depth determining device includes a memory and at least one processor, the memory stores instructions, and the memory and the at least one processor communicate with each other through a wire. Connect; the at least one processor calls the instructions in the memory, so that the gesture depth determination device executes the above gesture depth determination method.
  • a fourth aspect of the present invention provides a storage medium in which instructions are stored, which when run on a computer, cause the computer to execute the aforementioned gesture depth determination method.
  • the present invention collects the hand depth image through a depth camera, obtains the preset wrist feature points and the depth value of the preset wrist feature points contained in the hand depth image; judges whether the preset wrist feature points are noise points according to the depth value; when preset When the wrist feature point is a noise point, obtain the preset finger feature point closest to the preset wrist feature point as the target finger feature point; determine whether the target finger feature point is a noise point; when the target finger feature point is not a noise point, determine the preset On the connection line between the wrist feature point and the target finger feature point, whether the pixel points other than the target finger feature point are all noise; when the preset wrist feature point and the target finger feature point are on the line, except for the target finger feature point When the pixel points are not all noise points, the median depth of all normal points on the line connecting the preset wrist feature points and the target finger feature points is calculated as the depth of the preset wrist feature points.
  • the preset wrist feature points are noisy points
  • the median depth of the normal points to replace the preset wrist feature points depth
  • more stable and accurate depth information of the hand feature points can be obtained, thereby improving the
  • the depth camera measures the accuracy of the depth information of the feature points of the hand.
  • FIG. 1 is a schematic flowchart of an embodiment of a method for determining a gesture depth according to the present invention
  • FIG. 2 is a schematic diagram of preset hand feature points included in a hand depth image according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of modules of an embodiment of a gesture depth determining device of the present invention.
  • Fig. 4 is a schematic structural diagram of a gesture depth determination device provided by an embodiment of the present invention.
  • the embodiment of the present invention provides a method, device, device and storage medium for determining the depth of a gesture.
  • the depth of the preset wrist feature point can be obtained by finding the median depth of the normal point instead of the preset wrist feature point. To obtain more stable and accurate depth information of the hand feature points, thereby improving the accuracy of measuring the depth information of the hand feature points based on the depth camera.
  • Fig. 1 is a schematic flowchart of an embodiment of a method for determining a gesture depth according to the present invention. The method includes:
  • Step 101 Collect a hand depth image through a depth camera, and obtain preset wrist feature points included in the hand depth image and depth values of the preset wrist feature points;
  • the execution subject of the present invention may be a gesture depth determining device, and may also be a terminal or a server, which is not specifically limited here.
  • the embodiment of the present invention is described by taking the server as the execution subject as an example.
  • the server is in communication connection with a depth camera.
  • the depth camera has a depth image shooting function, and its specific model can be flexibly selected. For example, it can be kinectV1 (first generation kinect) or kinectV2 (second generation kinect).
  • FIG. 2 is a schematic diagram of preset hand feature points included in a hand depth image according to an embodiment of the present invention.
  • the hand depth image includes 21 preset hand feature points, specifically including 20 on the fingers
  • the preset finger feature points thumb T0, T1, T2, T3, index finger I0, I1, I2, I3, middle finger M0, M1, M2, M3, ring finger R0, R1, R2, R3, little finger L0, L1, L2 L3, and a preset wrist feature point W located on the wrist.
  • the server obtains the depth value of the preset wrist feature point included in the collected hand depth image, and the depth value represents the distance of the hand feature point from the depth camera.
  • the server may first collect the hand RGB image and the hand depth image through the depth camera, and align the two images through the alignment algorithm to locate the position of the hand and the position of the feature points of the hand in the RGB image, and then The RGB image and the hand depth image are coordinate mapped to obtain the depth information of the entire hand contained in the hand depth image, and obtain the two-dimensional information of the wrist feature points and the finger feature points through the deep learning model, and at the same time obtain The depth information of the entire hand contained in the hand depth image, and then the depth value of the preset wrist feature point is obtained from the depth information of the entire hand.
  • Step 102 Determine whether the preset wrist feature point is a noise point according to the depth value
  • This step 102 may specifically include: identifying non-noise points in the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points; calculating all the obtained non-noise points And calculate the absolute value of the difference between the depth value of the preset wrist feature point and the median value; determine the absolute value of the difference between the depth value of the preset wrist feature point and the median value Whether the value is greater than or equal to a preset threshold; when the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to the preset threshold, it is determined that the preset wrist feature point is a noise point.
  • the working depth range of the kinectV2 camera is 500 ⁇ 4500mm. If the object is too close or too far away from the camera, the depth information measurement will be inaccurate. For this reason, the server can obtain the depth information of the hand area in the hand depth image.
  • the pixels with a depth value of 500 to 4500 mm are regarded as non-noise points, so as to realize the recognition of non-noise points in the hand area in the hand depth image.
  • the server After identifying the non-noise points of the hand region in the hand depth image, calculate the median value of the depth values of all non-noise points, that is, the median. Specifically, the server obtains the depth value of each non-noise point and arranges the depth values from small to large to obtain a series of numbers. When the number of items in the series is odd, the depth value in the middle position is the median. When the number of items in the sequence is even, the median is the average of the two depth values in the middle.
  • the server calculates the absolute value of the difference between the depth value of the preset wrist feature points contained in the hand depth image and the median value, and determines whether the absolute value is greater than or equal to the preset threshold, and if the absolute value is greater than or equal to the preset threshold If the threshold is set, the server determines that the preset wrist feature point is noisy. On the contrary, if the absolute value is less than the preset threshold, it means that the preset wrist feature point is non-noise. At this time, the depth value of the preset wrist feature point is not used. Do any processing.
  • the foregoing preset threshold may be determined according to the length of a human palm.
  • the length of the feature point W to M3 in FIG. 2 may be used as the preset threshold. Because the palm can be moved when taking a photo, if the arm is not moving, the maximum forward and backward distance of the palm is the depth threshold of the palm space, and points beyond the depth threshold are regarded as abnormal.
  • Step 103 When the preset wrist feature point is a noise point, and when the preset wrist feature point is a noise point, acquire the preset finger feature point closest to the preset wrist feature point as the target finger feature point;
  • the server determines that the preset wrist feature point is a noise point
  • the preset finger feature point closest to the preset wrist feature point is acquired as the target finger feature point.
  • Step 104 Judge whether the feature point of the target finger is a noise point
  • This step 104 may specifically include: obtaining the depth value of the feature point of the target finger, calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value; judging the depth value of the feature point of the target finger and Whether the absolute value of the difference between the median value is greater than or equal to the preset threshold; when the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold, it is determined The target finger feature point is a noise point; when the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold, it is determined that the target finger feature point is not a noise point.
  • the server When judging whether the target finger feature point is a noise point, the server first obtains the depth value of the target finger feature point, calculates the absolute value of the difference between the depth value of the target finger feature point and the above median value, and then determines whether the absolute value is greater than or Equal to the aforementioned preset threshold. If the absolute value is greater than or equal to the preset threshold, the server determines that the target finger feature point is a noise; otherwise, if the absolute value is less than the preset threshold, then it determines that the target finger feature point is not a noise.
  • Step 105 When the target finger feature point is not a noise point, determine whether the pixel points other than the target finger feature point on the line connecting the preset wrist feature point and the target finger feature point are all noise points ;
  • the server further determines whether the pixel points other than the target hand feature point are noise points on the connection line between the preset wrist feature point and the target hand feature point .
  • the specific judgment method can refer to the above method of judging whether the target hand feature point is a noise point, which will not be repeated here.
  • Step 106 When the pixel points other than the target finger feature points are not all noise points on the line connecting the preset wrist feature points and the target finger feature points, calculate the preset wrist feature points and all the points. The median depth of all normal points on the line connecting the feature points of the target finger is used as the depth of the preset wrist feature points.
  • the server obtains all normal points on the line (ie The median value of the depth value of the non-noise point, that is, the median depth, the median depth is taken as the depth of the preset wrist feature point.
  • the wrist feature point W when it is determined to be a noise point, obtain the finger feature point T0 closest to W, use T0 as the target finger feature point and determine whether T0 is a noise point. If T0 is not a noise, then determine whether the pixels on the line between W and T0 are all noise, if not, then calculate the median depth of all normal points on the line between W and T0, and use the median depth As the depth of the characteristic point W of the wrist.
  • step 104 it may further include: when the target finger feature point is a noisy point, acquiring the depth of the normal point closest to the preset wrist feature point, and setting it to the preset wrist feature point The depth of the nearest normal point is used as the depth of the preset wrist feature point.
  • the server may search for the normal point closest to the preset wrist feature point on the finger where the preset wrist feature point is located, centering on the preset wrist feature point, that is, For pixel points whose depth value is within a preset range, then the depth of the normal point is used as the depth of the preset wrist feature point.
  • the normal point closest to W is obtained, and the depth of the normal point can be used as the depth of the wrist feature point W.
  • step 105 it may further include: when the pixel points other than the target finger feature point are noise points on the line connecting the preset wrist feature point and the target finger feature point, obtaining the distance
  • the depth of the normal point closest to the preset wrist feature point is the depth of the normal point closest to the preset wrist feature point as the depth of the preset wrist feature point.
  • the server may place the finger on the finger where the preset wrist feature point is located.
  • the preset wrist feature point is the center, and the normal point closest to the preset wrist feature point is retrieved, that is, the pixel point whose depth value is within the preset range, and then the depth of the normal point is taken as the depth of the preset wrist feature point .
  • the wrist feature point W Take the wrist feature point W in Figure 2 as an example.
  • the wrist feature point W when it is determined to be a noise point, obtain the finger feature point T0 closest to W, use T0 as the target finger feature point and determine whether T0 is a noise point. If T0 is not a noise point, the normal point closest to W can be retrieved with W as the center, and the depth of the normal point can be used as the depth of the wrist feature point W.
  • the pixel points other than the target hand feature point are all noise points.
  • the depth of the normal point closest to the preset wrist feature point is used as the depth of the preset wrist feature point, which further improves the accuracy of measuring the depth information of the hand feature point based on the depth camera.
  • the embodiment of the present invention also provides a gesture depth determination device.
  • FIG. 3 is a schematic diagram of modules of an embodiment of a gesture depth determining device of the present invention.
  • the device for determining the gesture depth includes:
  • the acquisition module 301 is configured to acquire a hand depth image through a depth camera, and acquire preset wrist feature points contained in the hand depth image and the depth value of the preset wrist feature points;
  • the first determining module 302 is configured to determine whether the preset wrist feature point is a noise point according to the depth value
  • the acquiring module 303 is configured to acquire the preset finger feature point closest to the preset wrist feature point as the target finger feature point when the preset wrist feature point is a noise point;
  • the second judgment module 304 is configured to judge whether the feature point of the target finger is a noise point
  • the third judging module 305 is used for judging pixels other than the target finger feature points on the line connecting the preset wrist feature points and the target finger feature points when the target finger feature points are not noise points Whether the points are all noise points;
  • the calculation module 306 is configured to calculate the preset wrist feature when the pixel points other than the target finger feature point are not all noise points on the connection line between the preset wrist feature point and the target finger feature point.
  • the median depth of all normal points on the line connecting the point and the feature point of the target finger is taken as the depth of the preset wrist feature point.
  • the first judgment module 302 is further configured to:
  • the preset wrist feature point is a noise point.
  • the second judgment module 304 is further configured to:
  • the target finger feature point is not a noise point.
  • the device for determining the gesture depth further includes:
  • the first processing module is configured to obtain the depth of the normal point closest to the preset wrist feature point when the target finger feature point is a noise point, and use the depth of the normal point closest to the preset wrist feature point as The depth of the preset wrist feature point.
  • the device for determining the gesture depth further includes:
  • the second processing module is configured to obtain a distance from the preset wrist when the pixel points other than the target finger feature points are noise points on the connection line between the preset wrist feature point and the target finger feature point.
  • the depth of the normal point closest to the feature point is the depth of the normal point closest to the preset wrist feature point as the depth of the preset wrist feature point.
  • the device for determining the gesture depth in the embodiment of the present invention is described in detail above from the perspective of a modular functional entity, and the device for determining the gesture depth in the embodiment of the present invention is described in detail below from the perspective of hardware processing.
  • FIG. 4 is a schematic structural diagram of a gesture depth determination device provided by an embodiment of the present invention.
  • the gesture depth determination device 400 may have relatively large differences due to different configurations or performances, and may include one or more processors (central processing units, CPU) 410 (for example, one or more processors) and a memory 420. Or more than one storage medium 430 (for example, one or one storage device with a large amount of storage) storing application programs 433 or data 432. Among them, the memory 420 and the storage medium 430 may be short-term storage or persistent storage.
  • the program stored in the storage medium 430 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations in the device 400 for determining the gesture depth.
  • the processor 410 may be configured to communicate with the storage medium 430, and execute a series of instruction operations in the storage medium 430 on the gesture depth determining device 400.
  • the gesture depth determination device 400 may also include one or more power supplies 440, one or more wired or wireless network interfaces 450, one or more input and output interfaces 460, and/or one or more operating systems 431, such as Windows Serve , Mac OS X, Unix, Linux, FreeBSD, etc.
  • FIG. 4 does not constitute a limitation on the gesture depth determination device, and may include more or less components than shown in the figure, or combine certain components, or different The layout of the components.
  • the present invention also provides a storage medium.
  • the storage medium may be a non-volatile storage medium or a volatile storage medium.
  • the storage medium stores a gesture depth determination program, and the gesture depth determination program is processed. When the device is executed, the steps of the method for determining the depth of the gesture as described above are implemented.
  • the method and beneficial effects achieved when the gesture depth determination program running on the processor is executed can refer to the various embodiments of the gesture depth determination method of the present invention, which will not be repeated here.
  • the technical solution of the present invention essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium.
  • a computer device which can be a personal computer, a server, or a network device, etc.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disks or optical disks and other media that can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Provided are a method, device and apparatus for determining the depth of a gesture, and a storage medium capable of improving accuracy of depth information of a hand feature point measured by a depth camera. The method comprises: capturing a depth image of a hand, and acquiring a preset wrist feature point and a depth value thereof contained in the depth image of the hand (101); determining, according to the depth value, whether the preset wrist feature point is a noise point (102); if the preset wrist feature point is a noise point, acquiring a preset finger feature point closest to the preset wrist feature point, and using same as a target finger feature point (103); determining whether the target finger feature point is a noise point (104); if the target finger feature point is not a noise point, determining whether all of pixel points on a connection line between the preset wrist feature point and the target finger feature point are noise points (105); and if not all of the pixel points are noise points, calculating a mean depth of all normal points on the connection line between the preset wrist feature point and the target finger feature point, and using same as the depth of the preset wrist feature point (106).

Description

手势深度确定方法、装置、设备及存储介质Method, device, equipment and storage medium for determining gesture depth 技术领域Technical field
本发明涉及图像处理技术领域,尤其涉及手势深度确定方法、装置、设备及存储介质。The present invention relates to the field of image processing technology, and in particular to a method, device, device and storage medium for determining the depth of a gesture.
背景技术Background technique
近年来,随着人机交互、机器视觉等技术的发展,深度相机逐渐运用到物体识别、场景建模等应用场景。与普通的彩色相机仅能拍摄物体的2D图像不同,深度相机是一种3D摄像机,其所拍摄的深度图像不仅包括颜色信息,还包括深度信息,即图像中的实际物体特征点距离相机的距离,通过深度图像,可以得到物体的三维坐标,从而还原真实场景,实现场景建模等应用。In recent years, with the development of technologies such as human-computer interaction and machine vision, depth cameras have gradually been applied to application scenarios such as object recognition and scene modeling. Unlike ordinary color cameras that can only take 2D images of objects, depth cameras are a kind of 3D cameras. The depth images they take include not only color information, but also depth information, that is, the distance between the actual object feature points in the image and the camera. , Through the depth image, the three-dimensional coordinates of the object can be obtained, so as to restore the real scene and realize applications such as scene modeling.
目前,在通过深度相机采集用户的手部3D图像时,由于环境光线、手部晃动等原因,很容易导致采集到的手部深度信息存在噪声,比如对于手指尖的点,由于手指尖是手的末端,晃动幅度最大,因此很有可能误测它处的深度信息作为手指尖的深度信息。因此,现有的基于深度相机测量手部特征点深度信息的准确性还有待提高。At present, when using a depth camera to collect 3D images of a user’s hand, due to ambient light, hand shaking, etc., it is easy to cause noise in the collected hand depth information. For example, for the point of the fingertip, because the fingertip is a hand The end of the shook has the largest amplitude, so it is very likely that the depth information at it may be mistakenly detected as the depth information of the fingertip. Therefore, the accuracy of the existing depth information based on the depth camera to measure the depth information of the hand feature points needs to be improved.
发明内容Summary of the invention
本发明的主要目的在于提出一种手势深度确定方法、装置、设备及存储介质,旨在提高基于深度相机测量手部特征点深度信息的准确性。The main purpose of the present invention is to provide a gesture depth determination method, device, equipment and storage medium, aiming to improve the accuracy of measuring the depth information of the hand feature points based on the depth camera.
本发明第一方面提供了一种手势深度确定方法,所述手势深度确定方法包括:The first aspect of the present invention provides a method for determining the depth of a gesture, and the method for determining the depth of a gesture includes:
通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;Acquiring a hand depth image by a depth camera, and acquiring preset wrist feature points included in the hand depth image and depth values of the preset wrist feature points;
根据所述深度值判断所述预设手腕特征点是否为噪点;Judging whether the preset wrist feature point is a noise point according to the depth value;
当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特征点;When the preset wrist feature point is a noise point, acquiring the preset finger feature point closest to the preset wrist feature point as the target finger feature point;
判断所述目标手指特征点是否为噪点;Judging whether the feature point of the target finger is a noise point;
当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目 标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;When the target finger feature point is not a noise point, determining whether pixels other than the target finger feature point are all noise points on the line connecting the preset wrist feature point and the target finger feature point;
当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。When the pixel points other than the target finger feature points on the line connecting the preset wrist feature points and the target finger feature points are not all noise points, the preset wrist feature points and the target finger are calculated The median depth of all normal points on the line of the characteristic points is taken as the depth of the preset wrist characteristic points.
可选的,在本发明第一方面的第一种实现方式中,所述根据所述深度值判断所述预设手腕特征点是否为噪点的步骤包括:Optionally, in the first implementation of the first aspect of the present invention, the step of judging whether the preset wrist feature point is a noise according to the depth value includes:
根据所述深度相机的工作深度范围,识别所述手部深度图像中的手部区域的非噪点,并获取所述非噪点的深度值;Identifying the non-noise points of the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points;
计算获取到的所有非噪点的深度值的中值,并计算所述预设手腕特征点的深度值与所述中值之差的绝对值;Calculating the median value of the acquired depth values of all non-noise points, and calculating the absolute value of the difference between the depth value of the preset wrist feature point and the median value;
判断所述预设手腕特征点的深度值与所述中值之差的绝对值是否大于或等于预设阈值;Judging whether the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold;
当所述预设手腕特征点的深度值与所述中值之差的绝对值大于或等于预设阈值时,判定所述预设手腕特征点为噪点。When the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold value, it is determined that the preset wrist feature point is a noise point.
可选的,在本发明第一方面的第二种实现方式中,所述判断所述目标手指特征点是否为噪点的步骤包括:Optionally, in the second implementation manner of the first aspect of the present invention, the step of judging whether the feature point of the target finger is a noise point includes:
获取所述目标手指特征点的深度值,计算所述目标手指特征点的深度值与所述中值之差的绝对值;Acquiring the depth value of the feature point of the target finger, and calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value;
判断所述目标手指特征点的深度值与所述中值之差的绝对值是否大于或等于所述预设阈值;Judging whether the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold;
当所述目标手指特征点的深度值与所述中值之差的绝对值大于或等于所述预设阈值时,判定所述目标手指特征点为噪点;When the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold value, determining that the target finger feature point is a noise point;
当所述目标手指特征点的深度值与所述中值之差的绝对值小于所述预设阈值时,判定所述目标手指特征点不为噪点。When the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold value, it is determined that the target finger feature point is not a noise point.
可选的,在本发明第一方面的第三种实现方式中,所述判断所述目标手指特征点是否为噪点的步骤之后,还包括:Optionally, in the third implementation manner of the first aspect of the present invention, after the step of determining whether the target finger feature point is a noise point, the method further includes:
当所述目标手指特征点为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。When the target finger feature point is a noise point, obtain the depth of the normal point closest to the preset wrist feature point, and use the depth of the normal point closest to the preset wrist feature point as the preset wrist feature point depth.
可选的,在本发明第一方面的第四种实现方式中,所述判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点的步骤之后,还包括:Optionally, in the fourth implementation manner of the first aspect of the present invention, pixels other than the target finger feature point are determined on the line connecting the preset wrist feature point and the target finger feature point After the steps of whether the points are noise points, it also includes:
当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点均为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。When on the line connecting the preset wrist feature point and the target finger feature point, pixel points other than the target finger feature point are noise points, obtain the normal point closest to the preset wrist feature point Depth, the depth of the normal point closest to the preset wrist feature point is taken as the depth of the preset wrist feature point.
本发明第二方面提供了一种手势深度确定装置,所述手势深度确定装置包括:A second aspect of the present invention provides a gesture depth determining device, the gesture depth determining device includes:
采集模块,用于通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;An acquisition module, configured to acquire a hand depth image through a depth camera, and acquire preset wrist feature points contained in the hand depth image and the depth value of the preset wrist feature points;
第一判断模块,用于根据所述深度值判断所述预设手腕特征点是否为噪点;The first judgment module is configured to judge whether the preset wrist feature point is a noise point according to the depth value;
获取模块,用于当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特征点;An acquiring module, configured to acquire a preset finger feature point closest to the preset wrist feature point as the target finger feature point when the preset wrist feature point is a noise point;
第二判断模块,用于判断所述目标手指特征点是否为噪点;The second judgment module is used to judge whether the feature point of the target finger is a noise point;
第三判断模块,用于当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;The third judgment module is used for judging the pixel points other than the target finger feature points on the line between the preset wrist feature points and the target finger feature points when the target finger feature points are not noise points Are they all noisy;
计算模块,用于当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。A calculation module for calculating the preset wrist feature point when the pixel points other than the target finger feature point on the line connecting the preset wrist feature point and the target finger feature point are not all noise points The median depth of all normal points on the line with the feature point of the target finger is taken as the depth of the preset wrist feature point.
本发明第三方面提供了一种手势深度确定设备,所述手势深度确定设备包括:存储器和至少一个处理器,所述存储器中存储有指令,所述存储器和所述至少一个处理器通过线路互连;所述至少一个处理器调用所述存储器中的所述指令,以使得所述手势深度确定设备执行上述的手势深度确定方法。A third aspect of the present invention provides a gesture depth determining device. The gesture depth determining device includes a memory and at least one processor, the memory stores instructions, and the memory and the at least one processor communicate with each other through a wire. Connect; the at least one processor calls the instructions in the memory, so that the gesture depth determination device executes the above gesture depth determination method.
本发明的第四方面提供了一种存储介质,所述存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述的手势深度确定方法。A fourth aspect of the present invention provides a storage medium in which instructions are stored, which when run on a computer, cause the computer to execute the aforementioned gesture depth determination method.
本发明通过深度相机采集手部深度图像,获取手部深度图像中包含的预设手腕特征点和预设手腕特征点的深度值;根据深度值判断预设手腕特征点是否为噪点;当预设手腕特征点为噪点时,获取离预设手腕特征点最近的预设手指特征点,作为目标手指特征点;判断目标手指特征点是否为噪点;当目标手指特征点不为噪点时,判断预设手腕特征点与目标手指特征点的连线上,除目标手指特征点以外的像素点是否均为噪点;当预设手腕特征点与目标手指特征点的连线上,除目标手指特征点以外的像素点不均为噪点时,计算预设手腕特征点与目标手指特征点的连线上的所有正常点的中值深度,作为预设手腕特征点的深度。这种方式当预设手腕特征点为噪点时,通过找正常点的中值深度取代该预设手腕特征点的深度,能够获取到更加稳定、准确的手部特征点深度信息,从而提高了基于深度相机测量手部特征点深度信息的准确性。The present invention collects the hand depth image through a depth camera, obtains the preset wrist feature points and the depth value of the preset wrist feature points contained in the hand depth image; judges whether the preset wrist feature points are noise points according to the depth value; when preset When the wrist feature point is a noise point, obtain the preset finger feature point closest to the preset wrist feature point as the target finger feature point; determine whether the target finger feature point is a noise point; when the target finger feature point is not a noise point, determine the preset On the connection line between the wrist feature point and the target finger feature point, whether the pixel points other than the target finger feature point are all noise; when the preset wrist feature point and the target finger feature point are on the line, except for the target finger feature point When the pixel points are not all noise points, the median depth of all normal points on the line connecting the preset wrist feature points and the target finger feature points is calculated as the depth of the preset wrist feature points. In this way, when the preset wrist feature points are noisy points, by finding the median depth of the normal points to replace the preset wrist feature points depth, more stable and accurate depth information of the hand feature points can be obtained, thereby improving the The depth camera measures the accuracy of the depth information of the feature points of the hand.
附图说明Description of the drawings
图1为本发明手势深度确定方法的一个实施例的流程示意图;FIG. 1 is a schematic flowchart of an embodiment of a method for determining a gesture depth according to the present invention;
图2为本发明实施例手部深度图像中包含的预设手部特征点的示意图;2 is a schematic diagram of preset hand feature points included in a hand depth image according to an embodiment of the present invention;
图3为本发明手势深度确定装置的一个实施例的模块示意图;FIG. 3 is a schematic diagram of modules of an embodiment of a gesture depth determining device of the present invention;
图4为本发明实施例提供的手势深度确定设备的结构示意图。Fig. 4 is a schematic structural diagram of a gesture depth determination device provided by an embodiment of the present invention.
具体实施方式Detailed ways
本发明实施例提供了一种手势深度确定方法、装置、设备及存储介质,当预设手腕特征点为噪点时,通过找正常点的中值深度取代该预设手腕特征点的深度,能够获取到更加稳定、准确的手部特征点深度信息,从而提高了基于深度相机测量手部特征点深度信息的准确性。The embodiment of the present invention provides a method, device, device and storage medium for determining the depth of a gesture. When the preset wrist feature point is a noisy point, the depth of the preset wrist feature point can be obtained by finding the median depth of the normal point instead of the preset wrist feature point. To obtain more stable and accurate depth information of the hand feature points, thereby improving the accuracy of measuring the depth information of the hand feature points based on the depth camera.
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”、“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这 里描述的实施例能够以除了在这里图示或描述的内容以外的顺序实施。此外,术语“包括”或“具有”及其任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, without having to use To describe a specific order or sequence. It should be understood that the data used in this way can be interchanged under appropriate circumstances, so that the embodiments described herein can be implemented in a sequence other than the content illustrated or described herein. In addition, the terms "including" or "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product, or device that includes a series of steps or units is not necessarily limited to those clearly listed. Steps or units, but may include other steps or units that are not clearly listed or are inherent to these processes, methods, products, or equipment.
为便于理解,下面对本发明手势深度确定方法实施例的具体流程进行描述。For ease of understanding, the specific process of the embodiment of the method for determining the depth of the gesture of the present invention will be described below.
参照图1,图1为本发明手势深度确定方法的一个实施例的流程示意图,该方法包括:Referring to Fig. 1, Fig. 1 is a schematic flowchart of an embodiment of a method for determining a gesture depth according to the present invention. The method includes:
步骤101,通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;Step 101: Collect a hand depth image through a depth camera, and obtain preset wrist feature points included in the hand depth image and depth values of the preset wrist feature points;
可以理解的是,本发明的执行主体可以为手势深度确定装置,还可以是终端或者服务器,具体此处不做限定。本发明实施例以服务器为执行主体为例进行说明。It can be understood that the execution subject of the present invention may be a gesture depth determining device, and may also be a terminal or a server, which is not specifically limited here. The embodiment of the present invention is described by taking the server as the execution subject as an example.
在本实施例中,服务器与深度相机通信连接,该深度相机具备深度图像的拍摄功能,其具体型号可以灵活选择,比如可以是kinectV1(第一代kinect)或kinectV2(第二代kinect)等。In this embodiment, the server is in communication connection with a depth camera. The depth camera has a depth image shooting function, and its specific model can be flexibly selected. For example, it can be kinectV1 (first generation kinect) or kinectV2 (second generation kinect).
首先,服务器通过深度相机采集用户的手部深度图像,并获取该手部深度图像中包含的预设手指特征点。参照图2,图2为本发明实施例手部深度图像中包含的预设手部特征点的示意图,该手部深度图像中包含21个预设手部特征点,具体包括20个位于手指上的预设手指特征点:拇指T0、T1、T2、T3,食指I0、I1、I2、I3,中指M0、M1、M2、M3,无名指R0、R1、R2、R3,小指L0、L1、L2、L3,以及1个位于手腕上的预设手腕特征点W。First, the server collects a user's hand depth image through a depth camera, and obtains preset finger feature points contained in the hand depth image. Referring to FIG. 2, FIG. 2 is a schematic diagram of preset hand feature points included in a hand depth image according to an embodiment of the present invention. The hand depth image includes 21 preset hand feature points, specifically including 20 on the fingers The preset finger feature points: thumb T0, T1, T2, T3, index finger I0, I1, I2, I3, middle finger M0, M1, M2, M3, ring finger R0, R1, R2, R3, little finger L0, L1, L2 L3, and a preset wrist feature point W located on the wrist.
服务器获取采集到的手部深度图像中包含的预设手腕特征点的深度值,该深度值表示手部特征点距离深度相机的距离。具体地,服务器可以首先通过深度相机采集手部RGB图像和手部深度图像,通过对齐算法将两个图像进行对齐,从而定位手的位置和手的特征点在RGB图像中的位置,然后再将该RGB图像与手部深度图像进行坐标映射,从而获取到该手部深度图像中包含的整个手的深度信息,并通过深度学习模型获取手腕特征点和手指特征点的 二维信息,同时获取到该手部深度图像中包含的整个手的深度信息,再从整个手的深度信息中获取预设手腕特征点的深度值。The server obtains the depth value of the preset wrist feature point included in the collected hand depth image, and the depth value represents the distance of the hand feature point from the depth camera. Specifically, the server may first collect the hand RGB image and the hand depth image through the depth camera, and align the two images through the alignment algorithm to locate the position of the hand and the position of the feature points of the hand in the RGB image, and then The RGB image and the hand depth image are coordinate mapped to obtain the depth information of the entire hand contained in the hand depth image, and obtain the two-dimensional information of the wrist feature points and the finger feature points through the deep learning model, and at the same time obtain The depth information of the entire hand contained in the hand depth image, and then the depth value of the preset wrist feature point is obtained from the depth information of the entire hand.
步骤102,根据所述深度值判断所述预设手腕特征点是否为噪点;Step 102: Determine whether the preset wrist feature point is a noise point according to the depth value;
该步骤102具体可以包括:根据所述深度相机的工作深度范围,识别所述手部深度图像中的手部区域的非噪点,并获取所述非噪点的深度值;计算获取到的所有非噪点的深度值的中值,并计算所述预设手腕特征点的深度值与所述中值之差的绝对值;判断所述预设手腕特征点的深度值与所述中值之差的绝对值是否大于或等于预设阈值;当所述预设手腕特征点的深度值与所述中值之差的绝对值大于或等于预设阈值时,判定所述预设手腕特征点为噪点。This step 102 may specifically include: identifying non-noise points in the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points; calculating all the obtained non-noise points And calculate the absolute value of the difference between the depth value of the preset wrist feature point and the median value; determine the absolute value of the difference between the depth value of the preset wrist feature point and the median value Whether the value is greater than or equal to a preset threshold; when the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to the preset threshold, it is determined that the preset wrist feature point is a noise point.
以kinectV2相机为例,kinectV2相机的工作深度范围是500~4500mm,物体距离相机太近或太远都会导致深度信息测量不准确,为此,服务器可以获取该手部深度图像中的手部区域的所有像素点的深度值,将深度值处于500~4500mm的像素点作为非噪点,从而实现对手部深度图像中的手部区域的非噪点的识别。Take the kinectV2 camera as an example. The working depth range of the kinectV2 camera is 500~4500mm. If the object is too close or too far away from the camera, the depth information measurement will be inaccurate. For this reason, the server can obtain the depth information of the hand area in the hand depth image. For the depth values of all pixels, the pixels with a depth value of 500 to 4500 mm are regarded as non-noise points, so as to realize the recognition of non-noise points in the hand area in the hand depth image.
在识别到手部深度图像中的手部区域的非噪点后,计算所有非噪点的深度值的中值,即中位数。具体地,服务器获取每个非噪点的深度值并将该深度值由小到大进行排列,得到一个数列,当该数列的项数为奇数时,处于中间位置的深度值即为中位数,当该数列的项数为偶数时,中位数则为处于中间位置的2个深度值的平均数。之后,服务器计算手部深度图像中包含的预设手腕特征点的深度值与该中值之差的绝对值,并判断该绝对值是否大于或等于预设阈值,若该绝对值大于或等于预设阈值,则服务器判定该预设手腕特征点为噪点,反之,若该绝对值小于预设阈值,则说明该预设手腕特征点为非噪点,此时不对该预设手腕特征点的深度值做任何处理。After identifying the non-noise points of the hand region in the hand depth image, calculate the median value of the depth values of all non-noise points, that is, the median. Specifically, the server obtains the depth value of each non-noise point and arranges the depth values from small to large to obtain a series of numbers. When the number of items in the series is odd, the depth value in the middle position is the median. When the number of items in the sequence is even, the median is the average of the two depth values in the middle. After that, the server calculates the absolute value of the difference between the depth value of the preset wrist feature points contained in the hand depth image and the median value, and determines whether the absolute value is greater than or equal to the preset threshold, and if the absolute value is greater than or equal to the preset threshold If the threshold is set, the server determines that the preset wrist feature point is noisy. On the contrary, if the absolute value is less than the preset threshold, it means that the preset wrist feature point is non-noise. At this time, the depth value of the preset wrist feature point is not used. Do any processing.
需要说明的是,上述预设阈值可以根据人手掌的长度确定,比如可以将图2中特征点W到M3的长度作为预设阈值。因为拍照时手掌是可以活动的,如果手臂不动,手掌向前向后最大的距离就是手掌活动空间的深度阈值,超出该深度阈值的点均视为不正常。It should be noted that the foregoing preset threshold may be determined according to the length of a human palm. For example, the length of the feature point W to M3 in FIG. 2 may be used as the preset threshold. Because the palm can be moved when taking a photo, if the arm is not moving, the maximum forward and backward distance of the palm is the depth threshold of the palm space, and points beyond the depth threshold are regarded as abnormal.
步骤103,当所述预设手腕特征点为噪点时,当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特 征点;Step 103: When the preset wrist feature point is a noise point, and when the preset wrist feature point is a noise point, acquire the preset finger feature point closest to the preset wrist feature point as the target finger feature point;
该步骤中,当服务器判定预设手腕特征点为噪点时,获取离该预设手腕特征点最近的预设手指特征点,作为目标手指特征点。In this step, when the server determines that the preset wrist feature point is a noise point, the preset finger feature point closest to the preset wrist feature point is acquired as the target finger feature point.
步骤104,判断所述目标手指特征点是否为噪点;Step 104: Judge whether the feature point of the target finger is a noise point;
该步骤104具体可以包括:获取所述目标手指特征点的深度值,计算所述目标手指特征点的深度值与所述中值之差的绝对值;判断所述目标手指特征点的深度值与所述中值之差的绝对值是否大于或等于所述预设阈值;当所述目标手指特征点的深度值与所述中值之差的绝对值大于或等于所述预设阈值时,判定所述目标手指特征点为噪点;当所述目标手指特征点的深度值与所述中值之差的绝对值小于所述预设阈值时,判定所述目标手指特征点不为噪点。This step 104 may specifically include: obtaining the depth value of the feature point of the target finger, calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value; judging the depth value of the feature point of the target finger and Whether the absolute value of the difference between the median value is greater than or equal to the preset threshold; when the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold, it is determined The target finger feature point is a noise point; when the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold, it is determined that the target finger feature point is not a noise point.
在判断目标手指特征点是否为噪点时,服务器首先获取该目标手指特征点的深度值,计算该目标手指特征点的深度值与上述中值之差的绝对值,然后判断该绝对值是否大于或等于上述预设阈值,若该绝对值大于或等于预设阈值,则服务器判定该目标手指特征点为噪点,反之,若该绝对值小于预设阈值,则判定该目标手指特征点不为噪点。When judging whether the target finger feature point is a noise point, the server first obtains the depth value of the target finger feature point, calculates the absolute value of the difference between the depth value of the target finger feature point and the above median value, and then determines whether the absolute value is greater than or Equal to the aforementioned preset threshold. If the absolute value is greater than or equal to the preset threshold, the server determines that the target finger feature point is a noise; otherwise, if the absolute value is less than the preset threshold, then it determines that the target finger feature point is not a noise.
步骤105,当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;Step 105: When the target finger feature point is not a noise point, determine whether the pixel points other than the target finger feature point on the line connecting the preset wrist feature point and the target finger feature point are all noise points ;
该步骤中,当目标手部特征点不为噪点时,服务器进一步判断预设手腕特征点与该目标手部特征点的连线上,除该目标手部特征点以外的像素点是否均为噪点,具体判断方式可以参照上述判断目标手部特征点是否为噪点的方式,此处不做赘述。In this step, when the target hand feature point is not a noise point, the server further determines whether the pixel points other than the target hand feature point are noise points on the connection line between the preset wrist feature point and the target hand feature point , The specific judgment method can refer to the above method of judging whether the target hand feature point is a noise point, which will not be repeated here.
步骤106,当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。Step 106: When the pixel points other than the target finger feature points are not all noise points on the line connecting the preset wrist feature points and the target finger feature points, calculate the preset wrist feature points and all the points. The median depth of all normal points on the line connecting the feature points of the target finger is used as the depth of the preset wrist feature points.
该步骤中,当预设手腕特征点与目标手部特征点的连线上,除该目标手部特征点以外的像素点不均为噪点时,服务器获取该连线上的所有正常点(即非噪点)的深度值的中值,即中值深度,将该中值深度作为该预设手腕特征 点的深度。In this step, when the preset wrist feature point and the target hand feature point are on the line, and the pixel points other than the target hand feature point are not all noise points, the server obtains all normal points on the line (ie The median value of the depth value of the non-noise point, that is, the median depth, the median depth is taken as the depth of the preset wrist feature point.
以图2中的手腕特征点W为例,对于手腕特征点W,当判定其为噪点时,则获取离W最近的手指特征点T0,将T0作为目标手指特征点并判断T0是否为噪点,如果T0不为噪点,则判断W与T0连线上的像素点是否均为噪点,若不均为噪点,则计算W与T0连线上的所有正常点的中值深度,将该中值深度作为手腕特征点W的深度。Take the wrist feature point W in Figure 2 as an example. For the wrist feature point W, when it is determined to be a noise point, obtain the finger feature point T0 closest to W, use T0 as the target finger feature point and determine whether T0 is a noise point. If T0 is not a noise, then determine whether the pixels on the line between W and T0 are all noise, if not, then calculate the median depth of all normal points on the line between W and T0, and use the median depth As the depth of the characteristic point W of the wrist.
本实施例通过上述方式,当预设手腕特征点为噪点时,通过找正常点的中值深度取代该预设手腕特征点的深度,能够获取到更加稳定、准确的手部特征点深度信息,从而提高了基于深度相机测量手部特征点深度信息的准确性。In this embodiment, through the above method, when the preset wrist feature points are noisy points, by finding the median depth of the normal points to replace the preset wrist feature points depth, more stable and accurate hand feature point depth information can be obtained. Thereby, the accuracy of measuring the depth information of the feature points of the hand based on the depth camera is improved.
进一步地,基于本发明手势深度确定方法第一实施例,提出本发明手势深度确定方法第二实施例。Further, based on the first embodiment of the gesture depth determination method of the present invention, a second embodiment of the gesture depth determination method of the present invention is proposed.
在本实施例中,上述步骤104之后,还可以包括:当所述目标手指特征点为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。In this embodiment, after the above step 104, it may further include: when the target finger feature point is a noisy point, acquiring the depth of the normal point closest to the preset wrist feature point, and setting it to the preset wrist feature point The depth of the nearest normal point is used as the depth of the preset wrist feature point.
具体地,当目标手部特征点为噪点时,服务器可以在预设手腕特征点所在的手指上,以该预设手腕特征点为中心,检索离该预设手腕特征点最近的正常点,即深度值处于预设范围内的像素点,然后将该正常点的深度作为该预设手腕特征点的深度。Specifically, when the target hand feature point is a noisy point, the server may search for the normal point closest to the preset wrist feature point on the finger where the preset wrist feature point is located, centering on the preset wrist feature point, that is, For pixel points whose depth value is within a preset range, then the depth of the normal point is used as the depth of the preset wrist feature point.
以图2中的手腕特征点W为例,对于手腕特征点W,当判定其为噪点时,则获取离W最近的正常点,该正常点的深度即可作为手腕特征点W的深度。Taking the wrist feature point W in FIG. 2 as an example, for the wrist feature point W, when it is determined to be a noisy point, the normal point closest to W is obtained, and the depth of the normal point can be used as the depth of the wrist feature point W.
进一步地,上述步骤105之后,还可以包括:当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点均为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。Further, after the above step 105, it may further include: when the pixel points other than the target finger feature point are noise points on the line connecting the preset wrist feature point and the target finger feature point, obtaining the distance The depth of the normal point closest to the preset wrist feature point is the depth of the normal point closest to the preset wrist feature point as the depth of the preset wrist feature point.
具体地,当预设手腕特征点与目标手部特征点的连线上,除该目标手部特征点以外的像素点均为噪点时,服务器可以在预设手腕特征点所在的手指上,以该预设手腕特征点为中心,检索离该预设手腕特征点最近的正常点,即深度值处于预设范围内的像素点,然后将该正常点的深度作为该预设手腕 特征点的深度。Specifically, when the preset wrist feature point and the target hand feature point are on the line, and the pixel points other than the target hand feature point are all noise points, the server may place the finger on the finger where the preset wrist feature point is located. The preset wrist feature point is the center, and the normal point closest to the preset wrist feature point is retrieved, that is, the pixel point whose depth value is within the preset range, and then the depth of the normal point is taken as the depth of the preset wrist feature point .
以图2中的手腕特征点W为例,对于手腕特征点W,当判定其为噪点时,则获取离W最近的手指特征点T0,将T0作为目标手指特征点并判断T0是否为噪点,如果T0不为噪点,则可以以W为中心检索离W最近的正常点,该正常点的深度即可作为手腕特征点W的深度。Take the wrist feature point W in Figure 2 as an example. For the wrist feature point W, when it is determined to be a noise point, obtain the finger feature point T0 closest to W, use T0 as the target finger feature point and determine whether T0 is a noise point. If T0 is not a noise point, the normal point closest to W can be retrieved with W as the center, and the depth of the normal point can be used as the depth of the wrist feature point W.
在本实施例中,当目标手部特征点为噪点,或者当预设手腕特征点与目标手部特征点的连线上,除目标手部特征点以外的像素点均为噪点时,通过将离该预设手腕特征点最近的正常点的深度作为该预设手腕特征点的深度,进一步提高了基于深度相机测量手部特征点深度信息的准确性。In this embodiment, when the target hand feature point is a noise point, or when the preset wrist feature point and the target hand feature point are connected to the target hand feature point, the pixel points other than the target hand feature point are all noise points. The depth of the normal point closest to the preset wrist feature point is used as the depth of the preset wrist feature point, which further improves the accuracy of measuring the depth information of the hand feature point based on the depth camera.
本发明实施例还提供一种手势深度确定装置。The embodiment of the present invention also provides a gesture depth determination device.
参照图3,图3为本发明手势深度确定装置的一个实施例的模块示意图。本实施例中,所述手势深度确定装置包括:Referring to FIG. 3, FIG. 3 is a schematic diagram of modules of an embodiment of a gesture depth determining device of the present invention. In this embodiment, the device for determining the gesture depth includes:
采集模块301,用于通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;The acquisition module 301 is configured to acquire a hand depth image through a depth camera, and acquire preset wrist feature points contained in the hand depth image and the depth value of the preset wrist feature points;
第一判断模块302,用于根据所述深度值判断所述预设手腕特征点是否为噪点;The first determining module 302 is configured to determine whether the preset wrist feature point is a noise point according to the depth value;
获取模块303,用于当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特征点;The acquiring module 303 is configured to acquire the preset finger feature point closest to the preset wrist feature point as the target finger feature point when the preset wrist feature point is a noise point;
第二判断模块304,用于判断所述目标手指特征点是否为噪点;The second judgment module 304 is configured to judge whether the feature point of the target finger is a noise point;
第三判断模块305,用于当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;The third judging module 305 is used for judging pixels other than the target finger feature points on the line connecting the preset wrist feature points and the target finger feature points when the target finger feature points are not noise points Whether the points are all noise points;
计算模块306,用于当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。The calculation module 306 is configured to calculate the preset wrist feature when the pixel points other than the target finger feature point are not all noise points on the connection line between the preset wrist feature point and the target finger feature point The median depth of all normal points on the line connecting the point and the feature point of the target finger is taken as the depth of the preset wrist feature point.
可选的,所述第一判断模块302还用于:Optionally, the first judgment module 302 is further configured to:
根据所述深度相机的工作深度范围,识别所述手部深度图像中的手部区域的非噪点,并获取所述非噪点的深度值;Identifying the non-noise points of the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points;
计算获取到的所有非噪点的深度值的中值,并计算所述预设手腕特征点的深度值与所述中值之差的绝对值;Calculating the median value of the acquired depth values of all non-noise points, and calculating the absolute value of the difference between the depth value of the preset wrist feature point and the median value;
判断所述预设手腕特征点的深度值与所述中值之差的绝对值是否大于或等于预设阈值;Judging whether the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold;
当所述预设手腕特征点的深度值与所述中值之差的绝对值大于或等于预设阈值时,判定所述预设手腕特征点为噪点。When the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold value, it is determined that the preset wrist feature point is a noise point.
可选的,所述第二判断模块304还用于:Optionally, the second judgment module 304 is further configured to:
获取所述目标手指特征点的深度值,计算所述目标手指特征点的深度值与所述中值之差的绝对值;Acquiring the depth value of the feature point of the target finger, and calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value;
判断所述目标手指特征点的深度值与所述中值之差的绝对值是否大于或等于所述预设阈值;Judging whether the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold;
当所述目标手指特征点的深度值与所述中值之差的绝对值大于或等于所述预设阈值时,判定所述目标手指特征点为噪点;When the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold value, determining that the target finger feature point is a noise point;
当所述目标手指特征点的深度值与所述中值之差的绝对值小于所述预设阈值时,判定所述目标手指特征点不为噪点。When the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold value, it is determined that the target finger feature point is not a noise point.
可选的,所述手势深度确定装置还包括:Optionally, the device for determining the gesture depth further includes:
第一处理模块,用于当所述目标手指特征点为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。The first processing module is configured to obtain the depth of the normal point closest to the preset wrist feature point when the target finger feature point is a noise point, and use the depth of the normal point closest to the preset wrist feature point as The depth of the preset wrist feature point.
可选的,所述手势深度确定装置还包括:Optionally, the device for determining the gesture depth further includes:
第二处理模块,用于当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点均为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。The second processing module is configured to obtain a distance from the preset wrist when the pixel points other than the target finger feature points are noise points on the connection line between the preset wrist feature point and the target finger feature point. The depth of the normal point closest to the feature point is the depth of the normal point closest to the preset wrist feature point as the depth of the preset wrist feature point.
上述手势深度确定装置中各个模块的功能实现及有益效果与上述手势深度确定方法实施例中各步骤相对应,此处不再赘述。The functional implementation and beneficial effects of each module in the aforementioned gesture depth determination device correspond to the steps in the aforementioned gesture depth determination method embodiment, and will not be repeated here.
上面从模块化功能实体的角度对本发明实施例中的手势深度确定装置进行了详细描述,下面从硬件处理的角度对本发明实施例中手势深度确定设备进行详细描述。The device for determining the gesture depth in the embodiment of the present invention is described in detail above from the perspective of a modular functional entity, and the device for determining the gesture depth in the embodiment of the present invention is described in detail below from the perspective of hardware processing.
参照图4,图4为本发明实施例提供的手势深度确定设备的结构示意图。该手势深度确定设备400可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上处理器(central processing units,CPU)410(例如,一个或一个以上处理器)和存储器420,一个或一个以上存储应用程序433或数据432的存储介质430(例如一个或一个以上海量存储设备)。其中,存储器420和存储介质430可以是短暂存储或持久存储。存储在存储介质430的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对手势深度确定设备400中的一系列指令操作。更进一步地,处理器410可以设置为与存储介质430通信,在手势深度确定设备400上执行存储介质430中的一系列指令操作。Referring to FIG. 4, FIG. 4 is a schematic structural diagram of a gesture depth determination device provided by an embodiment of the present invention. The gesture depth determination device 400 may have relatively large differences due to different configurations or performances, and may include one or more processors (central processing units, CPU) 410 (for example, one or more processors) and a memory 420. Or more than one storage medium 430 (for example, one or one storage device with a large amount of storage) storing application programs 433 or data 432. Among them, the memory 420 and the storage medium 430 may be short-term storage or persistent storage. The program stored in the storage medium 430 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations in the device 400 for determining the gesture depth. Furthermore, the processor 410 may be configured to communicate with the storage medium 430, and execute a series of instruction operations in the storage medium 430 on the gesture depth determining device 400.
手势深度确定设备400还可以包括一个或一个以上电源440,一个或一个以上有线或无线网络接口450,一个或一个以上输入输出接口460,和/或,一个或一个以上操作系统431,例如Windows Serve,Mac OS X,Unix,Linux,FreeBSD等等。本领域技术人员可以理解,图4示出的手势深度确定设备结构并不构成对基于手势深度确定设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。The gesture depth determination device 400 may also include one or more power supplies 440, one or more wired or wireless network interfaces 450, one or more input and output interfaces 460, and/or one or more operating systems 431, such as Windows Serve , Mac OS X, Unix, Linux, FreeBSD, etc. Those skilled in the art can understand that the structure of the gesture depth determination device shown in FIG. 4 does not constitute a limitation on the gesture depth determination device, and may include more or less components than shown in the figure, or combine certain components, or different The layout of the components.
本发明还提供一种存储介质,该存储介质可以为非易失性存储介质,也可以为易失性存储介质,所述存储介质中存储有手势深度确定程序,所述手势深度确定程序被处理器执行时实现如上所述的手势深度确定方法的步骤。The present invention also provides a storage medium. The storage medium may be a non-volatile storage medium or a volatile storage medium. The storage medium stores a gesture depth determination program, and the gesture depth determination program is processed. When the device is executed, the steps of the method for determining the depth of the gesture as described above are implemented.
其中,在所述处理器上运行的手势深度确定程序被执行时所实现的方法及有益效果可参照本发明手势深度确定方法的各个实施例,此处不再赘述。The method and beneficial effects achieved when the gesture depth determination program running on the processor is executed can refer to the various embodiments of the gesture depth determination method of the present invention, which will not be repeated here.
本领域技术人员可以理解,上述集成的模块或单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储 器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。Those skilled in the art can understand that if the above integrated modules or units are implemented in the form of software functional units and sold or used as independent products, they can be stored in a storage medium. Based on this understanding, the technical solution of the present invention essentially or the part that contributes to the existing technology or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. , Including several instructions to make a computer device (which can be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disks or optical disks and other media that can store program codes. .
以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。The above embodiments are only used to illustrate the technical solutions of the present invention, not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, a person of ordinary skill in the art should understand that: The recorded technical solutions are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (10)

  1. 一种手势深度确定方法,其特征在于,所述手势深度确定方法包括如下步骤:A method for determining the depth of a gesture, characterized in that the method for determining the depth of a gesture includes the following steps:
    通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;Acquiring a hand depth image by a depth camera, and acquiring preset wrist feature points included in the hand depth image and depth values of the preset wrist feature points;
    根据所述深度值判断所述预设手腕特征点是否为噪点;Judging whether the preset wrist feature point is a noise point according to the depth value;
    当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特征点;When the preset wrist feature point is a noise point, acquiring the preset finger feature point closest to the preset wrist feature point as the target finger feature point;
    判断所述目标手指特征点是否为噪点;Judging whether the feature point of the target finger is a noise point;
    当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;When the target finger feature point is not a noise point, determining whether pixels other than the target finger feature point are all noise points on the line connecting the preset wrist feature point and the target finger feature point;
    当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。When the pixel points other than the target finger feature points on the line connecting the preset wrist feature points and the target finger feature points are not all noise points, the preset wrist feature points and the target finger are calculated The median depth of all normal points on the line of the characteristic points is taken as the depth of the preset wrist characteristic points.
  2. 如权利要求1所述的手势深度确定方法,其特征在于,所述根据所述深度值判断所述预设手腕特征点是否为噪点的步骤包括:The method for determining the depth of a gesture according to claim 1, wherein the step of judging whether the preset wrist feature point is a noise according to the depth value comprises:
    根据所述深度相机的工作深度范围,识别所述手部深度图像中的手部区域的非噪点,并获取所述非噪点的深度值;Identifying the non-noise points of the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points;
    计算获取到的所有非噪点的深度值的中值,并计算所述预设手腕特征点的深度值与所述中值之差的绝对值;Calculating the median value of the acquired depth values of all non-noise points, and calculating the absolute value of the difference between the depth value of the preset wrist feature point and the median value;
    判断所述预设手腕特征点的深度值与所述中值之差的绝对值是否大于或等于预设阈值;Judging whether the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold;
    当所述预设手腕特征点的深度值与所述中值之差的绝对值大于或等于预设阈值时,判定所述预设手腕特征点为噪点。When the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold value, it is determined that the preset wrist feature point is a noise point.
  3. 如权利要求2所述的手势深度确定方法,其特征在于,所述判断所述目标手指特征点是否为噪点的步骤包括:The method for determining the depth of a gesture according to claim 2, wherein the step of judging whether the feature point of the target finger is a noise point comprises:
    获取所述目标手指特征点的深度值,计算所述目标手指特征点的深度值 与所述中值之差的绝对值;Acquiring the depth value of the feature point of the target finger, and calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value;
    判断所述目标手指特征点的深度值与所述中值之差的绝对值是否大于或等于所述预设阈值;Judging whether the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold;
    当所述目标手指特征点的深度值与所述中值之差的绝对值大于或等于所述预设阈值时,判定所述目标手指特征点为噪点;When the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold value, determining that the target finger feature point is a noise point;
    当所述目标手指特征点的深度值与所述中值之差的绝对值小于所述预设阈值时,判定所述目标手指特征点不为噪点。When the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold value, it is determined that the target finger feature point is not a noise point.
  4. 如权利要求1-3中任一项所述的手势深度确定方法,其特征在于,所述判断所述目标手指特征点是否为噪点的步骤之后,还包括:The method for determining the depth of a gesture according to any one of claims 1 to 3, wherein after the step of determining whether the feature point of the target finger is a noise point, the method further comprises:
    当所述目标手指特征点为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。When the target finger feature point is a noise point, obtain the depth of the normal point closest to the preset wrist feature point, and use the depth of the normal point closest to the preset wrist feature point as the preset wrist feature point depth.
  5. 如权利要求1-3中任一项所述的手势深度确定方法,其特征在于,所述判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点的步骤之后,还包括:The method for determining the depth of a gesture according to any one of claims 1 to 3, wherein the line connecting the predetermined wrist feature point and the target finger feature point is determined, except for the target finger feature point. After the steps of whether the pixels other than the dots are all noises, it also includes:
    当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点均为噪点时,获取离所述预设手腕特征点最近的正常点的深度,将离所述预设手腕特征点最近的正常点的深度作为所述预设手腕特征点的深度。When on the line connecting the preset wrist feature point and the target finger feature point, pixel points other than the target finger feature point are noise points, obtain the normal point closest to the preset wrist feature point Depth, the depth of the normal point closest to the preset wrist feature point is taken as the depth of the preset wrist feature point.
  6. 一种手势深度确定装置,其特征在于,所述手势深度确定装置包括:A gesture depth determining device, characterized in that the gesture depth determining device includes:
    采集模块,用于通过深度相机采集手部深度图像,获取所述手部深度图像中包含的预设手腕特征点和所述预设手腕特征点的深度值;An acquisition module, configured to acquire a hand depth image through a depth camera, and acquire preset wrist feature points contained in the hand depth image and the depth value of the preset wrist feature points;
    第一判断模块,用于根据所述深度值判断所述预设手腕特征点是否为噪点;The first judgment module is configured to judge whether the preset wrist feature point is a noise point according to the depth value;
    获取模块,用于当所述预设手腕特征点为噪点时,获取离所述预设手腕特征点最近的预设手指特征点,作为目标手指特征点;An acquiring module, configured to acquire a preset finger feature point closest to the preset wrist feature point as the target finger feature point when the preset wrist feature point is a noise point;
    第二判断模块,用于判断所述目标手指特征点是否为噪点;The second judgment module is used to judge whether the feature point of the target finger is a noise point;
    第三判断模块,用于当所述目标手指特征点不为噪点时,判断所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点是否均为噪点;The third judgment module is used for judging the pixel points other than the target finger feature points on the line between the preset wrist feature points and the target finger feature points when the target finger feature points are not noise points Are they all noisy;
    计算模块,用于当所述预设手腕特征点与所述目标手指特征点的连线上,除所述目标手指特征点以外的像素点不均为噪点时,计算所述预设手腕特征点与所述目标手指特征点的连线上的所有正常点的中值深度,作为所述预设手腕特征点的深度。A calculation module for calculating the preset wrist feature point when the pixel points other than the target finger feature point on the line connecting the preset wrist feature point and the target finger feature point are not all noise points The median depth of all normal points on the line with the feature point of the target finger is taken as the depth of the preset wrist feature point.
  7. 如权利要求6所述的手势深度确定装置,其特征在于,所述第一判断模块还用于:7. The gesture depth determining device according to claim 6, wherein the first judgment module is further configured to:
    根据所述深度相机的工作深度范围,识别所述手部深度图像中的手部区域的非噪点,并获取所述非噪点的深度值;Identifying the non-noise points of the hand region in the hand depth image according to the working depth range of the depth camera, and obtaining the depth value of the non-noise points;
    计算获取到的所有非噪点的深度值的中值,并计算所述预设手腕特征点的深度值与所述中值之差的绝对值;Calculating the median value of the acquired depth values of all non-noise points, and calculating the absolute value of the difference between the depth value of the preset wrist feature point and the median value;
    判断所述预设手腕特征点的深度值与所述中值之差的绝对值是否大于或等于预设阈值;Judging whether the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold;
    当所述预设手腕特征点的深度值与所述中值之差的绝对值大于或等于预设阈值时,判定所述预设手腕特征点为噪点。When the absolute value of the difference between the depth value of the preset wrist feature point and the median value is greater than or equal to a preset threshold value, it is determined that the preset wrist feature point is a noise point.
  8. 如权利要求7所述的手势深度确定装置,其特征在于,所述第二判断模块还用于:8. The gesture depth determining device according to claim 7, wherein the second judgment module is further configured to:
    获取所述目标手指特征点的深度值,计算所述目标手指特征点的深度值与所述中值之差的绝对值;Acquiring the depth value of the feature point of the target finger, and calculating the absolute value of the difference between the depth value of the feature point of the target finger and the median value;
    判断所述目标手指特征点的深度值与所述中值之差的绝对值是否大于或等于所述预设阈值;Judging whether the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold;
    当所述目标手指特征点的深度值与所述中值之差的绝对值大于或等于所述预设阈值时,判定所述目标手指特征点为噪点;When the absolute value of the difference between the depth value of the target finger feature point and the median value is greater than or equal to the preset threshold value, determining that the target finger feature point is a noise point;
    当所述目标手指特征点的深度值与所述中值之差的绝对值小于所述预设阈值时,判定所述目标手指特征点不为噪点。When the absolute value of the difference between the depth value of the target finger feature point and the median value is less than the preset threshold value, it is determined that the target finger feature point is not a noise point.
  9. 一种手势深度确定设备,其特征在于,所述手势深度确定设备包括:存储器和至少一个处理器,所述存储器中存储有指令,所述存储器和所述至少一个处理器通过线路互连;A gesture depth determination device, wherein the gesture depth determination device includes a memory and at least one processor, the memory stores instructions, and the memory and the at least one processor are interconnected by wires;
    所述至少一个处理器调用所述存储器中的所述指令,以使得所述手势深度确定设备执行如权利要求1-7中任一项所述的手势深度确定方法。The at least one processor invokes the instructions in the memory, so that the gesture depth determination device executes the gesture depth determination method according to any one of claims 1-7.
  10. 一种存储介质,所述存储介质上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1-7中任一项所述的手势深度确定方法。A storage medium with a computer program stored on the storage medium, wherein the computer program implements the gesture depth determination method according to any one of claims 1-7 when the computer program is executed by a processor.
PCT/CN2021/073715 2020-02-26 2021-01-26 Method, device and apparatus for determining depth of gesture, and storage medium WO2021169704A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010119178.3A CN111354029B (en) 2020-02-26 2020-02-26 Gesture depth determination method, device, equipment and storage medium
CN202010119178.3 2020-02-26

Publications (1)

Publication Number Publication Date
WO2021169704A1 true WO2021169704A1 (en) 2021-09-02

Family

ID=71197283

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/073715 WO2021169704A1 (en) 2020-02-26 2021-01-26 Method, device and apparatus for determining depth of gesture, and storage medium

Country Status (2)

Country Link
CN (1) CN111354029B (en)
WO (1) WO2021169704A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110728748B (en) * 2019-09-30 2023-05-05 中国科学院国家天文台南京天文光学技术研究所 Rendering method based on hemispherical orthogonal function
CN111368675B (en) * 2020-02-26 2023-06-20 深圳市瑞立视多媒体科技有限公司 Gesture depth information processing method, device, equipment and storage medium
CN111354029B (en) * 2020-02-26 2023-05-05 深圳市瑞立视多媒体科技有限公司 Gesture depth determination method, device, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899591A (en) * 2015-06-17 2015-09-09 吉林纪元时空动漫游戏科技股份有限公司 Wrist point and arm point extraction method based on depth camera
CN106055091A (en) * 2016-05-16 2016-10-26 电子科技大学 Hand posture estimation method based on depth information and calibration method
US20170045952A1 (en) * 2012-06-18 2017-02-16 Microsoft Technology Licensing, Llc Dynamic Hand Gesture Recognition Using Depth Data
CN107885327A (en) * 2017-10-27 2018-04-06 长春理工大学 A kind of Fingertip Detection based on Kinect depth information
CN111354029A (en) * 2020-02-26 2020-06-30 深圳市瑞立视多媒体科技有限公司 Gesture depth determination method, device, equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107818290B (en) * 2016-09-14 2021-03-16 京东方科技集团股份有限公司 Heuristic finger detection method based on depth map
CN106529480A (en) * 2016-11-14 2017-03-22 江汉大学 Finger tip detection and gesture identification method and system based on depth information
CN107688779A (en) * 2017-08-18 2018-02-13 北京航空航天大学 A kind of robot gesture interaction method and apparatus based on RGBD camera depth images
CN108647597B (en) * 2018-04-27 2021-02-02 京东方科技集团股份有限公司 Wrist identification method, gesture identification method and device and electronic equipment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170045952A1 (en) * 2012-06-18 2017-02-16 Microsoft Technology Licensing, Llc Dynamic Hand Gesture Recognition Using Depth Data
CN104899591A (en) * 2015-06-17 2015-09-09 吉林纪元时空动漫游戏科技股份有限公司 Wrist point and arm point extraction method based on depth camera
CN106055091A (en) * 2016-05-16 2016-10-26 电子科技大学 Hand posture estimation method based on depth information and calibration method
CN107885327A (en) * 2017-10-27 2018-04-06 长春理工大学 A kind of Fingertip Detection based on Kinect depth information
CN111354029A (en) * 2020-02-26 2020-06-30 深圳市瑞立视多媒体科技有限公司 Gesture depth determination method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN111354029B (en) 2023-05-05
CN111354029A (en) 2020-06-30

Similar Documents

Publication Publication Date Title
WO2021169704A1 (en) Method, device and apparatus for determining depth of gesture, and storage medium
US11749025B2 (en) Eye pose identification using eye features
CN110689584B (en) Active rigid body pose positioning method in multi-camera environment and related equipment
KR102334139B1 (en) Eye gaze tracking based upon adaptive homography mapping
WO2018177337A1 (en) Method and apparatus for determining three-dimensional hand data, and electronic device
CN111480164B (en) Head pose and distraction estimation
US11847796B2 (en) Calibrating cameras using human skeleton
JP5924862B2 (en) Information processing apparatus, information processing method, and program
KR20170065573A (en) Improved calibration for eye tracking systems
WO2021136386A1 (en) Data processing method, terminal, and server
US10991124B2 (en) Determination apparatus and method for gaze angle
KR20120048370A (en) Object pose recognition apparatus and method using the same
US9727776B2 (en) Object orientation estimation
CN113474816A (en) Elastic dynamic projection mapping system and method
JP6817742B2 (en) Information processing device and its control method
TW202242716A (en) Methods, apparatuses, devices and storage media for object matching
WO2021169705A1 (en) Method, apparatus and device for processing gesture depth information, and storage medium
US9786030B1 (en) Providing focal length adjustments
JP2004046464A (en) Apparatus and method for estimating three-dimensional position of mobile object, program, and recording medium thereof
WO2022174603A1 (en) Pose prediction method, pose prediction apparatus, and robot
CN113140031B (en) Three-dimensional image modeling system and method and oral cavity scanning equipment applying same
CN114742967A (en) Visual positioning method and device based on building digital twin semantic graph
RU2694140C1 (en) Method of human identification in a mode of simultaneous operation of a group of video cameras
CN114494857A (en) Indoor target object identification and distance measurement method based on machine vision
JP2017227687A (en) Camera assembly, finger shape detection system using camera assembly, finger shape detection method using camera assembly, program implementing detection method, and recording medium of program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21761088

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 06.02.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21761088

Country of ref document: EP

Kind code of ref document: A1