WO2021077721A1 - Method, apparatus and system for reconstructing three-dimensional model of human body, and readable storage medium - Google Patents

Method, apparatus and system for reconstructing three-dimensional model of human body, and readable storage medium Download PDF

Info

Publication number
WO2021077721A1
WO2021077721A1 PCT/CN2020/089885 CN2020089885W WO2021077721A1 WO 2021077721 A1 WO2021077721 A1 WO 2021077721A1 CN 2020089885 W CN2020089885 W CN 2020089885W WO 2021077721 A1 WO2021077721 A1 WO 2021077721A1
Authority
WO
WIPO (PCT)
Prior art keywords
image sequence
human body
depth
server
depth image
Prior art date
Application number
PCT/CN2020/089885
Other languages
French (fr)
Chinese (zh)
Inventor
张吉
张遥
李竹
王琳
Original Assignee
深圳奥比中光科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳奥比中光科技有限公司 filed Critical 深圳奥比中光科技有限公司
Publication of WO2021077721A1 publication Critical patent/WO2021077721A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/521Depth or shape recovery from laser ranging, e.g. using interferometry; from the projection of structured light
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10048Infrared image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person

Definitions

  • This application relates to the field of computer vision technology, and in particular to a method, device, system and readable storage medium for reconstructing a three-dimensional human body model.
  • Three-dimensional reconstruction is the future core basic technology for the development of computer vision.
  • the current development and application is aimed at groups with specific shapes and characteristics, such as the human body, in film and television entertainment and life applications.
  • the embodiments of the present application provide a method, device, system, and readable storage medium for reconstructing a three-dimensional human body model, and provide an efficient solution for reconstructing a three-dimensional human body model.
  • an embodiment of the present application provides a method for reconstructing a three-dimensional human body model, including:
  • the depth camera converts the infrared image sequence Disparity image sequence or depth image sequence, so that the server does not need to directly process the infrared image sequence, reduces the amount of data calculation of the server, reduces the system resource occupation, and greatly improves the efficiency of human body 3D model reconstruction; on the other hand, the depth camera will By uploading the compressed data after compression and encoding to the server, the data transmission efficiency is improved, and the efficiency of the reconstruction of the three-dimensional human body model is further improved.
  • an apparatus for reconstructing a three-dimensional human body model including:
  • the image acquisition unit is used to acquire multiple frames of infrared image sequences including various parts of the human body;
  • An image processing unit configured to process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence
  • a compression coding unit configured to perform compression coding on the disparity image sequence and the first parameter, or perform compression coding on the depth image sequence and the second parameter;
  • the data uploading unit is configured to upload compressed data that has been compressed and encoded to a server, and the compressed data is used to instruct the server to decompress and decode the received compressed data to reconstruct a real three-dimensional human body model.
  • an embodiment of the present application provides a depth camera, including a memory, a processor, and a computer program stored in the memory and running on the processor.
  • the processor executes the computer program, Implement the method as described in the first aspect.
  • an embodiment of the present application provides a depth camera, including the device as described in the second aspect.
  • an embodiment of the present application provides a system for reconstructing a three-dimensional human body model, including a server, and the depth camera according to the third aspect or the fourth aspect, and the server is configured to compare the received compressed data After decompression and decoding, a real three-dimensional model of the human body is reconstructed.
  • an embodiment of the present application provides a computer-readable storage medium that stores a computer program that implements the method described in the first aspect when the computer program is executed by a processor.
  • embodiments of the present application provide a computer program product, which when the computer program product runs on an electronic device, causes the electronic device to execute the method described in the first aspect.
  • Fig. 1 is a schematic diagram of a system for reconstructing a three-dimensional human body model provided by an embodiment of the present application
  • FIG. 2 is a schematic diagram of a distribution network of a system for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
  • FIG. 3 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application
  • FIG. 4 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application
  • Fig. 5 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application
  • Fig. 6 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application
  • FIG. 7 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application.
  • FIG. 1 shows a system for reconstructing a three-dimensional human body model provided by the present application, including a depth camera 101 connected in two-to-two communication, a client (mobile phone shown in FIG. 1) 102, and a server 103.
  • the measurement principle of the system is: the client 102 initiates a measurement instruction to the depth camera 101, and after receiving the measurement instruction, the depth camera 101 takes a picture of the human body to collect multiple frames of parallax image sequences or depth image sequences including various parts of the human body and upload them To the server 103, the server 103 performs real three-dimensional (3D) reconstruction of the human body according to the received parallax image sequence or depth image sequence, and selects key parts on the 3D model for measurement, so as to obtain the corresponding data of the measured human body. After the measurement is completed, The final three-dimensional data is transmitted to the client 102 that initiated the measurement instruction.
  • 3D three-dimensional
  • FIG. 2 is an implementation diagram of the distribution network process in one of the embodiments of this application.
  • the specific process is as follows: the client 102 starts the network configuration and searches for connectable Bluetooth devices. When it finds the Deepin Camera 101, it connects to it with Bluetooth. After the connection is successful, the Deepin Camera 101 will scan the nearby WiFi QR code and generate it. The WiFi list is transmitted to the client 102 through a Bluetooth unit (not shown). The client 102 selects a WiFi and enters the WiFi password. If the connection is successful, the network configuration is completed.
  • the Deepin Camera can use its WiFi unit (not shown) ) Access the server 103.
  • WiFi unit not shown
  • FIG. 2 the WiFi distribution network is taken as an example for description, and the basis is only an exemplary description, and cannot be construed as a specific limitation to the present application.
  • the depth camera may be a depth camera based on structured light, binocular, and time of flight (TOF) technology.
  • the depth camera may also be a depth camera including a color camera module, such as a depth camera including an RGB camera module. In this way, both depth images containing depth information and color images containing rich texture information can be obtained.
  • the client may also be a tablet computer, a wearable device, a vehicle-mounted device, an augmented reality (AR)/virtual reality (VR) device, a notebook computer, a super mobile personal computer (
  • AR augmented reality
  • VR virtual reality
  • UMPC ultra-mobile personal computer
  • PDA personal digital assistant
  • the server may also be an independent server, a server cluster, or a distributed server, etc.
  • the embodiments of the present application do not impose any restriction on the specific type of the server.
  • FIG. 3 shows an implementation flowchart of a method for reconstructing a three-dimensional human body model provided by an embodiment of the present application.
  • the method includes steps S110 to S130. This method is suitable for situations where the human body needs to be reconstructed in three dimensions. This method can be applied to the depth camera shown in Figure 1.
  • the specific implementation principle of each step is as follows.
  • S110 Collect multiple frames of infrared image sequences including various parts of the human body.
  • the depth camera collects infrared images of the human body from different angles to form an infrared image sequence including various parts of the human body.
  • the image acquisition unit of the depth camera 101 includes a binocular IR camera module, wherein the baseline distance of the left and right IR cameras is 150 mm.
  • the depth camera 101 When taking pictures of the human body, place the depth camera 101 vertically and stick it on a vertical wall, about 0.8m to 1.2m from the ground. The subject keeps standing at a preset posture, distance and position and at a preset angle Rotate, for example, the subject’s hands hang down at a certain angle in A-pose, and rotate at a distance of 1m to 2m from the depth camera.
  • the depth camera uses the image acquisition unit to continuously take pictures to obtain multiple frames (eg 300 frames) Sequence of infrared images from different angles.
  • the image acquisition unit further includes a color camera module (not shown in FIG. 1), such as an RGB camera module.
  • a color camera module such as an RGB camera module.
  • the image acquisition unit further includes a laser projection module (not shown in Figure 1).
  • the laser projection module emits a laser with a wavelength of 825 nm.
  • the image acquisition unit collects infrared speckles. image. Therefore, when the laser projection module is in the working state, it should also detect whether there is an object within the dangerous distance range. Once an object intrusion is detected, the laser projection module should be turned off.
  • the depth camera further includes a distance measurement unit. Specifically, when the image acquisition unit is turned on, the distance measurement unit is simultaneously turned on, and when an object is detected within 1 m, the laser module is turned off.
  • the distance measuring unit may be a distance sensor or a proximity sensor or the like.
  • a photo-diode can be placed near the DOE, for example, it can be placed obliquely 45 degrees above the top angle of the DOE to detect the amount of light (luminous intensity), where the amount of light is equal to that of the PD.
  • the voltage across the PD is proportional. When the voltage across the PD exceeds the threshold, it is judged that the DOE is destroyed, and the laser projection module needs to be turned off at this time.
  • S120 Process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence.
  • the depth camera processes the infrared image sequence to obtain the corresponding parallax image sequence or depth image sequence.
  • the image processing unit of the depth camera processes the infrared image sequence to obtain the corresponding parallax image sequence or depth image sequence.
  • the image processing unit includes a parallax image acquisition unit and a depth image acquisition unit.
  • the parallax image acquisition unit is used to process the above-mentioned infrared image sequence to obtain the parallax image sequence, for example, the deviation of the spatial point in the two infrared images can be calculated according to the stereo matching algorithm to obtain a series of parallax images, or to calculate the reference speckle The deviation between the image and the acquired infrared speckle image to obtain a series of parallax images.
  • the depth image acquiring unit is configured to process the disparity image sequence to acquire the depth image sequence, for example, the disparity image sequence may be further converted into a depth image sequence according to the mapping relationship between the disparity and the depth.
  • the depth camera judges that the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence. If it is determined that the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence, the parallax image sequence or the depth image sequence is sent to the server, and the server performs the reconstruction of the human body 3D model according to the parallax image sequence or the depth image sequence sent by the depth camera.
  • the depth camera includes a position detection unit and a data upload unit.
  • the pose detection unit judges whether the human body is in a preset posture and distance based on the first frame or the first few depth images, so that when the human body is in the preset posture and distance, the data upload unit will
  • the parallax image sequence or the depth image sequence is sent to a server, and the server receives the parallax image sequence or the depth image sent by a depth camera to reconstruct a three-dimensional human body model.
  • the position detection unit mainly detects whether the following two conditions meet the requirements: 1) Whether the human body is in the central area of the depth image and occupies more than 80% of the screen; 2) Whether the human body is hanging down at a certain angle with both hands Standing in a posture.
  • the position detection unit can use the image segmentation algorithm to segment the target area (including the area of the human body) and the background area, and calculate the distance between the center of the target area and the geometric center of the depth image, when the distance is less than the preset value , It is judged that the human body is in the central area of the depth image, and at the same time, the proportion value of the target area in the entire depth image is calculated, and it is judged whether it is greater than 80%; for condition 2), the position detection unit can process the first frame or Perform key point detection (including but not limited to head, waist, hand, elbow, shoulder joint points and soles of feet, etc.) in the first few frames of depth images to extract human bone data and calculate the angle between the arm and the torso. Within a preset range, for example, 15 to 30 degrees, condition 2) is satisfied. When the above two conditions are met, it can be determined that the human body is in the preset posture and distance. At this time, the image acquisition unit can continue to image the human body.
  • key point detection including but not limited
  • the depth camera further includes a reminder unit.
  • the reminder unit sends out an adjustment reminder to adjust the posture and distance of the human body until it continues to collect data.
  • the sequence of parallax images or the sequence of depth images determines that the human body is in a preset posture and distance.
  • the reminder unit when the human body is not in the preset posture and distance, the reminder unit will issue a related reminder according to the current posture and/or distance of the human body, for example, a broadcast: "Please step forward/back/left/right” Or "open your arms and keep your posture", the subjects can perform corresponding operations according to the broadcast content.
  • a broadcast "Please step forward/back/left/right” Or "open your arms and keep your posture"
  • the subjects can perform corresponding operations according to the broadcast content.
  • the reminding unit when the distance between the standing position of the human body and the depth camera (or laser projection module) is within the dangerous distance range, the reminding unit will remind the subject to move backward through a broadcast.
  • the image acquisition unit will continuously collect infrared images of the subject to judge the rationality of the current subject’s standing position.
  • the reminding unit may be a speaker.
  • the parallax image sequence is determined according to the parallax image sequence or depth image sequence when the human body is in a preset posture and distance.
  • the depth image sequence is sent to the server for human body 3D model reconstruction.
  • the depth camera converts the infrared image sequence into a parallax image sequence or a depth image sequence, so that the server does not need to directly process the infrared image sequence, reducing the amount of data calculation on the server. The system resource occupation is reduced, and the efficiency of human body 3D model reconstruction is greatly improved.
  • the depth camera sends the parallax image sequence or depth image sequence that determines that the human body is in the preset posture and distance to the server for human body 3D model reconstruction.
  • the accuracy and completeness of data collection are improved, and the accuracy of the reconstruction of the three-dimensional model of the human body is further improved.
  • the method further includes: performing multi-distance calibration on the depth data in the depth image sequence.
  • the depth camera further includes a multi-distance calibration unit, which is used to perform multi-distance calibration on the depth data in the depth image sequence to reduce the systematic error of the measurement.
  • the depth camera in addition to the multi-distance calibration unit, also includes a valid frame detection unit.
  • the valid frame detection unit is used to screen the calibrated depth image sequence to remove redundant frames and further reduce The amount of data for subsequent 3D reconstruction.
  • the method further includes: masking the selected depth image sequence Process to obtain a sequence of deep human mask images.
  • the depth camera further includes a depth human body mask image acquisition unit for performing mask processing on the above-mentioned filtered depth image sequence to acquire a depth human body mask image sequence.
  • a pre-made sensory image sequence may be used.
  • the interest region mask is multiplied by the above-mentioned depth image sequence to remove the background area to obtain the depth human body mask image sequence.
  • the method further includes: calculating the depth human body mask image sequence to obtain the parallax human body mask image sequence.
  • the above-mentioned depth camera further includes a parallax human body mask image acquisition unit, configured to calculate the above-mentioned depth human body mask image to obtain a parallax human body mask image sequence.
  • a parallax human body mask image acquisition unit configured to calculate the above-mentioned depth human body mask image to obtain a parallax human body mask image sequence.
  • the method further includes: calculating the screened depth image sequence to obtain skeleton information of the human body.
  • the depth camera further includes a skeleton acquisition unit for calculating the above-mentioned filtered depth image sequence to obtain human body skeleton information, which is mainly used for subsequent three-dimensional reconstruction.
  • the depth camera may upload the acquired data to the server through compression coding.
  • the depth camera also includes a compression encoding unit for compressing and encoding the depth human mask image sequence, the second parameters (including the internal parameters of the depth camera), and the human skeleton information, and compressing them to 10% of the original data size.
  • % Is then uploaded to the server through the data upload unit, which shortens the data transmission time, thereby further improving the efficiency of the three-dimensional human body model. It should be noted that uploading the depth image data does not need to upload multiple parameters at the same time.
  • the system design is relatively simple, but the dynamic range is relatively large, and the adjacent pixels change greatly at a long distance, which is not conducive to compression coding.
  • the compression encoding unit compresses and encodes the parallax human body mask image sequence, the first parameters (including the internal parameters of the depth camera, the parallax conversion depth parameter, and the multi-distance calibration parameter), and the human skeleton information, and It is compressed to 10% of the original data size and then uploaded to the server through the data upload unit.
  • the dynamic range of parallax image data is relatively small (each pixel can be expressed in 12bit or less), the change between neighboring pixels is small, and lower bit rates can be obtained, but multi-distance calibration needs to be uploaded Parameters and other additional parameters.
  • the server When the depth camera uploads the acquired data to the server through compression and encoding, the server first decodes and decompresses the received compressed data to obtain the parallax human body mask image sequence, the first parameter, and the human body skeleton information; or the deep human body
  • the mask image sequence, the second parameter, and the human body skeleton information are used to perform three-dimensional reconstruction of the human body through the data obtained after the above-mentioned decoding and decompression.
  • the server includes a decoding and decompression unit and a three-dimensional reconstruction unit.
  • the server receives the compressed data
  • the received compressed and encoded data is decoded and decompressed by the decoding and decompression unit to obtain the parallax human body mask image sequence, the first parameters, and the human body skeleton information; or the depth human body mask image sequence,
  • the second parameter and the human skeleton information, and the data obtained after the above-mentioned decoding and decompression are subjected to the three-dimensional reconstruction of the human body by the three-dimensional reconstruction unit.
  • the server when the server receives a parallax human mask image sequence, it also needs to convert the parallax human mask image sequence into a depth human mask image sequence according to the parallax conversion depth parameter, and then according to the depth camera internal parameters and multi-distance
  • the calibration parameters correct the depth data in the depth human body mask image sequence to reduce the system measurement error.
  • the server further includes a data measuring unit, which is used to measure the dimensions of the required body part and push the measurement result to the client.
  • the measurement locations include, but are not limited to: chest circumference, waist circumference, hip circumference, upper arm circumference, lower arm circumference, thigh circumference, calf circumference, and the like.
  • FIG. 4 shows a structural block diagram of a device for reconstructing a three-dimensional human body model provided by an embodiment of the present application, and the device for reconstructing a three-dimensional human body model is configured in a depth camera.
  • the device for reconstructing a three-dimensional human body model is configured in a depth camera.
  • the device includes:
  • the image acquisition unit 41 is used to acquire multiple frames of infrared image sequences including various parts of the human body;
  • the image processing unit 42 is configured to process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
  • the data uploading unit 43 is configured to send the parallax image sequence or the depth image sequence to the server if it is determined that the human body is at a preset posture and distance according to the parallax image sequence or the depth image sequence.
  • the image sequence or the depth image sequence is used to instruct the server to reconstruct a three-dimensional model of a real human body.
  • the device further includes:
  • the position detection unit 44 is configured to determine whether the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence;
  • the reminding unit 45 is configured to send an adjustment reminder if it is determined according to the parallax image sequence or the depth image sequence that the human body is not in the preset posture and distance, until according to the continuously collected parallax image sequence or the depth The image sequence determines that the human body is in a preset posture and distance.
  • the device further includes:
  • the multi-distance calibration unit 46 is configured to perform multi-distance calibration on the depth data in the depth image sequence.
  • the device further includes:
  • the multi-distance calibration unit 46 is configured to perform multi-distance calibration on the depth data in the depth image sequence
  • the effective frame detection unit 47 is configured to screen the calibrated depth image sequence to obtain the screened depth image sequence.
  • the device further includes:
  • the skeleton obtaining unit 48 is configured to calculate the depth image sequence after screening to obtain skeleton information of the human body.
  • the depth human body mask image sequence acquiring unit 49 is configured to perform mask processing on the screened depth image sequence to acquire the depth human body mask image sequence.
  • the parallax human body mask image sequence acquiring unit 50 is configured to calculate the depth human body mask image sequence to obtain the parallax human body mask image sequence.
  • the compression encoding unit 51 is used for compressing and encoding the deep human body mask image sequence, the second parameter, and the human body skeleton information; or for compressing and encoding the parallax human body mask image sequence, the first parameter, and the human body skeleton information, Get compressed coded data.
  • the data uploading unit 43 is configured to upload the compressed coded data to the server, and the compressed coded data is used to instruct the server to reconstruct the three-dimensional model of the human body.
  • the embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps in each of the foregoing method embodiments can be realized.
  • the embodiments of the present application provide a computer program product.
  • the computer program product runs on an electronic device, the electronic device can realize the steps in the foregoing method embodiments when the electronic device is executed.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium.
  • the computer program can be stored in a computer-readable storage medium. When executed by the processor, the steps of the foregoing method embodiments can be implemented.
  • the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms.
  • the computer-readable medium may at least include: any entity or device capable of carrying computer program code to the photographing device/terminal device, recording medium, computer memory, read-only memory (ROM), random access memory (Random Access Memory, RAM), electric carrier signal, telecommunications signal, and software distribution medium.
  • ROM read-only memory
  • RAM random access memory
  • electric carrier signal telecommunications signal
  • software distribution medium Such as U disk, mobile hard disk, floppy disk or CD-ROM, etc.
  • computer-readable media cannot be electrical carrier signals and telecommunication signals.
  • the disclosed apparatus/network equipment and method may be implemented in other ways.
  • the device/network device embodiments described above are only illustrative.
  • the division of the modules or units is only a logical function division, and there may be other divisions in actual implementation, such as multiple units.
  • components can be combined or integrated into another system, or some features can be omitted or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Optics & Photonics (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Image Processing (AREA)

Abstract

The present application is applicable to the technical field of computer vision. Provided is a method for reconstructing a three-dimensional model of a human body, which method is applied to a depth camera. The method comprises: collecting a sequence of multiple frames of infrared images which comprise various parts of a human body; processing the sequence of infrared images to acquire a corresponding parallax image sequence or a depth image sequence; compressing and encoding the parallax image sequence and a first parameter, or compressing and encoding the depth image sequence and a second parameter; and uploading compressed data which has been compressed and encoded to a server, wherein the compressed data is used for instructing the server to decompress and decode the received compressed data, and then to reconstruct a real three-dimensional model of a real human body. By means of the present application, the efficient reconstruction of a three-dimensional model of a human body is realized.

Description

重建人体三维模型的方法、装置、系统和可读存储介质Method, device, system and readable storage medium for reconstructing three-dimensional model of human body 技术领域Technical field
本申请涉及计算机视觉技术领域,尤其涉及一种重建人体三维模型的方法、装置、系统和可读存储介质。This application relates to the field of computer vision technology, and in particular to a method, device, system and readable storage medium for reconstructing a three-dimensional human body model.
背景技术Background technique
三维重建是计算机视觉发展的未来核心基础技术,目前在开发应用的是针对诸如人体这种有特定形貌和特征的群体,在影视娱乐和生活方面的应用。Three-dimensional reconstruction is the future core basic technology for the development of computer vision. The current development and application is aimed at groups with specific shapes and characteristics, such as the human body, in film and television entertainment and life applications.
现有的人体三维重建技术大多基于待测人体的大量深度数据进行,因而人体三维模型重建效率不高。Most of the existing human body 3D reconstruction technologies are based on a large amount of depth data of the human body to be measured, so the reconstruction efficiency of the human body 3D model is not high.
发明内容Summary of the invention
本申请实施例提供了一种重建人体三维模型的方法、装置、系统和可读存储介质,提供了一种高效的重建人体三维模型的方案。The embodiments of the present application provide a method, device, system, and readable storage medium for reconstructing a three-dimensional human body model, and provide an efficient solution for reconstructing a three-dimensional human body model.
第一方面,本申请实施例提供了一种重建人体三维模型的方法,包括:In the first aspect, an embodiment of the present application provides a method for reconstructing a three-dimensional human body model, including:
采集多帧包括人体各个局部的红外图像序列;Collect multiple frames of infrared image sequences including various parts of the human body;
对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列;Processing the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码;Performing compression coding on the disparity image sequence and the first parameter, or performing compression coding on the depth image sequence and the second parameter;
将进行压缩编码后的压缩数据上传至服务器,所述压缩数据用于指示所述服务器对接收到的所述压缩数据进行解压解码后重建真实人体三维模型。Upload the compressed data that has been compressed and encoded to a server, where the compressed data is used to instruct the server to decompress and decode the received compressed data to reconstruct a real three-dimensional human body model.
通过先对红外图像序列进行处理,获取相应的视差图像序列或深度图像序列,将视差图像序列或深度图像序列压缩编码后上传至服务器进行人体三维模型重建,一方面,深度相机将红外图像序列转换成视差图像序列或深度图像序 列,使得服务器不需要直接处理红外图像序列,减少了服务器的数据计算量,减少了系统资源占用,大大提高了人体三维模型重建的效率;另一方面,深度相机将通过压缩编码后的压缩数据上传至服务器,提高了数据传输效率,进一步提高了人体三维模型重建的效率。By first processing the infrared image sequence, the corresponding parallax image sequence or depth image sequence is obtained, and the parallax image sequence or depth image sequence is compressed and encoded and then uploaded to the server to reconstruct the human body 3D model. On the one hand, the depth camera converts the infrared image sequence Disparity image sequence or depth image sequence, so that the server does not need to directly process the infrared image sequence, reduces the amount of data calculation of the server, reduces the system resource occupation, and greatly improves the efficiency of human body 3D model reconstruction; on the other hand, the depth camera will By uploading the compressed data after compression and encoding to the server, the data transmission efficiency is improved, and the efficiency of the reconstruction of the three-dimensional human body model is further improved.
第二方面,本申请实施例提供了一种重建人体三维模型的装置,包括:In the second aspect, an embodiment of the present application provides an apparatus for reconstructing a three-dimensional human body model, including:
图像采集单元,用于采集多帧包括人体各个局部的红外图像序列;The image acquisition unit is used to acquire multiple frames of infrared image sequences including various parts of the human body;
图像处理单元,用于对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列;An image processing unit, configured to process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
压缩编码单元,用于对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码;A compression coding unit, configured to perform compression coding on the disparity image sequence and the first parameter, or perform compression coding on the depth image sequence and the second parameter;
数据上传单元,用于将进行压缩编码后的压缩数据上传至服务器,所述压缩数据用于指示所述服务器对接收到的所述压缩数据进行解压解码后重建真实人体三维模型。The data uploading unit is configured to upload compressed data that has been compressed and encoded to a server, and the compressed data is used to instruct the server to decompress and decode the received compressed data to reconstruct a real three-dimensional human body model.
第三方面,本申请实施例提供了一种深度相机,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如第一方面所述的方法。In the third aspect, an embodiment of the present application provides a depth camera, including a memory, a processor, and a computer program stored in the memory and running on the processor. When the processor executes the computer program, Implement the method as described in the first aspect.
第四方面,本申请实施例提供了一种深度相机,包括如第二方面所述的装置。In a fourth aspect, an embodiment of the present application provides a depth camera, including the device as described in the second aspect.
第五方面,本申请实施例提供了一种重建人体三维模型的系统,包括服务器,和如第三方面或第四方面所述的深度相机,所述服务器用于对接收到的所述压缩数据进行解压解码后重建真实人体三维模型。In a fifth aspect, an embodiment of the present application provides a system for reconstructing a three-dimensional human body model, including a server, and the depth camera according to the third aspect or the fourth aspect, and the server is configured to compare the received compressed data After decompression and decoding, a real three-dimensional model of the human body is reconstructed.
第六方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如第一方面所述的方法。In a sixth aspect, an embodiment of the present application provides a computer-readable storage medium that stores a computer program that implements the method described in the first aspect when the computer program is executed by a processor.
第七方面,本申请实施例提供了一种计算机程序产品,当计算机程序产品在电子设备上运行时,使得电子设备执行上述第一方面所述的方法。In a seventh aspect, embodiments of the present application provide a computer program product, which when the computer program product runs on an electronic device, causes the electronic device to execute the method described in the first aspect.
可以理解的是,上述第二方面至第七方面的有益效果可以参见上述第一方面中的相关描述,在此不再赘述。It can be understood that the beneficial effects of the above second aspect to the seventh aspect can be referred to the related description in the above first aspect, which will not be repeated here.
附图说明Description of the drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only of the present application. For some embodiments, those of ordinary skill in the art can obtain other drawings based on these drawings without creative labor.
图1是本申请一实施例提供的重建人体三维模型的系统的示意图;Fig. 1 is a schematic diagram of a system for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图2是本申请一实施例提供的重建人体三维模型的系统的配网示意图;2 is a schematic diagram of a distribution network of a system for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图3是本申请一实施例提供的重建人体三维模型的装置的结构示意图;3 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图4是本申请一实施例提供的重建人体三维模型的装置的结构示意图;4 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图5是本申请一实施例提供的重建人体三维模型的装置的结构示意图;Fig. 5 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图6是本申请一实施例提供的重建人体三维模型的装置的结构示意图;Fig. 6 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图7是本申请一实施例提供的重建人体三维模型的装置的结构示意图;FIG. 7 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application;
图8是本申请一实施例提供的重建人体三维模型的装置的结构示意图。FIG. 8 is a schematic structural diagram of an apparatus for reconstructing a three-dimensional human body model provided by an embodiment of the present application.
具体实施方式Detailed ways
为了说明本申请所述的技术方案,下面将参考附图并结合实施例来进行说明。In order to illustrate the technical solution described in the present application, the following description will be made with reference to the drawings and in conjunction with the embodiments.
为了使本技术领域的人员更好地理解本申请方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚,完整地描述,显然,所描述的实施例仅仅是本申请一部分的实施例,而不是全部的实施例。基于本申请中的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,所获得的所有其他实施例,都应当属于本申请保护的范围。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。In order to enable those skilled in the art to better understand the solution of the application, the technical solutions in the embodiments of the application will be clearly and completely described below in conjunction with the drawings in the embodiments of the application. Obviously, the described embodiments are only It is a part of the embodiments of this application, not all the embodiments. Based on the embodiments in this application, for those of ordinary skill in the art, all other embodiments obtained without creative labor should fall within the protection scope of this application. It should be noted that the embodiments in this application and the features in the embodiments can be combined with each other if there is no conflict.
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are proposed for a thorough understanding of the embodiments of the present application. However, it should be clear to those skilled in the art that the present application can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted to avoid unnecessary details from obstructing the description of this application.
需要说明的是,当元件被称为“固定于”或“设置于”另一个元件,它可以直接在另一个元件上或者间接在该另一个元件上。当一个元件被称为是“连接于”另一个元件,它可以是有线连接到另一个元件或无线连接至该另一个元件上,连接用于数据传输作用。It should be noted that when an element is referred to as being "fixed to" or "disposed on" another element, it can be directly on the other element or indirectly on the other element. When an element is said to be "connected" to another element, it can be wired to the other element or wirelessly connected to the other element, and the connection is used for data transmission.
此外,在本申请的说明书、权利要求书及附图中的术语中涉及“第一”或“第二”等的描述仅用于区别类似的对象,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量,也就是说,这些描述不必用于描述特定的顺序或先后次序。此外,应该理解这些描述在适当情况下可以互换,以便描述本申请的实施例。In addition, the descriptions involving "first" or "second" in the terms in the specification, claims, and drawings of this application are only used to distinguish similar objects, and cannot be understood as indicating or implying their relative importance. Or implicitly specify the number of the indicated technical features, that is, these descriptions do not have to be used to describe a specific order or sequence. In addition, it should be understood that these descriptions can be interchanged under appropriate circumstances in order to describe the embodiments of the present application.
请参考图1,图1示出了本申请提供的一种重建人体三维模型的系统,包括两两通信连接的深度相机101,客户端(图1所示为手机)102、及服务器103。系统的测量原理为:客户端102向深度相机101发起测量指令,深度相机101接收该测量指令之后,对人体进行拍照以采集多帧包括人体各个局部的视差图像序列或深度图像序列并将其上传给服务器103,服务器103根据接收的视差图像序列或深度图像序列进行真实的三维(3D)人体重建,并在3D模型上选取关键部位进行测量,从而得到被测人体的对应数据,测量结束后,将最终的三维数据传送给发起测量指令的客户端102。Please refer to FIG. 1, which shows a system for reconstructing a three-dimensional human body model provided by the present application, including a depth camera 101 connected in two-to-two communication, a client (mobile phone shown in FIG. 1) 102, and a server 103. The measurement principle of the system is: the client 102 initiates a measurement instruction to the depth camera 101, and after receiving the measurement instruction, the depth camera 101 takes a picture of the human body to collect multiple frames of parallax image sequences or depth image sequences including various parts of the human body and upload them To the server 103, the server 103 performs real three-dimensional (3D) reconstruction of the human body according to the received parallax image sequence or depth image sequence, and selects key parts on the 3D model for measurement, so as to obtain the corresponding data of the measured human body. After the measurement is completed, The final three-dimensional data is transmitted to the client 102 that initiated the measurement instruction.
需要说明的是,深度相机101、客户端102、和服务器103两两之间都要通过网络进行数据传输,因此客户端102在发送测量指令给深度相机101之前还需要对三者进行配网。请参考图2,图2为本申请其中一实施例中配网流程的实现图。具体流程如下:由客户端102启动配网,寻找可连接的蓝牙设备,当 找到深度相机101之后与其进行蓝牙连接,连接成功之后,深度相机101会扫描附近可用的WiFi二维码并将生成的WiFi列表通过蓝牙单元(未示出)传送给客户端102,客户端102选择某个WiFi并输入WiFi密码,若连接成功,则完成配网,此时深度相机可通过其WiFi单元(未示出)访问服务器103。需要说明的是,在图2中以WiFi配网为例进行说明,基础仅为示例性描述,不能解释为对本申请的具体限定。It should be noted that the depth camera 101, the client 102, and the server 103 must transmit data through the network in each pair. Therefore, the client 102 needs to configure the network for the three before sending the measurement instruction to the depth camera 101. Please refer to FIG. 2, which is an implementation diagram of the distribution network process in one of the embodiments of this application. The specific process is as follows: the client 102 starts the network configuration and searches for connectable Bluetooth devices. When it finds the Deepin Camera 101, it connects to it with Bluetooth. After the connection is successful, the Deepin Camera 101 will scan the nearby WiFi QR code and generate it. The WiFi list is transmitted to the client 102 through a Bluetooth unit (not shown). The client 102 selects a WiFi and enters the WiFi password. If the connection is successful, the network configuration is completed. At this time, the Deepin Camera can use its WiFi unit (not shown) ) Access the server 103. It should be noted that, in FIG. 2, the WiFi distribution network is taken as an example for description, and the basis is only an exemplary description, and cannot be construed as a specific limitation to the present application.
在本申请一些实施例中,深度相机可以为基于结构光、双目、时间飞行算法(time of flight,TOF)技术的深度相机。此外,深度相机还可以为包括彩色相机模组的深度相机,例如包括RGB相机模组的深度相机,如此,既可以获取包含深度信息的深度图像,又可获取包含丰富纹理信息的彩色图像。In some embodiments of the present application, the depth camera may be a depth camera based on structured light, binocular, and time of flight (TOF) technology. In addition, the depth camera may also be a depth camera including a color camera module, such as a depth camera including an RGB camera module. In this way, both depth images containing depth information and color images containing rich texture information can be obtained.
在本申请一些实施例中,客户端还可以为平板电脑、可穿戴设备、车载设备、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本、个人数字助理(personal digital assistant,PDA)等终端设备,本申请实施例对电子设备的具体类型不作任何限制。In some embodiments of the present application, the client may also be a tablet computer, a wearable device, a vehicle-mounted device, an augmented reality (AR)/virtual reality (VR) device, a notebook computer, a super mobile personal computer ( For terminal devices such as ultra-mobile personal computer (UMPC), netbooks, and personal digital assistant (personal digital assistant, PDA), the embodiments of this application do not impose any restrictions on the specific types of electronic devices.
在本申请一些实施例中,服务器还可以为:独立的服务器、服务器集群或分布式服务器等,本申请实施例对服务器的具体类型不作任何限制。In some embodiments of the present application, the server may also be an independent server, a server cluster, or a distributed server, etc. The embodiments of the present application do not impose any restriction on the specific type of the server.
可以理解地,本领域技术人员可以根据实际需要进行部署,本申请实施例中的图示和与图示对应的阐述不构成对其具体部署形式的限定。It is understandable that those skilled in the art can deploy according to actual needs, and the illustrations in the embodiments of the present application and the explanations corresponding to the illustrations do not constitute a limitation on the specific deployment form thereof.
图3示出了本申请一实施例提供的一种重建人体三维模型的方法的实现流程图,该方法包括步骤S110至S130。该方法适用于需要对人体进行三维重建的情形。该方法可以应用于图1所示的深度相机。各步骤的具体实现原理如下。FIG. 3 shows an implementation flowchart of a method for reconstructing a three-dimensional human body model provided by an embodiment of the present application. The method includes steps S110 to S130. This method is suitable for situations where the human body needs to be reconstructed in three dimensions. This method can be applied to the depth camera shown in Figure 1. The specific implementation principle of each step is as follows.
S110,采集多帧包括人体各个局部的红外图像序列。S110: Collect multiple frames of infrared image sequences including various parts of the human body.
深度相机采集人体不同角度的红外图像,从而组成包括人体各个局部的红外图像序列。The depth camera collects infrared images of the human body from different angles to form an infrared image sequence including various parts of the human body.
继续参见图1所示,在一个实施例中,深度相机101的图像采集单元包括 双目IR相机模组,其中左右两个IR相机的基线距离为150mm。在对人体进行拍照时,将深度相机101纵向摆放,贴于垂直的墙面上,距离地面约0.8m至1.2m,受测者保持以预设姿势及距离及位置站立并以预设角度进行自转,例如受测者双手呈一定角度向下垂呈A-pose在距离深度相机1m至2m处自转,自转期间,深度相机通过图像采集单元不断对其进行拍照以获取多帧(例如300帧)不同角度的红外图像序列。Continuing to refer to FIG. 1, in one embodiment, the image acquisition unit of the depth camera 101 includes a binocular IR camera module, wherein the baseline distance of the left and right IR cameras is 150 mm. When taking pictures of the human body, place the depth camera 101 vertically and stick it on a vertical wall, about 0.8m to 1.2m from the ground. The subject keeps standing at a preset posture, distance and position and at a preset angle Rotate, for example, the subject’s hands hang down at a certain angle in A-pose, and rotate at a distance of 1m to 2m from the depth camera. During the rotation, the depth camera uses the image acquisition unit to continuously take pictures to obtain multiple frames (eg 300 frames) Sequence of infrared images from different angles.
在一个实施例中,图像采集单元还包括彩色相机模组(图1未示出),例如RGB相机模组,如此,既可以获取包含深度信息的深度图像,又可获取包含丰富纹理信息的彩色图像。可以理解的是,为了采集比较精确的包含人体各个部位的深度图像,上述图像采集过程优选为在室内进行,以避免环境光及红外强光的干扰,受测者最好身穿紧身衣、或者赤裸身体,站在预设位置按照规定动作完成测量过程。In one embodiment, the image acquisition unit further includes a color camera module (not shown in FIG. 1), such as an RGB camera module. In this way, both depth images containing depth information and color images containing rich texture information can be obtained. image. It is understandable that, in order to collect more accurate depth images containing various parts of the human body, the above-mentioned image collection process is preferably carried out indoors to avoid interference from ambient light and strong infrared light. It is best for the subject to wear tights, or Naked body, stand in the preset position and follow the prescribed actions to complete the measurement process.
在一个实施例中,图像采集单元中还包括激光投影模组(图1未示出),举例说明,激光投影模组发射的激光波长为825nm,此时图像采集单元采集得到的就是红外散斑图像。因此,当激光投影模组处于工作状态时,还应该在危险距离范围内探测是否有物体,一旦检测到有物体入侵,则应该关闭激光投影模组。在一个实施例中,深度相机还包括距离测量单元,具体地,在打开图像采集单元的同时,同步打开距离测量单元,当其在1m范围内检测到了物体,则关闭激光模组。在一个实施例中,距离测量单元可以为距离传感器或接近度传感器等。In one embodiment, the image acquisition unit further includes a laser projection module (not shown in Figure 1). For example, the laser projection module emits a laser with a wavelength of 825 nm. At this time, the image acquisition unit collects infrared speckles. image. Therefore, when the laser projection module is in the working state, it should also detect whether there is an object within the dangerous distance range. Once an object intrusion is detected, the laser projection module should be turned off. In one embodiment, the depth camera further includes a distance measurement unit. Specifically, when the image acquisition unit is turned on, the distance measurement unit is simultaneously turned on, and when an object is detected within 1 m, the laser module is turned off. In an embodiment, the distance measuring unit may be a distance sensor or a proximity sensor or the like.
可以理解的是,不仅是当人体与激光投影模组相距较近时,会对人体造成伤害,当激光投影模组发射的激光功率较大时,也有可能对人体产生伤害。其中,造成激光投影模组功率较大的原因有可能是由于它的衍射光学元件(Diffractive Optical Elements,DOE)被破坏,未对零级光束进行有效衍射,因此在对人体进行图像采集前,还应该对DOE的完整性进行检测。在一个实施例中,可将光电二极管(Photo-Diode,PD)设置在DOE附近,例如可倾斜置 于DOE顶角上方45度处,以检测发光量(发光强度),其中发光量与PD两端的电压成正比,当PD两端电压超过阈值时,则判断DOE被破坏,此时需要关闭激光投影模组。It is understandable that not only when the human body is close to the laser projection module, it will cause harm to the human body, but also when the laser power emitted by the laser projection module is high, it may also cause harm to the human body. Among them, the reason for the high power of the laser projection module may be that its diffractive optical elements (DOE) are damaged and the zero-order beam is not effectively diffracted. Therefore, before the image collection of the human body, the The integrity of the DOE should be tested. In one embodiment, a photo-diode (PD) can be placed near the DOE, for example, it can be placed obliquely 45 degrees above the top angle of the DOE to detect the amount of light (luminous intensity), where the amount of light is equal to that of the PD. The voltage across the PD is proportional. When the voltage across the PD exceeds the threshold, it is judged that the DOE is destroyed, and the laser projection module needs to be turned off at this time.
S120,对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列。S120: Process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence.
深度相机对红外图像序列进行处理,获取相应的视差图像序列或深度图像序列。The depth camera processes the infrared image sequence to obtain the corresponding parallax image sequence or depth image sequence.
具体地,深度相机的图像处理单元对红外图像序列进行处理,获取相应的视差图像序列或深度图像序列。Specifically, the image processing unit of the depth camera processes the infrared image sequence to obtain the corresponding parallax image sequence or depth image sequence.
在一个实施例中,图像处理单元包括视差图像获取单元和深度图像获取单元。In one embodiment, the image processing unit includes a parallax image acquisition unit and a depth image acquisition unit.
视差图像获取单元用于对所上述红外图像序列进行处理以获取视差图像序列,例如可根据立体匹配算法计算空间点在两幅红外图像中的偏差以得到一系列的视差图像,或者计算参考散斑图像与采集得到的红外散斑图像的偏差以得到一系列的视差图像。The parallax image acquisition unit is used to process the above-mentioned infrared image sequence to obtain the parallax image sequence, for example, the deviation of the spatial point in the two infrared images can be calculated according to the stereo matching algorithm to obtain a series of parallax images, or to calculate the reference speckle The deviation between the image and the acquired infrared speckle image to obtain a series of parallax images.
深度图像获取单元,用于对视差图像序列进行处理以获取深度图像序列,例如可根据视差与深度之间的映射关系将视差图像序列进一步转换成深度图像序列。The depth image acquiring unit is configured to process the disparity image sequence to acquire the depth image sequence, for example, the disparity image sequence may be further converted into a depth image sequence according to the mapping relationship between the disparity and the depth.
S130,若根据所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离,则发送所述视差图像序列或所述深度图像序列至服务器,所述视差图像序列或所述深度图像序列用于指示所述服务器重建真实人体三维模型。S130: If it is determined that the human body is at a preset posture and distance according to the parallax image sequence or the depth image sequence, send the parallax image sequence or the depth image sequence to a server, and the parallax image sequence or the depth image sequence The depth image sequence is used to instruct the server to reconstruct a three-dimensional model of a real human body.
深度相机根据视差图像序列或深度图像序列判断人体处于预设姿势及距离。若根据视差图像序列或深度图像序列确定人体处于预设姿势及距离,则发送视差图像序列或深度图像序列至服务器,服务器根据深度相机发送的视差图像序列或深度图像序列,进行人体三维模型重建。The depth camera judges that the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence. If it is determined that the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence, the parallax image sequence or the depth image sequence is sent to the server, and the server performs the reconstruction of the human body 3D model according to the parallax image sequence or the depth image sequence sent by the depth camera.
深度相机包括位置检测单元和数据上传单元,位姿检测单元根据首帧或前 几帧深度图像判断人体是否处于预设姿势及距离,以当人体处于预设姿势及距离时,数据上传单元将所述视差图像序列或所述深度图像序列发送给服务器,服务器接收深度相机发送的所述视差图像序列或所述深度图像进行人体三维模型重建。The depth camera includes a position detection unit and a data upload unit. The pose detection unit judges whether the human body is in a preset posture and distance based on the first frame or the first few depth images, so that when the human body is in the preset posture and distance, the data upload unit will The parallax image sequence or the depth image sequence is sent to a server, and the server receives the parallax image sequence or the depth image sent by a depth camera to reconstruct a three-dimensional human body model.
在一个实例中,位置检测单元主要检测以下两个条件是否满足要求:1)人体是否在深度图像中处于中心区域,且占据画面80%以上;2)人体是否以两手成一定角度向下下垂的姿态站立。对于条件1),位置检测单元可利用图像分割算法将目标区域(包括人体的区域)与背景区域进行分割,并计算出目标区域的中心与深度图像几何中心的距离,当距离小于预设值时,则判断人体处于深度图像的中心区域,与此同时计算出目标区域占整个深度图像的比例值,并判断其是否大于80%;对于条件2),位置检测单元可对处理得到的首帧或前几帧深度图像进行关键点检测(包括但不限于头、腰、手、肘、肩关节点和双脚底等)以提取人体骨骼数据并计算手臂与躯干的夹角,当上述夹角大小在预设范围之内,例如15至30度时,条件2)满足。当上述2个条件都满足时,才可确定人体处于预设姿势及距离,此时图像采集单元还可以继续对人体进行图像采集。In an example, the position detection unit mainly detects whether the following two conditions meet the requirements: 1) Whether the human body is in the central area of the depth image and occupies more than 80% of the screen; 2) Whether the human body is hanging down at a certain angle with both hands Standing in a posture. For condition 1), the position detection unit can use the image segmentation algorithm to segment the target area (including the area of the human body) and the background area, and calculate the distance between the center of the target area and the geometric center of the depth image, when the distance is less than the preset value , It is judged that the human body is in the central area of the depth image, and at the same time, the proportion value of the target area in the entire depth image is calculated, and it is judged whether it is greater than 80%; for condition 2), the position detection unit can process the first frame or Perform key point detection (including but not limited to head, waist, hand, elbow, shoulder joint points and soles of feet, etc.) in the first few frames of depth images to extract human bone data and calculate the angle between the arm and the torso. Within a preset range, for example, 15 to 30 degrees, condition 2) is satisfied. When the above two conditions are met, it can be determined that the human body is in the preset posture and distance. At this time, the image acquisition unit can continue to image the human body.
在其他一些实施例中,深度相机还包括提醒单元,当位姿检测单元确定人体未处于预设姿势及距离时,提醒单元发出调整提醒以对人体的姿势及距离进行调整,直至根据继续采集的所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离。In some other embodiments, the depth camera further includes a reminder unit. When the posture detection unit determines that the human body is not in the preset posture and distance, the reminder unit sends out an adjustment reminder to adjust the posture and distance of the human body until it continues to collect data. The sequence of parallax images or the sequence of depth images determines that the human body is in a preset posture and distance.
在一个实施例中,当人体未处于预设姿势及距离时,提醒单元会根据当前人体的姿态和/或距离发出相关的提醒,例如发出广播:“请向前/后/左/右一步”或“张开双臂保持姿势”,受测者即可根据广播内容进行对应的操作。例如,当人体站立位置与深度相机(或激光投影模组)的距离在危险距离范围内时,提醒单元会通过广播提醒受测者向后移动。在此过程中,图像采集单元会不断采集受测者的红外图像,以对当前受测者所站立位置的合理性进行判断。具体 地,从而基于继续采集的视差图像序列或深度图像序列,在一个实施例中,提醒单元可以是扬声器。In one embodiment, when the human body is not in the preset posture and distance, the reminder unit will issue a related reminder according to the current posture and/or distance of the human body, for example, a broadcast: "Please step forward/back/left/right" Or "open your arms and keep your posture", the subjects can perform corresponding operations according to the broadcast content. For example, when the distance between the standing position of the human body and the depth camera (or laser projection module) is within the dangerous distance range, the reminding unit will remind the subject to move backward through a broadcast. In this process, the image acquisition unit will continuously collect infrared images of the subject to judge the rationality of the current subject’s standing position. Specifically, based on the continuously collected parallax image sequence or depth image sequence, in one embodiment, the reminding unit may be a speaker.
在本申请实施例中,通过先对红外图像序列进行处理,获取相应的视差图像序列或深度图像序列,再根据视差图像序列或深度图像序列确定人体处于预设姿态和距离时,将视差图像序列或深度图像序列发送给服务器进行人体三维模型重建,一方面,深度相机将红外图像序列转换成视差图像序列或深度图像序列,使得服务器不需要直接处理红外图像序列,减少了服务器的数据计算量,减少了系统资源占用,大大提高了人体三维模型重建的效率;另一方面,深度相机将确定了人体处于预设姿态和距离的视差图像序列或深度图像序列发送给服务器进行人体三维模型重建,由于提高了数据采集的准确性和完整性,进一步提高了人体三维模型重建的精度。In the embodiment of this application, by first processing the infrared image sequence, the corresponding parallax image sequence or depth image sequence is obtained, and then the parallax image sequence is determined according to the parallax image sequence or depth image sequence when the human body is in a preset posture and distance. Or the depth image sequence is sent to the server for human body 3D model reconstruction. On the one hand, the depth camera converts the infrared image sequence into a parallax image sequence or a depth image sequence, so that the server does not need to directly process the infrared image sequence, reducing the amount of data calculation on the server. The system resource occupation is reduced, and the efficiency of human body 3D model reconstruction is greatly improved. On the other hand, the depth camera sends the parallax image sequence or depth image sequence that determines that the human body is in the preset posture and distance to the server for human body 3D model reconstruction. The accuracy and completeness of data collection are improved, and the accuracy of the reconstruction of the three-dimensional model of the human body is further improved.
可以理解的是,在深度相机元器件制造以及组装过程中不可避免地会产生误差,这也会给测量的深度值带来系统性的误差,因此在上述图2所示方法实施例的基础上,在发送视差图像序列或深度图像序列给服务器之前,还包括:对所述深度图像序列中的深度数据进行多距离标定。It is understandable that errors will inevitably occur during the manufacturing and assembly of depth camera components, which will also bring systematic errors to the measured depth value. Therefore, based on the method embodiment shown in FIG. 2 above Before sending the disparity image sequence or the depth image sequence to the server, the method further includes: performing multi-distance calibration on the depth data in the depth image sequence.
在一个实施例中,深度相机还包括多距离标定单元,用于对深度图像序列中的深度数据进行多距离标定,以减小测量的系统误差。In an embodiment, the depth camera further includes a multi-distance calibration unit, which is used to perform multi-distance calibration on the depth data in the depth image sequence to reduce the systematic error of the measurement.
需要说明的是,并不是所有采集的图像帧都适合三维人体重建,因此,在图2所示方法实施例的基础上,在发送视差图像序列或深度图像序列给服务器之前,还包括:对所述深度图像序列中的深度数据进行多距离标定;对经过标定后的所述深度图像序列进行筛选,得到筛选后的所述深度图像序列。It should be noted that not all collected image frames are suitable for three-dimensional human body reconstruction. Therefore, based on the method embodiment shown in FIG. 2, before sending the parallax image sequence or the depth image sequence to the server, it also includes: Multi-distance calibration is performed on the depth data in the depth image sequence; the calibrated depth image sequence is screened to obtain the screened depth image sequence.
在一个实施例中,深度相机除了包括多距离标定单元,还包括有效帧检测单元,有效帧检测单元用于对经标定后的深度图像序列进行筛选以舍弃以去除冗余帧,且进一步减小后续三维重建的数据量。In one embodiment, in addition to the multi-distance calibration unit, the depth camera also includes a valid frame detection unit. The valid frame detection unit is used to screen the calibrated depth image sequence to remove redundant frames and further reduce The amount of data for subsequent 3D reconstruction.
为了进一步减少后续三维重建计算的数据量以及提升计算速度,在前述方法实施例的基础上,得到筛选后的所述深度图像序列之后,还包括:对筛选后 的所述深度图像序列进行掩膜处理,获取深度人体掩膜图像序列。In order to further reduce the amount of data for subsequent 3D reconstruction calculations and increase the calculation speed, on the basis of the foregoing method embodiment, after obtaining the selected depth image sequence, the method further includes: masking the selected depth image sequence Process to obtain a sequence of deep human mask images.
在一个实施例中,深度相机还包括深度人体掩膜图像获取单元,用于对上述经筛选后的深度图像序列进行掩膜处理以获取深度人体掩膜图像序列,具体地,可用预先制作的感兴趣区掩膜与上述深度图像序列相乘,以去除背景区域得到深度人体掩膜图像序列。In one embodiment, the depth camera further includes a depth human body mask image acquisition unit for performing mask processing on the above-mentioned filtered depth image sequence to acquire a depth human body mask image sequence. Specifically, a pre-made sensory image sequence may be used. The interest region mask is multiplied by the above-mentioned depth image sequence to remove the background area to obtain the depth human body mask image sequence.
在前述方法实施例的基础上,获取深度人体掩膜图像序列之后,还包括:对所述深度人体掩膜图像序列进行计算,获取视差人体掩膜图像序列。On the basis of the foregoing method embodiment, after acquiring the depth human body mask image sequence, the method further includes: calculating the depth human body mask image sequence to obtain the parallax human body mask image sequence.
在一个实施例中,上述深度相机还包括视差人体掩膜图像获取单元,用于对上述深度人体掩膜图像进行计算以获取视差人体掩膜图像序列。与深度图像数据相比,视差图像数据动态范围比较小(每个像素可以用12bit表达或者更少),邻域像素间的变化很小,可以获得更低的码率,可以进一步提升传输速度。In one embodiment, the above-mentioned depth camera further includes a parallax human body mask image acquisition unit, configured to calculate the above-mentioned depth human body mask image to obtain a parallax human body mask image sequence. Compared with the depth image data, the dynamic range of the parallax image data is relatively small (each pixel can be expressed in 12 bits or less), the change between neighboring pixels is small, a lower bit rate can be obtained, and the transmission speed can be further improved.
在前述方法实施例的基础上,得到筛选后的所述深度图像序列之后,还包括:对筛选后的所述深度图像序列进行计算,获取所述人体的骨架信息。On the basis of the foregoing method embodiment, after obtaining the screened depth image sequence, the method further includes: calculating the screened depth image sequence to obtain skeleton information of the human body.
在一个实施例中,深度相机还包括骨架获取单元,用于对上述经筛选后的深度图像序列进行计算以获取人体骨架信息,人体骨架信息主要用于后续的三维重建。In an embodiment, the depth camera further includes a skeleton acquisition unit for calculating the above-mentioned filtered depth image sequence to obtain human body skeleton information, which is mainly used for subsequent three-dimensional reconstruction.
在其他一些实施例中,为了缩短了传输数据时间,从而进一步提高人体三维模型的效率,深度相机可以通过压缩编码的方式将获取的数据上传给服务器。In some other embodiments, in order to shorten the data transmission time and further improve the efficiency of the three-dimensional human body model, the depth camera may upload the acquired data to the server through compression coding.
具体地,深度相机还包括压缩编码单元,用于将深度人体掩膜图像序列、第二参数(包括深度相机的内参)、以及人体骨架信息进行压缩编码,将其进行压缩至原始数据大小的10%后再通过数据上传单元上传至所述服务器,缩短了传输数据时间,从而进一步提高人体三维模型的效率。需要说明的是,上传深度图像数据,无需同时上传多个参数,系统设计比较简单,但是动态范围比较大,而且相邻像素在距离较远处的变化较大,不利于压缩编码。Specifically, the depth camera also includes a compression encoding unit for compressing and encoding the depth human mask image sequence, the second parameters (including the internal parameters of the depth camera), and the human skeleton information, and compressing them to 10% of the original data size. % Is then uploaded to the server through the data upload unit, which shortens the data transmission time, thereby further improving the efficiency of the three-dimensional human body model. It should be noted that uploading the depth image data does not need to upload multiple parameters at the same time. The system design is relatively simple, but the dynamic range is relatively large, and the adjacent pixels change greatly at a long distance, which is not conducive to compression coding.
因此在其他一些实施例中,压缩编码单元将视差人体掩膜图像序列、第一参数(包括深度相机的内参、视差转深度参数、和多距离标定参数)、以及人 体骨架信息进行压缩编码,将其进行压缩至原始数据大小的10%后通过数据上传单元上传至所述服务器。与深度图像数据相比,视差图像数据动态范围比较小(每个像素可以用12bit表达或者更少),邻域像素间的变化很小,可以获得更低的码率,但是需要上传多距离标定参数等额外的参数。Therefore, in some other embodiments, the compression encoding unit compresses and encodes the parallax human body mask image sequence, the first parameters (including the internal parameters of the depth camera, the parallax conversion depth parameter, and the multi-distance calibration parameter), and the human skeleton information, and It is compressed to 10% of the original data size and then uploaded to the server through the data upload unit. Compared with depth image data, the dynamic range of parallax image data is relatively small (each pixel can be expressed in 12bit or less), the change between neighboring pixels is small, and lower bit rates can be obtained, but multi-distance calibration needs to be uploaded Parameters and other additional parameters.
当深度相机通过压缩编码的方式将获取的数据上传给服务器时,服务器先将接收到的压缩数据进行解码解压,以获取视差人体掩膜图像序列、第一参数、以及人体骨架信息;或深度人体掩膜图像序列、第二参数、以及人体骨架信息,并通过上述解码解压后得到的数据进行人体三维重建。When the depth camera uploads the acquired data to the server through compression and encoding, the server first decodes and decompresses the received compressed data to obtain the parallax human body mask image sequence, the first parameter, and the human body skeleton information; or the deep human body The mask image sequence, the second parameter, and the human body skeleton information are used to perform three-dimensional reconstruction of the human body through the data obtained after the above-mentioned decoding and decompression.
在一个实施例中,服务器包括解码解压单元和三维重建单元。服务器接收到压缩数据之后,通过解码解压单元将接收的所述经压缩编码后的数据进行解码解压以获取视差人体掩膜图像序列、第一参数以及人体骨架信息;或深度人体掩膜图像序列、第二参数以及人体骨架信息,并通过上述解码解压后得到的数据由三维重建单元进行人体三维重建。可以理解的是,当服务器接收的为视差人体掩膜图像序列时,还需要根据视差转深度参数将视差人体掩膜图像序列时转换成深度人体掩膜图像序列,接着根据深度相机内参和多距离标定参数对深度人体掩膜图像序列中的深度数据进行修正,以减小系统测量误差。In one embodiment, the server includes a decoding and decompression unit and a three-dimensional reconstruction unit. After the server receives the compressed data, the received compressed and encoded data is decoded and decompressed by the decoding and decompression unit to obtain the parallax human body mask image sequence, the first parameters, and the human body skeleton information; or the depth human body mask image sequence, The second parameter and the human skeleton information, and the data obtained after the above-mentioned decoding and decompression are subjected to the three-dimensional reconstruction of the human body by the three-dimensional reconstruction unit. It is understandable that when the server receives a parallax human mask image sequence, it also needs to convert the parallax human mask image sequence into a depth human mask image sequence according to the parallax conversion depth parameter, and then according to the depth camera internal parameters and multi-distance The calibration parameters correct the depth data in the depth human body mask image sequence to reduce the system measurement error.
在本申请其他实施例中,服务器还包括数据测量单元,用于测量所需人体部位的维度,并将测量结果推送给客户端。在一个实施例中,测量部位包括但不限于:胸围、腰围、臀围、上臂围、下臂围、大腿围和小腿围等。In other embodiments of the present application, the server further includes a data measuring unit, which is used to measure the dimensions of the required body part and push the measurement result to the client. In one embodiment, the measurement locations include, but are not limited to: chest circumference, waist circumference, hip circumference, upper arm circumference, lower arm circumference, thigh circumference, calf circumference, and the like.
如此,在整个系统中,数据采集、数据处理、数据显示分别在三个不同的设备中进行,可提升三维数据测量的速度与精度。In this way, in the entire system, data collection, data processing, and data display are carried out in three different devices, which can improve the speed and accuracy of three-dimensional data measurement.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application.
对应于上文实施例所述的重建人体三维模型的方法,图4示出了本申请实施例提供的重建人体三维模型的装置的结构框图,该重建人体三维模型的装置 配置于深度相机。为了便于说明,仅示出了与本申请实施例相关的部分。Corresponding to the method for reconstructing a three-dimensional human body model described in the above embodiment, FIG. 4 shows a structural block diagram of a device for reconstructing a three-dimensional human body model provided by an embodiment of the present application, and the device for reconstructing a three-dimensional human body model is configured in a depth camera. For ease of description, only the parts related to the embodiments of the present application are shown.
参照图4,该装置包括:Referring to Figure 4, the device includes:
图像采集单元41,用于采集多帧包括人体各个局部的红外图像序列;The image acquisition unit 41 is used to acquire multiple frames of infrared image sequences including various parts of the human body;
图像处理单元42,用于对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列;The image processing unit 42 is configured to process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
数据上传单元43,用于若根据所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离,则发送所述视差图像序列或所述深度图像序列至服务器,所述视差图像序列或所述深度图像序列用于指示所述服务器重建真实人体三维模型。The data uploading unit 43 is configured to send the parallax image sequence or the depth image sequence to the server if it is determined that the human body is at a preset posture and distance according to the parallax image sequence or the depth image sequence. The image sequence or the depth image sequence is used to instruct the server to reconstruct a three-dimensional model of a real human body.
可选地,在图4所示实施例的基础上,如图5所示,该装置还包括:Optionally, on the basis of the embodiment shown in FIG. 4, as shown in FIG. 5, the device further includes:
位置检测单元44,用于根据所述视差图像序列或所述深度图像序列判断所述人体是否处于预设姿势及距离;The position detection unit 44 is configured to determine whether the human body is in a preset posture and distance according to the parallax image sequence or the depth image sequence;
提醒单元45,用于若根据所述视差图像序列或所述深度图像序列确定所述人体未处于预设姿势及距离,则发出调整提醒,直至根据继续采集的所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离。The reminding unit 45 is configured to send an adjustment reminder if it is determined according to the parallax image sequence or the depth image sequence that the human body is not in the preset posture and distance, until according to the continuously collected parallax image sequence or the depth The image sequence determines that the human body is in a preset posture and distance.
可选地,在图5所示实施例的基础上,如图6所示,该装置还包括:Optionally, on the basis of the embodiment shown in FIG. 5, as shown in FIG. 6, the device further includes:
多距离标定单元46,用于对所述深度图像序列中的深度数据进行多距离标定。The multi-distance calibration unit 46 is configured to perform multi-distance calibration on the depth data in the depth image sequence.
可选地,在图5所示实施例的基础上,如图7所示,该装置还包括:Optionally, on the basis of the embodiment shown in FIG. 5, as shown in FIG. 7, the device further includes:
多距离标定单元46,用于对所述深度图像序列中的深度数据进行多距离标定;The multi-distance calibration unit 46 is configured to perform multi-distance calibration on the depth data in the depth image sequence;
有效帧检测单元47,用于对经过标定后的所述深度图像序列进行筛选,得到筛选后的所述深度图像序列。The effective frame detection unit 47 is configured to screen the calibrated depth image sequence to obtain the screened depth image sequence.
可选地,在图7所示实施例的基础上,如图8所示,该装置还包括:Optionally, on the basis of the embodiment shown in FIG. 7, as shown in FIG. 8, the device further includes:
骨架获取单元48,用于对筛选后的所述深度图像序列进行计算,获取所述人体的骨架信息。The skeleton obtaining unit 48 is configured to calculate the depth image sequence after screening to obtain skeleton information of the human body.
深度人体掩膜图像序列获取单元49,用于对筛选后的所述深度图像序列进行掩膜处理,获取深度人体掩膜图像序列。The depth human body mask image sequence acquiring unit 49 is configured to perform mask processing on the screened depth image sequence to acquire the depth human body mask image sequence.
视差人体掩膜图像序列获取单元50,用于对所述深度人体掩膜图像序列进行计算,获取视差人体掩膜图像序列。The parallax human body mask image sequence acquiring unit 50 is configured to calculate the depth human body mask image sequence to obtain the parallax human body mask image sequence.
压缩编码单元51,用于将深度人体掩膜图像序列、第二参数、以及人体骨架信息进行压缩编码;或用于将视差人体掩膜图像序列、第一参数、以及人体骨架信息进行压缩编码,得到压缩编码数据。The compression encoding unit 51 is used for compressing and encoding the deep human body mask image sequence, the second parameter, and the human body skeleton information; or for compressing and encoding the parallax human body mask image sequence, the first parameter, and the human body skeleton information, Get compressed coded data.
相应的,数据上传单元43,用于将所述压缩编码数据上传至服务器,所述压缩编码数据用于指示服务器重建人体三维模型。Correspondingly, the data uploading unit 43 is configured to upload the compressed coded data to the server, and the compressed coded data is used to instruct the server to reconstruct the three-dimensional model of the human body.
需要说明的是,上述单元之间的信息交互、执行过程等内容,由于与本申请方法实施例基于同一构思,其具体功能及带来的技术效果,具体可参见方法实施例部分,此处不再赘述。It should be noted that the information exchange and execution process between the above-mentioned units are based on the same idea as the method embodiment of this application, and its specific functions and technical effects can be found in the method embodiment section. Go into details again.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In practical applications, the above functions can be allocated to different functional units and modules as needed. Module completion, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiments can be integrated into one processing unit, or each unit can exist alone physically, or two or more units can be integrated into one unit. The above-mentioned integrated units can be hardware-based Formal realization can also be realized in the form of software functional units. In addition, the specific names of the functional units and modules are only used to facilitate distinguishing from each other, and are not used to limit the protection scope of the present application. For the specific working process of the units and modules in the foregoing system, reference may be made to the corresponding process in the foregoing method embodiment, which will not be repeated here.
需要说明的是,上述装置/单元之间的信息交互、执行过程等内容,由于与本申请方法实施例基于同一构思,其具体功能及带来的技术效果,具体可参见方法实施例部分,此处不再赘述。It should be noted that the information interaction and execution process between the above-mentioned devices/units are based on the same concept as the method embodiment of this application, and its specific functions and technical effects can be found in the method embodiment section. I won't repeat it here.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上 述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In practical applications, the above functions can be allocated to different functional units and modules as required. Module completion, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiments can be integrated into one processing unit, or each unit can exist alone physically, or two or more units can be integrated into one unit. The above-mentioned integrated units can be hardware-based Formal realization can also be realized in the form of software functional units. In addition, the specific names of the functional units and modules are only used to facilitate distinguishing from each other, and are not used to limit the protection scope of the present application. For the specific working process of the units and modules in the foregoing system, reference may be made to the corresponding process in the foregoing method embodiment, which will not be repeated here.
本申请实施例还提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现可实现上述各个方法实施例中的步骤。The embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the steps in each of the foregoing method embodiments can be realized.
本申请实施例提供了一种计算机程序产品,当计算机程序产品在电子设备上运行时,使得电子设备执行时实现可实现上述各个方法实施例中的步骤。The embodiments of the present application provide a computer program product. When the computer program product runs on an electronic device, the electronic device can realize the steps in the foregoing method embodiments when the electronic device is executed.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实现上述实施例方法中的全部或部分流程,可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质至少可以包括:能够将计算机程序代码携带到拍照装置/终端设备的任何实体或装置、记录介质、计算机存储器、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、电载波信号、电信信号以及软件分发介质。例如U盘、移动硬盘、磁碟或者光盘等。在某些司法管辖区,根据立法和专利实践,计算机可读介质不可以是电载波信号和电信信号。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable storage medium. Based on this understanding, the implementation of all or part of the processes in the above-mentioned embodiment methods in this application can be accomplished by instructing relevant hardware through a computer program. The computer program can be stored in a computer-readable storage medium. When executed by the processor, the steps of the foregoing method embodiments can be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file, or some intermediate forms. The computer-readable medium may at least include: any entity or device capable of carrying computer program code to the photographing device/terminal device, recording medium, computer memory, read-only memory (ROM), random access memory (Random Access Memory, RAM), electric carrier signal, telecommunications signal, and software distribution medium. Such as U disk, mobile hard disk, floppy disk or CD-ROM, etc. In some jurisdictions, in accordance with legislation and patent practices, computer-readable media cannot be electrical carrier signals and telecommunication signals.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail or recorded in an embodiment, reference may be made to related descriptions of other embodiments.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。A person of ordinary skill in the art may realize that the units and algorithm steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered beyond the scope of this application.
在本申请所提供的实施例中,应该理解到,所揭露的装置/网络设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/网络设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided in this application, it should be understood that the disclosed apparatus/network equipment and method may be implemented in other ways. For example, the device/network device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division, and there may be other divisions in actual implementation, such as multiple units. Or components can be combined or integrated into another system, or some features can be omitted or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, a person of ordinary skill in the art should understand that it can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application.

Claims (10)

  1. 一种重建人体三维模型的方法,其特征在于,包括:A method for reconstructing a three-dimensional human body model, which is characterized in that it includes:
    采集多帧包括人体各个局部的红外图像序列;Collect multiple frames of infrared image sequences including various parts of the human body;
    对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列;Processing the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
    对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码;Performing compression coding on the disparity image sequence and the first parameter, or performing compression coding on the depth image sequence and the second parameter;
    将进行压缩编码后的压缩数据上传至服务器,所述压缩数据用于指示所述服务器对接收到的所述压缩数据进行解压解码后重建真实人体三维模型。Upload the compressed data that has been compressed and encoded to a server, where the compressed data is used to instruct the server to decompress and decode the received compressed data to reconstruct a real human three-dimensional model.
  2. 如权利要求1所述的方法,其特征在于,若根据所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离,则执行所述对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码的步骤。The method according to claim 1, wherein if it is determined that the human body is at a preset posture and distance according to the parallax image sequence or the depth image sequence, then the pairing of the parallax image sequence and the first The parameters are compressed and encoded, or the depth image sequence and the second parameter are compressed and encoded.
  3. 如权利要求1或2所述的方法,其特征在于,所述获取相应的视差图像序列或深度图像序列之后,还包括:The method according to claim 1 or 2, characterized in that, after acquiring the corresponding disparity image sequence or depth image sequence, the method further comprises:
    若根据所述视差图像序列或所述深度图像序列确定所述人体未处于预设姿势及距离,则发出调整提醒,直至根据继续采集的所述视差图像序列或所述深度图像序列确定所述人体处于预设姿势及距离。If it is determined according to the parallax image sequence or the depth image sequence that the human body is not in the preset posture and distance, an adjustment reminder is issued until the human body is determined according to the continuously collected parallax image sequence or the depth image sequence In a preset position and distance.
  4. 如权利要求1或2所述的方法,其特征在于,所述压缩数据还用于指示所述服务器在重建真实人体三维模型后,对所述真实人体三维模型进行测量,获取所述人体的三维数据。The method according to claim 1 or 2, wherein the compressed data is further used to instruct the server to measure the three-dimensional model of the real human body after reconstructing the three-dimensional model of the real human body to obtain the three-dimensional model of the human body. data.
  5. 如权利要求1或2所述的方法,其特征在于,所述对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码之前,还包括:The method according to claim 1 or 2, characterized in that, before compressing and encoding the disparity image sequence and the first parameter, or compressing and encoding the depth image sequence and the second parameter, the method further comprises:
    对所述深度图像序列中的深度数据进行多距离标定;Performing multi-distance calibration on the depth data in the depth image sequence;
    or
    对所述深度图像序列中的深度数据进行多距离标定;对经过标定后的所述深度图像序列进行筛选,得到筛选后的所述深度图像序列。Multi-distance calibration is performed on the depth data in the depth image sequence; the calibrated depth image sequence is screened to obtain the screened depth image sequence.
  6. 如权利要求5所述的方法,其特征在于,所述得到筛选后的所述深度图像序列之后,还包括:The method according to claim 5, characterized in that, after obtaining the selected depth image sequence, the method further comprises:
    对筛选后的所述深度图像序列进行计算,获取所述人体的骨架信息;Calculating the selected depth image sequence to obtain skeleton information of the human body;
    or
    对筛选后的所述深度图像序列进行掩膜处理,获取深度人体掩膜图像序列;Performing mask processing on the selected depth image sequence to obtain a depth human body mask image sequence;
    or
    对筛选后的所述深度图像序列进行掩膜处理,获取深度人体掩膜图像序列;对所述深度人体掩膜图像序列进行计算,获取视差人体掩膜图像序列。Mask processing is performed on the selected depth image sequence to obtain a depth human body mask image sequence; the depth human body mask image sequence is calculated to obtain a parallax human body mask image sequence.
  7. 如权利要求1或2所述的方法,其特征在于,所述第一参数包括:所述深度相机的内参、视差转深度参数以及多距离标定参数;所述第二参数包括所述深度相机的内参。The method according to claim 1 or 2, wherein the first parameter includes: internal parameters of the depth camera, parallax to depth parameters, and multi-distance calibration parameters; the second parameter includes the depth camera Internal reference.
  8. 一种重建人体三维模型的装置,其特征在于,包括:A device for reconstructing a three-dimensional human body model, characterized in that it comprises:
    图像采集单元,用于采集多帧包括人体各个局部的红外图像序列;The image acquisition unit is used to acquire multiple frames of infrared image sequences including various parts of the human body;
    图像处理单元,用于对所述红外图像序列进行处理,获取相应的视差图像序列或深度图像序列;An image processing unit, configured to process the infrared image sequence to obtain a corresponding parallax image sequence or depth image sequence;
    压缩编码单元,用于对所述视差图像序列和第一参数进行压缩编码,或对所述深度图像序列和第二参数进行压缩编码;A compression coding unit, configured to perform compression coding on the disparity image sequence and the first parameter, or perform compression coding on the depth image sequence and the second parameter;
    数据上传单元,用于将进行压缩编码后的压缩数据上传至服务器,所述压缩数据用于指示所述服务器对接收到的所述压缩数据进行解压解码后重建真实人体三维模型。The data uploading unit is configured to upload compressed data that has been compressed and encoded to a server, and the compressed data is used to instruct the server to decompress and decode the received compressed data to reconstruct a real three-dimensional human body model.
  9. 一种重建人体三维模型的系统,其特征在于,包括服务器和深度相机,所述服务器用于对接收到的所述压缩数据进行解压解码后重建真实人体三维模型,所述深度相机包括如权利要求8所述的装置。A system for reconstructing a three-dimensional human body model, comprising a server and a depth camera, the server is used to decompress and decode the received compressed data to reconstruct a real three-dimensional human body model, and the depth camera includes 8 said device.
  10. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程 序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至7任一项所述的方法。A computer-readable storage medium storing a computer program, wherein the computer program implements the method according to any one of claims 1 to 7 when the computer program is executed by a processor.
PCT/CN2020/089885 2019-10-25 2020-05-12 Method, apparatus and system for reconstructing three-dimensional model of human body, and readable storage medium WO2021077721A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911025195.4A CN110874851A (en) 2019-10-25 2019-10-25 Method, device, system and readable storage medium for reconstructing three-dimensional model of human body
CN201911025195.4 2019-10-25

Publications (1)

Publication Number Publication Date
WO2021077721A1 true WO2021077721A1 (en) 2021-04-29

Family

ID=69717909

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/089885 WO2021077721A1 (en) 2019-10-25 2020-05-12 Method, apparatus and system for reconstructing three-dimensional model of human body, and readable storage medium

Country Status (2)

Country Link
CN (1) CN110874851A (en)
WO (1) WO2021077721A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110874851A (en) * 2019-10-25 2020-03-10 深圳奥比中光科技有限公司 Method, device, system and readable storage medium for reconstructing three-dimensional model of human body
CN113192185B (en) * 2021-05-18 2022-05-17 清华大学 Dynamic light field reconstruction method, device and equipment
CN113727105B (en) * 2021-09-08 2022-04-26 北京医百科技有限公司 Depth map compression method, device, system and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110148868A1 (en) * 2009-12-21 2011-06-23 Electronics And Telecommunications Research Institute Apparatus and method for reconstructing three-dimensional face avatar through stereo vision and face detection
US20130034264A1 (en) * 2011-08-04 2013-02-07 National Taiwan University Locomotion analysis method and locomotion analysis apparatus
CN104008571A (en) * 2014-06-12 2014-08-27 深圳奥比中光科技有限公司 Human body model obtaining method and network virtual fitting system based on depth camera
CN108876836A (en) * 2018-03-29 2018-11-23 北京旷视科技有限公司 A kind of depth estimation method, device, system and computer readable storage medium
CN109903379A (en) * 2019-03-05 2019-06-18 电子科技大学 A kind of three-dimensional rebuilding method based on spots cloud optimization sampling
CN110874851A (en) * 2019-10-25 2020-03-10 深圳奥比中光科技有限公司 Method, device, system and readable storage medium for reconstructing three-dimensional model of human body

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100263935B1 (en) * 1997-12-30 2000-08-16 윤종용 Apparatus and method for converting 2-d video sequence to 3-d video using conversion of motional disparity to vertical disparity
CN102178530A (en) * 2011-01-24 2011-09-14 天津大学 Method for automatically measuring human body dimensions on basis of three-dimensional point cloud data
CN105678842A (en) * 2016-01-11 2016-06-15 湖南拓视觉信息技术有限公司 Manufacturing method and device for three-dimensional map of indoor environment
CN105847777B (en) * 2016-03-24 2018-04-17 湖南拓视觉信息技术有限公司 A kind of method and device for transmitting three dimensional depth image
CN106997614B (en) * 2017-03-17 2021-07-20 浙江光珀智能科技有限公司 Large-scale scene 3D modeling method and device based on depth camera
CN106981091B (en) * 2017-04-27 2020-04-17 深圳奥比中光科技有限公司 Human body three-dimensional modeling data processing method and device
CN108053476B (en) * 2017-11-22 2021-06-04 上海大学 Human body parameter measuring system and method based on segmented three-dimensional reconstruction
CN108154551B (en) * 2017-11-29 2021-04-30 奥比中光科技集团股份有限公司 Method and system for dynamically reconstructing three-dimensional human body model in real time
CN108335353B (en) * 2018-02-23 2020-12-22 清华-伯克利深圳学院筹备办公室 Three-dimensional reconstruction method, device and system of dynamic scene, server and medium
CN108510573B (en) * 2018-04-03 2021-07-30 南京大学 Multi-view face three-dimensional model reconstruction method based on deep learning
CN108876897B (en) * 2018-04-20 2022-10-18 杭州电子科技大学 Scene three-dimensional reconstruction method under rapid motion
CN108596128B (en) * 2018-04-28 2020-06-26 京东方科技集团股份有限公司 Object recognition method, device and storage medium
CN108769646B (en) * 2018-05-06 2020-02-18 Oppo广东移动通信有限公司 Three-dimensional video communication method and system, electronic device, server and readable storage medium
CN108596923B (en) * 2018-05-14 2020-10-16 江苏哈工智能机器人股份有限公司 Three-dimensional data acquisition method and device and electronic equipment
CN109344706A (en) * 2018-08-28 2019-02-15 杭州电子科技大学 It is a kind of can one man operation human body specific positions photo acquisition methods
CN109413152B (en) * 2018-09-25 2021-02-26 上海瑾盛通信科技有限公司 Image processing method, image processing device, storage medium and electronic equipment
CN109410318B (en) * 2018-09-30 2020-09-08 先临三维科技股份有限公司 Three-dimensional model generation method, device, equipment and storage medium
CN109544677B (en) * 2018-10-30 2020-12-25 山东大学 Indoor scene main structure reconstruction method and system based on depth image key frame
CN109584358A (en) * 2018-11-28 2019-04-05 深圳市商汤科技有限公司 A kind of three-dimensional facial reconstruction method and device, equipment and storage medium
CN110276831B (en) * 2019-06-28 2022-03-18 Oppo广东移动通信有限公司 Method and device for constructing three-dimensional model, equipment and computer-readable storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110148868A1 (en) * 2009-12-21 2011-06-23 Electronics And Telecommunications Research Institute Apparatus and method for reconstructing three-dimensional face avatar through stereo vision and face detection
US20130034264A1 (en) * 2011-08-04 2013-02-07 National Taiwan University Locomotion analysis method and locomotion analysis apparatus
CN104008571A (en) * 2014-06-12 2014-08-27 深圳奥比中光科技有限公司 Human body model obtaining method and network virtual fitting system based on depth camera
CN108876836A (en) * 2018-03-29 2018-11-23 北京旷视科技有限公司 A kind of depth estimation method, device, system and computer readable storage medium
CN109903379A (en) * 2019-03-05 2019-06-18 电子科技大学 A kind of three-dimensional rebuilding method based on spots cloud optimization sampling
CN110874851A (en) * 2019-10-25 2020-03-10 深圳奥比中光科技有限公司 Method, device, system and readable storage medium for reconstructing three-dimensional model of human body

Also Published As

Publication number Publication date
CN110874851A (en) 2020-03-10

Similar Documents

Publication Publication Date Title
WO2021077721A1 (en) Method, apparatus and system for reconstructing three-dimensional model of human body, and readable storage medium
CN110874864B (en) Method, device, electronic equipment and system for obtaining three-dimensional model of object
CN110853133A (en) Method, device, system and readable storage medium for reconstructing three-dimensional model of human body
CN111060023B (en) High-precision 3D information acquisition equipment and method
CN103971408B (en) Three-dimensional facial model generating system and method
US11504055B2 (en) Systems and methods for analyzing cutaneous conditions
CN110335343B (en) Human body three-dimensional reconstruction method and device based on RGBD single-view-angle image
JP6429772B2 (en) 3D scanning and positioning system
US20060104503A1 (en) Apparatus and method for rapidly measuring 3-Dimensional foot sizes from multi-images
US8917317B1 (en) System and method for camera calibration
WO2024032670A1 (en) Three-dimensional reconstruction method, apparatus and system
US11722652B2 (en) Method and system for multi-wavelength depth encoding for three- dimensional range geometry compression
Reichinger et al. Evaluation of methods for optical 3-D scanning of human pinnas
CN111060008B (en) 3D intelligent vision equipment
CN114152217B (en) Binocular phase expansion method based on supervised learning
KR20130081037A (en) Apparatus for measuring body size and method thereof
CN107493452B (en) Video picture processing method and device and terminal
KR20210068409A (en) Information processing devices, information processing methods and information processing programs
CN111064949B (en) Intelligent 3D acquisition module for mobile terminal
WO2019091010A1 (en) Three-dimensional scanning method, apparatus and system, storage medium and processor
US20230306636A1 (en) Object three-dimensional localizations in images or videos
JP2015156540A (en) Image processing device, image processing method, and image processing program
CN117372657A (en) Training method and device for key point rotation model, electronic equipment and storage medium
CN104899916B (en) Human cervical spine three-dimensional reconstruction method and device based on optical grating projection
CN107507272A (en) Establish the method, apparatus and terminal device of human 3d model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20878480

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20878480

Country of ref document: EP

Kind code of ref document: A1