CN112949506A - Low-cost real-time bone key point identification method and device - Google Patents

Low-cost real-time bone key point identification method and device Download PDF

Info

Publication number
CN112949506A
CN112949506A CN202110246577.0A CN202110246577A CN112949506A CN 112949506 A CN112949506 A CN 112949506A CN 202110246577 A CN202110246577 A CN 202110246577A CN 112949506 A CN112949506 A CN 112949506A
Authority
CN
China
Prior art keywords
network
image
key point
neural network
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110246577.0A
Other languages
Chinese (zh)
Inventor
程煜钧
张哲为
丁博文
顾友良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Ziweiyun Technology Co ltd
Original Assignee
Guangzhou Ziweiyun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Ziweiyun Technology Co ltd filed Critical Guangzhou Ziweiyun Technology Co ltd
Priority to CN202110246577.0A priority Critical patent/CN112949506A/en
Publication of CN112949506A publication Critical patent/CN112949506A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Human Computer Interaction (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a low-cost real-time skeleton key point identification device, which comprises an image acquisition module, a core calculation unit, a lightweight neural network algorithm module, a neural network acceleration engine and a skeleton key point output module, wherein the image acquisition module acquires an image and sends acquired image information to the core calculation unit; the core computing unit is used for processing the acquired image, the lightweight neural network algorithm module adopts an improved SqueezeNet as a basic backbone network, multi-scale feature extraction is carried out on the image by combining a feature pyramid network, the neural network acceleration module is used for accelerating the network, finally, the skeleton key point output module is used for outputting the skeleton key points, the image acquisition module adopts any monocular camera, and the core computing unit adopts a mobile terminal CPU.

Description

Low-cost real-time bone key point identification method and device
Technical Field
The invention relates to the technical field of computer vision, in particular to a low-cost real-time bone key point identification method and device.
Background
The identification technology of the key points of the skeleton is one of the basic technologies of computer vision. The technology detects joints and five sense organs of a human body in image/video data through a sensor (a camera, infrared rays and other equipment), and describes human skeleton information through key points.
However, most of the existing bone key point recognition algorithms based on deep learning are difficult to run in real time on a low-cost hardware platform, and high-cost hardware (such as a GPU or a high-end camera) needs to be matched to achieve real time. The method is based on a series of software and hardware optimization technologies, and can be realized on a low-cost hardware platform to complete real-time identification of the skeletal key points. On the premise of ensuring the accuracy, compared with the identification time delay of about 50 milliseconds introduced by the mainstream method, the identification time delay of 3 milliseconds at the lowest can be achieved by the technology of the invention.
The traditional skeleton key point algorithm is carried out on the basis of geometric prior based on the idea of template matching, and the accuracy is poor. Due to the limitation of hardware performance, the existing bone key point identification algorithm based on deep learning has a low identification speed on a low-cost hardware platform (such as a mobile terminal mobile phone and a tablet), and the linkage application of the algorithm can cause the situations of application blocking, frame loss and the like, so that the user experience is greatly influenced.
Disclosure of Invention
Based on the defects, the invention mainly aims at the identification of the bone key points of the mobile terminal/embedded equipment, adopts a lightweight deep learning algorithm and applies a neural network acceleration technology for optimization, hardware only needs to adopt a CPU and a monocular camera, and can finish low-cost real-time identification of the bone key points without a GPU or a high-end camera (kinect).
The present invention is directed to solving at least one of the problems of the prior art. Therefore, the invention discloses a low-cost real-time bone key point identification device which comprises an image acquisition module, a core calculation unit, a lightweight neural network algorithm module, a neural network acceleration engine and a bone key point output module, wherein the image acquisition module acquires an image and sends acquired image information to the core calculation unit; the core computing unit is used for processing the acquired image, the lightweight neural network algorithm module adopts an improved SqueezeNet as a basic backbone network, multi-scale feature extraction is carried out on the image by combining a feature pyramid network, the neural network acceleration module is used for accelerating the network, finally, the skeleton key point output module is used for outputting the skeleton key points, the image acquisition module adopts any monocular camera, and the core computing unit adopts a mobile terminal CPU.
Still further, the neural network acceleration module further comprises: the input image firstly enters a modified Squeezenet backbone network for calculation, wherein the Squeezenet backbone network consists of two convolution layers, eight Fire layers and three pooling layers, wherein the convolution layers conv1 pass through 96 groups of convolution kernels of 7x7, and the convolution layers conv10 pass through 1000 groups of convolution kernels of 1x 1; the sizes of the pooling layers maxpool1, maxpool2 and maxpool3 are all 3x 3; the fire layer structure is unified, wherein the Squeeze part is composed of continuous 1x1 convolutions, and the Expand part is composed of continuous 1x1 convolutions and 3x3 convolution links.
Further, if the number of channels of the Squeeze portion is denoted as S1x1, and the number of convolutions of the expanded portion 1x1 and the number of convolutions of the expanded portion 3x3 are denoted as e1x1 and e3x3, respectively, the relationship between the above three satisfies the following equation: s1 × 1< e1 × 1+ e3 × 3.
Furthermore, the neural network acceleration module adds a network shortcut (short-cut) among the fire3, the fire5, the fire7 and the fire9 layers to improve the robustness during training and prevent overfitting, and the improved Squeezenet backbone network outputs a series of convolution characteristic graphs to a characteristic pyramid network for further calculation.
Furthermore, four scales of fire2, fire6, fire5 and conv10 in the backbone network squeezet are extracted to construct a feature pyramid network, and after the feature pyramid network is processed, a prediction result is output on each scale respectively.
Still further, the feature pyramid network further comprises: the predicted result contains two parts: the heat map (heatmap) and the vector field (paf) are combined to predict results of heatmap and paf in multiple scales, and finally the model obtains output results of the bone key points. If n-scale predictors are obtained, the final predictors Fheatmap and Fpaf of heatmap and paf are expressed as follows:
Figure BDA0002964092140000021
Figure BDA0002964092140000022
where vh _ i and vp _ i are the heatmap and paf values output for each scale, respectively.
Still further, the neural network acceleration module further comprises: the method comprises the steps of quantizing and accelerating an engine, wherein the quantizing is that data in an FP32 format are converted into an INT8 format in an inference process for data in an FP32 format adopted in a model training process, and the accelerating engine accelerates in the model inference process based on an OpenCL framework, so that the real-time performance of a bone key point identification process is further improved.
The application also discloses a low-cost real-time bone key point identification method, the device is adopted to collect image data, image processing is carried out on the obtained image information, multi-scale feature extraction is carried out on the image by adopting the improved SqueezeNet as a basic backbone network and combining a feature pyramid network, after the multi-scale feature extraction is carried out on the image by the feature pyramid network, a prediction result is respectively output on each scale, and finally the bone key point information of the collected image is output in real time by quantizing and accelerating the data in the reasoning process.
Compared with the prior art, the invention has the beneficial effects that: the invention mainly aims at the identification of the bone key points of mobile terminal/embedded equipment, adopts a lightweight deep learning algorithm and applies a neural network acceleration technology for optimization, hardware only needs to adopt a CPU and a monocular camera to finish low-cost real-time identification of the bone key points, a GPU or a high-end camera is not needed, an acceleration engine of a quantification and reasoning network is adopted, and on the premise of ensuring accuracy, compared with the identification time delay of about 50 milliseconds introduced by a mainstream method, the identification time delay of 3 milliseconds at least can be achieved by the technology of the invention. The real-time performance of the recognition speed is guaranteed through the technology while the hardware cost is reduced.
Drawings
The invention will be further understood from the following description in conjunction with the accompanying drawings. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the embodiments. In the drawings, like reference numerals designate corresponding parts throughout the different views.
FIG. 1 is a block diagram of the core modules of the real-time bone identification method and apparatus of the present invention;
FIG. 2 is a block diagram of the overall architecture of a lightweight network in one embodiment of the invention;
FIG. 3 is a block diagram of a SqueezeNet backbone network in accordance with an embodiment of the present invention;
FIG. 4 is a block diagram of a fire layer structure in one embodiment of the invention;
fig. 5 is a structural diagram of a feature pyramid network structure in an embodiment of the present invention.
Detailed Description
Example one
The core module of the low-cost real-time bone identification method and device provided by the invention is shown in fig. 1, and comprises an image acquisition module, a core calculation unit, a lightweight neural network algorithm module, a neural network acceleration engine and a bone key point output module. The image acquisition module adopts any monocular camera, and the core computing unit adopts a mobile end CPU. The core design of the invention is a lightweight neural network algorithm module and a neural network acceleration module, and the two modules are adopted to ensure the real-time performance of the system on low-cost hardware.
Wherein, the lightweight neural network algorithm module:
the lightweight neural network algorithm module adopts an improved SqueezeNet as a basic backbone network and combines a Feature Pyramid Network (FPN) to extract multi-scale features so as to improve the precision and speed. The overall structure of the lightweight network is shown in fig. 2.
The input image is first computed in a modified squeezet backbone network consisting of two convolutional layers, eight Fire layers and three pooling layers, as shown in fig. 3. Wherein the convolutional layer conv1 layers pass through 96 sets of convolution kernels of 7x7 (step size of 2), and convolutional layer conv10 passes through 1000 sets of convolution kernels of 1x1 (step size of 1); the sizes of the pooling layers maxpool1, maxpool2 and maxpool3 are all 3x3, and the step length is 2; the fire layer structure is unified, and the structure is shown in the figure four, wherein the Squeeze part is composed of continuous 1x1 convolution, and the expanded part is composed of continuous 1x1 convolution and 3x3 convolution connection. If the number of channels of the Squeeze part is recorded as S1x1And the number of convolutions of 1x1 and the number of convolutions of 3x3 of Expand part are denoted as e, respectively1x1And e3x3Then, the relationship of the above three must satisfy the following formula:
S1×1<e1×1+e3×3
in addition, the improved SqueezeNet backbone network adds a network shortcut (short-cut) among the fire3, fire5, fire7 and fire9 layers to improve robustness during training and prevent overfitting. Finally, the backbone network outputs a series of convolution characteristic graphs to the characteristic pyramid network for the next calculation.
The feature pyramid network structure is shown in fig. five, and four scales of fire2, fire6, fire5 and conv10 in the backbone network squeezet are extracted in the method to construct the feature pyramid network. And after the characteristic pyramid network processing, respectively outputting a predicted result on each scale. The predicted result contains two parts: heat map (heatmap) and vector field (paf). And synthesizing the prediction results of heatmap and paf in multiple scales, and finally obtaining the output result of the skeleton key point by the model. If n-scale predictors are in the end, the final predictors F of heatmap and pafheatmapAnd FpafRespectively expressed by the following formulas:
Figure BDA0002964092140000041
Figure BDA0002964092140000042
where vh _ i and vp _ i are the heatmap and paf values output for each scale, respectively.
Wherein, the neural network acceleration module:
in the identification method, a neural network acceleration module is adopted to accelerate the network. The module is divided into two parts: quantization and use of an acceleration engine. The quantification means that data in the FP32 format is adopted in the model training process, and the data in the FP32 format is converted into the INT8 format in the reasoning process, so that the reasoning speed of the model is greatly increased on the premise of basically ensuring the model accuracy, and better real-time performance is obtained. The adoption of the acceleration engine means that the model reasoning process of the method is accelerated based on an OpenCL framework, so that the real-time performance of the bone key point identification process is further improved.
Example two
The embodiment provides a low-cost real-time bone key point identification device, which comprises an image acquisition module, a core calculation unit, a lightweight neural network algorithm module, a neural network acceleration engine and a bone key point output module, wherein the image acquisition module acquires an image and sends acquired image information to the core calculation unit; the core computing unit is used for processing the acquired image, the lightweight neural network algorithm module adopts an improved SqueezeNet as a basic backbone network, multi-scale feature extraction is carried out on the image by combining a feature pyramid network, the neural network acceleration module is used for accelerating the network, finally, the skeleton key point output module is used for outputting the skeleton key points, the image acquisition module adopts any monocular camera, and the core computing unit adopts a mobile terminal CPU.
Still further, the neural network acceleration module further comprises: the input image firstly enters a modified Squeezenet backbone network for calculation, wherein the Squeezenet backbone network consists of two convolution layers, eight Fire layers and three pooling layers, wherein the convolution layers conv1 pass through 96 groups of convolution kernels of 7x7, and the convolution layers conv10 pass through 1000 groups of convolution kernels of 1x 1; the sizes of the pooling layers maxpool1, maxpool2 and maxpool3 are all 3x 3; the fire layer structure is unified, wherein the Squeeze part is composed of continuous 1x1 convolutions, and the Expand part is composed of continuous 1x1 convolutions and 3x3 convolution links.
Further, if the number of channels of the Squeeze portion is denoted as S1x1, and the number of convolutions of the expanded portion 1x1 and the number of convolutions of the expanded portion 3x3 are denoted as e1x1 and e3x3, respectively, the relationship between the above three satisfies the following equation: s1 × 1< e1 × 1+ e3 × 3.
Furthermore, the neural network acceleration module adds a network shortcut (short-cut) among the fire3, the fire5, the fire7 and the fire9 layers to improve the robustness during training and prevent overfitting, and the improved Squeezenet backbone network outputs a series of convolution characteristic graphs to a characteristic pyramid network for further calculation.
Furthermore, four scales of fire2, fire6, fire5 and conv10 in the backbone network squeezet are extracted to construct a feature pyramid network, and after the feature pyramid network is processed, a prediction result is output on each scale respectively.
Still further, the feature pyramid network further comprises: the predicted result contains two parts: the heat map (heatmap) and the vector field (paf) are combined to predict results of heatmap and paf in multiple scales, and finally the model obtains output results of the bone key points. If n-scale predictors are obtained, the final predictors Fheatmap and Fpaf of heatmap and paf are expressed as follows:
Figure BDA0002964092140000061
Figure BDA0002964092140000062
where vh _ i and vp _ i are the heatmap and paf values output for each scale, respectively.
Still further, the neural network acceleration module further comprises: the method comprises the steps of quantizing and accelerating an engine, wherein the quantizing is that data in an FP32 format are converted into an INT8 format in an inference process for data in an FP32 format adopted in a model training process, and the accelerating engine accelerates in the model inference process based on an OpenCL framework, so that the real-time performance of a bone key point identification process is further improved.
The application also discloses a low-cost real-time bone key point identification method, the device is adopted to collect image data, image processing is carried out on the obtained image information, multi-scale feature extraction is carried out on the image by adopting the improved SqueezeNet as a basic backbone network and combining a feature pyramid network, after the multi-scale feature extraction is carried out on the image by the feature pyramid network, a prediction result is respectively output on each scale, and finally the bone key point information of the collected image is output in real time by quantizing and accelerating the data in the reasoning process.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Although the invention has been described above with reference to various embodiments, it should be understood that many changes and modifications may be made without departing from the scope of the invention. It is therefore intended that the foregoing detailed description be regarded as illustrative rather than limiting, and that it be understood that it is the following claims, including all equivalents, that are intended to define the spirit and scope of this invention. The above examples are to be construed as merely illustrative and not limitative of the remainder of the disclosure. After reading the description of the invention, the skilled person can make various changes or modifications to the invention, and these equivalent changes and modifications also fall into the scope of the invention defined by the claims.

Claims (8)

1. A low-cost real-time bone key point identification device is characterized by comprising an image acquisition module, a core calculation unit, a lightweight neural network algorithm module, a neural network acceleration engine and a bone key point output module, wherein the image acquisition module acquires an image and sends acquired image information to the core calculation unit; the core computing unit is used for processing the acquired image, the lightweight neural network algorithm module adopts an improved SqueezeNet as a basic backbone network, multi-scale feature extraction is carried out on the image by combining a feature pyramid network, the neural network acceleration module is used for accelerating the network, finally, the skeleton key point output module is used for outputting the skeleton key points, the image acquisition module adopts any monocular camera, and the core computing unit adopts a mobile terminal CPU.
2. The low-cost real-time bone keypoint identification device of claim 1, wherein said neural network acceleration module further comprises: the input image firstly enters a modified Squeezenet backbone network for calculation, wherein the Squeezenet backbone network consists of two convolution layers, eight Fire layers and three pooling layers, wherein the convolution layers conv1 pass through 96 groups of convolution kernels of 7x7, and the convolution layers conv10 pass through 1000 groups of convolution kernels of 1x 1; the sizes of the pooling layers maxpool1, maxpool2 and maxpool3 are all 3x 3; the fire layer structure is unified, wherein the Squeeze part is composed of continuous 1x1 convolutions, and the Expand part is composed of continuous 1x1 convolutions and 3x3 convolution links.
3. The low-cost real-time bone key point identification device of claim 2, wherein if the number of channels of the Squeeze part is denoted as S1x1 and the number of convolutions of the expanded part 1x1 and the number of convolutions of the expanded part 3x3 are denoted as e1x1 and e3x3, respectively, the relationship of the above three is satisfied by the following formula: s1 × 1< e1 × 1+ e3 × 3.
4. The device of claim 3, wherein the neural network accelerator module further adds a short-cut between each of the fire3, fire5, fire7 and fire9 layers to improve robustness during training and prevent overfitting, and the improved Squeezenet backbone network outputs a series of convolution signatures to the signature pyramid network for further computation.
5. The device as claimed in claim 3, wherein four dimensions of fire2, fire6, fire5 and conv10 in the backbone network squeezet are extracted to construct a feature pyramid network, and after the feature pyramid network processing, the predicted results are output at each dimension respectively.
6. The low-cost, real-time bone keypoint identification device of claim 4, wherein said feature pyramid network further comprises: the predicted result contains two parts: the heat map (heatmap) and the vector field (paf) are combined to predict results of heatmap and paf in multiple scales, and finally the model obtains output results of the bone key points. If n-scale predictors are obtained, the final predictors Fheatmap and Fpaf of heatmap and paf are expressed as follows:
Figure FDA0002964092130000021
Figure FDA0002964092130000022
where vh _ i and vp _ i are the heatmap and paf values output for each scale, respectively.
7. The low-cost real-time bone keypoint identification device of claim 1, wherein said neural network acceleration module further comprises: the method comprises the steps of quantizing and accelerating an engine, wherein the quantizing is that data in an FP32 format are converted into an INT8 format in an inference process for data in an FP32 format adopted in a model training process, and the accelerating engine accelerates in the model inference process based on an OpenCL framework, so that the real-time performance of a bone key point identification process is further improved.
8. A low-cost real-time bone key point identification method, which utilizes the device as claimed in claims 1-7, characterized in that image data is collected, the obtained image information is processed, the image is extracted by using improved SqueezeNet as basic backbone network and combining with characteristic pyramid network, after processing by characteristic pyramid network, the predicted result is output on each scale, and finally the bone key point information of the collected image is output in real time by quantizing and accelerating the data in the reasoning process.
CN202110246577.0A 2021-03-05 2021-03-05 Low-cost real-time bone key point identification method and device Pending CN112949506A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110246577.0A CN112949506A (en) 2021-03-05 2021-03-05 Low-cost real-time bone key point identification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110246577.0A CN112949506A (en) 2021-03-05 2021-03-05 Low-cost real-time bone key point identification method and device

Publications (1)

Publication Number Publication Date
CN112949506A true CN112949506A (en) 2021-06-11

Family

ID=76247938

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110246577.0A Pending CN112949506A (en) 2021-03-05 2021-03-05 Low-cost real-time bone key point identification method and device

Country Status (1)

Country Link
CN (1) CN112949506A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115083016A (en) * 2022-06-09 2022-09-20 广州紫为云科技有限公司 Monocular camera-based small-target-oriented hand space interaction method and device
CN115171149A (en) * 2022-06-09 2022-10-11 广州紫为云科技有限公司 Monocular RGB image regression-based real-time human body 2D/3D bone key point identification method
CN115955603A (en) * 2022-12-06 2023-04-11 广州紫为云科技有限公司 Intelligent camera device based on somatosensory interaction of intelligent screen and implementation method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115083016A (en) * 2022-06-09 2022-09-20 广州紫为云科技有限公司 Monocular camera-based small-target-oriented hand space interaction method and device
CN115171149A (en) * 2022-06-09 2022-10-11 广州紫为云科技有限公司 Monocular RGB image regression-based real-time human body 2D/3D bone key point identification method
CN115171149B (en) * 2022-06-09 2023-12-05 广州紫为云科技有限公司 Real-time human body 2D/3D skeleton key point identification method based on monocular RGB image regression
CN115955603A (en) * 2022-12-06 2023-04-11 广州紫为云科技有限公司 Intelligent camera device based on somatosensory interaction of intelligent screen and implementation method
CN115955603B (en) * 2022-12-06 2024-05-03 广州紫为云科技有限公司 Intelligent camera device based on intelligent screen somatosensory interaction and implementation method

Similar Documents

Publication Publication Date Title
CN110473141B (en) Image processing method, device, storage medium and electronic equipment
CN112949506A (en) Low-cost real-time bone key point identification method and device
CN106203376B (en) Face key point positioning method and device
CN110188768B (en) Real-time image semantic segmentation method and system
CN111160375B (en) Three-dimensional key point prediction and deep learning model training method, device and equipment
JP7038867B2 (en) Face recognition method and equipment
CN108875487B (en) Training of pedestrian re-recognition network and pedestrian re-recognition based on training
CN110222718B (en) Image processing method and device
Santhalingam et al. Sign language recognition analysis using multimodal data
CN115171149B (en) Real-time human body 2D/3D skeleton key point identification method based on monocular RGB image regression
CN113869282B (en) Face recognition method, hyper-resolution model training method and related equipment
CN113449573A (en) Dynamic gesture recognition method and device
CN114519868A (en) Real-time bone key point identification method and system based on coordinate system regression
CN113569672A (en) Lightweight target detection and fault identification method, device and system
CN113435520A (en) Neural network training method, device, equipment and computer readable storage medium
CN114529982A (en) Lightweight human body posture estimation method and system based on stream attention
CN111950700A (en) Neural network optimization method and related equipment
CN110929558A (en) Pedestrian re-identification method based on deep learning
CN111797705A (en) Action recognition method based on character relation modeling
CN109635706B (en) Gesture recognition method, device, storage medium and device based on neural network
CN112883964B (en) Method for detecting characters in natural scene
CN113627163A (en) Attention model, feature extraction method and related device
CN110807380B (en) Human body key point detection method and device
CN111652349A (en) Neural network processing method and related equipment
WO2022001364A1 (en) Method for extracting data features, and related apparatus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination