WO2020207038A1 - People counting method, apparatus, and device based on facial recognition, and storage medium - Google Patents

People counting method, apparatus, and device based on facial recognition, and storage medium Download PDF

Info

Publication number
WO2020207038A1
WO2020207038A1 PCT/CN2019/122079 CN2019122079W WO2020207038A1 WO 2020207038 A1 WO2020207038 A1 WO 2020207038A1 CN 2019122079 W CN2019122079 W CN 2019122079W WO 2020207038 A1 WO2020207038 A1 WO 2020207038A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
portrait
result
segmented
counting
Prior art date
Application number
PCT/CN2019/122079
Other languages
French (fr)
Chinese (zh)
Inventor
王金燕
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2020207038A1 publication Critical patent/WO2020207038A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/40Image enhancement or restoration using histogram techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30242Counting objects in image

Definitions

  • This application relates to the field of artificial intelligence technology, and in particular to a method, device, device, and storage medium for counting people based on face recognition.
  • This application provides a method, device, equipment and storage medium for counting people based on face recognition, aiming to improve the efficiency and accuracy of counting people.
  • this application provides a method for counting people based on face recognition, and the method includes:
  • the step of extracting image LBP features of a plurality of the segmented images includes:
  • the step of inputting the LBP features of the image into a pre-trained portrait recognition model, and then recognizing the LBP features of the image by the portrait recognition model, before the step of outputting the recognition result further includes:
  • the sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  • the segmented image includes a first segmented image and a second segmented image
  • the step of segmenting the video image into a picture to obtain multiple segmented images includes:
  • the statistical recognition result is the number of segmented images of the portrait, and after the step of obtaining the first population statistical result, the method further includes:
  • the method before the step of reporting the first population count result to the server according to the preset result reporting interface, the method further includes:
  • the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
  • the portrait recognition model records the portrait coordinates of the portrait in the segmented image whose recognition result is a portrait
  • the step of judging whether an abnormal portrait is included in the first population count result according to the crest counting method includes:
  • the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait
  • the portrait corresponding to the image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  • an embodiment of the present application further provides a device for counting people based on face recognition, and the device for counting people based on face recognition includes:
  • the obtaining module is used to obtain video images from the video stream when receiving the people counting instruction;
  • An extraction module configured to perform picture segmentation on the video image to obtain multiple segmented images, and extract image LBP features of the multiple segmented images
  • a recognition module configured to input LBP features of the image into a pre-trained portrait recognition model, the portrait recognition model recognizes the LBP features of the image, and outputs a recognition result;
  • a statistics module configured to count the number of the segmented images whose recognition result is a portrait, to obtain a first population statistics result
  • the extraction module is also used for:
  • an embodiment of the present application further provides a device for counting people based on face recognition.
  • the device for counting people based on face recognition includes a processor, a memory, and a face recognition-based device stored in the memory.
  • People counting computer readable instructions when the computer readable instructions for counting people based on face recognition are executed by the processor, implement the steps of the method for counting people based on face recognition as described above.
  • an embodiment of the present application further provides a computer storage medium, the computer storage medium stores a computer readable instruction for counting people based on face recognition, and the computer readable instruction for counting people based on face recognition
  • the steps of the method for counting people based on face recognition as described above are realized when the processor is running.
  • the present application proposes a method, device, device, and storage medium for counting people based on face recognition.
  • the method includes: obtaining a video image from a video stream when receiving a people counting instruction; The video image is segmented to obtain multiple segmented images, and the image LBP features of the multiple segmented images are extracted; the image LBP features are input to a pre-trained portrait recognition model, and the portrait recognition model The LBP feature of the image is recognized, and the recognition result is output; the recognition result is the number of segmented images of the portrait, and the first people count result is obtained.
  • This application is based on artificial intelligence and uses image processing technology to count the number of people in the video, thereby greatly improving the efficiency and accuracy of people counting.
  • FIG. 1 is a schematic diagram of the hardware structure of a people counting device based on face recognition according to various embodiments of the present application;
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for counting people based on face recognition in this application;
  • FIG. 3 is a schematic flowchart of a second embodiment of a method for counting people based on face recognition in this application;
  • Fig. 4 is a schematic diagram of functional modules of a first embodiment of a device for counting people based on face recognition according to the present application.
  • the person counting device based on face recognition mainly involved in the embodiments of the present application refers to a network connection device that can realize network connection.
  • the person counting device based on face recognition may be a server, a cloud platform, and the like.
  • FIG. 1 is a schematic diagram of the hardware structure of a people counting device based on face recognition according to various embodiments of the present application.
  • the device for counting people based on face recognition may include a processor 1001 (for example, a central processor Processing Unit, CPU), communication bus 1002, input port 1003, output port 1004, memory 1005.
  • the communication bus 1002 is used to realize the connection and communication between these components; the input port 1003 is used for data input; the output port 1004 is used for data output.
  • the memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory). memory), such as a disk memory.
  • the memory 1005 may optionally be a storage device independent of the aforementioned processor 1001.
  • the hardware structure shown in FIG. 1 does not constitute a limitation to the present application, and may include more or less components than those shown in the figure, or combine certain components, or different component arrangements.
  • the memory 1005 as a readable storage medium in FIG. 1 may include an operating system, a network communication module, an application program module, and computer readable instructions for counting people based on face recognition.
  • the network communication module is mainly used to connect to the server and perform data communication with the server; and the processor 1001 can call the computer-readable instructions for counting people based on face recognition stored in the memory 1005, and execute the instructions provided in the embodiments of the present application. A method of counting people based on face recognition.
  • the embodiment of the present application provides a method for counting people based on face recognition.
  • FIG. 2 is a schematic flowchart of a first embodiment of a method for counting people based on face recognition in this application.
  • the method for counting people based on face recognition is applied to a device for counting people based on face recognition, and the method includes:
  • Step S101 when receiving a number counting instruction, obtain a video image from a video stream
  • a camera is installed in the people counting area where the people counting is needed in advance, and the people counting area is captured by the camera to obtain and save the video stream in real time. For example, install a camera at a certain location in the conference room to capture information such as scenes and people in the conference room, and save the current video stream of the conference room.
  • the video image is obtained from the video stream.
  • the people counting instruction includes a time point, and the time point may be the current time, the historical time, and the scheduled future time.
  • the video stream has a time stamp, and the video image corresponding to the time point is obtained according to the time stamp.
  • Step S102 Perform picture segmentation on the video image to obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
  • the video image needs to be segmented twice to obtain the first segmented image and the second segmented image.
  • the step of performing picture segmentation on the video image to obtain multiple segmented images includes:
  • Step S102-1a compressing the video image into a compressed video image of 512 ⁇ 512 pixels
  • the video image is compressed to obtain the compressed video image of 512 ⁇ 512 pixels. Understandably, in other embodiments, the video image may be compressed according to other pixels.
  • step S102-1b the compressed video image is segmented into pictures by 64 ⁇ 64 pixels to obtain multiple first segmented images
  • Performing a first segmentation on the compressed video image segmenting the compressed video image by 64 ⁇ 64 pixels to obtain multiple first segmented images.
  • Step S102-1c Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
  • the overlapping area of adjacent segmented images in the first segmented image is segmented twice with 64 pixels as the starting point to obtain the second segmented image.
  • the segmented image includes the first segmented image and the second segmented image.
  • LBP Local Binary Patterns (local binary mode) is an operator used to describe the characteristics of the local texture of an image, which has the characteristic of gray invariance.
  • the step of extracting image LBP features of a plurality of the segmented images includes:
  • Step S102-2a Divide the segmented image into multiple regions
  • the segmented image is divided into a plurality of regions of a preset size, for example, into a plurality of regions of 16 ⁇ 16.
  • Step S102-2b comparing the central gray value of each pixel in each region with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
  • the position of the adjacent pixel is marked as 1; if the adjacent gray value is less than or equal to the central gray value , The position of the adjacent pixel is marked as 0; in this way, comparing with 8 points in the neighborhood of 3*3 can generate an 8-bit binary number (usually convertible to a decimal number, that is, the LBP feature, the LBP The value is an integer between 1 and 256), thereby obtaining the LBP feature of the pixel.
  • Step S102-2c Obtain a histogram of each area based on the LBP feature of the pixel
  • Step S102-2d Perform normalization processing on the histogram of each area to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
  • the normalized image can better reflect the texture of each typical area, while at the same time diminishing the smoothness.
  • the histogram of each region is normalized to obtain a statistical histogram, and the image LBP feature of the segmented image is obtained based on the statistical histogram.
  • the binary system is rotated.
  • the initial LBP feature obtained at the beginning is 10010000, then after the initial feature is rotated clockwise, it can be converted to the minimum value of 00001001.
  • Value form so that the decimal value of the minimum form is the smallest, that is, the LBP is the smallest. No matter how the segmented image is rotated, the LBP is the smallest, which can ensure that the LBP has rotation invariance.
  • Step S103 Input the LBP feature of the image into a pre-trained portrait recognition model, and the LBP feature of the image is recognized by the portrait recognition model, and the recognition result is output;
  • Step S103a Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
  • the sample image includes a portrait sample image and a non-portrait sample image
  • the portrait sample image includes a human face sample image and a human upper body sample image.
  • 100,000 pieces of the face sample images are collected, 50,000 pieces of the upper body sample images of the person are collected, and labels are set for the 100,000 pieces of the face sample images and 50,000 pieces of the upper body sample images of the person As a portrait.
  • the upper body image of the person when the face in the video image is occluded, the number of people can be counted according to the characteristics of the upper body image, which can prevent inaccurate statistical results and lack of people. .
  • using non-personal images as training samples can make the trained person-recognition model recognize non-personal images and make the statistical results more accurate.
  • Step S103b compress the sample image into 128 ⁇ 128 pixels, and then perform grayscale processing and random incomplete processing to obtain a processed sample image;
  • the sample image is first compressed into 128 ⁇ 128 pixels to obtain a compressed sample image. Then, the compressed sample image is gray-scale processed by one of image inversion and logarithmic transformation to obtain a gray-scale sample image. Then, the grayscale sample image is subjected to random incomplete processing using an image repair method to obtain the processed sample image.
  • Step S103c extract the sample LBP feature of the processed sample image, and obtain the sample LBP feature
  • the processed sample image is divided into a plurality of sample regions, and the sample center gray value of each sample pixel in each sample region is divided into the gray value of the 8 sample adjacent pixels adjacent to the sample pixel.
  • the degree value is compared to obtain the sample LBP feature of the sample pixel; the sample histogram of each sample area is obtained based on the LBP feature of the sample pixel; the sample histogram of each sample area is normalized
  • a statistical sample histogram is obtained by transformation processing, and a sample image LBP feature of the sample image is obtained based on the sample statistical histogram.
  • Step S103d input the sample LBP features into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  • the TensorFlow is an open source machine learning framework, and TensorFlow is widely used in programming implementation of various machine learning algorithms. Using TensorFlow can help developers build models in extreme codes and make the products they need based on the models.
  • the sample LBP features are input into a neural network created based on TensorFlow for training. After repeated training for millions of times, the sample LBP features can be accurately classified according to the labels of the corresponding sample images, thus
  • the portrait recognition model is obtained, and the recognition result output by the portrait recognition model is a portrait or a non-portrait, that is, the recognition result of a sample image labeled as a portrait in the sample image is output as a portrait, and the label in the sample image is The recognition result of the non-personal sample image is output as a non-personal image.
  • Step S104 Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result.
  • the recognition result output by the portrait recognition model the number of the segmented images whose recognition result is the portrait is counted, and the number of the segmented images is taken as the first person counting result.
  • This embodiment uses the above solution to obtain a video image from a video stream when receiving a people counting instruction; segment the video image to obtain multiple segmented images, and extract the number of segmented images.
  • Image LBP features input the image LBP features into a pre-trained portrait recognition model, the portrait recognition model recognizes the image LBP features, and outputs the recognition result; the statistical recognition result is the number of segmented images of the portrait, Obtain the first number of people counting results. Therefore, based on artificial intelligence, image processing technology is used to count the number of people in the video, which greatly improves the efficiency and accuracy of people counting.
  • the second embodiment of the present application proposes a method for counting people based on face recognition.
  • the statistical recognition result is the number of segmented images of portraits.
  • the step of obtaining the first number of people counting result it also includes:
  • Step S106 According to a preset result reporting interface, the first number of people counting results are reported to the server.
  • a reporting interface is preset, and the reporting interface is used for network communication with the server. Understandably, the reporting interface may also report camera information, area information, time information, etc., corresponding to the first people counting result to the server.
  • the step S106 before the step of reporting the first population count result to the server according to the preset result reporting interface, the method further includes:
  • the video images in the video stream change in real time, the video images obtained from the video stream may not be stable enough, and may cause the first portrait statistics result due to people walking, posture changes, etc. Not accurate enough.
  • the video image is extracted from the video stream according to a preset duration, and the preset duration may be 100ms, 200ms, etc.
  • the video image is segmented to obtain multiple segmented images, and the image LBP features of the multiple segmented images are extracted; the image LBP features are input to a pre-trained portrait recognition model, and the The portrait recognition model recognizes the LBP features of the image, and outputs the portrait coordinates of the portrait in the segmented image.
  • the preset time may be 1 minute, and the number of times The threshold may be 4 times, 10 times, etc. For example, if the number of times the portrait coordinate appears in 1 minute is 10 times, it means that the portrait corresponding to the image coordinate is not an abnormal portrait. If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  • the step is performed: according to the preset result reporting interface, the first people counting result is reported to the server; if the first people counting result includes all the If the abnormal portrait is described, after removing the number of the abnormal portrait from the first population statistics result, a second population statistics result is obtained, and the second population statistics result is reported to the server. If there are two abnormal figures, the second statistical result is obtained after subtracting 2 from the first statistical result.
  • This embodiment uses the above solution to obtain a video image from a video stream when receiving a people counting instruction; segment the video image to obtain multiple images, and extract multiple images of the segmented images Image LBP features; input the image LBP features into a pre-trained portrait recognition model, the portrait recognition model recognizes the image LBP features, and outputs the recognition result; the statistical recognition result is the number of segmented images of the portrait, Obtain the first population statistics result; report the first population statistics result to the server according to the preset result reporting interface. Therefore, based on artificial intelligence, image processing technology is used to count the number of people in the video, which greatly improves the efficiency and accuracy of people counting.
  • this embodiment also provides a device for counting people based on face recognition.
  • Fig. 4 is a schematic diagram of the functional modules of the first embodiment of the device for counting people based on face recognition in this application.
  • the device for counting people based on face recognition is a virtual device, which is stored in the memory 1005 of the device for counting people based on face recognition shown in FIG. 1 to realize the computer readable people counting device based on face recognition. All functions of the instruction: when receiving the people counting instruction, obtain the video image from the video stream; use to segment the video image to obtain multiple segmented images, and extract multiple segments The image LBP features of the image; used to input the image LBP features into a pre-trained portrait recognition model, and the portrait recognition model recognizes the image LBP features, and outputs the recognition results; used to count the recognition results for all the portraits State the number of segmented images, and obtain the first person counting result.
  • the device for counting people based on face recognition in this embodiment includes:
  • the obtaining module 10 is configured to obtain a video image from a video stream when a number counting instruction is received;
  • the extraction module 20 is configured to perform picture segmentation on the video image, obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
  • the recognition module 30 is configured to input the LBP features of the image into a pre-trained portrait recognition model, and the portrait recognition model recognizes the LBP features of the image, and outputs a recognition result;
  • the statistics module 40 is configured to count the number of the segmented images whose recognition result is a portrait, and obtain a first population statistics result.
  • identification module is also used for:
  • the sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  • extraction module is also used for:
  • extraction module is also used for:
  • the statistics module is also used for:
  • the statistics module is also used for:
  • the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
  • the statistics module is also used for:
  • the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait
  • the portrait corresponding to the image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  • the present application also provides a computer storage medium that stores a computer readable instruction for counting people based on face recognition.
  • a computer readable instruction for counting people based on face recognition is run by a processor The steps for realizing the method for counting people based on face recognition as described above will not be repeated here.
  • the computer-readable storage medium may be a non-volatile readable storage medium.
  • the present application proposes a method, device, device, and storage medium for counting people based on face recognition.
  • the method includes: obtaining a video image from a video stream when receiving a people counting instruction; The video image is segmented to obtain multiple segmented images, and the image LBP features of the segmented images are extracted; the image LBP features are input into a pre-trained portrait recognition model, and the portrait recognition model The LBP feature of the image is recognized, and the recognition result is output; the recognition result is the number of segmented images of the portrait, and the first people count result is obtained.
  • This application is based on artificial intelligence and uses image processing technology to count the number of people in the video, thereby greatly improving the efficiency and accuracy of people counting.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

A people counting method, apparatus, and device based on facial recognition, and a storage medium. The method comprises: when a people counting instruction is received, obtaining a video image from a video stream (S101); performing image segmentation on the video image to obtain multiple segmented images, and extracting image LBP features of the multiple segmented images (S102); inputting the image LBP features into a pre-trained human image recognition model, performing recognition on the image LBP features by the human image recognition model, and outputting recognition results (S103); and counting the number of segmented images with the recognition result being a human image to obtain a first people counting result (S104). According to the method, people in a video are counted by means of image processing technology on the basis of artificial intelligence, thereby greatly improving the efficiency and accuracy of people counting.

Description

基于人脸识别的人数统计方法、装置、设备及存储介质 People counting method, device, equipment and storage medium based on face recognition To
本申请要求于2019年4月12日提交中国专利局、申请号为201910297454.2、发明名称为“基于人脸识别的人数统计方法、装置、设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在申请中This application requires the priority of a Chinese patent application filed with the Chinese Patent Office on April 12, 2019, the application number is 201910297454.2, and the invention title is "People counting methods, devices, equipment and computer-readable storage media based on face recognition" , The entire contents of which are incorporated in the application by reference
技术领域Technical field
本申请涉及人工智能技术领域,尤其涉及一种基于人脸识别的人数统计方法、装置、设备及存储介质。This application relates to the field of artificial intelligence technology, and in particular to a method, device, device, and storage medium for counting people based on face recognition.
背景技术Background technique
目前当需要对会议室、车站、商场等区域进行人数统计时,一般需要人工清点人数,或通过其它方法间接获得人数统计结果,进而导致人数统计效率低下,并且统计结果不够准确。At present, when it is necessary to count people in conference rooms, stations, shopping malls and other areas, it is generally necessary to count the people manually or obtain the result of the people counting indirectly through other methods, which leads to low efficiency of people counting and inaccurate statistics.
发明内容Summary of the invention
本申请提供一种基于人脸识别的人数统计方法、装置、设备及存储介质,旨在提高人数统计的效率和准确性。This application provides a method, device, equipment and storage medium for counting people based on face recognition, aiming to improve the efficiency and accuracy of counting people.
为实现上述目的,本申请提供一种基于人脸识别的人数统计方法,所述方法包括:In order to achieve the above objective, this application provides a method for counting people based on face recognition, and the method includes:
当接收到人数统计指令时,从视频流中获取视频图像;When receiving the people counting instruction, obtain the video image from the video stream;
将所述视频图像进行图片切分,获得多个切分图像,并提取多个所述切分图像的图像LBP特征;Performing picture segmentation on the video image to obtain multiple segmented images, and extracting image LBP features of the multiple segmented images;
将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;Input the LBP features of the image into a pre-trained portrait recognition model, and the LBP features of the image are recognized by the portrait recognition model, and the recognition result is output;
统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result;
所述提取多个所述切分图像的图像LBP特征的步骤包括:The step of extracting image LBP features of a plurality of the segmented images includes:
将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
优选地,所述将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果的步骤之前还包括:Preferably, the step of inputting the LBP features of the image into a pre-trained portrait recognition model, and then recognizing the LBP features of the image by the portrait recognition model, before the step of outputting the recognition result further includes:
收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
优选地,所述切分图像包括第一切分图像和第二切分图像,所述将所述视频图像进行图片切分,获得多个切分图像的步骤包括:Preferably, the segmented image includes a first segmented image and a second segmented image, and the step of segmenting the video image into a picture to obtain multiple segmented images includes:
将所述视频图像压缩成512×512像素的压缩视频图像;Compressing the video image into a compressed video image of 512×512 pixels;
将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;Segmenting the compressed video image according to 64×64 pixels to obtain multiple first segmented images;
将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
优选地,所述统计识别结果为人像的切分图像的个数,获得第一人数统计结果的步骤之后还包括:Preferably, the statistical recognition result is the number of segmented images of the portrait, and after the step of obtaining the first population statistical result, the method further includes:
根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Report the first population count result to the server according to the preset result report interface.
优选地,所述根据预设的结果上报接口将所述第一人数统计结果上报至服务器的步骤之前还包括:Preferably, before the step of reporting the first population count result to the server according to the preset result reporting interface, the method further includes:
根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;If the first person counting result does not include the abnormal portrait, perform the step: reporting the first person counting result to the server according to a preset result reporting interface;
若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。If the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
优选地,所述人像识别模型记录识别结果为人像的所述切分图像中人像的人像坐标,所述根据波峰计数法判断所述第一人数统计结果中是否包括异常人像的步骤包括:Preferably, the portrait recognition model records the portrait coordinates of the portrait in the segmented image whose recognition result is a portrait, and the step of judging whether an abnormal portrait is included in the first population count result according to the crest counting method includes:
获取所述人像坐标在预设时间内出现的次数;Obtaining the number of times the portrait coordinates appear within a preset time;
若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait;
若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
为实现上述目的,本申请实施例还提供一种基于人脸识别的人数统计装置,所述基于人脸识别的人数统计装置包括:To achieve the foregoing objective, an embodiment of the present application further provides a device for counting people based on face recognition, and the device for counting people based on face recognition includes:
获取模块,用于当接收到人数统计指令时,从视频流中获取视频图像;The obtaining module is used to obtain video images from the video stream when receiving the people counting instruction;
提取模块,用于将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;An extraction module, configured to perform picture segmentation on the video image to obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
识别模块,用于将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;A recognition module, configured to input LBP features of the image into a pre-trained portrait recognition model, the portrait recognition model recognizes the LBP features of the image, and outputs a recognition result;
统计模块,用于统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;A statistics module, configured to count the number of the segmented images whose recognition result is a portrait, to obtain a first population statistics result;
所述提取模块还用于:The extraction module is also used for:
将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
为实现上述目的,本申请实施例还提供一种基于人脸识别的人数统计设备,所述基于人脸识别的人数统计设备包括处理器,存储器以及存储在所述存储器中的基于人脸识别的人数统计计算机可读指令,所述基于人脸识别的人数统计计算机可读指令被所述处理器运行时,实现如上所述的基于人脸识别的人数统计方法的步骤。In order to achieve the foregoing objective, an embodiment of the present application further provides a device for counting people based on face recognition. The device for counting people based on face recognition includes a processor, a memory, and a face recognition-based device stored in the memory. People counting computer readable instructions, when the computer readable instructions for counting people based on face recognition are executed by the processor, implement the steps of the method for counting people based on face recognition as described above.
为实现上述目的,本申请实施例还提供一种计算机存储介质,所述计算机存储介质上存储有基于人脸识别的人数统计计算机可读指令,所述基于人脸识别的人数统计计算机可读指令被处理器运行时实现如上所述基于人脸识别的人数统计方法的步骤。In order to achieve the above objective, an embodiment of the present application further provides a computer storage medium, the computer storage medium stores a computer readable instruction for counting people based on face recognition, and the computer readable instruction for counting people based on face recognition The steps of the method for counting people based on face recognition as described above are realized when the processor is running.
相比现有技术,本申请提出的一种基于人脸识别的人数统计方法、装置、设备及存储介质,该方法包括:当接收到人数统计指令时,从视频流中获取视频图像;将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;统计识别结果为人像的切分图像的个数,获得第一人数统计结果。本申请基于人工智能,利用图像处理技术对视频中的人数进行统计,由此极大提高了人数统计的效率和准确性。Compared with the prior art, the present application proposes a method, device, device, and storage medium for counting people based on face recognition. The method includes: obtaining a video image from a video stream when receiving a people counting instruction; The video image is segmented to obtain multiple segmented images, and the image LBP features of the multiple segmented images are extracted; the image LBP features are input to a pre-trained portrait recognition model, and the portrait recognition model The LBP feature of the image is recognized, and the recognition result is output; the recognition result is the number of segmented images of the portrait, and the first people count result is obtained. This application is based on artificial intelligence and uses image processing technology to count the number of people in the video, thereby greatly improving the efficiency and accuracy of people counting.
附图说明Description of the drawings
图1是本申请各实施例涉及的基于人脸识别的人数统计设备的硬件结构示意图;FIG. 1 is a schematic diagram of the hardware structure of a people counting device based on face recognition according to various embodiments of the present application;
图2是本申请基于人脸识别的人数统计方法第一实施例的流程示意图;2 is a schematic flowchart of a first embodiment of a method for counting people based on face recognition in this application;
图3是本申请基于人脸识别的人数统计方法第二实施例的流程示意图;3 is a schematic flowchart of a second embodiment of a method for counting people based on face recognition in this application;
图4是本申请基于人脸识别的人数统计装置第一实施例的功能模块示意图。Fig. 4 is a schematic diagram of functional modules of a first embodiment of a device for counting people based on face recognition according to the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式detailed description
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the application, and are not used to limit the application.
本申请实施例主要涉及的基于人脸识别的人数统计设备是指能够实现网络连接的网络连接设备,所述基于人脸识别的人数统计设备可以是服务器、云平台等。The person counting device based on face recognition mainly involved in the embodiments of the present application refers to a network connection device that can realize network connection. The person counting device based on face recognition may be a server, a cloud platform, and the like.
参照图1,图1是本申请各实施例涉及的基于人脸识别的人数统计设备的硬件结构示意图。本申请实施例中,基于人脸识别的人数统计设备可以包括处理器1001(例如中央处理器Central Processing Unit、CPU),通信总线1002,输入端口1003,输出端口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信;输入端口1003用于数据输入;输出端口1004用于数据输出,存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器,存储器1005可选的还可以是独立于前述处理器1001的存储装置。本领域技术人员可以理解,图1中示出的硬件结构并不构成对本申请的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Referring to FIG. 1, FIG. 1 is a schematic diagram of the hardware structure of a people counting device based on face recognition according to various embodiments of the present application. In the embodiment of the present application, the device for counting people based on face recognition may include a processor 1001 (for example, a central processor Processing Unit, CPU), communication bus 1002, input port 1003, output port 1004, memory 1005. Among them, the communication bus 1002 is used to realize the connection and communication between these components; the input port 1003 is used for data input; the output port 1004 is used for data output. The memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory). memory), such as a disk memory. The memory 1005 may optionally be a storage device independent of the aforementioned processor 1001. Those skilled in the art can understand that the hardware structure shown in FIG. 1 does not constitute a limitation to the present application, and may include more or less components than those shown in the figure, or combine certain components, or different component arrangements.
继续参照图1,图1中作为一种可读存储介质的存储器1005可以包括操作系统、网络通信模块、应用程序模块以及基于人脸识别的人数统计计算机可读指令。在图1中,网络通信模块主要用于连接服务器,与服务器进行数据通信;而处理器1001可以调用存储器1005中存储的基于人脸识别的人数统计计算机可读指令,并执行本申请实施例提供的基于人脸识别的人数统计方法。Continuing to refer to FIG. 1, the memory 1005 as a readable storage medium in FIG. 1 may include an operating system, a network communication module, an application program module, and computer readable instructions for counting people based on face recognition. In FIG. 1, the network communication module is mainly used to connect to the server and perform data communication with the server; and the processor 1001 can call the computer-readable instructions for counting people based on face recognition stored in the memory 1005, and execute the instructions provided in the embodiments of the present application. A method of counting people based on face recognition.
本申请实施例提供了一种基于人脸识别的人数统计方法。The embodiment of the present application provides a method for counting people based on face recognition.
参照图2,图2是本申请基于人脸识别的人数统计方法第一实施例的流程示意图。Referring to FIG. 2, FIG. 2 is a schematic flowchart of a first embodiment of a method for counting people based on face recognition in this application.
本实施例中,所述基于人脸识别的人数统计方法应用于基于人脸识别的人数统计设备,所述方法包括:In this embodiment, the method for counting people based on face recognition is applied to a device for counting people based on face recognition, and the method includes:
步骤S101,当接收到人数统计指令时,从视频流中获取视频图像;Step S101, when receiving a number counting instruction, obtain a video image from a video stream;
本实施例中,预先在需要进行人数统计的人数统计区域安装摄像头,通过所述摄像头对所述人数统计区域进行摄像,获得实时获取并保存所述视频流。例如在会议室内某个位置安装一个摄像头,拍摄会议室内的情景、人员等信息,并保存当前会议室视频流。In this embodiment, a camera is installed in the people counting area where the people counting is needed in advance, and the people counting area is captured by the camera to obtain and save the video stream in real time. For example, install a camera at a certain location in the conference room to capture information such as scenes and people in the conference room, and save the current video stream of the conference room.
当接收到用户通过语音或触控操作发生的人数统计指令时,则从视频流中获取视频图像。可以理解地,所述人数统计指令包括时间点,所述时间点可以是当前时间、历史时间以及预约的将来时间。一般的,所述视频流具有时间戳,根据所述时间戳获取与所述时间点对应的视频图像。When receiving the user's voice or touch operation command for counting the number of people, the video image is obtained from the video stream. Understandably, the people counting instruction includes a time point, and the time point may be the current time, the historical time, and the scheduled future time. Generally, the video stream has a time stamp, and the video image corresponding to the time point is obtained according to the time stamp.
步骤S102,将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;Step S102: Perform picture segmentation on the video image to obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
本实施例中,需要对所述视频图像进行二次切分,获得第一切分图像和第二切分图像。具体地,所述将所述视频图像进行图片切分,获得多个切分图像的步骤包括:In this embodiment, the video image needs to be segmented twice to obtain the first segmented image and the second segmented image. Specifically, the step of performing picture segmentation on the video image to obtain multiple segmented images includes:
步骤S102-1a,将所述视频图像压缩成512×512像素的压缩视频图像;Step S102-1a, compressing the video image into a compressed video image of 512×512 pixels;
对所述视频图像进行压缩,获得512×512像素的所述压缩视频图像。可以理解地,在其它实施例中,可以将所述视频图像按其它像素进行压缩。The video image is compressed to obtain the compressed video image of 512×512 pixels. Understandably, in other embodiments, the video image may be compressed according to other pixels.
步骤S102-1b,将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;In step S102-1b, the compressed video image is segmented into pictures by 64×64 pixels to obtain multiple first segmented images;
对所述压缩视频图像进行第一次切分:将所述压缩视频图像按64×64像素进行切分,获得多个第一切分图像。Performing a first segmentation on the compressed video image: segmenting the compressed video image by 64×64 pixels to obtain multiple first segmented images.
步骤S102-1c,将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Step S102-1c: Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
不可避免地,图像切分后会有重叠区域,故再将第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Inevitably, there will be overlapping areas after image segmentation. Therefore, the overlapping area of adjacent segmented images in the first segmented image is segmented twice with 64 pixels as the starting point to obtain the second segmented image.
本实施例中,所述切分图像包括所述第一切分图像和所述第二切分图像。In this embodiment, the segmented image includes the first segmented image and the second segmented image.
本实施例中,LBP(Local Binary Patterns,局部二值模式)是一种用于描述图像局部纹理的特征的算子,具有灰度不变性的特点。In this embodiment, LBP (Local Binary Patterns (local binary mode) is an operator used to describe the characteristics of the local texture of an image, which has the characteristic of gray invariance.
进一步地,所述提取多个所述切分图像的图像LBP特征的步骤包括:Further, the step of extracting image LBP features of a plurality of the segmented images includes:
步骤S102-2a:将所述切分图像划分为多个区域;Step S102-2a: Divide the segmented image into multiple regions;
将所述切分图像划分为预设大小的多个区域,例如划分为16×16的多个区域。The segmented image is divided into a plurality of regions of a preset size, for example, into a plurality of regions of 16×16.
步骤S102-2b:将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Step S102-2b: comparing the central gray value of each pixel in each region with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
具体地,若所述相邻灰度值大于所述中心灰度值,则将所述相邻像素点的位置标记为1;若所述相邻灰度值小于或等于所述中心灰度值,则将所述相邻像素点的位置标记为0;这样与3*3的邻域内的8个点进行比较可产生8位二进制数(通常可转换为十进制数,即LBP特征,所述LBP值的为1-256之间的整数),由此获得所述像素点的LBP特征。Specifically, if the adjacent gray value is greater than the central gray value, the position of the adjacent pixel is marked as 1; if the adjacent gray value is less than or equal to the central gray value , The position of the adjacent pixel is marked as 0; in this way, comparing with 8 points in the neighborhood of 3*3 can generate an 8-bit binary number (usually convertible to a decimal number, that is, the LBP feature, the LBP The value is an integer between 1 and 256), thereby obtaining the LBP feature of the pixel.
步骤S102-2c:基于所述像素点的LBP特征,获得每个区域的直方图;Step S102-2c: Obtain a histogram of each area based on the LBP feature of the pixel;
获得所述区域内各个像素点的LBP特征后,将各个所述像素点的LBP特征进行统计即可获得所述每个区域的直方图。After obtaining the LBP feature of each pixel in the area, statistics of the LBP feature of each pixel can obtain the histogram of each area.
步骤S102-2d:对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Step S102-2d: Perform normalization processing on the histogram of each area to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
一般地,使用LBP表达图像纹理时,只关注Uniform模式,而将其他的所有模式归至同一类中,由此,归一化后的图像更能体现各个典型区域的纹理,同时又淡化了平滑区域的特征。本实施例中,对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Generally, when using LBP to express image texture, only focus on the Uniform mode, and group all other modes into the same category. As a result, the normalized image can better reflect the texture of each typical area, while at the same time diminishing the smoothness. Regional characteristics. In this embodiment, the histogram of each region is normalized to obtain a statistical histogram, and the image LBP feature of the segmented image is obtained based on the statistical histogram.
进一地,为了让所述LBP特征具有旋转不变性,将二进制进行旋转,例如一开始得到的初始LBP特征为10010000,那么将所述初始特征按顺时针进行旋转后,可以转换为00001001的最小值形式,这样所述最小值形式的十进制值最小,也即LBP最小。无论所述切分图像会如何旋转,所述LBP都最小,由此可以保证LBP具有旋转不变性。Further, in order to make the LBP feature rotation-invariant, the binary system is rotated. For example, the initial LBP feature obtained at the beginning is 10010000, then after the initial feature is rotated clockwise, it can be converted to the minimum value of 00001001. Value form, so that the decimal value of the minimum form is the smallest, that is, the LBP is the smallest. No matter how the segmented image is rotated, the LBP is the smallest, which can ensure that the LBP has rotation invariance.
步骤S103,将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;Step S103: Input the LBP feature of the image into a pre-trained portrait recognition model, and the LBP feature of the image is recognized by the portrait recognition model, and the recognition result is output;
本实施例中,所述步骤S103:将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果的步骤之前还包括:In this embodiment, the step S103: input the LBP features of the image into a pre-trained portrait recognition model, the LBP features of the image are recognized by the portrait recognition model, and the step of outputting the recognition result also includes:
步骤S103a,收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Step S103a: Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
所述样本图像包括人像样本图像和非人像样本图像,并且所述人像样本图像包括人脸样本图像和人上半身样本图像。The sample image includes a portrait sample image and a non-portrait sample image, and the portrait sample image includes a human face sample image and a human upper body sample image.
本实施例中,收集10万张所述人脸样本图像,收集5万张所述人上半身样本图像,将10万张所述人脸样本图像和5万张所述人上半身样本图像的标签设置为人像。收集1万张所述非人像图像,并将1万张所述非人像图像的标签设置为非人像。In this embodiment, 100,000 pieces of the face sample images are collected, 50,000 pieces of the upper body sample images of the person are collected, and labels are set for the 100,000 pieces of the face sample images and 50,000 pieces of the upper body sample images of the person As a portrait. Collect 10,000 non-portrait images, and set the label of the 10,000 non-portrait images as non-portrait.
可以理解地,将所述人上半身图像作为训练样本,可以在所述视频图像中人脸被遮挡的时候,根据人上半身图像的特征进行人数统计,可以防止统计结果不准确,缺少人数事情的发生。并且,将非人像图像也做为训练样本,则可以使训练后的人像识别模型识别出非人像,使统计结果更加准确。Understandably, using the upper body image of the person as a training sample, when the face in the video image is occluded, the number of people can be counted according to the characteristics of the upper body image, which can prevent inaccurate statistical results and lack of people. . Moreover, using non-personal images as training samples can make the trained person-recognition model recognize non-personal images and make the statistical results more accurate.
步骤S103b,将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;Step S103b, compress the sample image into 128×128 pixels, and then perform grayscale processing and random incomplete processing to obtain a processed sample image;
本实施例中,首先将所述样本图像压缩成128×128像素,获得压缩样本图像。然后将所述压缩样本图像通过图像反转、对数变换中的一种方法进行灰度处理,获得灰度样本图像。再将所述灰度样本图像利用图像修复方法进行随机残缺处理,获得处理后的所述样本图像。In this embodiment, the sample image is first compressed into 128×128 pixels to obtain a compressed sample image. Then, the compressed sample image is gray-scale processed by one of image inversion and logarithmic transformation to obtain a gray-scale sample image. Then, the grayscale sample image is subjected to random incomplete processing using an image repair method to obtain the processed sample image.
步骤S103c,提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Step S103c, extract the sample LBP feature of the processed sample image, and obtain the sample LBP feature;
将所述处理后样本图像划分成多个样本区域,将每个样本区域中的每一个样本像素点的样本中心灰度值与所述样本像素点相邻的8个样本相邻像素点的灰度值进行比较,获得所述样本像素点的样本LBP特征;基于所述样本像素点的LBP特征,获得每个样本区域的样本直方图;对所述每个样本区域的样本直方图进行归一化处理获得统计样本直方图,基于所述样本统计直方图获得所述样本图像的样本图像LBP特征。The processed sample image is divided into a plurality of sample regions, and the sample center gray value of each sample pixel in each sample region is divided into the gray value of the 8 sample adjacent pixels adjacent to the sample pixel. The degree value is compared to obtain the sample LBP feature of the sample pixel; the sample histogram of each sample area is obtained based on the LBP feature of the sample pixel; the sample histogram of each sample area is normalized A statistical sample histogram is obtained by transformation processing, and a sample image LBP feature of the sample image is obtained based on the sample statistical histogram.
步骤S103d,将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。Step S103d, input the sample LBP features into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
所述TensorFlow是一种开放源代码机器学习框架,TensorFlow被广泛应用于各类机器学习算法的编程实现。利用TensorFlow可以帮助开发者在极端的代码中构建模型,并应该模型做出所需要的产品。The TensorFlow is an open source machine learning framework, and TensorFlow is widely used in programming implementation of various machine learning algorithms. Using TensorFlow can help developers build models in extreme codes and make the products they need based on the models.
本实施例中,将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,当反复训练上百万次后,所述样本LBP特征可以根据其对应样本图像的标签进行准确分类,由此获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像,也即将所述样本图像中标签为人像的样本图像的识别结果输出为人像,将所述样本图像中标签为非人像的样本图像的识别结果输出为非人像。In this embodiment, the sample LBP features are input into a neural network created based on TensorFlow for training. After repeated training for millions of times, the sample LBP features can be accurately classified according to the labels of the corresponding sample images, thus The portrait recognition model is obtained, and the recognition result output by the portrait recognition model is a portrait or a non-portrait, that is, the recognition result of a sample image labeled as a portrait in the sample image is output as a portrait, and the label in the sample image is The recognition result of the non-personal sample image is output as a non-personal image.
步骤S104,统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果。Step S104: Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result.
根据所述人像识别模型输出的识别结果,统计识别结果为人像的所述切分图像的个数,将所述切分图像的个数作为第一人数统计结果。According to the recognition result output by the portrait recognition model, the number of the segmented images whose recognition result is the portrait is counted, and the number of the segmented images is taken as the first person counting result.
本实施例通过上述方案,当接收到人数统计指令时,从视频流中获取视频图像;将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;统计识别结果为人像的切分图像的个数,获得第一人数统计结果。由此,基于人工智能,利用图像处理技术对视频中的人数进行统计,极大提高了人数统计的效率和准确性。This embodiment uses the above solution to obtain a video image from a video stream when receiving a people counting instruction; segment the video image to obtain multiple segmented images, and extract the number of segmented images. Image LBP features; input the image LBP features into a pre-trained portrait recognition model, the portrait recognition model recognizes the image LBP features, and outputs the recognition result; the statistical recognition result is the number of segmented images of the portrait, Obtain the first number of people counting results. Therefore, based on artificial intelligence, image processing technology is used to count the number of people in the video, which greatly improves the efficiency and accuracy of people counting.
如图3所示,本申请第二实施例提出一种基于人脸识别的人数统计方法,基于上述图2所示的第一实施例,所述统计识别结果为人像的切分图像的个数,获得第一人数统计结果的步骤之后还包括:As shown in FIG. 3, the second embodiment of the present application proposes a method for counting people based on face recognition. Based on the first embodiment shown in FIG. 2, the statistical recognition result is the number of segmented images of portraits. , After the step of obtaining the first number of people counting result, it also includes:
步骤S106:根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Step S106: According to a preset result reporting interface, the first number of people counting results are reported to the server.
具体地,预先设置上报接口,所述上报接口用于与所述服务器网络通讯。可以理解地,上报接口还可以将所述第一人数统计结果对应的摄像头信息、区域信息、时间信息等上报至所述服务器。Specifically, a reporting interface is preset, and the reporting interface is used for network communication with the server. Understandably, the reporting interface may also report camera information, area information, time information, etc., corresponding to the first people counting result to the server.
所述步骤S106:根据预设的结果上报接口将所述第一人数统计结果上报至服务器的步骤之前还包括:The step S106: before the step of reporting the first population count result to the server according to the preset result reporting interface, the method further includes:
根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
可以理解地,由于视频流中的视频图像是实时变化的,故从所述视频流中获取的视频图像可能不够稳定,并且可能由于人员走动、姿势变化等原因导致所述第一人像统计结果不够准确。Understandably, since the video images in the video stream change in real time, the video images obtained from the video stream may not be stable enough, and may cause the first portrait statistics result due to people walking, posture changes, etc. Not accurate enough.
具体地,获取所述人像坐标在预设时间内出现的次数;Specifically, acquiring the number of times the portrait coordinates appear within a preset time;
按预设时长从所述视频流中提取所述视频图像,所述预设时长可以是100ms, 200ms等,将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,并输出所述切分图像中人像的人像坐标。The video image is extracted from the video stream according to a preset duration, and the preset duration may be 100ms, 200ms, etc., the video image is segmented to obtain multiple segmented images, and the image LBP features of the multiple segmented images are extracted; the image LBP features are input to a pre-trained portrait recognition model, and the The portrait recognition model recognizes the LBP features of the image, and outputs the portrait coordinates of the portrait in the segmented image.
若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;本实施例中,所述预设时间可以是1分钟,所述次数阈值可以是4次、10次等,例如若1分钟内所述人像坐标出现的次数是10次,则说明所述图像坐标对应的人像不是异常人像。若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。若存在2个异常人像,则将所述第一统计结果减去2之后则获得所述第二人数统计结果。If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the threshold of times, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait; in this embodiment, the preset time may be 1 minute, and the number of times The threshold may be 4 times, 10 times, etc. For example, if the number of times the portrait coordinate appears in 1 minute is 10 times, it means that the portrait corresponding to the image coordinate is not an abnormal portrait. If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait. If the first people counting result does not include the abnormal portrait, then the step is performed: according to the preset result reporting interface, the first people counting result is reported to the server; if the first people counting result includes all the If the abnormal portrait is described, after removing the number of the abnormal portrait from the first population statistics result, a second population statistics result is obtained, and the second population statistics result is reported to the server. If there are two abnormal figures, the second statistical result is obtained after subtracting 2 from the first statistical result.
本实施例通过上述方案,当接收到人数统计指令时,从视频流中获取视频图像;将所述视频图像进行图片切分,获得切分多个图像,并提取多个所述切分图像的图像LBP特征;将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;统计识别结果为人像的切分图像的个数,获得第一人数统计结果;根据预设的结果上报接口将所述第一人数统计结果上报至服务器。由此,基于人工智能,利用图像处理技术对视频中的人数进行统计,极大提高了人数统计的效率和准确性。This embodiment uses the above solution to obtain a video image from a video stream when receiving a people counting instruction; segment the video image to obtain multiple images, and extract multiple images of the segmented images Image LBP features; input the image LBP features into a pre-trained portrait recognition model, the portrait recognition model recognizes the image LBP features, and outputs the recognition result; the statistical recognition result is the number of segmented images of the portrait, Obtain the first population statistics result; report the first population statistics result to the server according to the preset result reporting interface. Therefore, based on artificial intelligence, image processing technology is used to count the number of people in the video, which greatly improves the efficiency and accuracy of people counting.
此外,本实施例还提供一种基于人脸识别的人数统计装置。参照图4,图4为本申请基于人脸识别的人数统计装置第一实施例的功能模块示意图。In addition, this embodiment also provides a device for counting people based on face recognition. Referring to Fig. 4, Fig. 4 is a schematic diagram of the functional modules of the first embodiment of the device for counting people based on face recognition in this application.
本申请提供的基于人脸识别的人数统计装置是一种虚拟装置,存储于图1所示的基于人脸识别的人数统计设备的存储器1005中,以实现基于人脸识别的人数统计计算机可读指令的所有功能:用于当接收到人数统计指令时,从视频流中获取视频图像;用于将所述视频图像进行图片切分,获得多个切分图像,并提取多个所述切分图像的图像LBP特征;用于将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;用于统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果。The device for counting people based on face recognition provided in the present application is a virtual device, which is stored in the memory 1005 of the device for counting people based on face recognition shown in FIG. 1 to realize the computer readable people counting device based on face recognition. All functions of the instruction: when receiving the people counting instruction, obtain the video image from the video stream; use to segment the video image to obtain multiple segmented images, and extract multiple segments The image LBP features of the image; used to input the image LBP features into a pre-trained portrait recognition model, and the portrait recognition model recognizes the image LBP features, and outputs the recognition results; used to count the recognition results for all the portraits State the number of segmented images, and obtain the first person counting result.
具体地,本实施例中所述基于人脸识别的人数统计装置包括:Specifically, the device for counting people based on face recognition in this embodiment includes:
获取模块10,用于当接收到人数统计指令时,从视频流中获取视频图像;The obtaining module 10 is configured to obtain a video image from a video stream when a number counting instruction is received;
提取模块20,用于将所述视频图像进行图片切分,获得多个切分图像,并提取多个所述切分图像的图像LBP特征;The extraction module 20 is configured to perform picture segmentation on the video image, obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
识别模块30,用于将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;The recognition module 30 is configured to input the LBP features of the image into a pre-trained portrait recognition model, and the portrait recognition model recognizes the LBP features of the image, and outputs a recognition result;
统计模块40,用于统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果。The statistics module 40 is configured to count the number of the segmented images whose recognition result is a portrait, and obtain a first population statistics result.
进一地,所述识别模块还用于:Further, the identification module is also used for:
收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
进一地,所述提取模块还用于:Further, the extraction module is also used for:
将所述视频图像压缩成512×512像素的压缩视频图像;Compressing the video image into a compressed video image of 512×512 pixels;
将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;Segmenting the compressed video image according to 64×64 pixels to obtain multiple first segmented images;
将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
进一地,所述提取模块还用于:Further, the extraction module is also used for:
将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
进一地,所述统计模块还用于:Furthermore, the statistics module is also used for:
根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Report the first population count result to the server according to the preset result report interface.
进一地,所述统计模块还用于:Furthermore, the statistics module is also used for:
根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;If the first person counting result does not include the abnormal portrait, perform the step: reporting the first person counting result to the server according to a preset result reporting interface;
若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。If the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
进一地,所述统计模块还用于:Furthermore, the statistics module is also used for:
获取所述人像坐标在预设时间内出现的次数;Obtaining the number of times the portrait coordinates appear within a preset time;
若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait;
若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
此外,本申请还提供一种计算机存储介质,所述计算机存储介质上存储有基于人脸识别的人数统计计算机可读指令,所述基于人脸识别的人数统计计算机可读指令被处理器运行时实现如上所述基于人脸识别的人数统计方法的步骤,在此不再赘述。所述计算机可读存储介质可以为非易失性可读存储介质。In addition, the present application also provides a computer storage medium that stores a computer readable instruction for counting people based on face recognition. When the computer readable instruction for counting people based on face recognition is run by a processor The steps for realizing the method for counting people based on face recognition as described above will not be repeated here. The computer-readable storage medium may be a non-volatile readable storage medium.
相比现有技术,本申请提出的一种基于人脸识别的人数统计方法、装置、设备及存储介质,该方法包括:当接收到人数统计指令时,从视频流中获取视频图像;将所述视频图像进行图片切分,获得多个切分图像,并提取多个所述切分图像的图像LBP特征;将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;统计识别结果为人像的切分图像的个数,获得第一人数统计结果。本申请基于人工智能,利用图像处理技术对视频中的人数进行统计,由此极大提高了人数统计的效率和准确性。Compared with the prior art, the present application proposes a method, device, device, and storage medium for counting people based on face recognition. The method includes: obtaining a video image from a video stream when receiving a people counting instruction; The video image is segmented to obtain multiple segmented images, and the image LBP features of the segmented images are extracted; the image LBP features are input into a pre-trained portrait recognition model, and the portrait recognition model The LBP feature of the image is recognized, and the recognition result is output; the recognition result is the number of segmented images of the portrait, and the first people count result is obtained. This application is based on artificial intelligence and uses image processing technology to count the number of people in the video, thereby greatly improving the efficiency and accuracy of people counting.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system including a series of elements not only includes those elements, It also includes other elements that are not explicitly listed, or elements inherent to the process, method, article, or system. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the foregoing embodiments of the present application are only for description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiments can be implemented by means of software plus the necessary general hardware platform. Of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disk, optical disk), including a number of instructions to make a terminal device execute the method described in each embodiment of this application.
以上所述仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或流程变换,或直接或间接运用在其它相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or process transformation made using the content of the description and drawings of this application, or directly or indirectly applied to other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (20)

  1. 一种基于人脸识别的人数统计方法,其中,所述方法包括: A method for counting people based on face recognition, wherein the method includes:
    当接收到人数统计指令时,从视频流中获取视频图像;When receiving the people counting instruction, obtain the video image from the video stream;
    将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像局部二值模式LBP特征;Performing picture segmentation on the video image to obtain multiple segmented images, and extracting image local binary mode LBP features of the multiple segmented images;
    将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;Input the LBP features of the image into a pre-trained portrait recognition model, and the LBP features of the image are recognized by the portrait recognition model, and the recognition result is output;
    统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result;
    所述提取多个所述切分图像的图像LBP特征的步骤包括:The step of extracting image LBP features of a plurality of the segmented images includes:
    将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
    将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
    基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
    对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
  2. 根据权利要求1所述的方法,其中,所述将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果的步骤之前还包括:The method according to claim 1, wherein the step of inputting LBP features of the image into a pre-trained portrait recognition model, and recognizing the LBP features of the image by the portrait recognition model, the step of outputting the recognition result further comprises :
    收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
    将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
    提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
    将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  3. 根据权利要求1所述的方法,其中,所述切分图像包括第一切分图像和第二切分图像,所述将所述视频图像进行图片切分,获得多个切分图像的步骤包括:The method according to claim 1, wherein the segmented image includes a first segmented image and a second segmented image, and the step of dividing the video image into a picture to obtain a plurality of segmented images comprises :
    将所述视频图像压缩成512×512像素的压缩视频图像;Compressing the video image into a compressed video image of 512×512 pixels;
    将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;Segmenting the compressed video image according to 64×64 pixels to obtain multiple first segmented images;
    将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
  4. 根据权利要求1所述的方法,其中,所述统计识别结果为人像的切分图像的个数,获得第一人数统计结果的步骤之后还包括:The method according to claim 1, wherein the statistical recognition result is the number of segmented images of the portrait, and after the step of obtaining the first population statistical result, the method further comprises:
    根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Report the first population count result to the server according to the preset result report interface.
  5. 根据权利要求4所述的方法,其中,所述根据预设的结果上报接口将所述第一人数统计结果上报至服务器的步骤之前还包括:The method according to claim 4, wherein before the step of reporting the first people counting result to the server according to the preset result reporting interface, the method further comprises:
    根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
    若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;If the first person counting result does not include the abnormal portrait, perform the step: reporting the first person counting result to the server according to a preset result reporting interface;
    若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。If the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
  6. 根据权利要求5中所述的方法,其中,所述人像识别模型记录识别结果为人像的所述切分图像中人像的人像坐标,所述根据波峰计数法判断所述第一人数统计结果中是否包括异常人像的步骤包括:The method according to claim 5, wherein the portrait recognition model records the portrait coordinates of the portrait in the segmented image whose recognition result is a portrait, and the method of crest counting determines whether the first population count result is The steps to include unusual figures include:
    获取所述人像坐标在预设时间内出现的次数;Obtaining the number of times the portrait coordinates appear within a preset time;
    若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait;
    若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  7. 一种基于人脸识别的人数统计装置,其中,所述基于人脸识别的人数统计装置包括:A device for counting people based on face recognition, wherein the device for counting people based on face recognition includes:
    获取模块,用于当接收到人数统计指令时,从视频流中获取视频图像;The obtaining module is used to obtain video images from the video stream when receiving the people counting instruction;
    提取模块,用于将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像LBP特征;An extraction module, configured to perform picture segmentation on the video image to obtain multiple segmented images, and extract image LBP features of the multiple segmented images;
    识别模块,用于将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;A recognition module, configured to input LBP features of the image into a pre-trained portrait recognition model, the portrait recognition model recognizes the LBP features of the image, and outputs a recognition result;
    统计模块,用于统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;A statistics module, configured to count the number of the segmented images whose recognition result is a portrait, to obtain a first population statistics result;
    所述提取模块还用于:The extraction module is also used for:
    将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
    将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
    基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
    对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
  8. 根据权利要求7所述的装置,其中,所述识别模块还用于:The device according to claim 7, wherein the identification module is further used for:
    收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
    将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
    提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
    将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  9. 根据权利要求7所述的装置,其中,所述提取模块还用于;The device according to claim 7, wherein the extraction module is also used for;
    将所述视频图像压缩成512×512像素的压缩视频图像;Compressing the video image into a compressed video image of 512×512 pixels;
    将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;Segmenting the compressed video image according to 64×64 pixels to obtain multiple first segmented images;
    将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
  10. 根据权利要求7所述的装置,其中,所述统计模块还用于The device according to claim 7, wherein the statistics module is also used for
    根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Report the first population count result to the server according to the preset result report interface.
  11. 根据权利要求10所述的装置,其中,所述统计模块还用于:The device according to claim 10, wherein the statistics module is further used for:
    根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
    若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;If the first person counting result does not include the abnormal portrait, perform the step: reporting the first person counting result to the server according to a preset result reporting interface;
    若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。If the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
  12. 根据权利要求11所述的装置,其中,所述统计模块还用于:The device according to claim 11, wherein the statistics module is further used for:
    获取所述人像坐标在预设时间内出现的次数;Obtaining the number of times the portrait coordinates appear within a preset time;
    若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait;
    若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  13. 一种基于人脸识别的人数统计设备,其中,所述基于人脸识别的人数统计设备包括处理器,存储器以及存储在所述存储器中的基于人脸识别的人数统计计算机可读指令,所述基于人脸识别的人数统计计算机可读指令被所述处理器运行时,实现如下步骤:A device for counting people based on face recognition, wherein the device for counting people based on face recognition includes a processor, a memory, and computer-readable instructions for counting people based on face recognition stored in the memory, and When the computer-readable instructions for counting people based on face recognition are executed by the processor, the following steps are implemented:
    当接收到人数统计指令时,从视频流中获取视频图像;When receiving the people counting instruction, obtain the video image from the video stream;
    将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像局部二值模式LBP特征;Performing picture segmentation on the video image to obtain multiple segmented images, and extracting image local binary mode LBP features of the multiple segmented images;
    将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;Input the LBP features of the image into a pre-trained portrait recognition model, and the LBP features of the image are recognized by the portrait recognition model, and the recognition result is output;
    统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result;
    所述提取多个所述切分图像的图像LBP特征的步骤包括:The step of extracting image LBP features of a plurality of the segmented images includes:
    将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
    将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
    基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
    对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
  14. 根据权利要求13所述的设备,其中,所述将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果的步骤之前还包括:The device according to claim 13, wherein the step of inputting LBP features of the image into a pre-trained portrait recognition model, and recognizing the LBP features of the image by the portrait recognition model, the step of outputting the recognition result further comprises :
    收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
    将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
    提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
    将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait.
  15. 根据权利要求13所述的装置,其中,所述切分图像包括第一切分图像和第二切分图像,所述将所述视频图像进行图片切分,获得多个切分图像的步骤包括:The apparatus according to claim 13, wherein the segmented image includes a first segmented image and a second segmented image, and the step of dividing the video image into a picture to obtain a plurality of segmented images comprises :
    将所述视频图像压缩成512×512像素的压缩视频图像;Compressing the video image into a compressed video image of 512×512 pixels;
    将所述压缩视频图像按64×64像素进行图片切分,获得多个第一切分图像;Segmenting the compressed video image according to 64×64 pixels to obtain multiple first segmented images;
    将所述第一切分图像中相邻切分图像的重叠区域以64像素为起点进行二次图片切分,获得第二切分图像。Perform secondary image segmentation on the overlapping area of adjacent segmented images in the first segmented image with 64 pixels as a starting point to obtain a second segmented image.
  16. 根据权利要求13所述的设备,其中,所述统计识别结果为人像的切分图像的个数,获得第一人数统计结果的步骤之后还包括:The device according to claim 13, wherein the statistical recognition result is the number of segmented images of the portrait, and after the step of obtaining the first number of people statistical result, the method further comprises:
    根据预设的结果上报接口将所述第一人数统计结果上报至服务器。Report the first population count result to the server according to the preset result report interface.
  17. 根据权利要求16所述的设备,其中,所述根据预设的结果上报接口将所述第一人数统计结果上报至服务器的步骤之前还包括:The device according to claim 16, wherein before the step of reporting the first population count result to the server according to the preset result reporting interface, the method further comprises:
    根据波峰计数法判断所述第一人数统计结果中是否包括异常人像;Judging whether an abnormal human figure is included in the first population count result according to the peak counting method;
    若所述第一人数统计结果中不包括所述异常人像,则执行步骤:根据预设的结果上报接口将所述第一人数统计结果上报至服务器;If the first person counting result does not include the abnormal portrait, perform the step: reporting the first person counting result to the server according to a preset result reporting interface;
    若所述第一人数统计结果中包括所述异常人像,则在所述第一人数统计结果中去除所述异常人像的个数后,获得第二人数统计结果,将所述第二人数统计结果上报至所述服务器。If the first person counting result includes the abnormal person, after removing the number of the abnormal person from the first person counting result, a second person counting result is obtained, and the second person counting result is Report to the server.
  18. 根据权利要求17所述的设备,其中,所述人像识别模型记录识别结果为人像的所述切分图像中人像的人像坐标,所述根据波峰计数法判断所述第一人数统计结果中是否包括异常人像的步骤包括:18. The device according to claim 17, wherein the portrait recognition model records the portrait coordinates of the portrait in the segmented image whose recognition result is a portrait, and the method of crest counting determines whether the first population count result includes The steps for unusual portraits include:
    获取所述人像坐标在预设时间内出现的次数;Obtaining the number of times the portrait coordinates appear within a preset time;
    若所述人像坐标在预设时间内出现的次数大于或等于次数阈值,则判定所述图像坐标对应的人像不是异常人像;If the number of times that the portrait coordinates appear within the preset time is greater than or equal to the number threshold, it is determined that the portrait corresponding to the image coordinates is not an abnormal portrait;
    若所述人像图像坐标在预设时间内出现的次数小于次数阈值,则判定所述图像坐标对应的人像是异常人像,则将所述图像坐标对应的人像标记为异常人像。If the number of occurrences of the portrait image coordinates within the preset time is less than the number threshold, it is determined that the portrait corresponding to the image coordinates is an abnormal portrait, and the portrait corresponding to the image coordinates is marked as an abnormal portrait.
  19. 一种计算机存储介质,其中,所述计算机存储介质上存储有基于人脸识别的人数统计计算机可读指令,所述基于人脸识别的人数统计计算机可读指令被处理器运行时实现如下的步骤:A computer storage medium, wherein the computer storage medium stores a computer readable instruction for counting people based on face recognition, and the following steps are implemented when the computer readable instruction for counting people based on face recognition is executed by a processor :
    当接收到人数统计指令时,从视频流中获取视频图像;When receiving the people counting instruction, obtain the video image from the video stream;
    将所述视频图像进行图片切分,获得多个切分图像,并提取所述多个切分图像的图像局部二值模式LBP特征;Performing picture segmentation on the video image to obtain multiple segmented images, and extracting image local binary mode LBP features of the multiple segmented images;
    将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果;Input the LBP features of the image into a pre-trained portrait recognition model, and the LBP features of the image are recognized by the portrait recognition model, and the recognition result is output;
    统计识别结果为人像的所述切分图像的个数,获得第一人数统计结果;Count the number of the segmented images whose recognition result is a portrait, and obtain a first population count result;
    所述提取多个所述切分图像的图像LBP特征的步骤包括:The step of extracting image LBP features of a plurality of the segmented images includes:
    将所述切分图像划分为多个区域;Dividing the segmented image into multiple regions;
    将每个区域中的每一个像素点的中心灰度值与所述像素点相邻的8个相邻像素点的灰度值进行比较,获得所述像素点的LBP特征;Comparing the central gray value of each pixel in each area with the gray values of 8 adjacent pixels adjacent to the pixel to obtain the LBP feature of the pixel;
    基于所述像素点的LBP特征,获得每个区域的直方图;Obtain a histogram of each area based on the LBP feature of the pixel;
    对所述每个区域的直方图进行归一化处理获得统计直方图,基于所述统计直方图获得所述切分图像的图像LBP特征。Perform normalization processing on the histogram of each region to obtain a statistical histogram, and obtain the image LBP feature of the segmented image based on the statistical histogram.
  20. 根据权利要求19所述的计算机存储介质,其中,所述将所述图像LBP特征输入预先训练的人像识别模型,由所述人像识别模型对所述图像LBP特征进行识别,输出识别结果的步骤之前还包括:The computer storage medium according to claim 19, wherein the step of inputting the LBP features of the image into a pre-trained portrait recognition model, and recognizing the LBP features of the image by the portrait recognition model, and outputting the recognition result Also includes:
    收集预设数量的样本图像,将所述样本图像的标签设置为人像或非人像;Collect a preset number of sample images, and set the label of the sample images as portrait or non-portrait;
    将所述样本图像压缩成128×128像素后再进行灰度处理和随机残缺处理,获得处理后样本图像;After compressing the sample image into 128×128 pixels, perform gray-scale processing and random incomplete processing to obtain a processed sample image;
    提取所述处理后样本图像的样本LBP特征,获得样本LBP特征;Extracting the sample LBP feature of the processed sample image to obtain the sample LBP feature;
    将所述样本LBP特征输入基于TensorFlow创建的神经网络中进行训练,获得所述人像识别模型,所述人像识别模型输出的识别结果为人像或非人像。 The sample LBP features are input into a neural network created based on TensorFlow for training to obtain the portrait recognition model, and the recognition result output by the portrait recognition model is a portrait or a non-portrait. To
PCT/CN2019/122079 2019-04-12 2019-11-29 People counting method, apparatus, and device based on facial recognition, and storage medium WO2020207038A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910297454.2 2019-04-12
CN201910297454.2A CN110163092A (en) 2019-04-12 2019-04-12 Demographic method, device, equipment and storage medium based on recognition of face

Publications (1)

Publication Number Publication Date
WO2020207038A1 true WO2020207038A1 (en) 2020-10-15

Family

ID=67639302

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/122079 WO2020207038A1 (en) 2019-04-12 2019-11-29 People counting method, apparatus, and device based on facial recognition, and storage medium

Country Status (2)

Country Link
CN (1) CN110163092A (en)
WO (1) WO2020207038A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822111A (en) * 2021-01-19 2021-12-21 北京京东振世信息技术有限公司 Crowd detection model training method and device and crowd counting method and device
CN114462653A (en) * 2022-01-24 2022-05-10 广东天地和实业控股集团有限公司 O2O type campus self-service catering management method, system, equipment and medium
CN116128883A (en) * 2023-04-19 2023-05-16 尚特杰电力科技有限公司 Photovoltaic panel quantity counting method and device, electronic equipment and storage medium
CN118379778A (en) * 2024-06-25 2024-07-23 贵州聚控云科技有限公司 Personnel information statistics method and system based on face recognition

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110163092A (en) * 2019-04-12 2019-08-23 深圳壹账通智能科技有限公司 Demographic method, device, equipment and storage medium based on recognition of face
CN111199215A (en) * 2020-01-06 2020-05-26 郑红 People counting method and device based on face recognition

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2012013865A (en) * 2012-11-29 2014-05-28 Vision Holdings Mexico S De R L De C V System and method for paying and counting passengers in transport means.
KR101500496B1 (en) * 2013-12-06 2015-03-10 주식회사 케이티 Apparatus for recognizing face and method thereof
CN105160313A (en) * 2014-09-15 2015-12-16 中国科学院重庆绿色智能技术研究院 Method and apparatus for crowd behavior analysis in video monitoring
CN108563997A (en) * 2018-03-16 2018-09-21 新智认知数据服务有限公司 It is a kind of establish Face datection model, recognition of face method and apparatus
CN108805140A (en) * 2018-05-23 2018-11-13 国政通科技股份有限公司 A kind of feature rapid extracting method and face identification system based on LBP
CN109344765A (en) * 2018-09-28 2019-02-15 广州云从人工智能技术有限公司 A kind of intelligent analysis method entering shop personnel analysis for chain shops
CN110163092A (en) * 2019-04-12 2019-08-23 深圳壹账通智能科技有限公司 Demographic method, device, equipment and storage medium based on recognition of face

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106778512A (en) * 2016-11-25 2017-05-31 南京蓝泰交通设施有限责任公司 Face identification method under the conditions of a kind of unrestricted based on LBP and depth school
CN109359548B (en) * 2018-09-19 2022-07-08 深圳市商汤科技有限公司 Multi-face recognition monitoring method and device, electronic equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MX2012013865A (en) * 2012-11-29 2014-05-28 Vision Holdings Mexico S De R L De C V System and method for paying and counting passengers in transport means.
KR101500496B1 (en) * 2013-12-06 2015-03-10 주식회사 케이티 Apparatus for recognizing face and method thereof
CN105160313A (en) * 2014-09-15 2015-12-16 中国科学院重庆绿色智能技术研究院 Method and apparatus for crowd behavior analysis in video monitoring
CN108563997A (en) * 2018-03-16 2018-09-21 新智认知数据服务有限公司 It is a kind of establish Face datection model, recognition of face method and apparatus
CN108805140A (en) * 2018-05-23 2018-11-13 国政通科技股份有限公司 A kind of feature rapid extracting method and face identification system based on LBP
CN109344765A (en) * 2018-09-28 2019-02-15 广州云从人工智能技术有限公司 A kind of intelligent analysis method entering shop personnel analysis for chain shops
CN110163092A (en) * 2019-04-12 2019-08-23 深圳壹账通智能科技有限公司 Demographic method, device, equipment and storage medium based on recognition of face

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822111A (en) * 2021-01-19 2021-12-21 北京京东振世信息技术有限公司 Crowd detection model training method and device and crowd counting method and device
CN113822111B (en) * 2021-01-19 2024-05-24 北京京东振世信息技术有限公司 Crowd detection model training method and device and crowd counting method and device
CN114462653A (en) * 2022-01-24 2022-05-10 广东天地和实业控股集团有限公司 O2O type campus self-service catering management method, system, equipment and medium
CN114462653B (en) * 2022-01-24 2022-09-30 广东天地和实业控股集团有限公司 O2O type campus self-service catering management method, system, equipment and medium
CN116128883A (en) * 2023-04-19 2023-05-16 尚特杰电力科技有限公司 Photovoltaic panel quantity counting method and device, electronic equipment and storage medium
CN118379778A (en) * 2024-06-25 2024-07-23 贵州聚控云科技有限公司 Personnel information statistics method and system based on face recognition

Also Published As

Publication number Publication date
CN110163092A (en) 2019-08-23

Similar Documents

Publication Publication Date Title
WO2020207038A1 (en) People counting method, apparatus, and device based on facial recognition, and storage medium
WO2020034526A1 (en) Quality inspection method, apparatus, device and computer storage medium for insurance recording
WO2018018771A1 (en) Dual camera-based photography method and system
WO2020073495A1 (en) Artificial intelligence-based reexamination method, apparatus, and device, and storage medium
WO2017010695A1 (en) Three dimensional content generating apparatus and three dimensional content generating method thereof
WO2020082562A1 (en) Symbol identification method, apparatus, device, and storage medium
WO2021012508A1 (en) Ai image recognition method, apparatus and device, and storage medium
WO2020130309A1 (en) Image masking device and image masking method
WO2015137666A1 (en) Object recognition apparatus and control method therefor
WO2018008881A1 (en) Terminal device and service server, method and program for providing diagnostic analysis service performed by same device, and computer-readable recording medium having same program recorded therein
WO2020207035A1 (en) Crank call interception method, apparatus, and device, and storage medium
WO2019203528A1 (en) Electronic apparatus and method for controlling thereof
WO2020253048A1 (en) Image recognition method, apparatus and device based on deep learning, and storage medium
WO2020015060A1 (en) Power consumption anomaly estimation method and apparatus, device, and computer storage medium
WO2018131875A1 (en) Display apparatus and method for providing service thereof
WO2021051558A1 (en) Knowledge graph-based question and answer method and apparatus, and storage medium
WO2018223520A1 (en) Child-oriented learning method and learning device, and storage medium
WO2010041836A2 (en) Method of detecting skin-colored area using variable skin color model
WO2020034534A1 (en) Drive recorder-based evidence production method, apparatus and device, and storage medium
WO2013085278A1 (en) Monitoring device using selective attention model and method for monitoring same
WO2022114731A1 (en) Deep learning-based abnormal behavior detection system and detection method for detecting and recognizing abnormal behavior
WO2020186774A1 (en) Positioning method and apparatus based on image detection, and device and storage medium
WO2020233055A1 (en) Product promotion method, apparatus, and device based on liveness detection, and storage medium
WO2020067725A1 (en) Apparatus and method for model reconstruction using photogrammetry
WO2020080734A1 (en) Face recognition method and face recognition device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19924405

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 02/02/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19924405

Country of ref document: EP

Kind code of ref document: A1