CN111242010A

CN111242010A - Method for judging and identifying identity of litter worker based on edge AI

Info

Publication number: CN111242010A
Application number: CN202010026367.6A
Authority: CN
Inventors: 余齐齐
Original assignee: Xiamen Bohai Zhongtian Information Technology Co ltd
Current assignee: Xiamen Bohai Zhongtian Information Technology Co ltd
Priority date: 2020-01-10
Filing date: 2020-01-10
Publication date: 2020-06-05

Abstract

The invention relates to the technical field of judging litter and identifying the identity of illegal persons, in particular to a method for judging and identifying the identity of a litter person based on an edge AI (artificial intelligence) algorithm, wherein the method for judging and identifying the identity of the litter person based on the edge AI algorithm provided by the invention realizes calculation resources and storage resources for accelerating identification through an edge AI calculation method, can quickly identify whether a litter action exists, adopts a face recognition technology to identify the identity of the litter person when judging the litter action by adopting the edge AI algorithm, helps to make person identity tracing, traces back the litter person to the illegal person through the illegal action, and is favorable for educating the individual who throws the litter and establishing a responsibility system.

Description

Method for judging and identifying identity of litter worker based on edge AI

Technical Field

The invention relates to the technical field of judging litter and identifying illegal personnel identities, in particular to a method for judging and identifying the identity of a litter personnel based on an edge AI.

Background

With the rapid development of social economy in China, the problems of further improving city grade, improving city image, beautifying city environment and the like become important subjects in front of governments, environment improvement, city management optimization, new city image building, city grade improvement and full development of activities of civilized cities are promoted.

With the continuous deepening of the urbanization process, the insufficient urban environment monitoring efficiency becomes an urgent problem to be solved, household garbage is taken as an important influence factor influencing the urban environment, a large amount of household garbage is generated along with the improvement of the urban living standard, the effective control of the garbage becomes a major problem of urban environmental protection work, a large amount of randomly discarded garbage in the past causes harm to water and soil resources of people, and the survival and development of people are directly threatened; information collection, processing and analysis of sanitation work as an important ring has not yet been mature solution, and is usually only discovered and stopped by sanitation workers, so that the efficiency is low. Under the condition, the original environment monitoring mode cannot meet the actual needs of the digital city, and the comprehensive and efficient environment detection system based on artificial intelligence becomes the inevitable appeal of urbanization development.

However, the current irreconcilable phenomenon of garbage throwing is still all the more than ever, and the garbage throwing is difficult to effectively control and depends on the independent consciousness of people. The existing artificial intelligence algorithm model is used for identifying whether a garbage throwing behavior exists or not, and the existing artificial intelligence algorithm model firstly judges whether a person enters a garbage throwing forbidden area or not, and then voice broadcasting is carried out to remind that garbage can not be thrown.

The method only plays a role in reminding, the realization effect depends on the quality of individuals, and the behavior of forbidding throwing garbage randomly cannot be really realized.

Disclosure of Invention

In order to solve the problem that the prior art mentioned in the background art can only remind people who forbid garbage throwing and cannot confirm the identity of garbage throwing personnel, the invention provides a method for judging and identifying the identity of garbage throwing personnel based on edge AI, which comprises the following steps:

s100, performing original data algorithm model learning by using a fastercnnn framework of a deep learning target detection algorithm based on Region Proposal by a cloud algorithm center;

s200, distributing an original data example model obtained by the famtercnn framework learning to each edge computing node by the cloud example center through a cloud unloading mode of cloud computing;

s300, the field camera transmits the collected video and pictures of the personnel throwing garbage to an edge computing node, the edge computing node is combined with an existing example model to carry out computing, whether the personnel throwing garbage has illegal behaviors of throwing garbage or not is judged according to a computing result, if yes, an instruction is sent to a field terminal device, and the field terminal device gives a prompt according to the instruction;

and S400, the edge computing node transmits the judgment result and the collected video and picture of the person throwing garbage to a cloud computing center for storage or viewing.

On the basis of the scheme, further, the raw data example model is obtained through the following steps:

s110, constructing an image database for identifying people and garbage throwing behaviors, and performing model depth model training and labeling on the image to form source data;

and S120, performing AI modeling based on the source data and the fastercnnn algorithm network to generate an original data example model.

On the basis of the above scheme, further, the step S110 includes the following steps:

s111, obtaining pictures containing people and garbage throwing behaviors as data sources;

and S112, carrying out model depth model training and labeling on the data source and forming source data, wherein the labeling comprises the position of the image, the name of the image, the width and the height of the image, the dimension of the image, the name of a labeled object and the xy coordinate value of the Bbox.

On the basis of the above scheme, further, the step S120 includes the following steps:

s121, extracting feature maps of the candidate images from the original images through a multilayer convolutional neural network, wherein the feature maps can be shared for use by a subsequent RPN network and a full connection layer;

s122, generating region prosages through RPN layers by the feature maps, firstly generating a stack of anchors, then cutting and filtering, judging whether the anchors belong to the foreground or the background through softmax, and meanwhile correcting the anchors by utilizing bounding box regression to form more accurate prosages;

s123, obtaining a propofol feature map with a fixed size on a Roi Pooling layer by the propofol;

s124, the generic feature map performs full-connection operation, specific category classification is performed on the full-connection layer and softmax, and a bounding box regression operation is completed by utilizing L1 Loss to obtain the accurate position of the object, so that an original data example model is generated.

On the basis of the above scheme, further, the S122 includes the following steps:

the fastercnnn adopts the SPP layer in the SPP-net, and introduces a multitask loss function for calculating loss of bbox regression and classification.

On the basis of the above scheme, further, the multitask loss function is:

wherein, the p is_iThe probability of being the target is predicted for the anchor,

N_clsis the total number of anchors, t_i＝(t_x,t_y,t_w,t_h) Is a vector representing the anchor, the offset predicted by the RPN training stage,

is and t_iVectors with the same dimension represent anchors, the offset of the RPN training stage relative to gt is actual, gt is a labeled correct target, lambda is a labeled number, and L is_regTo mark the correct number of targets, L_clsTo classify the number, N_regThe number of positive anchors.

On the basis of the above scheme, further, the calculation of the loss of bbox regression and classification is expressed by using a thought vector (x, y, w, h), and the calculation steps are as follows:

1) and (Δ x, Δ y), Δ x ═ P_Wd_x(P),Δy＝P_hd_y(P)；

2) Scaling (S)_W,S_h),S_W＝P_Wd_W(P),S_h＝P_hd_h(P)；

Wherein x, y, w, h respectively represent the central point coordinate and width and height of the image, P_WIs the target position, P_hIs a target height, d_x(P) is the target abscissa, d_y(P) is the object ordinate, d_w(P) is the zoom size, d_h(P) is the zoom height, S_wTo target zoom size, S_hThe target zoom height is obtained.

On the basis of the scheme, the equipment model adopted by the EDGE computing node is VA-TU-DL, and preferably VA-TU-EDGE-B01.

Compared with the prior art, the method for judging and identifying the identity of the litter thrower based on the edge AI has the following technical effects:

the invention can quickly and accurately identify whether the garbage throwing behavior exists in the garbage throwing forbidden area or not, and identify the identity of the garbage throwing personnel, thereby helping to establish a set of complete garbage throwing forbidden system.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a block diagram of a method for determining and identifying the identity of a person who discards garbage according to an edge AI of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the description of the present invention, it should be noted that the terms "center", "longitudinal", "lateral", "up", "down", "front", "back", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", etc., indicate orientations or positional relationships based on those shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the referred device or element must have a specific orientation, be constructed and operated in a specific orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

The invention provides a method for judging and identifying the identity of a litter worker based on an edge AI, which comprises the following steps:

In specific implementation, as shown in fig. 1, the method comprises the following steps:

s300, the field camera transmits the collected video and pictures of the personnel throwing garbage to an edge computing node, the edge computing node is combined with an existing example model to carry out computing, whether the personnel throwing garbage has illegal behaviors of throwing garbage or not is judged according to a computing result, if yes, an instruction is sent to the field terminal device, and the field terminal device gives a prompt according to the instruction.

The edge computing device detects that a person holding garbage enters a monitoring site and starts tracking the person holding garbage and the garbage on hands until the garbage is thrown out, the edge computing device calculates a calculation result value by combining a process video of the garbage thrown out with an example model, judges whether the person throwing the garbage has illegal behaviors of throwing the garbage according to the calculation result, specifically judges whether the illegal behaviors of throwing the garbage exist or not by comparing a numerical value of the result with a set threshold value through the example model, and if the illegal behaviors of throwing the garbage are lower than the threshold value, the garbage is thrown; if the instruction exists, the instruction is sent to the field terminal equipment, and the field terminal equipment gives a prompt according to the instruction, wherein the prompt can be used for playing voice or other related operations, such as broadcasting through a sound-producing device or an alarm bell;

it should be noted that, not only can the field video data be acquired by the camera, but also a person skilled in the art can implement the method by other devices for acquiring image or video data, and redundant description is not repeated here;

The edge computing node transmits the judgment result and the collected video and picture of the personnel throwing garbage to the cloud computing center for storage or checking, so that the identity of the illegal personnel can be identified conveniently, the illegal personnel can be traced back by illegal behaviors, and education and responsibility system establishment for the individual throwing garbage are facilitated.

The method for judging and identifying the identity of the garbage thrower based on the edge AI can quickly and accurately identify whether the garbage thrower exists in the garbage throwing forbidden area or not, identify the identity of the garbage thrower and help establish a complete garbage throwing forbidden system.

Preferably, the raw data example model is obtained by the following steps:

In specific implementation, as a CNN network target detection method, fastercnnn extracts feature maps of an original image by using a convolutional layer + pooling layer mode, and provides support for a subsequent RPN layer and a full connection layer; the RPN network is mainly used for generating region explosals;

firstly, after the picture with any size passes Conv layers, the original image is cut into 1/16, then the Feature Map enters into Intemediate layer filter to generate Anchor box finally, then judge that anchors belong to the foreground or the background, namely judge that the object is or is not, namely generate a matrix with MxN size on the image, firstly, performing convolution with 1x1 to output the image with WxHx18 size,

then whether the object is finally obtained through a softmax function and a resurcope layer, and meanwhile, a bounding box regression correction anchor box forms accurate propofol;

secondly, utilizing the prosals generated by the RPN and the feature map obtained by the last layer of the VGG16 to finally obtain the prosal feature map with fixed size, and entering the back to perform target identification and positioning by utilizing full connection operation;

finally, forming a feature map with a fixed size on a Roi Pooling layer for full connection operation, and classifying specific categories by using Softmax, wherein the feature map is mapped into values (0,1) through a Softmax function, and the cumulative sum of the values is 1 to meet the property of probability, so that the value with the largest probability can be selected as a prediction target, classifying according to the values, then completing bounding box regression operation by using L1 Loss to obtain the accurate position of the object, namely rpn _ Loss _ bbox uses a smooth L1 calculation function, performing prediction marking on the anchor box in rpn-data, and calculating the offset between the anchor box and gt _ boxes; meanwhile, the bounding box regression obtains the position offset bbox _ pred of each region proxy, and regression obtains a more accurate target detection position and finally obtains the accurate position of the object;

finally, an example model of the raw data is generated in the above manner.

Preferably, the step S110 includes the steps of:

In specific implementation, model depth model training and labeling are carried out by the following method:

preferably, the step S120 includes the steps of:

Preferably, the S122 includes the steps of:

Preferably, the multitask penalty function is:

Preferably, the computation of the loss of bbox regression and classification, expressed using a thought vector (x, y, w, h), is performed as follows:

1) and (Δ x, Δ y), Δ x ═ P_Wd_x(P),Δy＝P_hd_y(P)；

2) Scaling (S)_W,S_h),S_W＝P_Wd_W(P),S_h＝P_hd_h(P)；

Preferably, the equipment model adopted by the edge computing node is VA-TU-DL.

In particular, the EDGE equipment model is preferably VA-TU-EDGE-B01.

In order to test the application of the method for judging and identifying the identity of the people who throw garbage based on the edge AI in practice, the following tests are provided:

100000 human face pictures and 100000 garbage throwing pictures are collected by the example center for marking training, and a human face recognition model and a garbage throwing recognition model are established. The edge equipment is provided with a face recognition model and a garbage throwing recognition model, 1500 face pictures and 1000 garbage throwing pictures are collected by a camera of the edge equipment every day, and the model studies and judges the pictures collected by the camera and outputs the studied and judged results. The pictures collected on site are returned to the calculation center for continuous marking training, the accuracy of the face recognition model reaches 98%, and the accuracy of the garbage throwing recognition model reaches 90%.

Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims

1. A method for judging and identifying the identity of a litter worker based on an edge AI is characterized by comprising the following steps:

2. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 1, wherein the raw data example model is obtained by:

3. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 2, wherein the step S110 comprises the steps of:

4. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 2, wherein the step S120 comprises the steps of:

s122, generating region prosages through RPN layers by the feature maps, firstly generating a stack of anchor boxes, then cutting and filtering, judging whether the anchors belong to the foreground or the background through softmax, and meanwhile correcting the anchor boxes by utilizing bounding box regression to form more accurate prosages;

5. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 4, wherein the step S122 comprises the steps of:

6. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 5, wherein the multitasking loss function is:

7. The method for determining and identifying identity of people throwing garbage based on edge AI of claim 5, wherein: the computation of the loss of bbox regression and classification, expressed using thought vectors (x, y, w, h), is performed as follows:

1) and (Δ x, Δ y), Δ x ═ P_Wd_x(P),Δy＝P_hd_y(P)；

2) Scaling (S)_W,S_h),S_W＝P_Wd_W(P),S_h＝P_hd_h(P)；

8. The method for determining and identifying identity of people throwing garbage based on edge AI according to claim 1, wherein: the equipment model adopted by the edge computing node is VA-TU-DL.