CN107832728A

CN107832728A - A kind of judge based on video makes a phone call Activity recognition method

Info

Publication number: CN107832728A
Application number: CN201711168589.6A
Authority: CN
Inventors: 朱健立; 于宏志; 王景彬; 刘永乐
Original assignee: Tianjin Intane Video Technology Co Ltd
Current assignee: Tianjin Intane Video Technology Co Ltd
Priority date: 2017-11-21
Filing date: 2017-11-21
Publication date: 2018-03-23

Abstract

Made a phone call Activity recognition method the invention provides a kind of judge based on video, specifically include the personage's head and shoulder region identified using the textural characteristics of personage's head and shoulder in image in area-of-interest；Personage's head and shoulder region is identified using deep learning disaggregated model, judges present image with the presence or absence of behavior of making a phone call；Energy of making a phone call is calculated according to judged result, and finally judged.The present invention utilizes high definition court's trial video, is automatically found judge position, then carries out making a phone call to identify.Due to having used the thought of energy accumulation, this method has accuracy height, loss low and the characteristic of strong robustness.

Description

A kind of judge based on video makes a phone call Activity recognition method

Technical field

The invention belongs to video detection technology field, know more particularly, to a kind of judge based on video behavior of making a phone call Other method.

Background technology

During opening a court session, it is both that itself professional not strong performance causes to the seriousness of court again that judge, which takes phone, Destroy, it could even be possible to influenceing the fairness of trial by phone.Judge, which should strictly observe, " forbids people's electricity of being combatted smuggling during opening a court session Court's discipline of words ".With the progress of court's electronization upgrading, present most courts record court's trial using picture pick-up device Process.After having high definition court's trial video, carry out making a phone call to be identified as in order to possible for people using computer generation.

The content of the invention

In view of this, a kind of Activity recognition method the present invention is directed to propose judge based on video makes a phone call, to utilize High definition court's trial video, is automatically found judge position, carries out making a phone call to identify.

To reach above-mentioned purpose, the technical proposal of the invention is realized in this way：

A kind of judge based on video makes a phone call Activity recognition method, specifically comprises the following steps：

(1) personage's head and shoulder region in image in area-of-interest is identified using the textural characteristics of personage's head and shoulder；

(2) personage's head and shoulder region is identified using deep learning disaggregated model, judges that present image whether there is and beat Phone behavior；

(3) energy of making a phone call is calculated according to judged result, and finally judged.

Further, specifically included in the step (1), personage's head and shoulder Sample Storehouse is established, according to known personage's head and shoulder sample This HOG features, establish personage's head and shoulder model using DPM algorithms, then use head and shoulder in the area-of-interest of video to be detected Model is detected, and obtains the head and shoulder scope of personage in video.

Further, specifically included in the step (2), foundation is made a phone call and Sample Storehouse of not making a phone call, utilizes Sample Storehouse Disaggregated model training is carried out using deep learning, the head and shoulder range image of personage is identified by deep learning, judges to work as Preceding image is with the presence or absence of behavior of making a phone call.

Further, specifically included in the step (3), energy scores are carried out to human target according to multiple judged result Whether accumulation, then finally making a phone call to judge according to energy scores to personage.

Further, the utilization DPM algorithms are established personage's head and shoulder model and specifically included

(11) image to be checked is carried out piecemeal by DPM algorithms first, then to each piece of extraction HOG feature；

(12) each block of calculating obtains the weight of each block to the distance at center, and the HOG of these Weights is special Sign is trained and detected by SVM cascade grader, finally gives personage's head and shoulder regional location.

Relative to prior art, a kind of judge based on video of the present invention Activity recognition method of making a phone call has Following advantage：The present invention utilizes high definition court's trial video, is automatically found judge position, then carries out making a phone call to identify；The present invention by In the thought for having used energy accumulation, have that accuracy is high, loss is low and the characteristic of strong robustness.

Brief description of the drawings

The accompanying drawing for forming the part of the present invention is used for providing a further understanding of the present invention, schematic reality of the invention Apply example and its illustrate to be used to explain the present invention, do not form inappropriate limitation of the present invention.In the accompanying drawings：

Fig. 1 is that a kind of judge based on video described in the embodiment of the present invention makes a phone call Activity recognition method flow diagram；

Fig. 2 is the HOG feature extraction block diagrams described in the embodiment of the present invention；

Fig. 3 is the division schematic diagram of the rectangle HOG blocks described in the embodiment of the present invention；

Fig. 4 is the DPM flow charts described in the embodiment of the present invention.

Embodiment

It should be noted that in the case where not conflicting, the feature in embodiment and embodiment in the present invention can phase Mutually combination.

Describe the present invention in detail below with reference to the accompanying drawings and in conjunction with the embodiments.

Made a phone call Activity recognition method the invention provides a kind of judge based on video, overall flow figure of the invention As shown in figure 1, key step is as follows：

First, personage's head and shoulder region is found in the picture, carries out the training of personage's head and shoulder model in the present invention using DPM algorithms And detection.

Image to be checked is carried out piecemeal by 1.DPM algorithms first, then to each piece of extraction HOG feature, gradient direction Nogata Figure (Histogram of Oriented Gradient, HOG) is characterized in a kind of normal in computer vision and image processing field Character description method, it is as shown in Figure 2 that it extracts characteristic procedure：

(1) by image normalization：

Wherein max and min is respectively the minimum and maximum pixel value of image, and x is normalization preceding pixel value, and x* is normalization Pixel value afterwards.

(2) gradient of normalized image is calculated using first differential.

I, j are the coordinate of current pixel,Represent gradient, each corresponding 8 Grad of point.

(3) the direction weight projection based on gradient magnitude.As shown in figure 3, one block (Block) of the division of rectangle HOG blocks It is made up of 4 units (Cell), a unit is made up of 64 pixels.The distribution of 0~180 degree is divided into 9 deflections Degree, each orientation angle scope can correspond to a Bin, and the gradient obtained in step (2) is independently done into ladder in each Cell Directional statistics are spent, obtain a 9 dimensional vector v (x1, x2 ..., x9), wherein x1 correspondingly projects to terraced in 0~20 degree of direction bin The number of degree, for each Block, obtain 36 dimensional characteristics vector vs (x1, x2 ..., x36).

(4) HOG characteristic vectors normalize.

Wherein ε is the constant value of a very little, and it is 0 to avoid denominator.

(5) characteristic vector generation final HOG.Finally we can obtain an a*b*c data by above step The vector of composition, wherein a represent the number of Bin in each Cell, and b represents the number of Cell in each Block, and c represents block Middle Block number.

2. each block is calculated again to the distance at center to obtain the weight of each block, the HOG of these Weights is special Sign is trained and detected by SVM cascade grader.DPM algorithm flows are as shown in Figure 4.Finally give personage's head and shoulder region position Put.

2nd, personage's head and shoulder area image is passed into deep learning disaggregated model to be identified, and carries out energy accumulation, obtained Final result.Main the following steps：

1. personage's head and shoulder area image is passed into deep learning disaggregated model to be identified.

Three kinds are shared to each frame recognition result：Left hand is made a phone call CALL_LEFT), the right hand makes a phone call (CALL_ RIGHT), do not make a phone call (NO_CALL).The recognition result of confidence score (Conf) is obtained by deep learning model, in order that Result it is more credible, we add threshold value (Thresh), work as Conf>It is considered that result is effective when Thresh.

2. the result obtained in upper step is constantly accumulated into acquisition energy scores, and final alarm knot is obtained according to energy scores Fruit.

(1) two field picture is often handled, score (Score) fixation of target region (TargetRegion) subtracts one The result of individual value (MinusValue), this operation and deep learning is unrelated, and limitation Score minimum values are 0, i.e. Score=Max (Score-MinusValue, 0).

(2) two field picture is often handled, when result is CALL_LEFT or CALL_RIGHT, target region (TargetRegion) score (Score) increases a value (AddValue), and limitation Score maximums are 100, i.e. Score =Min (Score+AddValue, 100).

(3) final result whether alarm mainly according to judge energy scores (Score) whether exceed setting score threshold (AlarmThresh).Each frame can all judge whether Score is more than AlarmThresh, work as Score>AlarmThresh is During Ture, it is believed that when the region where Area Objects has behavior of making a phone call and starts to alarm.Once Score>AlarmThresh is False, alarm eliminate.

The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention God any modification, equivalent substitution and improvements made etc., should be included in the scope of the protection with principle.

Claims

A kind of Activity recognition method 1. judge based on video makes a phone call, it is characterised in that：Specifically comprise the following steps：

(1) personage's head and shoulder region in image in area-of-interest is identified using the textural characteristics of personage's head and shoulder；

(2) personage's head and shoulder region is identified using deep learning disaggregated model, judges that present image whether there is and make a phone call Behavior；

(3) energy of making a phone call is calculated according to judged result, and finally judged.
The Activity recognition method 2. a kind of judge based on video according to claim 1 makes a phone call, it is characterised in that：Institute State and specifically included in step (1), establish personage's head and shoulder Sample Storehouse, according to the HOG features of known personage's head and shoulder sample, utilize DPM Algorithm establishes personage's head and shoulder model, is then detected in the area-of-interest of video to be detected using head and shoulder model, depending on The head and shoulder scope of personage in frequency.
The Activity recognition method 3. a kind of judge based on video according to claim 2 makes a phone call, it is characterised in that：Institute State and specifically included in step (2), foundation is made a phone call and Sample Storehouse of not making a phone call, is classified using Sample Storehouse using deep learning Model training, the head and shoulder range image of personage is identified by deep learning, judges that present image whether there is and make a phone call Behavior.
The Activity recognition method 4. a kind of judge based on video according to claim 3 makes a phone call, it is characterised in that：Institute State and specifically included in step (3), energy scores accumulation is carried out to human target according to multiple judged result, then according to can measure Divide and finally whether personage is being made a phone call to judge.
The Activity recognition method 5. a kind of judge based on video according to claim 2 makes a phone call, it is characterised in that：Institute State and establish personage's head and shoulder model using DPM algorithms and specifically include

(11) image to be checked is carried out piecemeal by DPM algorithms first, then to each piece of extraction HOG feature；

(12) each block of calculating obtains the weight of each block to the distance at center, and the HOG features of these Weights are led to Cross SVM cascade grader to be trained and detect, finally give personage's head and shoulder regional location.