CN110263728B - Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network - Google Patents

Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network Download PDF

Info

Publication number
CN110263728B
CN110263728B CN201910548528.5A CN201910548528A CN110263728B CN 110263728 B CN110263728 B CN 110263728B CN 201910548528 A CN201910548528 A CN 201910548528A CN 110263728 B CN110263728 B CN 110263728B
Authority
CN
China
Prior art keywords
video
abnormal
neural network
packet
positive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910548528.5A
Other languages
Chinese (zh)
Other versions
CN110263728A (en
Inventor
卢博文
郭文波
朱松豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Posts and Telecommunications filed Critical Nanjing University of Posts and Telecommunications
Priority to CN201910548528.5A priority Critical patent/CN110263728B/en
Publication of CN110263728A publication Critical patent/CN110263728A/en
Application granted granted Critical
Publication of CN110263728B publication Critical patent/CN110263728B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses an abnormal behavior detection method based on an improved pseudo-three-dimensional residual error neural network, which comprises the following steps: firstly, dividing each video in a training set into a plurality of video segments; secondly, inputting all video segments of a video into an improved pseudo-three-dimensional residual error neural network respectively to obtain the characteristics of the video segments; then, taking the average value of the feature vectors of all frames in each segment, and then carrying out L2 norm normalization on the feature average value to obtain the feature vector of the video segment; finally, inputting the feature vector of the video clip into a 3-layer fully-connected neural network, and outputting the abnormal score of the video clip. Experimental results show that compared with the existing method, the method further improves the accuracy of abnormal behavior detection and is more suitable for practical application.

Description

Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network
Technical Field
The invention relates to an abnormal behavior detection method in a surveillance video scene, in particular to an abnormal behavior detection method based on multi-example learning and improved pseudo three-dimensional residual error neural network, and belongs to the technical field of video analysis.
Background
The traditional video monitoring mainly depends on artificially monitoring abnormal behaviors in a scene, which not only needs extremely high labor cost, but also is easy to generate visual fatigue, and even can cause some abnormal behaviors not to be observed in time; the abnormal behavior detection and analysis aims to automatically detect abnormal behaviors in a monitoring scene through algorithms such as video signal processing and machine learning, and therefore people are helped to take corresponding measures in time; therefore, the abnormal behavior detection of the monitoring scene has very important research significance.
Early research work on abnormal behavior detection utilized low-level trajectory features to describe normal patterns, however, these methods were not robust in complex or crowded scenes with multiple occlusions due to the difficulty in obtaining reliable trajectories; in consideration of the shortcomings of the trajectory feature and the low-level spatio-temporal feature, Histogram of Oriented Gradient (HOG), Histogram of Optical Flow (HOF), and histogram of boundary (MBH) are widely used, on which basis a markov random field Model (MRF), a Social Force Model (SFM), a multi-scale histogram of optical flow (MHOF), a hybrid dynamic texture (MDT) are proposed one after another; these methods model normal behavior according to training data of the normal behavior, detect a low probability pattern as an anomaly, however, these artificially designed features hardly reflect behavior characteristics efficiently, and are computationally complex.
With the success of sparse representation and dictionary learning methods in some computer vision problems, researchers began to learn dictionaries of normal behaviors using sparse representations, and in the testing process, patterns with large reconstruction errors were considered to be abnormal behaviors; recently, researchers have learned a normal behavior model using a self-encoder based on deep learning, and detected abnormalities using reconstruction loss; the method is based on the assumption that any behavior deviating from the learned normal behavior pattern will be considered abnormal; however, this assumption may not hold because both normal and abnormal behavior have a complex diversity and the boundaries between them are sometimes ambiguous.
It is not appropriate to learn the normal behavior dictionary only from the training data of the normal behavior and detect the abnormal behavior based on the reconstruction error, and it is reasonable to utilize both the normal behavior and the abnormal behavior video data and should be performed with as little tag information as possible.
Disclosure of Invention
The invention aims to overcome the defects of an abnormal behavior detection method in the existing monitoring video scene, and provides an abnormal behavior detection method based on an improved Pseudo-three-dimensional Residual Neural Network (P3D-ResNet), which is used for improving the P3D-ResNet and learning the characteristics of a video.
The abnormal behavior detection method based on the improved pseudo-three-dimensional residual error neural network comprises the following steps:
based on a multi-example learning method, a training set only with a coarse granularity label (namely a video level label) is adopted, data preprocessing is carried out on each video in the training set, and each video is divided into a plurality of video segments;
step two, improving the structure of the P3D-ResNet network, and respectively inputting all video segments of a video into the improved P3D-ResNet to obtain a feature vector of each video frame in each video segment;
calculating the average value of the feature vectors of all frames in each video segment, and then performing L2 norm normalization on the feature average value to obtain the feature vectors of the video segment;
step four, inputting the feature vector of the video clip obtained in the step three into a 3-layer fully connected neural network (FC neural network for short), and outputting the abnormal score (i.e. the probability of belonging to the abnormality) of the video clip;
step five, drawing a Receiver Operating Characteristic Curve (ROC for short), calculating a corresponding Area Under the Curve (AUC for short), and evaluating the abnormal segments of the input video.
Further, in the first step, each video in the training set is regarded as a packet, the video with abnormal behavior is marked as a positive packet, the normal video without abnormal behavior is marked as a negative packet, the size and frame rate of each frame of the video are adjusted, and the data preprocessing method comprises the following steps: each video is divided into non-overlapping video segments having the same number of frames, each as an example in a positive or negative packet.
In a further step two, the P3D-ResNet network structure is improved in that, on the basis of the pseudo three-dimensional convolution residual neural network framework, a 3 × 3 × 3 average pooling operation is added to the shortcut connection shortcuts part, and a Batch Normalization (BN) operation is added after each convolution operation.
Further, the classifier 3-layer fully-connected neural network used in the fourth step is trained by adopting an objective function.
Further, in step four, the design step of the objective function is:
1) in a multi-example learning algorithm, the training set with video level labels can be represented as { (x) 1 ,y 1 ),…,(x i ,y i ),…,(x N ,y N ) In which x is i Is the ith in the training setBag, y i The label of the packet, N is the total number of packets in the training set; the ith packet can be represented as
Figure GDA0003745762630000031
Wherein x ik Representing a package x i The kth example of (1), n i Is a positive packet x i Total number of examples in (1). Suppose there is N in the training set a A positive packet, then x i (i∈[1,N a ]) Is a positive packet, and y i 1 ═ 1; with simultaneous training centered on N-N a A negative bag, then x j (j∈[N a +1,N]) Is a negative bag, and y j =-1;
In a standard supervised classification problem using a Support Vector Machine (SVM), the optimization objectives are:
Figure GDA0003745762630000032
wherein, the first is a risk term, the second is a regularization term, the C is a regularization constant, the m is the total number of training examples, and the y is i For each exemplary label, φ (x) i ) The feature vector of the image block or video segment, w is the weight parameter of the model, and b is the bias parameter.
2) Since the abnormal score is the probability of being abnormal, it is reasonable that the abnormal video is higher than the abnormal score of the normal video, in order to improve the accuracy of detection, it is desirable that the abnormal video segment has a higher abnormal score than the normal video segment, a sorting loss can be used, which enables the abnormal video segment to obtain a higher score than the normal segment, however, in the case of no video segment level annotation (whether the normal video segment exists in the abnormal video is unknown), a multi-example sorting loss function is needed:
Figure GDA0003745762630000033
wherein x ik And x jl Representing abnormal video segments and normal video segments, respectively, f (x) ik ) And f (x) jl ) Respectively represent corresponding abnormal scores, and the value range is [0,1 ]]。
3) The example with the highest anomaly score in the positive packet is most likely a true positive example (anomalous segment), and the example with the highest anomaly score in the negative packet is actually a normal example, but the negative example is likely to be erroneously detected, and is therefore referred to as a difficult example in anomaly detection. To solve the above problem, we do not order every example in a package, but force to order only the two examples with the highest anomaly score in positive and negative packages, in order to make the positive example with the highest anomaly score and the negative example with the highest anomaly score differ greatly in anomaly score, an ordering loss function in the form of change-loss is used:
Figure GDA0003745762630000041
4) the above ordering penalty function alone is not sufficient and also takes into account the temporal structure of the video. First, since the segment sequence is continuous, the outlier score between two adjacent video segments should be relatively smooth. In this regard, we minimize the outlier score difference of neighboring video segments by adding a timing smoothness constraint. Second, since the time range in which abnormal behavior occurs in a video is usually relatively small, the score of the positive example (video clip) should be sparse; in this regard, we make the exceptional score of the video segment sparse by adding sparsity constraint; combining the smoothness constraint and the sparsity constraint to obtain a complete objective function as follows:
Figure GDA0003745762630000042
wherein, the first one is a time sequence smooth constraint term, the second one is a sparse constraint term, the third one is a regularization term,
Figure GDA0003745762630000043
representing the model weight, λ 1 、λ 2 And λ 3 Respectively representAnd the time sequence smooth constraint term coefficient, the sparse constraint term coefficient and the regularization term coefficient, and other variables are the same as the formula (2). In MIL ordering loss, the error is propagated backwards from the video segment with the highest score among the positive and negative packets.
Further, in the fifth step, the classifier is used to obtain the abnormal probability of each video clip, and then an ROC curve is drawn, so that the performance of the abnormal behavior detection method can be evaluated, and the specific method is as follows:
1) sorting the video clips in a descending order according to the abnormal probability, and meanwhile, taking the maximum abnormal probability as a threshold, classifying the examples which are greater than or equal to the threshold into a positive class, otherwise, classifying the examples into a negative class;
2) calculating false positive rate and true positive rate according to the classification result, and respectively taking the false positive rate and the true positive rate as an abscissa and an ordinate to obtain a point of a coordinate axis;
3) setting the classification threshold values to the predicted values of other examples in sequence, and obtaining a series of points on coordinate axes according to the method;
4) and connecting all coordinate points into an ROC curve, wherein the corresponding area under the ROC curve is AUC.
The invention has the beneficial effects that: the method only uses the weakly labeled training data, not only utilizes the video data of the normal behavior, but also utilizes the video data of the abnormal behavior, improves the accuracy rate of the abnormal behavior detection, and is more suitable for practical application.
Drawings
In order that the manner in which the present invention is attained and can be understood in detail, a more particular description of the invention briefly summarized above may be had by reference to the embodiments thereof which are illustrated in the appended drawings.
FIG. 1 is a flow chart of a method for detecting abnormal behavior as set forth herein;
FIG. 2 is a diagram of the structure of 3 pseudo three-dimensional residual blocks in P3D-ResNet;
FIG. 3 is a structural diagram of a modified P3D-ResNet box;
FIG. 4 is an ROC curve of the present application and two other prior art three-dimensional neural network framework based methods and reference methods; wherein, the method is expressed by a method, a method based on 3D CNN is expressed by a method, a method based on 3D ResNet-34 is expressed by a method, and a method based on a two-classification support vector machine is expressed by a method.
Detailed Description
The invention aims to provide an abnormal behavior detection method in a monitoring video scene, in particular to an abnormal behavior detection method based on multi-example learning and an improved pseudo three-dimensional residual error neural network, so as to enhance the monitoring capability and improve the public safety.
As shown in FIG. 1, the method comprises the following steps:
firstly, dividing each video in a training set into a plurality of video segments, and inputting improved P3D-ResNet to obtain the characteristics of the video segments; then, taking the feature average value of all frames in each video segment, and then carrying out L2 norm normalization on the feature average value so as to obtain the features of the video; finally, inputting the characteristics into a full-connection neural network of 3 layers, the abnormal score of the video clip is output.
As shown in FIG. 2, the performance of a pseudo three-dimensional residual neural network (P3D-ResNet) formed by sequentially mixing three block structures of a pseudo three-dimensional serial shortcut residual block (P3D-A), a pseudo three-dimensional parallel shortcut residual block (P3D-B) and a pseudo three-dimensional serial parallel shortcut residual block (P3D-C) is superior to that of three variant networks P3D-A ResNet, P3D-BResNet and P3D-C ResNet formed by a single block structure, so that the pseudo three-dimensional convolutional residual neural network with diversified block structures is obtained by sequentially replacing 2-dimensional block structures of ResNet-50 with 3 pseudo three-dimensional residual block structures; on the basis of a pseudo three-dimensional residual neural network framework, 3 × 3 × 3 average pooling operation is added to the shortcut connection shortcuts part, batch normalization operation is added after each convolution operation, and a schematic diagram of improved P3D-ResNet is shown in FIG. 3.
The method specifically comprises the following steps:
1. preparation of Experimental data
The abnormal behaviors in the past training sets are single, and some training sets are formed by performing and recording people at a certain position and cannot reflect the situation of a real scene monitored by a video.
Due to the limitations of the previous training set, our method was evaluated using a new large-scale training set of video-level labels UCF-analog-Detection-Dataset. The training set consists of uncut monitoring videos, the total duration is long, 1900 videos are provided, and the number of the normal videos and the number of the abnormal videos are 950. The training set covers abnormal events of 13 real scenes, including human or animals abuse, suspect arrest, fire, blow, traffic accidents, burglary, explosion, fighting, robbery, gunshot events, stealing, shoplifting, property destruction and the like.
The training set is divided into two parts: the training set comprises 1610 videos (800 normal videos and 810 abnormal videos), and the test set comprises 290 videos (150 normal videos and 140 abnormal videos)
2. Experimental details settings
All video frame pixels are uniformly adjusted to 240 multiplied by 320, the frame rate is uniformly adjusted to 30 frames per second, in order to extract pseudo three-dimensional characteristics, each video is divided into a plurality of video segments with the length of 16 frames according to the video length, and 8 frames are overlapped between every two continuous segments.
Firstly, dividing input video data into a plurality of video segments; secondly, inputting the video clips into an improved pseudo three-dimensional residual error neural network to obtain characteristics; then, taking the feature average value of all frames in each video clip; then, performing L2 norm normalization on the feature mean value to obtain feature representation of the input video data; and finally, inputting the obtained characteristics into a 3-layer fully-connected neural network to realize the detection of abnormal segments of the input video data.
The FC layer (full connection layer) of the first layer has 512 units, the FC layers of the second layer and the third layer have 32 units and 1 unit respectively, and Dropout regularization of 50% is used among the FC layers. In this embodiment, an ELU activation function and a Swish activation function are used for the first and last fully-connected layers, respectively, and an initial learning rate of 0.001, β is used 1 =0.9,β 2 =0.999,ε=1×10 -8 The Adam optimizer of (1); to obtain the best performance, in the ordering loss of multiple instance learning, the timing sequence is setSetting the coefficients of the smooth constraint term, the sparse constraint term and the regularization term as lambda 1 =λ 2 =8×10 -5 And λ 3 =0.01。
3. Selection of evaluation index
The common evaluation index in abnormal behavior detection is the characteristic curve (ROC) of the operation of a subject and the corresponding Area (AUC) under the curve.
The horizontal axis of the ROC curve is False Positive Rate (FPR), and the vertical axis is True Positive Rate (TPR):
Figure GDA0003745762630000071
wherein TP, FN, FP, TN respectively represent the number of samples corresponding to four cases of true positive (true positive), false positive (false positive), true negative (true negative), and false negative (false negative). The closer the ROC curve is to the upper part, the larger the area AUC under the curve is, the higher the detection accuracy is, and otherwise, the lower the detection accuracy is.
And after the classifier is utilized to obtain the abnormal probability of all video segments, an ROC curve is drawn and the AUC is calculated, so that the performance of the abnormal behavior detection method can be evaluated. The specific method comprises the following steps: 1) sorting the video clips in a descending order according to the abnormal probability, and meanwhile, taking the maximum abnormal probability as a threshold, classifying the examples which are greater than or equal to the threshold into a positive class, otherwise, classifying the examples into a negative class; 2) calculating false positive rate and true positive rate according to the classification result, and respectively taking the false positive rate and the true positive rate as an abscissa and an ordinate to obtain a point of a coordinate axis; 3) setting the classification threshold values as predicted values of other examples in a descending order, and obtaining points on a series of coordinate axes according to the method; 4) and connecting all coordinate points into an ROC curve, and calculating the corresponding area AUC under the ROC curve.
The abnormal behavior detection is performed on the UCF data set together with the other two existing three-dimensional neural network-based methods and the reference method, and an ROC curve is drawn for performance evaluation, as shown in FIG. 4. In fig. 4, the method based on the improved pseudo three-dimensional residual error neural network proposed by the present application is: based on video-level label data, a behavior mode is learned through a deep multi-instance method, an improved pseudo three-dimensional residual neural network is used as a feature extractor, a three-layer fully-connected neural network which is regularized by 50% Dropout is used as a classifier, and therefore abnormal behavior detection is achieved;
and the method is a three-dimensional convolution-based fully-connected neural network (3D CNN): based on video level label data, a 3-dimensional convolutional neural network is used as a feature extractor, and a 3-layer fully-connected neural network which is normalized by using 50% of Dropout is used as a classifier, so that abnormal behaviors are detected;
③ is a three-dimensional residual error neural network (3D ResNet-34) method based on 34 layers: 3DRESNet-34 is used as a feature extractor, and a linear Support Vector Machine (SVM) is used as a classifier, so that the detection of abnormal behaviors is realized;
and fourthly, directly using a binary support vector machine (binary SVM) classifier to detect abnormal behaviors, and using the method as a reference method.
The application carries out quantitative comparison of the abnormal behavior detection effect with the existing method based on the three-dimensional neural network framework, and the following table 1 shows:
TABLE 1
Method Acc. AUC
①P3D-ResNet(ours) 89.2 94.6
②C3D+FC 76.6 86.5
③3D ResNet-34 68.9 72.7
④Binary classifier 50.0 55.5
As can be seen from the experimental results of fig. 4 and table 1, the method of the present application has the best abnormal behavior detection effect compared to the other three methods.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and all equivalent variations made by using the contents of the present specification and the drawings are within the protection scope of the present invention.

Claims (4)

1. The abnormal behavior detection method based on the improved pseudo-three-dimensional residual error neural network is characterized by comprising the following steps of:
based on a multi-example learning method, adopting a video level label training set only with a coarse granularity label to perform data preprocessing on each video in the training set, and dividing each video into a plurality of video segments;
step two, improving the structure of the P3D-ResNet network, and respectively inputting all video segments of a video into the improved P3D-ResNet to obtain the feature vector of each video frame in each video segment;
the improvement of the P3D-ResNet network structure is that on the basis of a pseudo three-dimensional convolution residual neural network framework, 3 multiplied by 3 average pooling operation is added to a shortcut connection short part, and batch normalization operation is added after each convolution operation;
calculating the average value of the feature vectors of all frames in each video segment, and then performing L2 norm normalization on the feature average value to obtain the feature vectors of the video segment;
step four, inputting the feature vector of the video clip obtained in the step three into a 3-layer fully-connected neural network, and outputting the abnormal score of the video clip;
in the fourth step, the design steps of the objective function are as follows:
1) in a multi-example learning algorithm, a training set with video level labels is represented as { (x) 1 ,y 1 ),…,(x i ,y i ),…,(x N ,y N ) In which x is i Is the ith packet, y, in the training set i For the label of the packet, N is the total number of packets in the training set, and the ith packet can be expressed as
Figure FDA0003745762620000011
Wherein x ik Representing a package x i The kth example of (1), n i Is a positive packet x i Total number of examples in (1); suppose there is N in the training set a A positive packet, then x i ,i∈[1,N a ]Is a positive packet, and y i 1 ═ 1; with simultaneous training centered on N-N a A negative bag, then x j ,j∈[N a +1,N]Is a negative bag, and y j =-1;
In the standard supervised classification problem using the support vector machine SVM, the optimization objectives that can be obtained are:
Figure FDA0003745762620000012
wherein, the risk term, the regularization constant, the total number of training examples, and the regularization constant are respectively defined as i For each exemplary label, φ (x) i ) The characteristic vector of the image block or the video clip, w is the weight parameter of the model, and b is the offset parameter;
2) using the ordering loss, abnormal video segments get a higher abnormal score than normal segments, considering that without video segment level annotation, a multi-example ordering loss function is used:
Figure FDA0003745762620000021
wherein x ik And x jl Representing abnormal video segments and normal video segments, respectively, f (x) ik ) And f (x) jl ) Respectively represent corresponding abnormal scores, and the value range is [0,1 ]];
3) Instead of ordering each instance in a packet, it is forced to order only the two instances with the highest anomaly score in the positive and negative packets, and in order for the positive instance with the highest anomaly score to differ greatly in anomaly score from the negative instance with the highest anomaly score, an ordering penalty function in the form of change-loss is used:
Figure FDA0003745762620000022
4) the time sequence structure of the video is also considered; firstly, since the segment sequence is continuous, the abnormal score between two adjacent video segments is smooth, and the abnormal score difference of the adjacent video segments is minimized by adding a timing smoothness constraint; secondly, as the time range of abnormal behaviors in the video is small, the scores of examples in the positive bag are sparse, and the abnormal scores of video segments have sparsity by adding sparsity constraint; and combining the smoothness constraint and the sparsity constraint to obtain a complete objective function:
Figure FDA0003745762620000023
wherein, the first one is a time sequence smooth constraint term, the second one is a sparse constraint term, the third one is a regularization term,
Figure FDA0003745762620000024
representing the model weight, λ 1 、λ 2 And λ 3 Respectively representing a time sequence smooth constraint term coefficient, a sparse constraint term coefficient and a regularization term coefficient;
and step five, drawing an operation characteristic curve of the subject, calculating a corresponding area under the curve, and evaluating an abnormal fragment of the input video.
2. The improved detection method for abnormal behaviors based on pseudo-three-dimensional residual error neural network of claim 1, wherein in step one, each video in the training set is regarded as a packet, the video with abnormal behaviors is marked as a positive packet, the normal video without abnormal behaviors is marked as a negative packet, the size and frame rate of each frame of the video are adjusted, and the data preprocessing method comprises: each video is divided into video segments with the same number of frames with overlap between neighbors, each as an example in a positive or negative packet.
3. The improved method for detecting abnormal behavior based on pseudo-three-dimensional residual error neural network as claimed in claim 1, wherein the 3-layer fully-connected neural network used in step four is trained with an objective function.
4. The improved abnormal behavior detection method based on the pseudo-three-dimensional residual error neural network according to claim 1, wherein in step five, the classifier is used to obtain the abnormal probability of each video segment, and an ROC curve is drawn, and the specific method is as follows:
1) sorting the video clips in a descending order according to the abnormal probability, and meanwhile, taking the maximum abnormal probability as a threshold, classifying the examples which are greater than or equal to the threshold into a positive class, otherwise, classifying the examples into a negative class;
2) calculating false positive rate and true positive rate according to the classification result, and respectively taking the false positive rate and the true positive rate as an abscissa and an ordinate to obtain a point of a coordinate axis;
3) sequentially setting the classification threshold values to the predicted values of other examples, and obtaining a series of points on coordinate axes according to the method;
4) and connecting all coordinate points into an ROC curve, and calculating the corresponding area AUC under the ROC curve.
CN201910548528.5A 2019-06-24 2019-06-24 Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network Active CN110263728B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910548528.5A CN110263728B (en) 2019-06-24 2019-06-24 Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910548528.5A CN110263728B (en) 2019-06-24 2019-06-24 Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network

Publications (2)

Publication Number Publication Date
CN110263728A CN110263728A (en) 2019-09-20
CN110263728B true CN110263728B (en) 2022-08-19

Family

ID=67920872

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910548528.5A Active CN110263728B (en) 2019-06-24 2019-06-24 Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network

Country Status (1)

Country Link
CN (1) CN110263728B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111160117A (en) * 2019-12-11 2020-05-15 青岛联合创智科技有限公司 Abnormal behavior detection method based on multi-example learning modeling
CN111159961B (en) * 2019-12-13 2024-02-02 杭州广立微电子股份有限公司 Anomaly detection method and system based on curve data
CN111291699B (en) * 2020-02-19 2022-06-03 山东大学 Substation personnel behavior identification method based on monitoring video time sequence action positioning and abnormity detection
CN111178344B (en) * 2020-04-15 2020-07-17 中国人民解放军国防科技大学 Multi-scale time sequence behavior identification method
CN111814548B (en) * 2020-06-03 2022-12-09 中铁第四勘察设计院集团有限公司 Abnormal behavior detection method and device
CN111739492B (en) * 2020-06-18 2023-07-11 南京邮电大学 Music melody generation method based on pitch contour curve
CN111814653B (en) * 2020-07-02 2024-04-05 苏州交驰人工智能研究院有限公司 Method, device, equipment and storage medium for detecting abnormal behavior in video
CN112257545A (en) * 2020-10-19 2021-01-22 安徽领云物联科技有限公司 Violation real-time monitoring and analyzing method and device and storage medium
CN113312968B (en) * 2021-04-23 2024-03-12 上海海事大学 Real abnormality detection method in monitoring video
CN113627386A (en) * 2021-08-30 2021-11-09 山东新一代信息产业技术研究院有限公司 Visual video abnormity detection method
CN114724061A (en) * 2022-03-24 2022-07-08 南京邮电大学 Anomaly detection method based on visual language bidirectional code converter and sequencing loss function

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108734095A (en) * 2018-04-10 2018-11-02 南京航空航天大学 A kind of motion detection method based on 3D convolutional neural networks
CN109117774A (en) * 2018-08-01 2019-01-01 广东工业大学 A kind of multi-angle video method for detecting abnormality based on sparse coding
CN109635790A (en) * 2019-01-28 2019-04-16 杭州电子科技大学 A kind of pedestrian's abnormal behaviour recognition methods based on 3D convolution
CN109670446A (en) * 2018-12-20 2019-04-23 泉州装备制造研究所 Anomaly detection method based on linear dynamic system and depth network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108734095A (en) * 2018-04-10 2018-11-02 南京航空航天大学 A kind of motion detection method based on 3D convolutional neural networks
CN109117774A (en) * 2018-08-01 2019-01-01 广东工业大学 A kind of multi-angle video method for detecting abnormality based on sparse coding
CN109670446A (en) * 2018-12-20 2019-04-23 泉州装备制造研究所 Anomaly detection method based on linear dynamic system and depth network
CN109635790A (en) * 2019-01-28 2019-04-16 杭州电子科技大学 A kind of pedestrian's abnormal behaviour recognition methods based on 3D convolution

Also Published As

Publication number Publication date
CN110263728A (en) 2019-09-20

Similar Documents

Publication Publication Date Title
CN110263728B (en) Abnormal behavior detection method based on improved pseudo-three-dimensional residual error neural network
Gong et al. A Real‐Time Fire Detection Method from Video with Multifeature Fusion
CN103839065B (en) Extraction method for dynamic crowd gathering characteristics
CN111079674B (en) Target detection method based on global and local information fusion
CN107909027B (en) Rapid human body target detection method with shielding treatment
Li et al. Attention-based anomaly detection in multi-view surveillance videos
CN104063719A (en) Method and device for pedestrian detection based on depth convolutional network
CN111062278B (en) Abnormal behavior identification method based on improved residual error network
CN109902612B (en) Monitoring video abnormity detection method based on unsupervised learning
CN115527269B (en) Intelligent human body posture image recognition method and system
CN113011322A (en) Detection model training method and detection method for specific abnormal behaviors of monitoring video
Ansari et al. An expert video surveillance system to identify and mitigate shoplifting in megastores
Altowairqi et al. A Review of the Recent Progress on Crowd Anomaly Detection
CN117409347A (en) ESNN-based early fire detection method
Yu et al. Tca-vad: temporal context alignment network for weakly supervised video anomly detection
CN113312968B (en) Real abnormality detection method in monitoring video
JP7006724B2 (en) Classification device, classification method, and program
CN114677638A (en) Detection method based on deep learning and abnormal clustering of clustered people
CN115457652A (en) Pedestrian re-identification method and device based on semi-supervised learning and storage medium
Cheng Research on Application of the Feature Transfer Method Based on Fast R‐CNN in Smoke Image Recognition
Xu et al. An intra-frame classification network for video anomaly detection and localization
Luo et al. Graph convolutional neural network for skeleton-based video abnormal behavior detection
Jin et al. Optical flow and spatio-temporal gradient based abnormal behavior detection
Bao et al. Multiobjects Association and Abnormal Behavior Detection for Massive Data Analysis in Multisensor Monitoring Network
Xu et al. Abnormal behavior detection in crowd scenes

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant