CN111950363B

CN111950363B - Video anomaly detection method based on open data filtering and domain adaptation

Info

Publication number: CN111950363B
Application number: CN202010646631.6A
Authority: CN
Inventors: 张晨; 李国荣; 苏荔; 黄庆明
Original assignee: University of Chinese Academy of Sciences
Current assignee: University of Chinese Academy of Sciences
Priority date: 2020-07-07
Filing date: 2020-07-07
Publication date: 2022-11-29
Anticipated expiration: 2040-07-07
Also published as: CN111950363A

Abstract

The invention relates to the technical field of application of computer vision, in particular to a video anomaly detection method based on open data filtering and domain adaptation. The method can definitely detect the open normal event and the open abnormal event in the data set, and uses different classifiers to process the open data and the visible data, thereby reducing the influence of the open data on the abnormal detection and obviously improving the abnormal detection performance. First, the method directly trains a classifier for visible data in test data using a training video. Then, in the testing phase, the testing data is divided into visible data and open data by using the open data filtering module. Finally, for the visible data, directly utilizing a visible data classifier to generate an abnormal score for the visible data; for open test data, a domain adaptation method is used to reduce the distribution difference between the open test data and the training data, and then a classifier is retrained for generating the abnormal score.

Description

Video anomaly detection method based on open data filtering and domain adaptation

Technical Field

The invention relates to the technical field of computer vision, in particular to a video anomaly detection method based on open data filtering and domain adaptation.

Background

The anomaly detection is a key technology for realizing intelligent video monitoring, and can automatically detect abnormal events in videos. The existing video anomaly detection method can be divided into three categories according to a supervision mode: semi-supervised, unsupervised and supervised. The semi-supervised anomaly detection method only uses normal data to train a detection model, and determines an event which does not accord with the characteristics of the normal data as an abnormal event during testing; the unsupervised anomaly detection method detects anomalies by estimating the frame's resolvability only with reference to the video context, without using known normal data. If a frame can be easily distinguished from other frames in the same video, it is marked as an abnormal frame; the supervised anomaly detection method uses a large amount of normal data and a small amount of abnormal data to train an anomaly detection model, and treats anomaly detection as an unbalanced two-classification problem.

Existing methods typically use the difference between an exceptional event and a normal event to measure whether a frame in a video contains an exceptional event. However, due to the diversity and openness of events in video, it is not realistic to obtain all possible normal events, and there may be also a large difference between the types of normal events not seen in the test set and the known normal events in the training set. Then the unseen normal events will not satisfy the characteristics of the known normal events in the training set, and there is a high possibility that they are erroneously determined as abnormal events, resulting in a high false positive rate.

Disclosure of Invention

In order to solve the technical problems, the invention provides a supervised video anomaly detection method based on open data filtering and domain adaptation to measure whether anomalies exist in a video frame or not and improve the video anomaly detection performance.

The invention discloses a video anomaly detection method based on open data filtering and domain adaptation, which comprises the following steps:

extracting appearance characteristics of all frames of a training video and a testing video by using a vgg-f model trained on ImageNet, and training a classifier for visible data by using the characteristics of the training video;

designing an open data filter, wherein the module is used for dividing the test data into visible data and open data; the filter firstly divides known training data into a plurality of classes by a k-means clustering method, then takes the minimum distance between the testing data and the class center of the known training data as the open probability of the testing data, and finally determines the optimal filtering proportion by a U-test method after obtaining the open probability of all frames of a testing video to obtain the final visible data and open data;

meanwhile, a Joint Distribution Adaptation (JDA) method is introduced for reducing the distribution difference between the open data and the training data and obtaining the new feature representation of the open data and the training data; training a classifier for the open data using the new training data feature representation;

and generating an abnormal score of the visible data in the test set by using a visible data classifier, generating an abnormal score of the open data in the test set by using an open data classifier, and synthesizing the abnormal scores to obtain a detection result of the whole test video.

The beneficial effects of the invention are as follows: when the video is subjected to the anomaly detection, the difference between the abnormal event and the known normal event is considered, and the difference between the open normal event and the known normal event is also considered, so that the problem that the unseen normal event can be mistakenly judged as the abnormal event can be solved, and the anomaly detection performance is improved; the test data are effectively divided into visible data and open data by the open data filter, and the abnormal scores of the two data are obtained by adopting different classifiers, so that the influence of open events in a test set on the abnormal detection performance is effectively reduced.

Drawings

FIG. 1 is a schematic of the present invention.

Detailed Description

The following detailed description of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the present invention, but are not intended to limit the scope of the present invention.

Constructing a training set and a testing set: selecting abnormal frames in a certain proportion from the test set and adding the abnormal frames into the training set, thereby obtaining a training set containing normal events and abnormal events, wherein the test set also contains open events in a certain proportion;

in the training stage, the pre-training neural network is used to extract the features of the training video, and the visible data classifier is trained by using the features.

In the testing stage, after the features of the test video are obtained by utilizing the pre-training neural network, the features of the test video are divided into known data and open data by the open data filter. The filter firstly divides known training data into a plurality of classes through a k-means clustering method, and then takes the minimum distance between the test data and the class center of the known training data as the open probability of the test data:

wherein

J frame, X, representing the ith video _train Representing the entire training video, C _n Representing the nth class center of the training data.

After the open probabilities of all the frames of the test video are obtained, determining the optimal filtering proportion by a U-test method:

wherein P is _nor (r) represents a P value between a mean of visible data in the test data and a mean of normal data in the training data at a filtering ratio r; p _abnor (r) represents the value of P between the mean of the visible data in the test data and the mean of the outlier data in the training data at the filter ratio r. The larger the sum of the two P values is, the smaller the difference between the visible data and the training data obtained after filtering the open data in the test data is, so that the optimal ratio of the visible data and the open data can be obtained.

Since the sigmoid function can smoothly map real numbers to [0,1], the function value can be used as a probability of testing for frame anomalies. For the visible data, directly taking the probability that the test frame output by the visible data classifier belongs to the abnormal class as an abnormal score:

wherein g is _s Representing a visual data classifier, theta _s A parameter representing a classifier of the visible data,

representing number of testsThe j frame of the i video from which the data is visible.

For the open data, firstly, a joint distribution adaptation method is adopted to reduce the distribution difference between the open data and the training data, and new feature representations of the open data and the training data are obtained:

X _{train_new} ＝4 ^T X _s

X _{open_new} ＝4 ^T X _t

where 4 denotes an optimal transformation matrix obtained by a joint distribution adaptation method, X _s Is a feature representation of the source training data, X _t Is a characteristic representation of open data.

The open data classifier is then trained with the feature representation of the new training data and used to generate an anomaly score for the open data:

wherein g is _n Represents an open data classifier, θ _n A parameter representing an open data classifier is used,

is a new feature representation of the ith video frame j.

And finally, combining the abnormal scores of the visible data and the open data to obtain the abnormal detection result of the whole test video.

The above description is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, it is possible to make various improvements and modifications without departing from the technical principle of the present invention, and those improvements and modifications should be also considered as the protection scope of the present invention.

Claims

1. A video anomaly detection method based on open data filtering and domain adaptation is characterized by comprising the following steps:

(1) Extracting data features: extracting appearance characteristics of training data and test data by using a pre-trained neural network, and training a visible data classifier by using the characteristics of the training data;

(2) Open data filtering: analyzing the characteristics of the test data by using an open data filter, and dividing the characteristics into visible data and open data;

the filter firstly divides known training data into a plurality of classes through a k-means clustering method, and then takes the minimum distance between the test data and the class center of the known training data as the open probability of the test data:

wherein

J frame, X, representing the ith video _train Representing the entire training video, C _n An nth class center representing training data;

after obtaining the open probability of all the frames of the test video, determining the optimal filtering proportion by a U-test method:

wherein P is _nor (r) represents a P value between a mean of visible data in the test data and a mean of normal data in the training data at a filtering ratio r; p _abnor (r) represents the P value between the mean of the visible data in the test data and the mean of the anomalous data in the training data at a filter ratio r; the larger the sum of the two P values is, the smaller the difference between the visible data and the training data obtained after filtering the open data in the test data is, so that the visible data and the open data with the optimal proportion can be obtained;

(3) Open data field adaptation: using Joint Distribution Adaptation (JDA) to reduce the distribution difference between the open data and the training data and obtain new feature representations of the open data and the training data:

X _{train_new} ＝A ^T X _s

X _{open_new} ＝A ^T X _t

wherein A represents an optimal transformation matrix obtained by a joint distribution adaptation method, X _s Is a feature representation of the source training data, X _t Is a characteristic representation of open data;

then, the open data classifier is trained by using the feature representation of the new training data, and the classifier is used for generating an abnormal score of the open data:

is a new feature representation of the ith video frame j;

(4) And (3) abnormal score: for the visible data in the test set, obtaining the probability of the visible data belonging to the abnormality by using a visible data classifier, and taking the probability as the abnormality score of the visible data; and for the open data in the test set, obtaining the probability of the open data belonging to the anomaly by using an open data classifier, and taking the probability as the anomaly score of the visible data.

2. The method for detecting video abnormality based on open data filtering and domain adaptation of claim 1, wherein the pre-trained neural network is a vgg-f model trained on ImageNet.

3. The method for detecting video anomaly based on open data filtering and domain adaptation according to claim 1, wherein the visible data classifier and the open data classifier are both focal loss based unbalanced classifiers.

4. The method for detecting video anomaly based on open data filtering and domain adaptation as claimed in claim 1, wherein said open data filter is implemented based on k-means method and U-test method.

5. The method for detecting video anomaly based on open data filtering and domain adaptation as claimed in claim 1, wherein said data is a frame of a video.