CN110708567A

CN110708567A - Distributed self-optimization video real-time analysis framework based on active learning

Info

Publication number: CN110708567A
Application number: CN201910303275.5A
Authority: CN
Inventors: 赵宏伟; 张卫山; 任鹏程
Original assignee: China University of Petroleum East China
Current assignee: China University of Petroleum East China
Priority date: 2019-04-15
Filing date: 2019-04-15
Publication date: 2020-01-17

Abstract

The invention provides a distributed self-optimization video real-time analysis framework based on active learning, wherein a Storm framework performs parallel training on distributed stored video data, a deep learning model is fused to form the distributed parallel video analysis framework, and the optimal distribution of computing resources is realized under the Storm framework by matching with a multi-terminal parallel service mechanism. And performing iterative training on the original model after correction by using the data of the error analysis, and updating the deep learning model in the video analysis frame, so that self-optimization of the frame is realized, and the accuracy and the robustness of the model are improved.

Description

Distributed self-optimization video real-time analysis framework based on active learning

Technical Field

The invention relates to the field of massive video analysis, data distributed computation and storage and deep learning, in particular to a distributed self-optimization video real-time analysis framework based on active learning.

Background

The rapid development of the deep learning technology provides a new idea for solving a plurality of problems in production. The convolutional neural network is powerful in that the multilayer structure of the convolutional neural network can automatically learn features, and can learn features of multiple layers: the sensing domain of the shallower convolutional layer is smaller, and the characteristics of some local regions are learned; deeper convolutional layers have larger perceptual domains and can learn more abstract features. These abstract features are less sensitive to the size, position, orientation, etc. of the object, thereby contributing to an improvement in recognition performance. The method has strong adaptability to factors such as geometric transformation, deformation and illumination of the target, and effectively overcomes the recognition resistance caused by variable target appearance. The method can automatically extract and analyze the features according to the data input into the network, and has higher universality generalization capability. The closest techniques to the present invention are:

(1) and deep learning: deep learning provides a method for enabling a computer to automatically learn mode characteristics, and the characteristic learning is integrated into the process of establishing a model, so that incompleteness caused by artificial design characteristics is reduced. Some machine learning applications taking deep learning as a core reach recognition or classification performance exceeding that of the existing algorithm under the application scene meeting specific conditions. However, in an application scenario where a limited amount of data is provided, the deep learning algorithm cannot perform an unbiased estimation on the regularity of the data. To achieve good accuracy, large data supports are required.

(2) Storm calculation framework: storm is an open-source distributed real-time computing system, can simply and reliably process a large number of data streams, and has a plurality of use scenes: such as real-time analysis, online machine learning, continuous computing, distributed RPC, ETL, and the like. Storm supports horizontal expansion, has high fault tolerance, ensures that each message can be processed, has high processing speed, and can process millions of messages per second by each node in a small cluster. Storm deployment and operation and maintenance are convenient, and more importantly, any programming language can be used for developing application.

In the background of massive video data, the conventional deep learning video analysis is difficult to achieve the effect of real-time analysis and bear the load of massive video data. In order to realize real-time analysis of videos under the condition of parallel input of massive multi-source video data and actively optimize a video analysis model by utilizing the input videos, the invention provides a distributed self-optimization video real-time analysis framework based on active learning.

Disclosure of Invention

In order to overcome the defects in the prior art, the invention provides a distributed self-optimization video real-time analysis framework based on active learning, which is combined with a Storm stream processing framework and a deep learning image analysis model, realizes real-time analysis of videos under the condition of parallel input of mass multi-source video data, and realizes self-optimization of the deep learning model by using verified analysis results.

The technical scheme of the invention is as follows:

the method comprises the following steps that (1) distributed training of image data is achieved by adopting a Storm framework based on a Hadoop distributed storage system;

performing model fusion on the obtained multiple deep learning models, deploying the models to a distributed parallel video analysis framework, and realizing optimal allocation of computing resources by matching with a multi-terminal parallel service mechanism;

step (3), aiming at multi-source data input, selecting an optimal computing node in a distributed parallel video analysis frame autonomously;

step (4), the result of the video analysis is transmitted to the front end for displaying, and the result and the video image file are cached to a temporary storage server;

step (5), correcting the data in the temporary storage server, particularly the data with analysis errors, and then performing iterative training;

and (6) updating the neural network model in a video analysis frame to realize one-time free iteration.

The invention has the beneficial effects that:

(1) the method utilizes the advantage of Storm streaming processing to realize the distributed parallel training of the deep learning model and improve the efficiency of the deep learning model training;

(2) the real-time video analysis under the condition of parallel input of mass multi-source video data is realized;

(3) aiming at the difficult problem of large updating calculation amount of the deep learning model, the self-optimization of the video analysis model is realized by correcting the error analysis result on the premise of the existing deep learning model.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a schematic diagram of a distributed self-optimizing video real-time parsing framework based on active learning according to the present invention;

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As shown in fig. 1, a distributed self-optimization video real-time analysis framework based on active learning combines the efficient stream computation performance and the deep learning target detection capability of the Storm framework, and realizes real-time video analysis under the condition of parallel input of massive multi-source video data.

The following describes in detail a specific process of a distributed self-optimization video real-time parsing framework based on active learning:

According to the distributed self-optimization video real-time analysis framework based on active learning, a Storm framework conducts parallel training on distributed stored video data, deep learning models are fused to form the distributed parallel video analysis framework, and optimal distribution of computing resources is achieved under the Storm framework in cooperation with a multi-terminal parallel service mechanism. And performing iterative training on the original model after correction by using the data of the error analysis, and updating the deep learning model in the video analysis frame, so that self-optimization of the frame is realized, and the accuracy and the robustness of the model are improved.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. A distributed self-optimization video real-time analysis frame based on active learning is combined with a Storm stream processing frame and a deep learning image analysis model, real-time analysis of videos is achieved under the condition that massive multi-source video data are input in parallel, and self-optimization of the deep learning model is achieved by means of verified analysis results, and the distributed self-optimization video real-time analysis frame comprises the following steps: