CN110659561A

CN110659561A - Optimization method and device of internet riot and terrorist video identification model

Info

Publication number: CN110659561A
Application number: CN201910725251.9A
Authority: CN
Inventors: 李扬曦; 佟玲玲; 井雅琪; 缪亚男; 段运强; 胡燕林; 任博雅
Original assignee: National Computer Network and Information Security Management Center
Current assignee: National Computer Network and Information Security Management Center
Priority date: 2019-08-07
Filing date: 2019-08-07
Publication date: 2020-01-07

Abstract

The invention discloses an optimization method and device of an internet riot and terrorist video identification model, wherein the method comprises the following steps: acquiring parameters of each channel of a convolution kernel of each layer in a convolution neural network of an internet riot and terrorist video identification model; and judging whether to discard the corresponding channel according to the parameters of each channel, and if so, executing channel discarding operation. The invention greatly improves the data volume audited by using the riot model every day. The method fills the blank of the auditing technology and products of the mass internet video riot and terrorist videos, and can greatly improve the product competitiveness of the auditing of the riot and terrorist images.

Description

Optimization method and device of internet riot and terrorist video identification model

Technical Field

The invention relates to the technical field of computers, in particular to an optimization method and device of an internet riot and terrorist video identification model.

Background

Nowadays, the internet brings great convenience to the life of people, but simultaneously, many lawbreakers use the internet to publish some videos which publicize extreme, terrorism or have violent terrorism pictures, which causes great harm to the internet surfing experience of common people, meanwhile, the videos are easy to arouse the bad emotion of individuals to cause terrorism activities, and further, the videos form great challenges to peace, stability and public security of the country. Therefore, the identification of the riot and terrorist videos from a large number of videos uploaded to the internet by netizens and the timely blocking and killing and the later filing of some potential lawless persons are of great significance, but at the same time, the work consumes manpower and material resources because the data volume uploaded to the network every day is huge, however, the various defects of manual audit before are greatly relieved by the automatic machine identification of the riot and terrorist videos by applying deep learning at present, and the acceleration and compression of the riot and terrorist identification model are of great significance because the data volume needing to be audited every day is huge.

The existing riot and terrorist identification model is an image identification classifier, a video is firstly extracted into frames to form pictures, and then the picture riot and terrorist identification model is divided into two categories of riot and terrorist and normal. Specifically, the image classifier is a deep convolutional neural network, the input of the model is an image, the model outputs scores of two categories of an explosion category and a normal category, and the score of the high-ranked person is obtained as the judgment category of the image. However, in order to make the recognition capability of the model strong, the model is relatively complex, the number of parameters is large, and the calculation speed is slow each time.

In summary, the disadvantages of the prior art are mainly as follows: the existing riot and terrorist recognition model is too high in complexity and large in calculation amount, so that too many machine resources are consumed for auditing the whole network video, and the whole network video is difficult to be truly overhauled. It is therefore of great significance to compress the riot identification model and the acceleration of the model without affecting the model identification capability.

Disclosure of Invention

The embodiment of the invention provides an optimization method and device of an internet riot and terrorist video identification model, which are used for solving the problems in the prior art.

The embodiment of the invention provides an optimization method of an internet riot and terrorist video identification model, which comprises the following steps:

acquiring parameters of each channel of a convolution kernel of each layer in a convolution neural network of an internet riot and terrorist video identification model;

and judging whether to discard the corresponding channel according to the parameters of each channel, and if so, executing channel discarding operation.

Preferably, whether to discard the corresponding channel is determined according to the parameter of each channel, and if yes, the performing the channel discarding operation specifically includes:

and calculating the sum of absolute values of the parameters of the same channel, judging whether the sum of the absolute values is less than or equal to a preset parameter threshold value, and if so, discarding the corresponding channel.

Preferably, the method further comprises:

and converting the data and parameters of the Internet riot and terrorist video identification model into an 8-bit integer for processing by a computer.

Preferably, the step of converting the data and parameters of the internet riot and terrorist video identification model into an 8-bit integer for computer processing specifically comprises the following steps:

inputting data and parameters of a floating point number type, and carrying out primary int8 quantization on the data and the parameters to obtain quantized data, the maximum value and the minimum value of the data, quantization parameters and the maximum value and the minimum value of the parameters;

carrying out convolution operation on the quantized data, the maximum value and the minimum value of the data, the quantization parameter and the maximum value and the minimum value of the parameter, and outputting a 32-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result;

carrying out complex quantization on the 32-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result to obtain an 8-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result;

and restoring the 8-bit integer into data and parameters of a floating point number type according to the quantized convolution result and the maximum value and the minimum value of the quantized result.

The embodiment of the invention also provides an optimization device of the internet riot and terrorist video identification model, which comprises the following steps:

the acquiring module is used for acquiring parameters of each channel of a convolution kernel of each layer in a convolution neural network of the Internet riot and terrorist video identification model;

and the judging module is used for judging whether to discard the corresponding channel according to the parameters of each channel, and if so, executing channel discarding operation.

Preferably, the determining module is specifically configured to:

Preferably, the apparatus further comprises:

and the Int8 quantization module is used for converting data and parameters of the internet riot and terrorist video identification model into an 8-bit integer for computer processing.

Preferably, the Int8 quantization module is specifically configured to:

The embodiment of the invention also provides an optimization device of the internet riot and terrorist video identification model, which comprises the following steps: the internet riot video identification model optimization method comprises the following steps of a memorizer, a processor and a computer program which is stored on the memorizer and can run on the processor, wherein the computer program realizes the steps of the internet riot video identification model optimization method when being executed by the processor.

The embodiment of the invention also provides a computer-readable storage medium, wherein an implementation program for information transmission is stored on the computer-readable storage medium, and when the implementation program is executed by a processor, the steps of the optimization method for the internet riot and terrorist video identification model are implemented.

By adopting the embodiment of the invention, aiming at the problems of low speed and large model of the existing riot and terrorist identification model, the problem is solved by pruning the convolutional neural network of the internet riot and terrorist video identification model, so that the data volume audited by using the riot and terrorist model every day is greatly improved. The method fills the blank of the auditing technology and products of the mass internet video riot and terrorist videos, and can greatly improve the product competitiveness of the auditing of the riot and terrorist images.

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:

FIG. 1 is a flow chart of a method for optimizing an Internet riot video identification model according to an embodiment of the method of the present invention;

FIG. 2 is a schematic diagram of an apparatus for optimizing an Internet riot video recognition model according to a first embodiment of the present invention;

fig. 3 is a schematic diagram of an apparatus for optimizing an internet riot video recognition model according to a second embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

Method embodiment

According to an embodiment of the present invention, an optimization method for an internet riot and terrorist video recognition model is provided, fig. 1 is a flowchart of the optimization method for the internet riot and terrorist video recognition model according to the embodiment of the present invention, as shown in fig. 1, the optimization method for the internet riot and terrorist video recognition model according to the embodiment of the present invention specifically includes:

step 101, acquiring parameters of each channel of a convolution kernel of each layer in a convolution neural network of an internet riot and terrorist video identification model;

and 102, judging whether to discard the corresponding channel according to the parameters of each channel, and if so, executing channel discarding operation. Step 102 specifically includes: and calculating the sum of absolute values of the parameters of the same channel, judging whether the sum of the absolute values is less than or equal to a preset parameter threshold value, and if so, discarding the corresponding channel.

That is to say, for a convolutional neural network, many parameters in the network are usually redundant, and do not contribute to an actual model result, and each layer of the network usually has convolution kernels of many channels, and if the sum of absolute values of parameters of a certain channel of a convolution kernel is small, that is, each parameter of the channel is small, it is conceivable that the effect of the channel actually playing in the convolution process is very small, so that we can discard the whole channel, thereby achieving the effects of reducing parameters, reducing calculation amount, and accelerating the model, that is, pruning the convolutional neural network model, and the pruning operation generally affects the recognition capability of the model within an acceptable range, and can greatly reduce the complexity of the model and greatly accelerate the model prediction speed.

In order to further optimize the convolutional neural network of the internet riot and terrorist video identification model, preferably, the embodiment of the invention can also convert the data and parameters of the internet riot and terrorist video identification model into 8-bit integers for processing by a computer. The method specifically comprises the following steps:

step 1, inputting data and parameters of a floating point number type, and carrying out primary int8 quantization on the data and the parameters to obtain quantized data, the maximum value and the minimum value of the data, quantized parameters and the maximum value and the minimum value of the parameters;

step 2, carrying out convolution operation on the quantized data, the maximum value and the minimum value of the data, the quantization parameter and the maximum value and the minimum value of the parameter, and outputting a 32-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result;

step 3, carrying out complex quantization on the 32-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result to obtain an 8-bit integer quantization convolution result and the maximum value and the minimum value of the quantization result;

and 4, restoring the 8-bit integer into data and parameters of the floating point number type according to the quantized convolution result and the maximum value and the minimum value of the quantized result.

That is, the parameters of the model are usually stored in the form of float32 in the computer, and if the parameters are changed to 8-bit integers, the computation speed of the CPU processing the integers is faster than that of the floating point numbers, so the network can be compressed and accelerated without changing the number of connections and the number of parameters.

In the embodiment of the present invention, the input data is a floating point type, and first, there are three data quantized by int 8: quantized data, maximum, minimum. Namely, the original floating point number can be recovered through the three data; then, performing quantized convolution operation, wherein 6 quantized data, the maximum and minimum values of the data, the quantization parameters and the maximum and minimum values of the parameters are input, and 3 quantized convolution results (32-bit integers), the maximum values and the minimum values of the quantization results are output; thirdly, the quantization result and the range input thereof are subjected to complex quantization, and the aim is to re-quantize the 32-bit integer into an 8-bit integer; fourth, the 8-bit integer is reduced to a floating point type. In practice the input and output of such a unit are still floating point numbers.

By adopting the embodiment of the invention, aiming at the problems of low speed and large model of the existing riot and terrorist identification model, the problem is solved by pruning the convolutional neural network of the internet riot and terrorist video identification model and carrying out int8 quantization, so that the data volume audited by using the riot and terrorist model every day is greatly improved. The method fills the blank of the auditing technology and products of the mass internet video riot and terrorist videos, and can greatly improve the product competitiveness of the auditing of the riot and terrorist images.

Apparatus embodiment one

According to an embodiment of the present invention, an optimization apparatus for an internet riot and terrorist video recognition model is provided, fig. 2 is a schematic diagram of the optimization apparatus for an internet riot and terrorist video recognition model according to an embodiment of the present invention, as shown in fig. 2, the optimization apparatus for an internet riot and terrorist video recognition model according to an embodiment of the present invention specifically includes:

the acquiring module 20 is configured to acquire parameters of each channel of a convolution kernel of each layer in a convolution neural network of the internet riot and terrorist video identification model;

and the judging module 22 is configured to judge whether to discard the corresponding channel according to the parameter of each channel, and if yes, execute a channel discarding operation. The determining module 22 is specifically configured to:

In order to further optimize the convolutional neural network of the internet riot video recognition model, preferably, the apparatus of the embodiment of the present invention further includes:

and the Int8 quantization module is used for converting data and parameters of the internet riot and terrorist video identification model into an 8-bit integer for computer processing. Int8 quantization module is specifically used for:

Device embodiment II

The embodiment of the invention provides an optimizing device of an internet riot and terrorist video identification model, as shown in fig. 3, comprising: a memory 30, a processor 32 and a computer program stored on the memory 30 and executable on the processor 32, which computer program, when executed by the processor 32, carries out the following method steps:

Device embodiment III

The embodiment of the present invention provides a computer-readable storage medium, on which an implementation program for information transmission is stored, and when being executed by a processor 32, the implementation program implements the following method steps:

The computer-readable storage medium of this embodiment includes, but is not limited to: ROM, RAM, magnetic or optical disks, and the like.

It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and alternatively, they may be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.

The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. An optimization method for an internet riot and terrorist video identification model is characterized by comprising the following steps:

2. The method of claim 1, wherein whether to discard the corresponding channel is determined according to the parameter of each channel, and if yes, the performing the channel discarding operation specifically includes:

3. The method of claim 1, wherein the method further comprises:

4. The method of claim 3, wherein converting data and parameters of the internet riot video recognition model into 8-bit integers for computer processing specifically comprises:

5. An optimization device for an internet riot and terrorist video recognition model is characterized by comprising:

6. The apparatus of claim 5, wherein the determining module is specifically configured to:

7. The apparatus of claim 5, wherein the apparatus further comprises:

8. The apparatus of claim 7, wherein the Int8 quantization module is specifically configured to:

9. An optimization device for an internet riot and terrorist video recognition model is characterized by comprising: memory, processor and computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the method of optimizing an internet riot video recognition model according to any one of claims 1 to 4.

10. A computer-readable storage medium, on which an information transfer implementing program is stored, which, when executed by a processor, implements the steps of the method for optimizing an internet riot video recognition model according to any one of claims 1 to 4.