CN112036387A

CN112036387A - News picture shooting angle identification method based on gated convolutional neural network

Info

Publication number: CN112036387A
Application number: CN202011226536.7A
Authority: CN
Inventors: 温序铭; 李�杰; 宋恩宏; 颜刚
Original assignee: Chengdu Sobey Digital Technology Co Ltd
Current assignee: Chengdu Sobey Digital Technology Co Ltd
Priority date: 2020-11-06
Filing date: 2020-11-06
Publication date: 2020-12-04
Anticipated expiration: 2040-11-06
Also published as: CN112036387B

Abstract

The invention discloses a news picture shooting angle identification method based on a gated convolutional neural network, which comprises the following steps of: marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; constructing a dual-branch gate control convolution neural network algorithm model based on the trained target detection algorithm model and the image classification algorithm model, and acquiring news picture shooting angles and the like corresponding to the identification images based on the dual-branch gate control convolution neural network algorithm model; the method can identify whether the corresponding news picture is obtained by overlook, look-up or head-up shooting, consumes less resources and consumes less time in operation, and can complete practical engineering application.

Description

News picture shooting angle identification method based on gated convolutional neural network

Technical Field

The invention relates to the technical field of computer vision, in particular to a method for identifying a news picture shooting angle based on a gated convolutional neural network.

Background

Video is an important news information transmission means, and compared with a news information transmission mode which uses newspaper, radio station, broadcast and the like as transmission carriers, the video often has the characteristic of more intuitive and more realistic description of news events. When the news video is shot, different shooting angles can be adopted for shooting according to different news types. After the news is manufactured, the shooting angle information of the news picture can be marked, and the repeated utilization rate of the news material can be improved. The traditional shooting angle identification method is mainly based on a three-dimensional reconstruction method, and the method has the problems of huge resource consumption and long time consumption, and cannot meet the actual application scene.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a news image shooting angle identification method based on a gated convolutional neural network, which can identify the shooting angle information of a news image, specifically can identify whether the corresponding news image is obtained by overlook, look-up or look-up shooting, has less resource consumption and short calculation time consumption, and can complete practical engineering application.

The purpose of the invention is realized by the following scheme:

a news picture shooting angle identification method based on a gated convolutional neural network comprises the following steps:

marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; and constructing a double-branch gate control convolution neural network algorithm model based on the trained saliency target detection algorithm model and the trained image classification algorithm model, and acquiring a news picture shooting angle corresponding to the identification image based on the double-branch gate control convolution neural network algorithm model.

Further, labeling the salient target area and the shooting angle classification information of the image to be recognized comprises the following steps:

s101, collecting news videos, decoding the news videos into images, and collecting the images with set quantity so as to perform subsequent labeling steps;

s102, according to a data set making standard of image saliency target detection, using the image data set obtained in the step S101 to mark saliency target information in each image;

s103, according to the data set creating standard of the image classification, the data set marked in the step S102 is marked with the lens angle type information.

Further, the salient object detection algorithm model is based on the construction process steps of a readable storage medium:

s201, constructing a deep convolutional neural network structure of a coder and a decoder on the basis of a convolutional residual neural network of 101 layers, wherein the coder and the decoder adopt symmetrical characteristic pyramids, and the network structure is used as a network structure of a training significance target detection model and is marked as the network structureNs；

S202, updating the back propagation error by using a batch gradient descent methodNsAnd (5) repeatedly iterating model parameters until the model converges.

Further, the image classification algorithm model is based on the construction process steps of a readable storage medium:

s301, constructing an image classification algorithm network structure based on the convolution residual error neural network of the 101 layers, wherein the network structure is used as a network structure for training an image classification algorithm and is marked as the network structureNc；

S302, updating the back propagation error by adopting a batch gradient descent methodNcAnd (5) repeatedly iterating the model parameters until the classification model converges.

Further, the method comprises the following steps of constructing a dual-branch gate-controlled convolutional neural network algorithm model based on a readable storage medium:

s401, mixingNsDeleting the layer after the last convolutional layer in the network, and reserving the remaining network structure, which is recorded asFns(ii) a Deleting the layers behind the Nc network classification layer, reserving the rest network structure and recording asFnc；

S402, constructing a double-branch gate control convolution neural network algorithm modelDPThe algorithm model adopts a double-branch network pairImage recognition, wherein the network structure of the first branch usesFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.

Further, acquiring a news picture shooting angle corresponding to the identification image based on the two-branch gate convolution neural network algorithm model, and the method comprises the following steps:

s501, updating the back propagation error by adopting a batch gradient descent methodDPModel parameters, training of this step only updatesGAnd ClsStopping training after the model is converged;

s502, continuing to train the model updated in the step S501, and updating the model in the step SDPThe updating mode adopts a batch gradient descent method.

The invention has the beneficial effects that:

(1) the invention provides a news picture shooting angle identification method of a gate-control convolutional neural network aiming at the defects of a three-dimensional reconstruction method, which is mainly realized by using the image significance detection and image classification technology of the current convolutional neural network in computer vision.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

FIG. 1 is a flow chart of the method steps of the present invention;

FIG. 2 is a schematic diagram of an algorithm network structure for constructing a two-branch gated convolutional neural network model according to the present invention.

Detailed Description

All of the features disclosed in the specification for all of the embodiments (including any accompanying claims, abstract and drawings), or all of the steps of a method or process so disclosed, may be combined and/or expanded, or substituted, in any way, except for mutually exclusive features and/or steps.

As shown in fig. 1 and 2, a method for identifying a news image shooting angle based on a gated convolutional neural network includes the steps of:

s201, constructing a deep convolution nerve of a coder and a decoder based on a convolution residual error nerve network of 101 layersA network structure, in which the codec uses a symmetric feature pyramid, and the network structure is used as a network structure for training a saliency target detection model and is recorded asNs；

S402, constructing a gate-controlled convolutional neural network algorithm of double branchesDPThe algorithm adopts a double-branch network to identify the image, wherein the network structure of the branch I usesFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.

s501, updating the back propagation error by adopting a batch gradient descent methodDPModel parameters, training of this step only updatesGAnd ClsParameter of (2), convergence of modelThen, stopping training;

In another embodiment of the present invention, as shown in fig. 1, a method for identifying a shot angle of a news image based on a gated convolutional neural network according to an embodiment of the present invention includes the following steps:

the method comprises the following steps: labeling the classification information of the image saliency target area and the shooting angle;

in step one, through the discussion of professional news related practitioners and computer vision related practitioners, the saliency target of the image and the annotation standard of the image capturing angle are defined.

The first step further comprises the following substeps:

step 101: in this embodiment, a plurality of original video materials can be obtained by crawling news columns of television stations in each province and city such as beijing, anhui, guangdong and the like on the internet by using a crawler technology, videos are decoded into images by using a video decoding tool, some obvious interference images such as images synthesized by a computer are removed, and a data set to be marked is formed.

Step 102: and marking each image of the data set to be marked according to the data set making rule of the saliency target detection, and marking the shooting angle category information of each image according to the marking rule of image classification.

Step two: and designing a saliency target detection algorithm, and training a saliency target detection algorithm model by using the data set obtained in the step one.

In this embodiment, the implementation of the image saliency target detection algorithm model based on the encoder includes the following sub-steps:

step 201: constructing a convolutional neural network structure of a coder and a decoder based on a convolutional residual neural network with 101 layers, wherein the coder and the decoder are in a symmetrical network structure;

step 202: updating Ns model parameters by using a batch gradient descent method for back propagation errors, and repeatedly iterating until the model converges;

step three: design and train shooting angle classification model

In this embodiment, the image classification algorithm is mainly used to identify the shooting angle information of the image, wherein the following sub-steps are included, for example, a 101-layer convolutional neural network which is general for image classification is adopted, the classification number of the network is 3, wherein 0 represents an overhead view, 1 represents an overhead view, 2 represents an eye view, and a training mode adopts a common deep learning image classification strategy.

Step four: the training for constructing the dual-branch gated convolutional neural network algorithm comprises the following specific implementation steps:

step 401: deleting the layer after the last convolution layer in the network structure of the significance detection algorithm designed in the step two, reserving the rest network structure as one branch in the double branches, deleting the layer after the network classification layer of the image classification algorithm, reserving the rest network structure as the other branch in the double branches, and constructing the double branches.

Step 402: a gating module can be designed, which uses a spatial channel attention mechanism commonly used in computer vision, as shown in fig. 2, the module first combines features of two branches, then learns the weight of the input features through the designed spatial attention mechanism, and then performs point multiplication with the fused features to obtain features S beneficial to the task. And the S is followed by a classification layer, and the class information of the shooting angle is output.

Step five: training of dual-branch gated convolutional neural network algorithm

Step 501: initializing a double-branch gate-control convolution neural network algorithm model by using the model trained in the second step and the third step, updating the parameters of the algorithm model in the figure 2 by adopting a batch gradient descent method for back propagation errors, only updating the parameters of a gate-control convolution module in the training of the step, and stopping the training after the model is converged.

Step 502: the model updated in step 501 is trained, which needs to update the parameters of all modules in fig. 2, and the updating method adopts a batch gradient descent method.

The functionality of the present invention, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

Other embodiments than the above examples may be devised by those skilled in the art based on the foregoing disclosure, or by adapting and using knowledge or techniques of the relevant art, and features of various embodiments may be interchanged or substituted and such modifications and variations that may be made by those skilled in the art without departing from the spirit and scope of the present invention are intended to be within the scope of the following claims.

Claims

1. A news picture shooting angle identification method based on a gated convolutional neural network is characterized by comprising the following steps:

2. The news image shooting angle recognition method based on the gated convolutional neural network as claimed in claim 1, wherein labeling the salient target region and the shooting angle classification information of the image to be recognized comprises the steps of:

3. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 1, wherein the salient object detection algorithm model is based on the construction process steps of a readable storage medium:

4. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in any one of claims 1 or 3, wherein the image classification algorithm model is based on the steps of a readable storage medium construction process:

5. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 4, wherein the algorithm model of the gated convolutional neural network with two branches is based on the construction process steps of a readable storage medium:

S402, constructing a double-branch gate control convolution neural network algorithm modelDPThe algorithm model adopts a double-branch network to identify the image, wherein the network structure of the branch I is usedFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.

6. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 1, wherein the method for acquiring the shooting angle of the news picture corresponding to the identification image based on the two-branch gated convolutional neural network algorithm model comprises the following steps: