CN112036387A - News picture shooting angle identification method based on gated convolutional neural network - Google Patents

News picture shooting angle identification method based on gated convolutional neural network Download PDF

Info

Publication number
CN112036387A
CN112036387A CN202011226536.7A CN202011226536A CN112036387A CN 112036387 A CN112036387 A CN 112036387A CN 202011226536 A CN202011226536 A CN 202011226536A CN 112036387 A CN112036387 A CN 112036387A
Authority
CN
China
Prior art keywords
neural network
image
shooting angle
algorithm model
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011226536.7A
Other languages
Chinese (zh)
Other versions
CN112036387B (en
Inventor
温序铭
李�杰
宋恩宏
颜刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Sobey Digital Technology Co Ltd
Original Assignee
Chengdu Sobey Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Sobey Digital Technology Co Ltd filed Critical Chengdu Sobey Digital Technology Co Ltd
Priority to CN202011226536.7A priority Critical patent/CN112036387B/en
Publication of CN112036387A publication Critical patent/CN112036387A/en
Application granted granted Critical
Publication of CN112036387B publication Critical patent/CN112036387B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a news picture shooting angle identification method based on a gated convolutional neural network, which comprises the following steps of: marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; constructing a dual-branch gate control convolution neural network algorithm model based on the trained target detection algorithm model and the image classification algorithm model, and acquiring news picture shooting angles and the like corresponding to the identification images based on the dual-branch gate control convolution neural network algorithm model; the method can identify whether the corresponding news picture is obtained by overlook, look-up or head-up shooting, consumes less resources and consumes less time in operation, and can complete practical engineering application.

Description

News picture shooting angle identification method based on gated convolutional neural network
Technical Field
The invention relates to the technical field of computer vision, in particular to a method for identifying a news picture shooting angle based on a gated convolutional neural network.
Background
Video is an important news information transmission means, and compared with a news information transmission mode which uses newspaper, radio station, broadcast and the like as transmission carriers, the video often has the characteristic of more intuitive and more realistic description of news events. When the news video is shot, different shooting angles can be adopted for shooting according to different news types. After the news is manufactured, the shooting angle information of the news picture can be marked, and the repeated utilization rate of the news material can be improved. The traditional shooting angle identification method is mainly based on a three-dimensional reconstruction method, and the method has the problems of huge resource consumption and long time consumption, and cannot meet the actual application scene.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a news image shooting angle identification method based on a gated convolutional neural network, which can identify the shooting angle information of a news image, specifically can identify whether the corresponding news image is obtained by overlook, look-up or look-up shooting, has less resource consumption and short calculation time consumption, and can complete practical engineering application.
The purpose of the invention is realized by the following scheme:
a news picture shooting angle identification method based on a gated convolutional neural network comprises the following steps:
marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; and constructing a double-branch gate control convolution neural network algorithm model based on the trained saliency target detection algorithm model and the trained image classification algorithm model, and acquiring a news picture shooting angle corresponding to the identification image based on the double-branch gate control convolution neural network algorithm model.
Further, labeling the salient target area and the shooting angle classification information of the image to be recognized comprises the following steps:
s101, collecting news videos, decoding the news videos into images, and collecting the images with set quantity so as to perform subsequent labeling steps;
s102, according to a data set making standard of image saliency target detection, using the image data set obtained in the step S101 to mark saliency target information in each image;
s103, according to the data set creating standard of the image classification, the data set marked in the step S102 is marked with the lens angle type information.
Further, the salient object detection algorithm model is based on the construction process steps of a readable storage medium:
s201, constructing a deep convolutional neural network structure of a coder and a decoder on the basis of a convolutional residual neural network of 101 layers, wherein the coder and the decoder adopt symmetrical characteristic pyramids, and the network structure is used as a network structure of a training significance target detection model and is marked as the network structureNs
S202, updating the back propagation error by using a batch gradient descent methodNsAnd (5) repeatedly iterating model parameters until the model converges.
Further, the image classification algorithm model is based on the construction process steps of a readable storage medium:
s301, constructing an image classification algorithm network structure based on the convolution residual error neural network of the 101 layers, wherein the network structure is used as a network structure for training an image classification algorithm and is marked as the network structureNc
S302, updating the back propagation error by adopting a batch gradient descent methodNcAnd (5) repeatedly iterating the model parameters until the classification model converges.
Further, the method comprises the following steps of constructing a dual-branch gate-controlled convolutional neural network algorithm model based on a readable storage medium:
s401, mixingNsDeleting the layer after the last convolutional layer in the network, and reserving the remaining network structure, which is recorded asFns(ii) a Deleting the layers behind the Nc network classification layer, reserving the rest network structure and recording asFnc
S402, constructing a double-branch gate control convolution neural network algorithm modelDPThe algorithm model adopts a double-branch network pairImage recognition, wherein the network structure of the first branch usesFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.
Further, acquiring a news picture shooting angle corresponding to the identification image based on the two-branch gate convolution neural network algorithm model, and the method comprises the following steps:
s501, updating the back propagation error by adopting a batch gradient descent methodDPModel parameters, training of this step only updatesGAnd ClsStopping training after the model is converged;
s502, continuing to train the model updated in the step S501, and updating the model in the step SDPThe updating mode adopts a batch gradient descent method.
The invention has the beneficial effects that:
(1) the invention provides a news picture shooting angle identification method of a gate-control convolutional neural network aiming at the defects of a three-dimensional reconstruction method, which is mainly realized by using the image significance detection and image classification technology of the current convolutional neural network in computer vision.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flow chart of the method steps of the present invention;
FIG. 2 is a schematic diagram of an algorithm network structure for constructing a two-branch gated convolutional neural network model according to the present invention.
Detailed Description
All of the features disclosed in the specification for all of the embodiments (including any accompanying claims, abstract and drawings), or all of the steps of a method or process so disclosed, may be combined and/or expanded, or substituted, in any way, except for mutually exclusive features and/or steps.
As shown in fig. 1 and 2, a method for identifying a news image shooting angle based on a gated convolutional neural network includes the steps of:
marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; and constructing a double-branch gate control convolution neural network algorithm model based on the trained saliency target detection algorithm model and the trained image classification algorithm model, and acquiring a news picture shooting angle corresponding to the identification image based on the double-branch gate control convolution neural network algorithm model.
Further, labeling the salient target area and the shooting angle classification information of the image to be recognized comprises the following steps:
s101, collecting news videos, decoding the news videos into images, and collecting the images with set quantity so as to perform subsequent labeling steps;
s102, according to a data set making standard of image saliency target detection, using the image data set obtained in the step S101 to mark saliency target information in each image;
s103, according to the data set creating standard of the image classification, the data set marked in the step S102 is marked with the lens angle type information.
Further, the salient object detection algorithm model is based on the construction process steps of a readable storage medium:
s201, constructing a deep convolution nerve of a coder and a decoder based on a convolution residual error nerve network of 101 layersA network structure, in which the codec uses a symmetric feature pyramid, and the network structure is used as a network structure for training a saliency target detection model and is recorded asNs
S202, updating the back propagation error by using a batch gradient descent methodNsAnd (5) repeatedly iterating model parameters until the model converges.
Further, the image classification algorithm model is based on the construction process steps of a readable storage medium:
s301, constructing an image classification algorithm network structure based on the convolution residual error neural network of the 101 layers, wherein the network structure is used as a network structure for training an image classification algorithm and is marked as the network structureNc
S302, updating the back propagation error by adopting a batch gradient descent methodNcAnd (5) repeatedly iterating the model parameters until the classification model converges.
Further, the method comprises the following steps of constructing a dual-branch gate-controlled convolutional neural network algorithm model based on a readable storage medium:
s401, mixingNsDeleting the layer after the last convolutional layer in the network, and reserving the remaining network structure, which is recorded asFns(ii) a Deleting the layers behind the Nc network classification layer, reserving the rest network structure and recording asFnc
S402, constructing a gate-controlled convolutional neural network algorithm of double branchesDPThe algorithm adopts a double-branch network to identify the image, wherein the network structure of the branch I usesFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.
Further, acquiring a news picture shooting angle corresponding to the identification image based on the two-branch gate convolution neural network algorithm model, and the method comprises the following steps:
s501, updating the back propagation error by adopting a batch gradient descent methodDPModel parameters, training of this step only updatesGAnd ClsParameter of (2), convergence of modelThen, stopping training;
s502, continuing to train the model updated in the step S501, and updating the model in the step SDPThe updating mode adopts a batch gradient descent method.
In another embodiment of the present invention, as shown in fig. 1, a method for identifying a shot angle of a news image based on a gated convolutional neural network according to an embodiment of the present invention includes the following steps:
the method comprises the following steps: labeling the classification information of the image saliency target area and the shooting angle;
in step one, through the discussion of professional news related practitioners and computer vision related practitioners, the saliency target of the image and the annotation standard of the image capturing angle are defined.
The first step further comprises the following substeps:
step 101: in this embodiment, a plurality of original video materials can be obtained by crawling news columns of television stations in each province and city such as beijing, anhui, guangdong and the like on the internet by using a crawler technology, videos are decoded into images by using a video decoding tool, some obvious interference images such as images synthesized by a computer are removed, and a data set to be marked is formed.
Step 102: and marking each image of the data set to be marked according to the data set making rule of the saliency target detection, and marking the shooting angle category information of each image according to the marking rule of image classification.
Step two: and designing a saliency target detection algorithm, and training a saliency target detection algorithm model by using the data set obtained in the step one.
In this embodiment, the implementation of the image saliency target detection algorithm model based on the encoder includes the following sub-steps:
step 201: constructing a convolutional neural network structure of a coder and a decoder based on a convolutional residual neural network with 101 layers, wherein the coder and the decoder are in a symmetrical network structure;
step 202: updating Ns model parameters by using a batch gradient descent method for back propagation errors, and repeatedly iterating until the model converges;
step three: design and train shooting angle classification model
In this embodiment, the image classification algorithm is mainly used to identify the shooting angle information of the image, wherein the following sub-steps are included, for example, a 101-layer convolutional neural network which is general for image classification is adopted, the classification number of the network is 3, wherein 0 represents an overhead view, 1 represents an overhead view, 2 represents an eye view, and a training mode adopts a common deep learning image classification strategy.
Step four: the training for constructing the dual-branch gated convolutional neural network algorithm comprises the following specific implementation steps:
step 401: deleting the layer after the last convolution layer in the network structure of the significance detection algorithm designed in the step two, reserving the rest network structure as one branch in the double branches, deleting the layer after the network classification layer of the image classification algorithm, reserving the rest network structure as the other branch in the double branches, and constructing the double branches.
Step 402: a gating module can be designed, which uses a spatial channel attention mechanism commonly used in computer vision, as shown in fig. 2, the module first combines features of two branches, then learns the weight of the input features through the designed spatial attention mechanism, and then performs point multiplication with the fused features to obtain features S beneficial to the task. And the S is followed by a classification layer, and the class information of the shooting angle is output.
Step five: training of dual-branch gated convolutional neural network algorithm
Step 501: initializing a double-branch gate-control convolution neural network algorithm model by using the model trained in the second step and the third step, updating the parameters of the algorithm model in the figure 2 by adopting a batch gradient descent method for back propagation errors, only updating the parameters of a gate-control convolution module in the training of the step, and stopping the training after the model is converged.
Step 502: the model updated in step 501 is trained, which needs to update the parameters of all modules in fig. 2, and the updating method adopts a batch gradient descent method.
The functionality of the present invention, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
Other embodiments than the above examples may be devised by those skilled in the art based on the foregoing disclosure, or by adapting and using knowledge or techniques of the relevant art, and features of various embodiments may be interchanged or substituted and such modifications and variations that may be made by those skilled in the art without departing from the spirit and scope of the present invention are intended to be within the scope of the following claims.

Claims (6)

1. A news picture shooting angle identification method based on a gated convolutional neural network is characterized by comprising the following steps:
marking a salient target area and shooting angle classification information of an image to be identified; training a constructed significance target detection algorithm model and an image classification algorithm model by using the labeled image data set; and constructing a double-branch gate control convolution neural network algorithm model based on the trained saliency target detection algorithm model and the trained image classification algorithm model, and acquiring a news picture shooting angle corresponding to the identification image based on the double-branch gate control convolution neural network algorithm model.
2. The news image shooting angle recognition method based on the gated convolutional neural network as claimed in claim 1, wherein labeling the salient target region and the shooting angle classification information of the image to be recognized comprises the steps of:
s101, collecting news videos, decoding the news videos into images, and collecting the images with set quantity so as to perform subsequent labeling steps;
s102, according to a data set making standard of image saliency target detection, using the image data set obtained in the step S101 to mark saliency target information in each image;
s103, according to the data set creating standard of the image classification, the data set marked in the step S102 is marked with the lens angle type information.
3. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 1, wherein the salient object detection algorithm model is based on the construction process steps of a readable storage medium:
s201, constructing a deep convolutional neural network structure of a coder and a decoder on the basis of a convolutional residual neural network of 101 layers, wherein the coder and the decoder adopt symmetrical characteristic pyramids, and the network structure is used as a network structure of a training significance target detection model and is marked as the network structureNs
S202, updating the back propagation error by using a batch gradient descent methodNsAnd (5) repeatedly iterating model parameters until the model converges.
4. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in any one of claims 1 or 3, wherein the image classification algorithm model is based on the steps of a readable storage medium construction process:
s301, constructing an image classification algorithm network structure based on the convolution residual error neural network of the 101 layers, wherein the network structure is used as a network structure for training an image classification algorithm and is marked as the network structureNc
S302, updating the back propagation error by adopting a batch gradient descent methodNcAnd (5) repeatedly iterating the model parameters until the classification model converges.
5. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 4, wherein the algorithm model of the gated convolutional neural network with two branches is based on the construction process steps of a readable storage medium:
s401, mixingNsDeleting the layer after the last convolutional layer in the network, and reserving the remaining network structure, which is recorded asFns(ii) a Deleting the layers behind the Nc network classification layer, reserving the rest network structure and recording asFnc
S402, constructing a double-branch gate control convolution neural network algorithm modelDPThe algorithm model adopts a double-branch network to identify the image, wherein the network structure of the branch I is usedFnsThe other branch being usedFncThe gate control module uses the spatial channel attention mechanism commonly used by computer vision, and is recorded asGA classification module is added behind the gate control moduleClsAnd the function of the method is to further optimize the output characteristics of the gating module and identify the shooting angle category of the image.
6. The method for identifying the shooting angle of the news picture based on the gated convolutional neural network as claimed in claim 1, wherein the method for acquiring the shooting angle of the news picture corresponding to the identification image based on the two-branch gated convolutional neural network algorithm model comprises the following steps:
s501, updating the back propagation error by adopting a batch gradient descent methodDPModel parameters, training of this step only updatesGAnd ClsStopping training after the model is converged;
s502, continuing to train the model updated in the step S501, and updating the model in the step SDPThe updating mode adopts a batch gradient descent method.
CN202011226536.7A 2020-11-06 2020-11-06 News picture shooting angle identification method based on gated convolutional neural network Active CN112036387B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011226536.7A CN112036387B (en) 2020-11-06 2020-11-06 News picture shooting angle identification method based on gated convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011226536.7A CN112036387B (en) 2020-11-06 2020-11-06 News picture shooting angle identification method based on gated convolutional neural network

Publications (2)

Publication Number Publication Date
CN112036387A true CN112036387A (en) 2020-12-04
CN112036387B CN112036387B (en) 2021-02-09

Family

ID=73572748

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011226536.7A Active CN112036387B (en) 2020-11-06 2020-11-06 News picture shooting angle identification method based on gated convolutional neural network

Country Status (1)

Country Link
CN (1) CN112036387B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113255628A (en) * 2021-07-15 2021-08-13 成都索贝数码科技股份有限公司 Scene identification recognition method for news scene
CN113793286A (en) * 2021-11-18 2021-12-14 成都索贝数码科技股份有限公司 Media image watermark removing method based on multi-order attention neural network

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102216941A (en) * 2008-08-19 2011-10-12 数字标记公司 Methods and systems for content processing
CN103425757A (en) * 2013-07-31 2013-12-04 复旦大学 Cross-medial personage news searching method and system capable of fusing multi-mode information
CN104765801A (en) * 2011-03-07 2015-07-08 科宝2股份有限公司 Systems and methods for analytic data gathering from image providers at event or geographic location
CN109801621A (en) * 2019-03-15 2019-05-24 三峡大学 A kind of audio recognition method based on residual error gating cycle unit
CN110580450A (en) * 2019-08-12 2019-12-17 西安理工大学 traffic sign identification method based on convolutional neural network
CN110889428A (en) * 2019-10-21 2020-03-17 浙江大搜车软件技术有限公司 Image recognition method and device, computer equipment and storage medium
CN111242110A (en) * 2020-04-28 2020-06-05 成都索贝数码科技股份有限公司 Training method of self-adaptive conditional random field algorithm for automatically breaking news items
CN111552300A (en) * 2020-06-09 2020-08-18 南开大学 Crop picking system based on instance segmentation and path planning
CN111860472A (en) * 2020-09-24 2020-10-30 成都索贝数码科技股份有限公司 Television station caption detection method, system, computer equipment and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102216941A (en) * 2008-08-19 2011-10-12 数字标记公司 Methods and systems for content processing
CN104765801A (en) * 2011-03-07 2015-07-08 科宝2股份有限公司 Systems and methods for analytic data gathering from image providers at event or geographic location
CN103425757A (en) * 2013-07-31 2013-12-04 复旦大学 Cross-medial personage news searching method and system capable of fusing multi-mode information
CN109801621A (en) * 2019-03-15 2019-05-24 三峡大学 A kind of audio recognition method based on residual error gating cycle unit
CN110580450A (en) * 2019-08-12 2019-12-17 西安理工大学 traffic sign identification method based on convolutional neural network
CN110889428A (en) * 2019-10-21 2020-03-17 浙江大搜车软件技术有限公司 Image recognition method and device, computer equipment and storage medium
CN111242110A (en) * 2020-04-28 2020-06-05 成都索贝数码科技股份有限公司 Training method of self-adaptive conditional random field algorithm for automatically breaking news items
CN111552300A (en) * 2020-06-09 2020-08-18 南开大学 Crop picking system based on instance segmentation and path planning
CN111860472A (en) * 2020-09-24 2020-10-30 成都索贝数码科技股份有限公司 Television station caption detection method, system, computer equipment and storage medium

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
YUEJUAN WANG等: "Human Detection Based on Improved Mask R-CNN", 《JOURNAL OF PHYSICS: CONFERENCE SERIES》 *
YULANG CHEN等: "R-CNN-Based Satellite Components Detection in Optical Images", 《INTERNATIONAL JOURNAL OF AEROSPACE ENGINEERING》 *
柳杨等: "基于门控卷积机制与层次注意力机制的多语义词向量计算方法", 《中文信息学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113255628A (en) * 2021-07-15 2021-08-13 成都索贝数码科技股份有限公司 Scene identification recognition method for news scene
CN113793286A (en) * 2021-11-18 2021-12-14 成都索贝数码科技股份有限公司 Media image watermark removing method based on multi-order attention neural network

Also Published As

Publication number Publication date
CN112036387B (en) 2021-02-09

Similar Documents

Publication Publication Date Title
CN106650789B (en) Image description generation method based on depth LSTM network
CN109919209B (en) Domain self-adaptive deep learning method and readable storage medium
CN112036387B (en) News picture shooting angle identification method based on gated convolutional neural network
CN111079532A (en) Video content description method based on text self-encoder
CN105825191A (en) Face multi-attribute information-based gender recognition method and system and shooting terminal
CN109213896B (en) Underwater video abstract generation method based on long-short term memory network reinforcement learning
WO2022105608A1 (en) Rapid face density prediction and face detection method and apparatus, electronic device, and storage medium
CN107680053A (en) A kind of fuzzy core Optimized Iterative initial value method of estimation based on deep learning classification
CN109961095B (en) Image labeling system and method based on unsupervised deep learning
CN110796251A (en) Image compression optimization method based on convolutional neural network
CN113392864B (en) Model generation method, video screening method, related device and storage medium
CN113627479B (en) Graph data anomaly detection method based on semi-supervised learning
CN114782355B (en) Gastric cancer digital pathological section detection method based on improved VGG16 network
WO2023207389A1 (en) Data processing method and apparatus, program product, computer device, and medium
CN114842180B (en) Point cloud completion method, device, equipment and medium
CN112132232A (en) Medical image classification labeling method and system and server
CN116580267A (en) Defect sample generation method and device, electronic equipment and storage medium
CN108647295B (en) Image labeling method based on depth collaborative hash
CN111260659A (en) Image interactive segmentation method based on initial annotation point guidance
CN103888498A (en) Information pushing method and apparatus, terminal and server
CN113095319B (en) Multidirectional scene character detection method and device based on full convolution angular point correction network
CN114120447A (en) Behavior recognition method and system based on prototype comparison learning and storage medium
Zhang et al. Reinforcement learning cropping method based on comprehensive feature and aesthetics assessment
Lin Virtual reality and its application for producing TV programs
Oh et al. Dynamic programming approach to visual place recognition in changing environments

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant