CN106488313A - A kind of TV station symbol recognition method and system - Google Patents

A kind of TV station symbol recognition method and system Download PDF

Info

Publication number
CN106488313A
CN106488313A CN201610931021.4A CN201610931021A CN106488313A CN 106488313 A CN106488313 A CN 106488313A CN 201610931021 A CN201610931021 A CN 201610931021A CN 106488313 A CN106488313 A CN 106488313A
Authority
CN
China
Prior art keywords
convolutional neural
neural networks
station symbol
video
array
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610931021.4A
Other languages
Chinese (zh)
Inventor
王妙辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TCL Corp
Original Assignee
TCL Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TCL Corp filed Critical TCL Corp
Priority to CN201610931021.4A priority Critical patent/CN106488313A/en
Publication of CN106488313A publication Critical patent/CN106488313A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of TV station symbol recognition method and system, methods described includes:After getting frame of video, intercept image block group from described frame of video;Described image block group is input to the input layer of default convolutional neural networks array as |input paramete;Using described convolutional neural networks array, described image block group is identified, to obtain the corresponding television channel of TV station symbol that described frame of video comprises.The present invention is identified the reusability it is achieved that training result by convolutional neural networks array to image block group, it is to avoid training complexity during newly-increased identification species, improves network capacity and recognition accuracy simultaneously.

Description

A kind of TV station symbol recognition method and system
Technical field
The present invention relates to TV technology, particularly to a kind of TV station symbol recognition method and system.
Background technology
The station symbol of television station is to discriminate between the important mark of different television stations, and TV station symbol recognition technology is to the platform in TV image Mark a kind of technology being identified, TV station symbol recognition technology is that the application such as video analysis, retrieval and user watched behavioral statisticses carries Supply important data message.The existing station identification method for distinguishing that can be used for mainly includes two class methods.First kind method is base The method being identified in the feature of engineer, e.g., the characteristic information such as edge of station symbol color, station symbol shape or station symbol. Such method is easily affected by video content and station symbol shape, there is accuracy and robustness is not strong.Equations of The Second Kind method is base In the method for statistical machine learning, the image containing station symbol is sent into neural network module, is automatically learned by neutral net by it Practise feature representation, it has that network model's autgmentability difference and accuracy rate are low.
Thus prior art could be improved and improves.
Content of the invention
The technical problem to be solved in the present invention is, for the deficiencies in the prior art, provide a kind of TV station symbol recognition method and System is low to solve the problems, such as network model's autgmentability difference that existing TV station symbol recognition method exists and accuracy rate.
In order to solve above-mentioned technical problem, the technical solution adopted in the present invention is as follows:
A kind of TV station symbol recognition method, it includes:
After getting frame of video, intercept image block group from described frame of video;
Described image block group is input to default convolutional neural networks array input layer, wherein, described convolution as |input paramete The identification array that neutral net array is made up of with cascade structure some independent convolutional neural networks;
First order convolutional neural networks identification using described convolutional neural networks array is identified to described image block group;
If recognition result is TV station symbol, export the corresponding television channel of described TV station symbol;
If recognition result is other, using next stage convolutional neural networks, described image block group is identified, executes successively To afterbody convolutional neural networks;
If afterbody convolutional neural networks recognition result is other, judge described convolutional neural networks array None- identified institute State frame of video, and export unknown result.
Described TV station symbol recognition method, wherein, described after getting frame of video, from described frame of video intercept image block group Specifically include:
Receive and parse through the video flowing in communications, obtain the video frame images in described video flowing;
Described video frame images are carried out with pretreatment, and some predeterminated positions from pretreatment rear video two field picture intercept figure respectively As block is to form image block group.
Described TV station symbol recognition method, wherein, described convolutional neural networks array by some independent convolutional neural networks with The identification array of cascade structure composition is specially:
Described convolutional neural networks array includes some independent convolutional neural networks for identifying inhomogeneity station symbol collection, remembers respectively For first order convolutional neural networks, second level convolutional neural networks ...., n-th grade of convolutional neural networks;
Connected by cascade system between described some independent convolutional neural networks.
Described TV station symbol recognition method, wherein, if described afterbody convolutional neural networks recognition result is other, judges Frame of video described in described convolutional neural networks array None- identified, and can also include after exporting unknown result:
It is used for identifying the new convolutional neural networks of the corresponding station symbol of described frame of video according to described frame of video stand-alone training;
After described new convolutional neural networks level is coupled to described convolutional neural networks array, to form new convolutional Neural net Network array.
Described TV station symbol recognition method, wherein, described convolutional neural networks include amassing god for the master file identifying station symbol feature Through network and some branch's convolutional neural networks for identifying numeral/character features;Described branch convolutional neural networks bridge It is connected to the convolutional layer of main convolutional neural networks.
Described TV station symbol recognition method, wherein, by some convolutional layers, space is criticized for the front end of described convolutional neural networks and middle-end Amount regularization layer, corrects linear unit layer and maximum pond layer composition;Its rear end by some full articulamentums, batch regularization layer and Correct linear unit layer composition.
Described TV station symbol recognition method, wherein, all includes space batch after each convolutional layer of described convolutional neural networks Regularization layer and correction linear unit layer.
A kind of TV station symbol recognition system, it includes:
Interception module, for, after getting frame of video, intercepting image block group from described frame of video;
Input module, for described image block group is input to default convolutional neural networks array input layer as |input paramete, Wherein, the identification array that described convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks;
Identification module, identifies to described image block for the first order convolutional neural networks using described convolutional neural networks array Group is identified;
First performing module, for being television station's timestamp when recognition result, exports the corresponding television channel of described TV station symbol;
Second performing module, for being other when recognition result, is entered to described image block group using next stage convolutional neural networks Row identification, repeats identification module, the first performing module and the second performing module up to afterbody convolutional neural networks;
3rd performing module, for when afterbody convolutional neural networks recognition result is other, judging described convolution Frame of video described in neutral net array None- identified, and export unknown result.
Described TV station symbol recognition system, wherein, described convolutional neural networks include amassing god for the master file identifying station symbol feature Through network and some branch's convolutional neural networks for identifying numeral/character features;Described branch convolutional neural networks bridge It is connected to the convolutional layer of main convolutional neural networks.
Described TV station symbol recognition system, wherein, by some convolutional layers, space is criticized for the front end of described convolutional neural networks and middle-end Amount regularization layer, corrects linear unit layer and maximum pond layer composition;Its rear end by some full articulamentums, batch regularization layer and Correct linear unit layer composition.
Beneficial effect:Compared with prior art, the invention provides a kind of TV station symbol recognition method and system, methods described bag Include:After getting frame of video, intercept image block group from described frame of video;Described image block group is inputted as |input paramete Input layer to default convolutional neural networks array;Using described convolutional neural networks array, described image block group is known Not, to obtain the corresponding television channel of TV station symbol that described frame of video comprises.The present invention passes through convolutional neural networks array pair Image block group is identified the reusability it is achieved that training result, it is to avoid training complexity during newly-increased identification species, with When improve network capacity and recognition accuracy.
Brief description
The flow chart that Fig. 1 is preferably implemented for the TV station symbol recognition method that the present invention provides.
Convolutional neural networks array identification process figure in the TV station symbol recognition method that Fig. 2 provides for the present invention.
The structure principle chart of the TV station symbol recognition method system that Fig. 3 provides for the present invention.
The structure principle chart of another embodiment of TV station symbol recognition method system that Fig. 4 provides for the present invention.
Specific embodiment
The present invention provides a kind of TV station symbol recognition method and system, for making the purpose of the present invention, technical scheme and effect more Clear, clear and definite, the present invention is described in more detail for the embodiment that develops simultaneously referring to the drawings.It should be appreciated that it is described herein Specific embodiment only in order to explain the present invention, is not intended to limit the present invention.
In the present invention, using such as " module ", " part " or " unit " for representing element suffix only for favourable In the explanation of the present invention, itself does not have specific meaning.Therefore, module ", " part " or " unit " can mixedly make With.
Terminal unit can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as moving Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP The mobile terminal of (portable media player), guider etc. and such as numeral TV, desk computer etc. consolidate Determine terminal.However, it will be understood by those skilled in the art that, in addition to being used in particular for the element of mobile purpose, according to this The construction of bright embodiment can also apply to the terminal of fixed type.
Below in conjunction with the accompanying drawings, by the description to embodiment, content of the invention is described further.
Refer to shown in Fig. 1 and Fig. 2, the flow chart of the preferred embodiment of the TV station symbol recognition method that Fig. 1 provides for the present invention, Convolutional neural networks array identification process figure in the TV station symbol recognition method that Fig. 2 provides for the present invention.Methods described includes:
S100, after getting frame of video, from described frame of video intercept image block group;
S200, described image block group is input to default convolutional neural networks array input layer as |input paramete, wherein, described The identification array that convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks;
S300, using described convolutional neural networks array the first order convolutional neural networks identification described image block group is known Not;
If S400 recognition result is TV station symbol, export the corresponding television channel of described TV station symbol;
If S500 recognition result is other, using next stage convolutional neural networks, described image block group is identified, repeats Step S300-S500 is up to afterbody convolutional neural networks;
If S600 afterbody convolutional neural networks recognition result is other, judge described convolutional neural networks array Frame of video described in None- identified, and export unknown result..
Specifically, in described step S100, described frame of video refers to intelligent television and receives in video flowing Single-frame imagess.Described single-frame imagess are two dimensional image.That is, described frame of video shows for one that intelligent television receives Image.In actual applications, described get frame of video before can also include monitor intelligent television zapping instruction, receiving After zapping instruction, first frame of video will be received as described frame of video.Therefore, described after getting frame of video, from institute State and can include before intercepting image block group in frame of video:
S01, when intelligent television is in open state, the operation of zapping that real-time monitoring users are carried out to intelligent television;
S02, after listening to zapping operation, according to described zapping operational control intelligent television zapping and obtain first receiving Individual frame of video.
In the present embodiment, obtain image block group after getting frame of video in addition it is also necessary to from described frame of video.Described Image block group includes multiple equal-sized image blocks, and described image block obtains from the predeterminated position of frame of video.That is, it is pre- First setting video image obtains the scope of image, and some image blocks corresponding position in described image scope.Described figure As scope can determine according to the size of smart television display, for example, described image scope and described display screen scope phase With.
Some positions are pre-set in the range of described image, when intercepting the image block group in image, described by intercepting The image of some positions is to obtain image block group.The shape size of described some positions is identical, i.e. the shape of the image block of sectional drawing Identical with size.The interception position of described image block may be located at any position of video frame images, for example, a corner, four Individual corner and center etc..In actual applications, in order to fast and accurately obtain the image block of protection station symbol, will be described pre- First some position branches are set to:Four corners and center.
Exemplary, described after getting frame of video, intercept image block group from described frame of video and specifically can include:
S101, the video flowing receiving and parsing through in communications, obtain the video frame images in described video flowing;
S102, described video frame images are carried out with pretreatment, and from some predeterminated positions of pretreatment rear video two field picture respectively Intercept image block to form image block group.
Specifically, described video frame images are carried out with the described image that pretreatment refers to zoom in and out, at denoising and normalization Reason.To intercepting multiple images block in the video frame images of pretreatment, the plurality of image block forms image block group.The plurality of figure As block is the multiple subimages being the same from position intercepting from multiple form and dimensions pre-setting of video frame images.Described The parts of images content of video frame images described in image block protection.
In described step S200, described convolutional neural networks array can only comprise convolutional neural networks, also may be used To comprise multiple independent convolutional neural networks.When convolutional neural networks array comprises multiple independent network model, described Connected by cascade system between multiple independent network modeies.That is, described convolutional neural networks array include some For identifying the independent convolutional neural networks of inhomogeneity station symbol collection, it is designated as first order convolutional neural networks, second level convolution respectively Neutral net ...., n-th grade of convolutional neural networks;Connected by cascade system between described some independent convolutional neural networks.
In the present embodiment, the independent network model of described convolutional neural networks array cascade can identify according to it Content and extend.That is, when described convolutional neural networks array None- identified current image block group, stand-alone training one New convolutional neural networks model, and by described new convolutional neural networks Cascade in existing convolutional neural networks array On, as afterbody convolutional neural networks.So can not need to carry out re -training to already present training pattern result To accurately identify fresh target, solve the problems, such as the neutral net only guarantee accuracy rate in the target of finite number simultaneously.
Exemplarily, described default convolutional neural networks array only comprises convolutional neural networks, is designated as first order volume Long-pending neutral net, it is used for identifying first station symbol collection (for example comprising multiple TV station symbol species), is designated as A class TV station symbol mould Type (CNN_A).When assuming to have new station symbol classification collection to need to identify using first order convolutional neural networks, can be to new station symbol The new convolutional neural networks model of classification collection stand-alone training one, is designated as B class TV station symbol (CNN_B);Then, by new convolution After neural network model CNN_B level is associated in original neural network group CNN_A;Finally, new convolutional neural networks array is (for example Comprise CNN_A and CNN_B) associable for TV station symbol recognition.
In the present embodiment, the front end of described convolutional neural networks and middle-end can be any number of convolutional layers (convolutional layer), space batch regularization layer(spatial batch normalization layer), entangle Linear positive elementary layer(rectified linear units layer), and maximum pond layer(max pooling layer)'s Combination.Rear end is any number of full articulamentum(full connection layer), batch regularization layer(batch normalization layer)With correction linear unit layer(rectified linear units layer)Composition.
Described convolutional network carries out the study of weight parameter using stochastic gradient descent method, and constructs nerve with described weight Network.Wherein, if initial learn speed 0.0001, every 30 wheel iteration are updated to original 1/2, and carry out at random increasing encoding and decoding Distortion 5%~30%, Random-Rotation -5 spends~5 degree, random 50%~150% size variation disturbance, random left rotation and right rotation, forms number According to collection, using described data set, convolutional network is trained.Certainly, described convolutional network can also be carried out using other modes Training, does not just describe in detail here one by one.
Further, described convolutional neural networks comprise main split's convolutional network and Liang Ge sub-branch convolutional network, Branch is designated as sub-branch convolutional neural networks I and sub-branch convolutional neural networks II.Liang Ge sub-branch convolutional network can be any Bridge at the convolutional layer of master network.Wherein, described main split convolutional neural networks are responsible for identifying station symbol body feature, described two Sub-branch's convolutional neural networks are responsible for identifying numerical characteristic and/or character features, and for example, sub-branch convolutional neural networks I is responsible for Identification numerical characteristic, sub-branch convolutional neural networks II is responsible for recognition character.So pass through to pass through master when carrying out TV station symbol recognition Branch's convolutional network and Liang Ge sub-branch convolutional network identify respectively, finally three web results are comprehensively chosen, output result. So master network and sub-network share the parameter in low-level image feature space, can save size and the complexity of network model.
In the present embodiment, described convolutional neural networks can include:
Ground floor, convolutional layer, convolution kernel is 3x3, and convolution step-length is 2;
The second layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Third layer, maximum pond layer, pondization is interval to be 2;
4th layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Sub-network structure I:
4th layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Layer 5, full articulamentum;
Layer 6, full articulamentum(Sub-network I terminates).
Layer 5, maximum pond layer, pondization is interval to be 2;
Layer 6, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Sub-network structure II:
Layer 6, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Layer 7, full articulamentum(Sub-network II terminates).
Layer 7, maximum pond layer, pondization is interval to be 2;
8th layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
9th layer, maximum pond layer, pondization is interval to be 2;
Tenth layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1;
Eleventh floor, maximum pond layer, pondization is interval to be 2;
Floor 12, full articulamentum;
13rd layer, full articulamentum(Master network terminates).
In described convolutional network, there is individual space batch regularization layer after each convolutional layer and correct linear unit Layer.There is individual spatial regularization layer after each same full articulamentum and correct linear unit layer.The image that convolutional layer will input Or characteristics of image is by a series of new characteristics of image of linear transformations output.Space batch regularization can be divided with normalization data Cloth is thus accelerate training process.Correct linear unit the result of input is exported according to the work conversion of approximate people's vision response.Pond Change layer and multiple input numerical value are mapped as an output numerical value.The upper layer network structure of described convolutional neural networks is according to handled Objective attribute target attribute have the parameter of specifically different network structures and feature space it can be ensured that the accuracy of network model.Improve The capacity of network and discrimination.
In described step S300, described using convolutional neural networks array, image block group is identified referring to adopting Convolutional neural networks array is identified to image block group and detects, its identify station symbol when determine that described station symbol is corresponding Channel information, described channel information includes the title of described channel, for example, BTV, Shenzhen physical culture etc..
In order to further illustrate convolutional neural networks array to image recognition process, several specific embodiments are given below It is illustrated.
Embodiment one
Described convolutional neural networks array is the convolutional neural networks group of cascade, and one of neural network group only corresponds to more than one The convolutional neural networks of the multitask of output.The convolutional neural networks group of the first order identifies one group of initial station symbol.Second level god Only identify the new station symbol group newly included through network, by that analogy.Its identification process is specifically as follows:Use the convolution of the first order first Neural network group is identified.If recognition result " other " enters the convolutional neural networks group of next stage, if station symbol is then Export corresponding television channel.When proceeding to the second level, if recognition result " other " then exports " other ", if station symbol is then Export corresponding television channel.When carrying out the identification of every one-level, only one of which convolutional neural networks, recognition result is TV frequency Road title or " other ", this recognition result is directly as the recognition result of this convolutional neural networks group.For example, " Jiangsu physical culture Channel " is one of television channel that first training data is comprised, and " Shenzhen entertainment channel " belongs to what training data was comprised One of television channel.For the test image comprising Jiangsu sports channel station symbol, first entered with the convolutional neural networks group of the first order Row identification.Wherein convolutional neural networks recognition result is " Jiangsu sports channel ", exports " Jiangsu sports channel ".For comprising depth The test image of ditch between fields entertainment channel station symbol, is first identified with the convolutional neural networks group of the first order.Wherein convolutional neural networks Recognition result be " other ".Then it is identified with the convolutional neural networks of the second level.The identification of wherein convolutional neural networks Result is " Shenzhen entertainment channel ".Output " Shenzhen entertainment channel ".
Embodiment two
Convolutional neural networks group using cascade carries out television station channel identification, one-level convolutional neural networks comprise two independent The convolutional neural networks of the single task of multi output.Wherein, first convolutional neural networks is responsible for identifying main station symbol part, second Convolutional neural networks identify sub- station symbol part.One group of station symbol of the initial regulation of convolutional neural networks group identification of the first order.Second Level neural network group only identifies the one group of new station symbol newly included, by that analogy.Concrete identification process can be described as:First with first The convolutional neural networks group of level is identified.If recognition result " other " enters the convolutional neural networks group of next stage, if Then export corresponding television channel for station symbol.When carrying out the second level, if recognition result " other " then exports " other ", if Station symbol then exports corresponding television channel.When being identified with convolutional neural networks group, ground floor convolutional neural networks recognition result For television station's title or " other ", if ground floor recognition result is " other ", the recognition result of this convolutional neural networks group is " its He ";If ground floor recognition result is television station, carry out the identification of second layer convolutional neural networks.Second layer convolutional neural networks Recognition result is channel brief introduction, channel number or " other ".Finally the recognition result of comprehensive two convolutional neural networks obtains Whole television channel name.For example, " Jiangsu sports channel " is one of television channel that first training data is comprised, Shenzhen Recreation table belongs to one of channel that second batch training data is comprised.For the test image comprising Jiangsu satellite TV channel station symbol, First it is identified with the convolutional neural networks group of the first order.Wherein first convolutional neural networks recognition result is " Jiangsu TV Platform ", second convolutional neural networks recognition result is " physical culture ".Comprehensively first order convolutional neural networks group recognition result can be obtained is " Jiangsu sports channel ", then exports recognition result.For the test image comprising Shenzhen entertainment channel station symbol, first use the first order Convolutional neural networks group be identified.The recognition result of wherein first convolutional neural networks is " other ", then with second The convolutional neural networks of level are identified.In the convolutional neural networks kind of the second level, the knowledge of its first convolutional neural networks Other result is " Shenzhen TV Station ", and the recognition result of its second convolutional neural networks is " amusement ".Comprehensively the second level can be obtained The recognition result of convolutional neural networks group is " Shenzhen entertainment channel ".
Embodiment three
Described convolutional neural networks array comprises the convolutional neural networks of the single task of two independent multi output, wherein, first Individual convolutional neural networks are responsible for identifying main station symbol part, and second convolutional neural networks identifies sub- station symbol part.Newly include when having Station logo data when coming in, new and old station symbol is put re -training together.For example, " Jiangsu sports channel " is first training data One of television channel being comprised, Shenzhen recreation table belongs to one of channel that second batch training data is comprised.For " Jiangsu body Ssd channel ", is identified with convolutional neural networks group.Wherein first convolutional neural networks recognition result is " Jiangsu TV Platform ", second convolutional neural networks recognition result is " physical culture ".It is " Jiangsu that convolutional neural networks group recognition result comprehensively can be obtained Sports channel ", then exports recognition result." Shenzhen entertainment channel ", is identified with convolutional neural networks group.Wherein first The recognition result of convolutional neural networks is " Shenzhen TV Station ", and the recognition result of second convolutional neural networks is " amusement ".Comprehensive The recognition result that conjunction can obtain convolutional neural networks group is " Shenzhen entertainment channel ".
Present invention also offers a kind of TV station symbol recognition system, as shown in figure 3, it includes:
A kind of TV station symbol recognition system, it includes:
Interception module 100, for, after getting frame of video, intercepting image block group from described frame of video;
Input module 200, for being input to default convolutional neural networks array input using described image block group as |input paramete Layer, wherein, the identification array that described convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks;
Identification module 300, the first order convolutional neural networks using described convolutional neural networks array identify to described image block Group is identified;
First performing module 400, for being television station's timestamp when recognition result, output described TV station symbol corresponding TV frequency Road;
Second performing module 500, for being other when recognition result, using next stage convolutional neural networks to described image block group It is identified, repeat identification module, the first performing module and the second performing module up to afterbody convolutional Neural net Network;
3rd performing module 600, for when afterbody convolutional neural networks recognition result is other, judging described volume Frame of video described in long-pending neutral net array None- identified, and export unknown result..
Described TV station symbol recognition system, wherein, described convolutional neural networks include amassing god for the master file identifying station symbol feature Through network and some branch's convolutional neural networks for identifying numeral/character features;Described branch convolutional neural networks bridge It is connected to the convolutional layer of main convolutional neural networks.
Described TV station symbol recognition system, wherein, by some convolutional layers, space is criticized for the front end of described convolutional neural networks and middle-end Amount regularization layer, corrects linear unit layer and maximum pond layer composition;Its rear end by some full articulamentums, batch regularization layer and Correct linear unit layer composition.
In another embodiment of the present invention, as shown in figure 4, described system includes:Intelligent television 1000 server 2000;
Described intelligent television 1000 includes interception module 1001 and sending module 1002;
Described interception module 1001, for, after getting frame of video, intercepting image block group from described frame of video;
Described sending module 1002, for sending described image block group to server;
Described server 2000 includes:Receiver module 2001, input module 2002 and identification module 2003;
Described receiver module 2001, for receiving the image block group of intelligent television transmission;
Described input module 2002, for being input to default convolutional neural networks array using described image block group as |input paramete Input layer;
Described identification module 2003, for being identified to described image block group using described convolutional neural networks array, with The corresponding television channel of TV station symbol comprising to described frame of video.
Described TV station symbol recognition system, wherein, described interception module specifically includes:
Acquiring unit, for receiving and parsing through the video flowing in communications, obtains the video frame images in described video flowing;
Interception unit for carrying out pretreatment and some default from pretreatment rear video two field picture to described video frame images Position intercepts image block respectively to form image block group.
Described TV station symbol recognition system, wherein, described default convolutional neural networks array is by some independent convolutional Neural The identification array that network is formed with cascade structure.
Described TV station symbol recognition system, wherein, described identification module specifically includes:
First recognition unit, for being identified to described image block group using the identification of first order convolutional neural networks;
First output unit, for being television station's timestamp when recognition result, exports the corresponding television channel of described TV station symbol;
Second recognition unit, for when recognition result is for other, being entered to described image block group using next convolutional neural networks Row identification, executes successively to afterbody convolutional neural networks;
Second output unit, for when the recognition result of afterbody convolutional neural networks is other, exporting unknown result.
Described TV station symbol recognition system, wherein, described convolutional neural networks include rolling up for the main split identifying station symbol feature Long-pending neutral net and some sub-branch's convolutional neural networks for identifying numeral/character features;Described sub-branch convolution god Through network bridging in the convolutional layer of main split's convolutional neural networks.
The modules of above-mentioned TV station symbol recognition system are described in detail in the above-mentioned methods, just no longer old one by one here State.
It should be understood that disclosed system and method in embodiment provided by the present invention, can pass through other Mode is realized.For example, device embodiment described above is only schematically, for example, the division of described module, it is only A kind of division of logic function, actual can have other dividing mode when realizing, for example multiple units or assembly can in conjunction with or Person is desirably integrated into another system, or some features can be ignored, or does not execute.Another, shown or discussed is mutual Between coupling or direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING of device or unit or communication link Connect, can be electrical, mechanical or other forms.
The described unit illustrating as separating component can be or may not be physically separate, show as unit The part showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.The mesh to realize this embodiment scheme for some or all of unit therein can be selected according to the actual needs 's.
In addition, can be integrated in a processing unit in each functional unit in each embodiment of the present invention it is also possible to It is that unit is individually physically present it is also possible to two or more units are integrated in a unit.Above-mentioned integrated list Unit both can be to be realized in the form of hardware, it would however also be possible to employ the form that hardware adds SFU software functional unit is realized.
The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in an embodied on computer readable and deposit In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, including some instructions with so that a computer Equipment (can be personal computer, server, or network equipment etc.) or processor (processor) execution the present invention each The part steps of embodiment methods described.And aforesaid storage medium includes:USB flash disk, portable hard drive, read only memory (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. various Can be with the medium of store program codes.
Finally it should be noted that:Above example only in order to technical scheme to be described, is not intended to limit;Although With reference to the foregoing embodiments the present invention is described in detail, it will be understood by those within the art that:It still may be used To modify to the technical scheme described in foregoing embodiments, or equivalent is carried out to wherein some technical characteristics; And these modification or replace, do not make appropriate technical solution essence depart from various embodiments of the present invention technical scheme spirit and Scope.

Claims (10)

1. a kind of TV station symbol recognition method is it is characterised in that it includes:
After getting frame of video, intercept image block group from described frame of video;
Described image block group is input to default convolutional neural networks array input layer, wherein, described convolution as |input paramete The identification array that neutral net array is made up of with cascade structure some independent convolutional neural networks;
First order convolutional neural networks identification using described convolutional neural networks array is identified to described image block group;
If recognition result is TV station symbol, export the corresponding television channel of described TV station symbol;
If recognition result is other, using next stage convolutional neural networks, described image block group is identified, executes successively To afterbody convolutional neural networks;
If afterbody convolutional neural networks recognition result is other, judge described convolutional neural networks array None- identified institute State frame of video, and export unknown result.
2. according to claim 1 TV station symbol recognition method it is characterised in that described after getting frame of video, regard from described Intercept image block group in frequency frame to specifically include:
Receive and parse through the video flowing in communications, obtain the video frame images in described video flowing;
Described video frame images are carried out with pretreatment, and some predeterminated positions from pretreatment rear video two field picture intercept figure respectively As block is to form image block group.
3. according to claim 1 TV station symbol recognition method it is characterised in that described convolutional neural networks array is by some independences Convolutional neural networks be specially with the identification array that cascade structure forms:
Described convolutional neural networks array includes some independent convolutional neural networks for identifying inhomogeneity station symbol collection, remembers respectively For first order convolutional neural networks, second level convolutional neural networks ...., n-th grade of convolutional neural networks;
Connected by cascade system between described some independent convolutional neural networks.
If according to claim 1 TV station symbol recognition method it is characterised in that described afterbody convolutional neural networks identification Result is other, then judge frame of video described in described convolutional neural networks array None- identified, and exports after unknown result also Can include:
It is used for identifying the new convolutional neural networks of the corresponding station symbol of described frame of video according to described frame of video stand-alone training;
After described new convolutional neural networks level is coupled to described convolutional neural networks array, to form new convolutional Neural net Network array.
5. according to the arbitrary described TV station symbol recognition method of claim 1-4 it is characterised in that described convolutional neural networks include for The main convolutional neural networks of identification station symbol feature and some branch's convolutional neural networks for identifying numeral/character features; Described branch convolutional neural networks bridge at the convolutional layer of main convolutional neural networks.
6. according to claim 5 TV station symbol recognition method it is characterised in that the front end of described convolutional neural networks and middle-end by Some convolutional layers, space batch regularization layer, correct linear unit layer and maximum pond layer composition;Its rear end is by some full connections Layer, batch regularization layer and correction linear unit layer composition.
7. according to claim 6 TV station symbol recognition method it is characterised in that described convolutional neural networks each convolutional layer it All include space batch regularization layer afterwards and correct linear unit layer.
8. a kind of TV station symbol recognition system is it is characterised in that it includes:
Interception module, for, after getting frame of video, intercepting image block group from described frame of video;
Input module, for described image block group is input to default convolutional neural networks array input layer as |input paramete, Wherein, the identification array that described convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks;
Identification module, identifies to described image block for the first order convolutional neural networks using described convolutional neural networks array Group is identified;
First performing module, for being television station's timestamp when recognition result, exports the corresponding television channel of described TV station symbol;
Second performing module, for being other when recognition result, is entered to described image block group using next stage convolutional neural networks Row identification, repeats identification module, the first performing module and the second performing module up to afterbody convolutional neural networks;
3rd performing module, for when afterbody convolutional neural networks recognition result is other, judging described convolutional Neural Frame of video described in network array None- identified, and export unknown result.
9. according to claim 8 TV station symbol recognition system it is characterised in that described convolutional neural networks are included for identifying platform The main convolutional neural networks of mark feature and some branch's convolutional neural networks for identifying numeral/character features;Described point Prop up the convolutional layer that convolutional neural networks bridge at main convolutional neural networks.
10. according to claim 9 TV station symbol recognition system it is characterised in that the front end of described convolutional neural networks and middle-end By some convolutional layers, space batch regularization layer, correct linear unit layer and maximum pond layer composition;Its rear end is connected entirely by some Connect layer, batch regularization layer and correction linear unit layer composition.
CN201610931021.4A 2016-10-31 2016-10-31 A kind of TV station symbol recognition method and system Pending CN106488313A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610931021.4A CN106488313A (en) 2016-10-31 2016-10-31 A kind of TV station symbol recognition method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610931021.4A CN106488313A (en) 2016-10-31 2016-10-31 A kind of TV station symbol recognition method and system

Publications (1)

Publication Number Publication Date
CN106488313A true CN106488313A (en) 2017-03-08

Family

ID=58272753

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610931021.4A Pending CN106488313A (en) 2016-10-31 2016-10-31 A kind of TV station symbol recognition method and system

Country Status (1)

Country Link
CN (1) CN106488313A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107563434A (en) * 2017-08-30 2018-01-09 山东大学 A kind of brain MRI image sorting technique based on Three dimensional convolution neutral net, device
CN108009637A (en) * 2017-11-20 2018-05-08 天津大学 The station symbol dividing method of Pixel-level TV station symbol recognition network based on cross-layer feature extraction
CN108509953A (en) * 2018-04-12 2018-09-07 南京烽火天地通信科技有限公司 A kind of TV station symbol detection recognition method
CN108734667A (en) * 2017-04-14 2018-11-02 Tcl集团股份有限公司 A kind of image processing method and system
CN109117768A (en) * 2018-07-30 2019-01-01 上海科江电子信息技术有限公司 A kind of TV station symbol recognition method based on deep learning
CN110147462A (en) * 2019-05-20 2019-08-20 新联智慧信息技术(深圳)有限公司 The verification method and Related product of the short-sighted frequency of religion
CN110188673A (en) * 2019-05-29 2019-08-30 京东方科技集团股份有限公司 Expression recognition method and device
CN110287888A (en) * 2019-06-26 2019-09-27 中科软科技股份有限公司 A kind of TV station symbol recognition method and system
CN110309833A (en) * 2018-03-20 2019-10-08 国家新闻出版广电总局广播电视规划院 One kind knowing method for distinguishing, equipment and computer readable storage medium for image identification
CN110543884A (en) * 2018-05-29 2019-12-06 国际关系学院 network attack organization tracing method based on image
CN110610230A (en) * 2019-07-31 2019-12-24 国家计算机网络与信息安全管理中心 Station caption detection method and device and readable storage medium
CN111008738A (en) * 2019-12-04 2020-04-14 云南锡业集团(控股)有限责任公司研发中心 Sn-Bi alloy elongation and tensile strength prediction method based on multi-mode deep learning

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103607603A (en) * 2013-11-28 2014-02-26 Tcl集团股份有限公司 Method and device for identifying station caption
CN103679185A (en) * 2012-08-31 2014-03-26 富士通株式会社 Convolutional neural network classifier system as well as training method, classifying method and application thereof
CN103902987A (en) * 2014-04-17 2014-07-02 福州大学 Station caption identifying method based on convolutional network
CN105956626A (en) * 2016-05-12 2016-09-21 成都新舟锐视科技有限公司 Deep learning based vehicle license plate position insensitive vehicle license plate recognition method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103679185A (en) * 2012-08-31 2014-03-26 富士通株式会社 Convolutional neural network classifier system as well as training method, classifying method and application thereof
CN103607603A (en) * 2013-11-28 2014-02-26 Tcl集团股份有限公司 Method and device for identifying station caption
CN103902987A (en) * 2014-04-17 2014-07-02 福州大学 Station caption identifying method based on convolutional network
CN105956626A (en) * 2016-05-12 2016-09-21 成都新舟锐视科技有限公司 Deep learning based vehicle license plate position insensitive vehicle license plate recognition method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
许子立等: "基于递进卷积神经网络的台标识别及其并行化", 《电视技术》 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108734667A (en) * 2017-04-14 2018-11-02 Tcl集团股份有限公司 A kind of image processing method and system
CN107563434A (en) * 2017-08-30 2018-01-09 山东大学 A kind of brain MRI image sorting technique based on Three dimensional convolution neutral net, device
CN108009637A (en) * 2017-11-20 2018-05-08 天津大学 The station symbol dividing method of Pixel-level TV station symbol recognition network based on cross-layer feature extraction
CN108009637B (en) * 2017-11-20 2021-06-25 天津大学 Station caption segmentation method of pixel-level station caption identification network based on cross-layer feature extraction
CN110309833A (en) * 2018-03-20 2019-10-08 国家新闻出版广电总局广播电视规划院 One kind knowing method for distinguishing, equipment and computer readable storage medium for image identification
CN110309833B (en) * 2018-03-20 2021-03-05 国家新闻出版广电总局广播电视规划院 Method, equipment and computer-readable storage medium for image identification recognition
CN108509953A (en) * 2018-04-12 2018-09-07 南京烽火天地通信科技有限公司 A kind of TV station symbol detection recognition method
CN110543884A (en) * 2018-05-29 2019-12-06 国际关系学院 network attack organization tracing method based on image
CN109117768A (en) * 2018-07-30 2019-01-01 上海科江电子信息技术有限公司 A kind of TV station symbol recognition method based on deep learning
CN110147462A (en) * 2019-05-20 2019-08-20 新联智慧信息技术(深圳)有限公司 The verification method and Related product of the short-sighted frequency of religion
CN110188673A (en) * 2019-05-29 2019-08-30 京东方科技集团股份有限公司 Expression recognition method and device
CN110188673B (en) * 2019-05-29 2021-07-30 京东方科技集团股份有限公司 Expression recognition method and device
CN110287888A (en) * 2019-06-26 2019-09-27 中科软科技股份有限公司 A kind of TV station symbol recognition method and system
CN110610230A (en) * 2019-07-31 2019-12-24 国家计算机网络与信息安全管理中心 Station caption detection method and device and readable storage medium
CN111008738A (en) * 2019-12-04 2020-04-14 云南锡业集团(控股)有限责任公司研发中心 Sn-Bi alloy elongation and tensile strength prediction method based on multi-mode deep learning

Similar Documents

Publication Publication Date Title
CN106488313A (en) A kind of TV station symbol recognition method and system
CN109978893A (en) Training method, device, equipment and the storage medium of image, semantic segmentation network
CN106156781B (en) Sort convolutional neural networks construction method and its image processing method and device
CN107657249A (en) Method, apparatus, storage medium and the processor that Analysis On Multi-scale Features pedestrian identifies again
CN108664999A (en) A kind of training method and its device, computer server of disaggregated model
CN111368943B (en) Method and device for identifying object in image, storage medium and electronic device
CN110288082A (en) Convolutional neural networks model training method, device and computer readable storage medium
CN107644221A (en) Convolutional neural networks traffic sign recognition method based on compression of parameters
CN112257526B (en) Action recognition method based on feature interactive learning and terminal equipment
CN107654406A (en) Fan air-supply control device, fan air-supply control method and device
CN105118027A (en) Image defogging method
CN108038839A (en) Twisted-pair feeder lay real-time detection method on a kind of flow production line
CN112950640A (en) Video portrait segmentation method and device, electronic equipment and storage medium
CN108810648A (en) A kind of method and terminal carrying out image procossing
CN106446223A (en) Map data processing method and device
WO2023193521A1 (en) Video inpainting method, related apparatus, device and storage medium
CN114092920A (en) Model training method, image classification method, device and storage medium
CN106454195A (en) Anti-peeping method and system for video chats based on VR
CN108876144B (en) Transformer substation pre-addressing method based on deep learning algorithm
CN106960188A (en) Weather image sorting technique and device
CN107729381A (en) Interactive multimedia resource polymerization method and system based on multidimensional characteristic identification
CN115082800B (en) Image segmentation method
CN105184809A (en) Moving object detection method and moving object detection device
CN114882314A (en) Model training method and related product, image processing method and related product
CN114708575A (en) Vehicle identification method, device and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170308

RJ01 Rejection of invention patent application after publication