CN106488313A

CN106488313A - A kind of TV station symbol recognition method and system

Info

Publication number: CN106488313A
Application number: CN201610931021.4A
Authority: CN
Inventors: 王妙辉
Original assignee: TCL Corp
Current assignee: TCL Corp
Priority date: 2016-10-31
Filing date: 2016-10-31
Publication date: 2017-03-08

Abstract

The invention discloses a kind of TV station symbol recognition method and system, methods described includes：After getting frame of video, intercept image block group from described frame of video；Described image block group is input to the input layer of default convolutional neural networks array as |input paramete；Using described convolutional neural networks array, described image block group is identified, to obtain the corresponding television channel of TV station symbol that described frame of video comprises.The present invention is identified the reusability it is achieved that training result by convolutional neural networks array to image block group, it is to avoid training complexity during newly-increased identification species, improves network capacity and recognition accuracy simultaneously.

Description

A kind of TV station symbol recognition method and system

Technical field

The present invention relates to TV technology, particularly to a kind of TV station symbol recognition method and system.

Background technology

The station symbol of television station is to discriminate between the important mark of different television stations, and TV station symbol recognition technology is to the platform in TV image Mark a kind of technology being identified, TV station symbol recognition technology is that the application such as video analysis, retrieval and user watched behavioral statisticses carries Supply important data message.The existing station identification method for distinguishing that can be used for mainly includes two class methods.First kind method is base The method being identified in the feature of engineer, e.g., the characteristic information such as edge of station symbol color, station symbol shape or station symbol. Such method is easily affected by video content and station symbol shape, there is accuracy and robustness is not strong.Equations of The Second Kind method is base In the method for statistical machine learning, the image containing station symbol is sent into neural network module, is automatically learned by neutral net by it Practise feature representation, it has that network model's autgmentability difference and accuracy rate are low.

Thus prior art could be improved and improves.

Content of the invention

The technical problem to be solved in the present invention is, for the deficiencies in the prior art, provide a kind of TV station symbol recognition method and System is low to solve the problems, such as network model's autgmentability difference that existing TV station symbol recognition method exists and accuracy rate.

In order to solve above-mentioned technical problem, the technical solution adopted in the present invention is as follows：

A kind of TV station symbol recognition method, it includes：

After getting frame of video, intercept image block group from described frame of video；

Described image block group is input to default convolutional neural networks array input layer, wherein, described convolution as |input paramete The identification array that neutral net array is made up of with cascade structure some independent convolutional neural networks；

First order convolutional neural networks identification using described convolutional neural networks array is identified to described image block group；

If recognition result is TV station symbol, export the corresponding television channel of described TV station symbol；

If recognition result is other, using next stage convolutional neural networks, described image block group is identified, executes successively To afterbody convolutional neural networks；

If afterbody convolutional neural networks recognition result is other, judge described convolutional neural networks array None- identified institute State frame of video, and export unknown result.

Described TV station symbol recognition method, wherein, described after getting frame of video, from described frame of video intercept image block group Specifically include：

Receive and parse through the video flowing in communications, obtain the video frame images in described video flowing；

Described video frame images are carried out with pretreatment, and some predeterminated positions from pretreatment rear video two field picture intercept figure respectively As block is to form image block group.

Described TV station symbol recognition method, wherein, described convolutional neural networks array by some independent convolutional neural networks with The identification array of cascade structure composition is specially：

Described convolutional neural networks array includes some independent convolutional neural networks for identifying inhomogeneity station symbol collection, remembers respectively For first order convolutional neural networks, second level convolutional neural networks ...., n-th grade of convolutional neural networks；

Connected by cascade system between described some independent convolutional neural networks.

Described TV station symbol recognition method, wherein, if described afterbody convolutional neural networks recognition result is other, judges Frame of video described in described convolutional neural networks array None- identified, and can also include after exporting unknown result：

It is used for identifying the new convolutional neural networks of the corresponding station symbol of described frame of video according to described frame of video stand-alone training；

After described new convolutional neural networks level is coupled to described convolutional neural networks array, to form new convolutional Neural net Network array.

Described TV station symbol recognition method, wherein, described convolutional neural networks include amassing god for the master file identifying station symbol feature Through network and some branch's convolutional neural networks for identifying numeral/character features；Described branch convolutional neural networks bridge It is connected to the convolutional layer of main convolutional neural networks.

Described TV station symbol recognition method, wherein, by some convolutional layers, space is criticized for the front end of described convolutional neural networks and middle-end Amount regularization layer, corrects linear unit layer and maximum pond layer composition；Its rear end by some full articulamentums, batch regularization layer and Correct linear unit layer composition.

Described TV station symbol recognition method, wherein, all includes space batch after each convolutional layer of described convolutional neural networks Regularization layer and correction linear unit layer.

A kind of TV station symbol recognition system, it includes：

Interception module, for, after getting frame of video, intercepting image block group from described frame of video；

Input module, for described image block group is input to default convolutional neural networks array input layer as |input paramete, Wherein, the identification array that described convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks；

Identification module, identifies to described image block for the first order convolutional neural networks using described convolutional neural networks array Group is identified；

First performing module, for being television station's timestamp when recognition result, exports the corresponding television channel of described TV station symbol；

Second performing module, for being other when recognition result, is entered to described image block group using next stage convolutional neural networks Row identification, repeats identification module, the first performing module and the second performing module up to afterbody convolutional neural networks；

3rd performing module, for when afterbody convolutional neural networks recognition result is other, judging described convolution Frame of video described in neutral net array None- identified, and export unknown result.

Described TV station symbol recognition system, wherein, described convolutional neural networks include amassing god for the master file identifying station symbol feature Through network and some branch's convolutional neural networks for identifying numeral/character features；Described branch convolutional neural networks bridge It is connected to the convolutional layer of main convolutional neural networks.

Described TV station symbol recognition system, wherein, by some convolutional layers, space is criticized for the front end of described convolutional neural networks and middle-end Amount regularization layer, corrects linear unit layer and maximum pond layer composition；Its rear end by some full articulamentums, batch regularization layer and Correct linear unit layer composition.

Beneficial effect：Compared with prior art, the invention provides a kind of TV station symbol recognition method and system, methods described bag Include：After getting frame of video, intercept image block group from described frame of video；Described image block group is inputted as |input paramete Input layer to default convolutional neural networks array；Using described convolutional neural networks array, described image block group is known Not, to obtain the corresponding television channel of TV station symbol that described frame of video comprises.The present invention passes through convolutional neural networks array pair Image block group is identified the reusability it is achieved that training result, it is to avoid training complexity during newly-increased identification species, with When improve network capacity and recognition accuracy.

Brief description

The flow chart that Fig. 1 is preferably implemented for the TV station symbol recognition method that the present invention provides.

Convolutional neural networks array identification process figure in the TV station symbol recognition method that Fig. 2 provides for the present invention.

The structure principle chart of the TV station symbol recognition method system that Fig. 3 provides for the present invention.

The structure principle chart of another embodiment of TV station symbol recognition method system that Fig. 4 provides for the present invention.

Specific embodiment

The present invention provides a kind of TV station symbol recognition method and system, for making the purpose of the present invention, technical scheme and effect more Clear, clear and definite, the present invention is described in more detail for the embodiment that develops simultaneously referring to the drawings.It should be appreciated that it is described herein Specific embodiment only in order to explain the present invention, is not intended to limit the present invention.

In the present invention, using such as " module ", " part " or " unit " for representing element suffix only for favourable In the explanation of the present invention, itself does not have specific meaning.Therefore, module ", " part " or " unit " can mixedly make With.

Terminal unit can be implemented in a variety of manners.For example, the terminal described in the present invention can include such as moving Phone, smart phone, notebook computer, digit broadcasting receiver, PDA (personal digital assistant), PAD (panel computer), PMP The mobile terminal of (portable media player), guider etc. and such as numeral TV, desk computer etc. consolidate Determine terminal.However, it will be understood by those skilled in the art that, in addition to being used in particular for the element of mobile purpose, according to this The construction of bright embodiment can also apply to the terminal of fixed type.

Below in conjunction with the accompanying drawings, by the description to embodiment, content of the invention is described further.

Refer to shown in Fig. 1 and Fig. 2, the flow chart of the preferred embodiment of the TV station symbol recognition method that Fig. 1 provides for the present invention, Convolutional neural networks array identification process figure in the TV station symbol recognition method that Fig. 2 provides for the present invention.Methods described includes：

S100, after getting frame of video, from described frame of video intercept image block group；

S200, described image block group is input to default convolutional neural networks array input layer as |input paramete, wherein, described The identification array that convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks；

S300, using described convolutional neural networks array the first order convolutional neural networks identification described image block group is known Not；

If S400 recognition result is TV station symbol, export the corresponding television channel of described TV station symbol；

If S500 recognition result is other, using next stage convolutional neural networks, described image block group is identified, repeats Step S300-S500 is up to afterbody convolutional neural networks；

If S600 afterbody convolutional neural networks recognition result is other, judge described convolutional neural networks array Frame of video described in None- identified, and export unknown result..

Specifically, in described step S100, described frame of video refers to intelligent television and receives in video flowing Single-frame imagess.Described single-frame imagess are two dimensional image.That is, described frame of video shows for one that intelligent television receives Image.In actual applications, described get frame of video before can also include monitor intelligent television zapping instruction, receiving After zapping instruction, first frame of video will be received as described frame of video.Therefore, described after getting frame of video, from institute State and can include before intercepting image block group in frame of video：

S01, when intelligent television is in open state, the operation of zapping that real-time monitoring users are carried out to intelligent television；

S02, after listening to zapping operation, according to described zapping operational control intelligent television zapping and obtain first receiving Individual frame of video.

In the present embodiment, obtain image block group after getting frame of video in addition it is also necessary to from described frame of video.Described Image block group includes multiple equal-sized image blocks, and described image block obtains from the predeterminated position of frame of video.That is, it is pre- First setting video image obtains the scope of image, and some image blocks corresponding position in described image scope.Described figure As scope can determine according to the size of smart television display, for example, described image scope and described display screen scope phase With.

Some positions are pre-set in the range of described image, when intercepting the image block group in image, described by intercepting The image of some positions is to obtain image block group.The shape size of described some positions is identical, i.e. the shape of the image block of sectional drawing Identical with size.The interception position of described image block may be located at any position of video frame images, for example, a corner, four Individual corner and center etc..In actual applications, in order to fast and accurately obtain the image block of protection station symbol, will be described pre- First some position branches are set to：Four corners and center.

Exemplary, described after getting frame of video, intercept image block group from described frame of video and specifically can include：

S101, the video flowing receiving and parsing through in communications, obtain the video frame images in described video flowing；

S102, described video frame images are carried out with pretreatment, and from some predeterminated positions of pretreatment rear video two field picture respectively Intercept image block to form image block group.

Specifically, described video frame images are carried out with the described image that pretreatment refers to zoom in and out, at denoising and normalization Reason.To intercepting multiple images block in the video frame images of pretreatment, the plurality of image block forms image block group.The plurality of figure As block is the multiple subimages being the same from position intercepting from multiple form and dimensions pre-setting of video frame images.Described The parts of images content of video frame images described in image block protection.

In described step S200, described convolutional neural networks array can only comprise convolutional neural networks, also may be used To comprise multiple independent convolutional neural networks.When convolutional neural networks array comprises multiple independent network model, described Connected by cascade system between multiple independent network modeies.That is, described convolutional neural networks array include some For identifying the independent convolutional neural networks of inhomogeneity station symbol collection, it is designated as first order convolutional neural networks, second level convolution respectively Neutral net ...., n-th grade of convolutional neural networks；Connected by cascade system between described some independent convolutional neural networks.

In the present embodiment, the independent network model of described convolutional neural networks array cascade can identify according to it Content and extend.That is, when described convolutional neural networks array None- identified current image block group, stand-alone training one New convolutional neural networks model, and by described new convolutional neural networks Cascade in existing convolutional neural networks array On, as afterbody convolutional neural networks.So can not need to carry out re -training to already present training pattern result To accurately identify fresh target, solve the problems, such as the neutral net only guarantee accuracy rate in the target of finite number simultaneously.

Exemplarily, described default convolutional neural networks array only comprises convolutional neural networks, is designated as first order volume Long-pending neutral net, it is used for identifying first station symbol collection (for example comprising multiple TV station symbol species), is designated as A class TV station symbol mould Type (CNN_A).When assuming to have new station symbol classification collection to need to identify using first order convolutional neural networks, can be to new station symbol The new convolutional neural networks model of classification collection stand-alone training one, is designated as B class TV station symbol (CNN_B)；Then, by new convolution After neural network model CNN_B level is associated in original neural network group CNN_A；Finally, new convolutional neural networks array is (for example Comprise CNN_A and CNN_B) associable for TV station symbol recognition.

In the present embodiment, the front end of described convolutional neural networks and middle-end can be any number of convolutional layers （convolutional layer）, space batch regularization layer（spatial batch normalization layer）, entangle Linear positive elementary layer（rectified linear units layer）, and maximum pond layer（max pooling layer）'s Combination.Rear end is any number of full articulamentum（full connection layer）, batch regularization layer（batch normalization layer）With correction linear unit layer（rectified linear units layer）Composition.

Described convolutional network carries out the study of weight parameter using stochastic gradient descent method, and constructs nerve with described weight Network.Wherein, if initial learn speed 0.0001, every 30 wheel iteration are updated to original 1/2, and carry out at random increasing encoding and decoding Distortion 5%～30%, Random-Rotation -5 spends～5 degree, random 50%～150% size variation disturbance, random left rotation and right rotation, forms number According to collection, using described data set, convolutional network is trained.Certainly, described convolutional network can also be carried out using other modes Training, does not just describe in detail here one by one.

Further, described convolutional neural networks comprise main split's convolutional network and Liang Ge sub-branch convolutional network, Branch is designated as sub-branch convolutional neural networks I and sub-branch convolutional neural networks II.Liang Ge sub-branch convolutional network can be any Bridge at the convolutional layer of master network.Wherein, described main split convolutional neural networks are responsible for identifying station symbol body feature, described two Sub-branch's convolutional neural networks are responsible for identifying numerical characteristic and/or character features, and for example, sub-branch convolutional neural networks I is responsible for Identification numerical characteristic, sub-branch convolutional neural networks II is responsible for recognition character.So pass through to pass through master when carrying out TV station symbol recognition Branch's convolutional network and Liang Ge sub-branch convolutional network identify respectively, finally three web results are comprehensively chosen, output result. So master network and sub-network share the parameter in low-level image feature space, can save size and the complexity of network model.

In the present embodiment, described convolutional neural networks can include：

Ground floor, convolutional layer, convolution kernel is 3x3, and convolution step-length is 2；

The second layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1；

Third layer, maximum pond layer, pondization is interval to be 2；

4th layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1；

Sub-network structure I：

Layer 5, full articulamentum；

Layer 6, full articulamentum（Sub-network I terminates）.

Layer 5, maximum pond layer, pondization is interval to be 2；

Layer 6, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1；

Sub-network structure II：

Layer 7, full articulamentum（Sub-network II terminates）.

Layer 7, maximum pond layer, pondization is interval to be 2；

8th layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1；

9th layer, maximum pond layer, pondization is interval to be 2；

Tenth layer, convolutional layer, convolution kernel is 3x3, and convolution step-length is 1；

Eleventh floor, maximum pond layer, pondization is interval to be 2；

Floor 12, full articulamentum；

13rd layer, full articulamentum（Master network terminates）.

In described convolutional network, there is individual space batch regularization layer after each convolutional layer and correct linear unit Layer.There is individual spatial regularization layer after each same full articulamentum and correct linear unit layer.The image that convolutional layer will input Or characteristics of image is by a series of new characteristics of image of linear transformations output.Space batch regularization can be divided with normalization data Cloth is thus accelerate training process.Correct linear unit the result of input is exported according to the work conversion of approximate people's vision response.Pond Change layer and multiple input numerical value are mapped as an output numerical value.The upper layer network structure of described convolutional neural networks is according to handled Objective attribute target attribute have the parameter of specifically different network structures and feature space it can be ensured that the accuracy of network model.Improve The capacity of network and discrimination.

In described step S300, described using convolutional neural networks array, image block group is identified referring to adopting Convolutional neural networks array is identified to image block group and detects, its identify station symbol when determine that described station symbol is corresponding Channel information, described channel information includes the title of described channel, for example, BTV, Shenzhen physical culture etc..

In order to further illustrate convolutional neural networks array to image recognition process, several specific embodiments are given below It is illustrated.

Embodiment one

Described convolutional neural networks array is the convolutional neural networks group of cascade, and one of neural network group only corresponds to more than one The convolutional neural networks of the multitask of output.The convolutional neural networks group of the first order identifies one group of initial station symbol.Second level god Only identify the new station symbol group newly included through network, by that analogy.Its identification process is specifically as follows：Use the convolution of the first order first Neural network group is identified.If recognition result " other " enters the convolutional neural networks group of next stage, if station symbol is then Export corresponding television channel.When proceeding to the second level, if recognition result " other " then exports " other ", if station symbol is then Export corresponding television channel.When carrying out the identification of every one-level, only one of which convolutional neural networks, recognition result is TV frequency Road title or " other ", this recognition result is directly as the recognition result of this convolutional neural networks group.For example, " Jiangsu physical culture Channel " is one of television channel that first training data is comprised, and " Shenzhen entertainment channel " belongs to what training data was comprised One of television channel.For the test image comprising Jiangsu sports channel station symbol, first entered with the convolutional neural networks group of the first order Row identification.Wherein convolutional neural networks recognition result is " Jiangsu sports channel ", exports " Jiangsu sports channel ".For comprising depth The test image of ditch between fields entertainment channel station symbol, is first identified with the convolutional neural networks group of the first order.Wherein convolutional neural networks Recognition result be " other ".Then it is identified with the convolutional neural networks of the second level.The identification of wherein convolutional neural networks Result is " Shenzhen entertainment channel ".Output " Shenzhen entertainment channel ".

Embodiment two

Convolutional neural networks group using cascade carries out television station channel identification, one-level convolutional neural networks comprise two independent The convolutional neural networks of the single task of multi output.Wherein, first convolutional neural networks is responsible for identifying main station symbol part, second Convolutional neural networks identify sub- station symbol part.One group of station symbol of the initial regulation of convolutional neural networks group identification of the first order.Second Level neural network group only identifies the one group of new station symbol newly included, by that analogy.Concrete identification process can be described as：First with first The convolutional neural networks group of level is identified.If recognition result " other " enters the convolutional neural networks group of next stage, if Then export corresponding television channel for station symbol.When carrying out the second level, if recognition result " other " then exports " other ", if Station symbol then exports corresponding television channel.When being identified with convolutional neural networks group, ground floor convolutional neural networks recognition result For television station's title or " other ", if ground floor recognition result is " other ", the recognition result of this convolutional neural networks group is " its He "；If ground floor recognition result is television station, carry out the identification of second layer convolutional neural networks.Second layer convolutional neural networks Recognition result is channel brief introduction, channel number or " other ".Finally the recognition result of comprehensive two convolutional neural networks obtains Whole television channel name.For example, " Jiangsu sports channel " is one of television channel that first training data is comprised, Shenzhen Recreation table belongs to one of channel that second batch training data is comprised.For the test image comprising Jiangsu satellite TV channel station symbol, First it is identified with the convolutional neural networks group of the first order.Wherein first convolutional neural networks recognition result is " Jiangsu TV Platform ", second convolutional neural networks recognition result is " physical culture ".Comprehensively first order convolutional neural networks group recognition result can be obtained is " Jiangsu sports channel ", then exports recognition result.For the test image comprising Shenzhen entertainment channel station symbol, first use the first order Convolutional neural networks group be identified.The recognition result of wherein first convolutional neural networks is " other ", then with second The convolutional neural networks of level are identified.In the convolutional neural networks kind of the second level, the knowledge of its first convolutional neural networks Other result is " Shenzhen TV Station ", and the recognition result of its second convolutional neural networks is " amusement ".Comprehensively the second level can be obtained The recognition result of convolutional neural networks group is " Shenzhen entertainment channel ".

Embodiment three

Described convolutional neural networks array comprises the convolutional neural networks of the single task of two independent multi output, wherein, first Individual convolutional neural networks are responsible for identifying main station symbol part, and second convolutional neural networks identifies sub- station symbol part.Newly include when having Station logo data when coming in, new and old station symbol is put re -training together.For example, " Jiangsu sports channel " is first training data One of television channel being comprised, Shenzhen recreation table belongs to one of channel that second batch training data is comprised.For " Jiangsu body Ssd channel ", is identified with convolutional neural networks group.Wherein first convolutional neural networks recognition result is " Jiangsu TV Platform ", second convolutional neural networks recognition result is " physical culture ".It is " Jiangsu that convolutional neural networks group recognition result comprehensively can be obtained Sports channel ", then exports recognition result." Shenzhen entertainment channel ", is identified with convolutional neural networks group.Wherein first The recognition result of convolutional neural networks is " Shenzhen TV Station ", and the recognition result of second convolutional neural networks is " amusement ".Comprehensive The recognition result that conjunction can obtain convolutional neural networks group is " Shenzhen entertainment channel ".

Present invention also offers a kind of TV station symbol recognition system, as shown in figure 3, it includes：

A kind of TV station symbol recognition system, it includes：

Interception module 100, for, after getting frame of video, intercepting image block group from described frame of video；

Input module 200, for being input to default convolutional neural networks array input using described image block group as |input paramete Layer, wherein, the identification array that described convolutional neural networks array is made up of with cascade structure some independent convolutional neural networks；

Identification module 300, the first order convolutional neural networks using described convolutional neural networks array identify to described image block Group is identified；

First performing module 400, for being television station's timestamp when recognition result, output described TV station symbol corresponding TV frequency Road；

Second performing module 500, for being other when recognition result, using next stage convolutional neural networks to described image block group It is identified, repeat identification module, the first performing module and the second performing module up to afterbody convolutional Neural net Network；

3rd performing module 600, for when afterbody convolutional neural networks recognition result is other, judging described volume Frame of video described in long-pending neutral net array None- identified, and export unknown result..

In another embodiment of the present invention, as shown in figure 4, described system includes：Intelligent television 1000 server 2000；

Described intelligent television 1000 includes interception module 1001 and sending module 1002；

Described interception module 1001, for, after getting frame of video, intercepting image block group from described frame of video；

Described sending module 1002, for sending described image block group to server；

Described server 2000 includes：Receiver module 2001, input module 2002 and identification module 2003；

Described receiver module 2001, for receiving the image block group of intelligent television transmission；

Described input module 2002, for being input to default convolutional neural networks array using described image block group as |input paramete Input layer；

Described identification module 2003, for being identified to described image block group using described convolutional neural networks array, with The corresponding television channel of TV station symbol comprising to described frame of video.

Described TV station symbol recognition system, wherein, described interception module specifically includes：

Acquiring unit, for receiving and parsing through the video flowing in communications, obtains the video frame images in described video flowing；

Interception unit for carrying out pretreatment and some default from pretreatment rear video two field picture to described video frame images Position intercepts image block respectively to form image block group.

Described TV station symbol recognition system, wherein, described default convolutional neural networks array is by some independent convolutional Neural The identification array that network is formed with cascade structure.

Described TV station symbol recognition system, wherein, described identification module specifically includes：

First recognition unit, for being identified to described image block group using the identification of first order convolutional neural networks；

First output unit, for being television station's timestamp when recognition result, exports the corresponding television channel of described TV station symbol；

Second recognition unit, for when recognition result is for other, being entered to described image block group using next convolutional neural networks Row identification, executes successively to afterbody convolutional neural networks；

Second output unit, for when the recognition result of afterbody convolutional neural networks is other, exporting unknown result.

Described TV station symbol recognition system, wherein, described convolutional neural networks include rolling up for the main split identifying station symbol feature Long-pending neutral net and some sub-branch's convolutional neural networks for identifying numeral/character features；Described sub-branch convolution god Through network bridging in the convolutional layer of main split's convolutional neural networks.

The modules of above-mentioned TV station symbol recognition system are described in detail in the above-mentioned methods, just no longer old one by one here State.

It should be understood that disclosed system and method in embodiment provided by the present invention, can pass through other Mode is realized.For example, device embodiment described above is only schematically, for example, the division of described module, it is only A kind of division of logic function, actual can have other dividing mode when realizing, for example multiple units or assembly can in conjunction with or Person is desirably integrated into another system, or some features can be ignored, or does not execute.Another, shown or discussed is mutual Between coupling or direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING of device or unit or communication link Connect, can be electrical, mechanical or other forms.

The described unit illustrating as separating component can be or may not be physically separate, show as unit The part showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.The mesh to realize this embodiment scheme for some or all of unit therein can be selected according to the actual needs 's.

In addition, can be integrated in a processing unit in each functional unit in each embodiment of the present invention it is also possible to It is that unit is individually physically present it is also possible to two or more units are integrated in a unit.Above-mentioned integrated list Unit both can be to be realized in the form of hardware, it would however also be possible to employ the form that hardware adds SFU software functional unit is realized.

The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in an embodied on computer readable and deposit In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, including some instructions with so that a computer Equipment (can be personal computer, server, or network equipment etc.) or processor (processor) execution the present invention each The part steps of embodiment methods described.And aforesaid storage medium includes：USB flash disk, portable hard drive, read only memory (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. various Can be with the medium of store program codes.

Finally it should be noted that：Above example only in order to technical scheme to be described, is not intended to limit；Although With reference to the foregoing embodiments the present invention is described in detail, it will be understood by those within the art that：It still may be used To modify to the technical scheme described in foregoing embodiments, or equivalent is carried out to wherein some technical characteristics； And these modification or replace, do not make appropriate technical solution essence depart from various embodiments of the present invention technical scheme spirit and Scope.

Claims

1. a kind of TV station symbol recognition method is it is characterised in that it includes：

2. according to claim 1 TV station symbol recognition method it is characterised in that described after getting frame of video, regard from described Intercept image block group in frequency frame to specifically include：

3. according to claim 1 TV station symbol recognition method it is characterised in that described convolutional neural networks array is by some independences Convolutional neural networks be specially with the identification array that cascade structure forms：

If according to claim 1 TV station symbol recognition method it is characterised in that described afterbody convolutional neural networks identification Result is other, then judge frame of video described in described convolutional neural networks array None- identified, and exports after unknown result also Can include：

5. according to the arbitrary described TV station symbol recognition method of claim 1-4 it is characterised in that described convolutional neural networks include for The main convolutional neural networks of identification station symbol feature and some branch's convolutional neural networks for identifying numeral/character features； Described branch convolutional neural networks bridge at the convolutional layer of main convolutional neural networks.

6. according to claim 5 TV station symbol recognition method it is characterised in that the front end of described convolutional neural networks and middle-end by Some convolutional layers, space batch regularization layer, correct linear unit layer and maximum pond layer composition；Its rear end is by some full connections Layer, batch regularization layer and correction linear unit layer composition.

7. according to claim 6 TV station symbol recognition method it is characterised in that described convolutional neural networks each convolutional layer it All include space batch regularization layer afterwards and correct linear unit layer.

8. a kind of TV station symbol recognition system is it is characterised in that it includes：

3rd performing module, for when afterbody convolutional neural networks recognition result is other, judging described convolutional Neural Frame of video described in network array None- identified, and export unknown result.

9. according to claim 8 TV station symbol recognition system it is characterised in that described convolutional neural networks are included for identifying platform The main convolutional neural networks of mark feature and some branch's convolutional neural networks for identifying numeral/character features；Described point Prop up the convolutional layer that convolutional neural networks bridge at main convolutional neural networks.

10. according to claim 9 TV station symbol recognition system it is characterised in that the front end of described convolutional neural networks and middle-end By some convolutional layers, space batch regularization layer, correct linear unit layer and maximum pond layer composition；Its rear end is connected entirely by some Connect layer, batch regularization layer and correction linear unit layer composition.