CN106162330A

CN106162330A - A kind of method extracting caption area in video pictures

Info

Publication number: CN106162330A
Application number: CN201610597947.4A
Authority: CN
Inventors: 谢超平; 吴春中; 罗明利
Original assignee: Chengdu Sobey Digital Technology Co Ltd
Current assignee: Chengdu Sobey Digital Technology Co Ltd
Priority date: 2016-07-27
Filing date: 2016-07-27
Publication date: 2016-11-23
Anticipated expiration: 2036-07-27
Also published as: CN106162330B

Abstract

The invention discloses a kind of method extracting caption area in video pictures, it comprises the following steps: S1: remove noise treatment: use maximum gradient method to obtain gradient image data according to the texture difference of captions and background frame: wherein, subtitle parts has neat data, and background frame then there will be irregular noise data；Remove irregular noise data part；S2: use the mode locking corrosion to position caption area；S3: determine headline.The present invention is to improve the efficiency that news demolition processes, determine suitable title quickly to the news split, a kind of method extracting caption area from video pictures according to video caption characteristic simultaneously for convenience taxonomic revision, the fast search of news video provided.Wherein, caption area is convenient and reliability is high to use the method locking corrosion to determine.

Description

A kind of method extracting caption area in video pictures

Technical field

The present invention relates to a kind of method extracting caption area in video pictures.

Background technology

For television station, the news material of this TV station has complete storage management system, it is easy to regard a news A piece of news in Pin carries out follow-up use；But, the news video in remaining source is accomplished by carrying out demolition process, will news Video split into one by one individually news so that follow-up use.Captions in news video, especially main title can letters The bright main points embodying this then news in a capsule.If can the caption recognition in video pictures out, then just can be quickly for tearing open Being divided into single news one by one and determine suitable title, this has the biggest for visual classification arrangement, fast search etc. Benefit.

In order to improve the efficiency that news demolition processes, determine suitable title, simultaneously for side quickly to the news split Just the one invented taxonomic revision, the fast search of news video extracts word according to video caption characteristic from video pictures The method in curtain region.

Summary of the invention

It is an object of the invention to overcome the deficiencies in the prior art, it is provided that a kind of in video pictures, extract caption area Method.

It is an object of the invention to be achieved through the following technical solutions: a kind of in video pictures, extract caption area Method, it comprises the following steps:

S1: remove noise treatment: use maximum gradient method to obtain gradient map according to the texture difference of captions and background frame As data: wherein, subtitle parts has neat data, and background frame then there will be irregular noise data；Remove nothing The noise data part of rule；

S2: location caption area, including following sub-step:

S21: cloth seed, under original state, is set to background the value in whole region, finds suitably according to gradient image data Place plants seed, to ensure that seed is on word；

S22: use maximum value process to calculate the value waiting to judge the point i.e. G point of corrosion, including following sub-step:

S221: calculate original graph respectively in vertical, level, the gray scale in three directions of diagonal, the maximum of tetra-passages of R, G, B Value；

S222: calculating the maximum in four values, described maximum is designated as the value of G point；

S23: lock corrosion, including following sub-step:

S231: the point of the four direction left and right, upper and lower of seed in traversal drawing of seeds successively, it is judged that whether this point is corroded and i.e. sentences Whether this point disconnected is G point: if this point has been corroded, and continues to judge next point；If this point is not corroded, then continue next Step；

S232: compare value and the size of first threshold nGsensitive set of this point, if the value of this point is more than threshold value Then can not corrode this point, be this point and added one layer of lock；If the value of this point is less than threshold value, then carry out next step, carry out rotten to the left Erosion；

S233:G point waits to judge the point of corrosion for the seed left side, has 0 to N number of point being corroded on the right of seed, to the left corruption Erosion, including following sub-step:

S2331: judge the quantity of the point being corroded on the right of seed:

(1) when the point not being corroded on the right of seed, then G point can be corroded；

(2) when there being the point that 1 to N is corroded on the right of seed, G point and the gray scale of rightmost point, R, G, B four-way are calculated Maximum, when this maximum less than set Second Threshold nSensitive, then G point can corrode, and otherwise can not be corroded I.e. add one layer of lock to this point；

S2332: the point being corroded becomes seed, and returns step S2331 until cannot corrosion sites again；

S234: whether monitoring captions marginal area reaches Corrosion standards, if reaching, terminating corrosion, otherwise strengthening corrosion Second Threshold nSensitive repeats above-mentioned steps；

S235: extract caption data according to Corrosion results, then remove noise data further according to character features, finally obtain one Subtitle position region accurately；

S3: determine headline, including following sub-step:

S31: screen and mistake according to the continuity Characteristics caption area identical to continuous print in video of captions in video properties Filter, and choose effect preferably as the picture identifying captions；

S32: captions picture is converted to word, is the headline of this then news.

A kind of method extracting caption area in video pictures also includes that a threshold value arranges sub-step S0, in beginning Front respectively first threshold nGsensitive and Second Threshold nSensitive are configured.

Described N value is 4.

Described step S1 also includes following sub-step: according to the feature of news caption profile consistency from top to bottom to video pictures Carry out horizontal striping identification, and carry out noise removal process according to the feature of news caption.

The invention has the beneficial effects as follows:

The present invention is to improve the efficiency that news demolition processes, and determines suitable title quickly to the news split, is simultaneously The one that taxonomic revision, the fast search of news video are provided by convenience carries from video pictures according to video caption characteristic The method taking caption area.Wherein, caption area is convenient and reliability is high to use the method locking corrosion to determine.

Accompanying drawing explanation

Fig. 1 is the inventive method flow chart；

Fig. 2 is that original graph is vertical, level, three direction schematic diagrams of diagonal；

Fig. 3 is for locking corrosion flow chart.

Detailed description of the invention

Technical scheme is described in further detail below in conjunction with the accompanying drawings:

First, news video has a characteristic that

1) captions and the texture difference of background: typically in order to make captions understand easily resolution, captions and background have significantly segmentation Line, so the texture difference of captions and background is the biggest, can determine the profile of captions with this.

2) feature of news caption:

A) profile consistency from top to bottom: from profile, background frame is usually irregular pattern-information, on news caption is then Under neat pattern.

B) profile transformation density is big: from profile, and background frame seldom has the irregular conversion of profile a lot, and news word The profile transformation density of curtain is the biggest.

C) can be with cutting: because there being interval between word and word, in captions, each word is that shape is similar in terms of profile Independent individuality, for caption area, can be with cutting；And background frame is difficult to appearance and can be cut into shape by profile Similar individuality.

D) generally there are frame characteristic: news caption the most all can be placed on special captions base plate, has intrinsic colour Continuous gradation or constant, but with up and down or the feature of left and right significant change, the program mark on side can be removed according to base plate feature Know data.

3) characteristic of video:

A) seriality: for captions, continuous a period of time (is probably several milliseconds for news content captions and is also likely to be several Second, but may be up to a few minutes a few minutes even ten for headline captions), video pictures change time, captions without Change.It is to say, in one section of continuous print frame, captions do not change, there is seriality.

B) invariant position: captions are to edit on the regulation region of captions base plate, and the position of captions base plate is usually Constant, so the position of caption area is also changeless, and the background colour of captions base plate is gradual change or constant.

Therefore, according to These characteristics, as it is shown in figure 1, a kind of method extracting caption area in video pictures, it includes Following steps:

S1: removal noise treatment:

(1) maximum gradient method is used to obtain gradient image data according to the texture difference of captions and background frame: wherein, word Curtain part has neat data, and background frame then there will be irregular noise data；Remove irregular noise data Part；

(2) (striping is known according to the feature of news caption profile consistency from top to bottom, video pictures to be carried out horizontal striping identification Other: picture is carried out the identification one by one of one fixed width), and carry out noise removal process according to the feature of news caption.

S2: location caption area,

A) according to the region that may have captions on the feature location frame out of news caption；

B) lemma is all to edit on captions base plate, is gradual change or constant according to captions base plate invariant position and background colour May determine that captions base plate size, and again orient caption area；

C) method locking corrosion is used to further determine that caption area: to include following sub-step:

S21: cloth seed, under original state, is set to background (255) the value in whole region, finds according to gradient image data Suitably place plants seed, to ensure that seed is on word；Wherein, the value of seed is 0；

S221: calculate original graph respectively at vertical, level, the gray scale of three directions (as shown in Figure 2) of diagonal, R, G, B tetra- The maximum of individual passage:

G_{Gray scale}[x]=Max(abs(a1-a2), abs (b1-b2), abs (c1-c2), abs (d1-d2))；

G_R[x]=Max(abs(a1-a2), abs (b1-b2), abs (c1-c2), abs (d1-d2))；

G_G[x]=Max(abs(a1-a2), abs (b1-b2), abs (c1-c2), abs (d1-d2))；

G_B[x]=Max(abs(a1-a2), abs (b1-b2), abs (c1-c2), abs (d1-d2))；

S222: calculate the maximum in four values, described maximum is designated as the value of G point:

G [x]=Max (G gray scale [x], G_R[x], G_G[x], G_B[x])

S23: as it is shown on figure 3, lock corrosion, including following sub-step:

S231: the point of the four direction left and right, upper and lower of seed in traversal drawing of seeds successively, it is judged that whether this point is corroded and i.e. sentences Whether this point disconnected is G point: if the value of this point is 0, be corroded, and continues to judge next point；If the value of this point is not 0, Then continue next step；

S233:G point waits to judge the point of corrosion for the seed left side, has 0 to 4 points being corroded, corrode to the left on the right of seed, Including following sub-step:

S2331: judge the quantity of the point being corroded on the right of seed:

(2) when there being the point that 1 to 4 are corroded on the right of seed, G point and the gray scale of rightmost point, R, G, B four-way are calculated Maximum, when this maximum less than set Second Threshold nSensitive, then G point can corrode, and otherwise can not be corroded I.e. add one layer of lock to this point；

S3: determine headline, including following sub-step:

S32: captions picture is converted to word, is the headline of this then news.

Claims

1. the method extracting caption area in video pictures, it is characterised in that: it comprises the following steps:

S2: location caption area, including following sub-step:

S23: lock corrosion, including following sub-step:

S2331: judge the quantity of the point being corroded on the right of seed:

S3: determine headline, including following sub-step:

S32: captions picture is converted to word, is the headline of this then news.

A kind of method extracting caption area in video pictures the most according to claim 1, it is characterised in that: also include One threshold value arranges sub-step S0, the most respectively to first threshold nGsensitive and Second Threshold nSensitive It is configured.

A kind of method extracting caption area in video pictures the most according to claim 1, it is characterised in that: described N value is 4.

A kind of method extracting caption area in video pictures the most according to claim 1, it is characterised in that: described Step S1 also includes following sub-step: according to the feature of news caption profile consistency from top to bottom, video pictures is carried out horizontal band Change and identify, and carry out noise removal process according to the feature of news caption.