CN105844251A - Cartoon video identification method and device - Google Patents
Cartoon video identification method and device Download PDFInfo
- Publication number
- CN105844251A CN105844251A CN201610201081.0A CN201610201081A CN105844251A CN 105844251 A CN105844251 A CN 105844251A CN 201610201081 A CN201610201081 A CN 201610201081A CN 105844251 A CN105844251 A CN 105844251A
- Authority
- CN
- China
- Prior art keywords
- interval
- video
- key frame
- cartoon
- pixel
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 25
- 239000011159 matrix material Substances 0.000 claims description 36
- 238000004364 calculation method Methods 0.000 claims description 19
- 238000000605 extraction Methods 0.000 claims description 14
- 238000013139 quantization Methods 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000010422 painting Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012706 support-vector machine Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
Abstract
The invention discloses a cartoon video identification method and device, belonging to the mode identification technical field. The method comprises: extracting key frames from a video to be identified, obtaining image characteristics from the key frames, calculating the cartoon image membership of the key frames according to the image characteristics of each key frame, and determining whether the video is a carton video according to the cartoon image membership of all the key frames of the video to be identified. The method and device possess the characteristics of simple algorithm and reasonable image characteristic selection, reach an appropriate balance between identification speed and identification accuracy, and are very suitable for identification scenes of a lot of videos.
Description
Technical field
The present invention relates to mode identification technology, particularly relate to a kind of cartoon video recognition methods and dress
Put.
Background technology
At present, along with network technology and the high speed development of multimedia technology, various multimedia messages are the hugest
Rich, greatly enrich daily life.Meanwhile, video as a kind of common multimedia form,
Daily life with the mankind is closely bound up, is also to access more a kind of resources mode on network.
According to the 34th time, China Internet center China Internet network state of development statistical report, cut-off
In by the end of June, 2014 China held up to 4.39 hundred million internet video user, the user of Internet video
Quantity has accounted for the 1/3 of Chinese population sum.Along with increasing of number of users, user is for Online Video
Demand is the most increasing.In order to sufficiently meet the demand of user, Ge great portal website constantly expands online
Video library, causes sharply increasing of internet video quantity, the number of videos of single portal website to reach
To tens billion of more than.It is reported an investigational data of survey institute ComScore, in October, 2011 is excellent extremely
Net video playback amount reach 4,600,000,000 times, day the amount of uploading be 70,000.
But, owing to kind and the quantity of video grow with each passing day, so how the video of these magnanimity is entered
Row taxonomic revision, enables one to be quickly found out oneself content interested according to classification and becomes for one
Individual important problem.To this end, the automatic analysis system of video just arises at the historic moment, the inspection of automatically classifying of video
Method of determining and calculating also becomes a study hotspot of area of pattern recognition.
It will be seen that common video type mainly has film, electricity from the visual classification label of happy view
Depending on play, physical culture, animation etc. classification, wherein animation i.e. cartoon video.At these video genre
In, cartoon video is a kind of special classification, and it is different from " authenticity " of other videos, but passes through
The video that craft or computer aided painting are made.At present, animation industry has become a weight the most
The cultural industry wanted, therefore the proportion shared by animation video also will be the hugest.Therefore, cartoon video is known
The most just become an important research direction in visual classification field.
One important feature of cartoon video is, cartoon video has more significantly edge feature, with
Time, the color of cartoon video is the abundantest.Based on these features, more existing cartoons in prior art
Video frequency identifying method, it is right such as to be come by the statistics color of video, texture, shape, motion etc. feature
Video genre is identified, and wherein " identifies " and uses grader that precondition is good to a certain the most exactly
Organize concrete characteristics of image to classify.But, owing to the extraction of characteristics of image can not be divided comprehensively
Class device there is also inevitable deviation, and therefore recognition result exists inaccuracy definitely.
Summary of the invention
In view of this, it is an object of the invention to propose a kind of cartoon video recognition methods and device, it is possible to
Improve the accuracy rate of cartoon video identification further.
Based on above-mentioned purpose, the technical scheme that the present invention provides is:
A kind of cartoon video recognition methods, the method comprises:
Key frame is extracted from video to be identified;
Characteristics of image is obtained from key frame;
Use the first sorting algorithm according to the cartoon image of this key frame of box counting algorithm of each key frame
Degree of membership;
The span of cartoon image degree of membership is divided at least three interval, and adds up all key frames
The interval distribution situation of cartoon image degree of membership;
The second sorting algorithm is used to judge whether video to be identified is card intervisibility according to described interval distribution situation
Frequently.
Specifically, characteristics of image can comprise color histogram, edge histogram, high luminance pixels ratio,
Edge pixel ratio and color moment information, wherein color moment information is calculated by color histogram;Edge
Histogram is drawn by gradient direction and the gradient magnitude of statistical pixel point;High luminance pixels ratio refers to HSV
V (Value, lightness) parameter in (Hue-Saturation-Value, tone-saturation degree-lightness) space
Ratio more than the pixel of threshold X;Edge pixel ratio refers to that gradient magnitude is more than the pixel of threshold value Y
The ratio of point.
Specifically, the calculation that edge histogram represents in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude Nogata of m row 1 row
Figure matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of above-mentioned gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents jth
The pixel contribution to each gradient direction interval;
Above-mentioned QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;The jth pixel contribution v to quantized interval SSWith the contribution v to quantized interval TTCount as the following formula
Calculate:
In formula, γSTRepresent what the direction that characterized, midpoint of quantized interval S and the midpoint of quantized interval T were characterized
The angle in direction, θSRepresent what the gradient direction of jth pixel and the midpoint of quantized interval S were characterized
The angle in direction, θTRepresent what the gradient direction of jth pixel and the midpoint of quantized interval T were characterized
The angle in direction.
Specifically, color histogram carries out equal interval quantizing to color in HSV space and draws, color
Square information is made up of the front third moment of color histogram, i.e. first moment (mean value Mean), second moment
(variance Variance) and third moment (degree of bias Skewness).
Specifically, gradient direction interval division in the range of 0~180 degree with it at 180~360 degree of models
Enclose interior interval division specular.
Specifically, key frame is both from effective section of video to be identified, and effective section is to be identified regarding
Frequency removes the remainder behind the beginning part and ending, and the duration of effective section is at least whole waiting to be known
The 50% of other video duration, and the duration of the beginning part and ending all at least whole video to be identified
The 8% of duration.Advantage of this is that: no matter general cartoon video is also that non-cartoon video all can have head
And run-out, and head and run-out be probably captions, this can affect the algorithm recognition result to video, therefore
Preferably the beginning and end of video is removed.
Specifically, characteristics of image is all from the effective coverage of key frame, and the area of effective coverage is at least
For the 25% of whole key frame area, and effective coverage is positioned at the desirable region of key frame;Desirable region
For having the similar figures of the key frame of common geometric center with key frame, and the area in desirable region is crucial
The 64% of frame area.Advantage of this is that: may regard no matter cartoon video is also non-cartoon video
The marginal portion of frequency has black surround and/or captions, in order to avoid black surround and the interference of captions, best during identification
The geometry mid portion of selecting video key frame.
A kind of cartoon video identification device, comprises:
Key-frame extraction module: for extracting key frame from video to be identified;
Image characteristics extraction module: for obtaining characteristics of image from key frame;
First sort module: for using the first sorting algorithm according to the box counting algorithm of each key frame
The cartoon image degree of membership of this key frame;
Degree of membership distribution statistics module: for the span of cartoon image degree of membership is divided at least three
Interval, and add up the interval distribution situation of the cartoon image degree of membership of all key frames;
Second sort module: for using the second sorting algorithm to judge to wait to know according to described interval distribution situation
Whether other video is cartoon video.
Specifically, characteristics of image can comprise color histogram, edge histogram, high luminance pixels ratio,
Edge pixel ratio, and the color moment information obtained by color histogram;Edge histogram is by statistics
Gradient direction and the gradient magnitude of pixel draw;High luminance pixels ratio refers to V parameter in HSV space
Ratio more than the pixel of threshold X;Edge pixel ratio refers to that gradient magnitude is more than the pixel of threshold value Y
The ratio of point.
Specifically, the calculation that edge histogram represents in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude Nogata of m row 1 row
Figure matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of above-mentioned gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents jth
The pixel contribution to each gradient direction interval;
Above-mentioned QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;The jth pixel contribution v to quantized interval SSWith the contribution v to quantized interval TTCount as the following formula
Calculate:
In formula, γSTRepresent the direction that characterized, midpoint of quantized interval S and the midpoint institute table of quantized interval T
The angle in the direction levied, θSRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval S
The angle in the direction levied, θTRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval T
The angle in the direction levied.
Specifically, color histogram carries out equal interval quantizing to color in HSV space and draws, color
Square information is made up of the front third moment of color histogram, i.e. first moment (mean value Mean), second moment
(variance Variance) and third moment (degree of bias Skewness).
Specifically, gradient direction interval division in the range of 0~180 degree with it at 180~360 degree of models
Enclose interior interval division specular.
Specifically, key frame is both from effective section of video to be identified, and effective section is to be identified regarding
Frequency removes the remainder behind the beginning part and ending, and the duration of effective section is at least whole waiting to be known
The 50% of other video duration, and the duration of the beginning part and ending all at least whole video to be identified
The 8% of duration.Advantage of this is that: no matter general cartoon video is also that non-cartoon video all can have head
And run-out, and head and run-out be probably captions, this can affect the algorithm recognition result to video, therefore
Preferably the beginning and end of video is removed.
Specifically, characteristics of image is all from the effective coverage of key frame, and the area of effective coverage is at least
For the 25% of whole key frame area, and effective coverage is positioned at the desirable region of key frame;Desirable region
For having the similar figures of the key frame of common geometric center with key frame, and the area in desirable region is crucial
The 64% of frame area.Advantage of this is that: may regard no matter cartoon video is also non-cartoon video
The marginal portion of frequency has black surround and/or captions, in order to avoid black surround and the interference of captions, best during identification
The geometry mid portion of selecting video key frame.
From the above it can be seen that the beneficial effects of the present invention is:
Cartoon image degree of membership has been divided into multiple interval, and the cartoon figure to all key frames by the present invention
As the distribution situation of degree of membership is added up, then statistics is inputted the second grader and has carried out again
Subseries, thus improve the judgment accuracy of cartoon video, still maintain relatively low algorithm multiple simultaneously
Miscellaneous degree, is a kind of important improvement to prior art.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality
Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that below,
Accompanying drawing in description is only some embodiments of the present invention, for those of ordinary skill in the art,
On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is a kind of flow chart of the inventive method embodiment;
Fig. 2 is a kind of structural representation of apparatus of the present invention embodiment;
Fig. 3 is a kind of angular interval dividing mode schematic diagram in the embodiment of the present invention.
Detailed description of the invention
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with being embodied as
Example, and referring to the drawings, the present invention is described in more detail.
It should be noted that the statement of all uses " first " and " second " is equal in the embodiment of the present invention
It is for the parameter of the entity or non-equal distinguishing two same names non-equal, it is seen that " first " " the
Two " only for the convenience of statement, should not be construed as the restriction to the embodiment of the present invention, subsequent embodiment pair
This illustrates the most one by one.
A kind of cartoon video recognition methods, the method comprises:
Extracting key frame from video to be identified, key frame can use ffmpeg etc. software to extract;
Characteristics of image is obtained from key frame;
Use the first sorting algorithm according to the cartoon image of this key frame of box counting algorithm of each key frame
Degree of membership;
The span of cartoon image degree of membership is divided at least three interval, and adds up all key frames
The interval distribution situation of cartoon image degree of membership;
The second sorting algorithm is used to judge whether video to be identified is card intervisibility according to described interval distribution situation
Frequently.
Specifically, characteristics of image comprises color histogram, edge histogram, high luminance pixels ratio, edge
Pixel ratio, and the color moment information obtained by color histogram;Edge histogram passes through statistical pixel
Gradient direction and the gradient magnitude of point draw, specifically can use Sobel operator;High luminance pixels ratio
Refer to V in HSV (Hue-Saturation-Value, tone-saturation degree-lightness) space (Value, bright
Degree) parameter is more than the ratio of pixel of threshold X;Edge pixel ratio refers to that gradient magnitude is more than threshold value
The ratio of the pixel of Y.
Specifically, the calculation that edge histogram represents in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude Nogata of m row 1 row
Figure matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of above-mentioned gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents jth
The pixel contribution to each gradient direction interval;
Above-mentioned QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;The jth pixel contribution v to quantized interval SSWith the contribution v to quantized interval TTCount as the following formula
Calculate:
In formula, γSTRepresent what the direction that characterized, midpoint of quantized interval S and the midpoint of quantized interval T were characterized
The minimum positive angle in direction, θSRepresent the gradient direction of jth pixel and the midpoint institute of quantized interval S
The minimum positive angle in the direction characterized, θTRepresent that the gradient direction of jth pixel is with quantized interval T's
The minimum positive angle in the direction that midpoint is characterized.
Such as, being illustrated in figure 3 a kind of angular interval dividing mode, the most every two adjacent solid lines are one
Individual angular interval, two dotted lines in figure represent that the angle of the first angular interval and the second angular interval is put down respectively
Separated time, the chain-dotted line in figure represents the gradient direction of a certain pixel, and it falls in the second angular interval,
Simultaneously also falling in the angular range of two dotted lines, therefore this pixel is to the first angular interval and second jiao
All there is contribution in degree interval, and the contribution of the first angular interval is by itAnd the tribute to the second angular interval
Offer for
Certainly, the statistics about gradient direction also has simpler mode, the gradient of the most a certain pixel
Direction falls in the second angular interval, then this pixel only has contribution to the second angular interval, and to other
Angular interval is all without contribution.Both statistical methods all can be applicable to all detailed description of the invention of the present invention
In.
Specifically, color histogram carries out equal interval quantizing to color in HSV space and draws, color
Square information is made up of the front third moment of color histogram, i.e. first moment (mean value Mean), second moment
(variance Variance) and third moment (degree of bias Skewness), its calculation is respectively as follows:
First moment:
Second moment:
Third moment:
Wherein, N represents the sum of pixel, and i represents image channel, and (i.e. H in HSV space leads to
Road, channel S or V passage), pijRepresent key frame jth pixel gray value under passage i.
Specifically, gradient direction interval division in the range of 0~180 degree with it at 180~360 degree of models
Enclose interior interval division specular, such as start even circumferential is divided into 8 intervals from 0 degree.
Specifically, key frame is both from effective section of video to be identified, and effective section is to be identified regarding
Frequency removes the remainder behind the beginning part and ending, and the duration of effective section is at least whole waiting to be known
The 50% of other video duration, and the duration of the beginning part and ending all at least whole video to be identified
The 8% of duration.Advantage of this is that: no matter general cartoon video is also that non-cartoon video all can have head
And run-out, and head and run-out be probably captions, this can affect the algorithm recognition result to video, therefore
Preferably the beginning and end of video is removed.
Specifically, characteristics of image is all from the effective coverage of key frame, and the area of effective coverage is at least
For the 25% of whole key frame area, and effective coverage is positioned at the desirable region of key frame;Desirable region
For having the similar figures of the key frame of common geometric center with key frame, and the area in desirable region is crucial
The 64% of frame area.Advantage of this is that: may regard no matter cartoon video is also non-cartoon video
The marginal portion of frequency has black surround and/or captions, in order to avoid black surround and the interference of captions, best during identification
The geometry mid portion of selecting video key frame.
A kind of cartoon video identification device, it comprises:
Key-frame extraction module: for extracting key frame from video to be identified;
Image characteristics extraction module: for obtaining characteristics of image from key frame;
First sort module: for using the first sorting algorithm according to the box counting algorithm of each key frame
The cartoon image degree of membership of this key frame;
Degree of membership distribution statistics module: for the span of cartoon image degree of membership is divided at least three
Interval, and add up the interval distribution situation of the cartoon image degree of membership of all key frames;
Second sort module: use the second sorting algorithm to judge that video to be identified is according to interval distribution situation
No for cartoon video.
Specifically, characteristics of image comprises color histogram, edge histogram, high luminance pixels ratio, edge
Pixel ratio, and the color moment information obtained by color histogram;Edge histogram passes through statistical pixel
Gradient direction and the gradient magnitude of point draw;High luminance pixels ratio refers to that in HSV space, V parameter is more than
The ratio of the pixel of threshold X;Edge pixel ratio refers to that gradient magnitude is more than the pixel of threshold value Y
Ratio.
Specifically, the calculation that edge histogram represents in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude Nogata of m row 1 row
Figure matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of above-mentioned gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents jth
The pixel contribution to each gradient direction interval;
Above-mentioned QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;The jth pixel contribution v to quantized interval SSWith the contribution v to quantized interval TTCount as the following formula
Calculate:
In formula, γSTRepresent what the direction that characterized, midpoint of quantized interval S and the midpoint of quantized interval T were characterized
The minimum positive angle in direction, θSRepresent the gradient direction of jth pixel and the midpoint institute of quantized interval S
The minimum positive angle in the direction characterized, θTRepresent that the gradient direction of jth pixel is with quantized interval T's
The minimum positive angle in the direction that midpoint is characterized.
Certainly, the statistics about gradient direction also has simpler mode, the gradient of the most a certain pixel
Direction falls in the second angular interval, then this pixel only has contribution to the second angular interval, and to other
Angular interval is all without contribution.Both statistical methods all can be applicable to the concrete of all embodiments of the present invention
In practice.
Specifically, color histogram carries out equal interval quantizing to color in HSV space and draws, color
Square information is made up of the front third moment of color histogram, i.e. first moment (mean value Mean), second moment
(variance Variance) and third moment (degree of bias Skewness).
Specifically, gradient direction interval division in the range of 0~180 degree with it at 180~360 degree of models
Enclose interior interval division specular.
Specifically, key frame is both from effective section of video to be identified, and effective section is to be identified regarding
Frequency removes the remainder behind the beginning part and ending, and the duration of effective section is at least whole waiting to be known
The 50% of other video duration, and the duration of the beginning part and ending all at least whole video to be identified
The 8% of duration.Advantage of this is that: no matter general cartoon video is also that non-cartoon video all can have head
And run-out, and head and run-out be probably captions, this can affect the algorithm recognition result to video, therefore
Preferably the beginning and end of video is removed.
Specifically, characteristics of image is all from the effective coverage of key frame, and the area of effective coverage is at least
For the 25% of whole key frame area, and effective coverage is positioned at the desirable region of key frame;Desirable region
For having the similar figures of the key frame of common geometric center with key frame, and the area in desirable region is crucial
The 64% of frame area.Advantage of this is that: may regard no matter cartoon video is also non-cartoon video
The marginal portion of frequency has black surround and/or captions, in order to avoid black surround and the interference of captions, best during identification
The geometry mid portion of selecting video key frame.
As an embodiment of the inventive method, as it is shown in figure 1, a kind of cartoon video recognition methods,
The method comprises the steps of:
Step 101, intercepts the centre 80% duration part of video to be identified;
Step 102, extracts key frame from intercepting part;
Step 103, intercepts the region of 70% in the middle part of the length and width of key frame;
Step 104, is transformed into HSV space by the image intercepting region;
Step 105, in HSV space, statistics intercepts the color histogram in region, and H parameter is drawn equably
Being divided into 8 intervals, S and V parameter is evenly divided into 6 intervals respectively;Statistics intercepts region simultaneously
Edge histogram, wherein edge histogram is the gradient direction by statistical pixel point and gradient magnitude obtains
Going out, gradient direction and gradient magnitude are all evenly divided into 8 intervals;
Step 106, adds up the high luminance pixels ratio intercepting region according to color histogram and edge histogram
With edge pixel ratio;Wherein high luminance pixels ratio is the ratio of the V parameter pixel more than 0.5, limit
Edge pixel ratio is the ratio that gradient magnitude is more than the pixel of threshold value 0.087;
Step 107, uses SVM (Support Vector Machine, support vector that precondition is crossed
Machine) grader to intercept region classify, calculate intercept region cartoon image degree of membership;
Step 108, is uniformly divided into the span of degree of membership 5 intervals, adds up all key frames
Degree of membership is distributed;
Step 109, inputs another SVM classifier through precondition by degree of membership distribution, thus counts
Calculate the probability that this video is cartoon video, if probability is more than 50%, determine that it is cartoon video;
Step 110, exports result of determination.
As an embodiment of apparatus of the present invention, as in figure 2 it is shown, a kind of device, this cartoon video is known
Other device 2 comprises key-frame extraction module 201, image characteristics extraction module the 202, first sort module
203, degree of membership distribution statistics module 204 and the second sort module 205;Degree of membership distribution statistics module
204 for being divided into 5 intervals by the span of cartoon image degree of membership, and add up cartoon image and be subordinate to
The interval distribution situation of degree, uses the second sorting algorithm to classify further according to interval distribution situation, thus
Judge whether video to be identified is cartoon video.During use, by video input key-frame extraction mould to be identified
Block 201, the key frame extracted is passed to image characteristics extraction module by key-frame extraction module 201
202, image characteristics extraction module 202 extracts characteristics of image from key frame, and its extracting method can use
Any of which mentioned in the inventive method, then image characteristics extraction module 202 is by characteristics of image
Passing to the first sort module 203, the first sort module 203 calculates the cartoon image of each key frame
Degree of membership, and pass the result to degree of membership distribution statistics module 204, degree of membership distribution statistics module 204
Statistics passes to the second sort module 205, and the classified calculating through the second sort module 205 is sentenced
Whether disconnected video to be sorted is cartoon video, finally exports video type.
Being readily apparent that, implementing of apparatus of the present invention both can be a kind of special equipment, it is also possible to be
The equipment installing specific software on the smart machines such as computer, mobile phone, flat board and formed.
It should be noted that scope is made that in described above the parameters of restriction, within the range
Choose any endpoint value or median is all desirable, and the different valued combinations of each parameter are also feasible
's.After recognizing specific embodiment of the invention limited range, those skilled in the art need not
Paying any creative work and each parameter can be carried out concrete value, it is obtained
Application effect is all without departing from the scope described in the present invention, and therefore, in order to save length, inventor is not
Again various possible values and possible combination thereof are enumerated.
The device of above-described embodiment is used for the corresponding method in previous embodiment that realizes, and has corresponding
The beneficial effect of embodiment of the method, does not repeats them here.
Those of ordinary skill in the field are it is understood that the discussion of any of the above embodiment is merely illustrative
, it is not intended that hint the scope of the present disclosure (including claim) is limited to these examples;At this
Under bright thinking, between the technical characteristic in above example or different embodiment, group can also be carried out
Closing, step can realize with random order, and there is the many of the different aspect of the present invention as above
Other change, for they not offers in details simple and clear.
It addition, for simplifying explanation and discussing, and in order to obscure the invention, provided
Accompanying drawing in can illustrate or can not illustrate with integrated circuit (IC) chip and other parts known
Power supply/grounding connection.Furthermore, it is possible to illustrate device in block diagram form, in order to avoid making the present invention be difficult to
Understand, and this have also contemplated that following facts, i.e. the details about the embodiment of these block diagram arrangements is
Depend highly on will implement the present invention platform (that is, these details should be completely in this area skill
In the range of the understanding of art personnel).Elaborating that detail (such as, circuit) is to describe showing of the present invention
In the case of example embodiment, it will be apparent to those skilled in the art that can there is no this
In the case of a little details or these details change in the case of implement the present invention.Therefore,
These descriptions are considered as illustrative and not restrictive.
Although invention has been described to have been incorporated with the specific embodiment of the present invention, but according to front
The description in face, a lot of replacements, amendment and the modification of these embodiments are for those of ordinary skills
Will be apparent from.Such as, other memory architecture (such as, dynamic ram (DRAM)) can
The embodiment discussed with use.
Embodiments of the invention be intended to fall within the broad range of claims all so
Replacement, amendment and modification.Therefore, all within the spirit and principles in the present invention, any province done
Summary, amendment, equivalent, improvement etc., should be included within the scope of the present invention.
Claims (14)
1. a cartoon video recognition methods, it is characterised in that comprise:
Key frame is extracted from video to be identified;
Characteristics of image is obtained from key frame;
Use the first sorting algorithm according to the cartoon image of this key frame of box counting algorithm of each key frame
Degree of membership;
The span of described cartoon image degree of membership is divided at least three interval, and adds up all keys
The interval distribution situation of the cartoon image degree of membership of frame;
The second sorting algorithm is used to judge whether video to be identified is card intervisibility according to described interval distribution situation
Frequently.
Cartoon video recognition methods the most according to claim 1, it is characterised in that: described image is special
Levy and comprise color histogram, edge histogram, high luminance pixels ratio, edge pixel ratio and color moment letter
Breath, wherein color moment information is calculated by described color histogram;Described edge histogram is by statistics
Gradient direction and the gradient magnitude of pixel draw;
Described high luminance pixels ratio refers to that in HSV space, V parameter is more than the ratio of the pixel of threshold X;
Described edge pixel ratio refers to that gradient magnitude is more than the ratio of the pixel of threshold value Y.
Cartoon video recognition methods the most according to claim 2, it is characterised in that: described edge is straight
The calculation just scheming to represent in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude of m row 1 row
Histogram matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of described gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents the
J pixel is to the interval contribution of each gradient direction;
QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;The jth pixel contribution v to quantized interval SSWith the contribution v to quantized interval TTCount as the following formula
Calculate:
In formula, γSTRepresent the direction that characterized, midpoint of quantized interval S and the midpoint institute table of quantized interval T
The angle in the direction levied, θSRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval S
The angle in the direction levied, θTRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval T
The angle in the direction levied.
Cartoon video recognition methods the most according to claim 2, it is characterised in that: described color is straight
Side's figure carries out equal interval quantizing to color in HSV space and draws, described color moment information is by described face
The front third moment composition of Color Histogram.
Cartoon video recognition methods the most according to claim 2, it is characterised in that: described gradient side
To the interval division in the range of 0~180 degree and its interval division mirror image in the range of 180~360 degree
Symmetrical.
Cartoon video recognition methods the most according to claim 1, it is characterised in that: described key frame
Both from effective section of described video to be identified, described effective section is that video to be identified removes beginning
Remainder after part and ending, the duration of described effective section is at least whole video to be identified
The 50% of duration, and the duration of described the beginning part and described ending is all at least whole to be identified regard
Frequently the 8% of duration.
Cartoon video recognition methods the most according to claim 1, it is characterised in that: described image is special
Levying the effective coverage all from key frame, the area of described effective coverage is at least whole crucial pattern
Long-pending 25%, and effective coverage is positioned at the desirable region of key frame;Described desirable region is and key frame
There are the similar figures of the key frame of common geometric center, and the area in desirable region is key frame area
64%.
8. a cartoon video identification device, it is characterised in that comprise:
Key-frame extraction module: for extracting key frame from video to be identified;
Image characteristics extraction module: for obtaining characteristics of image from key frame;
First sort module: for using the first sorting algorithm according to the box counting algorithm of each key frame
The cartoon image degree of membership of this key frame;
Degree of membership distribution statistics module: for the span of described cartoon image degree of membership is divided at least
Three intervals, and add up the interval distribution situation of the cartoon image degree of membership of all key frames;
Second sort module: use the second sorting algorithm to judge that video to be identified is according to described interval distribution situation
No for cartoon video.
Cartoon video identification device the most according to claim 8, it is characterised in that: described image is special
Levy and comprise color histogram, edge histogram, high luminance pixels ratio, edge pixel ratio and color moment letter
Breath, wherein color moment information is calculated by color histogram;Described edge histogram passes through statistical pixel
Gradient direction and the gradient magnitude of point draw;Described high luminance pixels ratio refers to that in HSV space, V parameter is big
Ratio in the pixel of threshold X;Described edge pixel ratio refers to that gradient magnitude is more than the picture of threshold value Y
The ratio of vegetarian refreshments.
Cartoon video identification device the most according to claim 9, it is characterised in that: described edge
The calculation that histogram represents in the matrix form is:
Bmn=Lm×Gn,
In formula, BmnRepresent the edge histogram matrix of m row n row, LmRepresent the gradient magnitude of m row 1 row
Histogram matrix, GnRepresent the gradient orientation histogram matrix of 1 row n row;
The calculation of described gradient orientation histogram matrix is:
In formula: N represents by the sum of statistical pixel point;QnjIt is the matrix of 1 row n row, represents the
J pixel is to the interval contribution of each gradient direction;
QnjCalculation be: the gradient direction setting jth pixel falls into quantized interval S, then it is right
Quantized interval S and quantized interval T all has contribution, and to other quantized intervals all without contribution, described amount
Changing interval T is quantization district minimum with the gradient direction angle of jth pixel in addition to quantized interval S
Between;Quantized interval S is contributed vS and the contribution v to quantized interval T by jth pixelTCount as the following formula
Calculate:
In formula, γSTRepresent the direction that characterized, midpoint of quantized interval S and the midpoint institute table of quantized interval T
The angle in the direction levied, θSRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval S
The angle in the direction levied, θTRepresent the gradient direction of jth pixel and the midpoint institute table of quantized interval T
The angle in the direction levied.
11. cartoon video identification devices according to claim 9, it is characterised in that: described color
Histogram carries out equal interval quantizing to color in HSV space and draws, described color moment information is by described
The front third moment composition of color histogram.
12. cartoon video identification devices according to claim 9, it is characterised in that: described gradient
Direction interval division in the range of 0~180 degree and its interval division mirror in the range of 180~360 degree
As symmetrical.
13. cartoon video identification devices according to claim 9, it is characterised in that: described key
Frame is both from effective section of described video to be identified, and described effective section is that video to be identified removes out
Remainder after head part and ending, the duration of described effective section is at least whole to be identified regard
Frequently the 50% of duration, and the duration of described the beginning part and described ending is all at least whole to be identified
The 8% of video duration.
14. cartoon video identification devices according to claim 9, it is characterised in that: described image
Feature is at least whole key frame all from the effective coverage of key frame, the area of described effective coverage
The 25% of area, and effective coverage is positioned at the desirable region of key frame;Described desirable region is and key
Frame has the similar figures of the key frame of common geometric center, and the area in desirable region is key frame area
64%.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610201081.0A CN105844251A (en) | 2016-03-31 | 2016-03-31 | Cartoon video identification method and device |
PCT/CN2016/096153 WO2017166597A1 (en) | 2016-03-31 | 2016-08-22 | Cartoon video recognition method and apparatus, and electronic device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610201081.0A CN105844251A (en) | 2016-03-31 | 2016-03-31 | Cartoon video identification method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105844251A true CN105844251A (en) | 2016-08-10 |
Family
ID=56597759
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610201081.0A Pending CN105844251A (en) | 2016-03-31 | 2016-03-31 | Cartoon video identification method and device |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105844251A (en) |
WO (1) | WO2017166597A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017166597A1 (en) * | 2016-03-31 | 2017-10-05 | 乐视控股(北京)有限公司 | Cartoon video recognition method and apparatus, and electronic device |
CN111797912A (en) * | 2020-06-23 | 2020-10-20 | 山东云缦智能科技有限公司 | System and method for identifying film generation type and construction method of identification model |
CN113222058A (en) * | 2021-05-28 | 2021-08-06 | 新疆爱华盈通信息技术有限公司 | Image classification method and device, electronic equipment and storage medium |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109871827B (en) * | 2019-03-14 | 2022-10-25 | 大连海事大学 | Footprint expression method combining region confidence coefficient and pressure distribution direction intensity |
CN111325181B (en) * | 2020-03-19 | 2023-12-05 | 京东科技信息技术有限公司 | State monitoring method and device, electronic equipment and storage medium |
CN111479130B (en) * | 2020-04-02 | 2023-09-26 | 腾讯科技(深圳)有限公司 | Video positioning method and device, electronic equipment and storage medium |
CN115544473B (en) * | 2022-09-09 | 2023-11-21 | 苏州吉弘能源科技有限公司 | Photovoltaic power station operation and maintenance terminal login control system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6363380B1 (en) * | 1998-01-13 | 2002-03-26 | U.S. Philips Corporation | Multimedia computer system with story segmentation capability and operating program therefor including finite automation video parser |
CN101650728A (en) * | 2009-08-26 | 2010-02-17 | 北京邮电大学 | Video high-level characteristic retrieval system and realization thereof |
CN101977311A (en) * | 2010-11-03 | 2011-02-16 | 上海交通大学 | Multi-characteristic analysis-based CG animation video detecting method |
CN104881675A (en) * | 2015-05-04 | 2015-09-02 | 北京奇艺世纪科技有限公司 | Video scene identification method and apparatus |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU7545700A (en) * | 1999-10-08 | 2001-04-23 | British Telecommunications Public Limited Company | Cartoon recognition |
EP1130546A1 (en) * | 2000-03-02 | 2001-09-05 | BRITISH TELECOMMUNICATIONS public limited company | Cartoon recognition |
US6810144B2 (en) * | 2001-07-20 | 2004-10-26 | Koninklijke Philips Electronics N.V. | Methods of and system for detecting a cartoon in a video data stream |
CN1679027A (en) * | 2002-08-26 | 2005-10-05 | 皇家飞利浦电子股份有限公司 | Unit for and method of detection a content property in a sequence of video images |
CN100541524C (en) * | 2008-04-17 | 2009-09-16 | 上海交通大学 | Content-based method for filtering internet cartoon medium rubbish information |
CN105844251A (en) * | 2016-03-31 | 2016-08-10 | 乐视控股(北京)有限公司 | Cartoon video identification method and device |
-
2016
- 2016-03-31 CN CN201610201081.0A patent/CN105844251A/en active Pending
- 2016-08-22 WO PCT/CN2016/096153 patent/WO2017166597A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6363380B1 (en) * | 1998-01-13 | 2002-03-26 | U.S. Philips Corporation | Multimedia computer system with story segmentation capability and operating program therefor including finite automation video parser |
CN101650728A (en) * | 2009-08-26 | 2010-02-17 | 北京邮电大学 | Video high-level characteristic retrieval system and realization thereof |
CN101977311A (en) * | 2010-11-03 | 2011-02-16 | 上海交通大学 | Multi-characteristic analysis-based CG animation video detecting method |
CN104881675A (en) * | 2015-05-04 | 2015-09-02 | 北京奇艺世纪科技有限公司 | Video scene identification method and apparatus |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017166597A1 (en) * | 2016-03-31 | 2017-10-05 | 乐视控股(北京)有限公司 | Cartoon video recognition method and apparatus, and electronic device |
CN111797912A (en) * | 2020-06-23 | 2020-10-20 | 山东云缦智能科技有限公司 | System and method for identifying film generation type and construction method of identification model |
CN111797912B (en) * | 2020-06-23 | 2023-09-22 | 山东浪潮超高清视频产业有限公司 | System and method for identifying film age type and construction method of identification model |
CN113222058A (en) * | 2021-05-28 | 2021-08-06 | 新疆爱华盈通信息技术有限公司 | Image classification method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
WO2017166597A1 (en) | 2017-10-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105844251A (en) | Cartoon video identification method and device | |
CN105118048B (en) | The recognition methods of reproduction certificate picture and device | |
CN104063712B (en) | A kind of information of vehicles extracting method and system | |
CN102012939B (en) | Method for automatically tagging animation scenes for matching through comprehensively utilizing overall color feature and local invariant features | |
CN104376105B (en) | The Fusion Features system and method for image low-level visual feature and text description information in a kind of Social Media | |
CN104899586B (en) | Method and device is identified to the word content for including in image | |
CN102968637A (en) | Complicated background image and character division method | |
CN104408429A (en) | Method and device for extracting representative frame of video | |
CN108629319B (en) | Image detection method and system | |
CN103605991A (en) | Automatic video advertisement detection method | |
CN107305691A (en) | Foreground segmentation method and device based on images match | |
CN108647595A (en) | Vehicle recognition methods again based on more attribute depth characteristics | |
CN106203454A (en) | The method and device that certificate format is analyzed | |
CN103793717A (en) | Methods for determining image-subject significance and training image-subject significance determining classifier and systems for same | |
CN103092930A (en) | Method of generation of video abstract and device of generation of video abstract | |
CN110827312A (en) | Learning method based on cooperative visual attention neural network | |
CN104050684A (en) | Video moving object classification method and system based on on-line training | |
Sikirić et al. | Classifying traffic scenes using the GIST image descriptor | |
CN103985130A (en) | Image significance analysis method for complex texture images | |
CN106844785A (en) | A kind of CBIR method based on conspicuousness segmentation | |
CN104680189B (en) | Based on the bad image detecting method for improving bag of words | |
CN109213886A (en) | Image search method and system based on image segmentation and Fuzzy Pattern Recognition | |
CN106066887B (en) | A kind of sequence of advertisements image quick-searching and analysis method | |
Ma et al. | Lecture video segmentation and indexing | |
CN101977311B (en) | Multi-characteristic analysis-based CG animation video detecting method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160810 |