CN101853286A - Intelligent selection method of video thumbnails - Google Patents

Intelligent selection method of video thumbnails Download PDF

Info

Publication number
CN101853286A
CN101853286A CN201010180153A CN201010180153A CN101853286A CN 101853286 A CN101853286 A CN 101853286A CN 201010180153 A CN201010180153 A CN 201010180153A CN 201010180153 A CN201010180153 A CN 201010180153A CN 101853286 A CN101853286 A CN 101853286A
Authority
CN
China
Prior art keywords
score
thumbnail
value
sigma
std
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201010180153A
Other languages
Chinese (zh)
Other versions
CN101853286B (en
Inventor
连惠城
刘子枫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
SHANGHAI TUDOU NETWORK TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI TUDOU NETWORK TECHNOLOGY Co Ltd filed Critical SHANGHAI TUDOU NETWORK TECHNOLOGY Co Ltd
Priority to CN201010180153.0A priority Critical patent/CN101853286B/en
Publication of CN101853286A publication Critical patent/CN101853286A/en
Application granted granted Critical
Publication of CN101853286B publication Critical patent/CN101853286B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses an intelligent selection method of video thumbnails, comprising the following steps: 1) acquiring N numbered thumbnails of a video, wherein N is more than or equal to 2 and less than or equal to 100, and N is a constant; 2) calculating the characteristics of each thumbnail, such as frontal face position and size, profile face position and size, whole body position and size, upper body position and size, lower body position and size, gray level histogram distribution value, gray level histogram variance distribution value, colorful RGB histogram variance distribution value, difference value that a colorful RGB image is changed into a gray level image, the variance value of a gray level image pixel point, the variance value of the colorful RGB image pixel point, the dispersion distance value between the colorful RGB blocks, and the like; 3) calculating the characteristic values of each thumbnail respectively through ambiguity functions; 4) weighting and calculating the characteristic values of each thumbnail to obtain a final fraction; 5) sequencing the N numbered thumbnails according to the fraction of each thumbnail; and 6) selecting the thumbnail with the highest fraction as the thumbnail representing the video.

Description

Intelligent selection method of video thumbnails
Technical field
The present invention relates to the picture choosing method, particularly a kind of intelligent selection method of video thumbnails that is undertaken by Computer Image Processing.
Background technology
In the daily use of video website, it is exactly to extract the representative picture of a two field picture as this video file from video file that an important techniques is arranged, and we claim that this picture is the thumbnail of video file.Our way is to extract eight hypertonic sketch maps from video file at present, and the 4th thumbnail as acquiescence is set.This employing fixed position method is given tacit consent to and the thumbnail that obtains is not the most representative thumbnail in the eight hypertonic sketch maps usually.We have adopted certain methods to address this problem, and can determine the thumbnail of which pictures as acquiescence by self-defining method such as (1) user, and (2) are given the website staff and selected the right of which pictures as the acquiescence thumbnail.Though said method can address this problem, but reality still can't meet the demands, this is by data analysis because of (1), we know the user usually not can or seldom self-defined thumbnail (2) because to upload to the video file quantity of video website every day very numerous, the workload that defines thumbnail by the website staff is very heavy.Based on above-mentioned two reasons, we invent and a kind ofly can select the method for thumbnail of representative thumbnail as acquiescence automatically by the method for Computer Image Processing.
People such as P.Viola and M.Jones " adopting the stacked method for improving of simple feature to carry out quick object detection " (Rapid Object Detection Using a Boosted Cascade of SimpleFeatures.) method that (Proceedings ofComputer Vision and Pattern Recognition, 2001) propose on the computer vision and pattern-recognition international conference of calendar year 2001 detects.
In view of this, those skilled in the art provide a kind of and can select the intelligent selection method of video thumbnails of representative thumbnail as the thumbnail of acquiescence automatically by the Computer Image Processing method at the problems referred to above.
Summary of the invention
The invention provides a kind of intelligent selection method of video thumbnails, overcome the difficulty of prior art, select the purpose of representative thumbnail automatically as the thumbnail of acquiescence to reach by the Computer Image Processing method.
The present invention adopts following technical scheme:
The invention provides a kind of intelligent selection method of video thumbnails, be used for selecting a thumbnail of representing this video, may further comprise the steps from one section video:
(1) obtain the N hypertonic sketch map of one section video, wherein, 2≤N≤100, and N is a constant;
(2) every hypertonic sketch map is calculated the position and the size of its front face, the position and the size of people from side face, the position of whole body human body and size, the position and the size of the position of upper body and the size and the lower part of the body, grey level histogram distribution value, grey level histogram variance distribution value, colored RGB histogram variance distribution value, colored RGB image becomes the difference value of gray level image, the variance yields of gray level image pixel, the variance yields of colored RGB image slices vegetarian refreshments, discrete distance value between the colored RGB piece or the like feature;
(3) every hypertonic sketch map is tried to achieve the score value of above-mentioned each feature respectively by ambiguity function;
(4) the feature score value to every hypertonic sketch map obtains a final mark by the weighting evaluation;
(5) size according to mark sorts to N hypertonic sketch map;
(6) select for use the highest thumbnail of mark as the thumbnail of representing this video.
Preferably, in the described step (1), adopt the extracted at equal intervals method, the video duration according to obtaining in advance is divided into the N equal portions with it, and each equal portions extracts a frame as thumbnail then, and N=8.
Preferably, may further comprise the steps in the described step (2):
(21) the input thumbnail detects face characteristic, comprises position and the position and the size big or small, people from side face of front face, and calculates fractional value;
(22) input thumbnail, the human body feature comprises position and size, the position of upper body and the position and the size of the size and the lower part of the body of whole body human body, and calculates fractional value;
(23) input thumbnail calculates grey level histogram distribution value;
(24) input thumbnail calculates grey level histogram variance distribution value;
(25) the input thumbnail calculates colored RGB histogram variance distribution value;
(26) the input thumbnail calculates the difference value that colored RGB image becomes gray level image;
(27) input thumbnail, the variance yields of calculating gray level image pixel;
(28) import thumbnail, calculate the variance yields of colored RGB image slices vegetarian refreshments;
(29) the input thumbnail calculates the discrete distance value between the colored RGB piece.
Preferably, in the described step (21):
The score value of front face then
Figure GSA00000126154900021
W wherein iWide for front face, n is people's face number, establishing W is the wide of thumbnail;
The score value of people from side face
Figure GSA00000126154900031
W wherein iWide for side people's face, n is people's face number, establishing W is the wide of thumbnail.
Preferably, establish (x in the described step (22) i, y i, w i, h i), i=1 ..., n, wherein x i, y iRepresentative's face position, w iBehaviour face wide, n is people's face number, establishing W is the wide of thumbnail,
The score value of whole body then
Figure GSA00000126154900032
W wherein iWide for the whole body human body, n is a number, establishing W is the wide of thumbnail;
The score value of upper body
Figure GSA00000126154900033
W wherein iWide for the whole body human upper limb, n is a number, establishing W is the wide of thumbnail;
The score value of the lower part of the body
Figure GSA00000126154900034
W wherein iWide for the whole body human body lower limbs, n is a number, establishing W is the wide of thumbnail.
Preferably, input thumbnail in the described step (23), establishing its grey level histogram is gray_hist=(gh_1, gh_2, .., gh_n), n is histogrammic number, t1, t2 is the integer threshold value, T1 and T2 be histogram and threshold value, base_score is the benchmark mark, then the computing formula of grey level histogram distribution value score_hist_distrib is:
Figure GSA00000126154900035
Wherein, n=10, t1=2, t2=8, T 1=0.7, T 1=0.8, base_score=1.0.
Preferably, input thumbnail in the described step (24), establish its grey level histogram and be gray_hist=(gh_1, gh_2 .., gh_n), n is histogrammic number, Max_num=max (gh_1, gh_2 .., gh_n), base_score is the benchmark mark, the score_gray_hist_std computing formula of then calculating grey level histogram variance distribution value is:
Figure GSA00000126154900041
Wherein, n=64, base_score=1.0.
Preferably, histogram variance distribution value on R plane, G plane, B plane respectively in the described step (25), averaging then obtains colored RGB histogram variance distribution value score_RGB_hist_std;
Figure GSA00000126154900044
score _ RGB _ hist _ distrib = score _ R _ hist _ distrib + score _ G _ hist _ distrib + score _ B _ hist _ distrib 3
Wherein, n=64, base_score=1.0.
Preferably, input thumbnail in the described step (26), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, the pixel value (r of colored RGB image Ij, g Ij, b Ij), i=1 ..., W, j=1 ..., H, wherein W, H is respectively the wide and high of thumbnail, and base_score1 and base_score2 are the benchmark mark, and t1 is a threshold value, establishes then to calculate the computing formula that colored RGB image becomes the difference value score_rgb2gray_diff of gray level image and be:
Figure GSA00000126154900046
Wherein,
Figure GSA00000126154900047
Base_score1=1.0, base_score2=2.0, t 1=-0.9.
Preferably, input thumbnail in the described step (27), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, W wherein, H is respectively the wide and high of thumbnail, and the score_gray_std computing formula of then calculating the variance yields of gray level image pixel is:
score _ gray _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij .
Preferably, in the described step (28) respectively on the R plane, the variance yields of G plane, B plane epigraph pixel, averaging then obtains the variance yields score_RGB_std of colored RGB image slices vegetarian refreshments;
score _ R _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ G _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ B _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ RGB _ std = score _ R _ std + score _ G _ std + score _ B _ std 3 .
Preferably, import thumbnail in the described step (29), establish the pixel value (r of colored RGB image i, g i, b i), i=1 ..., W * H, W wherein, H is respectively the wide and high of thumbnail, and base_score is the benchmark mark, and the computing formula of then calculating the discrete distance value score_rgb_dist between the colored RGB piece is:
Wherein, base_score=0.0.
Preferably, final score value final_score is calculated by following formula in the described step (4):
final_score=w 1×score_face_frontal+w 2×score_face_profile+w 3×score_full_body+w 4×score_upper_body+w 5×score_lower_body+w 6×score_hist_distrib+w 7×score_gray_hist_std+w 8×score_RGB_hist_std+w 9×score_rgb2gray_diff+w 10×score_gray_std+w 11×score_RGB_std;
Wherein, w 1=1.0, w 2=0.5, w 3=0.5, w 4=0.5, w 5=0.5, w 6=0.5, w 7=0.5, w 8=1.2, W 9=1.2, w 10=0.5, w 11=1.2.
Owing to adopted above-mentioned technology, compared with prior art, the present invention has following advantage: the present invention can select the thumbnail of representative thumbnail as acquiescence automatically by the Computer Image Processing method.
Further specify the present invention below in conjunction with drawings and Examples.
Description of drawings
Fig. 1 is the process flow diagram of intelligent selection method of video thumbnails of the present invention.
Embodiment
Introduce a kind of specific embodiment of the present invention below by Fig. 1.
As shown in Figure 1, a kind of intelligent selection method of video thumbnails may further comprise the steps:
(1) obtain the N hypertonic sketch map of a video, the acquisition methods of N hypertonic sketch map is to adopt the extracted at equal intervals method, promptly according to the video duration that obtains in advance, it is divided into the N equal portions, and each equal portions extracts a frame as thumbnail then.Here get N=8, but be not limited to N=8, N can be between 2 to 100.
(2) every hypertonic sketch map is calculated the position and the size of its front face, the position and the size of people from side face, the position of whole body human body and size, the position and the size of the position of upper body and the size and the lower part of the body, grey level histogram distribution value, histogram variance distribution value, colored RGB histogram variance distribution value, colored RGB image becomes the difference value of gray level image, the variance yields of gray level image pixel, the variance yields of colored RGB image slices vegetarian refreshments, discrete distance value between the colored RGB piece or the like feature.
(3) try to achieve the score value of above-mentioned feature respectively by ambiguity function.
(4) obtain a final mark by the weighting evaluation.
(5) size according to mark sorts to N hypertonic sketch map.
(6) thumbnail that score value is the highest is as the thumbnail of acquiescence.
Above-mentioned steps (2), carry out as follows:
(21) the input thumbnail detects face characteristic, includes position and the position and the size big or small, people from side face of front face; Calculate fractional value;
(22) input thumbnail, the human body feature includes the position of whole body human body and size, the position of upper body and the position and the size of the size and the lower part of the body; Calculate fractional value;
(23) input thumbnail calculates grey level histogram distribution value;
(24) input thumbnail calculates grey level histogram variance distribution value;
(25) the input thumbnail calculates colored RGB histogram variance distribution value;
(26) the input thumbnail calculates the difference value that colored RGB image becomes gray level image;
(27) input thumbnail, the variance yields of calculating gray level image pixel;
(28) import thumbnail, calculate the variance yields of colored RGB image slices vegetarian refreshments;
(29) the input thumbnail calculates the discrete distance value between the colored RGB piece.
Embodiments of the present invention are as follows:
Continuation is referring to Fig. 1, and a kind of intelligent selection method of video thumbnails of the present invention is used for selecting a thumbnail of representing this video from one section video, may further comprise the steps:
Step (1) is obtained the N hypertonic sketch map of one section video, wherein, and 2≤N≤100, and N is constant, adopts the extracted at equal intervals method, and the video duration according to obtaining in advance is divided into the N equal portions with it, and each equal portions extracts a frame as thumbnail then, and N=8.
Step (2) is calculated the position and the size of its front face to every hypertonic sketch map, the position and the size of people from side face, the position of whole body human body and size, the position and the size of the position of upper body and the size and the lower part of the body, grey level histogram distribution value, grey level histogram variance distribution value, colored RGB histogram variance distribution value, colored RGB image becomes the difference value of gray level image, the variance yields of gray level image pixel, the variance yields of colored RGB image slices vegetarian refreshments, discrete distance value between the colored RGB piece or the like feature;
Described step may further comprise the steps in (2):
Step (21) input thumbnail detects face characteristic, comprises position and the position and the size big or small, people from side face of front face, and calculates fractional value;
Step (22) input thumbnail, the human body feature comprises position and size, the position of upper body and the position and the size of the size and the lower part of the body of whole body human body, and calculates fractional value;
Step (23) input thumbnail calculates grey level histogram distribution value;
Step (24) input thumbnail calculates grey level histogram variance distribution value;
Step (25) input thumbnail calculates colored RGB histogram variance distribution value;
Step (26) input thumbnail calculates the difference value that colored RGB image becomes gray level image;
Step (27) input thumbnail, the variance yields of calculating gray level image pixel;
Step (28) is imported thumbnail, calculates the variance yields of colored RGB image slices vegetarian refreshments;
Step (29) input thumbnail calculates the discrete distance value between the colored RGB piece.
In the described step (21): the score value of front face then W wherein iWide for front face, n is people's face number, establishing W is the wide of thumbnail; The score value of people from side face
Figure GSA00000126154900081
W wherein iWide for side people's face, n is people's face number, establishing W is the wide of thumbnail.
Described step is established (x in (22) i, y i, w i, h i), i=1 ..., n, wherein x i, y iRepresentative's face position, w iBehaviour face wide, n is people's face number, establishing W is the wide of thumbnail,
The score value of whole body then
Figure GSA00000126154900082
W wherein iWide for the whole body human body, n is a number, establishing W is the wide of thumbnail;
The score value of upper body
W wherein iWide for the whole body human upper limb, n is a number, establishing W is the wide of thumbnail;
The score value of the lower part of the body
W wherein iWide for the whole body human body lower limbs, n is a number, establishing W is the wide of thumbnail.
Input thumbnail in the described step (23), if its grey level histogram is gray_hist=(gh_1, gh_2 .., gh_n), n is histogrammic number, t1, t2 are the integer threshold value, T1 and T2 be histogram and threshold value, base_score is the benchmark mark, and then the computing formula of grey level histogram distribution value score_hist_distrib is:
Figure GSA00000126154900085
Wherein, n=10, t1=2, t2=8, T 1=0.7, T 1=0.8, base_score=1.0.
Input thumbnail in the described step (24), establish its grey level histogram and be gray_hist=(gh_1, gh_2 .., gh_n), n is histogrammic number,
Figure GSA00000126154900086
Max_num=max (gh_1, gh_2 .., gh_n), base_score is the benchmark mark, the score_gray_hist_std computing formula of then calculating grey level histogram variance distribution value is:
Wherein, n=64, base_score=1.0.
Histogram variance distribution value on R plane, G plane, B plane respectively in the described step (25), averaging then obtains colored RGB histogram variance distribution value score_RGB_hist_std;
Figure GSA00000126154900092
Figure GSA00000126154900093
Figure GSA00000126154900094
score _ RGB _ hist _ distrib = score _ R _ hist _ distrib + score _ G _ hist _ distrib + score _ B _ hist _ distrib 3
Wherein, n=64, base_score=1.0.
Input thumbnail in the described step (26), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, the pixel value (r of colored RGB image Ij, g Ij, b Ij), i=1 ..., W, j=1 ..., H, wherein W, H is respectively the wide and high of thumbnail, and base_score1 and base_score2 are the benchmark mark, and t1 is a threshold value, establishes then to calculate the computing formula that colored RGB image becomes the difference value score_rgb2gray_diff of gray level image and be:
Figure GSA00000126154900096
Wherein, Base_score1=1.0, base_score2=2.0, t 1=-0.9.
Input thumbnail in the described step (27), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, W wherein, H is respectively the wide and high of thumbnail, and the score_gray_std computing formula of then calculating the variance yields of gray level image pixel is:
score _ gray _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij .
In the described step (28) respectively on the R plane, the variance yields of G plane, B plane epigraph pixel, averaging then obtains the variance yields score_RGB_std of colored RGB image slices vegetarian refreshments;
score _ R _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ G _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ B _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ RGB _ std = score _ R _ std + score _ G _ std + score _ B _ std 3 .
Import thumbnail in the described step (29), establish the pixel value (r of colored RGB image i, g i, b i), i=1 ..., W * H, W wherein, H is respectively the wide and high of thumbnail, and base_score is the benchmark mark, and the computing formula of then calculating the discrete distance value score_rgb_dist between the colored RGB piece is:
score _ rgb _ dist = - base _ score + Σ i = 1 W × H Σ j = 1 , j ≠ i W × H ( | r i - r j | + | g i - g j | + | b i - b j | ) / ( 3 × W × H × 255 ) , Wherein, base_score=0.0.
Step (3) is tried to achieve the score value of above-mentioned each feature respectively by ambiguity function to every hypertonic sketch map;
Step (4) obtains a final mark to the feature score value of every hypertonic sketch map by the weighting evaluation, and final score value final_score is calculated by following formula:
final_score=w 1×score_face_frontal+w 2×score_face_profile+w 3×score_full_body+w 4×score_upper_body+w 5×score_lower_body+w 6×score_hist_distrib+w 7×score_gray_hist_std+w 8×score_RGB_hist_std+w 9×score_rgb2gray_diff+w 10×score_gray_std+w 11×score_RGB_std;
Wherein, w 1=1.0, w 2=0.5, w 3=0.5, w 4=0.5, w 5=0.5, w 6=0.5, w 7=0.5, w 8=1.2, W 9=1.2, w 10=0.5, w 11=1.2.
Step (5) sorts to N hypertonic sketch map according to the size of mark;
Step (6) selects for use the highest thumbnail of mark as the thumbnail of representing this video.
In summary, owing to adopted above-mentioned technology, the present invention has following advantage: the present invention can select the thumbnail of representative thumbnail as acquiescence automatically by the Computer Image Processing method.
Above-described embodiment only is used to illustrate technological thought of the present invention and characteristics, its purpose is to make those skilled in the art can understand content of the present invention and implements according to this, can not only limit claim of the present invention with present embodiment, be all equal variation or modifications of doing according to disclosed spirit, still drop in the claim of the present invention.

Claims (13)

1. an intelligent selection method of video thumbnails is used for selecting a thumbnail of representing this video from one section video, it is characterized in that: may further comprise the steps:
(1) obtain the N hypertonic sketch map of one section video, wherein, 2≤N≤100, and N is a constant;
(2) every hypertonic sketch map is calculated the position and the size of its front face, the position and the size of people from side face, the position of whole body human body and size, the position and the size of the position of upper body and the size and the lower part of the body, grey level histogram distribution value, grey level histogram variance distribution value, colored RGB histogram variance distribution value, colored RGB image becomes the difference value of gray level image, the variance yields of gray level image pixel, the variance yields of colored RGB image slices vegetarian refreshments, discrete distance value between the colored RGB piece or the like feature;
(3) every hypertonic sketch map is tried to achieve the score value of above-mentioned each feature respectively by ambiguity function;
(4) the feature score value to every hypertonic sketch map obtains a final mark by the weighting evaluation;
(5) size according to mark sorts to N hypertonic sketch map;
(6) select for use the highest thumbnail of mark as the thumbnail of representing this video.
2. intelligent selection method of video thumbnails as claimed in claim 1 is characterized in that: in the described step (1), adopt the extracted at equal intervals method, according to the video duration that obtains in advance, it is divided into the N equal portions, and each equal portions extracts a frame as thumbnail then, and N=8.
3. intelligent selection method of video thumbnails as claimed in claim 2 is characterized in that: described step may further comprise the steps in (2):
(21) the input thumbnail detects face characteristic, comprises position and the position and the size big or small, people from side face of front face, and calculates fractional value;
(22) input thumbnail, the human body feature comprises position and size, the position of upper body and the position and the size of the size and the lower part of the body of whole body human body, and calculates fractional value;
(23) input thumbnail calculates grey level histogram distribution value;
(24) input thumbnail calculates grey level histogram variance distribution value;
(25) the input thumbnail calculates colored RGB histogram variance distribution value;
(26) the input thumbnail calculates the difference value that colored RGB image becomes gray level image;
(27) input thumbnail, the variance yields of calculating gray level image pixel;
(28) import thumbnail, calculate the variance yields of colored RGB image slices vegetarian refreshments;
(29) the input thumbnail calculates the discrete distance value between the colored RGB piece.
4. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: in the described step (21):
The score value of front face then
Figure FSA00000126154800021
W wherein iWide for front face, n is people's face number, establishing W is the wide of thumbnail;
The score value of people from side face
Figure FSA00000126154800022
W wherein iWide for side people's face, n is people's face number, establishing W is the wide of thumbnail.
5. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: described step is established (x in (22) i, y i, w i, h i), i=1 ..., n, wherein x i, y iRepresentative's face position, w iBehaviour face wide, n is people's face number, establishing W is the wide of thumbnail,
The score value of whole body then
Figure FSA00000126154800023
W wherein iWide for the whole body human body, n is a number, establishing W is the wide of thumbnail;
The score value of upper body
Figure FSA00000126154800024
W wherein iWide for the whole body human upper limb, n is a number, establishing W is the wide of thumbnail;
The score value of the lower part of the body
Figure FSA00000126154800025
W wherein iWide for the whole body human body lower limbs, n is a number, establishing W is the wide of thumbnail.
6. intelligent selection method of video thumbnails as claimed in claim 3, it is characterized in that: input thumbnail in the described step (23), establishing its grey level histogram is gray_hist=(gh_1, gh_2, .., gh_n), n is histogrammic number, t1, t2 is the integer threshold value, T1 and T2 be histogram and threshold value, base_score is the benchmark mark, then the computing formula of grey level histogram distribution value score_hist_distrib is:
Figure FSA00000126154800031
Wherein, n=10, t1=2, t2=8, T 1=0.7, T 1=0.8, base_score=1.0.
7. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: input thumbnail in the described step (24), establish its grey level histogram and be gray_hist=(gh_1, gh_2 .., gh_n), n is histogrammic number, Max_num=max (gh_1, gh_2 .., gh_n), base_score is the benchmark mark, the score_gray_hist_std computing formula of then calculating grey level histogram variance distribution value is:
Figure FSA00000126154800033
When max num ≠ 0 wherein, n=64, base_score=1.0.
8. intelligent selection method of video thumbnails as claimed in claim 3, it is characterized in that: histogram variance distribution value on R plane, G plane, B plane respectively in the described step (25), averaging then obtains colored RGB histogram variance distribution value score_RGB_hist_std;
Figure FSA00000126154800034
When max_num ≠ 0;
Figure FSA00000126154800035
When max_num ≠ 0;
Figure FSA00000126154800036
When max_num ≠ 0;
score _ RGB _ hist _ distrib = score _ R _ hist _ distrib + score _ G _ hist _ distrib + score _ B _ hist _ distrib 3 Wherein, n=64, base_score=1.0.
9. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: input thumbnail in the described step (26), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, the pixel value (r of colored RGB image Ij, g Ij, b Ij), i=1 ..., W, j=1 ..., H, wherein W, H is respectively the wide and high of thumbnail, and base_score1 and base_score2 are the benchmark mark, and t1 is a threshold value, establishes then to calculate the computing formula that colored RGB image becomes the difference value score_rgb2gray_diff of gray level image and be:
Figure FSA00000126154800041
Wherein, Base_score1=1.0, base_score2=2.0, t 1=-0.9.
10. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: input thumbnail in the described step (27), the pixel value of establishing gray level image is x Ij, i=1 ..., W, j=1 ..., H, W wherein, H is respectively the wide and high of thumbnail, and the score_gray_std computing formula of then calculating the variance yields of gray level image pixel is:
score _ gray _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij .
11. intelligent selection method of video thumbnails as claimed in claim 3, it is characterized in that: in the described step (28) respectively on the R plane, the variance yields of G plane, B plane epigraph pixel, averaging then obtains the variance yields score_RGB_std of colored RGB image slices vegetarian refreshments;
score _ R _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ G _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ B _ std = - base _ score + Σ i = 1 W Σ j = 1 H ( x ij - x ‾ ) 2 / 255 , Wherein x ‾ = Σ i = 1 W Σ j = 1 H x ij ;
score _ RGB _ std = score _ R _ std + score _ G _ std + score _ B _ std 3 .
12. intelligent selection method of video thumbnails as claimed in claim 3 is characterized in that: import thumbnail in the described step (29), establish the pixel value (r of colored RGB image i, g i, b i), i=1 ..., W * H, W wherein, H is respectively the wide and high of thumbnail, and base_score is the benchmark mark, and the computing formula of then calculating the discrete distance value score_rgb_dist between the colored RGB piece is:
score _ rgb _ dist = - base _ score + Σ i = 1 W × H Σ j = 1 , j ≠ i W × H ( | r i - r j | + | g i - g j | + | b i - b j | ) / ( 3 × W × H × 255 ) , Wherein, base_score=0.0.
13. intelligent selection method of video thumbnails as claimed in claim 2 is characterized in that: score value final_score final in the described step (4) is calculated by following formula:
final_score=w 1×score_face_frontal+w 2×score_face_profile+w 3×score_full_body+w 4×score_upper_body+w 5×score_lower_body+w 6×score_hist_distrib+w 7×score_gray_hist_std+w 8×score_RGB_hist_std+w 9×score_rgb2gray_diff+w 10×score_gray_std+w 11×score_RGB_std;
Wherein, w 1=1.0, w 2=0.5, w 3=0.5, w 4=0.5, w 5=0.5, w 6=0.5, w 7=0.5, w 8=1.2, W 9=1.2, w 10=0.5, w 11=1.2.
CN201010180153.0A 2010-05-20 2010-05-20 Intelligent selection method of video thumbnails Expired - Fee Related CN101853286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010180153.0A CN101853286B (en) 2010-05-20 2010-05-20 Intelligent selection method of video thumbnails

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010180153.0A CN101853286B (en) 2010-05-20 2010-05-20 Intelligent selection method of video thumbnails

Publications (2)

Publication Number Publication Date
CN101853286A true CN101853286A (en) 2010-10-06
CN101853286B CN101853286B (en) 2016-08-10

Family

ID=42804778

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010180153.0A Expired - Fee Related CN101853286B (en) 2010-05-20 2010-05-20 Intelligent selection method of video thumbnails

Country Status (1)

Country Link
CN (1) CN101853286B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063844A (en) * 2013-03-21 2014-09-24 腾讯科技(深圳)有限公司 Reduced graph generation method and system
CN104506968A (en) * 2014-12-31 2015-04-08 北京奇艺世纪科技有限公司 Method and device for determining video abstract figure
CN104581379A (en) * 2014-12-31 2015-04-29 乐视网信息技术(北京)股份有限公司 Video preview image selecting method and device
CN104598921A (en) * 2014-12-31 2015-05-06 乐视网信息技术(北京)股份有限公司 Video preview selecting method and device
CN105704559A (en) * 2016-01-12 2016-06-22 深圳市茁壮网络股份有限公司 Poster generation method and apparatus thereof
CN106101868A (en) * 2016-07-18 2016-11-09 乐视控股(北京)有限公司 Reduced graph generating method and generating means
CN106462744A (en) * 2014-06-12 2017-02-22 微软技术许可有限责任公司 Rule-based video importance analysis
CN108650524A (en) * 2018-05-23 2018-10-12 腾讯科技(深圳)有限公司 Video cover generation method, device, computer equipment and storage medium
CN108733676A (en) * 2017-04-14 2018-11-02 合信息技术(北京)有限公司 The extracting method and device of video thumbnails
CN110147780A (en) * 2019-05-28 2019-08-20 山东大学 The landform recognition methods of real-time field robot and system based on level landform
CN110572723A (en) * 2019-08-30 2019-12-13 华为终端有限公司 Thumbnail generation method and related device
CN112954450A (en) * 2021-02-02 2021-06-11 北京字跳网络技术有限公司 Video processing method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003333523A (en) * 2002-05-17 2003-11-21 Newsoft Technology Corp Character thumbnail sequence generating system and method thereof
CN1561513A (en) * 2001-10-01 2005-01-05 汤姆森特许公司 System and method for identifying data on a storage medium
US20070294641A1 (en) * 2000-05-31 2007-12-20 Rashkovskiy Oleg B Automatically preparing streaming video programming guides
US20080046406A1 (en) * 2006-08-15 2008-02-21 Microsoft Corporation Audio and video thumbnails
WO2009044351A1 (en) * 2007-10-04 2009-04-09 Koninklijke Philips Electronics N.V. Generation of image data summarizing a sequence of video frames
US20100107126A1 (en) * 2008-10-28 2010-04-29 Hulu Llc Method and apparatus for thumbnail selection and editing
JP4612826B2 (en) * 2004-10-01 2011-01-12 キヤノン株式会社 Storage device, storage method, computer-readable storage medium, and program

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070294641A1 (en) * 2000-05-31 2007-12-20 Rashkovskiy Oleg B Automatically preparing streaming video programming guides
CN1561513A (en) * 2001-10-01 2005-01-05 汤姆森特许公司 System and method for identifying data on a storage medium
JP2003333523A (en) * 2002-05-17 2003-11-21 Newsoft Technology Corp Character thumbnail sequence generating system and method thereof
JP4612826B2 (en) * 2004-10-01 2011-01-12 キヤノン株式会社 Storage device, storage method, computer-readable storage medium, and program
US20080046406A1 (en) * 2006-08-15 2008-02-21 Microsoft Corporation Audio and video thumbnails
WO2009044351A1 (en) * 2007-10-04 2009-04-09 Koninklijke Philips Electronics N.V. Generation of image data summarizing a sequence of video frames
US20100107126A1 (en) * 2008-10-28 2010-04-29 Hulu Llc Method and apparatus for thumbnail selection and editing

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063844A (en) * 2013-03-21 2014-09-24 腾讯科技(深圳)有限公司 Reduced graph generation method and system
CN104063844B (en) * 2013-03-21 2019-01-15 腾讯科技(深圳)有限公司 A kind of reduced graph generating method and system
US10664687B2 (en) 2014-06-12 2020-05-26 Microsoft Technology Licensing, Llc Rule-based video importance analysis
CN106462744A (en) * 2014-06-12 2017-02-22 微软技术许可有限责任公司 Rule-based video importance analysis
CN104506968A (en) * 2014-12-31 2015-04-08 北京奇艺世纪科技有限公司 Method and device for determining video abstract figure
CN104581379A (en) * 2014-12-31 2015-04-29 乐视网信息技术(北京)股份有限公司 Video preview image selecting method and device
CN104598921A (en) * 2014-12-31 2015-05-06 乐视网信息技术(北京)股份有限公司 Video preview selecting method and device
CN104506968B (en) * 2014-12-31 2018-08-07 北京奇艺世纪科技有限公司 A kind of video frequency abstract figure determines method and device
CN105704559A (en) * 2016-01-12 2016-06-22 深圳市茁壮网络股份有限公司 Poster generation method and apparatus thereof
CN106101868A (en) * 2016-07-18 2016-11-09 乐视控股(北京)有限公司 Reduced graph generating method and generating means
CN108733676A (en) * 2017-04-14 2018-11-02 合信息技术(北京)有限公司 The extracting method and device of video thumbnails
CN108650524A (en) * 2018-05-23 2018-10-12 腾讯科技(深圳)有限公司 Video cover generation method, device, computer equipment and storage medium
CN110147780A (en) * 2019-05-28 2019-08-20 山东大学 The landform recognition methods of real-time field robot and system based on level landform
CN110147780B (en) * 2019-05-28 2021-01-01 山东大学 Real-time field robot terrain identification method and system based on hierarchical terrain
CN110572723A (en) * 2019-08-30 2019-12-13 华为终端有限公司 Thumbnail generation method and related device
CN112954450A (en) * 2021-02-02 2021-06-11 北京字跳网络技术有限公司 Video processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN101853286B (en) 2016-08-10

Similar Documents

Publication Publication Date Title
CN101853286A (en) Intelligent selection method of video thumbnails
CN102567731B (en) Extraction method for region of interest
EP3249562A1 (en) Method for obtaining care information, method for sharing care information, and electronic apparatus therefor
CN106446952B (en) A kind of musical score image recognition methods and device
CN104598908B (en) A kind of crops leaf diseases recognition methods
EP2523165B1 (en) Image processing method and image processing device
CN110163198A (en) A kind of Table recognition method for reconstructing, device and storage medium
WO2017084204A1 (en) Method and system for tracking human body skeleton point in two-dimensional video stream
CN102509099B (en) Detection method for image salient region
CN102088597B (en) Method for estimating video visual salience through dynamic and static combination
CN103218619A (en) Image aesthetics evaluating method
CN103248906B (en) Method and system for acquiring depth map of binocular stereo video sequence
CN103996195A (en) Image saliency detection method
CN102881160B (en) Outdoor traffic sign identification method under low-illumination scene
CN109902558A (en) A kind of human health deep learning prediction technique based on CNN-LSTM
CN104951495B (en) Device and method for Management Representative video image
CN103618918A (en) Method and device for controlling display of smart television
CN103632153B (en) Region-based image saliency map extracting method
CN104021544B (en) A kind of greenhouse vegetable disease monitor video extraction method of key frame, that is, extraction system
CN103824284A (en) Key frame extraction method based on visual attention model and system
CN102354388B (en) Method for carrying out adaptive computing on importance weights of low-level features of image
CN106228157A (en) Coloured image word paragraph segmentation based on image recognition technology and recognition methods
CN102779270A (en) Target clothing image extraction method aiming at shopping image search
CN106447654B (en) Quality evaluating method is redirected based on statistics similarity and the image of two-way conspicuousness fidelity
CN103946868A (en) Processing method and system for medical images

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20200518

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Patentee after: Alibaba (China) Co.,Ltd.

Address before: 200003 Shanghai Xuhui District Xietu Road No. 1238 Building No. 6, X2 Creative Park

Patentee before: SHANGHAI TUDOU NETWORK TECHNOLOGY Co.,Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160810

Termination date: 20200520