Video quality digging system and method
Technical field
The present invention, about a kind of network video system and method, particularly relates to video quality digging system and the method for high-quality video in a kind of UGC video website.
Background technology
UGC full name is User Generated Content, the namely meaning of user-generated content.Different from conventional video website, in UGC video website, video has following features:
1) short-sighted frequency is main
2) video is uploaded by user, and video quality is uneven
3) quantity is large, video updating decision.
These features, make robotization find and excavate the high-quality video in website, becoming difficulty.
At present, the video quality for UGC video website excavates, and a kind of simple solution, is that the advantage of the method is by manually screening, but can ensures the quality of screened video to a certain extent. and shortcoming is also apparent:
1) labor intensive material resources, due to video updating decision, need editor constantly to go to hold current focus.
2) subjective degree is high, and this comprises: a, selects the quality of video, depends on the experience .b with editor, during selecting video, and the inevitably tendency of a guy.
For the another kind of method that UGC video website video quality excavates, the data placed one's entire reliance upon as statistical significances such as video playback number, comment numbers, common are as: one week play maximum ranking list, today comments on ranking list etc., these statisticss, to a certain extent, can reflect the popular degree of video really, playing the video that number is higher, is also generally the video that current hot topic is play really.But also there are the following problems:
1) video playback number affects by spread and demonstrate..Such as, certain video appears at Baidu's homepage, or is forwarded by certain famous person in community, then video is more easily exposed, thus has and play number in a large number. but these videos, quality might not be got well, and user may be by " deceive " click and enter viewing video, and non-user is really liked.
Citing:
User wants on website, look for the film just shown, but due to copyright reasons, community website does not have full version film, only has titbit or propaganda film. and user " is deceived " by search engine and is clicked and entered, and really is liked.
2) video quality is difficult to ensure.Some title party videos, often have considerable click, but these videos, also should not classify as high-quality video.
Summary of the invention
For overcoming the deficiency that above-mentioned prior art exists, the object of the present invention is to provide a kind of video quality digging system and method, by the video in different video source being carried out respectively classify and merge, according to video attribute, weight is obtained to each video, the ranking list candidate collection of each classification is obtained according to weight, not only can excavate high-quality video comparatively objectively, and the quality of excavated video can be ensured.
For reaching above-mentioned and other object, the present invention proposes a kind of video quality digging system, at least comprises:
Video source module, comprises the multiple videos from multiple different video source for Video Mining;
Classification die set, classifies respectively according to video attribute to the video of video source each in this video source module, and is merged by the classification results in different video source;
Weight calculation module, to sorted each video, obtains multiple weight according to different video attributes, and obtained multiple weights are obtained the final weight of each video according to weight calculation formula;
Ranking list candidate collection generates module, then for each video in each classification, the weight according to video generates the ranking list candidate collection of each classification.
Further, this system also comprises memory module, the ranking list candidate collection of each classification is stored in database.
Further, this video source module comprises the video source that the video source of real-time statistics broadcasting number acquisition, the video source of human-edited's recommendation and important VCU upload.
Further, this real-time statistics is play the time in the video source foundation video attribute of number acquisition and is play number by the visual classification in video source by this classification die set.
Further, this classification die set video source that human-edited is recommended according to the video location in video attribute by the visual classification in video source.
Further, this classification die set video source that this important VCU is uploaded according to the uplink time in video attribute by the visual classification in video source.
Further, this weight calculation formula is:
Rank=w1*R1+w2*R2+w3*R3
Wherein, Rank is that this weight calculation module calculates gained weight, w1, w2, w3 is empirical parameter, according to the broadcasting number of statistics video, R1 show that the weight of video, R2 are that the position calculation appearing at page breakage according to video draws video weight R2, R3 is the significance level according to this VCU, gives the weight of this VCU uploaded videos.
Further, this video quality digging system also comprises the module that reorders, the attribute that this module that reorders newly obtains according to video for the video in each classification ranking list candidate collection and primary attribute recalculate weight, and reorder according to the weight after recalculating, form the ranking list candidate collection of new each classification.
Further, this module that reorders comprises click feedback module, duplicate removal module, blacklist filtering module, play number total amount acquisition module and weight re-computation module, this click feedback module for excavate the previous day user to broadcast page recommend digital video click situation, this duplicate removal module is for judging whether the video with repetition, this blacklist filtering module is used for filtering black list and is stored in the illegal video of state in list of videos, this broadcasting number total amount acquisition module in conjunction with video novel degrees and play number total amount, adjustment video weight, this weight re-computation module is according to the weight clicking the video click feedback that feedback module obtains, according to the penalty term that duplicate removal module calculates, and the attribute weight of video is calculated according to the primary attribute of video, re-computation formula is utilized to obtain the new weighted value of each video, and according to new weighted value, the video in each classification ranking list candidate collection is reordered, form the ranking list candidate collection of new each classification.
Further, this re-computation formula is:
V=R+v1+v2+v3
Wherein, V is new weighted value, R is the weighted value that this power kind calculates module acquisition, the weight that the V1 number of times clicked according to video that be this click feedback module obtains, V2 is this duplicate removal module is the penalty term that palinopsia frequency meter calculates to video, and V3 is the attribute weight of the video gone out according to video elementary property calculation.
For reaching above-mentioned and other object, the present invention also provides a kind of video quality method for digging, comprises the steps:
Step one, classifies respectively according to video attribute to the video in different video source, and is merged by the classification results in different video source;
Step 2, to sorted each video, obtains multiple weight according to different video attributes, and obtains the final weight of each video according to weight calculation formula;
Step 3, for each video in each classification, the weight according to video generates the ranking list candidate collection of each classification.
Further, after step 3, also comprise the step ranking list candidate collection of each classification be stored in database.
Further, this video source comprises the video source that the video source of real-time statistics broadcasting number acquisition, the video source of human-edited's recommendation and important VCU upload.
Further, real-time statistics is play to the video source of number acquisition, the video attribute of classification foundation is time and broadcasting number; For the video source that human-edited recommends, the video attribute of classification foundation is video location; For the video source that important VCU uploads, the video attribute of classification foundation is uplink time.
Further, in step one, the result that classification merges comprises the fastest video of rising, popular video and selected video.
Further, step 2 also comprises the steps:
According to the broadcasting number of the video of statistics, obtain the weight R1 of video;
According to the position of the appearance of video, calculate video weight R2;
According to the significance level of VCU, give the weight R3 of this VCU uploaded videos;
Exploitation right re-computation formula Rank=w1*R1+w2*R2+w3*R3 obtains the final weight of each video, and wherein w1, w2, w3 are empirical parameter.
Further, in step 2, according to clicking rate situation on line, the weight of each video is regulated.
Further, also comprise the steps: after this step 3
The attribute newly obtained for the video foundation video in each classification ranking list candidate collection and primitive attribute recalculate weight, and reorder according to the weight after recalculating, and form the ranking list candidate collection of new each classification.
Further, the step that reorders comprises:
Obtain the clicked number of times of this video according to click feedback module, show that video clicks the weight v1 of feedback;
According to the judged result of duplicate removal module, calculate penalty term v2;
According to the primary attribute of video, calculate the attribute weight v3 of video;
Obtain the new weighted value of each video according to re-computation formula V=R+v1+v2+v3, wherein R is the weighted value obtained in step 2;
Reorder according to the ranking list candidate collection of weight to each classification after recalculating, form the ranking list candidate collection of new each classification.
Further, this primary attribute comprises video title length, uplink time, video duration, readability.
Compared with prior art, a kind of video quality digging system of the present invention and method are by carrying out respectively classifying and merging by the video in different video source, according to video attribute, weight is obtained to each video, the ranking list candidate collection of each classification is obtained according to weight, not only can excavate high-quality video comparatively objectively, and the quality of excavated video can be ensured.
Accompanying drawing explanation
Fig. 1 is the system architecture diagram of a kind of video quality digging system of the present invention;
Fig. 2 is the flow chart of steps of a kind of video quality method for digging of the present invention;
Fig. 3, Fig. 4 and Fig. 5 are the video quality Result schematic diagram of the preferred embodiment of the present invention.
Embodiment
Below by way of specific instantiation and accompanying drawings embodiments of the present invention, those skilled in the art can understand other advantage of the present invention and effect easily by content disclosed in the present specification.The present invention is also implemented by other different instantiation or is applied, and the every details in this instructions also can based on different viewpoints and application, carries out various modification and change not deviating under spirit of the present invention.
Fig. 1 is the system architecture diagram of a kind of video quality digging system of the present invention.As shown in Figure 1, a kind of video quality digging system of the present invention, excavate for the high-quality video in UGC video website, at least comprise: video source module 101, classification die set 102, weight calculation module 103, ranking list candidate collection generate module 104 and memory module 105.
Wherein, video source module 101 comprises the multiple videos from multiple different video source for Video Mining, in present pre-ferred embodiments, video source module 101 has following three kinds of video source: 1, the video playback number of each video of real-time statistics, and playing number more than the video of a preset value is the video source that real-time statistics obtains; 2, the video recommended of human-edited, comprises page breakage and column, i.e. each classification of arranging out of web editor, the video of each column, and the quality of this kind of video source is general higher; 3, important VCU (Value Creating User/Unit, user/the mechanism of the creation of value) video uploaded, to for a long time, the video that the VCU that website and community are approved uploads, run-of-the-mill is all guaranteed. and these videos are sources for high-quality video.
In classification die set 102 pairs of video source module 101, the video of each video source is classified respectively according to video attribute, and is merged by the classification results in different video source.Specifically, real-time statistics is play to the video source of the acquisition of number, classification die set 102 is according to the time in video attribute and play number by the visual classification in this video source, in present pre-ferred embodiments, be divided three classes: rise the fastest, popular video and selected video, for example, classification die set 102 was by nearest 5 hours, the video that hour level plays number > 20 is classified as the fastest video of rising, by nearest 24 hours, the video that hour level plays number > 50 classifies as popular video, by nearest 10 days. the video playing number > 200 every day is classified as selected video, for the video that human-edited recommends, classification die set 102 is classified according to the video location in video attribute, such as, current page fragment video is classified as the fastest video of rising by classification die set 102, current column video is classified as popular video, history column video is classified as selected video, for the video that important VCU uploads, classification die set 102 is classified according to the uplink time in video attribute, such as, nearest 1 day uploaded videos is classified as the fastest video of rising by classification die set 102, the nearest video uploaded for 2 days is classified as popular video, the nearest video uploaded for 10 days is classified as selected video, can be as shown in table 1 to the classification results of three kinds of video source, the classification results in different video source merges by classification results 102, forms the classification results total to video in video source module 101.
Table 1
Weight calculation module 103, to sorted each video, obtains multiple weight according to different video attributes, and obtains the final weight of each video according to weight calculation formula.Specifically, weight calculation module 103 is according to the broadcasting number of the video of statistics, obtain the weight R1 of video, according to the position of the appearance of video, as the position of page breakage, calculate video weight R2, according to the significance level of VCU, give the weight R3 of this VCU uploaded videos, then by weight calculation formula Rank=w1*R1+w2*R2+w3*R3 wherein w1, w2, w3 is empirical parameter, obtains the final weight of each video, more preferably, weight calculation module 103 also can click situation according on line, does suitable adjustment to the final weight of each video.Ranking list candidate collection generates module 104 for each video in each classification, and the weight according to video generates the ranking list candidate collection of each classification; The ranking list candidate collection of each classification is stored in database by memory module 105, in present pre-ferred embodiments, the ranking list candidate collection of each classification can be stored in the Key-Value database Redis of the non-relational database (NoSQL) of increasing income.
Preferably, the video quality digging system of the present invention also comprises the module that reorders, the attribute that the module that reorders newly obtains according to video for the video in each classification ranking list candidate collection and primary attribute recalculate weight, and reorder according to the weight after recalculating, form the ranking list candidate collection of new each classification.The module that reorders comprises clicks feedback module 107, duplicate removal module 108, blacklist filtering module 109, broadcasting number total amount acquisition module 110 and weight re-computation module 111, click feedback module 107 for excavate the previous day user to broadcast page recommend digital video click situation, due to the click in recommendation position, representative of consumer that can be clearer and more definite is to the interest level of video, therefore, click more videos, better weight should be able to be obtained, duplicate removal module 108 is for judging whether the video with repetition, and duplicate removal is completed by following two steps: title duplicate removal, by video interface (video_cluster interface), obtain belonging to video and gather together (cluster), if two videos belong to one gather together (cluster), then only retain the video blacklist filtering module 109 of higher weights value, for filtering black list and be stored in state (status) illegal video in list of videos (the video table in mysql database), play number total amount acquisition module 110, in conjunction with novel degrees and the broadcasting number total amount of video, adjustment video weight, such as, rising in the fastest video, not only needing the temperature calculating current hour level video, also should consider video novel degrees. to hotter in the recent period, but the video that total playback volume is very high, should fall power, to find more new videos, weight re-computation module 111 obtains according to clicking feedback module 107 the weight v1 that video clicks feedback, penalty term v2 is calculated according to duplicate removal module 108, and according to the primary attribute of video, as uplink time, video duration, readability, calculate the attribute weight v3 of video, the new weighted value of each video is obtained here according to re-computation formula V=R+v1+v2+v3 (R is that weight calculation module 103 calculates acquisition weight), and according to new weighted value, the video in each classification ranking list candidate collection is reordered, form the ranking list candidate collection of new each classification.
Fig. 2 is the flow chart of steps of a kind of video quality method for digging of the present invention.As shown in Figure 2, a kind of video quality method for digging of the present invention, comprises the steps:
Step 201, classifies respectively according to video attribute to the video in different video source, and is merged by the classification results in different video source.In present pre-ferred embodiments, video source has three kinds: 1, the video playback number of each video of real-time statistics, and playing number more than the video of a preset value is the video source that real-time statistics obtains; 2, the video recommended of human-edited, comprises page breakage and column, i.e. each classification of arranging out of web editor, the video of each column, and the quality of this kind of video source is general higher; 3, the video uploaded of important VCU (Value Creating User/Unit, the user/mechanism of the creation of value).Different video source is different for the video attribute carrying out classifying, such as, for the first video source, the video attribute of classification foundation can be time and broadcasting number, by nearest 5 hours, the video that hour level plays number > 20 is classified as the fastest video of rising, by nearest 24 hours, the video that hour level plays number > 50 classifies as popular video, by nearest 10 days, the video playing number > 200 every day is classified as selected video, for the second video source, the video attribute of classification foundation can be video location, that is: by nearest 5 hours, the video that hour level plays number > 20 is classified as the fastest video of rising, by nearest 24 hours, the video that hour level plays number > 50 classifies as popular video, by nearest 10 days. the video playing number > 200 every day is classified as selected video, for the third video source, the video attribute of classification foundation can be then uplink time, such as, nearest 1 day uploaded videos is classified as the fastest video of rising, the nearest video uploaded for 2 days is classified as popular video, the nearest video uploaded for 10 days is classified as selected video.
Step 202, to sorted each video, obtains multiple weight according to different video attributes, and obtains the final weight of each video according to weight calculation formula.Specifically, step 202 can comprise the steps: further
According to the broadcasting number of the video of statistics, obtain the weight R1 of video;
According to the position of the appearance of video, as the position of page breakage, calculate video weight R2;
According to the significance level of VCU, give the weight R3 of this VCU uploaded videos;
Finally, exploitation right re-computation formula Rank=w1*R1+w2*R2+w3*R3, wherein w1, w2, w3 are empirical parameter, obtain the final weight of each video.
Certainly, step 202 according to clicking rate situation on line, can also do suitable adjustment to the weight of each video.
Step 203, for each video in each classification, the weight according to video generates the ranking list candidate collection of each classification.
Step 204, the ranking list candidate collection of each classification is stored in database, in present pre-ferred embodiments, the ranking list candidate collection of each classification can be stored in the Key-Value database Redis of the non-relational database (No SQL) of increasing income.
Preferably, after step 203, the video quality method for digging of the present invention can also comprise the steps:
The attribute newly obtained for the video foundation video in each classification ranking list candidate collection and primitive attribute recalculate weight, and reorder according to the weight after recalculating, and form the ranking list candidate collection of new each classification.Specifically, the step that reorders then comprises the steps:
Obtain the clicked number of times of this video according to click feedback module, show that video clicks the weight v1 of feedback;
According to the judged result of duplicate removal module, detecting that as crossed this video is palinopsia frequency, calculating penalty term v2 (v2 is negative value);
According to the primary attribute of video, as video title length, uplink time, video duration, readability etc., calculate the attribute weight v3 of video;
The new weighted value of each video is obtained here according to re-computation formula V=R+v1+v2+v3 (R is that step 202 calculates the weight obtained);
Reorder according to the ranking list candidate collection of weight to each classification after recalculating, form the ranking list candidate collection of new each classification.
Fig. 3, Fig. 4 and Fig. 5 are the video quality Result schematic diagram of the preferred embodiment of the present invention.Visible, by the present invention, high-quality video can be excavated more objectively, user is provided high-quality video.
In sum, a kind of video quality digging system of the present invention and method are by carrying out respectively classifying and merging by the video in different video source, according to video attribute, weight is obtained to each video, the ranking list candidate collection of each classification is obtained according to weight, not only can excavate high-quality video comparatively objectively, and the quality of excavated video can be ensured.
Above-described embodiment is illustrative principle of the present invention and effect thereof only, but not for limiting the present invention.Any those skilled in the art all without prejudice under spirit of the present invention and category, can carry out modifying to above-described embodiment and change.Therefore, the scope of the present invention, should listed by claims.