CN110309397A - Video screening technique and system - Google Patents

Video screening technique and system Download PDF

Info

Publication number
CN110309397A
CN110309397A CN201810227335.5A CN201810227335A CN110309397A CN 110309397 A CN110309397 A CN 110309397A CN 201810227335 A CN201810227335 A CN 201810227335A CN 110309397 A CN110309397 A CN 110309397A
Authority
CN
China
Prior art keywords
video
screened
metadata
screening
search condition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810227335.5A
Other languages
Chinese (zh)
Inventor
窦阳超
刘利华
孔令志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Digital Video Software Technology Development Co Ltd
Sumavision Technologies Co Ltd
Original Assignee
Beijing Digital Video Software Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Digital Video Software Technology Development Co Ltd filed Critical Beijing Digital Video Software Technology Development Co Ltd
Priority to CN201810227335.5A priority Critical patent/CN110309397A/en
Publication of CN110309397A publication Critical patent/CN110309397A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of video screening technique and systems, are related to data retrieval technology field, this method comprises: obtaining the metadata of video to be screened according to pre-set search condition;According to the metadata of preset screening rule and the video to be screened, target video is determined from video to be screened;The metadata of target video is saved, for user's processing.Video screening technique provided by the invention and system, by the collection screening operation for being automatically performed video data, both the workload of user had been effectively reduced, it is convenient for finding bad video in time again, to reduce cost of labor, it improves work efficiency, and collection screening is not influenced by the working condition of staff, therefore improves the stability of video Effect of screening.

Description

Video screening technique and system
Technical field
The present invention relates to data retrieval technology fields, more particularly, to a kind of video screening technique and system.
Background technique
With the development of science and technology, network is increasingly becoming the important tool that people are used to exchange, obtain information, and watches Video is the most intuitive quick way that people obtain information.However current video be easy by criminal for propagate it is illegal, Violence, bloody information, infringement national sovereignty, tissue or individual privacy etc., it is therefore desirable to which screening management is carried out to harmful video.
Traditional harmful video screening, which usually relies on, to be accomplished manually.By spending a large amount of manpower, time, manually exist Audio-video frequency content of reading for a long time on a large scale in major video website is accomplished to send out in time as far as possible so that screening goes out harmful information Existing, processing in time, the harm of harmful video is preferably minimized.
With the fast development of Internet technology, the video upload amount of one side video website is increasing significantly, mainstream The video length that some video websites upload in one minute is up to several hundred hours;On the other hand, the propagation of network video has Agility.For above situation, the measure that traditional harmful video screening method is taken is as follows: 1, increasing staff, in number Being significantly increased for workload is made up in amount.2,24 hours shift works;Since video uplink time is not fixed, whole day 24 hours is all Have, therefore this is also required.
As it can be seen that traditional harmful video screening method is because by being accomplished manually, so cost of labor is higher, working efficiency compared with It is low, and since the working strength of staff is larger, it works high-intensitively be easy to cause under-enumeration part nocuousness video for a long time, make The stability for obtaining video Effect of screening is poor.
Summary of the invention
It is mentioned in view of this, the purpose of the present invention is to provide a kind of video screening techniques and system with reducing cost of labor The stability of high working efficiency and video Effect of screening.
In a first aspect, the embodiment of the invention provides a kind of video screening techniques, comprising:
According to pre-set search condition, the metadata of video to be screened is obtained;
According to the metadata of preset screening rule and the video to be screened, target is determined from the video to be screened Video;
The metadata of the target video is saved, for user's processing.
With reference to first aspect, the embodiment of the invention provides the first possible embodiments of first aspect, wherein institute State the metadata that video to be screened is obtained according to pre-set search condition, comprising:
The unified resource for generating webpage capture task corresponding with pre-set search condition by task dispatcher is fixed Position symbol URL, and the URL is sent to webpage capture device;Wherein, the search condition includes search address, search key It is required with retrieval time;
The URL, which is sent, to Website server by the webpage capture device grabs the corresponding net of the webpage capture task Page content;
The web page contents are sent to and video website pair belonging to the web page contents by the task dispatcher The web-page parser answered;
The web page contents are parsed by the web-page parser, extract the metadata of video to be screened.
The possible embodiment of with reference to first aspect the first, the embodiment of the invention provides second of first aspect Possible embodiment, wherein it is described that the web page contents retrieved are parsed by the web-page parser, it extracts wait sieve After the metadata for selecting video, the method also includes:
Whether the metadata that the video to be screened is judged by the web-page parser is to meet the search condition Total data;
It is grabbed if not, generating webpage corresponding with pre-set search condition by task dispatcher described in continuing to execute Take the uniform resource position mark URL of task, and the step of URL is sent to webpage capture device.
With reference to first aspect, the embodiment of the invention provides the third possible embodiments of first aspect, wherein institute The metadata for stating video to be screened includes uplink time and click volume, and the screening rule includes that uplink time requirement and temperature are wanted It asks;The metadata according to preset screening rule and the video to be screened determines target from the video to be screened Video, comprising:
Judge whether the uplink time of the video to be screened meets the uplink time requirement;
If meeting the uplink time requirement, according to the click volume of the video to be screened, the view to be screened is judged Whether frequency meets the temperature requirement;
The video to be screened that the temperature requires will be met and be determined as target video.
The third possible embodiment with reference to first aspect, the embodiment of the invention provides the 4th kind of first aspect Possible embodiment, wherein the metadata of the video to be screened further includes acquisition time, and the temperature requires to include temperature Threshold value;The click volume according to the video to be screened, judges whether the video to be screened meets the temperature requirement, packet It includes:
According to the previous of this click volume of the video to be screened and this acquisition time and the video to be screened Secondary click volume and a preceding acquisition time calculate the hot value of the video to be screened;
Judge whether the hot value is more than or equal to the heat degree threshold;
If the hot value is more than or equal to the heat degree threshold, determine that the video to be screened meets the temperature It is required that.
With reference to first aspect, the embodiment of the invention provides the 5th kind of possible embodiments of first aspect, wherein institute After stating the metadata for saving the target video, which comprises
New video notice to be processed is issued to the user.
Second aspect, the embodiment of the present invention also provide a kind of video screening system, comprising: include:
Data acquisition module, for obtaining the metadata of video to be screened according to pre-set search condition;
Video screening module, for the metadata according to preset screening rule and the video to be screened, from it is described to It screens and determines target video in video;
Video preserving module, for saving the metadata of the target video, for user's processing.
In conjunction with second aspect, the embodiment of the invention provides the first possible embodiments of second aspect, wherein institute Data acquisition module is stated to be specifically used for:
The unified resource for generating webpage capture task corresponding with pre-set search condition by task dispatcher is fixed Position symbol URL, and the URL is sent to webpage capture device;Wherein, the search condition includes search address, search key It is required with retrieval time;
The URL, which is sent, to Website server by the webpage capture device grabs the corresponding net of the webpage capture task Page content;
The web page contents are sent to and video website pair belonging to the web page contents by the task dispatcher The web-page parser answered;
The web page contents are parsed by the web-page parser, extract the metadata of video to be screened.
In conjunction with the first possible embodiment of second aspect, the embodiment of the invention provides second of second aspect Possible embodiment, wherein the data acquisition module is specifically also used to:
Whether the metadata that the video to be screened is judged by the web-page parser is to meet the search condition Total data;
It is grabbed if not, generating webpage corresponding with pre-set search condition by task dispatcher described in continuing to execute Take the uniform resource position mark URL of task, and the step of URL is sent to webpage capture device.
In conjunction with second aspect, the embodiment of the invention provides the third possible embodiments of second aspect, wherein institute The metadata for stating video to be screened includes uplink time and click volume, and the screening rule includes that uplink time requirement and temperature are wanted It asks;The video screening module is specifically used for:
Judge whether the uplink time of the video to be screened meets the uplink time requirement;
If meeting the uplink time requirement, according to the click volume of the video to be screened, the view to be screened is judged Whether frequency meets the temperature requirement;
The video to be screened that the temperature requires will be met and be determined as target video.
The embodiment of the present invention bring it is following the utility model has the advantages that
In the embodiment of the present invention, according to pre-set search condition, the metadata of video to be screened is obtained;According to default Screening rule and the video to be screened metadata, from video to be screened determine target video;Save the member of target video Data, for user's processing.Video screening technique provided in this embodiment and system, by the collection for being automatically performed video data Screening operation, had not only effectively reduced the workload of user, but also convenient for finding bad video in time, thus reduce manually at This, improves work efficiency, and collection screening is not influenced by the working condition of staff, therefore improves video screening effect The stability of fruit.
Other features and advantages of the present invention will illustrate in the following description, also, partly become from specification It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention are in specification, claims And specifically noted structure is achieved and obtained in attached drawing.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, preferred embodiment is cited below particularly, and cooperate Appended attached drawing, is described in detail below.
Detailed description of the invention
It, below will be to specific in order to illustrate more clearly of the specific embodiment of the invention or technical solution in the prior art Embodiment or attached drawing needed to be used in the description of the prior art be briefly described, it should be apparent that, it is described below Attached drawing is some embodiments of the present invention, for those of ordinary skill in the art, before not making the creative labor It puts, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow diagram of video screening technique provided in an embodiment of the present invention;
Fig. 2 is a kind of overall procedure schematic diagram of video screening process provided in an embodiment of the present invention;
Fig. 3 is a kind of structural schematic diagram of video screening system provided in an embodiment of the present invention;
Fig. 4 is the structural schematic diagram of another video screening system provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Technical solution be clearly and completely described, it is clear that described embodiments are some of the embodiments of the present invention, rather than Whole embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art are not making creative work premise Under every other embodiment obtained, shall fall within the protection scope of the present invention.
Harmful video screening traditional at present, which usually relies on, to be accomplished manually, and cost of labor is higher, and working efficiency is lower, The stability of video Effect of screening is poor.Based on this, a kind of video screening technique provided in an embodiment of the present invention and system can be with Cost of labor is reduced, the stability of working efficiency and video Effect of screening is improved.
For convenient for understanding the present embodiment, first to a kind of video screening technique disclosed in the embodiment of the present invention into Row is discussed in detail.
Embodiment one:
The embodiment of the invention provides a kind of intelligent video information extraction and analysis based on computer and Internet technology Technology, for collecting and the work of screening harmful information.Fig. 1 is a kind of stream of video screening technique provided in an embodiment of the present invention Journey schematic diagram, as shown in Figure 1, this method including the following steps:
Step S102 obtains the metadata of video to be screened according to pre-set search condition.
Before the metadata for obtaining video to be screened, need first to be arranged or obtain search condition.Pre-set inspection Rope condition includes search address, search key and retrieval time requirement etc., such as the video website that selection needs to pay close attention to As search address (web-site), the commonly keyword for retrieving video and phrase composition keywords database, setting are collected Search time interval, the retrieval times such as video uplink time range require, in addition it can which the conditions such as whether download are arranged.Its In, the collection of search key is obtained according to the routine use of user, such as the hot spot for needs search for for a long time or recent Vocabulary.Keywords database is the dictionary of a relative dynamic, will by increase, deletion, the modification to search key in keywords database Obtain more accurately search result.
User blind can search the task start video screening technique by creating.Newly-built one blind when searching task, to all It the website (video website selected in search condition) of support while coming into force, without adding website one by one.It creates blind when searching task Automatically what is obtained is general search condition, is not particularly suited for all websites, and user can return to situation, modification inspection according to result Rope condition.Search condition can be classified and be saved line by line in the form of text, modified ginseng by task name in the database Number comes into force in blind search in task next time.Based on this, the above method further include: when receive search condition modification request when, root Request is modified according to the search condition and modifies current search condition, and saves modified search condition.
It is above-mentioned it is blind search task run after, method provided in this embodiment automatically from above-mentioned web-site retrieval and keywords database Web page contents that are relevant and meeting retrieval time requirement, then the web page contents got are parsed, obtain view to be screened The metadata of frequency.
Step S104, according to the metadata of preset screening rule and above-mentioned video to be screened, from video to be screened really Set the goal video.
The metadata of video to be screened is compared with preset screening rule, the to be screened of the screening rule will be met Video is determined as target video.
Step S106 saves the metadata of above-mentioned target video, for user's processing.
First number of above-mentioned target video is saved in database, for user search, checks and handles.
User can consult the video (target video) newly obtained, and take it and be further processed, and be such as downloaded, enter The operation such as library, deletion, and will handle in time data-in library.According to the search time interval being arranged in search condition, period Execute above-mentioned steps S102 to S106 to property.User can consult or handle at any time the first number for the target video being stored in database According to.
In some possible embodiments, above-mentioned steps S102 can be realized by following procedure: pass through task dispatcher Generate the uniform resource position mark URL (Uniform of webpage capture task corresponding with pre-set search condition Resource Locator), and the URL is sent to webpage capture device;Wherein, search condition includes search address, retrieval pass Keyword and retrieval time require;By webpage capture device to Website server send the URL grab webpage capture task it is corresponding Web page contents;Web page contents are sent to webpage solution corresponding with video website belonging to the web page contents by task dispatcher Parser;The web page contents are parsed by web-page parser, extract the metadata of video to be screened.
Specifically, it is blind search task run after, task dispatcher obtains search condition from database, generates webpage capture and appoints The URL of business gives webpage capture device by task queue and implements specific webpage capture task.Webpage capture task presses task name Classify and is saved in database line by line.
Webpage capture device sends URL to Website server, passes through the modes such as http or https agreement (but it is not limited to this) Web page contents are grabbed, and gives the web page contents to web-page parser by task dispatcher and handles.Webpage capture task With highest running priority grade, it is immediately performed once generating.Webpage capture device can use distributed deployment mode, in this way more There is better response speed and user experience when task.
Different video websites possesses different web-page parsers, and web-page parser is advised according to the corresponding parsing of video website Then extract the metadata of video.The metadata of video may include: video name, thumbnail, length, uplink time, click volume etc.. The metadata of video classifies by task name and is saved in database line by line.
Further, it is contemplated that there may be grab incomplete ask for the metadata for the video to be screened that the above process obtains Topic, the above method further include: whether the metadata that video to be screened is judged by web-page parser is meet search condition complete Portion's data;It " generates webpage capture corresponding with pre-set search condition by task dispatcher to appoint if not, continuing to execute The URL of business, and the URL is sent to webpage capture device " the step of.Such as, it is desirable that the video in crawl one month, if judging Crawl is the video in one week, then continues to establish and enforce webpage capture task.
In some possible embodiments, the metadata of video to be screened includes uplink time and click volume, above-mentioned screening Rule includes that uplink time requires to realize by following procedure with temperature requirement, above-mentioned steps S104: judging view to be screened Whether the uplink time of frequency meets uplink time requirement;If meeting uplink time requirement, according to the click volume of video to be screened, Judge whether video to be screened meets temperature requirement;The video to be screened for meeting temperature requirement is determined as target video.
Further, the metadata of above-mentioned video to be screened further includes acquisition time, and above-mentioned temperature requires to include temperature threshold Value, above-mentioned steps " judging whether video to be screened meets temperature requirement " can be realized by following process: according to view to be screened This click volume and this acquisition time of frequency and the preceding click volume and a preceding acquisition time of video to be screened, calculate The hot value of video to be screened;Judge whether the hot value is more than or equal to heat degree threshold;If the hot value be greater than or Equal to heat degree threshold, determine that video to be screened meets temperature requirement.
Specifically, the detailed process of above-mentioned steps S104 can be executed by Content Filter.In view of often temperature is high Video easily cause the interest of user and be useful video, first progress new video judgement: judged by uplink time Whether video is new video;Then hot news judgement is carried out: video click volume n1, time t1 and the last time obtained according to this Click volume n0, the time t0 of acquisition, are calculated the temperature of video are as follows: H1=(n1-n0)/(t1-t0).Parsing is obtained again The metadata of video is compared with above-mentioned search condition, will be met the video information of search condition and be classified by task name and line by line It is saved in database.
In the embodiment of the present invention, according to pre-set search condition, the metadata of video to be screened is obtained;According to default Screening rule and the video to be screened metadata, from video to be screened determine target video;Save the member of target video Data, for user's processing.Work is screened in video screening technique provided in this embodiment, the collection by being automatically performed video data Make, effectively reduced the workload of user, but also not only convenient for finding bad video in time, to reduce cost of labor, improves Working efficiency, and collection screening is not influenced by the working condition of staff, therefore improves the steady of video Effect of screening It is qualitative.
In order to improve the timeliness of video screening, after the metadata for saving above-mentioned target video, the above method is also wrapped It includes: issuing the user with new video notice to be processed, such as show notification icon and/or hair on client (mobile phone, computer etc.) Sound prompting out.User can be notified to have new video to be retrieved in this way, user is facilitated to handle in time.
The carry out situation that task is checked for the ease of user, can also show Task Progress.Specifically, in the page of client The blind result (target video such as got) and progress for searching task is shown on face, including creation time, the view obtained Frequency amount and the number of videos of Screening Treatment etc..User can check the Task Progress, such as check the target video got, The Task Progress can also be added, modify, delete, the operation such as remarks.
Fig. 2 is a kind of overall procedure schematic diagram of video screening process provided in an embodiment of the present invention, as shown in Fig. 2, should Video screening process includes following below scheme:
Process 202, search condition setting.
Process 204 retrieves the video of video website.
Process 206 is analyzed search result, is screened.The web page contents retrieved are parsed, and to parsing Obtained video is screened.
When proceeding to process 206, if search condition is modified, process 202 is re-executed.
Process 208, client show new video notice.
Then according to the search time interval being arranged in search condition, process 204 is periodically carried out to process 208.
Whether process 210, the video that manual examination and verification filter out are useful.
Useful video is put on record preservation, useless video is lost by process 212.
Embodiment two:
Fig. 3 is a kind of structural schematic diagram of video screening system provided in an embodiment of the present invention, as shown in figure 3, the system Include:
Data acquisition module 32, for obtaining the metadata of video to be screened according to pre-set search condition;
Video screening module 34, for the metadata according to preset screening rule and above-mentioned video to be screened, from wait sieve It selects and determines target video in video;
Video preserving module 36, for saving the metadata of target video, for user's processing.
In some possible embodiments, above-mentioned data acquisition module 32 is specifically used for:
The unified resource for generating webpage capture task corresponding with pre-set search condition by task dispatcher is fixed Position symbol URL, and the URL is sent to webpage capture device;Wherein, search condition includes search address, search key and retrieval Time requirement;The URL, which is sent, to Website server by webpage capture device grabs the corresponding web page contents of webpage capture task;It is logical It crosses task dispatcher and the web page contents is sent to web-page parser corresponding with video website belonging to the web page contents;Pass through Web-page parser parses the web page contents, extracts the metadata of video to be screened.
There may be grab incomplete ask for the metadata for the video to be screened that in view of above-mentioned data acquisition module 32 obtains Topic, data acquisition module 32 are specifically also used to:
Whether the metadata that video to be screened is judged by web-page parser is the total data for meeting search condition;If It is no, continue to execute the unified resource that webpage capture task corresponding with pre-set search condition is generated by task dispatcher Finger URL URL, and the step of URL is sent to webpage capture device.
In some possible embodiments, the metadata of video to be screened includes uplink time and click volume, screening rule It is required including uplink time and temperature requires;Above-mentioned video screening module 34 is specifically used for:
Judge whether the uplink time of video to be screened meets uplink time requirement;If meeting uplink time requirement, root According to the click volume of video to be screened, judge whether video to be screened meets temperature requirement;The view to be screened of temperature requirement will be met Frequency is determined as target video.
In the embodiment of the present invention, according to pre-set search condition, the metadata of video to be screened is obtained;According to default Screening rule and the video to be screened metadata, from video to be screened determine target video;Save the member of target video Data, for user's processing.Work is screened in video screening system provided in this embodiment, the collection by being automatically performed video data Make, effectively reduced the workload of user, but also not only convenient for finding bad video in time, to reduce cost of labor, improves Working efficiency, and collection screening is not influenced by the working condition of staff, therefore improves the steady of video Effect of screening It is qualitative.
Fig. 4 is the structural schematic diagram of another video screening system provided in an embodiment of the present invention, as shown in figure 4, this is System includes the subsystems such as task dispatcher, webpage capture device, web-page parser, Content Filter.
It is connected between subsystems using message queue, other than task dispatcher is single-point deployment, webpage capture device, net Page resolver and Content Filter all can be with multiple spot distributed deployments.Task dispatcher can according to breadth-first strategy, It is responsible for whole scheduling controlling.
Specifically, as shown in figure 4, blind task of searching is dispatched by task dispatcher initiation, webpage capture device grabs web page contents, Web-page parser parses web page contents, and judges whether the parsing result obtained is that the whole of preset search condition tie Fruit mentions chain task (being sent to task dispatcher) if it is not, then generation is new, forms closed loop.It is finally that parsing result is further Filtering, final result is presented to the user, and be saved in database and check for user.
Wherein, above-mentioned webpage capture device can flexibly parse the page using a variety of libraries, and use frame API (Application Programming Interface, application programming interface) controls next step grasping movement, by setting Put back into regulation parsing movement.
Technical solution provided in an embodiment of the present invention has the advantages that
1, reduce cost: the collection work of scheduled (nocuousness) video information can be completed in deployment this system, reduces use The workload at family, reduces cost of labor.
2, stability is good: this system is not influenced the collection work of video information by the working condition of staff, can To be executed by predeterminated target and complete task.
3, all weather operations: this system can work 24 hours a day.
4, intelligent: the result of collection is carried out structuring processing by this system, is conveniently checked and is managed.
5, data review: the meta-data preservation of all videos in central database, can according to the degree of correlation with temporal information into Historical information is retrieved and checked to row.
Video screening system provided in an embodiment of the present invention has identical with video screening technique provided by the above embodiment Technical characteristic reach identical technical effect so also can solve identical technical problem.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description Specific work process, can refer to corresponding processes in the foregoing method embodiment, details are not described herein.
The flow chart and block diagram in the drawings show the system of multiple embodiments according to the present invention, method and computer journeys The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, section or code of table, a part of the module, section or code include one or more use The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box The function of note can also occur in a different order than that indicated in the drawings.For example, two continuous boxes can actually base Originally it is performed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.It is also noted that It is the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart, can uses and execute rule The dedicated hardware based system of fixed function or movement is realized, or can use the group of specialized hardware and computer instruction It closes to realize.
The computer program product of video screening technique is carried out provided by the embodiment of the present invention, including stores processor The computer readable storage medium of executable non-volatile program code, the instruction that said program code includes can be used for executing Previous methods method as described in the examples, specific implementation can be found in embodiment of the method, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system and method can pass through it Its mode is realized.System embodiment described above is only schematical, for example, the division of the unit, only A kind of logical function partition, there may be another division manner in actual implementation, in another example, multiple units or components can combine Or it is desirably integrated into another system, or some features can be ignored or not executed.Another point, shown or discussed phase Coupling, direct-coupling or communication connection between mutually can be through some communication interfaces, the INDIRECT COUPLING of device or unit or Communication connection can be electrical property, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.
It, can be with if the function is realized in the form of SFU software functional unit and when sold or used as an independent product It is stored in the executable non-volatile computer-readable storage medium of a processor.Based on this understanding, of the invention Technical solution substantially the part of the part that contributes to existing technology or the technical solution can be with software in other words The form of product embodies, which is stored in a storage medium, including some instructions use so that One computer equipment (can be personal computer, server or the network equipment etc.) executes each embodiment institute of the present invention State all or part of the steps of method.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read- Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can be with Store the medium of program code.
Finally, it should be noted that embodiment described above, only a specific embodiment of the invention, to illustrate the present invention Technical solution, rather than its limitations, scope of protection of the present invention is not limited thereto, although with reference to the foregoing embodiments to this hair It is bright to be described in detail, those skilled in the art should understand that: anyone skilled in the art In the technical scope disclosed by the present invention, it can still modify to technical solution documented by previous embodiment or can be light It is readily conceivable that variation or equivalent replacement of some of the technical features;And these modifications, variation or replacement, do not make The essence of corresponding technical solution is detached from the spirit and scope of technical solution of the embodiment of the present invention, should all cover in protection of the invention Within the scope of.Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (10)

1. a kind of video screening technique characterized by comprising
According to pre-set search condition, the metadata of video to be screened is obtained;
According to the metadata of preset screening rule and the video to be screened, target view is determined from the video to be screened Frequently;
The metadata of the target video is saved, for user's processing.
2. obtaining wait sieve the method according to claim 1, wherein described according to pre-set search condition Select the metadata of video, comprising:
The uniform resource locator of webpage capture task corresponding with pre-set search condition is generated by task dispatcher URL, and the URL is sent to webpage capture device;Wherein, the search condition includes search address, search key and inspection Rope time requirement;
The URL is sent to Website server by the webpage capture device to grab in the corresponding webpage of the webpage capture task Hold;
The web page contents are sent to by the task dispatcher corresponding with video website belonging to the web page contents Web-page parser;
The web page contents are parsed by the web-page parser, extract the metadata of video to be screened.
3. according to the method described in claim 2, it is characterized in that, it is described by the web-page parser to the webpage retrieved Content is parsed, after the metadata for extracting video to be screened, the method also includes:
Whether the metadata that the video to be screened is judged by the web-page parser is the whole for meeting the search condition Data;
If not, continuing to execute described by task dispatcher generation webpage capture times corresponding with pre-set search condition The uniform resource position mark URL of business, and the step of URL is sent to webpage capture device.
4. the method according to claim 1, wherein the metadata of the video to be screened include uplink time and Click volume, the screening rule include uplink time requirement and temperature requirement;It is described according to preset screening rule and it is described to The metadata for screening video determines target video from the video to be screened, comprising:
Judge whether the uplink time of the video to be screened meets the uplink time requirement;
If meeting the uplink time requirement, according to the click volume of the video to be screened, judge that the video to be screened is It is no to meet the temperature requirement;
The video to be screened that the temperature requires will be met and be determined as target video.
5. according to the method described in claim 4, it is characterized in that, the metadata of the video to be screened further includes when obtaining Between, the temperature requires to include heat degree threshold;The click volume according to the video to be screened, judges the video to be screened Whether the temperature requirement is met, comprising:
According to primary point before this click volume of the video to be screened and this acquisition time and the video to be screened The amount of hitting and a preceding acquisition time calculate the hot value of the video to be screened;
Judge whether the hot value is more than or equal to the heat degree threshold;
If the hot value is more than or equal to the heat degree threshold, determine that the video to be screened meets the temperature and wants It asks.
6. the method according to claim 1, wherein after the metadata for saving the target video, institute The method of stating includes:
New video notice to be processed is issued to the user.
7. a kind of video screening system characterized by comprising
Data acquisition module, for obtaining the metadata of video to be screened according to pre-set search condition;
Video screening module, for the metadata according to preset screening rule and the video to be screened, from described to be screened Target video is determined in video;
Video preserving module, for saving the metadata of the target video, for user's processing.
8. system according to claim 7, which is characterized in that the data acquisition module is specifically used for:
The uniform resource locator of webpage capture task corresponding with pre-set search condition is generated by task dispatcher URL, and the URL is sent to webpage capture device;Wherein, the search condition includes search address, search key and inspection Rope time requirement;
The URL is sent to Website server by the webpage capture device to grab in the corresponding webpage of the webpage capture task Hold;
The web page contents are sent to by the task dispatcher corresponding with video website belonging to the web page contents Web-page parser;
The web page contents are parsed by the web-page parser, extract the metadata of video to be screened.
9. system according to claim 8, which is characterized in that the data acquisition module is specifically also used to:
Whether the metadata that the video to be screened is judged by the web-page parser is the whole for meeting the search condition Data;
If not, continuing to execute described by task dispatcher generation webpage capture times corresponding with pre-set search condition The uniform resource position mark URL of business, and the step of URL is sent to webpage capture device.
10. system according to claim 7, which is characterized in that the metadata of the video to be screened includes uplink time And click volume, the screening rule include uplink time requirement and temperature requirement;The video screening module is specifically used for:
Judge whether the uplink time of the video to be screened meets the uplink time requirement;
If meeting the uplink time requirement, according to the click volume of the video to be screened, judge that the video to be screened is It is no to meet the temperature requirement;
The video to be screened that the temperature requires will be met and be determined as target video.
CN201810227335.5A 2018-03-19 2018-03-19 Video screening technique and system Pending CN110309397A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810227335.5A CN110309397A (en) 2018-03-19 2018-03-19 Video screening technique and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810227335.5A CN110309397A (en) 2018-03-19 2018-03-19 Video screening technique and system

Publications (1)

Publication Number Publication Date
CN110309397A true CN110309397A (en) 2019-10-08

Family

ID=68073277

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810227335.5A Pending CN110309397A (en) 2018-03-19 2018-03-19 Video screening technique and system

Country Status (1)

Country Link
CN (1) CN110309397A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103562848A (en) * 2011-03-08 2014-02-05 Tivo有限公司 Multi-source and destination media discovery and management platform
CN103577593A (en) * 2013-11-14 2014-02-12 中国科学院声学研究所 Method and system for video aggregation based on microblog hot topics
CN104144181A (en) * 2013-05-08 2014-11-12 中国科学院声学研究所 Terminal aggregation method and system for network videos
CN104572996A (en) * 2015-01-06 2015-04-29 百度在线网络技术(北京)有限公司 Processing method and device for video webpage
CN104754374A (en) * 2015-04-03 2015-07-01 北京奇虎科技有限公司 Audio-video file detection management method and device
CN105872720A (en) * 2015-11-25 2016-08-17 乐视网信息技术(北京)股份有限公司 Method and device for screening playing records in video application
US20160259856A1 (en) * 2015-03-03 2016-09-08 International Business Machines Corporation Consolidating and formatting search results
CN106339447A (en) * 2016-08-23 2017-01-18 达而观信息科技(上海)有限公司 System and method for automatically predicting hot videos
CN106604067A (en) * 2016-12-30 2017-04-26 中广热点云科技有限公司 Video browse information classification method and server

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103562848A (en) * 2011-03-08 2014-02-05 Tivo有限公司 Multi-source and destination media discovery and management platform
CN104144181A (en) * 2013-05-08 2014-11-12 中国科学院声学研究所 Terminal aggregation method and system for network videos
CN103577593A (en) * 2013-11-14 2014-02-12 中国科学院声学研究所 Method and system for video aggregation based on microblog hot topics
CN104572996A (en) * 2015-01-06 2015-04-29 百度在线网络技术(北京)有限公司 Processing method and device for video webpage
US20160259856A1 (en) * 2015-03-03 2016-09-08 International Business Machines Corporation Consolidating and formatting search results
CN104754374A (en) * 2015-04-03 2015-07-01 北京奇虎科技有限公司 Audio-video file detection management method and device
CN105872720A (en) * 2015-11-25 2016-08-17 乐视网信息技术(北京)股份有限公司 Method and device for screening playing records in video application
CN106339447A (en) * 2016-08-23 2017-01-18 达而观信息科技(上海)有限公司 System and method for automatically predicting hot videos
CN106604067A (en) * 2016-12-30 2017-04-26 中广热点云科技有限公司 Video browse information classification method and server

Similar Documents

Publication Publication Date Title
US11178167B2 (en) Graphical display suppressing events indicating security threats in an information technology system
US8103653B2 (en) System for locating documents a user has previously accessed
US7831601B2 (en) Method for automatically searching for documents related to calendar and email entries
CN102930059B (en) Method for designing focused crawler
CN102054028B (en) Method for implementing web-rendering function by using web crawler system
CN104615627B (en) A kind of event public feelings information extracting method and system based on microblog
JP3845046B2 (en) Document management method and document management apparatus
CN103761279B (en) Method and system for scheduling network crawlers on basis of keyword search
CN105930527B (en) Searching method and device
CN103077254B (en) Webpage acquisition methods and device
CN104035972B (en) A kind of knowledge recommendation method and system based on microblogging
CN103942268B (en) Search for method, equipment and the application interface being combined with application
CN107885873A (en) Method and apparatus for output information
CN105718307B (en) Process management method and management of process device
CN104967698B (en) A kind of method and apparatus crawling network data
CN104008213B (en) A kind of more new discovery of info web and the method and apparatus of statistics
Lu et al. The design and implementation of configurable news collection system based on web crawler
CN108121743A (en) A kind of generation of generic web pages masterplate and application method, system
KR100557874B1 (en) Method of scientific information analysis and media that can record computer program thereof
CN110309397A (en) Video screening technique and system
JP2009211514A (en) Related information obtaining system, related information obtaining method, and related information obtaining program
Grillo et al. Fast user classifying to establish forensic analysis priorities
CN111444412B (en) Method and device for scheduling web crawler tasks
CN109558183B (en) A kind of method and device of automatic realization API application
CN109948021A (en) A kind of cloud disk searching method, system, server and storage medium based on Elasticsearch

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191008

RJ01 Rejection of invention patent application after publication