Disclosure of Invention
Aiming at the technical problems in the prior art, the invention provides a material screening method and a material screening system, which are used for efficiently screening materials needing to be shared, have quick response to the heat of the materials and are reasonable in flow distribution.
In order to solve the above technical problem, according to an aspect of the present invention, there is provided a material screening method, including:
verifying each material in the first set;
adding materials meeting a preset sharing rate in the first set into the second set;
obtaining the exposure rate of each material in the second set; and
and eliminating the materials meeting the preset name in the exposure rate ranking in the second set into a material queue, wherein the exposure rate is related to the browsing times of the materials.
Preferably, the step of verifying each material in the first set comprises: and acquiring the sharing amount of each material in the first set in a preset time period.
Preferably, the predetermined sharing rate is met when the number of shares exceeds a predetermined threshold within the predetermined time period, wherein the predetermined time period is 10-30 minutes, and the predetermined threshold is 2000-6000.
Preferably, the material screening method further comprises: and adding the materials meeting the preset sharing rate in the first set into the second set when the time for which one or more materials before the exposure rate ranking in the second set are kept unchanged reaches a timing threshold.
Preferably, the exposure rate is a ratio of the browsing times and the sharing amount of each material in the second set in the current time slice.
Preferably, the material screening method further comprises adding one or more materials with the highest weight in the material queue to the first set.
Preferably, the higher the elimination times of each material in the material queue, the lower the weight of the material.
Preferably, the material screening method further comprises: and when the number of the materials in the first set is less than the preset number, adding one or more materials with the highest weight in the material queue into the first set.
Preferably, the number of individual materials in the first set remains constant.
Preferably, the number of individual materials in the second set is not less than 5.
According to one aspect of the present invention, there is provided a material screening system comprising:
the material verification module is used for verifying each material in the first set and adding the materials meeting a preset sharing rate in the first set into the second set; and
and the material competition module is used for obtaining the exposure rate of each material in the second set, and eliminating the materials meeting the preset name order in the exposure rate ranking in the second set into a material queue, wherein the exposure rate is related to the browsing times of the materials.
Preferably, the material verification module includes:
the first share quantity counting unit is used for counting the share quantity of each material in the first set within a preset time period;
the first timing unit is used for timing the verification time of each material in the first set;
the comparison unit is used for comparing the sharing amount of each material in the first set in a preset time period with the size of a preset threshold value; and
and the second sending unit is used for supplementing the material into the second set after the material passes verification when the sharing amount of the material in a preset time period is greater than or equal to the sharing threshold.
Preferably, the predetermined time period is 10-30 minutes, and the predetermined threshold is 2000-6000 times.
Preferably, the material screening system further includes a material providing module, configured to obtain a weight of each material in the material queue, and add one or more materials with the highest weight in the material queue to the first set.
Preferably, the material verification module further includes:
and the material supplementing unit is used for sending a material supplementing request to the material providing module when the number of the materials in the first set is less than the preset number.
Preferably, the material providing module includes:
the weight calculation unit is used for calculating the weight of each material in the material queue;
the first sequencing unit is used for sequencing the materials in the material queue according to the sequence of the weight from high to low; and
and the first sending unit is used for supplementing one or more materials with the highest weight in the material queue into the first set.
Preferably, the material competition module includes:
the second sharing amount counting unit is used for counting the sharing amount of each material in the current time slice;
the calculating unit is used for calculating the ratio of the browsing times to the sharing amount to obtain the exposure rate of the material;
the second sorting unit is used for sorting all the materials in the second set according to the exposure rate; and
and the third sending unit is used for eliminating the materials meeting the preset name in the exposure rate ranking in the second set into the material queue.
Preferably, the material competition module further comprises:
and the second timing unit is used for timing the state that one or more materials in the second set before the exposure rate ranking is kept unchanged, and sending a material updating request to the material verification module when the timing threshold is reached.
The material screening method and the material screening system provided by the invention can meet the operation requirements of large flow and high quality, and can analyze a large amount of data in time, so that the response to the heat of the material is rapid. By continuously comparing and analyzing the materials, the optimally combined materials are obtained, and the flow is utilized to the maximum extent.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the following detailed description, reference is made to the accompanying drawings that form a part hereof and in which is shown by way of illustration specific embodiments of the application. In the drawings, like numerals describe substantially similar components throughout the different views. Various specific embodiments of the present application are described in sufficient detail below to enable those skilled in the art to practice the teachings of the present application. It is to be understood that other embodiments may be utilized and structural, logical or electrical changes may be made to the embodiments of the present application.
By "material" is meant content that can be shared and browsed. The material may be a picture or a portion of a picture, an article or a portion of an article, a video or a portion of a video, an audio or a portion of an audio, or the like. The material may be very short, such as a slogan or a motion picture; the material may also be long, such as an article or a video. In some embodiments, the material may be used as a carrier for advertisements in addition to advertisements. In some embodiments, the material itself may also become an advertisement or a part of an advertisement.
In order to screen out the materials with high popularity, the invention divides all the materials into three sets: a material queue A, a first set B and a second set C. The shared materials are all placed in the material queue A, the materials in the first set B are the materials with high popularity selected from the material queue A, and the materials need to be verified and are transferred to the second set C after the verification is passed. Through ranking, one or more materials which are ranked at the top are always kept in the second set C, and the materials are the materials which are screened out and have the highest popularity and can be recommended to the user. In addition, the method and the system monitor the sharing condition and the browsing condition of each material in real time. By counting the sharing amount (total sharing times) of a material in a time slice and the browsing times of the material, the exposure rate of the material in the time slice can be obtained. Wherein the time slice can be set as desired, such as a 20 minute period, a 5 minute period, etc.
Specifically, as shown in fig. 1, which is a flow chart of the material screening method of the present invention, fig. 2A to 2C are schematic diagrams of material transfer in three material sets related in the present invention, and with reference to fig. 1 and fig. 2A to 2C, the material screening method includes the following steps:
in step S1, the weight of each material in the material queue a is obtained. In the invention, materials which can be used for sharing are all placed in a material queue, and each material has respective weight. The weight described here is a deselection weight, i.e., a weight generated after deselection at the time of the last competitive ranking. The weight of the material in the material queue A is in inverse proportion to the elimination times of the material, when the material is eliminated once, the elimination times are added with 1, and the elimination times of the new material is an initial value of 1. The more times the material is eliminated, the lower the weight. For example, assume that the number of materials in the current material queue a is Assetlength, C1 is the number of culling times, and CA is the sum of the number of culling times. Adopting a weight formula: weight (| C1-AssetLength | +1)/CA to calculate the weight of the material. For example, there are 5 existing materials, AssetLength is 5, the elimination times C1 of the 5 materials are 1,2,3,4,5, respectively, and thus CA is 15,
an example of Weight weightings for these 5 materials is as follows:
(|1-5|+1)/15=0.333333333
(|2-5|+1)/15=0.266666667
(|3-5|+1)/15=0.2
(|4-5|+1)/15=0.133333333
(|5-5|+1)/15=0.066666667
step S2, the material with the highest current weight in the material queue a is supplemented into the first set B, as shown in fig. 2A. After the weights of the materials in the material queue a are obtained, the materials can be sorted. The supplement of material from the material queue a into the first set B can be done in two ways, one being active at the specified time of arrival. For example, the series of operations in the present invention that will calculate the weight, ranking, and replenishment of material to the first set B in the queue of material A can be performed periodically or on a frequent basis, such as every 20 minutes. When the predetermined 20 minutes arrives, the calculation and sorting are performed, and the material is transferred.
The other is passive supplementation on demand, i.e. the weighting and ranking of the material can be implemented with a timed frequency, but the transfer of the material is implemented according to the received request. For example, when the amount of material in the first set B is less than a preset amount that can be processed, such as 100, the processing device may request the material queue a to supplement the material, and at this time passively supplement the material to the first set B according to the request.
The number of the supplementary materials may be a set number or a requested number. For example, it may be set that 10 materials currently weighted the highest in the material queue a are supplemented into the first set B. For another example, the amount of material in the first set B is a set amount that can be processed by the processing device, and if the current amount of material in the first set B is m out, the amount of request m is also given when the material queue a is requested to supplement the material. At this time, the m materials with the highest current weight in the material queue a are supplemented into the first set B according to the number of requests.
In step S3, the respective materials in the first set B are verified. The more heavily weighted material is obtained in step 2, but if it is really popular, a test is needed to participate in the subsequent competitive ranking. The sharing amount of each material in its verification period is first obtained, for example, the verification period may be set to 10-30 minutes, and for this material, how many sharing events occur in this period is monitored. And then comparing the sharing amount with a sharing threshold value, for example, the sharing threshold value is an appropriate value in 6000 times of 2000, and if the sharing amount of the material in the verification time period is greater than or equal to the sharing threshold value, that is, the material meets a predetermined sharing rate, the material is verified. And if the sharing amount of the material in the verification time period is less than the sharing threshold value, waiting for the next calculation.
In step S4, the verified material is supplemented into the second collection C, as shown in fig. 2B. There are many options for the moment of supplementing material into the second collection C. For example, when the material in the second collection C needs to be updated, a material supplement request is sent, and the verified material is supplemented to the second collection C according to the request.
And step S5, sequencing the materials in the second set C according to the exposure rate, and eliminating the materials which accord with the preset name order into a material queue. In one embodiment, the material in the second collection C is kept at 10, and material with an exposure rate lower than the first 10 is eliminated to the material queue A, as shown in FIG. 2C. As a preferred embodiment, in order to increase the stability of the material, the update frequency is set, for example, the material in the second set C is updated every 5 minutes. By timing the state of the first 10 materials currently held by the second collection C, verified material is replenished into the second collection C when the timing threshold (i.e., the update time 5 minutes) is reached. At this time, the materials in the second group C are sorted, and the top 10 materials lower than the predetermined number are eliminated in the material queue a.
Wherein the exposure of the material may represent the popularity of the material. The method comprises the steps of firstly counting the sharing amount and the browsing times of a material in a current time slice, wherein the exposure rate is equal to the ratio of the browsing times to the sharing amount, and the higher the browsing times of the material is, the more people like the material is indicated, and the higher the popularity degree of the material is also indicated.
In order to implement the method, the present invention further provides a material screening system, as shown in fig. 3, the system includes: the system comprises a material providing module 1, a material verifying module 2 and a material competition module 3.
As shown in fig. 4, a specific schematic block diagram of the material providing module 1 is provided. The material providing module 1 includes a weight calculating unit 11, a first sorting unit 12, and a first transmitting unit 13. Wherein the weight calculation unit 11 calculates the weight of each material in the material queue. The first sorting unit 12 sorts the materials in the material queue in order of the weight from high to low. The first sending unit 13 supplements the material with the highest weight in the material queue as the material to be verified into the first set B. As shown in the previous embodiment of the method steps, the transfer of material occurs in two cases: timing, fixed frequency, or upon request of the material verification module 2. The number of materials to be supplemented may be one or more at a time, or may be the number of requests.
As shown in fig. 5, a functional block diagram of the material verification module is shown. The material verification module 2 includes: the first sharing amount statistic unit 21 includes a first timing unit 22, a comparing unit 23, and a second sending unit 24, where the first sharing amount statistic unit 21 counts the sharing amount of each material in the verification time period. The first timing unit 22 is configured to time the verification time of the material, and when the time (e.g., 30 minutes) arrives, obtain the sharing amount of the material in the time. The comparison unit 23 compares the sharing amount of each material in the verification period with the size of the sharing threshold (for example, 4000). When the sharing amount of the material in the verification time period is larger than or equal to the sharing threshold 4000, the material passes the verification. The second transmitting unit 24 supplements the verified material to the second set C.
Depending on the optimal processing power of the material verification module 2, in one embodiment, setting the number of materials in the first set B to 100 may allow the material verification module 2 to be at the optimal processing efficiency, and thus, it may be desirable to keep the first set B at 100 materials at all times. After the verified material is supplemented into the second collection C, the material in the first collection B is reduced, and the material needs to be supplemented into the first collection B in time. Thus, the material verification module 2 further includes a material requiring unit 25 that transmits a request to the material providing module 1 and includes the amount of the material in the request. The amount of the request can be the amount of the material supplemented to the second set C, that is, the amount of the material sent out, and the amount of the material required to be supplemented, so that the amount of the material in the first set B can be always kept at the optimal processing amount of 100.
As shown in fig. 6, it is a schematic block diagram of the material competition module. The material competition module 3 includes: a second share statistic unit 31, a calculating unit 32, a second sorting unit 33 and a third sending unit 34. When the materials are added into the second set C, the second sharing amount counting unit 3 counts the sharing amount and browsing times of each material in the current time slice in the second set C. The time slice is a time period during which the material competition module 3 needs to acquire data to order the materials in the second set C. The calculating unit 32 calculates the ratio of the browsing times to the sharing amount according to the sharing amount and the browsing times of the material in the time slice to obtain the exposure rate of the material. The second sorting unit 33 sorts all the materials in the second set C by exposure rate. Since the current second set C has more material than its set number, e.g., 10, at this time, the third sending unit 34 sends back the material ranked below 10 to the material queue a.
In order to keep the materials in the second set C stable, 10 materials in the current second set C need to be kept for a certain period of time, and therefore, the material competition module 3 further includes a second timing unit 35, configured to time the state of keeping the top 10 materials in the second set, for example, start timing after the second sorting unit 33 finishes sorting, and send a material update request to the material verification module 2 when a timing threshold is reached, such as 5 minutes; correspondingly, the second sending unit 24 in the material verification module 2 supplements the verified material to the second collection C. The number of supplements may be 1, or multiple, and may be set manually.
In the above description, the first … … unit and the second … … unit are not limited in number, but are used to distinguish units in different modules for performing the same function. Moreover, the embodiments in the foregoing description are provided for illustration only, and not for limitation of the invention, and those skilled in the art can make various changes and modifications without departing from the scope of the invention, and therefore, all equivalent technical solutions should also fall within the scope of the disclosure of the invention.