Disclosure of Invention
The invention aims to provide an intelligent pushing system and method for information message content based on big data, so as to solve the problems in the background technology.
In order to solve the technical problems, the invention provides the following technical scheme: the intelligent pushing system comprises a user subscription library, a subscription monitoring module, an identification analysis module and an identification acquisition module, wherein the user subscription library is used for storing subscription contents of users, the subscription monitoring module monitors the contents of the user subscription library, when the presence of update of the subscription contents of the users is monitored, the identification analysis module analyzes the watching information of the subscription contents of the users to determine the identification information of the subscription contents according to the watching information of the subscription contents, the identification acquisition module acquires the identification information of the subscription contents, if the identification information of the subscription contents is a first identification, the updated contents of the subscription contents are pushed to the users, and if the identification information of the subscription contents is a second identification, the updated information of the subscription contents is generated and pushed to the users.
Further, the identification analysis module comprises a preliminary judgment index calculation module, a preliminary judgment index comparison module, a candidate duration analysis module, a reference duration acquisition module, a concern index calculation module and a concern index comparison module, wherein the preliminary judgment index calculation module sets a certain subscription content as a subscription to be analyzed, acquires the historical update content number Uz of the subscription to be analyzed and the update content number Us watched by the subscription to be analyzed, then the preliminary judgment index V=us/Uz of the subscription to be analyzed, the preliminary judgment index comparison module compares the preliminary judgment index of the subscription to be analyzed with a preliminary judgment threshold, if the preliminary judgment index of the subscription to be analyzed is smaller than or equal to the preliminary judgment threshold, then adds second identification information to the subscription to be analyzed, if the preliminary judgment index of the subscription to be analyzed is larger than the preliminary judgment threshold, the concern index calculation module acquires k update information of the subscription to be analyzed, which the user has recently watched, and calculates the concern index of the subscription to be analyzed according to the k update informationWherein H i is the time spent by the user to recently watch the ith updated information of the subscription to be analyzed, H is the sum of the time spent by the user to recently watch the k updated information of the subscription to be analyzed, D i is the reference time length in the process of the user to recently watch the ith updated information of the subscription to be analyzed, the candidate time length analysis module acquires the stay time length of a certain page of the user to watch the updated information to be analyzed when the user watches the certain updated information of the subscription to be analyzed, if the stay time length of a certain page is greater than or equal to a stay threshold value, the speed of page sliding is acquired when the page sliding speed is detected for the first time after the page stays, if the page sliding speed is smaller than the speed threshold value, the stay time length is the candidate time length, the reference time length acquisition module acquires all the candidate time lengths of the update information when the user to watch the certain updated information to be analyzed, the first candidate time length is selected to be the reference time length of the update information according to the sequence from big to small, the attention index comparison module compares the attention index of the subscription to be analyzed with the attention index of the attention index to be analyzed, and if the attention index is greater than the attention index of the first subscription to be analyzed and is added to the first subscription analysis threshold value.
Further, the intelligent push system further includes an association subscription selection module, the association subscription selection module includes an association index calculation module that obtains a situation that a user views user subscription content each time, a certain subscription content is set as a base subscription, subscription content except the base subscription in a user subscription library is set as a candidate subscription, and then an association index g=rs/Rz of the certain candidate subscription and the base subscription is set, wherein Rz is the number of times of historical viewing of the base subscription, rs is the number of times of historical viewing of the base subscription, and the number of times of candidate subscription is also watched in the number of times of historical viewing of the base subscription, and the association index comparison module sets that the candidate subscription is the association subscription of the base subscription when the association index of the certain candidate subscription and the base subscription is greater than an association threshold.
Further, the identification analysis module further comprises a sweep index calculation module and a sweep index comparison module, if the sweep index of the subscription to be analyzed is smaller than the attention threshold, the sweep index calculation module obtains the viewing condition of the associated subscription of the subscription to be analyzed, calculates the sweep index b=qs/Qz of the subscription to be analyzed, wherein Qz is the number of base subscriptions of the subscription to be analyzed, qs is the number of base subscriptions which are watched when the subscription information of each base subscription to be analyzed is updated last time, the sweep index comparison module compares the sweep index of the subscription to be analyzed with the sweep threshold, if the sweep index of the subscription to be analyzed is larger than the sweep threshold, adds the first identification information to the subscription to be analyzed, and if the sweep index of the subscription to be analyzed is smaller than or equal to the sweep threshold, adds the second identification information to the subscription to be analyzed.
An intelligent pushing method for information message content based on big data, comprising the following steps:
Establishing a user subscription library for storing subscription content of users,
Monitoring the content of the user subscription library, when the presence of update of the user subscription content is monitored, acquiring the identification information of the subscription content, wherein, analyzing the viewing information of the user on the subscription content to determine the identification information of the subscription content according to the viewing information,
If the identification information of the subscription content is the first identification, pushing the updated content of the subscription content to the user;
if the identification information of the subscription content is the second identification, generating update information of the subscription content and pushing the update information to the user.
Further, the analyzing the viewing information of the subscription content by the user to determine the identification information of the subscription content according to the viewing information comprises:
setting a certain subscription content as the subscription to be analyzed, obtaining the number Uz of historical updated content of the subscription to be analyzed and the number Us of the updated content watched by the user subscription to be analyzed, then the preliminary exponent V=us/Uz of the subscription to be analyzed,
If the preliminary judgment index of the subscription to be analyzed is smaller than or equal to the preliminary judgment threshold value, adding second identification information to the subscription to be analyzed,
If the initial judgment index of the subscription to be analyzed is larger than the initial judgment threshold value, k pieces of updated information of the subscription to be analyzed, which are recently watched by the user, are acquired, wherein when the user watches a certain piece of updated information of the subscription to be analyzed, the stay time length of the user on a certain page watching the updated information is acquired, if the stay time length of the certain page is larger than or equal to the stay threshold value, the speed of the page sliding when the page sliding is detected for the first time after the certain page stays is acquired, if the page sliding speed is smaller than the speed threshold value, the stay time length is the candidate time length, all the candidate time lengths of the user watching a certain piece of updated information of the subscription to be analyzed are acquired, all the candidate time lengths of the updated information are sequenced in the sequence from large to small, the first candidate time length is selected as the reference time length of the updated information,
Calculating attention index of subscription to be analyzedWherein H i is the time spent by the user recently viewing the ith updated information of the subscription to be analyzed, H is the sum of the time spent by the user recently viewing the k updated information of the subscription to be analyzed, D i is the reference time length during the process of the user recently viewing the ith updated information of the subscription to be analyzed,
And comparing the attention index of the subscription to be analyzed with an attention threshold, and adding first identification information to the subscription to be analyzed if the attention index of the subscription to be analyzed is greater than or equal to the attention threshold.
Further, the intelligent pushing method further comprises the following steps:
acquiring the condition that the user watches the user subscription content each time, setting a certain subscription content as a base subscription, setting the subscription content except the base subscription in the user subscription library as a candidate subscription,
Then the association index g=rs/Rz of a candidate subscription with the base subscription, where Rz is the number of historical viewing base subscriptions, rs is the number of historical viewing base subscriptions for which candidate subscriptions were also viewed,
If there is a candidate subscription with an association index of the base subscription that is greater than the association threshold, then the candidate subscription is the base subscription's association subscription.
Further, the comparing the attention index of the subscription to be analyzed with the attention threshold value further includes:
if the attention index of the subscription to be analyzed is smaller than the attention threshold value, acquiring the viewing condition of the associated subscription of the subscription to be analyzed,
Calculating the sweep index b=qs/Qz of the subscriptions to be analyzed, wherein Qz is the number of base subscriptions of the subscriptions to be analyzed, qs is the number of base subscriptions watched when each base subscription to be analyzed updates subscription information last time,
Comparing the sweep index of the subscription to be analyzed with a sweep threshold,
And if the sweep index of the subscription to be analyzed is larger than the sweep threshold value, adding first identification information to the subscription to be analyzed.
Further, comparing the sweep index of the subscription to be analyzed with the sweep threshold value further includes:
and if the sweep index of the subscription to be analyzed is smaller than or equal to the sweep threshold value, adding second identification information to the subscription to be analyzed.
Compared with the prior art, the invention has the following beneficial effects: when the user subscription content is monitored to be updated, whether the updated content of the subscription content is pushed to the user or the updated information of the subscription content is only generated to be pushed to the user is judged according to the condition that the user watches the updated information of the subscription content in a historical mode, so that the user is not required to watch and track the updated condition of the subscription content, and different modes of pushing different subscription contents are adopted, and the user pertinence is stronger.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, the present invention provides the following technical solutions: the intelligent pushing system comprises a user subscription library, a subscription monitoring module, an identification analysis module and an identification acquisition module, wherein the user subscription library is used for storing subscription contents of users, the subscription monitoring module monitors the contents of the user subscription library, when the presence of update of the subscription contents of the users is monitored, the identification analysis module analyzes the watching information of the subscription contents of the users to determine the identification information of the subscription contents according to the watching information of the subscription contents, the identification acquisition module acquires the identification information of the subscription contents, if the identification information of the subscription contents is a first identification, the updated contents of the subscription contents are pushed to the users, and if the identification information of the subscription contents is a second identification, the updated information of the subscription contents is generated and pushed to the users.
The identification analysis module comprises a preliminary index calculation module, a preliminary index comparison module, a candidate duration analysis module, a reference duration acquisition module, a concern index calculation module and a concern index comparison module, wherein the preliminary index calculation module sets a certain subscription content as a subscription to be analyzed, acquires the historical update content Uz of the subscription to be analyzed and the update content Us watched by the subscription to be analyzed, then the preliminary index V=us/Uz of the subscription to be analyzed, the preliminary index comparison module compares the preliminary index of the subscription to be analyzed with a preliminary threshold, if the preliminary index of the subscription to be analyzed is smaller than or equal to the preliminary threshold, then adds second identification information to the subscription to be analyzed, if the preliminary index of the subscription to be analyzed is larger than the preliminary threshold, the concern index calculation module acquires k update information of the subscription to be analyzed, which is recently watched by a user, and calculates the concern index of the subscription to be analyzed according to the update informationWherein H i is the time spent by the user to recently watch the ith updated information of the subscription to be analyzed, H is the sum of the time spent by the user to recently watch the k updated information of the subscription to be analyzed, D i is the reference time length in the process of the user to recently watch the ith updated information of the subscription to be analyzed, the candidate time length analysis module acquires the stay time length of a certain page of the user to watch the updated information to be analyzed when the user watches the certain updated information of the subscription to be analyzed, if the stay time length of a certain page is greater than or equal to a stay threshold value, the speed of page sliding is acquired when the page sliding speed is detected for the first time after the page stays, if the page sliding speed is smaller than the speed threshold value, the stay time length is the candidate time length, the reference time length acquisition module acquires all the candidate time lengths of the update information when the user to watch the certain updated information to be analyzed, the first candidate time length is selected to be the reference time length of the update information according to the sequence from big to small, the attention index comparison module compares the attention index of the subscription to be analyzed with the attention index of the attention index to be analyzed, and if the attention index is greater than the attention index of the first subscription to be analyzed and is added to the first subscription analysis threshold value.
The intelligent push system further comprises an associated subscription selection module, the associated subscription selection module comprises an associated index calculation module for acquiring the condition that a user watches user subscription content each time, a certain subscription content is taken as a base subscription, subscription content except the base subscription in a user subscription library is taken as a candidate subscription, then the associated index G=Rs/Rz of the certain candidate subscription and the base subscription is taken as the associated index G=Rs/Rz of the base subscription, wherein Rz is the number of times of historical watching the base subscription, rs is the number of times of historical watching the base subscription and also watches the candidate subscription, and the associated index comparison module is used for judging that the candidate subscription is the associated subscription of the base subscription when the associated index of the certain candidate subscription and the base subscription is larger than an associated threshold value.
The identification analysis module further comprises a sweep index calculation module and a sweep index comparison module, if the attention index of the subscription to be analyzed is smaller than the attention threshold, the sweep index calculation module obtains the watching condition of the associated subscription of the subscription to be analyzed, calculates the sweep index b=qs/Qz of the subscription to be analyzed, wherein Qz is the number of base subscriptions of the subscription to be analyzed, qs is the number of base subscriptions watched when the subscription information of each base subscription to be analyzed is updated last time, the sweep index comparison module compares the sweep index of the subscription to be analyzed with the sweep threshold, if the sweep index of the subscription to be analyzed is larger than the sweep threshold, adds first identification information to the subscription to be analyzed, and if the sweep index of the subscription to be analyzed is smaller than or equal to the sweep threshold, adds second identification information to the subscription to be analyzed.
An intelligent pushing method for information message content based on big data, comprising the following steps:
Establishing a user subscription library for storing subscription content of users,
Monitoring the content of the user subscription library, when the presence of update of the user subscription content is monitored, acquiring the identification information of the subscription content, wherein, analyzing the viewing information of the user on the subscription content to determine the identification information of the subscription content according to the viewing information,
Analyzing the viewing information of the user on the subscribed content to determine the identification information of the subscribed content according to the viewing information comprises:
Setting a certain subscription content as a subscription to be analyzed, acquiring the number Uz of historical updated content of the subscription to be analyzed and the number Us of updated content watched by the user subscription to be analyzed, wherein the initial judgment index V=us/Uz of the subscription to be analyzed is shown as a blog in a user subscription library, if Us is record of blogs in the blog watched by the user, uz is total record of blogs in the blog, or in practice Uz can also be selected as total record of blogs updated by storing the blogs in the user subscription library from the user;
If the preliminary judgment index of the subscription to be analyzed is smaller than or equal to the preliminary judgment threshold value, adding second identification information to the subscription to be analyzed, when the preliminary judgment index of the subscription to be analyzed is smaller, indicating that the user has not seen the updated content of the subscription to be analyzed for a long time, having no interest in the updated content of the subscription to be analyzed,
If the preliminary index of the subscription to be analyzed is greater than the preliminary threshold, it indicates that the user may be interested in the updated content of the subscription to be analyzed, or may simply go through, sweep at a glance,
Obtaining k pieces of updated information of a subscription to be analyzed which is recently watched by a user, wherein when the user watches a certain piece of updated information of the subscription to be analyzed, obtaining the stay time of the user on a certain page watching the updated information, if the stay time length of the certain page is larger than or equal to a stay threshold value, obtaining the speed of page sliding when the page sliding is detected for the first time after the certain page stays, if the speed of page sliding is smaller than the speed threshold value, obtaining all the candidate time lengths of the user watching the certain piece of updated information of the subscription to be analyzed, sorting all the candidate time lengths of the updated information according to the sequence from large to small, selecting the first candidate time length as the reference time length of the updated information,
When the user views the updated information of the subscription to be analyzed, the user does not need to carefully watch the updated information in each place, the situation that the user has places to watch the updated information in a fine manner and has places to watch the updated information in a rough manner can occur, and when the user views the updated information of the subscription to be analyzed in a fine manner, the user is more interested in the subscription to be analyzed; according to the method, the device and the system, whether the user wants to see or is interested in the content is always known in the watching process, the user can slowly slide the page to look down in the interested condition, the user can directly and quickly slide the page to find the place of interest in the next place in the uninteresting condition, the candidate duration is selected based on the place, and when the candidate duration is longer, the user is more interested in updating information to be analyzed and subscribed;
calculating attention index of subscription to be analyzed Wherein H i is the time spent by the user recently viewing the ith updated information of the subscription to be analyzed, H is the sum of the time spent by the user recently viewing the k updated information of the subscription to be analyzed, D i is the reference time length during the process of the user recently viewing the ith updated information of the subscription to be analyzed,
Comparing the attention index of the subscription to be analyzed with an attention threshold, and adding first identification information to the subscription to be analyzed if the attention index of the subscription to be analyzed is greater than or equal to the attention threshold, and when the attention index of the subscription to be analyzed is greater, the updated content is carefully watched by a user in the process of watching the updated content of the subscription to be analyzed;
if the attention index of the subscription to be analyzed is smaller than the attention threshold value, acquiring the viewing condition of the associated subscription of the subscription to be analyzed,
Calculating the sweep index b=qs/Qz of the subscriptions to be analyzed, wherein Qz is the number of base subscriptions of the subscriptions to be analyzed, qs is the number of base subscriptions watched when each base subscription to be analyzed updates subscription information last time,
Comparing the sweep index of the subscription to be analyzed with a sweep threshold,
If the spreading index of the subscription to be analyzed is larger than the spreading threshold, when the spreading index of the subscription to be analyzed is larger, the tendency that the user is possibly interested in the subscription is indicated, and first identification information is added to the subscription to be analyzed;
if the sweep index of the subscription to be analyzed is smaller than or equal to the sweep threshold, when the sweep index of the subscription to be analyzed is smaller, the user is less interested in the subscription to be analyzed and the associated subscription, and second identification information is added to the subscription to be analyzed.
The intelligent pushing method further comprises the following steps:
acquiring the condition that the user watches the user subscription content each time, setting a certain subscription content as a base subscription, setting the subscription content except the base subscription in the user subscription library as a candidate subscription,
Then the association index g=rs/Rz of a candidate subscription with the base subscription, where Rz is the number of historical viewing base subscriptions, rs is the number of historical viewing base subscriptions for which candidate subscriptions were also viewed,
If there is a candidate subscription with an association index of the base subscription that is greater than the association threshold, then the candidate subscription is the base subscription's association subscription.
If the identification information of the subscription content is the first identification, the user is stated to pay more attention to the subscription content, so that the updated content of the subscription content is pushed to the user, and the user can conveniently watch the updated information of the subscription content;
If the identification information of the subscription content is the second identification, the user is not interested in watching the subscription content, so that only updated information of the subscription content is generated to push to the user, and the user is reminded. For example, when a user pays attention to a blog, the blogger updates an article on a certain day, if the identification of the blog in the user subscription database is a first identification, the updated article is directly sent to the user, and if the identification of the blog in the user subscription database is a second identification, the information of the updated article of the blog is directly sent to the user.
It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Finally, it should be noted that: the foregoing description is only a preferred embodiment of the present invention, and the present invention is not limited thereto, but it is to be understood that modifications and equivalents of some of the technical features described in the foregoing embodiments may be made by those skilled in the art, although the present invention has been described in detail with reference to the foregoing embodiments. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.