CN107222787A - Video resource popularity prediction method - Google Patents

Video resource popularity prediction method Download PDF

Info

Publication number
CN107222787A
CN107222787A CN201710409198.2A CN201710409198A CN107222787A CN 107222787 A CN107222787 A CN 107222787A CN 201710409198 A CN201710409198 A CN 201710409198A CN 107222787 A CN107222787 A CN 107222787A
Authority
CN
China
Prior art keywords
mrow
msub
data
rating
mover
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710409198.2A
Other languages
Chinese (zh)
Inventor
王子磊
朱策
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology of China USTC
Original Assignee
University of Science and Technology of China USTC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology of China USTC filed Critical University of Science and Technology of China USTC
Priority to CN201710409198.2A priority Critical patent/CN107222787A/en
Publication of CN107222787A publication Critical patent/CN107222787A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4667Processing of monitored end-user data, e.g. trend analysis based on the log file of viewer selections
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/738Presentation of query results
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of video resource Popularity prediction method, including:The viewing-data of certain area in-group user is counted, the rating categorical data and interbehavior data of group of subscribers is obtained, and utilizes the resource popularity of rating categorical data statistics to calculate;Utilize the LDA models for coupling user behavior, traversal rating categorical data and interbehavior data generate correspondence Di Li Cray distributions respectively, the full probability of each behavior pattern is derived by chain rule and the expectation of its Di Li Cray distribution is asked for, behavior pattern matrix is obtained;With reference to neural network model, the resource popularity of institute's statistics and behavior pattern matrix are inputted as neutral net, by training generation forecast model, and then to predict the video resource popularity in future.This method considers the influence that user watched interactive contents data and interbehavior data are predicted resource popularity, and the relation between two class data of research and popularity improves the accuracy predicted resource popularity.

Description

Video resource Popularity prediction method
Technical field
The present invention relates to data mining and machine learning techniques field, a kind of video resource Popularity prediction side is especially designed Method.
Background technology
Video request program is a kind of based on user watched dynamic need, and the service content for transmitting and playing corresponding demand is regarded Frequency play-back technology.User has the initiative to data flow in the selection of rating content and viewing mode, if can ring in time Efficiency using family viewing behavior demand and scheduling of resource is closely related.During scheduling of resource, the popularity of resource information It is the important references standard of dispatching algorithm.
Separately below related research work is introduced from user behavior analysis and resource popularity forecasting research respectively.
In the existing research on user behavior analysis, the definition to user behavior is broadly divided into two classes:What one class referred to It is interactive user operations of the user during program request rating, described interactive operation and the particular problem of research has closely The system of " user behavior analysis device and realization in IPTV system " analysis application of this network technology company of relation, such as Saite, How main research is safeguarded and be run to equipment control model with reference to user behavior, and different user data informations is counted Service provider is fed back to after arrangement, user behavior therein generally refers to user to live, program request, the use feelings reviewed Condition, including viewing duration and viewing number of times etc., user behavior therein are fixed under its company oneself equipment management system framework Justice is simultaneously counted, and the difference between viewing mode (live, program request) is confined to the definition of viewing behavior, without it is real deeply To in demand mode in different user mutual behaviors;Another kind of user behavior refers to the content of user's request, to ask Content represent user behavior and user's request content handled and excavated, such as Wang Pan patent " is regarded based on full-service The IPTV user behavior analysis method of map analysis " matches the customer group square for being adapted to preference according to service attribute centered on business Battle array, user behavior therein generally refers to demand of the user to IPTV value-added services, including information browse, game, visual electricity The value-added services such as words, image space, think to its study limitation to user behavior that use of the user to different business is represented not Same user behavior, is not still deep into the essence of user mutual behavior, it is impossible to react user exactly to rating content Demand.
For resource popularity prediction, " a kind of Resource service system and its resource allocation methods " of CHINAUNICOM mainly will Queued tasks in operation system carry out Clustering according to weight properties, and resource point is carried out to task according to priority marking Match somebody with somebody;Four reach " Proxy Caching for Streaming Media replacement method and the device " of epoch communication network technique Co., Ltd mainly according to principal component The method of analysis and multiple linear regression determines the Popularity prediction value of Streaming Media Object, is determined with reference to the signal to noise ratio of video peak value The comprehensive value of Streaming Media Object is to carry out caching replacement;Zhang Tiankui etc. is " in a kind of information predicted based on content popularit Proposed in heart network-caching method " and clustered each nodes records by similarity analysis, the content in calculate node Popularity simultaneously does cycle statistics, is predicted further according to prediction algorithm pop degree, and carrying out contrast decision with local packets is No renewal nodal cache;Often mark etc. in " a kind of online serialized content Popularity prediction method based on autoregression model " The prediction of online serialized content popularity is solved again, and by crawling the overall playback volume trend of online serialized content, parsing is overall The html source code of the playback volume trend page, utilizes the popularity of the new serialized content of autoregressive model prediction.Come from such scheme See, the prediction of popularity and user mutual behavior are not carried out Conjoint Analysis by existing research so that the forecasting research of popularity The data supporting of user behavior angle is lacked, Popularity prediction precision can not obtain Reliable guarantee.
The content of the invention
It is an object of the invention to provide a kind of video resource Popularity prediction method, user watched interaction content is considered The influence that data and interbehavior data are predicted resource popularity, the relation between two class data of research and popularity is improved The accuracy predicted resource popularity.
The purpose of the present invention is achieved through the following technical solutions:
A kind of video resource Popularity prediction method, including:
The viewing-data of certain area in-group user is counted, the rating categorical data and interbehavior of group of subscribers is obtained Data, and utilize the resource popularity of rating categorical data statistics to calculate;
Using the document subject matter generation model LDA for coupling user behavior, traversal rating categorical data and interbehavior data Correspondence Di Li Cray distributions are generated respectively, and the full probability of each behavior pattern is derived by chain rule and its Di Li Cray is asked for The expectation of distribution, obtains behavior pattern matrix;
It is with reference to neural network model, the resource popularity of institute's statistics and behavior pattern matrix is defeated as neutral net Enter, by training generation forecast model, the video resource popularity in future is predicted according to the forecast model of training.
The viewing-data of the statistics certain area in-group user, obtains rating categorical data and the interaction of group of subscribers Behavioral data includes:
The viewing daily record of all group of subscribers in area is counted, by dividing period and program category, colony is obtained The rating categorical data and the data of interbehavior of different time sections in the certain number of days of user;
Wherein, the division period refers to divide the time of one day, if being divided into 24 periods by one day, often One period was a hour;
Interbehavior refers to broadcast state of the user in demand (telecommunication) service, has 10 kinds:Program collection, first startup are broadcast Put, be again started up broadcasting, F.F. state, fast-rewinding state, halted state, positioning playing, play failure, exit to play and broadcast with memory Put.
The resource popularity using rating categorical data statistics to calculate includes:
By rating categorical data set C={ ci| i=1,2, K } represent, the element c in set CiBy rating Type number, ciI class rating types are represented, accounting of its time being accessed by the user in total rating duration is pi, then piIt is ci Popularity in set C, and P={ pi| i=1,2, K be set C resource popularity set;Wherein, K is receipts Depending on the total quantity of type.
It is described to utilize the document subject matter generation model LDA for coupling user behavior, traversal rating categorical data and interbehavior Data generate correspondence Di Li Cray distributions respectively, derive the full probability of each behavior pattern by chain rule and ask for its Di Li The expectation of Cray distribution, obtaining behavior pattern matrix includes:
Assuming that there is K kind rating types V in rating categorical datawThere is L kind behavior types V in portion's program, interbehavior datal Individual interbehavior;
To there is the interbehavior of a certain type in the program of a certain rating type, then referred to as behavior pattern;M-th of document The behavior pattern multinomial distribution of middle userIt is K × L dimension matrix, behavior pattern distributionIn z-th positional representation Behavior pattern is user to rating type zmn1Program take zmn2Plant interbehavior;Wherein,
zmn2=(zmnmod K)
Using the LDA models for coupling user behavior, traversal rating categorical data and interbehavior data generate correspondence respectively Di Li Crays are distributed, and its process is as follows:
Order: Then for each viewing-data n, n ∈ { 1 ..., Nm};Wherein, NmFor viewing-data quantity in m-th of document,For kth kind The program multinomial distribution of rating type,For the interbehavior multinomial distribution of l kind behavior types, α is that behavior pattern is more Item formula distributionDirichlet Study firsts, β be program multinomial distributionDirichlet Study firsts, γ for interaction Behavior multinomial distributionDirichlet Study firsts;
Calculate the distribution of Di Li Crays Θ, Φ, Ψ:
Wherein,Behavior pattern, program category, the quantity of behavior type for participating in calculating are represented respectively;Respectively program, interbehavior and behavior pattern set;Represent the set of behavior pattern in m-th of document, Φ For program profile multinomialIn all program probability of happening Di Li Crays distribution, Ψ be interbehavior multinomial distributionIn The Di Li Crays distribution of all interbehavior probability of happening, Θ is behavior pattern multinomial distributionIn all behavior patterns occur The Di Li Crays distribution of probability;
Θ, Φ, Ψ are distributed based on Di Li Crays, and derive by chain rule the full probability of each behavior pattern:
Wherein,Respectively program, interbehavior and behavior pattern set, these three set inner elements wmn、 tmn、zmnRepresent respectively the program of the n-th viewing-data in m-th of document, the interbehavior of the n-th viewing-data in m-th of document, The behavior pattern of n-th viewing-data in m-th of document;Represent behavior pattern setMiddle removal zmnSet afterwards;
The full probability for being based ultimately upon Di Li Crays distribution Θ, Φ, Ψ and each behavior pattern solves the distribution of Di Li Crays Expectation, be derived by equation below:
Wherein,User is represented to V in kth kind rating typewThe viewing number of times of portion's program,Distribution is represented in l Plant the V of behavior typelThe number of times of interbehavior,Represent the frequency of i-th of behavior pattern in m-th of document;θmiRespectively single behavior pattern, program, the probability of happening of interbehavior.
The combination neural network model, regard the resource popularity of institute's statistics and behavior pattern matrix as nerve net Network is inputted, and by training generation forecast model, the video resource popularity in future is predicted according to the forecast model of training to be included:
The Nonlinear Mapping to behavior pattern matrix and the resource popularity of statistics is realized using BP neural network, it is defeated Enter layer and output layer neuron number is relevant with input/output argument, the rating type of the behavior pattern matrix program of input has K Kind, the video resource popularity in future is output as, then input layer and output layer node number are respectively K and 1, node in hidden layer It is set to S;
By the behavior vector of the program of different rating types in the resource popularity of statistics, and behavior pattern matrix Read in, then training data and test data are divided into by data are read in;
Start to initialize BP neural network, training method uses steepest descent method, and batch mode trains behavioral pattern data, Then sample is inputted into BP neural network by the way of batch is trained, calculates the error of each sample;Finally judge whether to receive Hold back, if not restraining, weights are adjusted according to steepest descent method, until convergence, so as to obtain forecast model;
The test data of delimitation is inputted into BP neural network, predicts that the video resource in future is popular using forecast model Degree.
As seen from the above technical solution provided by the invention, colony is carried out by program category to user data in area Analysis, effectively weakens the influence that single film breaks out pop degree precision of prediction in short term;By the LDA moulds for coupling user behavior Type carries out Conjoint Analysis to rating content and interbehavior, it is to avoid loss of learning caused by the single data of research, more accurately Ground finds group of subscribers behavior pattern;Using neural network model, according to the behavior pattern Matrix prediction for having merged behavioral data Video resource popularity, improves the Nonlinear Processing ability of prediction, so as to reduce predicated error;In addition, the present invention is provided Method be applied in the scheduling of resource of Cloud Server, can effectively improve request receptance, lift Consumer's Experience.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, being used required in being described below to embodiment Accompanying drawing be briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, for this For the those of ordinary skill in field, on the premise of not paying creative work, other can also be obtained according to these accompanying drawings Accompanying drawing.
Fig. 1 is a kind of flow chart of video resource Popularity prediction method provided in an embodiment of the present invention;
Fig. 2 is the flow chart that behavior pattern provided in an embodiment of the present invention is found;
Fig. 3 is the schematic diagram of behavior pattern matrix provided in an embodiment of the present invention;
Fig. 4 is the flow chart of the video resource Popularity prediction of fusion user behavior provided in an embodiment of the present invention.
Embodiment
With reference to the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Ground is described, it is clear that described embodiment is only a part of embodiment of the invention, rather than whole embodiments.Based on this The embodiment of invention, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to protection scope of the present invention.
The embodiment of the present invention provides a kind of video resource Popularity prediction method, and this method is to user group's data from resource Type dimension is handled, and generates user group's viewing-data;It is not independent handle in rating perhaps for viewing-data analysis Person's interbehavior, but two class data of fusion derive its joint probability distribution, so as to accurately describe the internal relation of two class data; In Forecasting Methodology, joint probability Input matrix to neural network model is predicted to video resource popularity, set up between the two Accurate mapping relations.
As described in Figure 1, it is a kind of flow chart of video resource Popularity prediction method.First, group in statistics certain area The viewing-data of body user, obtains the rating categorical data and interbehavior data of group of subscribers, and utilizes rating categorical data The resource popularity of statistics to calculate;Then, using the document subject matter generation model LDA for coupling user behavior, traversal Rating categorical data and interbehavior data generate correspondence Di Li Cray distributions respectively, and each behavior mould is derived by chain rule The full probability of formula and the expectation for asking for the distribution of its Di Li Cray, obtain behavior pattern matrix;Finally, with reference to neural network model, The resource popularity of institute's statistics and behavior pattern matrix are inputted as neutral net, by training generation forecast model, The video resource popularity in future is predicted according to the forecast model of training.This method considers user watched interaction content number According to the influence predicted with interbehavior data resource popularity, the relation between two class data of research and popularity, raising pair The accuracy of resource popularity prediction, and then it is effectively improved the resource deployment efficiency of Streaming Media Cloud Server, lifting request access With the service quality of response.
It is described in detail below for each step.
First, group of subscribers data processing.
In the embodiment of the present invention, the purpose of group of subscribers data processing is, by counting all group of subscribers in area Viewing daily record, then by dividing period and program category, different time sections in the certain number of days of group of subscribers can be obtained The data of rating categorical data and interbehavior.Meanwhile, the behavior mould of individual user can effectively be ignored to population data processing Formula makes a variation, and behavior pattern variation refers to certain user because with special behavior pattern, such as a F.F. or only sees certain class Program.Can be by the influence reduction of these uncertain variation pop degree predictions to minimum from colony's angle processing.
In the embodiment of the present invention, the division period refers to divide the time of one day, if being divided into 24 by one day Period, then each period is a hour, then the data set sample of 25 days can just be divided into 600 period collection Close.
Rating type can be divided into 15 types, respectively news, finance and economics, variety, physical culture, film, animation, military affairs, TV play, science and education, life, fashion tourism, child-parent education, music, old man's program and juvenile's program.
Interbehavior refers to broadcast state of the user in demand (telecommunication) service, has 10 kinds:Program collection, first startup are broadcast Put, be again started up broadcasting, F.F. state, fast-rewinding state, halted state, positioning playing, play failure, exit to play and broadcast with memory Put.
In addition, the embodiment of the present invention is also using the resource popularity of rating categorical data statistics to calculate, as One input of neutral net.Specifically:By rating categorical data set C={ ci| i=1,2, K } represent, Element c in set CiBy rating type number, ciI class rating types are represented, its time being accessed by the user is in total rating duration In accounting be pi, then piIt is ciPopularity in set C, and P={ pi| i=1,2, K be set C resource Popularity set;Wherein, K is the total quantity of rating type.
2nd, behavior pattern is found.
LDA models can obtain a data and concentrate complicated theme as classical body of text generation model Set, under broadcasting and TV data research background, the present invention is ground using the LDA models of coupling user behavior to group of subscribers data Study carefully.As shown in Fig. 2 choosing the viewing-data in user's one period of past, type demarcation is carried out to each program, program Type is as the theme of document generation, and LDA models can accurately generate the set of different program categories;Different program classes The viewing behavior pattern of user is different in type, and therefore, behavior pattern is distributedIntroduce LDA models to be coupled, by row User watched program category is finely divided for pattern, can be so in the rating to group of subscribers in finer granularity Appearance is analysed in depth, and obtains the Joint Distribution of behavior-program, behavior pattern matrix can be obtained in conjunction with subsequent treatment.
The LDA models of user behavior are coupled as the extension of LDA models, Conjoint Analysis can be carried out to behavioral agent.Knot Close Fig. 3, it is assumed that have K kind rating types V in rating categorical datawPortion's program, then the Joint Distribution Φ ' of type-program is K × Vw Tie up matrix,Represent that K kind ratings type includes VwThe probability of portion's program;Assuming that there is L kind behavior classes in interbehavior data Type VlIndividual interbehavior, then the Joint Distribution Ψ ' of type-behavior is L × VlTie up matrix,Represent in l kind behavior types VlThe probability of individual interbehavior;
In the embodiment of the present invention, there is the interbehavior of a certain type for the program of a certain rating type, referred to as go For pattern;The behavior pattern multinomial distribution of user in m-th of documentIt is K × L dimension matrix, behavior pattern distributionIn The behavior pattern of z-th of positional representation is user to rating type zmn1Program take zmn2Plant interbehavior;Wherein,
zmn2=(zmnmod K)
Using the LDA models for coupling user behavior, traversal rating categorical data and interbehavior data generate correspondence respectively Di Li Crays are distributed, and its process is as follows:
Order: Then for each viewing-data n (including timestamp, programm name, behavior type in rating record), n ∈ { 1 ..., Nm}; Wherein, NmFor viewing-data quantity in m-th of document,For the program multinomial distribution of kth kind rating type,For l kinds The interbehavior multinomial distribution of behavior type, α is behavior pattern multinomial distributionDirichlet Study firsts, β for section Mesh multinomial distributionDirichlet Study firsts, γ be interbehavior multinomial distributionDirichlet Study firsts; These multinomial distributions are made up of multiple elements, for example, program multinomial distributionK-th of element beTwo other many item number The implication of distributed constant is similar.
Calculate Di Li Crays distribution Θ:
In above formula,Represent to consider Study first and behavior time in Di Li Crays distribution pilot process is calculated Several probability.
The distribution of Di Li Crays Φ, Ψ can similarly be obtained:
Wherein,The behavior pattern, rating type, the quantity of behavior type for participating in calculating are represented respectively;Respectively program, interbehavior and behavior pattern set;Represent the set of behavior pattern in m-th of document, Φ For program profile multinomialIn all program probability of happening Di Li Crays distribution, Ψ be interbehavior multinomial distributionIn The Di Li Crays distribution of all interbehavior probability of happening, Θ is behavior pattern multinomial distributionIn all behavior patterns occur The Di Li Crays distribution of probability.
Θ, Φ, Ψ are distributed based on Di Li Crays, and derive by chain rule the full probability of each behavior pattern:
Wherein,Three set inner element wmn、tmn、zmnNth bar rating number in m-th of document is represented respectively According to program, in m-th of document in the interbehavior of nth bar viewing-data, m-th of document nth bar viewing-data behavior mould Formula;Represent behavior pattern setMiddle removal zmnSet afterwards;
The implication of above formula refers to Study first (i.e. α, β, γ) and program, behavior and the behavior being distributed according to existing three class The set of pattern is (i.e.) ask for the full probability that each behavior pattern occurs.The calculating of molecule and denominator on the right of formula It is respectively necessary for using Φ, Ψ, Θ in journey, that is,
In addition,Signal table it is as shown in table 1.
Table 1Signal table
The first row first row θ in upper table11To last row of last column θMIThe probability of happening combination of all behavior patterns ConstituteNamely a behavior pattern matrix.The probability of happening of any one behavior pattern therein is designated as θmi, it is also final herein Want the result asked for.
The full probability for being based ultimately upon Di Li Crays distribution Θ, Φ, Ψ and each behavior pattern solves the distribution of Di Li Crays Expectation, be derived by equation below:
Wherein,User is represented to V in kth kind rating typewThe viewing number of times of portion's program,Distribution is represented in l Plant the V of behavior typelThe number of times of interbehavior,Represent the frequency of i-th of behavior pattern in m-th of document;θmiRespectively single behavior pattern, program, the probability of happening of interbehavior.In the embodiment of the present invention, by Di Li The expectation of Cray distribution is solved, and obtains θmiIt is therein, θmiNamely represent different elements in behavior pattern matrix Probability of happening, be our output.
In the embodiment of the present invention, the implication explanation of parameters is as shown in table 2.
The cLDA Model Parameters of the fusion user behavior of table 2 illustrate table
3rd, the video resource Popularity prediction of user behavior is merged
The video resource Popularity prediction of fusion user behavior refers to, is predicted, will merged using neural network model The behavior pattern Input matrix models of user watched data and interbehavior, makes each behavior pattern matrix be mapped to one Popularity set, according to model prediction future video resource popularity.
In the embodiment of the present invention, realized using BP neural network to behavior pattern matrix and the resource popularity of statistics Nonlinear Mapping, input layer and output layer neuron number be relevant with input/output argument, the behavior pattern matrix section of input Purpose rating type has K kinds, is output as the video resource popularity in future, then input layer and output layer node number are respectively K With 1, node in hidden layer is set to S.
As shown in figure 4, by the program of different rating type in the resource popularity of statistics, and behavior pattern matrix Behavior vector read in, then by read in data be divided into training data and test data;Start to initialize BP neural network, training Method uses steepest descent method, and then batch mode training behavioral pattern data inputs sample by the way of batch is trained BP neural network, calculates the error of each sample, exemplary, and the error margin that can define popularity set is 0.1;Finally Judge whether convergence, if not restraining, weights are adjusted according to steepest descent method, until convergence, so as to obtain forecast model;It will delimit Test data input BP neural network, predicted using forecast model future video resource popularity.Finally, it will measure in advance To video resource popularity and the obtained video popularity of statistics apply respectively on Cloud Server, it is recognised that of the invention There is significant actively impact to performance boost.
Such scheme of the embodiment of the present invention, population analysis, effectively reduction are carried out to user data in area by program category Single film breaks out the influence of pop degree precision of prediction in short term;By coupling the LDA models of user behavior to rating content Conjoint Analysis is carried out with interbehavior, it is to avoid loss of learning caused by research single data, more accurately find that colony uses Family behavior pattern;It is popular according to the behavior pattern Matrix prediction video resource for having merged behavioral data using neural network model Degree, improves the Nonlinear Processing ability of prediction, so as to reduce predicated error;In addition, the method that the present invention is provided is applied to In the scheduling of resource of Cloud Server, request receptance can be effectively improved, Consumer's Experience is lifted.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment can To be realized by software, the mode of necessary general hardware platform can also be added to realize by software.Understood based on such, The technical scheme of above-described embodiment can be embodied in the form of software product, the software product can be stored in one it is non-easily The property lost storage medium (can be CD-ROM, USB flash disk, mobile hard disk etc.) in, including some instructions are to cause a computer to set Standby (can be personal computer, server, or network equipment etc.) performs the method described in each embodiment of the invention.
The foregoing is only a preferred embodiment of the present invention, but protection scope of the present invention be not limited thereto, Any one skilled in the art is in the technical scope of present disclosure, the change or replacement that can be readily occurred in, It should all be included within the scope of the present invention.Therefore, protection scope of the present invention should be with the protection model of claims Enclose and be defined.

Claims (5)

1. a kind of video resource Popularity prediction method, it is characterised in that including:
The viewing-data of certain area in-group user is counted, the rating categorical data and interbehavior number of group of subscribers is obtained According to, and utilize the resource popularity of rating categorical data statistics to calculate;
Using the document subject matter generation model LDA for coupling user behavior, traversal rating categorical data and interbehavior data difference Generation correspondence Di Li Cray distributions, derive the full probability of each behavior pattern by chain rule and ask for the distribution of its Di Li Cray Expectation, obtain behavior pattern matrix;
With reference to neural network model, the resource popularity of institute's statistics and behavior pattern matrix are inputted as neutral net, By training generation forecast model, the video resource popularity in future is predicted according to the forecast model of training.
2. a kind of video resource Popularity prediction method according to claim 1, it is characterised in that the certain area of statistics The viewing-data of domain in-group user, obtaining the rating categorical data and interbehavior data of group of subscribers includes:
The viewing daily record of all group of subscribers in area is counted, by dividing period and program category, group of subscribers is obtained The rating categorical data and the data of interbehavior of different time sections in certain number of days;
Wherein, the division period refers to divide the time of one day, if being divided into 24 periods by one day, per for the moment Between section be a hour;
Interbehavior refers to broadcast state of the user in demand (telecommunication) service, has 10 kinds:Program collection, first startup are played, again Secondary startup broadcasting, F.F. state, fast-rewinding state, halted state, positioning playing, broadcasting fail, exit broadcasting and memory broadcasting.
3. a kind of video resource Popularity prediction method according to claim 1, it is characterised in that the utilization rating class The resource popularity of type data statistics to calculate includes:
By rating categorical data set C={ ci| i=1,2 ..., K } represent, the element c in set CiBy rating type number, ciI class rating types are represented, accounting of its time being accessed by the user in total rating duration is pi, then piIt is ciIn set C Popularity, and P={ pi| i=1,2 ..., K be set C resource popularity set;Wherein, K is the sum of rating type Amount.
4. a kind of video resource Popularity prediction method according to claim 1, it is characterised in that described to be used using coupling The document subject matter generation model LDA of family behavior, traversal rating categorical data and interbehavior data generate correspondence Di Like respectively Thunder is distributed, and derives the full probability of each behavior pattern by chain rule and asks for the expectation of its Di Li Cray distribution, OK Include for mode matrix:
Assuming that there is K kind rating types V in rating categorical datawThere is L kind behavior types V in portion's program, interbehavior datalIt is individual to hand over Mutual behavior;
To there is the interbehavior of a certain type in the program of a certain rating type, then referred to as behavior pattern;Used in m-th of document The behavior pattern multinomial distribution at familyIt is K × L dimension matrix, behavior pattern distributionIn z-th of positional representation behavior Pattern is user to rating type zmn1Program take zmn2Plant interbehavior;Wherein,
<mrow> <msub> <mi>z</mi> <mrow> <mi>m</mi> <mi>n</mi> <mn>1</mn> </mrow> </msub> <mo>=</mo> <mrow> <mo>(</mo> <mfrac> <mrow> <msub> <mi>z</mi> <mrow> <mi>m</mi> <mi>n</mi> </mrow> </msub> <mo>-</mo> <mn>1</mn> </mrow> <mi>L</mi> </mfrac> <mo>+</mo> <mn>1</mn> <mo>)</mo> </mrow> <mo>;</mo> </mrow>
zmn2=(zmnmod K)
Using the LDA models for coupling user behavior, traversal rating categorical data and interbehavior data generate correspondence Di Li respectively Cray is distributed, and its process is as follows:
Order:Then For each viewing-data n, n ∈ { 1 ..., Nm};Wherein, NmFor viewing-data quantity in m-th of document,Received for kth kind Depending on the program multinomial distribution of type,For the interbehavior multinomial distribution of l kind behavior types, α is that behavior pattern is multinomial Formula is distributedDirichlet Study firsts, β be program multinomial distributionDirichlet Study firsts, γ be interaction row For multinomial distributionDirichlet Study firsts;
Calculate the distribution of Di Li Crays Θ, Φ, Ψ:
<mrow> <mi>P</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;theta;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>m</mi> </msub> <mo>|</mo> <msub> <mover> <mi>z</mi> <mo>&amp;RightArrow;</mo> </mover> <mrow> <mi>m</mi> <mo>,</mo> </mrow> </msub> <mi>&amp;alpha;</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>D</mi> <mi>i</mi> <mi>r</mi> <mi>i</mi> <mi>c</mi> <mi>h</mi> <mi>l</mi> <mi>e</mi> <mi>t</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;theta;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>m</mi> </msub> <mo>|</mo> <msub> <mover> <mi>n</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>m</mi> </msub> <mo>+</mo> <mi>&amp;alpha;</mi> <mo>)</mo> </mrow> <mo>;</mo> </mrow>
<mrow> <mi>P</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;phi;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>k</mi> </msub> <mo>|</mo> <mover> <mi>z</mi> <mo>&amp;RightArrow;</mo> </mover> <mo>,</mo> <mover> <mi>w</mi> <mo>&amp;RightArrow;</mo> </mover> <mo>,</mo> <mi>&amp;beta;</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>D</mi> <mi>i</mi> <mi>r</mi> <mi>i</mi> <mi>c</mi> <mi>h</mi> <mi>l</mi> <mi>e</mi> <mi>t</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;phi;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>k</mi> </msub> <mo>|</mo> <msub> <mover> <mi>n</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>k</mi> </msub> <mo>+</mo> <mi>&amp;beta;</mi> <mo>)</mo> </mrow> <mo>;</mo> </mrow>
<mrow> <mi>P</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;psi;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>l</mi> </msub> <mo>|</mo> <mover> <mi>z</mi> <mo>&amp;RightArrow;</mo> </mover> <mo>,</mo> <mover> <mi>t</mi> <mo>&amp;RightArrow;</mo> </mover> <mo>,</mo> <mi>&amp;gamma;</mi> <mo>)</mo> </mrow> <mo>=</mo> <mi>D</mi> <mi>i</mi> <mi>r</mi> <mi>i</mi> <mi>c</mi> <mi>h</mi> <mi>l</mi> <mi>e</mi> <mi>t</mi> <mrow> <mo>(</mo> <msub> <mover> <mi>&amp;psi;</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>l</mi> </msub> <mo>|</mo> <msub> <mover> <mi>n</mi> <mo>&amp;RightArrow;</mo> </mover> <mi>l</mi> </msub> <mo>+</mo> <mi>&amp;gamma;</mi> <mo>)</mo> </mrow> <mo>;</mo> </mrow>
Wherein,Behavior pattern, program category, the quantity of behavior type for participating in calculating are represented respectively;Respectively program, interbehavior and behavior pattern set;Represent the set of behavior pattern in m-th of document, Φ For program profile multinomialIn all program probability of happening Di Li Crays distribution, Ψ be interbehavior multinomial distributionIn The Di Li Crays distribution of all interbehavior probability of happening, Θ is behavior pattern multinomial distributionIn all behavior patterns occur The Di Li Crays distribution of probability;
Θ, Φ, Ψ are distributed based on Di Li Crays, and derive by chain rule the full probability of each behavior pattern:
Wherein,Respectively program, interbehavior and behavior pattern set, these three set inner elements wmn、tmn、 zmnRepresent respectively the program of the n-th viewing-data in m-th of document, the interbehavior of the n-th viewing-data in m-th of document, m-th The behavior pattern of n-th viewing-data in document;Represent behavior pattern setMiddle removal zmnSet afterwards;
The full probability for being based ultimately upon Di Li Crays distribution Θ, Φ, Ψ and each behavior pattern solves the phase that Di Li Crays are distributed Hope, be derived by equation below:
<mrow> <msub> <mi>&amp;theta;</mi> <mrow> <mi>m</mi> <mi>i</mi> </mrow> </msub> <mo>=</mo> <mfrac> <mrow> <msubsup> <mi>n</mi> <mi>m</mi> <mi>i</mi> </msubsup> <mo>+</mo> <mi>&amp;alpha;</mi> </mrow> <mrow> <msubsup> <mi>&amp;Sigma;</mi> <mrow> <mi>i</mi> <mo>=</mo> <mn>1</mn> </mrow> <mrow> <mi>K</mi> <mo>&amp;times;</mo> <mi>L</mi> </mrow> </msubsup> <mrow> <mo>(</mo> <msubsup> <mi>n</mi> <mrow> <mi>m</mi> <mo>+</mo> <mi>&amp;alpha;</mi> </mrow> <mi>i</mi> </msubsup> <mo>)</mo> </mrow> </mrow> </mfrac> <mo>;</mo> </mrow>
<mrow> <msub> <mi>&amp;phi;</mi> <mrow> <msub> <mi>kV</mi> <mi>w</mi> </msub> </mrow> </msub> <mo>=</mo> <mfrac> <mrow> <msubsup> <mi>n</mi> <mi>k</mi> <msub> <mi>V</mi> <mi>w</mi> </msub> </msubsup> <mo>+</mo> <mi>&amp;beta;</mi> </mrow> <mrow> <msubsup> <mi>&amp;Sigma;</mi> <mrow> <msub> <mi>V</mi> <mi>w</mi> </msub> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>V</mi> <mi>w</mi> </msub> </msubsup> <mrow> <mo>(</mo> <msubsup> <mi>n</mi> <mi>k</mi> <msub> <mi>V</mi> <mi>w</mi> </msub> </msubsup> <mo>+</mo> <mi>&amp;beta;</mi> <mo>)</mo> </mrow> </mrow> </mfrac> <mo>;</mo> </mrow>
<mrow> <msub> <mi>&amp;phi;</mi> <mrow> <msub> <mi>lV</mi> <mi>l</mi> </msub> </mrow> </msub> <mo>=</mo> <mfrac> <mrow> <msubsup> <mi>n</mi> <mi>l</mi> <msub> <mi>V</mi> <mi>l</mi> </msub> </msubsup> <mo>+</mo> <mi>&amp;gamma;</mi> </mrow> <mrow> <msubsup> <mi>&amp;Sigma;</mi> <mrow> <msub> <mi>V</mi> <mi>l</mi> </msub> <mo>=</mo> <mn>1</mn> </mrow> <msub> <mi>V</mi> <mi>l</mi> </msub> </msubsup> <mrow> <mo>(</mo> <msubsup> <mi>n</mi> <mi>l</mi> <msub> <mi>V</mi> <mi>l</mi> </msub> </msubsup> <mo>+</mo> <mi>&amp;gamma;</mi> <mo>)</mo> </mrow> </mrow> </mfrac> <mo>;</mo> </mrow>
Wherein,User is represented to V in kth kind rating typewThe viewing number of times of portion's program,Distribution is represented in l kind rows For the V of typelThe number of times of interbehavior,Represent the frequency of i-th of behavior pattern in m-th of document;θmiRespectively single behavior pattern, program, the probability of happening of interbehavior.
5. a kind of video resource Popularity prediction method according to claim 1, it is characterised in that the combination nerve net Network model, the resource popularity of institute's statistics and behavior pattern matrix are inputted as neutral net, pre- by training generation Model is surveyed, the video resource popularity in future is predicted according to the forecast model of training to be included:
The Nonlinear Mapping to behavior pattern matrix and the resource popularity of statistics, input layer are realized using BP neural network Relevant with input/output argument with output layer neuron number, the rating type of the behavior pattern matrix program of input has K kinds, defeated Go out for following video resource popularity, then input layer and output layer node number are respectively K and 1, and node in hidden layer is set to S It is individual;
The behavior vector of the program of different rating types in the resource popularity of statistics, and behavior pattern matrix is read Enter, then training data and test data are divided into by data are read in;
Start to initialize BP neural network, training method uses steepest descent method, and batch mode trains behavioral pattern data, then Sample is inputted into BP neural network by the way of batch is trained, the error of each sample is calculated;Finally judge whether convergence, if Do not restrain, weights are adjusted according to steepest descent method, until convergence, so as to obtain forecast model;
The test data of delimitation is inputted into BP neural network, the video resource popularity in future is predicted using forecast model.
CN201710409198.2A 2017-06-02 2017-06-02 Video resource popularity prediction method Pending CN107222787A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710409198.2A CN107222787A (en) 2017-06-02 2017-06-02 Video resource popularity prediction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710409198.2A CN107222787A (en) 2017-06-02 2017-06-02 Video resource popularity prediction method

Publications (1)

Publication Number Publication Date
CN107222787A true CN107222787A (en) 2017-09-29

Family

ID=59946918

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710409198.2A Pending CN107222787A (en) 2017-06-02 2017-06-02 Video resource popularity prediction method

Country Status (1)

Country Link
CN (1) CN107222787A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107864405A (en) * 2017-11-14 2018-03-30 国家新闻出版广电总局广播科学研究院 A kind of Forecasting Methodology, device and the computer-readable medium of viewing behavior type
CN107909108A (en) * 2017-11-15 2018-04-13 东南大学 Edge cache system and method based on content popularit prediction
CN108259929A (en) * 2017-12-22 2018-07-06 北京交通大学 A kind of prediction of video active period pattern and caching method
CN109272165A (en) * 2018-09-30 2019-01-25 江苏满运软件科技有限公司 Register probability predictor method, device, storage medium and electronic equipment
CN109409921A (en) * 2018-08-31 2019-03-01 阿里巴巴集团控股有限公司 The recognition methods of the popularity of electronic equipment, device, server
CN109447246A (en) * 2018-10-30 2019-03-08 北京字节跳动网络技术有限公司 Method and apparatus for generating model
CN110139125A (en) * 2019-06-18 2019-08-16 洛阳师范学院 Video sharing method based on demand perception and caching resource under mobile radio network
CN110704614A (en) * 2019-08-30 2020-01-17 中国平安人寿保险股份有限公司 Information processing method and device for predicting user group type in application
US20210049687A1 (en) * 2019-08-14 2021-02-18 Royal Bank Of Canada Systems and methods of generating resource allocation insights based on datasets
CN114785858A (en) * 2022-06-20 2022-07-22 武汉格蓝若智能技术有限公司 Resource active caching method and device applied to mutual inductor online monitoring system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103209342A (en) * 2013-04-01 2013-07-17 电子科技大学 Collaborative filtering recommendation method introducing video popularity and user interest change
CN106028155A (en) * 2016-06-22 2016-10-12 深圳大学 Method and system for carrying out evolution analysis on popularity of network video
CN106570597A (en) * 2016-11-14 2017-04-19 广州大学 Content popularity prediction method based on depth learning under SDN architecture

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103209342A (en) * 2013-04-01 2013-07-17 电子科技大学 Collaborative filtering recommendation method introducing video popularity and user interest change
CN106028155A (en) * 2016-06-22 2016-10-12 深圳大学 Method and system for carrying out evolution analysis on popularity of network video
CN106570597A (en) * 2016-11-14 2017-04-19 广州大学 Content popularity prediction method based on depth learning under SDN architecture

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
TINGYAO WU等: "On the Use of Reservoir Computing in Popularity Prediction", 《2010 2ND INTERNATIONAL CONFERENCE ON EVOLVING INTERNET》 *
YA ZHANG等: "A Time-Topic Coupled LDA Model for IPTV User Behaviors", 《 IEEE TRANSACTIONS ON BROADCASTING》 *
何明一: "《神经计算 原理·语言·设计·应用》", 30 November 1992 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107864405A (en) * 2017-11-14 2018-03-30 国家新闻出版广电总局广播科学研究院 A kind of Forecasting Methodology, device and the computer-readable medium of viewing behavior type
CN107864405B (en) * 2017-11-14 2020-12-01 国家新闻出版广电总局广播科学研究院 Viewing behavior type prediction method, device and computer readable medium
CN107909108A (en) * 2017-11-15 2018-04-13 东南大学 Edge cache system and method based on content popularit prediction
CN107909108B (en) * 2017-11-15 2021-06-11 东南大学 Edge cache system and method based on content popularity prediction
CN108259929B (en) * 2017-12-22 2020-03-06 北京交通大学 Prediction and caching method for video active period mode
CN108259929A (en) * 2017-12-22 2018-07-06 北京交通大学 A kind of prediction of video active period pattern and caching method
CN109409921A (en) * 2018-08-31 2019-03-01 阿里巴巴集团控股有限公司 The recognition methods of the popularity of electronic equipment, device, server
CN109272165A (en) * 2018-09-30 2019-01-25 江苏满运软件科技有限公司 Register probability predictor method, device, storage medium and electronic equipment
CN109447246A (en) * 2018-10-30 2019-03-08 北京字节跳动网络技术有限公司 Method and apparatus for generating model
CN109447246B (en) * 2018-10-30 2021-01-15 北京字节跳动网络技术有限公司 Method and apparatus for generating a model
CN110139125A (en) * 2019-06-18 2019-08-16 洛阳师范学院 Video sharing method based on demand perception and caching resource under mobile radio network
CN110139125B (en) * 2019-06-18 2021-01-22 洛阳师范学院 Video sharing method based on demand perception and resource caching in wireless mobile network
US20210049687A1 (en) * 2019-08-14 2021-02-18 Royal Bank Of Canada Systems and methods of generating resource allocation insights based on datasets
CN110704614A (en) * 2019-08-30 2020-01-17 中国平安人寿保险股份有限公司 Information processing method and device for predicting user group type in application
CN110704614B (en) * 2019-08-30 2023-09-19 中国平安人寿保险股份有限公司 Information processing method and device for predicting user group type in application
CN114785858A (en) * 2022-06-20 2022-07-22 武汉格蓝若智能技术有限公司 Resource active caching method and device applied to mutual inductor online monitoring system

Similar Documents

Publication Publication Date Title
CN107222787A (en) Video resource popularity prediction method
CN103793537B (en) System for recommending individual music based on multi-dimensional time series analysis and achieving method of system
CN109902708A (en) A kind of recommended models training method and relevant apparatus
CN106339507B (en) Streaming Media information push method and device
CN105912729A (en) Three-dimensional time sequence dynamic model based mobile phone game recommendation system
CN106844530A (en) Training method and device of a kind of question and answer to disaggregated model
CN106294830A (en) The recommendation method and device of multimedia resource
CN103198086A (en) Information processing device, information processing method, and program
CN110474944B (en) Network information processing method, device and storage medium
US20220172260A1 (en) Method, apparatus, storage medium, and device for generating user profile
Wang et al. Recurrent recommendation with local coherence
CN110457283A (en) A kind of teaching comment based on data mining and recommender system and its method
CN109902823A (en) A kind of model training method and equipment based on generation confrontation network
CN109460503A (en) Answer input method, device, storage medium and electronic equipment
CN116629937A (en) Marketing strategy recommendation method and device
CN114218488A (en) Information recommendation method and device based on multi-modal feature fusion and processor
CN108647364A (en) A kind of prediction recommendation method based on mobile terminal application data
Mussumeci et al. Reconstructing news spread networks and studying its dynamics
CN109993636A (en) A kind of interpretable recommendation score prediction technique of exercise of fusion aspect information
CN113424204A (en) Method for performing legal license checks of digital content
Tuan et al. Knowledge injection into dialogue generation via language models
CN115034847A (en) Product recommendation method, system, storage medium and equipment based on deep learning
Fan et al. Context-aware ubiquitous web services recommendation based on user location update
CN113761272A (en) Data processing method, data processing equipment and computer readable storage medium
Wang Forecast model of TV show rating based on convolutional neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170929

RJ01 Rejection of invention patent application after publication