CN104133837A - Internet information putting channel optimizing system based on distributed computing - Google Patents

Internet information putting channel optimizing system based on distributed computing Download PDF

Info

Publication number
CN104133837A
CN104133837A CN201410289052.5A CN201410289052A CN104133837A CN 104133837 A CN104133837 A CN 104133837A CN 201410289052 A CN201410289052 A CN 201410289052A CN 104133837 A CN104133837 A CN 104133837A
Authority
CN
China
Prior art keywords
user
channel
sigma
information
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410289052.5A
Other languages
Chinese (zh)
Other versions
CN104133837B (en
Inventor
张娅
魏逸
王宇晨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Media Intelligence Technology Co., Ltd.
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN201410289052.5A priority Critical patent/CN104133837B/en
Publication of CN104133837A publication Critical patent/CN104133837A/en
Application granted granted Critical
Publication of CN104133837B publication Critical patent/CN104133837B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Program initiating; Program switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching

Abstract

The invention provides an Internet information putting channel optimizing system based on distributed computing. A data collecting module collects user behaviors; a data preprocessing module carries out data cleaning, integration and reduction, and simplifies and standardizes the collected user behavior information; a training module carries out iterative operation on the data in a training set by a class E-M algorithm, and parameters in a probability cumulative model are obtained; then, data in a test set is substituted into the probability cumulative model, so that the prediction on the contribution of each putting channel and the prediction on whether users are converted or not are completed; and websites or recommending types with a high contribution degree are subjected to information putting, and the information is put to users most possible to generate conversion. Meanwhile, an Hadoop platform is used for distributed computing, and complicated computing consuming a great number of resources is distributed onto multiple nodes for computing, so that the multi-node parallel processing is realized.

Description

A kind of internet information based on Distributed Calculation is thrown in channel optimization systems
Technical field
The present invention relates to Internet technical field, a kind of internet information based on Distributed Calculation is thrown in channel optimization systems specifically.
Background technology
In more than ten years in the past, internet is able to explosive growth, and increasing people selects online social activity, game, shopping, and internet information recommends to also become a very good approach promoting product.This also makes each enterprise can therefrom obtain a large amount of network datas and follows the tracks of recommendation effect and rate of return on investment.
The object that information is thrown in the research of channel contribution degree is exactly to quantize different channels for user, to transform the size of behavioral implications.By quantizing the contribution degree of each channel, value that can more different market channels, these market channels comprise Email, Alliance marketing, demonstration advertisement, search advertisements, Social Media etc., company also can decide following dynamics of investment of throwing in channel for difference according to these data, in the hope of the cost with minimum, obtains the popular attention rate for information maximization.
In the prior art, the system that internet information is thrown in the prediction of channel contribution degree has three kinds conventionally: 1, the internet information based on single source attribution model is thrown in channel contribution degree prognoses system: in this kind of system model used by all contributions all to an event in numerous events, as the system based on rearmost point striking (last-click), system etc. based on putting for the first time striking (first-click).That this system is considered to is very inaccurate, because it has ignored those in fact for the event of impact that transformed behavior generation.2, the internet information based on mark attribution model is thrown in channel contribution degree prognoses system: in this kind of system, model used comprises equal weight, client's credit, three kinds of modes of U-shaped curve.Equal weight is exactly to give all input channels identical weight.Client's credit is exactly according to the effect of throwing in the past, guesses artificially and gives different weights.U-shaped curve is that whole weights are transformed to transforming for the first time with last, does not consider the impact of intermediate information input effect.Obviously, the cogency of this system is also not enough, and in fact they are not good for the Evaluated effect of contribution degree yet.3, the internet information based on probability Distribution Model is thrown in channel contribution degree prognoses system: according to the information that user paid close attention to, user is transformed to the impact of behavior, give these channels different contribution degrees, then the contribution degree of these channels is arranged, sorted, to complete, throw in the assessment of channel contribution degree.The prediction that obvious this system gives is just more accurate, more reasonable.
Summary of the invention
For the deficiencies in the prior art, the object of the present invention is to provide a kind of internet information based on Distributed Calculation to throw in channel optimization systems, by user's the behavior of browsing optimization information, throw in the selection of channel, realize more exactly internet information and recommend, meet consumers' demand.
For achieving the above object, the present invention has adopted following technical scheme:
The invention provides a kind of internet information based on Distributed Calculation and throw in channel optimization systems, this system comprises: data collection module, data preprocessing module, training module, information are thrown in channel contribution degree prediction module and conversion ratio prediction module, wherein:
Data collection module, this module is collected user behavior data by web server: the user behavior of collecting is divided into two parts, part of records certain user whole browse behavior, another part has recorded the access characteristic of the different channels of same information;
Data preprocessing module, this module is that the user behavior data that server is collected is cleared up, integrated, reduction, the user behavior information of collecting is simplified to standardization;
Training module, this module be input as training set, and carry out interative computation with class E-M algorithm, iteration, to the customer impact intensity factor in the cumulative model of probability and this two parameters convergence of the factor of impact decay in time, completes the parameter estimation to these two parameters.
Information is thrown in channel contribution degree prediction module, this module be input as test set, structure information is thrown in channel m contribution degree, then sums up according to affiliated web site or type that each information is thrown in channel m, draws each website and all types of contribution degrees; Finally, according to each website and all types of contribution degrees, sort from high to low, select the forward website of rank or type to carry out information pushing, with this, obtain better input effect;
Conversion ratio prediction module, this module be input as test set, utilize survival function to mark to each user, dope the user that most possible generation transforms behavior, and push internet information to this part user.
Distributed Calculation based on Hadoop platform, in all modules, relate to calculating section above, all at Hadoop platform, carry out, we carry out complicated Computation distribution to a plurality of nodes, realized the parallel processing of multitask, reduced the wait between task, resource has been distributed more reasonable, arithmetic speed is greatly enhanced.
Compared with prior art, the present invention has following beneficial effect:
Internet information based on Distributed Calculation proposed by the invention is thrown in channel optimization systems, can greatly improve the accuracy of throwing in the prediction of channel contribution degree for information, thereby conveniently choose the most effective website or type, carrys out impression information; And selected most probable the user crowd who transforms has occurred, made information recommendation more targeted.Therefore, can exchange with minimum cost best recommendation effect for.In addition, data processing of the present invention, all based on Hadoop platform, has realized the parallel processing of multiple computers, while greatly reducing the large data of processing, for the requirement of computer arithmetic capability and internal memory, meanwhile, has greatly improved arithmetic speed.
Accompanying drawing explanation
By reading the detailed description of non-limiting example being done with reference to the following drawings, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 is the information domestic model figure based on server in one embodiment of the invention;
Fig. 2 is that in one embodiment of the invention, the internet information based on Distributed Calculation is thrown in channel optimization systems;
Fig. 3 is distributed computing framework figure in one embodiment of the invention;
Fig. 4 is the performance comparison diagram of native system and existing system in one embodiment of the invention.
Embodiment
Below in conjunction with specific embodiment, the present invention is described in detail.Following examples will contribute to those skilled in the art further to understand the present invention, but not limit in any form the present invention.It should be pointed out that to those skilled in the art, without departing from the inventive concept of the premise, can also make some distortion and improvement.These all belong to protection scope of the present invention.
As shown in Figure 1, the information domestic model figure based on server in the present invention, has clearly shown user information collection in figure, the formation of User profile, and the recommending module that the present invention builds is to be all stored in server, and processed by server.And user's client computer used is not responsible storage, processing user profile.
As shown in Figure 2, in the present invention, the input of the internet information based on Distributed Calculation channel optimization systems comprises:
Data collection module, is used web server to collect user behavior, the user behavior of collecting is divided into two parts: web page browsing message, Information message.Wherein, web page browsing message accounting certain user whole browse behavior, it can reflect the correlated characteristic of this user's browsing page; Information message accounting the access characteristic of the different channels of same information, click history and the feature of channel are thrown in its reflection for information.
Data preprocessing module, the user behavior data that server is collected carries out data scrubbing, integrated, reduction.
Training module, input training intensive data, based on maximum likelihood estimate, carries out interative computation with class E-M algorithm, thereby completes the parameter estimation to the cumulative model of probability;
Information is thrown in channel contribution degree prediction module and conversion ratio prediction module, calls the parameter obtaining from training set, and test data is brought into, thereby complete, information is thrown in to the prediction of channel contribution degree and the prediction whether user is transformed.
As shown in Figure 3, in the present invention, distributed computing framework figure has shown the Distributed Calculation based on Hadoop platform.Internet information based on Distributed Calculation is thrown in channel optimization systems and is related to calculating section in all modules, all at Hadoop platform, carry out, we carry out complicated Computation distribution to multinode, realized parallel processing, thereby saved a large amount of system resource, and greatly accelerated arithmetic speed.
As shown in Figure 4, the present embodiment provides a kind of internet information based on Distributed Calculation to throw in channel optimization systems, and uses the training of True Data collection and test.The present embodiment is chosen current internet information and is thrown in the system that contribution degree prediction field uses the system based on rearmost point striking the most widely and logic-based to return and compare.Experimental result shows, no matter the present invention is in the accuracy of the contribution degree of the different channels of prediction, or may transform in the accuracy of behavior in predictive user, is all better than two kinds of systems above.Final the present invention can also provide front N user and the most effective information of most probable generation conversion behavior and throw in channel.
The present embodiment is that described method is applied to the optimization that in internet, information is thrown in channel, and this system comprises:
1, data collection module
This module is based on web server, and what adopt that the method for behavior tracking records certain user all browses behavior; Adopt the method for daily record excavation, record the access characteristic of the different channels of same information; Complete the collection for user profile, and user profile is stored in to web server.
2, data preprocessing module
This module is carried out data scrubbing, integrated, reduction.Wherein, the method that data scrubbing is mainly taked to ignore first ancestral and removed redundancy, this is that the data proportion of void value is very little because in the data of collecting; Data integration is mainly the unit of unified collected data; Data stipulations are mainly carried out quantity stipulations, and the click time is converted into model parameter, and the final data set that comprises user ID, information input channel, time and these four territories of click that forms; Again a part for this data centralization is extracted, as training set; The concentrated data of remainder data are as test set.So far, can form the user profile of standard, also convenient next for the application of data.
3, training module
This module is responsible for training by the data in training set, completes the parameter estimation to the cumulative model of probability.
The situation that first training module is thrown according to actual information is made following hypothesis:
(1) each information display can produce an influence power to user's conversion;
(2) each information display is decayed in time to the influence power of user's conversion;
(3) same information is consistent with the rate of decay to all users' influence power;
(4) influence power of the information that different channels are thrown in can linear superposition;
(5) user's instantaneous conversion probability is directly proportional to influence power.
Based on above hypothesis, training module can be set up probability cumulative model, i.e. user behavior condition strength function λ u(t):
&lambda; u ( t ) = &Sigma; t i u < t a a d i u w ad i u exp ( - w ad i u ( t - t i u ) ) , t < T u 0 , otherwise
Wherein: wherein: note user for set 1 ..., U}, information channel for set 1 ..., n}, the user behavior of observing is set { C 1..., C u, the structure of the behavior record of user u is wherein the information that is the i time behavior of user u is thrown in channel id, the time of the i time behavior of user u, x uuser's conversion results (x u=1 represents that user transforms, x u=0 is anti-), l_u is the total degree of user u behavior, if user u has transformed, t urepresent transformation time, otherwise represent window node observing time.α is that the different channels information of throwing in is to customer impact intensity factor, the factor that ω decays in time for impact, k is that information is thrown in channel id, a_k, w_k respectively representative information throw in channel k affect intensity factor and the impact factor of decay in time, Tu represent transformation time or observing time window node.
Then for representing user's conversion ratio, set up survival function S u(t):
S u ( t ) = exp ( - &Integral; 0 t &lambda; u ( v ) dv )
Then by class EM algorithm:
L ( &Theta; ) = &Sigma; X u = 1 ( log ( &Sigma; i a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) ) ) + &Sigma; u ( - &Sigma; i a ad i u ( 1 - exp ( - w ad i u ( T u - t i u ) ) ) ) &GreaterEqual; &Sigma; X u = 1 &Sigma; i p i u log ( a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) p i u ) - &Sigma; u ( &Sigma; i a ad i u ( 1 - exp ( - w ad i u ( T u - t i u ) ) ) ) = Q ( &Theta; | &Theta; ( t ) )
E-step wherein:
p i u = a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) &Sigma; t i u < T u a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) , X u = 1 0 , X u = 0
M-step:
Order &PartialD; Q &PartialD; a k = 0 , &PartialD; Q &PartialD; w k = 0 Can obtain:
a k = &Sigma; X u = 1 &Sigma; ad i u = k p i u &Sigma; u &Sigma; ad i u = k ( 1 - exp ( - w ad i u ( t ) ( T u - t i u ) ) )
w k = &Sigma; X u = 1 &Sigma; ad i u = k p i u &Sigma; u &Sigma; ad i u = k ( p i u ( T u - t i u ) + a k ( t ) ( T u - t i u ) exp ( - w ad i u ( t ) ( T u - t i u ) ) )
Can complete training process.
4, information is thrown in channel contribution degree prediction module
This module is responsible for bringing test set into complete training process the cumulative model of probability, obtains the contribution degree that each different information is thrown in channel.
The contribution degree that information is thrown in channel m can be written as:
contribution ( m ) = &Sigma; X u = 1 &Sigma; ad i u = m a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) &Sigma; t i u < T u a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) / &Sigma; X u = 1 1
According to affiliated web site or the type of each information input channel m, sum up again, draw each website and all types of contribution degrees.Finally, choose website or the type that contribution degree is high and carry out information pushing, with the high efficiency of the propelling movement channel that guarantees to choose.
5, conversion ratio prediction module
Whether this module is responsible for predictive user conversion behavior and can be occurred.This module is utilized 1-S (T u) each user is marked, then user's mark is carried out to sequence from low to high, select the top n user that mark is the highest, think that they are the users that possible transform behavior.Subsequently, these predicted users that conversion behavior can occur are carried out to information pushing, make information recommendation more targeted, thereby improved propelling movement effect.
6, the Distributed Calculation based on Hadoop platform.The data of data centralization are assigned in the middle of a plurality of different mapper by programming, form a collection of intermediate result <key, value>, reducer can process intermediate result, and the item with identical key is merged.Finally, using amalgamation result as output, obtain the result α of this iteration, ω.Using this result as parameter, re-enter in mapper again, realize the interative computation of parameter estimation.Like this, just a complicated task is divided into much more fine-grained subtasks.And these subtasks can be dispatched between idle processing node, make the more tasks of the faster node processing of processing speed, thereby avoid the slow node of processing speed to extend the deadline of whole task, to reach the effect that improves arithmetic speed.Meanwhile, can avoid the wait between task, save system resource.
Implementation result
Technique scheme, use be real data set.
First, the present invention carrys out the quality of evaluating system according to F1 mark.
Wherein, the method for F1 mark is as follows:
F = 2 &times; R &times; P P + R
Wherein, P is accuracy rate, equaling (predicting the outcome and the actual ID number conforming to)/(the total ID number predicting the outcome) R is recall rate, equals (predicting the outcome and the actual ID number that has conversion conforming to)/(the ID sum that has conversion in test set).
By (a) in Fig. 4, can obviously find out that the score of F1 mark of the present invention will be higher than rearmost point striking and logistic regression, this is just explanation also, and the present invention may transform the accuracy of behavior user in predicting for front N, be far away higher than latter two system.
Subsequently, using accuracy rate as horizontal ordinate, recall rate, as ordinate, compares 3 kinds of systems.From Fig. 4, in (b), can find out, the in the situation that of identical recall rate, accuracy rate of the present invention will be higher than all the other two kinds of systems.It will be further appreciated that, when recall rate reaches 0.9 left and right, unusual good of effect of the present invention, that is to say, is covering under the condition of nearly all data, and practicality of the present invention is extremely good.
From above test, can find out, internet information based on Distributed Calculation of the present invention is thrown in channel optimization systems, accuracy and the user that can effectively improve the prediction of different information input channel contribution degree transform prediction accuracy, thereby better represent prediction effect, meet user's demand.Data processing of the present invention is all based on Hadoop platform, has realized the parallel processing of multiple computers, while greatly reducing the large data of processing, for the requirement of computer arithmetic capability and internal memory, meanwhile, has greatly improved arithmetic speed.
Above specific embodiments of the invention are described.It will be appreciated that, the present invention is not limited to above-mentioned specific implementations, and those skilled in the art can make various distortion or modification within the scope of the claims, and this does not affect flesh and blood of the present invention.

Claims (6)

1. the internet information based on Distributed Calculation is thrown in a channel optimization systems, and this system comprises:
Data collection module, this module is collected user behavior data by web server: the user behavior of collecting is divided into two parts, part of records certain user whole browse behavior, another part has recorded the access characteristic of the different channels of same information;
Data preprocessing module, this module is that the user behavior data that web server is collected is processed, and first carries out data scrubbing, the method for taking to ignore first ancestral and removing redundancy; Carry out subsequently data integration, to the processing that unitizes of the unit of collected data; Finally carry out data stipulations, the click time is converted into model parameter, and the final data set that comprises user ID, information input channel, time and these four territories of click that forms; Again a part for this data centralization is extracted, as training set; The concentrated data of remainder data are as test set;
Training module, this module is carried out interative computation with class E-M algorithm to training set, the factor ω convergence of the customer impact intensity factor α that iteration to probability adds up in model and impact decay in time, thus obtain parameter alpha, ω;
Information is thrown in channel contribution degree prediction module, this module be input as test set, adopt different channels that training module trains out on the factor ω of customer impact intensity factor α and impact decay in time as model parameter, build and throw in channel m contribution degree, affiliated web site or the type of according to each, throwing in channel m again sum up, and draw each website and all types of contribution degrees; Finally, according to each website and all types of contribution degrees, sort from high to low, select the forward website of rank or type to carry out information input, with this, optimize internet information and throw in effect;
Conversion ratio prediction module, this module be input as test set, first, set up survival function S u(t), then, utilize 1-S u(t) mark to each user, dope the user of most possible conversion, and to this part user's pushed information.
2. internet information based on Distributed Calculation according to claim 1 is thrown in channel optimization systems, it is characterized in that, described data collection module adopts that the method for behavior tracking records certain user all browses behavior; Adopt the method for daily record excavation, record the access characteristic of the different channels of same information, complete the collection for user profile, and user profile is stored in to web server.
3. the internet information based on Distributed Calculation according to claim 1 is thrown in channel optimization systems, it is characterized in that, described training module is set up probability cumulative model, i.e. user behavior condition strength function λ u(t):
&lambda; u ( t ) = &Sigma; t i u < t a a d i u w ad i u exp ( - w ad i u ( t - t i u ) ) , t < T u 0 , otherwise
Wherein: note user for set 1 ..., U}, information channel for set 1 ..., n}, the user behavior of observing is set { C 1..., C u, the structure of the behavior record of user u is wherein the information that is the i time behavior of user u is thrown in channel id, the time of the i time behavior of user u, x uuser's conversion results, x u=1 represents that user transforms, x u=0 anti-; L_u is the total degree of user u behavior, if user u has transformed, and t urepresent transformation time, otherwise represent window node observing time; α is that the different channels information of throwing in is to customer impact intensity factor, the factor that ω decays in time for impact, k is that information is thrown in channel id, a_k, w_k respectively representative information throw in channel k affect intensity factor and the impact factor of decay in time, Tu represent transformation time or observing time window node;
For representing user's conversion ratio, set up survival function S u(t), wherein:
S u ( t ) = exp ( - &Integral; 0 t &lambda; u ( v ) dv )
Then by class EM algorithm:
L ( &Theta; ) = &Sigma; X u = 1 ( log ( &Sigma; i a a d i u w ad i u exp ( - w ad i u ( T u - t i u ) ) ) ) + &Sigma; u ( - &Sigma; i a ad i u ( 1 - exp ( - w ad i u ( T u - t i u ) ) ) )
At this moment, demand L (θ) maximum;
E-step wherein:
p i u = a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) &Sigma; t i u < T u a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) , X u = 1 0 , X u = 0
M-step:
Order &PartialD; Q &PartialD; a k = 0 , &PartialD; Q &PartialD; w k = 0 :
a k = &Sigma; X u = 1 &Sigma; ad i u = k p i u &Sigma; u &Sigma; ad i u = k ( 1 - exp ( - w ad i u ( t ) ( T u - t i u ) ) )
w k = &Sigma; X u = 1 &Sigma; ad i u = k p i u &Sigma; u &Sigma; ad i u = k ( p i u ( T u - t i u ) + a k ( t ) ( T u - t i u ) exp ( - w ad i u ( t ) ( T u - t i u ) ) )
Complete training process.
4. the internet information based on Distributed Calculation according to claim 2 is thrown in channel optimization systems, it is characterized in that, described information is thrown in channel contribution degree prediction module and is responsible for bringing test set into complete training process the cumulative model of probability, obtain each different contribution degrees of throwing in channel, the contribution degree of throwing in channel m is written as:
contribution ( m ) = &Sigma; X u = 1 &Sigma; ad i u = m a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) &Sigma; t i u < T u a ad i u w ad i u exp ( - w ad i u ( T u - t i u ) ) / &Sigma; X u = 1 1
Affiliated web site or the type of according to each, throwing in channel m again sum up, and draw each website and all types of contribution degrees, last, choose website or the type that contribution degree is high and carry out information input, complete the optimization of internet information being thrown in to channel.
5. the internet information based on Distributed Calculation according to claim 1 is thrown in channel optimization systems, it is characterized in that, and described conversion ratio prediction module, the conversion ratio of user u can be: 1-S (T u), then user's mark is carried out to sequence from low to high, select the top n user that mark is the highest, think that they are users that conversion behavior most possibly occurs.
6. according to the internet information based on Distributed Calculation described in claim 1-5 any one, throw in channel optimization systems, it is characterized in that, the described Distributed Calculation based on Hadoop platform is divided into much more fine-grained subtasks by a complicated task, these subtasks can be dispatched between idle processing node, make the more tasks of the faster node processing of processing speed.
CN201410289052.5A 2014-06-24 2014-06-24 A kind of internet information based on Distributed Calculation delivers channel optimization systems Active CN104133837B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410289052.5A CN104133837B (en) 2014-06-24 2014-06-24 A kind of internet information based on Distributed Calculation delivers channel optimization systems

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410289052.5A CN104133837B (en) 2014-06-24 2014-06-24 A kind of internet information based on Distributed Calculation delivers channel optimization systems

Publications (2)

Publication Number Publication Date
CN104133837A true CN104133837A (en) 2014-11-05
CN104133837B CN104133837B (en) 2017-10-31

Family

ID=51806515

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410289052.5A Active CN104133837B (en) 2014-06-24 2014-06-24 A kind of internet information based on Distributed Calculation delivers channel optimization systems

Country Status (1)

Country Link
CN (1) CN104133837B (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104602182A (en) * 2015-02-12 2015-05-06 中国联合网络通信集团有限公司 Channel value acquiring method and channel value acquiring device
CN104835066A (en) * 2015-05-25 2015-08-12 北京京东尚科信息技术有限公司 Embarking channel selection method and system
CN105005944A (en) * 2015-07-10 2015-10-28 国家电网公司 Method and device for expanding optimal payment channel on the basis of big-data regression analysis algorithm
CN106296305A (en) * 2016-08-23 2017-01-04 上海海事大学 Electric business website real-time recommendation System and method under big data environment
CN106844178A (en) * 2017-01-22 2017-06-13 腾云天宇科技(北京)有限公司 Prediction is presented method, computing device, server and the system of information transferring rate
CN106919692A (en) * 2017-03-07 2017-07-04 广州优视网络科技有限公司 A kind of method and apparatus of PUSH message
CN106933880A (en) * 2015-12-31 2017-07-07 阿里巴巴集团控股有限公司 A kind of label data leaks channel detection method and device
CN107153843A (en) * 2017-05-03 2017-09-12 西安电子科技大学 Surface subsidence forecasting system and method based on SVMs
CN107273521A (en) * 2017-06-22 2017-10-20 百度在线网络技术(北京)有限公司 Feed content qualities evaluation method and device
CN107481062A (en) * 2017-08-21 2017-12-15 小草数语(北京)科技有限公司 The distribution method and device of advertisement putting budget
CN108241472A (en) * 2017-12-01 2018-07-03 北京大学 A kind of big data processing method and system for supporting locality expression function
CN108564398A (en) * 2018-03-29 2018-09-21 北京酷云互动科技有限公司 A kind of appraisal procedure, system and the storage medium of channel ad conversion rates
CN109190669A (en) * 2018-08-01 2019-01-11 新疆玖富万卡信息技术有限公司 A kind of intelligent recommendation method, electronic equipment and computer readable storage medium
CN109242521A (en) * 2017-07-11 2019-01-18 阿里巴巴集团控股有限公司 A kind of distribution method and device of predetermined consumption amount
CN110148014A (en) * 2019-04-24 2019-08-20 深圳市元征科技股份有限公司 Information processing method, device, block chain node device and storage medium
CN111095312A (en) * 2017-06-22 2020-05-01 日昶半导体制造有限公司 Information processing apparatus
CN111126614A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Attribution method, attribution device and storage medium
CN111582934A (en) * 2020-05-07 2020-08-25 北京点众科技股份有限公司 Method, terminal and storage medium for determining option contribution degree for purchasing electronic book
CN112200618A (en) * 2020-10-29 2021-01-08 上海优扬新媒信息技术有限公司 Message channel attribution method, device and system
CN112348559A (en) * 2020-09-27 2021-02-09 北京淇瑀信息科技有限公司 Channel resource consumption optimization method and device and electronic equipment
CN112348556A (en) * 2020-09-27 2021-02-09 北京淇瑀信息科技有限公司 Channel resource consumption optimization method and device and electronic equipment
CN112668673A (en) * 2021-03-16 2021-04-16 腾讯科技(深圳)有限公司 Data preprocessing method and device, computer equipment and storage medium
CN113177174A (en) * 2021-05-21 2021-07-27 脸萌有限公司 Feature construction method, content display method and related device
CN116468470A (en) * 2023-04-24 2023-07-21 朴道征信有限公司 Evaluation method and device of user source channel, electronic equipment and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107862547B (en) * 2017-11-03 2021-06-11 网易乐得科技有限公司 Evaluation method, medium and device of delivery channel and computing equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110029636A1 (en) * 2009-07-31 2011-02-03 Barry Smyth Real time information feed processing
CN102567477A (en) * 2011-06-16 2012-07-11 北京亿赞普网络技术有限公司 Website value evaluation method and device
CN102956009A (en) * 2011-08-16 2013-03-06 阿里巴巴集团控股有限公司 Electronic commerce information recommending method and electronic commerce information recommending device on basis of user behaviors
CN103593380A (en) * 2013-08-06 2014-02-19 北京爱真心信息科技有限公司 Attractive user recommending platform of online dating site
CN103678647A (en) * 2013-12-20 2014-03-26 Tcl集团股份有限公司 Method and system for recommending information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110029636A1 (en) * 2009-07-31 2011-02-03 Barry Smyth Real time information feed processing
CN102567477A (en) * 2011-06-16 2012-07-11 北京亿赞普网络技术有限公司 Website value evaluation method and device
CN102956009A (en) * 2011-08-16 2013-03-06 阿里巴巴集团控股有限公司 Electronic commerce information recommending method and electronic commerce information recommending device on basis of user behaviors
CN103593380A (en) * 2013-08-06 2014-02-19 北京爱真心信息科技有限公司 Attractive user recommending platform of online dating site
CN103678647A (en) * 2013-12-20 2014-03-26 Tcl集团股份有限公司 Method and system for recommending information

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YI WEI等: "Learning the Hotness of Information Diffusions with Multi-dimensional Hawkes Processes", 《AGENTS AND DATA MINING INTERACTION》 *

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104602182A (en) * 2015-02-12 2015-05-06 中国联合网络通信集团有限公司 Channel value acquiring method and channel value acquiring device
CN104835066A (en) * 2015-05-25 2015-08-12 北京京东尚科信息技术有限公司 Embarking channel selection method and system
CN105005944A (en) * 2015-07-10 2015-10-28 国家电网公司 Method and device for expanding optimal payment channel on the basis of big-data regression analysis algorithm
US11080427B2 (en) 2015-12-31 2021-08-03 Alibaba Group Holding Limited Method and apparatus for detecting label data leakage channel
US10678946B2 (en) 2015-12-31 2020-06-09 Alibaba Group Holding Limited Method and apparatus for detecting label data leakage channel
CN106933880B (en) * 2015-12-31 2020-08-11 阿里巴巴集团控股有限公司 Label data leakage channel detection method and device
CN106933880A (en) * 2015-12-31 2017-07-07 阿里巴巴集团控股有限公司 A kind of label data leaks channel detection method and device
CN106296305A (en) * 2016-08-23 2017-01-04 上海海事大学 Electric business website real-time recommendation System and method under big data environment
CN106844178A (en) * 2017-01-22 2017-06-13 腾云天宇科技(北京)有限公司 Prediction is presented method, computing device, server and the system of information transferring rate
CN106844178B (en) * 2017-01-22 2019-11-15 腾云天宇科技(北京)有限公司 Prediction is presented the method for information transferring rate, calculates equipment, server and system
CN106919692B (en) * 2017-03-07 2021-02-19 阿里巴巴(中国)有限公司 Method and device for pushing message
CN106919692A (en) * 2017-03-07 2017-07-04 广州优视网络科技有限公司 A kind of method and apparatus of PUSH message
CN107153843A (en) * 2017-05-03 2017-09-12 西安电子科技大学 Surface subsidence forecasting system and method based on SVMs
CN107153843B (en) * 2017-05-03 2020-07-10 西安电子科技大学 Ground settlement prediction system and method based on support vector machine
CN111095312A (en) * 2017-06-22 2020-05-01 日昶半导体制造有限公司 Information processing apparatus
CN107273521B (en) * 2017-06-22 2020-10-09 百度在线网络技术(北京)有限公司 Feed content quality evaluation method and device
CN107273521A (en) * 2017-06-22 2017-10-20 百度在线网络技术(北京)有限公司 Feed content qualities evaluation method and device
CN109242521A (en) * 2017-07-11 2019-01-18 阿里巴巴集团控股有限公司 A kind of distribution method and device of predetermined consumption amount
CN107481062A (en) * 2017-08-21 2017-12-15 小草数语(北京)科技有限公司 The distribution method and device of advertisement putting budget
CN108241472A (en) * 2017-12-01 2018-07-03 北京大学 A kind of big data processing method and system for supporting locality expression function
CN108241472B (en) * 2017-12-01 2021-03-12 北京大学 Big data processing method and system supporting locality expression function
CN108564398A (en) * 2018-03-29 2018-09-21 北京酷云互动科技有限公司 A kind of appraisal procedure, system and the storage medium of channel ad conversion rates
CN109190669A (en) * 2018-08-01 2019-01-11 新疆玖富万卡信息技术有限公司 A kind of intelligent recommendation method, electronic equipment and computer readable storage medium
CN111126614A (en) * 2018-11-01 2020-05-08 百度在线网络技术(北京)有限公司 Attribution method, attribution device and storage medium
CN111126614B (en) * 2018-11-01 2024-01-16 百度在线网络技术(北京)有限公司 Attribution method, attribution device and storage medium
CN110148014A (en) * 2019-04-24 2019-08-20 深圳市元征科技股份有限公司 Information processing method, device, block chain node device and storage medium
CN110148014B (en) * 2019-04-24 2023-12-05 深圳市元征科技股份有限公司 Information processing method, information processing device, blockchain node equipment and storage medium
CN111582934A (en) * 2020-05-07 2020-08-25 北京点众科技股份有限公司 Method, terminal and storage medium for determining option contribution degree for purchasing electronic book
CN112348559A (en) * 2020-09-27 2021-02-09 北京淇瑀信息科技有限公司 Channel resource consumption optimization method and device and electronic equipment
CN112348556A (en) * 2020-09-27 2021-02-09 北京淇瑀信息科技有限公司 Channel resource consumption optimization method and device and electronic equipment
CN112200618A (en) * 2020-10-29 2021-01-08 上海优扬新媒信息技术有限公司 Message channel attribution method, device and system
CN112668673A (en) * 2021-03-16 2021-04-16 腾讯科技(深圳)有限公司 Data preprocessing method and device, computer equipment and storage medium
CN113177174A (en) * 2021-05-21 2021-07-27 脸萌有限公司 Feature construction method, content display method and related device
CN113177174B (en) * 2021-05-21 2024-02-06 脸萌有限公司 Feature construction method, content display method and related device
CN116468470A (en) * 2023-04-24 2023-07-21 朴道征信有限公司 Evaluation method and device of user source channel, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN104133837B (en) 2017-10-31

Similar Documents

Publication Publication Date Title
CN104133837A (en) Internet information putting channel optimizing system based on distributed computing
Liu et al. A convolutional click prediction model
Swathi et al. An optimal deep learning-based LSTM for stock price prediction using twitter sentiment analysis
US11100411B2 (en) Predicting influence in social networks
CN108509551B (en) Microblog network key user mining system and method based on Spark environment
Zhang et al. Sequential click prediction for sponsored search with recurrent neural networks
Chen et al. General functional matrix factorization using gradient boosting
CN104268292B (en) The label Word library updating method of portrait system
Dermouche et al. A joint model for topic-sentiment evolution over time
CN103714084A (en) Method and device for recommending information
CN105893609A (en) Mobile APP recommendation method based on weighted mixing
CN103793537A (en) System for recommending individual music based on multi-dimensional time series analysis and achieving method of system
EP2827294A1 (en) Systems and method for determining influence of entities with respect to contexts
Chen et al. Topic detection and tracking on heterogeneous information
CN104834641A (en) Processing method of network medium information and relevant system
Qiu et al. A framework for exploring organizational structure in dynamic social networks
CN114238573B (en) Text countercheck sample-based information pushing method and device
Li et al. A hybrid model for experts finding in community question answering
Liu et al. Multi-perspective User2Vec: Exploiting re-pin activity for user representation learning in content curation social network
CN101986301B (en) Inverse neighbor analysis-based collaborative filtering recommendation system and method
Jiang et al. BBS opinion leader mining based on an improved PageRank algorithm using MapReduce
CN111221881A (en) User characteristic data synthesis method and device and electronic equipment
She et al. Research on advertising click-through rate prediction based on CNN-FM hybrid model
Zhu et al. Topic correlation and individual influence analysis in online forums
Tang et al. Automatic identification of bottleneck tasks for business process management using fusion-based text clustering

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20181017

Address after: 200240 No. 800, Dongchuan Road, Shanghai, Minhang District

Co-patentee after: Wang Yanfeng

Patentee after: Zhang Ya

Address before: 200240 No. 800, Dongchuan Road, Shanghai, Minhang District

Patentee before: Shanghai Jiao Tong University

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20181116

Address after: Room 387, Building 333, Hongqiao Road, Xuhui District, Shanghai 200030

Patentee after: Shanghai Media Intelligence Technology Co., Ltd.

Address before: 200240 No. 800, Dongchuan Road, Shanghai, Minhang District

Co-patentee before: Wang Yanfeng

Patentee before: Zhang Ya

TR01 Transfer of patent right