CN105243063B - The method and apparatus of information recommendation - Google Patents

The method and apparatus of information recommendation Download PDF

Info

Publication number
CN105243063B
CN105243063B CN201410274081.4A CN201410274081A CN105243063B CN 105243063 B CN105243063 B CN 105243063B CN 201410274081 A CN201410274081 A CN 201410274081A CN 105243063 B CN105243063 B CN 105243063B
Authority
CN
China
Prior art keywords
source data
recommendation
dimension
client
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410274081.4A
Other languages
Chinese (zh)
Other versions
CN105243063A (en
Inventor
邓利利
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Feinno Communication Technology Co Ltd
Original Assignee
Beijing Feinno Communication Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Feinno Communication Technology Co Ltd filed Critical Beijing Feinno Communication Technology Co Ltd
Priority to CN201410274081.4A priority Critical patent/CN105243063B/en
Publication of CN105243063A publication Critical patent/CN105243063A/en
Application granted granted Critical
Publication of CN105243063B publication Critical patent/CN105243063B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a kind of method and apparatus of information recommendation, belong to data processing field.The described method includes: obtaining source data;Determine the dimension and the corresponding MapReduce model of each dimension of the source data;The source data is calculated parallel using determining each MapReduce model;The recommendation information being calculated is stored in Hbase system;When receiving the recommendation request of client, recommendation information related with the user of the client is obtained from the Hbase system and is sent to the client.Described device includes: to obtain module, determining module, computing module, memory module and recommending module.Present invention greatly enhances the speed of data processing, reduce the complexity of calculating, save computing overhead, improve the efficiency and accuracy of information recommendation.

Description

The method and apparatus of information recommendation
Technical field
The present invention relates to data processing field, in particular to a kind of method and apparatus of information recommendation.
Background technique
With the development of information technology, data are more and more important to enterprise, and data volume is also exponentially increasing, data The technologies such as storage and processing becoming hot spot.And valuable content how is extracted from huge data, and do with this Various recommendations have become mainstream business at present.
There is a kind of recommended technology at present, is that data are calculated based on collaborative filtering, and is pushed away what is be calculated It recommends result and is sent to user's progress information recommendation.But the algorithm complexity is high, algorithm is opened for large-scale data set Pin can become very huge, slow so as to cause data processing speed, result is inaccurate, and database corruption is also resulted in when serious.
Summary of the invention
In view of this, the embodiment of the invention provides a kind of method and apparatus of information recommendation, when improving information recommendation Speed and accuracy.The technical solution is as follows:
In a first aspect, providing a kind of method of information recommendation, which comprises
Obtain source data;
Determine the dimension and the corresponding MapReduce model of each dimension of the source data;
The source data is calculated parallel using determining each MapReduce model;
The recommendation information being calculated is stored in Hbase system;
When receiving the recommendation request of client, obtained from the Hbase system related with the user of the client Recommendation information and be sent to the client.
Optionally, the acquisition source data, comprising:
According to preset period timing acquisition source data;And/or
When source data changes, changed source data is obtained in real time.
Further, described that the source data is calculated parallel using determining each MapReduce model, it wraps It includes:
For the source data of timing acquisition, corresponding MapReduce model is utilized simultaneously according to preset static policies timing Row is calculated;
For the source data obtained in real time, according to preset dynamic strategy utilize in real time corresponding MapReduce model into Row calculates.
Optionally, the recommendation information that will be calculated is stored in Hbase system, comprising:
The User ID of client and recommendation list are stored in Hbase system, the User ID of the client is as master Key, the recommendation list include the recommendation information of each dimension of the user.
Further, the recommendation information of each dimension is spliced by designated symbols in the recommendation list, each dimension Recommendation information include: Refer ID, recommend reason and weight.
Second aspect, provides a kind of device of information recommendation, and described device includes:
Module is obtained, for obtaining source data;
Determining module, for determining the dimension and the corresponding MapReduce model of each dimension of the source data;
Computing module, for being calculated parallel the source data using determining each MapReduce model;
Memory module, for the recommendation information being calculated to be stored in Hbase system;
Recommending module, for being obtained and the client from the Hbase system when receiving the recommendation request of client The related recommendation information of the user at end is simultaneously sent to the client.
Optionally, the acquisition module includes:
Timing acquisition unit, for according to preset period timing acquisition source data;And/or
Real-time acquiring unit, for obtaining changed source data in real time when source data changes.
Further, the computing module includes:
Computing unit utilizes corresponding for the source data for timing acquisition according to preset static policies timing MapReduce model is calculated parallel;For the source data obtained in real time, correspondence is utilized in real time according to preset dynamic strategy MapReduce model calculated.
Optionally, the memory module includes:
Storage unit, for the User ID of client and recommendation list to be stored in Hbase system, the client For User ID as major key, the recommendation list includes the recommendation information of each dimension of the user.
Further, the recommendation information of each dimension is spliced by designated symbols in the recommendation list, each dimension Recommendation information include: Refer ID, recommend reason and weight.
Technical solution provided in an embodiment of the present invention has the benefit that acquisition source data;Determine the source data Dimension and the corresponding MapReduce model of each dimension;Using determining each MapReduce model parallel to the source Data are calculated;The recommendation information being calculated is stored in Hbase system;When the recommendation request for receiving client When, recommendation information related with the client is obtained from the Hbase system and is sent to the client, realizes information Recommend, is suitable for the MapReduce model of large-scale data set operation due to using, and use different MapReduce Model carries out parallel processing to the source data of different dimensions, greatly improves the speed of data processing, reduces answering for calculating Miscellaneous degree, saves computing overhead, improves the efficiency and accuracy of information recommendation.In addition, storing recommendation using Hbase system Breath has the advantages such as high reliability, high-performance, scalable, improves the safety of data storage.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing.
Fig. 1 is the method flow diagram for the information recommendation that one embodiment of the invention provides;
Fig. 2 be another embodiment of the present invention provides two degree of friend recommendation schematic diagrames;
Fig. 3 be another embodiment of the present invention provides information recommendation method flow diagram;
Fig. 4 be another embodiment of the present invention provides information recommendation interaction schematic diagram;
Fig. 5 be another embodiment of the present invention provides information recommendation structure drawing of device;
Fig. 6 be another embodiment of the present invention provides information recommendation structure drawing of device.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with attached drawing to embodiment party of the present invention Formula is described in further detail.
The present embodiments relate to Hadoop platform, a distributed system infrastructures.Two spies of Hadoop platform Property is, using HDFS (Hadoop Distributed File System, Hadoop distributed file system) storing data, and The operation of big data is carried out based on MapReduce model.MapReduce model is a kind of programming model, by specifying one One group of key-value pair is mapped to one group of new key-value pair by Map function, and guarantees to own by specifying concurrent Reduce function Each of key-value pair of mapping shares identical key group.
Referring to Fig. 1, one embodiment of the invention provides a kind of method of information recommendation, comprising:
101: obtaining source data.
Wherein, source data can store on one or multiple servers.
102: determining the dimension and the corresponding MapReduce model of each dimension of the source data.
The dimension of source data is determined by customer relationship chain.In general, a customer relationship chain is exactly a dimension, including But be not limited to it is following any one: it is common friend, Boss ticket, common IP, common group, address list, stranger's communication, common Grouping etc..
103: the source data being calculated parallel using determining each MapReduce model.
Wherein, each dimension calculates the source data of the dimension using corresponding MapReduce model, multiple The source data of dimension is respectively adopted respective MapReduce model and is calculated parallel, to greatly improve the place of data Reason ability, improves calculating speed and efficiency.
Moreover, new MapReduce model parallel processing can be designed when source data increases new dimension, to existing MapReduce model operation do not impact, do not interfere with each other, have good scalability.
In the present embodiment, the source data of different dimensions corresponds to different MapReduce models, specifically, can be according to dimension The characteristic of degree designs MapReduce model, and the present embodiment is not specifically limited in this embodiment.
For example, source data is " PC and mobile communication for several times+full dose friend relation to ", the format of PC and mobile communication for several times Are as follows: user UID1, user UIDn, the number that user UID1 is communicated on PC with user UIDn, user UID1 is on mobile phone The number communicated with user UIDn.Wherein, user UID1 and user UIDn is the user of instant messaging, user's UIDn generation Several can carry out the user of instant messaging with user UID1 with table, and n be positive integer, user UID1 can in user UIDn Any one carries out mutually instant messaging.The full dose friend relation is to referring to all friend relation pair in instant messaging application, lattice Formula is UIDi, UIDj.It is not the user of good friend each other that purpose, which is for the user's recommendation for carrying out instant messaging,.Recommendation information is close Friend refers to that communication number of days reaches specified and just recommends mutually for X days or more, and above-mentioned communication number is also possible to communicate number of days.Firstly, Determine that the dimension of the source data for communication dimension, designs MapReduce model corresponding with the dimension and pushed away as follows Recommend calculating: described except processing again refers to that each user UIDn remembers firstly, carry out the communication number of PC and mobile phone except again handling Record a communication total degree, specially PC communicates the cumulative of number and mobile communication number.Then, according to full dose friend relation It is right, by the good friend in user UID1 and user UIDn to deletion, obtain the user pair set of non-friend relation.Finally, in the collection Communication user UIDn list is extracted in closing and is sorted from high to low according to communication total degree, and recommendation list is obtained.As user UID1 Client when sending recommendation request, which can directly be recommended to client, or the recommendation can also be arranged The user UIDn that number of days is more than given number of days is communicated in table recommends client as close friend.
For example, source data is " buddy list of the buddy list+D of A ", recommendation information is " two degree of good friends ".Referring to fig. 2, A Buddy list include B and C, the buddy list of D includes B and E, and A and D are good friends.Dimension can be determined according to source data It for buddy list, designs MapReduce model corresponding with the dimension and carries out recommendation calculating as follows: being arranged in the good friend of A The good friend B that D is filtered out in table (B, C), obtains result C, the recommendation information as D.A is filtered out in the buddy list (B, E) of D Good friend B obtain result E, the recommendation information as A.When A requests to recommend, using E as friend recommendation to A, when D requests to recommend When, using C as friend recommendation to D.
For example, source data is " group relation+full dose friend relation to ", target is not to be to the member in same group The progress of good friend is recommended mutually.The dimension for determining source data is " common group ".According to the corresponding of dimension design MapReduce model carries out recommendation calculating as follows: firstly, all members in same group are done cartesian product, obtaining To multiple groups friend couple;It then, will be the friend of good friend to filtering out;Finally, by the friend of the friend's centering obtained after filtering It is friendly to be mutually used as recommendation information.When receiving the recommendation request of some user, carried out all friends of the user as good friend Recommend.
104: the recommendation information being calculated is stored in Hbase system.
Hbase system has the characteristics that high reliability, high performance, is towards column, telescopic distributed memory system, especially It is suitable for the storages of mass data.And the storage based on column mode, there is no any association, the efficiency of inquiry between table and table It is very high.
The recommendation information may include much information, such as instant messaging ID, phone number, group ID, this implementation Example is not specifically limited in this embodiment.
105: when receiving the recommendation request of client, being obtained from the Hbase system related with the user of the client Recommendation information and be sent to the client.
Wherein, recommendation information can be stored according to the ID (Identity, identity recognition number) of user, therefore, received To some client recommendation request when, corresponding recommendation information can be inquired according to the User ID of the client, thus Recommend the user of the client.
The client that the present embodiment is related to can be mobile phone, PC machine, tablet computer etc., and the present embodiment does not do this specifically It limits.
In the present embodiment, optionally, source data is obtained, may include:
According to preset period timing acquisition source data;And/or
When source data changes, changed source data is obtained in real time.
Further, the source data is calculated parallel using determining each MapReduce model, may include:
For the source data of timing acquisition, corresponding MapReduce model is utilized simultaneously according to preset static policies timing Row is calculated;
For the source data obtained in real time, according to preset dynamic strategy utilize in real time corresponding MapReduce model into Row calculates.
In the present embodiment, optionally, the recommendation information being calculated is stored in Hbase system, may include:
The User ID of client and recommendation list are stored in Hbase system, the User ID of the client as major key, The recommendation list includes the recommendation information of each dimension of the user.
Further, the recommendation information of each dimension is spliced by designated symbols in the recommendation list, each dimension Recommendation information include: Refer ID, recommend reason and weight.
The above method provided in this embodiment obtains source data;Determine the dimension and each dimension pair of the source data The MapReduce model answered;The source data is calculated parallel using determining each MapReduce model;It will calculate Obtained recommendation information is stored in Hbase system;When receiving the recommendation request of client, obtained from the Hbase system It takes recommendation information related with the user of the client and is sent to the client, realize information recommendation, due to using It is suitable for the MapReduce model of large-scale data set operation, and using different MapReduce model to different dimensions Source data carry out parallel processing, greatly improve the speed of data processing, reduce the complexity of calculating, save operation Expense improves the efficiency and accuracy of information recommendation.In addition, storing recommendation information using Hbase system, have highly reliable Property, high-performance, the advantages such as scalable, improve the safety of data storage.
Referring to Fig. 3, another embodiment of the present invention provides a kind of methods of information recommendation, comprising:
301: according to preset period timing acquisition source data;And/or when source data changes, hair is obtained in real time The source data for changing.
Wherein it is possible to by realizing that FTP (File Transfer Protocol, File Transfer Protocol) client functionality comes Timing acquisition source data is executed, which can according to need setting, and the present embodiment does not limit specific value.Periodically The source data of acquisition can store in HDFS, which can be an equipment, or be also possible to a group system.
Obtaining source data in real time can be using HTTP (HyperText Transfer Protocol, Hyper text transfer association View) mode or GET (obtaining order) mode realizes, it is preferable that it is realized using HTTP mode and obtains source data in real time, peace Quan Xinggeng high.The source data obtained in real time can store in Hbase system, which may include an equipment, or Person's multiple devices.
Timing acquisition is suitable for the not high recommendation request of timeliness, obtains be more suitable for that timeliness is more demanding to be pushed away in real time Recommend request.It is therefore preferred that timing acquisition is combined application with real-time acquisition, it can adapt to various recommendation requests, apply It is more flexible.
302: determining the dimension and the corresponding MapReduce model of each dimension of the source data.
In the present embodiment, corresponding MapReduce model, and every kind can be designed for every kind of dimension in advance Proposed algorithm in MapReduce model is all different.Depending on algorithm foundation dimension, more targetedly, therefore, it is recommended that meter The result of calculation is more acurrate, realizes intelligent recommendation.
303: for the source data of timing acquisition, utilizing corresponding MapReduce mould according to preset static policies timing Type is calculated parallel.
Wherein, static policies are used for the source data of timing acquisition, since the source data of timing acquisition is usually all lot number According to data volume is larger, but timeliness is of less demanding, it is therefore possible to use the static policies that timing calculates.It such as can be in terms of every month Calculation is once or the calculating of per two weeks is primary etc., and the present embodiment is not specifically limited in this embodiment.
The source data of timing acquisition is usually directed to multiple dimensions, therefore, can carry out to the source data of multiple dimension Parallel processing is calculated parallel using respective MapReduce model.
304: for the source data obtained in real time, utilizing corresponding MapReduce mould in real time according to preset dynamic strategy Type is calculated.
Wherein, dynamic strategy is used for the source data obtained in real time, since what is obtained in real time is all changed data, leads to Regular data amount is smaller, but timeliness is higher, it is therefore possible to use the dynamic strategy calculated in real time.In most cases, it obtains in real time The source data taken is all the data of sole user, in the address list as changed in some mobile phone or some PC machine increase newly i.e. When communication good friend etc., the present embodiment is not specifically limited in this embodiment.
Above-mentioned two step 303 and 304 can carry out simultaneously, to achieve the effect that parallel processing.
In addition, on the basis of using MapReduce model, further, can also be used in the present embodiment Combiner function and Partitioner function etc. carry out optimization algorithm, effectively to improve the performance of algorithm, do not do herein excessive Explanation.
By carrying out real-time calculation processing to the source data obtained in real time, it can guarantee the high-timeliness recommended, realize Real-time recommendation.
305: the User ID of client and recommendation list being stored in Hbase system, the User ID conduct of the client Major key, the recommendation list include the recommendation information of each dimension of the user.
Preferably, the recommendation information of each dimension is spliced by designated symbols in the recommendation list, and each dimension pushes away Information is recommended to include: Refer ID, recommend reason and weight.
In the present embodiment, the User ID of client and recommendation list can be designed to a main table storage.For example, can be with As shown in table 1.
Table 1
Wherein, major key KEY is the user UID that length is 10, cannot be sky.Reasonlist is recommendation list, including The recommendation information of all dimensions.It wherein, is KEY with UID, it is ensured that the speed of storage and inquiry.In general, UID is greater than 8 Long number less than 10, the UID in KEY less than 10 need to mend 0 in front to guarantee length for 10.Build table Afterwards, region division can be carried out to table according to the distribution of KEY, being uniformly distributed for data storage is guaranteed with this, is deposited with reaching The optimization of storage and inquiry velocity.Recommendation list symbolization splices the form storage of the recommendation information of various dimensions, so as to To digitize recommendation information, memory space is more saved than written form storage.The separator of splicing can according to need fixed Justice, the present embodiment are not specifically limited in this embodiment.
As shown in table 1, recommendation list may include a plurality of record, and every record includes: to recommend UID, recommend reason and power Weight;A plurality of record can be sorted from large to small according to recommendation weight.Wherein, the separator between each item record can be " ";It pushes away Recommending UID, recommending the separator between reason and weight can be " # ";Separator between multiple recommendation reasons can be " | "; Recommending reason includes KEY and VALUE, and separator can be ": " etc..For example, a recommendation list comprising two records It is as follows: 289367798#0:1#5 $ 11111111#0:1#4.First is recorded as " 289367798#0:1#5 ", then, by this UID user recommends key;Wherein, the UID of recommended user is " 289367798 ";Recommending reason is " 0:1 ", and " 0 " is KEY value, " 1 " is VALUE value, and different VALUE values can correspond to different meanings, is defined as needed, such as " 0 " corresponding Boss ticket dimension It spends (reason), " 1 " corresponding Boss ticket talk times;Calculating weight according to rule is 5.Article 2 is recorded as " 11111111# 0:1#4 ", then, this UID user is recommended into key;Wherein, the UID of recommended user be " 11111111 ", recommend reason be " 0: 1 ", weight 4.It is noted that when there is the increase of new dimension, only need to after recommendation list additional record, simple side Just, it easily extends.
In addition, some numerical results can also be stored in distributed cache system to further increase inquiry velocity It unites in Memcached, whether when there is client to send recommendation request, inquire from Memcached has corresponding recommendation first Information, if so, the recommendation information is directly then returned to client;It is corresponded to if it is not, being inquired from Hbase system again Recommendation information recommended, thus avoid every time recommend all from Hbase system queries, alleviate the pressure of Hbase system, pole The earth improves inquiry velocity.
Therefore, the data storage in the present embodiment can consist of three parts, HDFS, Hbase and Memcached.Wherein, The source data of timing acquisition is stored in HDFS, is suitable for mass data storage;The source data obtained in real time and parallel meter The result of calculation is stored in Hbase, and data are safer, efficient;Memcahed is used to cache some numerical results, with further Improve query capability.
306: when receiving the recommendation request of client, being obtained from the Hbase system related with the user of the client Recommendation information and be sent to the client.
The above method can be executed by the device of information recommendation.The recommendation request of client can be sent out by server The device is given, returns to recommendation information to client after executing inquiry by the device.The process can be as shown in figure 4, specific stream Journey is as follows: S1, client send recommendation request to server;S2, server, which receive, to be transmitted to information after the recommendation request and pushes away Recommend device;After S3, information recommending apparatus receive the request, the recommendation information of the client is inquired to Hbase;S4, information recommendation The recommendation information that inquiry obtains is sent to server by device, is transmitted to client by server and is recommended.
The above method provided in this embodiment, according to preset period timing acquisition source data;And/or when source data is sent out When changing, changed source data is obtained in real time;Determine that dimension and each dimension of the source data are corresponding MapReduce model;For the source data of timing acquisition, corresponding MapReduce is utilized according to preset static policies timing Model is calculated parallel;For the source data obtained in real time, utilized in real time according to preset dynamic strategy corresponding MapReduce model is calculated;The User ID of client and recommendation list are stored in Hbase system, the client For User ID as major key, which includes the recommendation information of each dimension of the client;When receiving pushing away for client When recommending request, recommendation information related with the user of the client is obtained from the Hbase system and is sent to the client;It realizes Information recommendation is suitable for the MapReduce model of large-scale data set operation due to using, and using different MapReduce model carries out parallel processing to the source data of different dimensions, greatly improves the speed of data processing, reduces The complexity of calculating, saves computing overhead, improves the efficiency and accuracy of information recommendation.In addition, being deposited using Hbase system Recommendation information is stored up, there are the advantages such as high reliability, high-performance, scalable, improves the safety of data storage.
Referring to Fig. 5, another embodiment of the present invention provides a kind of devices of information recommendation, comprising:
Module 501 is obtained, for obtaining source data;
Determining module 502, for determining the dimension and the corresponding MapReduce model of each dimension of the source data;
Computing module 503, for being calculated parallel the source data using determining each MapReduce model;
Memory module 504, for the recommendation information being calculated to be stored in Hbase system;
Recommending module 505, for being obtained and the client from the Hbase system when receiving the recommendation request of client The related recommendation information of the user at end is simultaneously sent to the client.
Referring to Fig. 6, in the present embodiment, optionally, obtaining module 501 may include:
Timing acquisition unit 501a, for according to preset period timing acquisition source data;And/or
Real-time acquiring unit 501b, for obtaining changed source data in real time when source data changes.
In the present embodiment, further, computing module 503 may include:
Computing unit 503a utilizes correspondence according to preset static policies timing for the source data for timing acquisition MapReduce model calculated parallel;For the source data obtained in real time, utilized in real time pair according to preset dynamic strategy The MapReduce model answered is calculated.
In the present embodiment, optionally, memory module 504 may include:
Storage unit, for the User ID of client and recommendation list to be stored in Hbase system, the use of the client For family ID as major key, which includes the recommendation information of each dimension of the user.
In the present embodiment, further, the recommendation information of each dimension is spelled by designated symbols in the recommendation list It connects, the recommendation information of each dimension includes: Refer ID, recommends reason and weight.
In the present embodiment, above-mentioned apparatus can also logically be divided are as follows: first interface layer, data computation layer, data Accumulation layer and second interface layer.Wherein, first interface layer includes above-mentioned acquisition module 501, is responsible for obtaining source data;Data calculate Layer includes above-mentioned determining module 502 and computing module 503, is responsible for being calculated using MapReduce model;Data storage layer packet Above-mentioned memory module 504 is included, storage calculated result is responsible for;Second interface layer includes above-mentioned recommending module 505, responsible and client Interaction, sends recommendation information in client request, to complete to recommend.
Above-mentioned apparatus provided in this embodiment can execute the method that any of the above-described embodiment of the method provides, and be detailed in method reality The description in example is applied, is not repeated herein.
Above-mentioned apparatus provided in this embodiment obtains source data;Determine the dimension and each dimension pair of the source data The MapReduce model answered;The source data is calculated parallel using determining each MapReduce model;It will calculate Obtained recommendation information is stored in Hbase system;When receiving the recommendation request of client, obtained from the Hbase system It takes recommendation information related with the user of the client and is sent to the client, realize information recommendation, due to using It is suitable for the MapReduce model of large-scale data set operation, and using different MapReduce model to different dimensions Source data carry out parallel processing, greatly improve the speed of data processing, reduce the complexity of calculating, save operation Expense improves the efficiency and accuracy of information recommendation.In addition, storing recommendation information using Hbase system, have highly reliable Property, high-performance, the advantages such as scalable, improve the safety of data storage.
Those of ordinary skill in the art will appreciate that realizing that all or part of the steps of above-described embodiment can pass through hardware It completes, relevant hardware can also be instructed to complete by program, the program can store in a kind of computer-readable In storage medium, storage medium mentioned above can be read-only memory, disk or CD etc..
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims (6)

1. a kind of method of information recommendation, which is characterized in that the described method includes:
According to preset period timing acquisition source data;And/or
When source data changes, changed source data is obtained in real time;
If the source data is computer and mobile communication number and full dose friend relation pair, the source data is determined Dimension is to communicate dimension, if the source data is the buddy list of the first user and the buddy list of second user, determines institute The dimension of source data is stated as good friend's List Dimension, if the source data is group relation and full dose friend relation pair, determines institute The dimension for stating source data is common group's dimension, determines the corresponding MapReduce model of each dimension;
For the source data of timing acquisition, using corresponding MapReduce model and advance according to preset static policies timing Row calculates, and for the source data obtained in real time, is carried out in real time using corresponding MapReduce model according to preset dynamic strategy It calculates;
The recommendation information being calculated is stored in Hbase system, the part recommendation information in the Hbase system also stores In distributed cache system Memcached;
When receiving the recommendation request of client, if being stored with the corresponding recommendation of the client in Memcached system The corresponding recommendation information of the client is then sent to the client from Memcached system by information;
If not stored in Memcached system have the corresponding recommendation information of the client, obtained from the Hbase system Recommendation information related with the user of the client is simultaneously sent to the client.
2. the method according to claim 1, wherein the recommendation information that will be calculated is stored in Hbase In system, comprising:
The User ID of client and recommendation list are stored in Hbase system, the User ID of the client is as major key, institute State the recommendation information that recommendation list includes each dimension of the user.
3. according to the method described in claim 2, it is characterized in that, the recommendation information of each dimension is by referring in the recommendation list Determine symbol to be spliced, the recommendation information of each dimension includes: Refer ID, recommends reason and weight.
4. a kind of device of information recommendation, which is characterized in that described device includes:
Timing acquisition unit, for according to preset period timing acquisition source data;And/or
Real-time acquiring unit, for obtaining changed source data in real time when source data changes;
Determining module determines if being computer and mobile communication number and full dose friend relation pair for the source data The dimension of the source data is communication dimension, if the source data is the buddy list of the first user and the good friend of second user List determines that the dimension of the source data is good friend's List Dimension, if the source data is that group relation and full dose good friend are closed System pair determines that the dimension of the source data is common group's dimension, determines the corresponding MapReduce model of each dimension;
Computing unit utilizes corresponding for the source data for timing acquisition according to preset static policies timing MapReduce model is calculated parallel;For the source data obtained in real time, correspondence is utilized in real time according to preset dynamic strategy MapReduce model calculated;
Memory module, the part for the recommendation information being calculated to be stored in Hbase system, in the Hbase system Recommendation information also is stored in distributed cache system Memcached;
Recommending module, for when receiving the recommendation request of client, if being stored with the client pair in Memcached The corresponding recommendation information of the client is then sent to the client from Memcached by the recommendation information answered;If It is not stored in Memcached to have the corresponding recommendation information of the client, then it is obtained and the client from the Hbase system The related recommendation information of user and be sent to the client.
5. device according to claim 4, which is characterized in that the memory module includes:
Storage unit, for the User ID of client and recommendation list to be stored in Hbase system, the user of the client For ID as major key, the recommendation list includes the recommendation information of each dimension of the user.
6. device according to claim 5, which is characterized in that the recommendation information of each dimension is by referring in the recommendation list Determine symbol to be spliced, the recommendation information of each dimension includes: Refer ID, recommends reason and weight.
CN201410274081.4A 2014-06-18 2014-06-18 The method and apparatus of information recommendation Active CN105243063B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410274081.4A CN105243063B (en) 2014-06-18 2014-06-18 The method and apparatus of information recommendation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410274081.4A CN105243063B (en) 2014-06-18 2014-06-18 The method and apparatus of information recommendation

Publications (2)

Publication Number Publication Date
CN105243063A CN105243063A (en) 2016-01-13
CN105243063B true CN105243063B (en) 2019-11-15

Family

ID=55040715

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410274081.4A Active CN105243063B (en) 2014-06-18 2014-06-18 The method and apparatus of information recommendation

Country Status (1)

Country Link
CN (1) CN105243063B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106709009A (en) * 2016-12-26 2017-05-24 镇江恒驰科技有限公司 System for processing big data of electric vehicles on basis of cloud platform
CN113240536A (en) * 2021-05-14 2021-08-10 北京达佳互联信息技术有限公司 Information acquisition method, device, server, medium and product

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663627A (en) * 2012-04-26 2012-09-12 焦点科技股份有限公司 Personalized recommendation method
CN103345514A (en) * 2013-07-09 2013-10-09 焦点科技股份有限公司 Streamed data processing method in big data environment
CN103345698A (en) * 2013-07-09 2013-10-09 焦点科技股份有限公司 Personalized recommendation method based on cloud processing mode and applied in e-business environment
CN103500226A (en) * 2013-10-23 2014-01-08 中国农业银行股份有限公司 Method and device for removing sensitivity of sensitive data

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100011020A1 (en) * 2008-07-11 2010-01-14 Motorola, Inc. Recommender system
CN102663117B (en) * 2012-04-18 2013-11-20 中国人民大学 OLAP (On Line Analytical Processing) inquiry processing method facing database and Hadoop mixing platform
CN103853714B (en) * 2012-11-28 2017-06-20 中国移动通信集团河南有限公司 A kind of data processing method and device
CN103747060B (en) * 2013-12-26 2017-12-08 惠州华阳通用电子有限公司 A kind of distributed monitoring system and method based on streaming media service cluster

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663627A (en) * 2012-04-26 2012-09-12 焦点科技股份有限公司 Personalized recommendation method
CN103345514A (en) * 2013-07-09 2013-10-09 焦点科技股份有限公司 Streamed data processing method in big data environment
CN103345698A (en) * 2013-07-09 2013-10-09 焦点科技股份有限公司 Personalized recommendation method based on cloud processing mode and applied in e-business environment
CN103500226A (en) * 2013-10-23 2014-01-08 中国农业银行股份有限公司 Method and device for removing sensitivity of sensitive data

Also Published As

Publication number Publication date
CN105243063A (en) 2016-01-13

Similar Documents

Publication Publication Date Title
US11093501B2 (en) Searching in a database
CN106980669B (en) A kind of storage of data, acquisition methods and device
CN106897322B (en) A kind of access method and device of database and file system
US9454558B2 (en) Managing an index of a table of a database
US8775425B2 (en) Systems and methods for massive structured data management over cloud aware distributed file system
US8468120B2 (en) Systems and methods for tracking and reporting provenance of data used in a massively distributed analytics cloud
Phaneendra et al. Big Data-solutions for RDBMS problems-A survey
US9756119B2 (en) Apparatus and method for pipelined event processing in a distributed environment
US11176128B2 (en) Multiple access path selection by machine learning
Gürcan et al. Real-time processing of big data streams: Lifecycle, tools, tasks, and challenges
Bhardwaj et al. Big data emerging technologies: A CaseStudy with analyzing twitter data using apache hive
CN108021809A (en) A kind of data processing method and system
CN103761102B (en) A kind of uniform data service platform and its implementation
CN103646051A (en) Big-data parallel processing system and method based on column storage
Gupta et al. Faster as well as early measurements from big data predictive analytics model
Saxena et al. Practical real-time data processing and analytics: distributed computing and event processing using Apache Spark, Flink, Storm, and Kafka
CN109947729A (en) A kind of real-time data analysis method and device
US20180129690A1 (en) Schema-Free In-Graph Indexing
Zobaed et al. Big Data in the Cloud.
CN105243063B (en) The method and apparatus of information recommendation
De Prato et al. The next wave:" big data"?
US20190364109A1 (en) Scale out data storage and query filtering using storage pools
US10467193B1 (en) Real-time ad hoc querying of data records
CN108804502A (en) Big data inquiry system, method, computer equipment and storage medium
US20190362011A1 (en) Scale out data storage and query filtering using data pools

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: Room 810, 8 / F, 34 Haidian Street, Haidian District, Beijing 100080

Patentee after: BEIJING D-MEDIA COMMUNICATION TECHNOLOGY Co.,Ltd.

Address before: 100089 Beijing city Haidian District wanquanzhuang Road No. 28 Wanliu new building block A room 602

Patentee before: BEIJING D-MEDIA COMMUNICATION TECHNOLOGY Co.,Ltd.