CN104915431B - A kind of date storage method and system - Google Patents

A kind of date storage method and system Download PDF

Info

Publication number
CN104915431B
CN104915431B CN201510337071.5A CN201510337071A CN104915431B CN 104915431 B CN104915431 B CN 104915431B CN 201510337071 A CN201510337071 A CN 201510337071A CN 104915431 B CN104915431 B CN 104915431B
Authority
CN
China
Prior art keywords
data
user
dimension
user behavior
operation data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510337071.5A
Other languages
Chinese (zh)
Other versions
CN104915431A (en
Inventor
谢贵明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tencent Computer Systems Co Ltd
Original Assignee
Shenzhen Tencent Computer Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Tencent Computer Systems Co Ltd filed Critical Shenzhen Tencent Computer Systems Co Ltd
Priority to CN201510337071.5A priority Critical patent/CN104915431B/en
Publication of CN104915431A publication Critical patent/CN104915431A/en
Application granted granted Critical
Publication of CN104915431B publication Critical patent/CN104915431B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0254Targeted advertisements based on statistics

Abstract

The embodiment of the invention discloses date storage method and system, applied to processing data information technical field.Commending system is in the data of counting user behavior in the embodiment of the present invention, can be user's operation data corresponding to multigroup dimension difference by the data point reuse of the user behavior of acquisition, then user's operation data corresponding to multigroup dimension difference is stored according to various dimensions, the structure of more time windows.Because the storage time of user's operation data required by every group of dimension may be different, using the method for the embodiment of the present invention, thus neatly the data of the user behavior counted with different dimensions time windows intensively can be stored into together, and without separately storage, it can farthest save memory space.

Description

A kind of date storage method and system
Technical field
The present invention relates to processing data information technical field, more particularly to a kind of date storage method and system.
Background technology
In existing commending system (such as ad system, or news commending system), businessman can pass through commending system Directionally or nondirectional that the data (such as ad data, or news data etc.) for needing to recommend are sent into each user is whole End, so as to reach the purpose promoted a certain product or inform some information.Commending system needs counting user terminal-pair in real time The operation of the recommending data of reception, i.e. user behavior.
In the prior art, commending system is in the data of counting user behavior, mainly according to cycle regular time, and The data of the user behavior of each user are stored with fixed dimension.
The content of the invention
The embodiment of the present invention provides a kind of date storage method and system, realizes the structure with various dimensions and more time windows Store the data of user behavior.
The embodiment of the present invention provides a kind of date storage method, including:
Obtain the data of user behavior;The data of the user behavior include user's operation data and to application it is related Data;
The data of the user behavior are adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension point Not corresponding user's operation data, the information of the dimension are included in the related data to application;
User behavior data after the adjustment is saved as to the storage organization of more time sliding windows, more time sliding windows Storage organization includes:User's operation data corresponding to multigroup dimension difference in time window.
The embodiment of the present invention also provides a kind of data-storage system, including:
Data capture unit, for obtaining the data of user behavior;The data of the user behavior include user's operation Data and to apply related data;
Adjustment unit, for adjusting the data for the user behavior that the data capture unit obtains according to dimension so that adjust User behavior data after whole include multigroup dimension respectively corresponding to user's operation data, the information of the dimension is included in described To applying in related data;
Storage element, the storage of more time sliding windows is saved as the user behavior data after the adjustment unit is adjusted Structure, the storage organization of more time sliding windows include:User's operation data corresponding to multigroup dimension difference in time window.
In the embodiment of the present invention, commending system, can be by the user behavior of acquisition in the data of counting user behavior Data point reuse is user's operation data corresponding to multigroup dimension difference, then more according to various dimensions, the structure storage of more time windows User's operation data corresponding to group dimension difference.Because the storage time of user's operation data required by every group of dimension may not Together, using the method for the embodiment of the present invention, the user behavior that will neatly can be thus counted with different dimensions time windows Data be intensively stored into together, and without separately storage, can farthest save memory space.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used also To obtain other accompanying drawings according to these accompanying drawings.
Fig. 1 is a kind of flow chart of date storage method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural representation of data-storage system provided in an embodiment of the present invention;
Fig. 3 is the structural representation of another data-storage system provided in an embodiment of the present invention;
Fig. 4 is a kind of structural representation of commending system provided in an embodiment of the present invention;
Fig. 5 is a kind of structural representation of the ad system provided in Application Example of the present invention;
Fig. 6 is the structural representation of the storage organization of more time sliding windows in the embodiment of the present invention;
Fig. 7 is a kind of schematic diagram of the storage organization of more time sliding windows in Application Example of the present invention;
Fig. 8 is the schematic diagram of the storage organization of another more time sliding windows in Application Example of the present invention;
Fig. 9 is the schematic diagram of the storage organization of the more time sliding windows of another in Application Example of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not made Embodiment, belong to the scope of protection of the invention.
Term " first ", " second ", " the 3rd " " in description and claims of this specification and above-mentioned accompanying drawing The (if present)s such as four " are for distinguishing similar object, without for describing specific order or precedence.It should manage The data that solution so uses can exchange in the appropriate case, so as to embodiments of the invention described herein for example can with except Order beyond those for illustrating or describing herein is implemented.In addition, term " comprising " and " having " and theirs is any Deformation, it is intended that including not exclusively is covered, for example, containing the process of series of steps or unit, method, system, production Product or equipment are not necessarily limited to those steps clearly listed or unit, but may include not list clearly or for this The intrinsic other steps of a little process, method, product or equipment or unit.
The embodiment of the present invention provides a kind of date storage method, primarily directed to commending system such as ad system, or newly Commending system etc. is heard when being counted to user behavior, and the data of the user behavior of each user terminal of statistics are carried out Storage, the method for the embodiment of the present invention are the methods performed by commending system, structural representation as shown in figure 1, including:
Step 101, the data of user behavior are obtained, the data of user behavior include user's operation data and with application Related data.
Here user behavior refers to the operation for the recommending data that user terminal used in user is sent to commending system, than Exposure (i.e. recommending data is checked in selection) such as to recommending data, is thumbed up, the operation such as collection, wherein, if commending system is wide Announcement system, then above-mentioned recommending data can be the ad data based on a certain product, if commending system, which is news, recommends system System, then above-mentioned recommending data can be news data etc..After recommending data is sent to user terminal by commending system, recommend System timing or sporadically can actively obtain the data of user behavior to user terminal, or user terminal is to commending system The data of active reporting user behavior.
Specifically, two kinds of data, i.e., user related to operation operation can be included in the data of the user behavior of acquisition Data and the data related to concrete application.Specifically, user's operation data can include user behavior mark (such as user's row For numbering), user operates T1 and quantity etc. at the time of generation, can also include user's mark etc.;The number related to concrete application According to can include recommend position (sign recommending data launch position, such as at the bottom of user interface, top etc.), recommending data The application identities of application described by mark (uniquely one recommending data of sign), recommending data and application type etc., it is so every The individual moment can correspond to the data of one group of user behavior.
Step 102, the data of user behavior are adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension User's operation data corresponding to degree difference, wherein, the information of dimension is included in the above-mentioned related data to application.Here tune Whole is that existing data are adjusted in data according to the user behavior obtained in above-mentioned steps 101, therefore, obtained adjustment All data in user behavior data afterwards are included in the data of the user behavior obtained in above-mentioned steps 101.And this In multigroup dimension in any group of dimension can include one or more dimensions.
Here the information of dimension can include at least one in following information:Recommending data type, recommending data group class Type, recommend position, recommending data mark, the application type of the application described by recommending data and application identities etc..
Specifically, if the data of one group of user behavior at a certain moment obtained in above-mentioned steps 101 include n, (n is Natural number more than 1) individual dimension information (for example recommending position and recommending data mark etc.), and user's operand at a certain moment According to, then can be m group subdatas by the data point reuse of this group of user behavior when adjusting the data of user behavior, any of which Group subdata includes user's operation data at the above-mentioned a certain moment and the information of at least one dimension in above-mentioned n dimension. Wherein, m is less than or equal to p natural number, and the p is the number of combinations sum for taking the natural number being less than and equal to n respectively from n, than If n is 2, then m is the natural number less than or equal to 3;N is 3, then m is the natural number less than or equal to 7.Here m values are by recommendation System is actually needed the influence of the data of the user behavior of statistics.
Such as:The data of one group of user behavior include:User identifies, and recommends position, recommending data mark, user behavior mark Know, at the time of user behavior occurs and quantity, wherein recommend position and recommending data to be identified as the information of dimension, other information is uses Family operation information, then the data of this group of user behavior can be adjusted to three groups of subdatas, wherein first group of subdata includes user Mark, recommending data mark, user behavior mark, at the time of user behavior occurs and quantity, second group of subdata include user Mark, recommend position, user behavior mark, to identify, push away including user with quantity, the 3rd group of subdata at the time of user behavior occurs Recommend position, recommending data mark, user behavior mark, at the time of user behavior occurs and quantity.
If only including the information of a dimension in the data of the one group of user behavior obtained in above-mentioned steps 101, protect The data for holding this group of user behavior obtained in above-mentioned steps 101 are constant.
Step 103, the user behavior data after the adjustment obtained in step 102 is saved as to the storage knot of more time sliding windows Structure, in the storage organization of more time sliding windows:User's operation data corresponding to multigroup dimension difference in time window.
Wherein, time window corresponding to every group of dimension can be the same or different, and each time window mentioned here is identical to be The width of finger time window is identical with the beginning and ending time, and time window difference can refer to the width difference of time window and the beginning and ending time is different, Or time window width is identical but different (for example two time windows are all N days, but a time window requirement is nearest the beginning and ending time N days, another time window requirement was N days from some time), or the beginning and ending time is identical but the width of time window is different.
It can be seen that in the embodiment of the present invention, commending system, can be by user's row of acquisition in the data of counting user behavior For data point reuse be multigroup dimension respectively corresponding to user's operation data, then stored up according to various dimensions, the structure of more time windows Deposit user's operation data corresponding to multigroup dimension difference.Because the storage time of user's operation data required by every group of dimension can Can be different, using the method for the embodiment of the present invention, the user that will neatly can be thus counted with different dimensions time windows The data of behavior are intensively stored into together, and without separately storage, it can farthest save memory space.
It should be noted that in the particular embodiment, in the storage organization of above-mentioned more time sliding windows in multigroup dimension Any group of dimension can correspond to the user's operation information of at least one user's operation, and at least one user operation corresponds to one respectively Group time window, and time window corresponding to every kind of user operation can be the same or different.Different use under so every group of dimension Family operation intensively can be also stored into together according to respective time window, and different user is met using as far as possible small memory space Respective requirement, such as the data demand storage longer period of user's collection operation of a certain recommendation position are operated, then the user Time window is just longer corresponding to operation, and the user of the recommendation position thumbs up the data demand storage shorter time period of operation, then should Time window corresponding to user's operation is just shorter.
In a specific embodiment, the data of a certain group of user behavior obtained for commending system, commending system When performing above-mentioned steps 103, multigroup dimension for just being included with the user behavior data after the adjustment obtained in above-mentioned steps 102 User's operation data corresponding to degree difference is right respectively instead of multigroup dimension in the storage organization of the more time sliding windows stored respectively User's operation data outside the time window answered.
In another specific embodiment, in order to further reduce memory space, commending system is performing above-mentioned step When rapid 103, first the user behavior data after adjustment can be merged, then stored again, specifically:
If in the user behavior data after being adjusted in above-mentioned steps 102, used in multigroup dimension corresponding to a certain group of dimension Family operation data includes:User's operation data at multiple moment, then commending system can be by user's operation data at multiple moment User's operation data at a moment is merged into, a wherein moment includes this multiple moment;Then after merging again User's operation data save as the storage organizations of more time sliding windows.
For example, a certain user's operation data includes user's operation data at four moment, i.e., 12 points 4 seconds, 12 points 25 minutes 25 points 20 seconds, 12 points of 48 seconds 25 minutes and 12 points of user's operation datas of 25 minutes and 50 seconds, then commending system these users can be grasped 12 points of user's operation datas of 25 minutes are merged into as data.
In addition, it is necessary to what is illustrated is the sides that above-mentioned steps 101 to 103 are the data how commending system stores user behavior Method, in other specific embodiments, after above-mentioned steps 103 have been performed, user can also use other clients actively please The data stored in reading system are sought, specifically, for commending system:
When commending system receives read requests, the read requests are used to ask the user's operation for reading a certain user's operation Data, then commending system can be according to the read requests, acquisition and a certain user behaviour from the storage organization of more time sliding windows Make related user's operation data.Specifically, it can include in the data of user behavior corresponding to each user, be grasped with the user Make related user's operation data.
The embodiment of the present invention also provides a kind of data-storage system, its structural representation as shown in Fig. 2 including:
Data capture unit 10, for obtaining the data of user behavior, the data of the user behavior include user behaviour Make data and to applying related data, specifically, data capture unit 10 can timing or sporadically actively whole to user End obtains the data of user behavior, or receives data of the user terminal to the data-storage system active reporting user behavior.
Adjustment unit 11, for adjusting the data for the user behavior that the data capture unit 10 obtains according to dimension, make User behavior data after must adjusting includes user's operation data corresponding to multigroup dimension difference, and the information of the dimension is included in In the related data to application, wherein, user's operation data includes:User behavior identifies, at the time of user operates generation With quantity etc., and user can also be included and identified, and the information of dimension can include it is at least one in following information:Data push away Recommend type, data recommendation set type recommends position, recommending data mark, the application type of the application described by recommending data and should With mark etc..
Specifically, if the data for one group of user behavior that data capture unit 10 obtains include n dimension information and User's operation data, the n are the natural number more than 1, then adjustment unit 11 is specifically used for the number of one group of user behavior According to being adjusted to m group subdatas, wherein, the user behaviour that the data that any group of subdata includes one group of user behavior include Make the information of at least one dimension in data, and the n dimension, wherein the m is the natural number less than or equal to p, the p is Take the number of combinations sum for the natural number being less than and equal to the n respectively from n.
If only including the information of a dimension in the data for one group of user behavior that data capture unit 10 obtains, adjust Whole unit 11 keeps the data of this group of user behavior of the acquisition of data capture unit 10 constant.
Storage element 12, more time sliding windows are saved as the user behavior data after the adjustment unit 11 is adjusted Storage organization, the storage organization of more time sliding windows include user's operand corresponding to multigroup dimension difference in time window According to.
Specifically, any group of dimension in the storage organization of more time sliding windows in multigroup dimension corresponds at least one user behaviour The user's operation information of work, at least one user's operation correspond to a time window respectively.
It can be seen that the data-storage system of the embodiment of the present invention is in the data of counting user behavior, adjustment unit 11 can be with By the data point reuse of the user behavior of acquisition be multigroup dimension respectively corresponding to user's operation data, then storage element 12 according to Various dimensions, the structure of more time windows store user's operation data corresponding to multigroup dimension difference.Due to required by every group of dimension The storage time of user's operation data may be different, use the system of the embodiment of the present invention just can neatly by with different dimensions not Data with the user behavior of time window statistics are intensively stored into together, and without separately storage, can farthest it save Save memory space.
In a specific embodiment, for the data of one group of user behavior, if having stored the group in system During the data of user behavior, then the storage element 12 of data-storage system is specifically used for the user behavior data after the adjustment User's operation data corresponding to the multigroup dimension difference included, respectively instead of the storage knot of the more time sliding windows stored User's operation data described in structure outside time window corresponding to multigroup dimension difference.
In another specific embodiment, in order to further reduce memory space, when a certain in multigroup dimension User's operation data corresponding to group dimension includes:User's operation data at multiple moment, then storage element 12 is specifically for inciting somebody to action User's operation data at the multiple moment is merged into user's operation data at a moment, wherein being included in one moment The multiple moment;User's operation data after the merging will be carried out to be stored into the storage organization of more time sliding windows.
With reference to shown in figure 3, in a specific embodiment, data-storage system is as shown in Figure 2 except that can include It can also include read requests unit 13 and acquisition request unit 14 outside structure, wherein:
Read requests unit 13, for receiving read requests, the read requests are used to ask to read a certain user's operation User's operation data;
Acquisition request unit 14, for the read requests received according to the read requests unit 13, from more times The user operation data related to a certain user's operation is obtained in the storage organization of sliding window.
The embodiment of the present invention also provides a kind of commending system, and structural representation is as shown in figure 4, the commending system can be because of configuration Or performance is different and produce bigger difference, one or more central processing units (central can be included Processing units, CPU) 20 (for example, one or more processors) and memory 21, one or more are deposited Store up the storage medium 22 (such as one or more mass memory units) of application program 221 or data 222.Wherein, store Device 21 and storage medium 22 can be of short duration storage or persistently storage.Be stored in storage medium 22 program can include one or More than one module (diagram does not mark), each module can include operating the series of instructions in commending system.More enter one Step ground, central processing unit 20 be could be arranged to communicate with storage medium 22, and one in storage medium 22 is performed on commending system Series of instructions operates.
Commending system can also include one or more power supplys 23, and one or more wired or wireless networks connect Mouth 24, one or more input/output interfaces 25, and/or, one or more operating systems 223, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
Described in above method embodiment can be based on the recommendation system shown in the Fig. 4 as the step performed by commending system System structure.
The method of the embodiment of the present invention is illustrated with a specific application example below, data storage side in the present embodiment The commending system that method is applied to is ad system, and the recommending data that ad system is sent to user terminal is ad data, and this is wide Announcement system is mainly to send ad data to the circle of friends of wechat, and counts user behavior of the wechat user terminal to ad data Data, specifically, advertisement exposure (nearest 7 day) of the counting user under advertisement position dimension, thumb up, lose interest in (nearest 30 My god) etc. user behavior data;And advertisement exposure (nearest 7 day) of the user under a certain advertisement dimension, thumb up, lose interest in The data of user behaviors such as (nearest 30 days).
Ad system is realized by such as Fig. 5 structure in the present embodiment, including:Daily record receiving module, multiple data Adjusting module, multiple sliding window statistical modules and sliding window memory module, wherein, daily record receiving module can be Spout components, data Adjusting module can be CombineBolt components, and sliding window memory module can be WindowBolt components, further, advertisement System can also include the interface module of user, i.e. data application module, wherein:
1st, the data of daily record receiving module primary recipient user behavior, and will be used in a random way corresponding to each user The data of family behavior are sent respectively to data point reuse module, and the form of transmission can be:[user identifies, advertisement position, advertisements Know, user behavior numbering, user operates T1 at the time of generation, quantity].Wherein, user behavior can be exposure, thumb up, do not feel Interest etc., advertisement position mentioned here are above-mentioned recommendation position, and advertisement and identifier is above-mentioned recommending data mark.
2nd, data point reuse module adjusts the data of the user behavior received according to dimension so that the user behavior number after adjustment According to user's operation data corresponding to multigroup dimension is included, then once merged again, and the data after merging are sent to Sliding window statistical module, the purpose merged here are to reduce the data of the output to sliding window statistical module, and can be reduced final Memory space.
Wherein, one group of data point reuse at each moment is three when adjusting the data of user behavior by data point reuse module Group subdata, i.e. [user identifies, advertisement and identifier, and user behavior numbering, user operates T1 at the time of generation, quantity], and [user Mark, advertisement position, user behavior numbering, user operate T1 at the time of generation, quantity], and [user identifies, advertisement position, advertisements Know, user behavior numbering, user operates T1 at the time of generation, quantity].
User's operation data at multiple moment is mainly merged into a moment by data point reuse module in merging data User's operation data, ultimately form [user identify, the information of dimension, user behavior numbering, moment T2, merge quantity], so The data of the user behavior after merging are sent to sliding window statistical module afterwards.Here merging quantity is multiple moment before merging Corresponding quantity summation;Moment T2 contains above-mentioned multiple T1 moment, than moment T1 as described above be using the second as minimum accurate position, Then moment T2 is then using minute as minimum accurate position, and the moment T2 accurate position of minimum depends on the grain of the time window of each dimension Degree, if time window granularity is 1 day or 1 hour, the moment T2 accurate position of minimum takes minute.
It should be noted that the data point reuse module is mainly to corresponding to a user at the data of user behavior Reason.
3rd, sliding window statistical module mainly reads data from sliding window memory module;And by the data and data point reuse of reading The data that modulus sends over are integrated, and specifically, the multigroup dimension included with the user behavior data after adjustment is distinguished Corresponding user's operation data, operated respectively instead of the user outside time window corresponding to multigroup dimension difference in the data of reading Data;Then the data after integration are write in sliding window memory module again.
In the specific implementation, sliding window statistical module before integral data, can also be sent to different pieces of information adjusting module The data to come over are merged, and the data of the user behavior of different user are merged.
4th, sliding window memory module stores the data of user behavior according to the storage organization of more time sliding windows, specifically, according to The structure storage of stored key word (key)-storage values (value), wherein, stored key word identifies for user, and storage value includes The information and user's operation data of dimension, then in the present embodiment, the data of the user behavior of each row is corresponding to a users The data of user behavior.
As shown in Figure 6 and Figure 7, multiple row races are stored in sliding window memory module, each race that arranges is corresponding to a user The data of user behavior, each race that arranges include multipair row keyword and train value, and wherein row keyword is the information of dimension (as schemed Advertisement position and/or advertisement and identifier in 7), the data under the different dimensions of such a user, which can be concentrated, to be stored into together, section Memory space is saved.Further, each train value includes multipair index keyword and time slot, and each index keyword is use Family behavior mark (for example user's exposure in Fig. 7 thumbs up with user), the so user with the different user operation under dimension Operation information, which can be concentrated, to be stored into together, and the time window of the data stored in each time slot can be with identical, can also not Together, the time window requirement of the user's operation information of different user operation is met, for example the data of user's exposure only need to store Nearest 7 days, and the data that user thumbs up need storage 30 days.
In each time slot store in correspond to user operation time window in data, specifically include multipair timeslice with Index number, timeslice 1 are the timeslice slided, for example require the data of user's exposure of nearest 7 days, and it is 1 day to slide granularity, when Before be No. 7, then store the data of No. 1 to No. 7 in time slot, timeslice 1 is the time of No. 1.
It is understood that in above-described embodiment, what ad system counted is the number of user behavior corresponding to each user According to.In other specific embodiments, if ad system wants the data of user behavior on one advertisement position of statistics, sliding window is deposited It is user behavior corresponding to an advertisement position to store up each row race in multiple row races in module in the storage organization of more time sliding windows Data, now, each keyword for arranging race be advertising site mark, and being worth corresponding to keyword specifically can be as shown in Figure 8:Each In multipair row keyword and train value that row race includes, keyword is each user mark (such as user's mark 1 and 2 in Fig. 8); In multipair index keyword and time slot that each train value includes, index keyword is user behavior mark (such as the use in Fig. 8 Family exposes and user thumbs up), the index number corresponded in time window is stored in time slot.In this case, the information of dimension It can be advertisement position.
Further, the storage organization of more time sliding windows shown in above-mentioned Fig. 6 to Fig. 8 is 4 Rotating fields, other specific real Apply in example, the storage organization of more time sliding windows can also be less than 4 layers of structure, if as shown in figure 9, ad system wants system The data of user behavior on an advertisement position are counted, then multiple row races in sliding window memory module in the storage organization of more time sliding windows In each row race be user behavior corresponding to an advertisement position data, now, each race that arranges includes multipair index keyword And time slot, index keyword are user behavior mark (for example user's exposure in Fig. 9 thumbs up with user), are stored in time slot The index number in corresponding time window.In this case, the information of dimension is advertisement position.
Content in the storage organization of more time sliding windows can also be just like other contents outside Fig. 7 and Fig. 8 and Fig. 9, its Specific content is mainly to be determined by the data of ad system statistics, herein without repeating.
5th, user can read the data stored in sliding window memory module by data application module request, for example request is read The user's operation information for taking some user to operate.
In the application example, can neatly by the data of the user behavior counted with different dimensions time windows intensively It is stored into together, and without separately storage, it can farthest save memory space.
It should be noted that if above-mentioned method of data synchronization can also be applied to the commending system of news data, or its Its recommending data can reach same effect into the application of user terminal, herein without repeating.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can To instruct the hardware of correlation to complete by program, the program can be stored in a computer-readable recording medium, storage Medium can include:Read-only storage (ROM), random access memory (RAM), disk or CD etc..
The date storage method and system provided above the embodiment of the present invention, is described in detail, and herein should The principle and embodiment of the present invention are set forth with specific case, the explanation of above example is only intended to help and managed Solve the method and its core concept of the present invention;Meanwhile for those of ordinary skill in the art, according to the thought of the present invention, There will be changes in embodiment and application, in summary, this specification content should not be construed as to this hair Bright limitation.

Claims (14)

  1. A kind of 1. date storage method, it is characterised in that including:
    Obtain the data of user behavior;The data of the user behavior include user's operation data and to applying related number According to;
    The data of the user behavior are adjusted according to dimension so that it is right respectively that the user behavior data after adjustment includes multigroup dimension The user's operation data answered, the information of the dimension are included in the related data to application;
    User behavior data after the adjustment is saved as to the storage organization of more time sliding windows, the storage of more time sliding windows Structure includes:User's operation data corresponding to multigroup dimension difference in time window;User's operation data includes user Behavior identifies;
    The storage organization of more time sliding windows includes multiple row races, and each race that arranges includes multipair keyword and train value, often Individual train value includes multipair index keyword and time slot, and the data of each time slot storage include multipair timeslice and index number; Keyword in the row race is the information of the dimension, or each row race corresponds to a dimension;
    Or the storage organization of more time sliding windows includes multiple row races, each race that arranges includes multipair index keyword It is each to arrange the corresponding dimension of race and time slot, the data of each time slot storage include multipair timeslice and index number;
    Wherein, the index keyword identifies for user behavior.
  2. 2. the method as described in claim 1, it is characterised in that
    User's operation data includes:User behavior identifies, and user operates at the time of generation and quantity;
    The information of the dimension includes at least one in following information:Recommending data type, recommending data set type, recommend Position, recommending data mark, the application type and application identities of the application described by the recommending data.
  3. 3. the method as described in claim 1, it is characterised in that in the storage organization of more time sliding windows in multigroup dimension Any group of dimension corresponds to the user's operation information of at least one user's operation, and at least one user's operation corresponds to one respectively Time window.
  4. 4. the method as described in any one of claims 1 to 3, it is characterised in that in the data of the user behavior, one group of user The data of behavior include the information and user's operation data of n dimension, and the n is the natural number more than 1;
    The then data that the user behavior is adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension User's operation data corresponding to respectively, is specifically included:It is m group subdatas by the data point reuse of one group of user behavior, wherein, Any group of subdata is included in user's operation data that the data of one group of user behavior include, and the n dimension The information of at least one dimension, wherein the m is the natural number less than or equal to p, the p is less than to be taken respectively from the n With the number of combinations sum of the natural number equal to the n.
  5. 5. the method as described in any one of claims 1 to 3, it is characterised in that the user behavior number by after the adjustment According to the storage organization for saving as more time sliding windows, specifically include:
    The multigroup dimension included with the user behavior data after the adjustment respectively corresponding to user's operation data, respectively instead of User's operation described in the storage organization of the more time sliding windows stored outside time window corresponding to multigroup dimension difference Data.
  6. 6. the method as described in any one of claims 1 to 3, it is characterised in that
    User's operation data corresponding to a certain group of dimension includes in multigroup dimension:User's operation data at multiple moment,
    Then the user behavior data by after the adjustment saves as the storage organization of more time sliding windows, specifically includes:
    User's operation data at the multiple moment is merged into user's operation data at a moment, wherein one moment In include the multiple moment;
    User's operation data after the merging will be carried out save as the storage organization of more time sliding windows.
  7. 7. the method as described in any one of claims 1 to 3, it is characterised in that methods described also includes:
    Read requests are received, the read requests are used to ask the user's operation data for reading a certain user's operation;
    According to the read requests, obtained from the storage organization of more time sliding windows related to a certain user's operation User's operation data.
  8. A kind of 8. data-storage system, it is characterised in that including:
    Data capture unit, for obtaining the data of user behavior;The data of the user behavior include user's operation data To apply related data;
    Adjustment unit, for adjusting the data for the user behavior that the data capture unit obtains according to dimension so that after adjustment User behavior data include multigroup dimension respectively corresponding to user's operation data, the information of the dimension be included in it is described with should In data with correlation;
    Storage element, the storage knot of more time sliding windows is saved as the user behavior data after the adjustment unit is adjusted Structure, the storage organization of more time sliding windows include:User's operation data corresponding to multigroup dimension difference in time window, institute User's operation data is stated to identify including user behavior;
    The storage organization of more time sliding windows includes multiple row races, and each race that arranges includes multipair keyword and train value, often Individual train value includes multipair index keyword and time slot, and the data of each time slot storage include multipair timeslice and index number; Keyword in the row race is the information of the dimension, or each row race corresponds to a dimension;
    Or the storage organization of more time sliding windows includes multiple row races, each race that arranges includes multipair index keyword It is each to arrange the corresponding dimension of race and time slot, the data of each time slot storage include multipair timeslice and index number;
    Wherein, the index keyword identifies for user behavior.
  9. 9. system as claimed in claim 8, it is characterised in that
    User's operation data includes:User behavior identifies, and user operates at the time of generation and quantity;
    The information of the dimension includes at least one set in following information:Recommending data type, recommending data set type, recommend Position, recommending data mark, the application type and application identities of the application described by recommending data.
  10. 10. system as claimed in claim 8, it is characterised in that in the storage organization of more time sliding windows in multigroup dimension Any group of dimension correspond to the user's operation information of at least one user operation, at least one user's operation corresponds to one respectively Individual time window.
  11. 11. the system as described in any one of claim 8 to 10, it is characterised in that
    The adjustment unit, specifically for when in the data of the user behavior, the data of one group of user behavior include n dimension Information and user's operation data, the n is natural number more than 1, is that m groups are sub by the data point reuse of one group of user behavior Data, wherein, any group of subdata includes user's operation data that the data of one group of user behavior include, and described The information of at least one dimension in n dimension, wherein the m is the natural number less than or equal to p, the p is to divide from the n The number of combinations sum for the natural number being less than and equal to the n is not taken.
  12. 12. the system as described in any one of claim 8 to 10, it is characterised in that
    The storage element, specifically for multigroup dimension for being included with the user behavior data after the adjustment respectively corresponding to User's operation data, respectively described in the storage organization instead of the more time sliding windows stored corresponding to multigroup dimension difference User's operation data outside time window.
  13. 13. the system as described in any one of claim 8 to 10, it is characterised in that
    The storage element, specifically for including when user's operation data corresponding to a certain group of dimension in multigroup dimension: User's operation data at multiple moment, user's operation data at the multiple moment is merged into user's operand at a moment According to wherein including the multiple moment in one moment;User's operation data after the merging will be carried out and be stored into institute In the storage organization for stating more time sliding windows.
  14. 14. the system as described in any one of claim 8 to 10, it is characterised in that the system also includes:
    Read requests unit, for receiving read requests, the read requests are used to ask the user for reading a certain user's operation Operation data;
    Acquisition request unit, for the read requests received according to the read requests unit, from depositing for more time sliding windows The user operation data related to a certain user's operation is obtained in storage structure.
CN201510337071.5A 2015-06-17 2015-06-17 A kind of date storage method and system Active CN104915431B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510337071.5A CN104915431B (en) 2015-06-17 2015-06-17 A kind of date storage method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510337071.5A CN104915431B (en) 2015-06-17 2015-06-17 A kind of date storage method and system

Publications (2)

Publication Number Publication Date
CN104915431A CN104915431A (en) 2015-09-16
CN104915431B true CN104915431B (en) 2018-01-16

Family

ID=54084494

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510337071.5A Active CN104915431B (en) 2015-06-17 2015-06-17 A kind of date storage method and system

Country Status (1)

Country Link
CN (1) CN104915431B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105184620A (en) * 2015-10-29 2015-12-23 北京恒麟信息科技有限公司 Real-time internet advertisement bidding system based on multi-dimensional data
CN107798041B (en) * 2017-06-21 2020-02-14 平安科技(深圳)有限公司 Policy data storage method and device and terminal equipment
CN107704526A (en) * 2017-09-15 2018-02-16 平安科技(深圳)有限公司 Storage method, device, computer equipment and the storage medium of data
CN108268588B (en) * 2017-11-29 2021-07-23 阿里巴巴(中国)有限公司 Advertisement data summarizing and inquiring method and device
CN113077292A (en) * 2021-04-20 2021-07-06 北京沃东天骏信息技术有限公司 User classification method and device, storage medium and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8548991B1 (en) * 2006-09-29 2013-10-01 Google Inc. Personalized browsing activity displays
CN104239351A (en) * 2013-06-20 2014-12-24 阿里巴巴集团控股有限公司 User behavior machine learning model training method and device
CN104572726A (en) * 2013-10-22 2015-04-29 北京品众互动网络营销技术有限公司 Advertisement analysis method
CN104700289A (en) * 2015-03-17 2015-06-10 中国联合网络通信集团有限公司 Advertising method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8548991B1 (en) * 2006-09-29 2013-10-01 Google Inc. Personalized browsing activity displays
CN104239351A (en) * 2013-06-20 2014-12-24 阿里巴巴集团控股有限公司 User behavior machine learning model training method and device
CN104572726A (en) * 2013-10-22 2015-04-29 北京品众互动网络营销技术有限公司 Advertisement analysis method
CN104700289A (en) * 2015-03-17 2015-06-10 中国联合网络通信集团有限公司 Advertising method and device

Also Published As

Publication number Publication date
CN104915431A (en) 2015-09-16

Similar Documents

Publication Publication Date Title
CN104915431B (en) A kind of date storage method and system
Williams Introduction: Social media, political marketing and the 2016 US election
CN103686237B (en) Recommend the method and system of video resource
Szabo et al. Predicting the popularity of online content
US20200012654A1 (en) System and methods for generating optimal post times for social networking sites
CN103647800A (en) Method and system of recommending application resources
CN103716338B (en) A kind of information-pushing method and device
CN108289121A (en) The method for pushing and device of marketing message
CN105872837A (en) User recommendation method and device
CN107222566A (en) Information-pushing method, device and server
CN107172452A (en) Direct broadcasting room recommends method and device
CN107087235A (en) Media content recommendations method, server and client
CN107885745A (en) A kind of song recommendations method and device
CN103686236A (en) Method and system for recommending video resource
CN102750320B (en) Method, device and system for calculating network video real-time attention
CN109511015A (en) Multimedia resource recommended method, device, storage medium and equipment
CN106055630A (en) Log storage method and device
CN104079960A (en) File recommending method and device
CN106888381A (en) A kind of data resource storage method and device
CN106101832A (en) For evaluating computational methods and the device of the index of the Internet media
CN110312167A (en) A kind of method, intelligent terminal and storage medium calculating movie and television contents scoring
CN103700004A (en) Method and device for pushing microblog advertising service information
CN109635192A (en) Magnanimity information temperature seniority among brothers and sisters update method and platform towards micro services
CN114024737B (en) Method, apparatus and computer readable storage medium for determining live room volume
US20110225287A1 (en) Method and system for distributed processing of web traffic analytics data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant