CN104915431B - A kind of date storage method and system - Google Patents
A kind of date storage method and system Download PDFInfo
- Publication number
- CN104915431B CN104915431B CN201510337071.5A CN201510337071A CN104915431B CN 104915431 B CN104915431 B CN 104915431B CN 201510337071 A CN201510337071 A CN 201510337071A CN 104915431 B CN104915431 B CN 104915431B
- Authority
- CN
- China
- Prior art keywords
- data
- user
- dimension
- user behavior
- operation data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/283—Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0251—Targeted advertisements
- G06Q30/0254—Targeted advertisements based on statistics
Abstract
The embodiment of the invention discloses date storage method and system, applied to processing data information technical field.Commending system is in the data of counting user behavior in the embodiment of the present invention, can be user's operation data corresponding to multigroup dimension difference by the data point reuse of the user behavior of acquisition, then user's operation data corresponding to multigroup dimension difference is stored according to various dimensions, the structure of more time windows.Because the storage time of user's operation data required by every group of dimension may be different, using the method for the embodiment of the present invention, thus neatly the data of the user behavior counted with different dimensions time windows intensively can be stored into together, and without separately storage, it can farthest save memory space.
Description
Technical field
The present invention relates to processing data information technical field, more particularly to a kind of date storage method and system.
Background technology
In existing commending system (such as ad system, or news commending system), businessman can pass through commending system
Directionally or nondirectional that the data (such as ad data, or news data etc.) for needing to recommend are sent into each user is whole
End, so as to reach the purpose promoted a certain product or inform some information.Commending system needs counting user terminal-pair in real time
The operation of the recommending data of reception, i.e. user behavior.
In the prior art, commending system is in the data of counting user behavior, mainly according to cycle regular time, and
The data of the user behavior of each user are stored with fixed dimension.
The content of the invention
The embodiment of the present invention provides a kind of date storage method and system, realizes the structure with various dimensions and more time windows
Store the data of user behavior.
The embodiment of the present invention provides a kind of date storage method, including:
Obtain the data of user behavior;The data of the user behavior include user's operation data and to application it is related
Data;
The data of the user behavior are adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension point
Not corresponding user's operation data, the information of the dimension are included in the related data to application;
User behavior data after the adjustment is saved as to the storage organization of more time sliding windows, more time sliding windows
Storage organization includes:User's operation data corresponding to multigroup dimension difference in time window.
The embodiment of the present invention also provides a kind of data-storage system, including:
Data capture unit, for obtaining the data of user behavior;The data of the user behavior include user's operation
Data and to apply related data;
Adjustment unit, for adjusting the data for the user behavior that the data capture unit obtains according to dimension so that adjust
User behavior data after whole include multigroup dimension respectively corresponding to user's operation data, the information of the dimension is included in described
To applying in related data;
Storage element, the storage of more time sliding windows is saved as the user behavior data after the adjustment unit is adjusted
Structure, the storage organization of more time sliding windows include:User's operation data corresponding to multigroup dimension difference in time window.
In the embodiment of the present invention, commending system, can be by the user behavior of acquisition in the data of counting user behavior
Data point reuse is user's operation data corresponding to multigroup dimension difference, then more according to various dimensions, the structure storage of more time windows
User's operation data corresponding to group dimension difference.Because the storage time of user's operation data required by every group of dimension may not
Together, using the method for the embodiment of the present invention, the user behavior that will neatly can be thus counted with different dimensions time windows
Data be intensively stored into together, and without separately storage, can farthest save memory space.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used also
To obtain other accompanying drawings according to these accompanying drawings.
Fig. 1 is a kind of flow chart of date storage method provided in an embodiment of the present invention;
Fig. 2 is a kind of structural representation of data-storage system provided in an embodiment of the present invention;
Fig. 3 is the structural representation of another data-storage system provided in an embodiment of the present invention;
Fig. 4 is a kind of structural representation of commending system provided in an embodiment of the present invention;
Fig. 5 is a kind of structural representation of the ad system provided in Application Example of the present invention;
Fig. 6 is the structural representation of the storage organization of more time sliding windows in the embodiment of the present invention;
Fig. 7 is a kind of schematic diagram of the storage organization of more time sliding windows in Application Example of the present invention;
Fig. 8 is the schematic diagram of the storage organization of another more time sliding windows in Application Example of the present invention;
Fig. 9 is the schematic diagram of the storage organization of the more time sliding windows of another in Application Example of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on
Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not made
Embodiment, belong to the scope of protection of the invention.
Term " first ", " second ", " the 3rd " " in description and claims of this specification and above-mentioned accompanying drawing
The (if present)s such as four " are for distinguishing similar object, without for describing specific order or precedence.It should manage
The data that solution so uses can exchange in the appropriate case, so as to embodiments of the invention described herein for example can with except
Order beyond those for illustrating or describing herein is implemented.In addition, term " comprising " and " having " and theirs is any
Deformation, it is intended that including not exclusively is covered, for example, containing the process of series of steps or unit, method, system, production
Product or equipment are not necessarily limited to those steps clearly listed or unit, but may include not list clearly or for this
The intrinsic other steps of a little process, method, product or equipment or unit.
The embodiment of the present invention provides a kind of date storage method, primarily directed to commending system such as ad system, or newly
Commending system etc. is heard when being counted to user behavior, and the data of the user behavior of each user terminal of statistics are carried out
Storage, the method for the embodiment of the present invention are the methods performed by commending system, structural representation as shown in figure 1, including:
Step 101, the data of user behavior are obtained, the data of user behavior include user's operation data and with application
Related data.
Here user behavior refers to the operation for the recommending data that user terminal used in user is sent to commending system, than
Exposure (i.e. recommending data is checked in selection) such as to recommending data, is thumbed up, the operation such as collection, wherein, if commending system is wide
Announcement system, then above-mentioned recommending data can be the ad data based on a certain product, if commending system, which is news, recommends system
System, then above-mentioned recommending data can be news data etc..After recommending data is sent to user terminal by commending system, recommend
System timing or sporadically can actively obtain the data of user behavior to user terminal, or user terminal is to commending system
The data of active reporting user behavior.
Specifically, two kinds of data, i.e., user related to operation operation can be included in the data of the user behavior of acquisition
Data and the data related to concrete application.Specifically, user's operation data can include user behavior mark (such as user's row
For numbering), user operates T1 and quantity etc. at the time of generation, can also include user's mark etc.;The number related to concrete application
According to can include recommend position (sign recommending data launch position, such as at the bottom of user interface, top etc.), recommending data
The application identities of application described by mark (uniquely one recommending data of sign), recommending data and application type etc., it is so every
The individual moment can correspond to the data of one group of user behavior.
Step 102, the data of user behavior are adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension
User's operation data corresponding to degree difference, wherein, the information of dimension is included in the above-mentioned related data to application.Here tune
Whole is that existing data are adjusted in data according to the user behavior obtained in above-mentioned steps 101, therefore, obtained adjustment
All data in user behavior data afterwards are included in the data of the user behavior obtained in above-mentioned steps 101.And this
In multigroup dimension in any group of dimension can include one or more dimensions.
Here the information of dimension can include at least one in following information:Recommending data type, recommending data group class
Type, recommend position, recommending data mark, the application type of the application described by recommending data and application identities etc..
Specifically, if the data of one group of user behavior at a certain moment obtained in above-mentioned steps 101 include n, (n is
Natural number more than 1) individual dimension information (for example recommending position and recommending data mark etc.), and user's operand at a certain moment
According to, then can be m group subdatas by the data point reuse of this group of user behavior when adjusting the data of user behavior, any of which
Group subdata includes user's operation data at the above-mentioned a certain moment and the information of at least one dimension in above-mentioned n dimension.
Wherein, m is less than or equal to p natural number, and the p is the number of combinations sum for taking the natural number being less than and equal to n respectively from n, than
If n is 2, then m is the natural number less than or equal to 3;N is 3, then m is the natural number less than or equal to 7.Here m values are by recommendation
System is actually needed the influence of the data of the user behavior of statistics.
Such as:The data of one group of user behavior include:User identifies, and recommends position, recommending data mark, user behavior mark
Know, at the time of user behavior occurs and quantity, wherein recommend position and recommending data to be identified as the information of dimension, other information is uses
Family operation information, then the data of this group of user behavior can be adjusted to three groups of subdatas, wherein first group of subdata includes user
Mark, recommending data mark, user behavior mark, at the time of user behavior occurs and quantity, second group of subdata include user
Mark, recommend position, user behavior mark, to identify, push away including user with quantity, the 3rd group of subdata at the time of user behavior occurs
Recommend position, recommending data mark, user behavior mark, at the time of user behavior occurs and quantity.
If only including the information of a dimension in the data of the one group of user behavior obtained in above-mentioned steps 101, protect
The data for holding this group of user behavior obtained in above-mentioned steps 101 are constant.
Step 103, the user behavior data after the adjustment obtained in step 102 is saved as to the storage knot of more time sliding windows
Structure, in the storage organization of more time sliding windows:User's operation data corresponding to multigroup dimension difference in time window.
Wherein, time window corresponding to every group of dimension can be the same or different, and each time window mentioned here is identical to be
The width of finger time window is identical with the beginning and ending time, and time window difference can refer to the width difference of time window and the beginning and ending time is different,
Or time window width is identical but different (for example two time windows are all N days, but a time window requirement is nearest the beginning and ending time
N days, another time window requirement was N days from some time), or the beginning and ending time is identical but the width of time window is different.
It can be seen that in the embodiment of the present invention, commending system, can be by user's row of acquisition in the data of counting user behavior
For data point reuse be multigroup dimension respectively corresponding to user's operation data, then stored up according to various dimensions, the structure of more time windows
Deposit user's operation data corresponding to multigroup dimension difference.Because the storage time of user's operation data required by every group of dimension can
Can be different, using the method for the embodiment of the present invention, the user that will neatly can be thus counted with different dimensions time windows
The data of behavior are intensively stored into together, and without separately storage, it can farthest save memory space.
It should be noted that in the particular embodiment, in the storage organization of above-mentioned more time sliding windows in multigroup dimension
Any group of dimension can correspond to the user's operation information of at least one user's operation, and at least one user operation corresponds to one respectively
Group time window, and time window corresponding to every kind of user operation can be the same or different.Different use under so every group of dimension
Family operation intensively can be also stored into together according to respective time window, and different user is met using as far as possible small memory space
Respective requirement, such as the data demand storage longer period of user's collection operation of a certain recommendation position are operated, then the user
Time window is just longer corresponding to operation, and the user of the recommendation position thumbs up the data demand storage shorter time period of operation, then should
Time window corresponding to user's operation is just shorter.
In a specific embodiment, the data of a certain group of user behavior obtained for commending system, commending system
When performing above-mentioned steps 103, multigroup dimension for just being included with the user behavior data after the adjustment obtained in above-mentioned steps 102
User's operation data corresponding to degree difference is right respectively instead of multigroup dimension in the storage organization of the more time sliding windows stored respectively
User's operation data outside the time window answered.
In another specific embodiment, in order to further reduce memory space, commending system is performing above-mentioned step
When rapid 103, first the user behavior data after adjustment can be merged, then stored again, specifically:
If in the user behavior data after being adjusted in above-mentioned steps 102, used in multigroup dimension corresponding to a certain group of dimension
Family operation data includes:User's operation data at multiple moment, then commending system can be by user's operation data at multiple moment
User's operation data at a moment is merged into, a wherein moment includes this multiple moment;Then after merging again
User's operation data save as the storage organizations of more time sliding windows.
For example, a certain user's operation data includes user's operation data at four moment, i.e., 12 points 4 seconds, 12 points 25 minutes
25 points 20 seconds, 12 points of 48 seconds 25 minutes and 12 points of user's operation datas of 25 minutes and 50 seconds, then commending system these users can be grasped
12 points of user's operation datas of 25 minutes are merged into as data.
In addition, it is necessary to what is illustrated is the sides that above-mentioned steps 101 to 103 are the data how commending system stores user behavior
Method, in other specific embodiments, after above-mentioned steps 103 have been performed, user can also use other clients actively please
The data stored in reading system are sought, specifically, for commending system:
When commending system receives read requests, the read requests are used to ask the user's operation for reading a certain user's operation
Data, then commending system can be according to the read requests, acquisition and a certain user behaviour from the storage organization of more time sliding windows
Make related user's operation data.Specifically, it can include in the data of user behavior corresponding to each user, be grasped with the user
Make related user's operation data.
The embodiment of the present invention also provides a kind of data-storage system, its structural representation as shown in Fig. 2 including:
Data capture unit 10, for obtaining the data of user behavior, the data of the user behavior include user behaviour
Make data and to applying related data, specifically, data capture unit 10 can timing or sporadically actively whole to user
End obtains the data of user behavior, or receives data of the user terminal to the data-storage system active reporting user behavior.
Adjustment unit 11, for adjusting the data for the user behavior that the data capture unit 10 obtains according to dimension, make
User behavior data after must adjusting includes user's operation data corresponding to multigroup dimension difference, and the information of the dimension is included in
In the related data to application, wherein, user's operation data includes:User behavior identifies, at the time of user operates generation
With quantity etc., and user can also be included and identified, and the information of dimension can include it is at least one in following information:Data push away
Recommend type, data recommendation set type recommends position, recommending data mark, the application type of the application described by recommending data and should
With mark etc..
Specifically, if the data for one group of user behavior that data capture unit 10 obtains include n dimension information and
User's operation data, the n are the natural number more than 1, then adjustment unit 11 is specifically used for the number of one group of user behavior
According to being adjusted to m group subdatas, wherein, the user behaviour that the data that any group of subdata includes one group of user behavior include
Make the information of at least one dimension in data, and the n dimension, wherein the m is the natural number less than or equal to p, the p is
Take the number of combinations sum for the natural number being less than and equal to the n respectively from n.
If only including the information of a dimension in the data for one group of user behavior that data capture unit 10 obtains, adjust
Whole unit 11 keeps the data of this group of user behavior of the acquisition of data capture unit 10 constant.
Storage element 12, more time sliding windows are saved as the user behavior data after the adjustment unit 11 is adjusted
Storage organization, the storage organization of more time sliding windows include user's operand corresponding to multigroup dimension difference in time window
According to.
Specifically, any group of dimension in the storage organization of more time sliding windows in multigroup dimension corresponds at least one user behaviour
The user's operation information of work, at least one user's operation correspond to a time window respectively.
It can be seen that the data-storage system of the embodiment of the present invention is in the data of counting user behavior, adjustment unit 11 can be with
By the data point reuse of the user behavior of acquisition be multigroup dimension respectively corresponding to user's operation data, then storage element 12 according to
Various dimensions, the structure of more time windows store user's operation data corresponding to multigroup dimension difference.Due to required by every group of dimension
The storage time of user's operation data may be different, use the system of the embodiment of the present invention just can neatly by with different dimensions not
Data with the user behavior of time window statistics are intensively stored into together, and without separately storage, can farthest it save
Save memory space.
In a specific embodiment, for the data of one group of user behavior, if having stored the group in system
During the data of user behavior, then the storage element 12 of data-storage system is specifically used for the user behavior data after the adjustment
User's operation data corresponding to the multigroup dimension difference included, respectively instead of the storage knot of the more time sliding windows stored
User's operation data described in structure outside time window corresponding to multigroup dimension difference.
In another specific embodiment, in order to further reduce memory space, when a certain in multigroup dimension
User's operation data corresponding to group dimension includes:User's operation data at multiple moment, then storage element 12 is specifically for inciting somebody to action
User's operation data at the multiple moment is merged into user's operation data at a moment, wherein being included in one moment
The multiple moment;User's operation data after the merging will be carried out to be stored into the storage organization of more time sliding windows.
With reference to shown in figure 3, in a specific embodiment, data-storage system is as shown in Figure 2 except that can include
It can also include read requests unit 13 and acquisition request unit 14 outside structure, wherein:
Read requests unit 13, for receiving read requests, the read requests are used to ask to read a certain user's operation
User's operation data;
Acquisition request unit 14, for the read requests received according to the read requests unit 13, from more times
The user operation data related to a certain user's operation is obtained in the storage organization of sliding window.
The embodiment of the present invention also provides a kind of commending system, and structural representation is as shown in figure 4, the commending system can be because of configuration
Or performance is different and produce bigger difference, one or more central processing units (central can be included
Processing units, CPU) 20 (for example, one or more processors) and memory 21, one or more are deposited
Store up the storage medium 22 (such as one or more mass memory units) of application program 221 or data 222.Wherein, store
Device 21 and storage medium 22 can be of short duration storage or persistently storage.Be stored in storage medium 22 program can include one or
More than one module (diagram does not mark), each module can include operating the series of instructions in commending system.More enter one
Step ground, central processing unit 20 be could be arranged to communicate with storage medium 22, and one in storage medium 22 is performed on commending system
Series of instructions operates.
Commending system can also include one or more power supplys 23, and one or more wired or wireless networks connect
Mouth 24, one or more input/output interfaces 25, and/or, one or more operating systems 223, such as Windows
ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
Described in above method embodiment can be based on the recommendation system shown in the Fig. 4 as the step performed by commending system
System structure.
The method of the embodiment of the present invention is illustrated with a specific application example below, data storage side in the present embodiment
The commending system that method is applied to is ad system, and the recommending data that ad system is sent to user terminal is ad data, and this is wide
Announcement system is mainly to send ad data to the circle of friends of wechat, and counts user behavior of the wechat user terminal to ad data
Data, specifically, advertisement exposure (nearest 7 day) of the counting user under advertisement position dimension, thumb up, lose interest in (nearest 30
My god) etc. user behavior data;And advertisement exposure (nearest 7 day) of the user under a certain advertisement dimension, thumb up, lose interest in
The data of user behaviors such as (nearest 30 days).
Ad system is realized by such as Fig. 5 structure in the present embodiment, including:Daily record receiving module, multiple data
Adjusting module, multiple sliding window statistical modules and sliding window memory module, wherein, daily record receiving module can be Spout components, data
Adjusting module can be CombineBolt components, and sliding window memory module can be WindowBolt components, further, advertisement
System can also include the interface module of user, i.e. data application module, wherein:
1st, the data of daily record receiving module primary recipient user behavior, and will be used in a random way corresponding to each user
The data of family behavior are sent respectively to data point reuse module, and the form of transmission can be:[user identifies, advertisement position, advertisements
Know, user behavior numbering, user operates T1 at the time of generation, quantity].Wherein, user behavior can be exposure, thumb up, do not feel
Interest etc., advertisement position mentioned here are above-mentioned recommendation position, and advertisement and identifier is above-mentioned recommending data mark.
2nd, data point reuse module adjusts the data of the user behavior received according to dimension so that the user behavior number after adjustment
According to user's operation data corresponding to multigroup dimension is included, then once merged again, and the data after merging are sent to
Sliding window statistical module, the purpose merged here are to reduce the data of the output to sliding window statistical module, and can be reduced final
Memory space.
Wherein, one group of data point reuse at each moment is three when adjusting the data of user behavior by data point reuse module
Group subdata, i.e. [user identifies, advertisement and identifier, and user behavior numbering, user operates T1 at the time of generation, quantity], and [user
Mark, advertisement position, user behavior numbering, user operate T1 at the time of generation, quantity], and [user identifies, advertisement position, advertisements
Know, user behavior numbering, user operates T1 at the time of generation, quantity].
User's operation data at multiple moment is mainly merged into a moment by data point reuse module in merging data
User's operation data, ultimately form [user identify, the information of dimension, user behavior numbering, moment T2, merge quantity], so
The data of the user behavior after merging are sent to sliding window statistical module afterwards.Here merging quantity is multiple moment before merging
Corresponding quantity summation;Moment T2 contains above-mentioned multiple T1 moment, than moment T1 as described above be using the second as minimum accurate position,
Then moment T2 is then using minute as minimum accurate position, and the moment T2 accurate position of minimum depends on the grain of the time window of each dimension
Degree, if time window granularity is 1 day or 1 hour, the moment T2 accurate position of minimum takes minute.
It should be noted that the data point reuse module is mainly to corresponding to a user at the data of user behavior
Reason.
3rd, sliding window statistical module mainly reads data from sliding window memory module;And by the data and data point reuse of reading
The data that modulus sends over are integrated, and specifically, the multigroup dimension included with the user behavior data after adjustment is distinguished
Corresponding user's operation data, operated respectively instead of the user outside time window corresponding to multigroup dimension difference in the data of reading
Data;Then the data after integration are write in sliding window memory module again.
In the specific implementation, sliding window statistical module before integral data, can also be sent to different pieces of information adjusting module
The data to come over are merged, and the data of the user behavior of different user are merged.
4th, sliding window memory module stores the data of user behavior according to the storage organization of more time sliding windows, specifically, according to
The structure storage of stored key word (key)-storage values (value), wherein, stored key word identifies for user, and storage value includes
The information and user's operation data of dimension, then in the present embodiment, the data of the user behavior of each row is corresponding to a users
The data of user behavior.
As shown in Figure 6 and Figure 7, multiple row races are stored in sliding window memory module, each race that arranges is corresponding to a user
The data of user behavior, each race that arranges include multipair row keyword and train value, and wherein row keyword is the information of dimension (as schemed
Advertisement position and/or advertisement and identifier in 7), the data under the different dimensions of such a user, which can be concentrated, to be stored into together, section
Memory space is saved.Further, each train value includes multipair index keyword and time slot, and each index keyword is use
Family behavior mark (for example user's exposure in Fig. 7 thumbs up with user), the so user with the different user operation under dimension
Operation information, which can be concentrated, to be stored into together, and the time window of the data stored in each time slot can be with identical, can also not
Together, the time window requirement of the user's operation information of different user operation is met, for example the data of user's exposure only need to store
Nearest 7 days, and the data that user thumbs up need storage 30 days.
In each time slot store in correspond to user operation time window in data, specifically include multipair timeslice with
Index number, timeslice 1 are the timeslice slided, for example require the data of user's exposure of nearest 7 days, and it is 1 day to slide granularity, when
Before be No. 7, then store the data of No. 1 to No. 7 in time slot, timeslice 1 is the time of No. 1.
It is understood that in above-described embodiment, what ad system counted is the number of user behavior corresponding to each user
According to.In other specific embodiments, if ad system wants the data of user behavior on one advertisement position of statistics, sliding window is deposited
It is user behavior corresponding to an advertisement position to store up each row race in multiple row races in module in the storage organization of more time sliding windows
Data, now, each keyword for arranging race be advertising site mark, and being worth corresponding to keyword specifically can be as shown in Figure 8:Each
In multipair row keyword and train value that row race includes, keyword is each user mark (such as user's mark 1 and 2 in Fig. 8);
In multipair index keyword and time slot that each train value includes, index keyword is user behavior mark (such as the use in Fig. 8
Family exposes and user thumbs up), the index number corresponded in time window is stored in time slot.In this case, the information of dimension
It can be advertisement position.
Further, the storage organization of more time sliding windows shown in above-mentioned Fig. 6 to Fig. 8 is 4 Rotating fields, other specific real
Apply in example, the storage organization of more time sliding windows can also be less than 4 layers of structure, if as shown in figure 9, ad system wants system
The data of user behavior on an advertisement position are counted, then multiple row races in sliding window memory module in the storage organization of more time sliding windows
In each row race be user behavior corresponding to an advertisement position data, now, each race that arranges includes multipair index keyword
And time slot, index keyword are user behavior mark (for example user's exposure in Fig. 9 thumbs up with user), are stored in time slot
The index number in corresponding time window.In this case, the information of dimension is advertisement position.
Content in the storage organization of more time sliding windows can also be just like other contents outside Fig. 7 and Fig. 8 and Fig. 9, its
Specific content is mainly to be determined by the data of ad system statistics, herein without repeating.
5th, user can read the data stored in sliding window memory module by data application module request, for example request is read
The user's operation information for taking some user to operate.
In the application example, can neatly by the data of the user behavior counted with different dimensions time windows intensively
It is stored into together, and without separately storage, it can farthest save memory space.
It should be noted that if above-mentioned method of data synchronization can also be applied to the commending system of news data, or its
Its recommending data can reach same effect into the application of user terminal, herein without repeating.
One of ordinary skill in the art will appreciate that all or part of step in the various methods of above-described embodiment is can
To instruct the hardware of correlation to complete by program, the program can be stored in a computer-readable recording medium, storage
Medium can include:Read-only storage (ROM), random access memory (RAM), disk or CD etc..
The date storage method and system provided above the embodiment of the present invention, is described in detail, and herein should
The principle and embodiment of the present invention are set forth with specific case, the explanation of above example is only intended to help and managed
Solve the method and its core concept of the present invention;Meanwhile for those of ordinary skill in the art, according to the thought of the present invention,
There will be changes in embodiment and application, in summary, this specification content should not be construed as to this hair
Bright limitation.
Claims (14)
- A kind of 1. date storage method, it is characterised in that including:Obtain the data of user behavior;The data of the user behavior include user's operation data and to applying related number According to;The data of the user behavior are adjusted according to dimension so that it is right respectively that the user behavior data after adjustment includes multigroup dimension The user's operation data answered, the information of the dimension are included in the related data to application;User behavior data after the adjustment is saved as to the storage organization of more time sliding windows, the storage of more time sliding windows Structure includes:User's operation data corresponding to multigroup dimension difference in time window;User's operation data includes user Behavior identifies;The storage organization of more time sliding windows includes multiple row races, and each race that arranges includes multipair keyword and train value, often Individual train value includes multipair index keyword and time slot, and the data of each time slot storage include multipair timeslice and index number; Keyword in the row race is the information of the dimension, or each row race corresponds to a dimension;Or the storage organization of more time sliding windows includes multiple row races, each race that arranges includes multipair index keyword It is each to arrange the corresponding dimension of race and time slot, the data of each time slot storage include multipair timeslice and index number;Wherein, the index keyword identifies for user behavior.
- 2. the method as described in claim 1, it is characterised in thatUser's operation data includes:User behavior identifies, and user operates at the time of generation and quantity;The information of the dimension includes at least one in following information:Recommending data type, recommending data set type, recommend Position, recommending data mark, the application type and application identities of the application described by the recommending data.
- 3. the method as described in claim 1, it is characterised in that in the storage organization of more time sliding windows in multigroup dimension Any group of dimension corresponds to the user's operation information of at least one user's operation, and at least one user's operation corresponds to one respectively Time window.
- 4. the method as described in any one of claims 1 to 3, it is characterised in that in the data of the user behavior, one group of user The data of behavior include the information and user's operation data of n dimension, and the n is the natural number more than 1;The then data that the user behavior is adjusted according to dimension so that the user behavior data after adjustment includes multigroup dimension User's operation data corresponding to respectively, is specifically included:It is m group subdatas by the data point reuse of one group of user behavior, wherein, Any group of subdata is included in user's operation data that the data of one group of user behavior include, and the n dimension The information of at least one dimension, wherein the m is the natural number less than or equal to p, the p is less than to be taken respectively from the n With the number of combinations sum of the natural number equal to the n.
- 5. the method as described in any one of claims 1 to 3, it is characterised in that the user behavior number by after the adjustment According to the storage organization for saving as more time sliding windows, specifically include:The multigroup dimension included with the user behavior data after the adjustment respectively corresponding to user's operation data, respectively instead of User's operation described in the storage organization of the more time sliding windows stored outside time window corresponding to multigroup dimension difference Data.
- 6. the method as described in any one of claims 1 to 3, it is characterised in thatUser's operation data corresponding to a certain group of dimension includes in multigroup dimension:User's operation data at multiple moment,Then the user behavior data by after the adjustment saves as the storage organization of more time sliding windows, specifically includes:User's operation data at the multiple moment is merged into user's operation data at a moment, wherein one moment In include the multiple moment;User's operation data after the merging will be carried out save as the storage organization of more time sliding windows.
- 7. the method as described in any one of claims 1 to 3, it is characterised in that methods described also includes:Read requests are received, the read requests are used to ask the user's operation data for reading a certain user's operation;According to the read requests, obtained from the storage organization of more time sliding windows related to a certain user's operation User's operation data.
- A kind of 8. data-storage system, it is characterised in that including:Data capture unit, for obtaining the data of user behavior;The data of the user behavior include user's operation data To apply related data;Adjustment unit, for adjusting the data for the user behavior that the data capture unit obtains according to dimension so that after adjustment User behavior data include multigroup dimension respectively corresponding to user's operation data, the information of the dimension be included in it is described with should In data with correlation;Storage element, the storage knot of more time sliding windows is saved as the user behavior data after the adjustment unit is adjusted Structure, the storage organization of more time sliding windows include:User's operation data corresponding to multigroup dimension difference in time window, institute User's operation data is stated to identify including user behavior;The storage organization of more time sliding windows includes multiple row races, and each race that arranges includes multipair keyword and train value, often Individual train value includes multipair index keyword and time slot, and the data of each time slot storage include multipair timeslice and index number; Keyword in the row race is the information of the dimension, or each row race corresponds to a dimension;Or the storage organization of more time sliding windows includes multiple row races, each race that arranges includes multipair index keyword It is each to arrange the corresponding dimension of race and time slot, the data of each time slot storage include multipair timeslice and index number;Wherein, the index keyword identifies for user behavior.
- 9. system as claimed in claim 8, it is characterised in thatUser's operation data includes:User behavior identifies, and user operates at the time of generation and quantity;The information of the dimension includes at least one set in following information:Recommending data type, recommending data set type, recommend Position, recommending data mark, the application type and application identities of the application described by recommending data.
- 10. system as claimed in claim 8, it is characterised in that in the storage organization of more time sliding windows in multigroup dimension Any group of dimension correspond to the user's operation information of at least one user operation, at least one user's operation corresponds to one respectively Individual time window.
- 11. the system as described in any one of claim 8 to 10, it is characterised in thatThe adjustment unit, specifically for when in the data of the user behavior, the data of one group of user behavior include n dimension Information and user's operation data, the n is natural number more than 1, is that m groups are sub by the data point reuse of one group of user behavior Data, wherein, any group of subdata includes user's operation data that the data of one group of user behavior include, and described The information of at least one dimension in n dimension, wherein the m is the natural number less than or equal to p, the p is to divide from the n The number of combinations sum for the natural number being less than and equal to the n is not taken.
- 12. the system as described in any one of claim 8 to 10, it is characterised in thatThe storage element, specifically for multigroup dimension for being included with the user behavior data after the adjustment respectively corresponding to User's operation data, respectively described in the storage organization instead of the more time sliding windows stored corresponding to multigroup dimension difference User's operation data outside time window.
- 13. the system as described in any one of claim 8 to 10, it is characterised in thatThe storage element, specifically for including when user's operation data corresponding to a certain group of dimension in multigroup dimension: User's operation data at multiple moment, user's operation data at the multiple moment is merged into user's operand at a moment According to wherein including the multiple moment in one moment;User's operation data after the merging will be carried out and be stored into institute In the storage organization for stating more time sliding windows.
- 14. the system as described in any one of claim 8 to 10, it is characterised in that the system also includes:Read requests unit, for receiving read requests, the read requests are used to ask the user for reading a certain user's operation Operation data;Acquisition request unit, for the read requests received according to the read requests unit, from depositing for more time sliding windows The user operation data related to a certain user's operation is obtained in storage structure.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510337071.5A CN104915431B (en) | 2015-06-17 | 2015-06-17 | A kind of date storage method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510337071.5A CN104915431B (en) | 2015-06-17 | 2015-06-17 | A kind of date storage method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104915431A CN104915431A (en) | 2015-09-16 |
CN104915431B true CN104915431B (en) | 2018-01-16 |
Family
ID=54084494
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510337071.5A Active CN104915431B (en) | 2015-06-17 | 2015-06-17 | A kind of date storage method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104915431B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105184620A (en) * | 2015-10-29 | 2015-12-23 | 北京恒麟信息科技有限公司 | Real-time internet advertisement bidding system based on multi-dimensional data |
CN107798041B (en) * | 2017-06-21 | 2020-02-14 | 平安科技(深圳)有限公司 | Policy data storage method and device and terminal equipment |
CN107704526A (en) * | 2017-09-15 | 2018-02-16 | 平安科技(深圳)有限公司 | Storage method, device, computer equipment and the storage medium of data |
CN108268588B (en) * | 2017-11-29 | 2021-07-23 | 阿里巴巴(中国)有限公司 | Advertisement data summarizing and inquiring method and device |
CN113077292A (en) * | 2021-04-20 | 2021-07-06 | 北京沃东天骏信息技术有限公司 | User classification method and device, storage medium and electronic equipment |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8548991B1 (en) * | 2006-09-29 | 2013-10-01 | Google Inc. | Personalized browsing activity displays |
CN104239351A (en) * | 2013-06-20 | 2014-12-24 | 阿里巴巴集团控股有限公司 | User behavior machine learning model training method and device |
CN104572726A (en) * | 2013-10-22 | 2015-04-29 | 北京品众互动网络营销技术有限公司 | Advertisement analysis method |
CN104700289A (en) * | 2015-03-17 | 2015-06-10 | 中国联合网络通信集团有限公司 | Advertising method and device |
-
2015
- 2015-06-17 CN CN201510337071.5A patent/CN104915431B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8548991B1 (en) * | 2006-09-29 | 2013-10-01 | Google Inc. | Personalized browsing activity displays |
CN104239351A (en) * | 2013-06-20 | 2014-12-24 | 阿里巴巴集团控股有限公司 | User behavior machine learning model training method and device |
CN104572726A (en) * | 2013-10-22 | 2015-04-29 | 北京品众互动网络营销技术有限公司 | Advertisement analysis method |
CN104700289A (en) * | 2015-03-17 | 2015-06-10 | 中国联合网络通信集团有限公司 | Advertising method and device |
Also Published As
Publication number | Publication date |
---|---|
CN104915431A (en) | 2015-09-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104915431B (en) | A kind of date storage method and system | |
Williams | Introduction: Social media, political marketing and the 2016 US election | |
CN103686237B (en) | Recommend the method and system of video resource | |
Szabo et al. | Predicting the popularity of online content | |
US20200012654A1 (en) | System and methods for generating optimal post times for social networking sites | |
CN103647800A (en) | Method and system of recommending application resources | |
CN103716338B (en) | A kind of information-pushing method and device | |
CN108289121A (en) | The method for pushing and device of marketing message | |
CN105872837A (en) | User recommendation method and device | |
CN107222566A (en) | Information-pushing method, device and server | |
CN107172452A (en) | Direct broadcasting room recommends method and device | |
CN107087235A (en) | Media content recommendations method, server and client | |
CN107885745A (en) | A kind of song recommendations method and device | |
CN103686236A (en) | Method and system for recommending video resource | |
CN102750320B (en) | Method, device and system for calculating network video real-time attention | |
CN109511015A (en) | Multimedia resource recommended method, device, storage medium and equipment | |
CN106055630A (en) | Log storage method and device | |
CN104079960A (en) | File recommending method and device | |
CN106888381A (en) | A kind of data resource storage method and device | |
CN106101832A (en) | For evaluating computational methods and the device of the index of the Internet media | |
CN110312167A (en) | A kind of method, intelligent terminal and storage medium calculating movie and television contents scoring | |
CN103700004A (en) | Method and device for pushing microblog advertising service information | |
CN109635192A (en) | Magnanimity information temperature seniority among brothers and sisters update method and platform towards micro services | |
CN114024737B (en) | Method, apparatus and computer readable storage medium for determining live room volume | |
US20110225287A1 (en) | Method and system for distributed processing of web traffic analytics data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |