CN107992606A - A kind of storage format of user behavior data - Google Patents
A kind of storage format of user behavior data Download PDFInfo
- Publication number
- CN107992606A CN107992606A CN201711342612.9A CN201711342612A CN107992606A CN 107992606 A CN107992606 A CN 107992606A CN 201711342612 A CN201711342612 A CN 201711342612A CN 107992606 A CN107992606 A CN 107992606A
- Authority
- CN
- China
- Prior art keywords
- user
- storage
- behavior
- user behavior
- format
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2453—Query optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
- G06F16/2291—User-Defined Types; Storage management thereof
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of storage format of user behavior data, more particularly to the data storage aspect of mobile Internet field, storage format proposed by the present invention is to store user behavior by storing the form of " people+time+behavior+object ", pass through JSON structure representation user behaviors, utilize " user ", " timestamp ", " action ", " target " defines the attribute of user respectively, time, behavior and object, the peculiar information of user behavior object is recorded by " carrier ", and the carrier uses the distributed memory system of key value patterns, user behavior can completely be expressed by such a user behavior data storage format, improve search efficiency.
Description
Technical field
The present invention relates to mobile Internet field, especially user behavior collection and field of data storage.
Background technology
As Internet technology develops, most of user behavior such as the communication of present general population, social activity, shopping, study
Moved under from line on line.And for marketing activity, " user's portrait " of one is described i.e. based on " user behavior "
The behavioural habits of user, are of great significance.It is traditional with data to describe " user behavior " when, most of methods for taking are
" user behavior " is stored, does not store object of action.Such a traditional data memory format, merely from a pen data (such as
" Zhang San's search commercial articles ") it can not completely express and understand one complete " user behavior ", increase reading and inquiry " user
The difficulty of behavior ".
The content of the invention
The present invention provides a kind of storage format of user behavior data, passes through " people+time+behavior+object " such a storage
The problems such as form is intactly expressed one " user behavior ", is solved traditional " user behavior " and is isolated storage, and search efficiency is low.
In view of the above-mentioned problems, technical scheme is as follows:
The present invention provides a kind of storage format of user behavior data, by one complete " user behavior " by with " people+
The reference format of time+behavior+object " carries out expressed intact and storage;
Wherein people represents user's id information, and the time represents the time that user behavior occurs, and behavior includes at least the clear of user
It is movable to look at, click on, input, search for, buy and evaluate etc., and object refers to operation object when user carries out behavioral activity.Such as:
" product B " is searched for during Zhang San A, wherein " Zhang San " is " people ", " during A " is " time ", and " search " is " behavior ", and " product B " is " right
As ", have complete " people, time, behavior, object ".
The storing step of the reference format is:
Beginning-storage User ID-storage behavior-storage time of the act-storage object of action-end.
The storing step of wherein reference format further includes extension storage step, judges to store whether object of action needs to extend
Storage, if desired extension storage then stores the load information of object of action, then terminates, and wherein load information is object of action
Particular attribute (such as " product B " is searched for during Zhang San A, wherein " product B " is object of action, " place of production of product B ", quantity, quality
Etc. the particular attribute that information is object of action).
The storage of the reference format follows JSON structures (a kind of text formatting independently of language):
With the attribute of " user " record user,
The time " timestamp " occurred with the behavior " action " of " event " record user and the behavior,
With the object type " type " of " target " record user behavior and the ID " id " of object;
Wherein, " user " represents the people in the reference format, and " action " represents the behavior in the reference format,
" timestamp " represents the time in the reference format, and " target " represents the object in the reference format.Such a storage
Mode can expressed intact user behavior, user behavior information will not be isolated to storage.
Wherein the load information of user behavior object is stored using Key-value patterns (a kind of distributed memory system),
Convenient storage bulk information, and be easy to inquire about.
Alternatively, since the information of user behavior is using text data, any memory, example can be stored in
Such as relevant database, K-V databases.
Beneficial effect
The present invention realizes a kind of storage format of user behavior data, and traditional storage " user is isolated compared to more existing
The form of behavior ", the present invention more intactly express one " user behavior ", and in technical elements, the present invention only need to read one
Pen data can obtain one complete " user behavior ", improve search efficiency.
Brief description of the drawings
Fig. 1 is a kind of storage format schematic diagram of user behavior data;
Fig. 2 stores operation step map for the user behavior data storing format informations.
Embodiment
The specific embodiment of the invention is described further below in conjunction with the accompanying drawings.
As shown in Figure 1, the present invention provides a kind of storage format of user behavior data, the storage format is by user behavior
Stored with the reference format of " people+time+behavior+object ", avoid isolating storage user behavior, enable user behavior complete table
Reach.
The operation step map of user behavior reference format storage as shown in Figure 2, including:
Unique ID of the handy family of " attribute " use " user " field record of user in system is recorded first;
Then " behavior " of user is recorded, is sent out with " behavior " (action) of " event " record user and the behavior
Raw " time " (timestamp);
" object " of user behavior is finally recorded, with the object type (type) and object of " target " record user behavior
ID(id)。
Such a user behavior reference format provides a kind of extension " carrier " (payload) and is used to record the user's row at the same time
For the peculiar information of object, judge whether that needs are extended storage in storage object of action, if desired extension storage then stores
The load information of object of action, then terminates, and wherein load information includes the particular attribute of object of action.And carrier uses key-
Value patterns (a kind of distributed memory system), it is convenient to store bulk information, and inquiry velocity is fast, improves search efficiency.
Illustrate embodiments of the present invention with reference to specific example:
For example, to record a kind of user behavior " Zhang San have purchased one bag of northeast 5 constant virtues rice on December 4th, 2017 ".
First record user " attribute ", with field " user " be " Zhang San " record, Zhang San be user in system only
One ID;
Then " behavior " of user is recorded, is that " purchase " records with field " action ";
" time " of behavior generation is recorded, is to record in " on December 4th, 2017 " with field " timestamp ";
" object " of user behavior is finally recorded, is that " rice " records with field " target ";
Whether have peculiar information, the object " rice " among the present embodiment is " one bag of east if judging " object " of user behavior
Northern 5 constant virtues rice ", therefore there is peculiar information to be extended storage to user behavior object, key-value moulds are used using carrier
Formula is stored, and is finally terminated.
The present embodiment carries out full storage using reference format to " people+time+behavior+object " of user behavior, avoids
Storage is isolated to user behavior, can be by inquiring about any attribute of user behavior, you can occur when needing to inquire about user behavior
Complete user behavior, such as whole user behavior occurs, and " Zhang San was at 2017 12 for query time " on December 4th, 2017 "
The moon have purchased one bag of northeast 5 constant virtues rice on the 4th ", inquiry velocity improves search efficiency soon.
According to the disclosure and teachings of the above specification, those skilled in the art in the invention can also be to above-mentioned embodiment party
Formula make a change and modification.Therefore, the invention is not limited in embodiment disclosed and described above, to the one of invention
A little modifications and changes should also be as falling into the scope of the claims of the present invention.In addition, although used in this specification
Some specific terms, but these terms are merely for convenience of description, do not limit the present invention in any way.
Claims (5)
- A kind of 1. storage format of user behavior data, it is characterised in that:By one complete " user behavior " by with " people+ The reference format of time+behavior+object " carries out expressed intact and storage;Wherein:People represents user's id information, and the time represents the time that user behavior occurs, behavior include at least the browsing of user, Click on, input, searching for, purchase and evaluation, the object are the concrete operations object of user behavior;The storing step of the reference format is:Beginning-storage User ID-storage behavior-storage time of the act-storage behavior pair As-terminate.
- 2. a kind of storage format of user behavior data according to claim 1, is characterized in that:The reference format is deposited Storage step further includes extension storage step:Judge whether that needs are extended storage in storage object of action, if desired extension storage then stores the loading of object of action Information, then terminates, and wherein load information includes the particular attribute of object of action.
- 3. a kind of storage format of user behavior data according to claim 1 or 2, is characterized in that:The reference format Storage follow JSON structures:With the attribute of " user " record user,The time " timestamp " occurred with the behavior " action " of " event " record user and the behavior,With the object type " type " of " target " record user behavior and the ID " id " of object,Wherein, " user " represents the people in the reference format, and " action " represents the behavior in the reference format, " timestamp " represents the time in reference format, and " target " represents the object in reference format.
- A kind of 4. storage format of user behavior data according to claim 1 or 2, it is characterised in that:The reticle Formula stores in memory.
- A kind of 5. storage format of user behavior data according to claim 5, it is characterised in that:The memory uses Relevant database or K-V databases.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711342612.9A CN107992606A (en) | 2017-12-14 | 2017-12-14 | A kind of storage format of user behavior data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711342612.9A CN107992606A (en) | 2017-12-14 | 2017-12-14 | A kind of storage format of user behavior data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107992606A true CN107992606A (en) | 2018-05-04 |
Family
ID=62038609
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711342612.9A Pending CN107992606A (en) | 2017-12-14 | 2017-12-14 | A kind of storage format of user behavior data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107992606A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105515871A (en) * | 2015-12-23 | 2016-04-20 | 小米科技有限责任公司 | Method and device for recording log |
JP2017037381A (en) * | 2015-08-07 | 2017-02-16 | 株式会社日立製作所 | Behavior characteristic analysis system and behavior characteristic analysis method |
CN106650760A (en) * | 2015-10-28 | 2017-05-10 | 华为技术有限公司 | Method and device for recognizing user behavioral object based on flow analysis |
CN106776607A (en) * | 2015-11-19 | 2017-05-31 | 北京国双科技有限公司 | Search engine operation behavior treating method and apparatus |
-
2017
- 2017-12-14 CN CN201711342612.9A patent/CN107992606A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2017037381A (en) * | 2015-08-07 | 2017-02-16 | 株式会社日立製作所 | Behavior characteristic analysis system and behavior characteristic analysis method |
CN106650760A (en) * | 2015-10-28 | 2017-05-10 | 华为技术有限公司 | Method and device for recognizing user behavioral object based on flow analysis |
CN106776607A (en) * | 2015-11-19 | 2017-05-31 | 北京国双科技有限公司 | Search engine operation behavior treating method and apparatus |
CN105515871A (en) * | 2015-12-23 | 2016-04-20 | 小米科技有限责任公司 | Method and device for recording log |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11238120B1 (en) | Selecting, ranking, and/or presenting microsite content | |
US10331631B2 (en) | Apparatus, systems, and methods for analyzing characteristics of entities of interest | |
CN105574093B (en) | A method of index is established in the spark-sql big data processing system based on HDFS | |
TWI522942B (en) | User favorites data processing method and device, user favorite data searching method and device, and user favorite system | |
CN102339315B (en) | Index updating method and system of advertisement data | |
US20120084291A1 (en) | Applying search queries to content sets | |
WO2022156529A1 (en) | Commodity recommendation method and apparatus for enterprise user | |
US20150347420A1 (en) | Performing Application Searches | |
CN101840400B (en) | Multilevel classification retrieval method and system | |
CN102629269B (en) | Searching and storing method for embedded database | |
CN102467521A (en) | Easily-extensible multi-level classification search method and system | |
CN101963966A (en) | Method for sorting search results by adding labels into search results | |
CN108932286B (en) | Data query method and device | |
CN109522271B (en) | Batch insertion and deletion method and device for B + tree nodes | |
CN106415540A (en) | Federated search | |
CN101706790A (en) | Clustering method of WEB objects in search engine | |
CN102521306A (en) | Application method for data storage system | |
CN106970974A (en) | A kind of APP low discharges data update system and method based on version | |
CN106663100A (en) | Multi-domain query completion | |
EP3123360B1 (en) | Partition filtering using smart index in memory | |
CN103123651B (en) | A kind of method, device and mobile device checking multiple same class file | |
CN102254025B (en) | Information memory retrieving method | |
CN102110125B (en) | Retrieval method for quickly searching points of interest according to Chinese characters | |
CN107729330A (en) | The method and apparatus for obtaining data set | |
CN107992606A (en) | A kind of storage format of user behavior data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180504 |
|
RJ01 | Rejection of invention patent application after publication |