CN105721578B - A kind of user behavior data acquisition method and system - Google Patents
A kind of user behavior data acquisition method and system Download PDFInfo
- Publication number
- CN105721578B CN105721578B CN201610089688.4A CN201610089688A CN105721578B CN 105721578 B CN105721578 B CN 105721578B CN 201610089688 A CN201610089688 A CN 201610089688A CN 105721578 B CN105721578 B CN 105721578B
- Authority
- CN
- China
- Prior art keywords
- data
- page
- acquisition
- timestamp
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/535—Tracking the activity of the user
Abstract
The present invention provides user behavior data acquisition method and systems.This method comprises: recording user by apache process acquires data to the associated first kind of the access request of the page, the timestamp and the first acquisition data that the first kind acquisition data generate when including: the identification information of the page, page load;The user the second class associated to the access request of the page, which is acquired, by java script acquires data, the timestamp and the second acquisition data that the second class acquisition data generate when including: the identification information of the page, page load;The identification information of the page in data and the second class acquisition data is acquired according to the first kind and timestamp acquires data to the first kind and the second class acquisition data are matched to obtain the behavioral data of the user.According to the present invention, the mode of scalable user behavioral data acquisition improves the comprehensive of user behavior data acquisition.
Description
Technical field
The present invention relates to data processing fields, more particularly, it is related to a kind of user behavior data acquisition method and is
System.
Background technique
With the rapid development of Internet technology, big data era has arrived.The user day of the website WEB of many hot topics
Equal amount of access has all reached ten million rank, and the related data of these user access activities becomes the basis member of big data analysis
Data, Dynamic Data Acquiring also become vital link.
However, the existing page data acquisition to the website WEB it is most of solely using Apache log or
The mode of Javascript script, and the data that every kind of acquisition mode can acquire are also different, therefore existing data are adopted
Mode set data collected are relatively simple, are not enough and comprehensively.
Summary of the invention
In order to solve the above technical problems, the present invention provides a kind of user behavior data acquisition method and device, pass through by
Apache and java script two ways are matched from the data that user acquires, as the behavioral data of user, expansible use
The mode of family behavioral data acquisition, significantly improves comprehensive degree of the acquisition of user behavior data.
According to the present invention embodiment in a first aspect, providing a kind of user behavior data acquisition method, this method packet
It includes: user being recorded by apache process, data, the first kind acquisition are acquired to the associated first kind of the access request of the page
The timestamp and the first acquisition data that data generate when including: the identification information of the page, page load;Pass through
Java script acquires the user the second class acquisition data associated to the access request of the page, and second class acquires data packet
It includes: the timestamp generated when the identification information of the page, page load and the second acquisition data;According to described first
Class acquires the identification information of data and the page in the second class acquisition data and timestamp acquires data and the to the first kind
Two classes acquisition data are matched to obtain the behavioral data of the user.
In certain embodiments of the present invention, the identification information of the page includes uniform resource position mark URL.
In certain embodiments of the present invention, the timestamp generated when the page loads is stored in the cookie of the page
In.
In certain embodiments of the present invention, the first acquisition data include following one or more: HTTP shape
State code, search in Website keyword, browsing commodity and be added shopping cart commodity.
In certain embodiments of the present invention, the second acquisition data include following one or more: session id,
User agent, Flash version, cookie, screen parameter and page residence time.
In certain embodiments of the present invention, described to be acquired in data and the second class acquisition data according to the first kind
The page identification information and timestamp to the first kind acquire data and the second class acquisition data carry out matching include: by institute
State the identification information of the page in first kind acquisition data and the mark of timestamp and the page in second class acquisition data
Information and timestamp are compared, if comparing unanimously, the first kind are acquired data and second class acquisition data are closed
And the timestamp corresponds to the behavioral data at moment on the page as the user.
The second aspect of embodiment according to the present invention provides user behavior data acquisition system, which includes:
One acquisition module acquires data, institute to the associated first kind of the access request of the page for recording user by apache process
It states when first kind acquisition data include: the identification information of the page, the page load timestamp that generates and first adopts
Collect data;Second acquisition module acquires the user the second class associated to the access request of the page by java script and acquires
Data, timestamp that second class acquisition data generate when including: the identification information of the page, page load and
Second acquisition data;Module is integrated, for acquiring the mark of the page in data and the second class acquisition data according to the first kind
Know information and timestamp to match to obtain the row of the user first kind acquisition data and the second class acquisition data
For data.
In certain embodiments of the present invention, the identification information of the page includes uniform resource position mark URL.
In certain embodiments of the present invention, the timestamp generated when the page loads is stored in the cookie of the page
In.
In certain embodiments of the present invention, the first acquisition data include following one or more: HTTP shape
State code, search in Website keyword, browsing commodity and be added shopping cart commodity.
In certain embodiments of the present invention, the second acquisition data include following one or more: session id,
User agent, Flash version, cookie, screen parameter and page residence time.
In certain embodiments of the present invention, the module of integrating is adopted according to first kind acquisition data and the second class
The identification information and timestamp of the page in collection data acquire data to the first kind and the second class acquisition data match
It include: the page in the identification information and timestamp and second class acquisition data for acquire the first kind page in data
The identification information and timestamp in face are compared, if comparing unanimously, the first kind are acquired data and second class is adopted
Collection data are incorporated as the user, and the timestamp corresponds to the behavioral data at moment on the page.
Implement embodiment of the present invention and user behavior data acquisition method and system are provided, it can be with extending user behavioral data
The mode of acquisition, while improving comprehensive degree of user behavior data acquisition.
Detailed description of the invention
Fig. 1 is the flow diagram of user behavior data acquisition method according to an embodiment of the present invention;
Fig. 2 is the process signal that user-association data are acquired by Apache mode according to an embodiment of the present invention
Figure;
Fig. 3 is the process signal that user-association data are acquired by java script according to an embodiment of the present invention
Figure;
Fig. 4 is the structural schematic diagram of user behavior data acquisition system according to an embodiment of the present invention.
Specific embodiment
It is described in detail to various aspects of the present invention below in conjunction with the drawings and specific embodiments.Wherein, many institute's weeks
Module, unit and its mutual connection, link, communication or the operation known are not shown or do not elaborate.Also, institute
Feature, framework or the function of description can combine in any way in one or more embodiments.Those skilled in the art
Member is it should be appreciated that following various embodiments are served only for the protection scope for example, and is not intended to limit the present invention.May be used also
To be readily appreciated that, module or unit or processing mode in each embodiment described herein and shown in the drawings can by it is various not
It is combined and designs with configuration.
Just some concepts of the present invention are illustrated below.
Apache is the abbreviation of Apache HTTP Server, is an open source code of Apache Software Foundation
Web page server, can be run in most computers operating system, belong to a kind of cross-platform web server software.
In embodiments of the present invention, it can use apache process reception user to surpass by what client browser was initiated to the page
Text transfer protocol (Hyper Text Transferprotocol, HTTP) request, and record correlation log.
Java script, i.e. Javascript are a kind of literal translation formula scripting languages, belong to regime type, weak type, based on original
The language of type.In embodiments of the present invention, public data acquisition java script can be embedded in, it can be achieved that adopting in each page
Collect user-defined counter.
User behavior data acquisition method of the invention is described with reference to the accompanying drawing.
Fig. 1 is the flow diagram of user behavior data acquisition method according to an embodiment of the present invention;Fig. 2 is root
According to the flow diagram for acquiring user-association data by Apache mode of one embodiment of the present invention;Fig. 3 is according to this
Invent a kind of flow diagram that user-association data are acquired by java script of embodiment.
As shown in Figure 1, the user behavior data acquisition method of embodiment of the present invention may include step S11, S12 and S13,
In other some embodiments, user behavior data acquisition method of the invention may also include other some steps, example
Such as, being pre-configured with before acquisition and the step of be embedded in, and data format step after the matching etc..
Each step that method of the invention is related to is specifically described below.
In step s 11, user is recorded by apache process and number is acquired to the associated first kind of the access request of the page
According to, timestamp that first kind acquisition data generate when including: the identification information of the page, page load and the
One acquisition data.This step is that the server-side for the website that user is accessed is deployed in the equipment of Apache software and executes.
Before step S11, user behavior data acquisition method of the invention may also include that configuring Apache journal format, for example, can
To be carried out by system manager.In client-side, after user makes the movement for clicking Website page, user place can be triggered
Client browser to the Website page initiate HTTP request.Server-side in website, apache process can receive this
HTTP request, record user acquire data to the associated first kind of the access request of the page, by Syslog by the number of record
According to this mould group grabber is sent to, by Syslog synchrodata, data analysis is carried out to Apache log so as to asynchronous.
In certain embodiments of the present invention, the process of Apache log collection operation can be as shown in Figure 2.
Data are acquired by the first kind that apache process acquires in embodiment of the present invention can include: the access of user is asked
Ask the timestamp generated when identification information, the page load of the accessed page and the first acquisition data.User is accessed
The identification information of the page may include uniform resource locator (UniversalResource Locator, URL), be also possible to
Other are capable of an identification information or multiple identification informations for the unique identification page.In certain embodiments of the present invention,
The identification information of the page is URL.The timestamp that the page generates when loading is the timestamp generated when the load of each page, is protected
There are in the cookie of the page, and accuracy can reach 10-9Second rank.In other embodiments, according to the matched essence of institute
The factors such as the difference that exactness needs, also can be used the timestamp of other accuracy, for example, accuracy is 10-1Second, 10-2Second, 10-3
Second, 10-4Second, 10-5Second, 10-6Second, 10-7Second, 10-8Second, 10-10Second, 10-11The timestamps such as second.First acquisition data are according to being visited
The type for the Website page asked and corresponding data analysis purpose and it is different, for example, first adopts for electric business website
Collection data may include following one or more (for example, being greater than or equal to 2 kinds): HTTP status code, search in Website keyword with
And the pipelined datas of these keywords, browsing commodity or members' operation such as commodity that shopping cart is added pipelined data etc..It needs
It is noted that the Website page that user is accessed in embodiment of the present invention not only can be various types of electric business net
It stands, is also possible to other kinds of website, for example, news category website etc..
In step s 12, the user the second class associated to the access request of the page is acquired by java script to acquire
Data, timestamp that second class acquisition data generate when including: the identification information of the page, page load and
Second acquisition data.In client-side, user make click Website page movement after, can trigger client browser to
The page, which initiates HTTP request, can trigger the java script for being previously embedded the page, java script starts during the page loads
Acquisition acquires data with associated second class of the access request, after data acquisition is completed, sends data collected to
Corresponding acquisition server.So as to carry out subsequent analysis of data collected.In certain embodiments of the present invention, pass through
The process that java script acquires data can be as shown in Figure 3.
Data are acquired by the second class that java script acquires in embodiment of the present invention can include: the access request of user
The timestamp generated when the identification information of the page accessed, page load and the second acquisition information.What user was accessed
The identification information of the page may include URL, be also possible to other identification informations or multiple marks for being capable of the unique identification page
Know information combination.In certain embodiments of the present invention, the identification information of the page is URL.The page load when generate when
Between stamp timestamp for generating when being the load of each page, be stored in the cookie of the page, and accuracy can reach 10-9Second
Rank.In other embodiments, according to the factors such as the difference that needs of matched accuracy, other accuracy also can be used
Timestamp, for example, accuracy is 10-1Second, 10-2Second, 10-3Second, 10-4Second, 10-5Second, 10-6Second, 10-7Second, 10-8Second, 10-10
Second, 10-11The timestamps such as second.Second acquisition data are analyzed according to the type of the Website page accessed and corresponding data
Purpose and it is different, for example, for electric business website, the second acquisition data may include following one or more (for example, being greater than
Or be equal to 2 kinds): session id (sessionID), user agent (UserAgent), Flash version, Cookie, screen parameter and
Page residence time etc..
It should be noted that although above-mentioned step S11 and S12 is to be described in a certain order, but in data
In collection process, step S11 and S12 can be according to first carrying out step S11, then execute the sequence of step S12 and execute, can also be by
It according to first carrying out S12, then executes the sequence of step S11 and executes, the sequence that may also be performed simultaneously step S11 and S12 executes.
The first kind acquisition data of S11 acquisition can save as the form of file through the above steps, be distributed in different
WEB server, can be by file asynchronous transmission to Analysis server using SyslogNG;The of S12 acquisition through the above steps
Two classes acquisition data can also save as the form of file, be transferred to corresponding Analysis server by Open-Source Tools Flume.
In step s 13, according to the page in above-mentioned collected first kind acquisition data and the second class acquisition data
Identification information and timestamp match to obtain the user's first kind acquisition data and the second class acquisition data
Behavioral data.Specifically, according to the mark of the page in above-mentioned collected first kind acquisition data and the second class acquisition data
Know information and timestamp to match first kind acquisition data and the second class acquisition data can include: by the first kind
Acquire data in the page identification information and timestamp and second class acquisition data in the page identification information and when
Between stab and be compared, if comparing consistent, that is to say, that the first kind acquires the mark of data and the page in the second class acquisition data
Information and timestamp are all the same, then the first kind are acquired data and second class acquisition data are incorporated as the user
The timestamp corresponds to the behavioral data at moment on the page.If comparing inconsistent, that is to say, that the first kind acquires data
It is different with the identification information and timestamp of the page in the second class acquisition data, then processing is not merged to data.?
That is Data Integration of the invention is to acquire information common in data and the second class acquisition data, the page based on the first kind
Identification information (for example, URL) and the page timestamp that generates when loading, the first kind acquired by different modes is acquired
Data and the second class acquisition data are matched, and obtaining the user, the timestamp corresponds to the various of moment and adopts on this page
Collect data, as the user the moment behavioral data.It, can also be right after obtaining the behavioral data of user at a time
These behavioral datas are formatted, for example, these data can be extracted, processed etc. with processing obtains unified format, just
In the processing of further statistical analysis.
Embodiment of the present invention passes through two identification informations of the class user-association data based on the page that acquire different modes
It is integrated with timestamp, compared with a kind of existing scheme for only acquiring user-association data by mode, scalable user
The mode of behavioral data acquisition, while comprehensive degree of user behavior data acquisition can also be improved.
User behavior data acquisition method of the invention is described above in conjunction with attached drawing and specific example, with reference to the accompanying drawing
System corresponding with the above-mentioned user behavior data acquisition method of specific example.
Fig. 4 is the structural schematic diagram of user behavior data acquisition system according to an embodiment of the present invention.
As shown in figure 4, user behavior data acquisition system 4 may include the first acquisition module 41,42 and of the second acquisition module
Module 43 is integrated, these modules may be disposed at the server-side of website, for example, may be disposed at the server for acquiring data
In cluster.The acquisition facility that first acquisition module 41 can use existing Apache carries out corresponding data acquisition, and second adopts
The acquisition facility that collection module 41 also can use existing java script carries out corresponding data acquisition.
The modules of user behavior data acquisition system of the invention are specifically described below.
First acquisition module 41 records user by apache process and acquires to the associated first kind of the access request of the page
Data, timestamp that first kind acquisition data generate when including: the identification information of the page, page load and
First acquisition data.In certain embodiments of the present invention, the process of Apache log collection operation can be as shown in Figure 2.
Data are acquired by the first kind that apache process acquires in embodiment of the present invention can include: the access of user is asked
Ask the timestamp generated when identification information, the page load of the accessed page and the first acquisition data.User is accessed
The identification information of the page may include uniform resource locator, be also possible to other marks for capableing of the unique identification page
Information or multiple identification informations.In certain embodiments of the present invention, the identification information of the page is URL.When the page loads
The timestamp of generation is the timestamp generated when the load of each page, is stored in the cookie of the page, and accuracy can
Reach 10-9Second rank.In other embodiments, according to the factors such as the difference that needs of matched accuracy, it also can be used
The timestamp of his accuracy, for example, accuracy is 10-1Second, 10-2Second, 10-3Second, 10-4Second, 10-5Second, 10-6Second, 10-7Second, 10-8Second, 10-10Second, 10-11The timestamps such as second.Type and corresponding number of the first acquisition data according to the Website page accessed
Different according to the purpose of analysis, for example, for electric business website, the first acquisition data may include following one or more
(for example, be greater than or equal to 2 kinds): HTTP status code, the pipelined data of search in Website keyword and these keywords, browsing
The pipelined data etc. of members' operation such as commodity or the commodity that shopping cart is added.
Second acquisition module 42 acquires the user the second class associated to the access request of the page by java script and adopts
Collect data, the timestamp that second class acquisition data generate when including: the identification information of the page, page load with
And second acquisition data.It in certain embodiments of the present invention, can be such as Fig. 3 institute by the process that java script acquires data
Show.
Data are acquired by the second class that java script acquires in embodiment of the present invention can include: the access request of user
The timestamp generated when the identification information of the page accessed, page load and the second acquisition information.What user was accessed
The identification information of the page may include URL, be also possible to other identification informations or multiple marks for being capable of the unique identification page
Know information combination.In certain embodiments of the present invention, the identification information of the page is URL.The page load when generate when
Between stamp timestamp for generating when being the load of each page, be stored in the cookie of the page, and accuracy can reach 10-9Second
Rank.In other embodiments, according to the factors such as the difference that needs of matched accuracy, other accuracy also can be used
Timestamp, for example, accuracy is 10-1Second, 10-2Second, 10-3Second, 10-4Second, 10-5Second, 10-6Second, 10-7Second, 10-8Second, 10-10
Second, 10-11The timestamps such as second.Second acquisition data are analyzed according to the type of the Website page accessed and corresponding data
Purpose and it is different, for example, for electric business website, the second acquisition data may include following one or more (for example, being greater than
Or be equal to 2 kinds): session id (sessionID), user agent (UserAgent), Flash version, Cookie, screen parameter and
Page residence time etc..
Module 43 is integrated to acquire data to the first kind that the first acquisition module 41 acquires and acquire the second acquisition module 42
The second class acquisition data in the page identification information and timestamp data and the second class are acquired to the first kind and acquire number
According to being matched to obtain the behavioral data of the user.Specifically, according to above-mentioned collected first kind acquisition data and
Second class acquires the identification information of the page in data and timestamp acquires data to the first kind and the second class acquires data
It is matched can include: acquire the identification information of the page in first kind acquisition data and timestamp and second class
The identification information and timestamp of the page in data are compared, if comparing consistent, that is to say, that the first kind acquires data and the
The identification information and timestamp of the page in two classes acquisition data are all the same, then the first kind are acquired data and described second
Class acquisition data are incorporated as the user, and the timestamp corresponds to the behavioral data at moment on the page.If comparing different
It causing, that is to say, that the identification information and timestamp of first kind acquisition data and the page in the second class acquisition data are different,
Then processing is not merged to data.
Embodiment of the present invention passes through two identification informations of the class user-association data based on the page that acquire different modes
It is integrated with timestamp, compared with a kind of existing scheme for only acquiring user-association data by mode, scalable user
The mode of behavioral data acquisition, while comprehensive degree of user behavior data acquisition can also be improved.
Through the above description of the embodiments, those skilled in the art can be understood that the present invention can be by
The mode of software combination hardware platform is realized.Based on this understanding, technical solution of the present invention makes tribute to background technique
That offers can be embodied in the form of software products in whole or in part, which can store is situated between in storage
In matter, such as ROM/RAM, magnetic disk, CD, including some instructions use is so that a computer equipment (can be individual calculus
Machine, server, smart phone or network equipment etc.) it executes described in certain parts of each embodiment of the present invention or embodiment
Method.
Term and wording used in description of the invention are just to for example, be not intended to constitute restriction.Ability
Field technique personnel should be appreciated that under the premise of not departing from the basic principle of disclosed embodiment, to above embodiment
In each details can carry out various change.Therefore, the scope of the present invention is only determined by claim, in the claims, unless
It is otherwise noted, all terms should be understood by the broadest reasonable meaning.
Claims (10)
1. a kind of user behavior data acquisition method, which is characterized in that the described method includes:
User is recorded by apache process, and data, the first kind acquisition are acquired to the associated first kind of the access request of the page
The timestamp and the first acquisition data that data generate when including: the identification information of the page, page load;
The user the second class associated to the access request of the page is acquired by java script and acquires data, and second class is adopted
The timestamp and the second acquisition data that collection data generate when including: the identification information of the page, page load;
The identification information and timestamp for acquiring the page in data and the second class acquisition data according to the first kind are to described the
One kind acquisition data and the second class acquisition data are matched to obtain the behavioral data of the user,
Wherein, the first kind is acquired in the identification information and timestamp and second class acquisition data of the page in data
The page identification information and timestamp be compared, if comparing consistent, the first kind is acquired into data and described second
Class acquisition data are incorporated as the user, and the timestamp corresponds to the behavioral data at moment on the page.
2. the method according to claim 1, wherein the identification information of the page includes uniform resource locator
URL。
3. the method according to claim 1, wherein the timestamp that the page generates when loading is stored in the page
Cookie in.
4. according to the method in any one of claims 1 to 3, which is characterized in that the first acquisition data include following
It is one or more: HTTP status code, search in Website keyword, browsing commodity and be added shopping cart commodity.
5. according to the method in any one of claims 1 to 3, which is characterized in that the second acquisition data include following
It is one or more: session id, user agent, Flash version, cookie, screen parameter and page residence time.
6. a kind of user behavior data acquisition system, which is characterized in that the system comprises:
First acquisition module acquires number to the associated first kind of the access request of the page for recording user by apache process
According to, timestamp that first kind acquisition data generate when including: the identification information of the page, page load and the
One acquisition data;
Second acquisition module acquires the user the second class associated to the access request of the page by java script and acquires number
According to, timestamp that second class acquisition data generate when including: the identification information of the page, page load and the
Two acquisition data;
Integrate module, for according to the first kind acquire data and the second class acquisition data in the page identification information and when
Between stamp to the first kind acquire data and the second class acquisition data matched to obtain the behavioral data of the user,
In, the first kind is acquired into the page in the identification information and timestamp and second class acquisition data of the page in data
Identification information and timestamp be compared, if comparing consistent, the first kind is acquired into data and second class acquires
Data are incorporated as the user, and the timestamp corresponds to the behavioral data at moment on the page.
7. system according to claim 6, which is characterized in that the identification information of the page includes uniform resource locator
URL。
8. system according to claim 6, which is characterized in that the timestamp that the page generates when loading is stored in the page
Cookie in.
9. the system according to any one of claim 6 to 8, which is characterized in that the first acquisition data include following
It is one or more: HTTP status code, search in Website keyword, browsing commodity and be added shopping cart commodity.
10. the system according to any one of claim 6 to 8, which is characterized in that the second acquisition data include following
It is one or more: session id, user agent, Flash version, cookie, screen parameter and page residence time.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610089688.4A CN105721578B (en) | 2016-02-17 | 2016-02-17 | A kind of user behavior data acquisition method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610089688.4A CN105721578B (en) | 2016-02-17 | 2016-02-17 | A kind of user behavior data acquisition method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105721578A CN105721578A (en) | 2016-06-29 |
CN105721578B true CN105721578B (en) | 2019-05-24 |
Family
ID=56155846
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610089688.4A Active CN105721578B (en) | 2016-02-17 | 2016-02-17 | A kind of user behavior data acquisition method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105721578B (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107886382B (en) * | 2016-09-29 | 2021-11-30 | 北京京东尚科信息技术有限公司 | Method, device and system for analyzing channel drainage effect in website |
CN109144834B (en) * | 2017-06-27 | 2021-11-23 | 深圳市Tcl高新技术开发有限公司 | User behavior data acquisition method and device, android system and terminal equipment |
CN109145194A (en) * | 2017-06-27 | 2019-01-04 | 北京国双科技有限公司 | The acquisition method and device of user behavior data |
CN109558449B (en) * | 2018-10-18 | 2022-02-08 | 北京新唐思创教育科技有限公司 | Data processing platform and data processing method |
CN111245880B (en) * | 2018-11-29 | 2022-10-04 | 中国移动通信集团山东有限公司 | Behavior trajectory reconstruction-based user experience monitoring method and device |
CN111277615B (en) * | 2018-12-04 | 2022-01-11 | 阿里巴巴集团控股有限公司 | User behavior tracking method based on browser, terminal device and server |
CN112199263A (en) * | 2020-09-30 | 2021-01-08 | 北京字节跳动网络技术有限公司 | Method, device, equipment and medium for recording page |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101576933A (en) * | 2009-06-29 | 2009-11-11 | 北京黑米世纪信息技术有限公司 | Fully-automatic grouping method of WEB pages based on title separator |
CN104601408A (en) * | 2015-01-30 | 2015-05-06 | 迈普通信技术股份有限公司 | Website data statistics and analysis method and system used for non-open network environment |
CN104636245A (en) * | 2015-03-09 | 2015-05-20 | 浪潮集团有限公司 | User browsing behavior collection modes based on real-time update |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070271375A1 (en) * | 2004-09-27 | 2007-11-22 | Symphoniq Corporation | Method and apparatus for monitoring real users experience with a website capable of using service providers and network appliances |
-
2016
- 2016-02-17 CN CN201610089688.4A patent/CN105721578B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101576933A (en) * | 2009-06-29 | 2009-11-11 | 北京黑米世纪信息技术有限公司 | Fully-automatic grouping method of WEB pages based on title separator |
CN104601408A (en) * | 2015-01-30 | 2015-05-06 | 迈普通信技术股份有限公司 | Website data statistics and analysis method and system used for non-open network environment |
CN104636245A (en) * | 2015-03-09 | 2015-05-20 | 浪潮集团有限公司 | User browsing behavior collection modes based on real-time update |
Non-Patent Citations (4)
Title |
---|
Web 使用挖掘技术的分析与研究;朱志国等;《计算机应用研究》;20080131;全文 |
基于Hadoop 的网络日志分析系统研究;胡光民等;《电脑知识与技术》;20100831;全文 |
基于用户行为的Web使用挖掘数据采集技术研究;向坚持等;《计算机与现代化》;20071231;摘要,第0-3部分,图1-2 |
系统设计以及javascript笔记:用户行为分析研究之数据采集;夏天的森林;《https://www.cnblogs.com/sharpxiajun/archive/2012/06/html》;20120626;全文 |
Also Published As
Publication number | Publication date |
---|---|
CN105721578A (en) | 2016-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105721578B (en) | A kind of user behavior data acquisition method and system | |
CN111522922B (en) | Log information query method and device, storage medium and computer equipment | |
US9659105B2 (en) | Methods and apparatus to track web browsing sessions | |
van Baar et al. | Digital forensics as a service: A game changer | |
US11477298B2 (en) | Offline client replay and sync | |
US8935390B2 (en) | Method and system for efficient and exhaustive URL categorization | |
US9171319B2 (en) | Analysis system and method used to construct social structures based on data collected from monitored web pages | |
CN107846426B (en) | Method and device for tracking user track in page access | |
US10073755B2 (en) | Tracing source code for end user monitoring | |
AU2007243143A1 (en) | Independent actionscript analytics tools and techniques | |
CN105488205B (en) | Page generation method and device | |
CN103309884A (en) | User behavior data collecting method and system | |
CN103415841A (en) | Method and computer program to monitor and correlate user - initiated actions with backend operations | |
US9607081B2 (en) | Ontology based categorization of users | |
US11178160B2 (en) | Detecting and mitigating leaked cloud authorization keys | |
EP2857987A1 (en) | Acquiring method, device and system of user behavior | |
CN107370628B (en) | Log processing method and system based on embedded points | |
CN105721519B (en) | A kind of webpage data acquiring method, apparatus and system | |
JP2011515754A (en) | URL providing method and system capable of new advertisement | |
US9736215B1 (en) | System and method for correlating end-user experience data and backend-performance data | |
CN104317884A (en) | Method and device for acquiring types of source pages of website | |
CN106815248A (en) | Web analytics method and device | |
Suguna et al. | User interest level based preprocessing algorithms using web usage mining | |
CN112860456B (en) | Log processing method and device | |
CN111459577B (en) | Application installation source tracking method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |