CN106656542B - method and device for determining user identification - Google Patents

method and device for determining user identification Download PDF

Info

Publication number
CN106656542B
CN106656542B CN201510742866.4A CN201510742866A CN106656542B CN 106656542 B CN106656542 B CN 106656542B CN 201510742866 A CN201510742866 A CN 201510742866A CN 106656542 B CN106656542 B CN 106656542B
Authority
CN
China
Prior art keywords
account
party cookie
determining
cookie
access record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510742866.4A
Other languages
Chinese (zh)
Other versions
CN106656542A (en
Inventor
詹晓强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201510742866.4A priority Critical patent/CN106656542B/en
Publication of CN106656542A publication Critical patent/CN106656542A/en
Application granted granted Critical
Publication of CN106656542B publication Critical patent/CN106656542B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0246Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols
    • H04L41/0253Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols using browsers or web-pages for accessing management information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0246Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols
    • H04L41/0266Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols using meta-data, objects or commands for formatting management information, e.g. using eXtensible markup language [XML]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The application discloses a method and a device for determining user identification. Wherein, the method comprises the following steps: acquiring an access log of a target website; extracting cookie information and an account ID from each access record of the access log; and determining the user identification of each access record according to the cookie information and the account ID. Through the method and the device, the technical problem that the user identification cannot be accurately determined only by the first party cookie or the third party cookie in the prior art is solved.

Description

method and device for determining user identification
Technical Field
The present application relates to the field of computers, and in particular, to a method and an apparatus for determining a user identifier.
background
When data analysis is performed on a website at present, a user behavior needs to be analyzed, and the user needs to be identified first, so that a series of behaviors associated with which user can be known. cookies are further divided into first party cookies and third party cookies, and the first party cookies are generally widely supported and used, but the first party cookies cannot be done when cross-domain identification is involved. At this point, only the third party cookie can be resorted to. The third party cookie identification is less acceptable than the first party cookie due to the environment of the browser or the like.
As can be seen from the above, in the prior art, either the first party cookie lacking the cross-domain identification function is used as the user identifier, or the third party cookie which can provide the cross-domain identification function but has a low acceptance accuracy is used as the user identifier.
In view of the above problems, no effective solution has been proposed.
Disclosure of Invention
The embodiment of the application provides a method and a device for determining a user identifier, so as to at least solve the technical problem that the user identifier cannot be accurately determined only by using a first party cookie or a third party cookie in the prior art.
According to an aspect of an embodiment of the present application, a method for determining a user identifier is provided, including: acquiring an access log of a target website; extracting cookie information and an account ID from each access record of the access log; and determining the user identification of each access record according to the cookie information and the account ID.
Further, the cookie information comprises a first party cookie and a third party cookie, wherein: extracting cookie information and an account ID from each access record of the access log comprises: extracting the ID of the first party cookie, the ID of the third party cookie and the account ID from each access record; numbering each access record in sequence, and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID; determining the user identifier of each access record according to the cookie information and the account ID comprises: and determining the user identification of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
further, determining, according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to each number, the user identifier of the access record corresponding to the number includes: judging whether data exist in the ID of the third-party cookie corresponding to the number and judging whether data exist in the account ID; and determining that the user identifier of the access record corresponding to the number is the account ID when judging that no data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
Further, determining the user identifier of the access record corresponding to each number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID; and determining that the user identifier of the access record corresponding to the number is the account ID when judging that data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
Further, determining, according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to each number, the user identifier of the access record corresponding to the number further includes: and under the condition that the data exist in the ID of the third-party cookie corresponding to the number and the data do not exist in the account ID, determining that the user identifier of the access record corresponding to the number is the ID of the third-party cookie.
Further, determining, according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to each number, the user identifier of the access record corresponding to the number further includes: and determining that the user identifier of the access record corresponding to the number is the ID of the first party cookie when judging that no data exists in the ID of the third party cookie corresponding to the number and no data exists in the data content of the account ID.
further, after determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to each number, the method further includes: storing each of the numbers and the user identification corresponding to the number in a database.
according to another aspect of the embodiments of the present application, there is provided an apparatus for determining a user identifier, including:
Further, the acquisition unit is used for acquiring an access log of the target website; an extracting unit, configured to extract cookie information and an account ID from each access record of the access log; and the determining unit is used for determining the user identification of each access record according to the cookie information and the account ID.
Further, the cookie information comprises a first party cookie and a third party cookie, wherein: the extraction unit includes: an extracting subunit, configured to extract, from each of the access records, an ID of a first-party cookie, an ID of a third-party cookie, and the account ID; the processing subunit is used for numbering each access record in sequence and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID; the determination unit includes: and the determining subunit is used for determining the user identifier of the access record corresponding to each number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
Further, the determining subunit includes: the judging module is used for judging whether data exist in the ID of the third-party cookie corresponding to the number and judging whether data exist in the account ID; and the first determining module is used for determining that the user identifier of the access record corresponding to the number is the account ID when judging that no data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
Further, the determining subunit further includes; and the second determining module is used for determining that the user identifier of the access record corresponding to the number is the account ID when judging that data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
Further, the determining subunit further includes: and the third determining module is used for determining that the user identifier of the access record corresponding to the number is the ID of the third-party cookie when judging that data exists in the ID of the third-party cookie corresponding to the number and data does not exist in the account ID.
Further, the determining subunit further includes: and the fourth determining module is used for determining that the user identifier of the access record corresponding to the number is the ID of the first party cookie when judging that no data exists in the ID of the third party cookie corresponding to the number and no data exists in the data content of the account ID.
Further, the determining subunit further includes: and the storage subunit is used for storing each number and the user identification corresponding to the number in a database after determining the user identification of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
In the embodiment of the application, the access log of the target website is obtained; extracting cookie information and an account ID from each access record of the access log; and determining the user identifier of each access record according to the cookie information and the account ID, so that the purpose of accurately determining the user identifier is achieved, the technical problem that the user identifier cannot be accurately determined only by using the first party cookie or the third party cookie in the prior art is solved, and the technical effect of improving the accuracy of determining the user identifier is achieved.
drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
Fig. 1 is a flowchart of a method for determining a user identifier according to an embodiment of the present application; and
Fig. 2 is a schematic diagram of an apparatus for determining a user identifier according to an embodiment of the present application.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
in accordance with an embodiment of the present application, there is provided a method embodiment of a method for determining user identity, it should be noted that the steps illustrated in the flowchart of the drawings may be performed in a computer system such as a set of computer executable instructions, and that while a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than that presented herein.
Fig. 1 is a flowchart of a method for determining a user identifier according to an embodiment of the present application, and as shown in fig. 1, the method includes steps S102 to S106, which are specifically as follows:
Step S102, obtaining an access log of the target website.
specifically, the access log of the target website may be obtained by deploying a script on a website page of the target website.
The target website is a website that needs to analyze user behavior, and may be any consumption website or video website, such as kyoto, naobao, suning, guicheng, youku, and the like.
step S104, extracting cookie information and account ID from each access record of the access log.
Specifically, the cookie information may include a first party cookie and a third party cookie.
Among other things, the account ID (also known as a strong account), by which the user can be identified very accurately, but is not easily obtained.
And step S106, determining the user identification of each access record according to the cookie information and the account ID.
In the embodiment of the application, the user identification is determined jointly according to the cookie information containing the first party cookie and the third party cookie and the account ID, and compared with the prior art that the user identification is determined only according to the first party cookie or the third party cookie, the user identification can be determined more accurately, so that the technical problem that the user identification cannot be determined accurately only by using the first party cookie or the third party cookie in the prior art is solved, and the technical effect of improving the accuracy of determining the user identification is achieved.
Optionally, in an embodiment of the present application, the cookie information includes a first-party cookie and a third-party cookie, where:
The steps S1041 to S1043 of extracting cookie information and an account ID from each access record of the access log include the following steps:
Step S1041, extracting the ID of the first party cookie, the ID of the third party cookie, and the account ID from each access record.
and S1043, numbering each access record in sequence, and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID.
If the access log includes 3 access records, the data shown in table 1 below can be obtained according to the above steps S1041 to S1043.
TABLE 1
Numbering cookieID cookie3rdID UserID
1 xxxx xxxx xxxxx
2 xxxx xxxx xxxxx
3 xxxx xxxx xxxxx
Wherein, the cookie ID represents the ID of the first party cookie, the cookie3rdID represents the ID of the third party cookie, and the UserID represents the account ID. It should be noted that other information than the above information may be obtained from each access record by a script, and the other information may be stored in correspondence with the number (for example, the content of "…" shown in table 1).
determining the user identifier of each access record according to the cookie information and the account ID comprises: and determining the user identification of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
Referring to table 1, for example, the user identifier of the access record corresponding to number 1 is determined from the ID of the first-party cookie corresponding to number 1, the ID of the third-party cookie, and the account ID.
in the embodiment of the application, the user identification is determined by combining the first party cookie, the third party cookie and the account ID, the advantages and disadvantages of the first party cookie and the third party cookie can be considered, and the technical effects of further improving the accuracy of determining the user identification and supporting the cross-domain identification of the user are achieved.
optionally, determining, according to the ID of the first-party cookie corresponding to each number, the ID of the third-party cookie, and the account ID, the user identifier of the access record corresponding to the number includes: and judging whether data exist in the ID of the third-party cookie corresponding to the number and whether data exist in the account ID, in other words, judging whether the data of the ID of the third-party cookie corresponding to the number are missing and judging whether the data of the account ID are missing.
Although the ID of the first-party cookie, the ID of the third-party cookie, and the account ID may be obtained through the deployment script, due to the limitation of the browser, the data of the ID of the first-party cookie may be obtained in a normal case, and it may not be possible to really obtain the data of the ID of the third-party cookie and the data of the account ID, that is, there are situations where data is missing about the ID of the third-party cookie and the account ID, and therefore, there are several cases as follows regarding the determination result, specifically:
In the first case, when it is determined that no data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID, the user identifier of the access record corresponding to the number is determined to be the account ID.
Specifically, the determination result may be represented by the following formula:
If cookie3rdID ═ and userID! "then UserID.
Wherein, "indicates data missing, cookie ID indicates the ID of the first party cookie, cookie3rdID indicates the ID of the third party cookie, UserID indicates the account ID, and uvID indicates the user identification.
In the embodiment of the application, a mode of determining the user identification when the CookieIDs of the same user are inconsistent is provided. It should be noted that the inconsistency of the cookie ids of the same user may be caused by cookie regeneration, and the like.
In the second case, when the data exists in the ID of the third-party cookie corresponding to the number and the data exists in the account ID, the user identification of the access record corresponding to the number is determined to be the account ID.
Similarly, the above determination result can be represented by the following formula:
If userID! "", and cookie3 rdID! "then userID.
in the embodiment of the application, a mode of determining the user identification when the same user is identified in a cross-domain mode is provided. It should be noted that, when a user logs in the website a and also logs in the website B, and domain names of the website a and the website B are different, identifying the user is a cross-domain identifier.
And in the third case, when the data exists in the ID of the third-party cookie corresponding to the number and the data does not exist in the account ID, determining the user identifier of the access record corresponding to the number as the ID of the third-party cookie.
Similarly, the above determination result can be represented by the following formula:
If userID ═ and cookie3 rdID! "then uvID is cookie3 rID.
in the embodiment of the application, a mode of determining the user identifier when the user identifier partially crosses the domain is provided.
And in the case that the data does not exist in the ID of the third-party cookie corresponding to the number and the data does not exist in the data content of the account ID, determining the user identifier of the access record corresponding to the number as the ID of the first-party cookie.
Similarly, the above determination result can be represented by the following formula:
if userID ═ and cookie3rdID ═ then uvID ═ cookie id.
According to the content, the scheme provided by the application integrates a plurality of methods for determining the user identification, makes up for deficiencies, supports cross-domain determination of the user identification, and improves the accuracy of the determined user identification. It should be noted that determining the user identity across domains needs to rely on the acquisition of third party cookie information.
optionally, after determining the user identifier of the access record corresponding to the number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to each number, the method further includes: each number and the user identification corresponding to the number are stored in a database.
Specifically, based on table 1, the data shown in table 2 below can be obtained.
TABLE 2
Data numbering Uvid
1 xxxx
2 xxxx
3 xxxx
It should be noted that the other information obtained in the access record is only schematically shown in one column in table 2, and is denoted by "…", and there may be a plurality of columns.
According to an embodiment of the present application, there is also provided a device for determining a user identifier, where the device for determining a user identifier is configured to execute the method for determining a user identifier provided in the foregoing content of the embodiment of the present application, and the following provides a specific description of the device for determining a user identifier provided in the embodiment of the present application:
fig. 2 is a schematic diagram of a user identifier determination apparatus according to an embodiment of the present application, and as shown in fig. 2, the determination apparatus mainly includes an obtaining unit 21, an extracting unit 23, and a determining unit 25, where:
The acquisition unit 21 is configured to acquire an access log of a target website. Specifically, the access log of the target website may be obtained by deploying a script on a website page of the target website.
the extracting unit 23 is configured to extract cookie information and an account ID from each access record of the access log.
Specifically, the cookie information may include a first party cookie and a third party cookie.
The determining unit 25 is configured to determine the user identifier of each access record according to the cookie information and the account ID.
In the embodiment of the application, the user identification is determined jointly according to the cookie information containing the first party cookie and the third party cookie and the account ID, and compared with the prior art that the user identification is determined only according to the first party cookie or the third party cookie, the user identification can be determined more accurately, so that the technical problem that the user identification cannot be determined accurately only by using the first party cookie or the third party cookie in the prior art is solved, and the technical effect of improving the accuracy of determining the user identification is achieved.
optionally, in an embodiment of the present application, the cookie information includes a first-party cookie and a third-party cookie, where:
The extraction unit includes: the extraction subunit is used for extracting the ID of the first party cookie, the ID of the third party cookie and the account ID from each access record; and the processing subunit is used for numbering each access record in sequence and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID.
The determination unit includes: and the determining subunit is used for determining the user identifier of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
In the embodiment of the application, the user identification is determined by combining the first party cookie, the third party cookie and the account ID, the advantages and disadvantages of the first party cookie and the third party cookie can be considered, and the technical effects of further improving the accuracy of determining the user identification and supporting the cross-domain identification of the user are achieved.
Optionally, in an embodiment of the present application, the determining the subunit includes:
And the judging module is used for judging whether data exists in the ID of the third-party cookie corresponding to the number and judging whether data exists in the account ID, in other words, judging whether the data of the ID of the third-party cookie corresponding to the number is missing and judging whether the data of the account ID is missing.
Although the ID of the first-party cookie, the ID of the third-party cookie, and the account ID may be obtained through the deployment script, due to the limitation of the browser, the data of the ID of the first-party cookie may be obtained in a normal case, and it may not be possible to really obtain the data of the ID of the third-party cookie and the data of the account ID, that is, there are situations where data is missing about the ID of the third-party cookie and the account ID, and therefore there are several cases as follows regarding the determination result of the determination by the determination module, which is specifically as follows:
In case one, the determining subunit further comprises a first determining module. The first determining module is used for determining that the user identifier of the access record corresponding to the number is the account ID when the data does not exist in the ID of the third-party cookie corresponding to the number and the data exists in the account ID.
In the embodiment of the application, a mode of determining the user identification when the CookieIDs of the same user are inconsistent is provided. It should be noted that the inconsistency of the cookie ids of the same user may be caused by cookie regeneration, and the like.
In case two, the determining subunit further comprises a second determining module. And the second determining module is used for determining that the user identifier of the access record corresponding to the number is the account ID under the condition that the data exists in the ID of the third-party cookie corresponding to the number and the data exists in the account ID.
In the embodiment of the application, a mode of determining the user identification when the same user is identified in a cross-domain mode is provided. It should be noted that, when a user logs in the website a and also logs in the website B, and domain names of the website a and the website B are different, identifying the user is a cross-domain identifier.
In case three, the determining subunit further comprises a third determining module. The third determining module is used for determining that the user identifier of the access record corresponding to the serial number is the ID of the third-party cookie when the data exists in the ID of the third-party cookie corresponding to the serial number and the data does not exist in the account ID.
in the embodiment of the application, a mode of determining the user identifier when the user identifier partially crosses the domain is provided.
In case four, the determining subunit further comprises a fourth determining module. The fourth determining module is configured to determine that the user identifier of the access record corresponding to the number is the ID of the first-party cookie when it is determined that no data exists in the ID of the third-party cookie corresponding to the number and no data exists in the data content of the account ID.
According to the content, the scheme provided by the application integrates a plurality of methods for determining the user identification, makes up for deficiencies, supports cross-domain determination of the user identification, and improves the accuracy of the determined user identification. It should be noted that determining the user identity across domains needs to rely on the acquisition of third party cookie information.
Optionally, the apparatus further comprises: and the storage subunit is used for storing each number and the user identification corresponding to the number in the database after determining the user identification of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
The device for determining the user identification comprises a processor and a memory, wherein the acquiring unit, the extracting unit, the determining unit and the like are stored in the memory as program units, and the processor executes the program units stored in the memory.
The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more than one, the technical problem that the user identification cannot be accurately determined only by the first party cookie or the third party cookie in the prior art is solved by adjusting the kernel parameters, and the technical effect of improving the accuracy of determining the user identification is realized.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
The present application further provides an embodiment of a computer program product, which, when being executed on a data processing device, is adapted to carry out program code for initializing the following method steps: acquiring an access log of a target website; extracting cookie information and an account ID from each access record of the access log; and determining the user identification of each access record according to the cookie information and the account ID.
The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.
in the above embodiments of the present application, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
in the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed to by the prior art, or all or part of the technical solution may be embodied in a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic or optical disk, and other various media capable of storing program codes.
The foregoing is only a preferred embodiment of the present application and it should be noted that those skilled in the art can make several improvements and modifications without departing from the principle of the present application, and these improvements and modifications should also be considered as the protection scope of the present application.

Claims (8)

1. A method for determining a user identity, comprising:
Acquiring an access log of a target website;
Extracting cookie information and an account ID from each access record of the access log; and
Determining the user identification of each access record according to the cookie information and the account ID;
wherein the cookie information comprises a first party cookie and a third party cookie, wherein: extracting cookie information and an account ID from each access record of the access log comprises: extracting the ID of the first party cookie, the ID of the third party cookie and the account ID from each access record; numbering each access record in sequence, and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID;
Wherein determining the user identifier of each access record according to the cookie information and the account ID comprises:
And determining the user identification of the access record corresponding to the number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
2. The method of claim 1, wherein determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to the number comprises:
judging whether data exist in the ID of the third-party cookie corresponding to the number and judging whether data exist in the account ID;
And determining that the user identifier of the access record corresponding to the number is the account ID when judging that no data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
3. The method of claim 2, wherein determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to the number further comprises;
And determining that the user identifier of the access record corresponding to the number is the account ID when judging that data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
4. the method of claim 2, wherein determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to the number further comprises:
And under the condition that the data exist in the ID of the third-party cookie corresponding to the number and the data do not exist in the account ID, determining that the user identifier of the access record corresponding to the number is the ID of the third-party cookie.
5. The method of claim 2, wherein determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to the number further comprises:
And determining that the user identifier of the access record corresponding to the number is the ID of the first party cookie when judging that no data exists in the ID of the third party cookie corresponding to the number and no data exists in the data content of the account ID.
6. the method of claim 1, wherein after determining the user identifier of the access record corresponding to each number according to the ID of the first-party cookie, the ID of the third-party cookie, and the account ID corresponding to the number, the method further comprises:
Storing each of the numbers and the user identification corresponding to the number in a database.
7. an apparatus for determining a user identity, comprising:
The acquisition unit is used for acquiring an access log of a target website;
An extracting unit, configured to extract cookie information and an account ID from each access record of the access log; and
A determining unit, configured to determine, according to the cookie information and the account ID, a user identifier of each access record;
Wherein the cookie information comprises a first party cookie and a third party cookie, wherein: the extraction unit includes: an extracting subunit, configured to extract, from each of the access records, an ID of a first-party cookie, an ID of a third-party cookie, and the account ID; the processing subunit is used for numbering each access record in sequence and storing each number, the ID of the first party cookie corresponding to the number, the ID of the third party cookie and the account ID;
Wherein the determination unit includes:
And the determining subunit is used for determining the user identifier of the access record corresponding to each number according to the ID of the first party cookie corresponding to each number, the ID of the third party cookie and the account ID.
8. The apparatus of claim 7, wherein the determining subunit comprises:
the judging module is used for judging whether data exist in the ID of the third-party cookie corresponding to the number and judging whether data exist in the account ID;
and the first determining module is used for determining that the user identifier of the access record corresponding to the number is the account ID when judging that no data exists in the ID of the third-party cookie corresponding to the number and data exists in the account ID.
CN201510742866.4A 2015-11-04 2015-11-04 method and device for determining user identification Active CN106656542B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510742866.4A CN106656542B (en) 2015-11-04 2015-11-04 method and device for determining user identification

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510742866.4A CN106656542B (en) 2015-11-04 2015-11-04 method and device for determining user identification

Publications (2)

Publication Number Publication Date
CN106656542A CN106656542A (en) 2017-05-10
CN106656542B true CN106656542B (en) 2019-12-10

Family

ID=58850926

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510742866.4A Active CN106656542B (en) 2015-11-04 2015-11-04 method and device for determining user identification

Country Status (1)

Country Link
CN (1) CN106656542B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103297405A (en) * 2012-03-02 2013-09-11 阿里巴巴集团控股有限公司 User identification and information delivery method and device
CN104281717A (en) * 2014-10-31 2015-01-14 晶赞广告(上海)有限公司 Method for establishing massive ID mapping relation

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2534632B1 (en) * 2010-02-08 2017-01-18 Facebook, Inc. Communicating information in a social network system about activities from another domain

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103297405A (en) * 2012-03-02 2013-09-11 阿里巴巴集团控股有限公司 User identification and information delivery method and device
CN104281717A (en) * 2014-10-31 2015-01-14 晶赞广告(上海)有限公司 Method for establishing massive ID mapping relation

Also Published As

Publication number Publication date
CN106656542A (en) 2017-05-10

Similar Documents

Publication Publication Date Title
CN108235122B (en) Video advertisement monitoring method and device
CN110099059B (en) Domain name identification method and device and storage medium
CN106874165B (en) Webpage detection method and device
CN106919611B (en) Product information pushing method and device
CN106933897B (en) Data query method and device
CN106776609B (en) Statistical method and device for website reprint quantity
CN106933916B (en) JSON character string processing method and device
WO2017080454A1 (en) Website access path aggregation method and device
EP3200113A1 (en) Password protection question setting method and device
CN106547658B (en) Automatic testing method and device
CN106611024B (en) File merging method and device
CN109582504A (en) A kind of data reconstruction method and device for apple equipment
CN106933903B (en) Storage method and device applied to distributed storage
CN108270753B (en) Method and device for logging out user account
CN106933905B (en) Method and device for monitoring webpage access data
CN112749173A (en) Method and device for updating object
CN109063482B (en) Macro virus identification method, macro virus identification device, storage medium and processor
CN109299081B (en) Method, device, computer equipment and storage medium for cleaning house price data
CN112307297A (en) User identification unification method and system based on priority rule
CN106656542B (en) method and device for determining user identification
CN106897297B (en) Method and device for determining access path between website columns
CN106686025B (en) Method and device for collecting data
CN108334778B (en) Virus detection method, device, storage medium and processor
CN107357795B (en) Method and device for monitoring association degree between websites
CN106611010B (en) Method and device for determining webpage loading speed

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant