CN106202117B - Data processing method and device and server - Google Patents

Data processing method and device and server Download PDF

Info

Publication number
CN106202117B
CN106202117B CN201510230290.3A CN201510230290A CN106202117B CN 106202117 B CN106202117 B CN 106202117B CN 201510230290 A CN201510230290 A CN 201510230290A CN 106202117 B CN106202117 B CN 106202117B
Authority
CN
China
Prior art keywords
data
same
user
terminal device
unique identifier
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510230290.3A
Other languages
Chinese (zh)
Other versions
CN106202117A (en
Inventor
陈树根
曾辉
钱波
席晓鸣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510230290.3A priority Critical patent/CN106202117B/en
Publication of CN106202117A publication Critical patent/CN106202117A/en
Application granted granted Critical
Publication of CN106202117B publication Critical patent/CN106202117B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention provides a data processing method, a data processing device and a server. The data processing method comprises the following steps: collecting data reported by at least one terminal device, wherein the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device; analyzing the recorded data and/or the equipment attribute information according to a preset data association algorithm, and determining the unique identification of the terminal equipment belonging to the same user; and matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model. The embodiment of the invention improves the efficiency of data mining and the accuracy of user behavior analysis.

Description

Data processing method and device and server
Technical Field
The embodiment of the invention relates to a data mining technology, in particular to a data processing method, a data processing device and a server.
Background
In the era of mobile internet, based on the diversity of terminals, the information that a user can reach is no longer single, the user data generated by the user data is also changed greatly, and the identification of user behaviors by using the traditional recorded data (cookie) is limited. A user watches a friend circle by a mobile phone from the beginning to the end of a day, watches news by a tablet personal computer (pad for short), purchases and socializes by a computer, works by different machines, so that the user can obtain information and make decisions by different terminal devices at different times, but user data scattered on each terminal device cannot completely represent one person, only the user data are collected and spliced, one person can be completely depicted, the person can almost correspond to a certain user in the real world, and therefore the identified user behaviors can be analyzed, and planning which is beneficial to product development is made.
The same user usually uses a plurality of terminal devices, and it is desired to completely delineate which terminal devices belong to the same user from a large number of terminal devices, which requires sufficient data mining of user data in a large number of terminal devices. However, not only are the device identifiers of the current terminal devices inconsistent and independent from each other, but also there are terminal devices such as a emulational machine, a dual-card dual-standby machine, and a terminal that has been swiped through a ROM that do not have a unique identifier and cannot be distinguished, which makes data mining very difficult and inefficient, resulting in deviation of user behavior analysis and even errors.
Disclosure of Invention
The embodiment of the invention provides a data processing method, a data processing device and a server, which are used for improving the efficiency of data mining and the accuracy of user behavior analysis.
In a first aspect, an embodiment of the present invention provides a data processing method, including:
collecting data reported by at least one terminal device, wherein the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device;
analyzing the recorded data and/or the equipment attribute information according to a preset data association algorithm, and determining the unique identification of the terminal equipment belonging to the same user;
and matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model.
Further, the analyzing the recorded data and the device attribute information according to a preset data association algorithm to determine a unique identifier of a terminal device belonging to the same user includes:
determining at least one preset condition according to a preset data association algorithm, and analyzing the recorded data and/or the equipment attribute information according to the preset condition, wherein the preset condition comprises conditions corresponding to at least one type of information in position information, account information and user attributes;
and determining the unique identifier corresponding to the recorded data and/or the equipment attribute information meeting the preset condition as the unique identifier of the terminal equipment belonging to the same user.
Further, the preset conditions comprise that resident positions are the same or similar, and the same account information is adopted for logging in the same application or webpage;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and/or the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the recorded data which satisfies the same or similar resident positions and adopts the same account information for logging in the same application or webpage as the unique identifier of the terminal equipment belonging to the same user.
Further, the preset conditions include that the user genders are the same, the historical records of the browsers are similar, and the model of the terminal device is the same as that of the operating system;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and/or the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the attribute information of the terminal equipment with the same terminal equipment model and the same operating system as the unique identifier of the terminal equipment belonging to the same user.
Further, the preset condition includes that the same cloud disk account information is used for logging in the same cloud disk;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and/or the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the recorded data which satisfies that the same cloud disk is logged in by the same cloud disk and the same cloud disk account information in the recorded data as the unique identifier of the terminal equipment belonging to the same user.
Further, before collecting the data reported by the at least one terminal device, the method further includes:
receiving an account request sent by the at least one terminal device;
and allocating the unique identifier to the at least one terminal device according to the account request, and returning the unique identifier to the at least one terminal device.
Further, before analyzing the recorded data and/or the device attribute information according to a preset data association algorithm and determining a unique identifier of a terminal device belonging to the same user, the method further includes:
and classifying and sorting the data according to the application of the recorded data generated in the terminal equipment and/or the equipment attribute information, and filtering error or incomplete data.
Further, the method further comprises:
verifying whether the determined unique identifier of the terminal equipment belonging to the user is consistent with the registered information according to the information of the terminal equipment of the user registered in advance;
and adjusting the preset data association algorithm according to a verification result, and updating and maintaining the unique identifier of the terminal equipment belonging to the same user.
In a second aspect, an embodiment of the present invention provides a data processing apparatus, including:
the system comprises a collection module, a processing module and a processing module, wherein the collection module is used for collecting data reported by at least one terminal device, and the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device;
the data analysis module is used for analyzing the recorded data and/or the equipment attribute information according to a preset data association algorithm and determining the unique identification of the terminal equipment belonging to the same user;
and the behavior matching module is used for matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model and pushing data according to the user behavior model.
Further, the data analysis module includes:
the analysis unit is used for determining at least one preset condition according to a preset data association algorithm and analyzing the recorded data and/or the equipment attribute information according to the preset condition, wherein the preset condition comprises conditions corresponding to at least one type of information in position information, account information and user attributes;
and the identification determining unit is used for determining the unique identification corresponding to the recorded data and/or the equipment attribute information meeting the preset conditions as the unique identification of the terminal equipment belonging to the same user.
Further, the preset conditions comprise that resident positions are the same or similar, and the same account information is adopted for logging in the same application or webpage;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, a unique identification corresponding to the recorded data that satisfies that resident positions in the recorded data are the same or similar and that logs in the same application or web page using the same account information.
Further, the preset conditions include that the user genders are the same, the historical records of the browsers are similar, and the model of the terminal device is the same as that of the operating system;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, the recorded data that satisfies the condition that the user has the same gender and the historical records of the browser are similar, and the unique identification corresponding to the attribute information of the terminal device having the same terminal device model and operating system.
Further, the preset condition includes that the same cloud disk account information is used for logging in the same cloud disk;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, a unique identification corresponding to the recording data that satisfies that the same cloud disk is logged in by using the same cloud disk account information in the recording data.
Further, the apparatus further comprises:
the receiving and sending module is used for receiving an account request sent by the at least one terminal device;
the identification distribution module is used for distributing the unique identification to the at least one terminal device according to the account request;
the transceiver module is further configured to return the unique identifier to the at least one terminal device.
Further, the apparatus further comprises:
and the data sorting module is used for sorting and sorting the data according to the application of the recorded data generated in the terminal equipment and/or the equipment attribute information and filtering wrong or incomplete data.
Further, the apparatus further comprises:
the verification module is used for verifying whether the determined unique identifier of the terminal equipment belonging to the user is consistent with the registered information or not according to the information of the terminal equipment of the user registered in advance;
and the maintenance module is used for adjusting the preset data association algorithm according to the verification result and updating and maintaining the unique identifier of the terminal equipment belonging to the same user.
In a third aspect, an embodiment of the present invention provides a server, including: a communication device, a processor, and a memory; the communication device is used for carrying out data interaction with at least one terminal device; the processor is configured to perform the method of the first aspect; the memory is to store processor-executable instructions.
According to the data processing method, the data processing device and the server, the terminal devices belonging to the same user are determined by collecting the data with the unique identification sent by the terminal devices and carrying out data association analysis on the data, the data reported by the terminal devices are matched with the pre-established user behavior model, the matched user behavior model is determined for data pushing, and the data mining efficiency and the accuracy of user behavior analysis are improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a schematic diagram of a server according to the present invention;
FIG. 2 is a flow chart of one embodiment of a data processing method of the present invention;
FIG. 3 is a flow chart of another embodiment of a data processing method of the present invention;
FIG. 4 is a schematic diagram of a default condition setting interface of the data processing method of the present invention;
FIG. 5 is a block diagram of a data processing apparatus according to an embodiment of the present invention;
FIG. 6 is a block diagram of another embodiment of a data processing apparatus according to the present invention;
FIG. 7 is a schematic diagram of a data processing apparatus according to a third embodiment of the present invention;
FIG. 8 is a diagram of a data processing apparatus according to a fourth embodiment of the present invention
Fig. 9 is a block diagram showing a partial structure related to a server provided by an embodiment of the present invention.
Detailed Description
In order to solve the above problems, the present invention provides a data processing method. In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The terms "comprises" and "comprising," and any variations thereof, in the description and claims of this invention and the above-described drawings, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of steps or modules is not necessarily limited to those steps or modules explicitly listed, but may include other steps or modules not expressly listed or inherent to such process, method, article, or apparatus.
Fig. 1 is a schematic structural diagram of a server applied in the present invention, and as shown in fig. 1, the server of this embodiment may include: a communication device 11, such as an antenna, a wireless network card, etc., a processor 12, such as a Central Processing Unit (CPU), a memory 13, at least one bus 14, and an operating system 15 running on hardware. The bus 14 is used to enable communication connections between the elements. The Memory 13 may include a Random Access Memory (RAM) Memory, and may further include a NonVolatile Memory (NVM), such as at least one disk Memory, where various programs may be stored in the Memory for completing various processing functions and implementing the method steps of this embodiment.
It will be understood by those skilled in the art that the server shown in fig. 1 may also include various other possible elements, such as input/output modules, databases, etc., and the present invention is not limited thereto. The server in this embodiment may be, for example, various Application (APP) servers, an operator server, and the APP may be an Application that is carried by the terminal device when the terminal device leaves a factory, or an Application that is downloaded and installed on the terminal device from the internet by a user. The relevant program code of the server can be stored in the memory 13 of the server shown in fig. 1.
The server can collect data from at least one terminal device through the communication device 11, the processor 12 obtains the executable instructions in the memory 13, analyzes the data according to the executable instructions to determine the terminal devices belonging to the same user, and builds a complete user image by matching the user behaviors of the data reported by a plurality of terminal devices, so as to realize data push.
The technical solution of the present invention will be described in detail below with specific examples. The following several specific embodiments may be combined with each other, and details of the same or similar concepts or processes may not be repeated in some embodiments.
Next, fig. 2 to 3 illustrate different embodiments of the method for setting a pattern lock according to the present invention.
Fig. 2 is a flowchart of an embodiment of a data processing method of the present invention, as shown in fig. 2, an execution subject of the embodiment may be the server shown in fig. 1, and the server performs data processing by the method of the embodiment, where the method of the embodiment may include:
step 101, collecting data reported by at least one terminal device, wherein the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device;
when a user uses an APP installed on a terminal device or the function of the terminal device itself, recorded data may be generated locally, where the recorded data may be operation records of the user, background records of the APP or the function, account information of the user using the APP or the function, a location tag, and the like, and these data are recorded and sent to a server, so as to facilitate subsequent development of an APP or a terminal device developer. In addition, the terminal stores device attribute information when leaving the factory, where the device attribute information includes a device model, an International Mobile Equipment Identity (IMEI), operating system information, and the like. When the recorded data and the device attribute information are reported to the server, the terminal device marks the data with a pre-allocated unique identifier, so that the server can know which terminal device the collected data is sent by.
102, analyzing the recorded data and/or the equipment attribute information according to a preset data association algorithm, and determining a unique identifier of terminal equipment belonging to the same user;
the server analyzes the collected recording data and/or the equipment attribute information, finds out the association relation among a large amount of data by establishing a data association model and a preset data association algorithm, determines which data are possibly generated by the same user operation, and determines the terminal equipment belonging to the same user by analyzing and mining a large amount of data. For example, the geographic location where the same user frequently comes and goes is relatively fixed, and the terminal devices commonly used by the user, such as a mobile phone, a tablet computer, and the like, are also fixed, so that the positions displayed by the data reported by the terminal devices are the same or similar; for another example, if the same account is used by a mobile game played by the same user, the terminal device that logs in the mobile game using the account is likely to belong to the same user. Therefore, after the server collects the data reported by the terminal devices, a large amount of data is analyzed through a preset data association algorithm, association relations among the terminal devices are considered from multiple dimensions, and the terminal devices are determined to belong to the same user and obtain the unique identification of the terminal devices.
And 103, matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model.
The data reported by a single terminal device may not be able to analyze complete user behaviors, for example, a user is used to carry a mobile phone with him, use the mobile phone to send information, play games, perform online shopping, make calls, etc., but a tablet computer is used only at home, and the tablet computer is used to play games, perform online shopping, watch videos, etc., so that the two devices respectively undertake partial behaviors of the user, and the user behaviors undertaken by the two devices are not completely coincident, and under such a condition, the server cannot acquire complete user behavior information by analyzing the mobile phone or the tablet computer alone, and only can partially know the preference of the user, etc. Therefore, the server can determine a relatively complete user behavior pattern by determining which terminal devices belong to the same user and analyzing the terminal devices, so that information which is interested by some users can be pushed to the terminal devices of the user, for example, some products are collected by the user on a mobile phone through a shopping APP, and then some similar products can be recommended to the user through the shopping APP on a tablet personal computer, or product price comparison information is provided.
In this embodiment, the data with the unique identifier sent by the terminal device is collected and subjected to data association analysis to determine the terminal devices belonging to the same user, and then the data reported by the terminal devices are matched with the pre-established user behavior model to determine the matched user behavior model for data pushing, so that the data mining efficiency and the accuracy of the user behavior analysis are improved.
Further, when the server determines which terminal devices belong to the same user, some verification methods may be adopted, and the specific method may be: verifying whether the determined unique identifier of the terminal equipment belonging to the user is consistent with the registered information according to the information of the terminal equipment of the user registered in advance; and adjusting the preset data association algorithm according to a verification result, and updating and maintaining the unique identifier of the terminal equipment belonging to the same user.
The server can register the information of the terminal equipment of some users in advance, for example, the information that the mobile phone a, the tablet computer b and the game machine c belong to a first user is recorded; and then analyzing the data reported by the plurality of terminal devices according to a preset data association algorithm to determine that the mobile phone a and the tablet computer b belong to the first user, verifying the analysis result and the pre-registered information to know that the determined terminal device really belongs to the first user but lacks the game machine c, so that the preset data association algorithm is adjusted to determine that the mobile phone a, the tablet computer b and the game machine c belong to the first user through data analysis. The preset association algorithm can be continuously adjusted in the data analysis process, the algorithm is continuously trained and perfected, and the unique identification of the terminal equipment belonging to the same user can be updated and maintained along with the perfection process so as to obtain an analysis result closer to the real condition.
Fig. 3 is a flowchart of another embodiment of the data processing method of the present invention, as shown in fig. 3, an execution main body of the present embodiment may be the server shown in fig. 1, and the server performs data processing by the method of the present embodiment, where the method of the present embodiment may include:
step 201, receiving an account request sent by at least one terminal device;
at present, a plurality of terminal devices such as a emulational mobile phone, a dual-card dual-standby mobile phone and a terminal with a Read Only Memory (ROM) are available, and the devices do not have an International Mobile Equipment Identity (IMEI) or carry a pseudo-serial number, so that the devices cannot be identified by a server and cannot acquire recorded data and device attribute information of the devices correctly. In order to realize the unified management of the terminal equipment, the server can distribute a unified unique identifier to the terminal equipment, the terminal equipment can request the unique identifier from the server by setting an identifier management module, and the terminal equipment provided with the identifier management module can request the unique identifier without being influenced by whether the terminal equipment is a terminal equipment such as a emulational machine, a dual-card dual-standby machine, a terminal with a Read Only Memory (ROM) and the like.
Step 202, allocating the unique identifier to the at least one terminal device according to the account request, and returning the unique identifier to the at least one terminal device;
step 203, collecting data reported by the at least one terminal device, where the data includes a unique identifier pre-allocated to the terminal device, recorded data generated by a user operating the terminal device, and device attribute information of the terminal device;
step 203 is similar to step 101 of the above method embodiment, and is not described again here.
Step 204, classifying and sorting the data according to the application of the recorded data generated in the terminal equipment and/or the equipment attribute information, and filtering error or incomplete data;
the data collected by the server may be sent by each APP or terminal function, and each application sending the data may have a different data format, so that the server may classify the recorded data according to the application after acquiring the recorded data, for example, the data of WeChat and QQ, and the server may store the data respectively. In addition, the terminal device itself may have its device type, operating system, and the like as criteria for data classification. When the server arranges the collected data, some data with lost packets and data with error check can be filtered out, and the rest data can be classified, so that the corresponding data can be quickly read when the data are analyzed.
Further, in the process of arranging the data by the server, the terminal devices can be classified according to the device attribute information, which is mainly divided into a emulational machine and a branded machine, if the branded machine server can directly determine the machine type according to the device attribute information, if the emulational machine server can have the problems of camouflage, error and the like of the device number information, the server may also need to analyze according to the recorded data to identify that the branded machine server is not the branded machine type.
Step 205, determining at least one preset condition according to a preset data association algorithm, and analyzing the recorded data and/or the device attribute information according to the preset condition, wherein the preset condition includes a condition corresponding to at least one type of information among location information, account information, and user attributes;
the server analyzes the data, namely, according to at least one preset condition, the data reported by the terminal equipment of the same user always have some association relation, and the overlapped data among the terminal equipment can be screened out according to the preset condition determined by a preset data association algorithm. Generally, the preset condition may include a condition corresponding to at least one of location information (e.g., Global Positioning System (GPS) coordinates), account information (e.g., wechat account, panning account, QQ account, etc.), and user attributes (e.g., user gender, age, frequent location, etc.).
Step 206, determining the unique identifier corresponding to the recorded data and/or the device attribute information meeting the preset condition as the unique identifier of the terminal device belonging to the same user;
the server may determine the unique identifier of the terminal device belonging to the same user in the following specific implementation manners, but is not limited to the following manners, as long as a method that can easily occur to those skilled in the art to determine the unique identifier of the terminal device belonging to the same user can be taken as an embodiment of the present invention, and is not limited specifically herein.
The first method comprises the following steps: the preset conditions comprise that resident positions are the same or similar, and the same account information is adopted for logging in the same application or webpage; and determining the unique identifier corresponding to the recorded data which satisfies the same or similar resident positions and adopts the same account information for logging in the same application or webpage as the unique identifier of the terminal equipment belonging to the same user.
The server presets that terminal devices which reside at the same or similar positions and which log in the same application or webpage and use the same account information are determined to be terminal devices belonging to the same user, for example, the server selects the resident position from a large amount of recorded data as the recorded data at or near the position a, which means that the terminal devices reporting the recorded data often appear around the landmark at the position a, which may be the same user, but may also be a plurality of users often gather around the position a, so further, the server selects the recorded data which log in the same application or webpage and use the same account information from the recorded data, and then often appear at the same or similar positions, and logs in the same application or webpage with the same account, such as QQ, wechat, microblog and the like, and the probability that the terminal devices sending the recorded data belong to the same user is very high, the server may determine the unique identifier corresponding to such recorded data as the unique identifier of the terminal device belonging to the same user.
And the second method comprises the following steps: the preset conditions comprise that the user genders are the same, the historical records of the browsers are similar, and the types of the terminal equipment and the operating system are the same; and determining the unique identifier corresponding to the attribute information of the terminal equipment with the same terminal equipment model and the same operating system as the unique identifier of the terminal equipment belonging to the same user.
The server determines the terminal devices with the same gender and the same browser history as well as the same terminal device model and operating system as the terminal devices belonging to the same user, for example, the server obtains the gender of the user according to the subscription information filled when the user applies for a client Identity Module (SIM), and also obtains the gender of the user according to the webpage information frequently browsed by the user, the installed APP and the like, and the user has own preference, so that the contents of the webpage and the favorite frequently browsed by any terminal device are generally similar, and the user also has certain preference for the terminal model, the operating system and the like, so that the three types of information can be combined to serve as preset conditions for determining the terminal devices belonging to the same user.
And the third is that: the preset conditions comprise that the same cloud disk account information is adopted for logging in the same cloud disk; and determining the unique identifier corresponding to the recorded data which satisfies that the same cloud disk is logged in by the same cloud disk and the same cloud disk account information in the recorded data as the unique identifier of the terminal equipment belonging to the same user.
At present, a cloud disk is a very common information sharing technology, and in order to facilitate sharing of information on multiple devices, a user can be realized through the cloud disk, so that a server can determine terminal devices which use the same account number to log in the same cloud disk as the terminal devices belonging to the same user.
Furthermore, the server can adopt the same account information according to the same age of the user and the same login of the same application or webpage; the resident locations are the same or similar, and the model of the terminal device and the operating system are the same as each other as the preset condition, which is not specifically limited herein.
And step 207, matching the recorded data and the device attribute information reported by the terminal devices belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model.
Step 207 is similar to step 103 of the above method embodiment and is not described here again.
In this embodiment, the data with the unique identifier sent by the terminal device is collected and subjected to data association analysis to determine the terminal devices belonging to the same user, and then the data reported by the terminal devices are matched with the pre-established user behavior model to determine the matched user behavior model for data pushing, so that the data mining efficiency and the accuracy of the user behavior analysis are improved.
Fig. 4 is a schematic diagram of a preset condition setting interface of the data processing method of the present invention, as shown in fig. 4, a server may display a page with preset conditions to a user in a web page manner, the user with the server usage right logs in the web page, the resident positions are selected to be the same or similar in preset condition 1, the login WeChat is selected to use the same account information in preset condition 2, and the two conditions of the preset condition 1 and the preset condition 2 need to be satisfied at the same time, so that "and" exists "between the two conditions. After the preset conditions are set by the user, the server starts to analyze and filter the collected data, wherein the data comprises data accumulated by the server all the time and come from different terminal devices.
Optionally, the user may set one or more preset conditions on the web page, and the data is analyzed through a combination of the preset conditions. In addition, the webpage can also provide a block for inputting characters for the user, for example, the user wants that the preset condition can be more accurate, and the preset condition is not corresponding to the preset option, so the preset condition can be increased by means of manual input.
Fig. 5 is a schematic structural diagram of an embodiment of a data processing apparatus according to the present invention, and as shown in fig. 5, the apparatus of the embodiment may include: the system comprises a collection module 11, a data analysis module 12 and a behavior matching module 13, wherein the collection module 11 is used for collecting data reported by at least one terminal device, and the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device; the data analysis module 12 is configured to analyze the recorded data and/or the device attribute information according to a preset data association algorithm, and determine a unique identifier of a terminal device belonging to the same user; and a behavior matching module 13, configured to match the recorded data and the device attribute information reported by the terminal devices belonging to the same user with a pre-established user behavior model, determine a matched user behavior model, and perform data pushing according to the user behavior model.
The apparatus of this embodiment may be used to implement the technical solution of the method embodiment shown in fig. 2, and the implementation principle and the technical effect are similar, which are not described herein again.
Fig. 6 is a schematic structural diagram of another embodiment of the data processing apparatus of the present invention, as shown in fig. 6, the apparatus of this embodiment is based on the apparatus structure shown in fig. 5, and further, the data analysis module 12 may include: the device comprises an analysis unit 121 and an identifier determination unit 122, wherein the analysis unit 121 is configured to determine at least one preset condition according to a preset data association algorithm, and analyze the recording data and/or the device attribute information according to the preset condition, where the preset condition includes a condition corresponding to at least one type of information among location information, account information, and user attributes; an identifier determining unit 122, configured to determine the unique identifier corresponding to the recording data and/or the device attribute information that meet the preset condition as a unique identifier of the terminal device that belongs to the same user.
The apparatus of this embodiment may be used to implement the technical solution of the method embodiment shown in fig. 2, and the implementation principle and the technical effect are similar, which are not described herein again.
Further, the preset conditions comprise that resident positions are the same or similar, and the same account information is adopted for logging in the same application or webpage; the identifier determining unit 122 is specifically configured to determine, as the unique identifier of the terminal device belonging to the same user, a unique identifier corresponding to the recorded data that satisfies that the resident positions in the recorded data are the same or similar, and that logs in the same application or the same webpage and uses the same account information.
Further, the preset conditions include that the user genders are the same, the historical records of the browsers are similar, and the model of the terminal device is the same as that of the operating system; the identifier determining unit 122 is specifically configured to determine, as the unique identifier of the terminal device belonging to the same user, the recorded data that satisfies that the user has the same gender and the similar historical records of the browser, and the unique identifier corresponding to the attribute information of the terminal device having the same terminal device model and the same operating system.
Further, the preset condition includes that the same cloud disk account information is used for logging in the same cloud disk; the identifier determining unit 122 is specifically configured to determine, as the unique identifier of the terminal device belonging to the same user, a unique identifier corresponding to the record data that satisfies that the same cloud disk is logged in by using the same cloud disk account information in the record data.
Fig. 7 is a schematic structural diagram of a data processing apparatus according to a third embodiment of the present invention, and as shown in fig. 7, the apparatus of this embodiment may further include, on the basis of the apparatus structure shown in fig. 6: the terminal equipment comprises a transceiver module 14, an identifier distribution module 15 and a data sorting module 16, wherein the transceiver module 14 is used for receiving an account request sent by the at least one terminal equipment; the identifier allocating module 15 is configured to allocate the unique identifier to the at least one terminal device according to the account request; the transceiver module 14 is further configured to return the unique identifier to the at least one terminal device. And the data sorting module 16 is configured to sort and sort the data according to the application of the recorded data generated in the terminal device and/or the device attribute information, and filter out erroneous or incomplete data.
The apparatus of this embodiment may be used to implement the technical solutions of the method embodiments shown in fig. 2 or fig. 3, and the implementation principles and technical effects are similar, which are not described herein again.
Fig. 8 is a schematic structural diagram of a fourth embodiment of the data processing apparatus of the present invention, and as shown in fig. 8, the apparatus of this embodiment may further include, on the basis of the apparatus structure shown in fig. 5: the authentication module 17 is used for verifying whether the determined unique identifier of the terminal equipment belonging to the user is consistent with the registered information or not according to the information of the terminal equipment of the user registered in advance; and the maintenance module 18 is configured to adjust the preset data association algorithm according to the verification result, and update and maintain the unique identifier of the terminal device belonging to the same user.
The apparatus of this embodiment may be used to implement the technical solutions of the method embodiments shown in fig. 2 or fig. 3, and the implementation principles and technical effects are similar, which are not described herein again.
The server provided in the embodiment of the present invention may be an application server, an operator server, or the like, fig. 9 is a block diagram illustrating a partial structure related to the server provided in the embodiment of the present invention, and referring to fig. 9, the server includes: radio Frequency (RF) circuit 10, memory 20, input unit 30, display unit 40, processor 50, and power supply 60. Those skilled in the art will appreciate that the server architecture shown in FIG. 9 does not constitute a limitation on the servers, and may include more or fewer components than shown, or some components in combination, or a different arrangement of components.
The following describes each component of the server in detail with reference to fig. 9:
the RF circuit 10 may be used for receiving and transmitting data during data transceiving or communication, and in particular, for processing the received data of the terminal device to the processor 50; in addition, the assigned unique identifier is sent to the terminal device. In general, RF circuit 10 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. Further, the RF circuit 10 can also communicate with a terminal device by wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to global system for Mobile communication (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE)), e-mail, Short message Service (Short Messaging Service: SMS), etc.
The memory 20 may be used to store executable instructions and modules, and the processor 60 performs various functions, applications, and data processing of the server by executing the executable instructions and modules stored in the memory 20. The memory 20 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function, and the like; the storage data area may store data created according to the use of the server, and the like. Further, the memory 20 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
The input unit 30 may be used to receive input numeric or character information. Specifically, the input unit 30 may include a keyboard mouse 31 and other input devices 32.
The display unit 40 may be used to display information input by a user or information provided to the user and various menus of the server. The display unit 40 may include a display panel 41, and optionally, the display panel 41 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like.
The processor 50 is a control center of the server, connects various parts of the entire server using various interfaces and lines, performs various functions of the server and processes data by operating or executing executable instructions and/or modules stored in the memory 20 and calling data stored in the memory 20, thereby performing overall monitoring of the server. Alternatively, processor 50 may include one or more processing units; preferably, the processor 50 may integrate an application processor, which mainly handles operating systems, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 50.
The server also includes a power supply 60 for powering the various components, which may preferably be logically connected to the processor 50 via a power management system to manage charging, discharging, and power consumption management functions via the power management system.
In the embodiment of the present invention, the processor 50 included in the terminal further has the following functions: analyzing the recorded data and/or the equipment attribute information according to a preset data association algorithm, and determining the unique identification of the terminal equipment belonging to the same user; and matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model.
For how the server determines the unique identifier of the terminal device belonging to the same user and determines the matched user behavior model, reference may be made to the detailed description of the above data processing method embodiment, which is not described herein again. Those of ordinary skill in the art will understand that: all or a portion of the steps of implementing the above-described method embodiments may be performed by hardware associated with program instructions. The program may be stored in a computer-readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (10)

1. A data processing method, comprising:
receiving an account request sent by at least one terminal device;
allocating a unique identifier to the at least one terminal device according to the account request, and returning the unique identifier to the at least one terminal device;
collecting data reported by the at least one terminal device, wherein the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device; the device attribute information includes: equipment model, international identity code of mobile equipment and operating system information;
recording data reported by the at least one terminal device by using a pre-allocated unique identifier;
classifying and sorting the data according to the application of the recorded data generated in the terminal equipment and/or the equipment attribute information, and filtering error or incomplete data;
displaying a page with preset conditions in a webpage mode, and receiving at least one preset condition selected from the page with the preset conditions;
analyzing the recorded data and the equipment attribute information according to the at least one preset condition through a pre-established data association model and a preset data association algorithm, determining the unique identification corresponding to the recorded data and the equipment attribute information meeting the preset condition as terminal equipment belonging to the same user, and acquiring the unique identification of the terminal equipment belonging to the same user;
matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model, and pushing data according to the user behavior model;
wherein the method further comprises:
pre-registering a plurality of terminal devices belonging to a first user, analyzing data reported by the plurality of terminal devices of the first user according to a preset data association algorithm, and verifying whether the determined unique identifier of the terminal device belonging to the first user is consistent with pre-registered information;
and training and adjusting the preset data association algorithm according to a verification result, and updating and maintaining the unique identifier of the terminal equipment belonging to the same user.
2. The method according to claim 1, wherein the preset conditions include that resident locations are the same or similar, and the same account information is adopted for logging in the same application or webpage;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and/or the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the recorded data which satisfies the same or similar resident positions and adopts the same account information for logging in the same application or webpage as the unique identifier of the terminal equipment belonging to the same user.
3. The method according to claim 1, wherein the preset conditions comprise that the user gender is the same, the historical records of the browsers are similar, and the model of the terminal device is the same as the operating system;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the attribute information of the terminal equipment with the same terminal equipment model and the same operating system as the unique identifier of the terminal equipment belonging to the same user.
4. The method according to claim 1, wherein the preset condition includes that the same cloud disk account information is used for logging in the same cloud disk;
the determining, as the unique identifier of the terminal device belonging to the same user, the unique identifier corresponding to the recorded data and the device attribute information that satisfy the preset condition includes:
and determining the unique identifier corresponding to the recorded data which satisfies that the same cloud disk is logged in by the same cloud disk and the same cloud disk account information in the recorded data as the unique identifier of the terminal equipment belonging to the same user.
5. A data processing apparatus, comprising:
the receiving and sending module is used for receiving an account request sent by at least one terminal device;
the identification distribution module is used for distributing a unique identification to the at least one terminal device according to the account request;
the transceiver module is further configured to return the unique identifier to the at least one terminal device;
the collection module is used for collecting data reported by the at least one terminal device, wherein the data comprises a unique identifier pre-allocated to the terminal device, record data generated by a user operating the terminal device and device attribute information of the terminal device; the device attribute information includes: equipment model, international identity code of mobile equipment and operating system information; recording data reported by the at least one terminal device by using a pre-allocated unique identifier; classifying and sorting the data according to the application of the recorded data generated in the terminal equipment and/or the equipment attribute information, and filtering error or incomplete data;
the data analysis module is used for displaying a page with preset conditions in a webpage mode and receiving at least one preset condition selected from the page with the preset conditions; analyzing the recorded data and the equipment attribute information according to the at least one preset condition through a pre-established data association model and a preset data association algorithm, determining the unique identification corresponding to the recorded data and the equipment attribute information meeting the preset condition as terminal equipment belonging to the same user, and acquiring the unique identification of the terminal equipment belonging to the same user;
the behavior matching module is used for matching the recorded data and the equipment attribute information reported by the terminal equipment belonging to the same user with a pre-established user behavior model, determining the matched user behavior model and pushing data according to the user behavior model;
the verification module is used for pre-registering a plurality of terminal devices belonging to a first user, analyzing data reported by the plurality of terminal devices of the first user according to a preset data association algorithm, and verifying whether the determined unique identifier of the terminal device belonging to the first user is consistent with pre-registered information;
and the maintenance module is used for training and adjusting the preset data association algorithm according to the verification result, and updating and maintaining the unique identifier of the terminal equipment belonging to the same user.
6. The device of claim 5, wherein the preset conditions include that resident locations are the same or similar, and the same account information is used for logging in the same application or webpage;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, a unique identification corresponding to the recorded data that satisfies that resident positions in the recorded data are the same or similar and that logs in the same application or web page using the same account information.
7. The apparatus of claim 5, wherein the preset conditions include that the user has the same gender and similar historical records of browsers, and that the model of the terminal device is the same as the operating system;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, the recorded data that satisfies the condition that the user has the same gender and the historical records of the browser are similar, and the unique identification corresponding to the attribute information of the terminal device having the same terminal device model and operating system.
8. The apparatus according to claim 5, wherein the preset condition includes that the same cloud disk account information is used for logging in the same cloud disk;
the identification determining unit is specifically configured to determine, as the unique identification of the terminal device belonging to the same user, a unique identification corresponding to the recording data that satisfies that the same cloud disk is logged in by using the same cloud disk account information in the recording data.
9. A server, comprising: a communication device, a processor, and a memory; the communication device is used for carrying out data interaction with at least one terminal device; the processor is used for executing the method of any one of claims 1-4; the memory is to store processor-executable instructions.
10. A computer-readable storage medium having stored thereon machine-readable instructions for causing at least one processor to perform the method of any one of claims 1-4.
CN201510230290.3A 2015-05-07 2015-05-07 Data processing method and device and server Active CN106202117B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510230290.3A CN106202117B (en) 2015-05-07 2015-05-07 Data processing method and device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510230290.3A CN106202117B (en) 2015-05-07 2015-05-07 Data processing method and device and server

Publications (2)

Publication Number Publication Date
CN106202117A CN106202117A (en) 2016-12-07
CN106202117B true CN106202117B (en) 2020-07-03

Family

ID=57459620

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510230290.3A Active CN106202117B (en) 2015-05-07 2015-05-07 Data processing method and device and server

Country Status (1)

Country Link
CN (1) CN106202117B (en)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106960143B (en) * 2017-03-23 2020-03-17 网易(杭州)网络有限公司 User account identification method and device, storage medium and electronic equipment
CN109426826B (en) * 2017-08-22 2020-12-29 中国电信股份有限公司 User behavior analysis method and device
CN109587197B (en) * 2017-09-29 2022-02-22 北京京东尚科信息技术有限公司 Method, device and system for associating reported data
CN108287662A (en) * 2017-12-08 2018-07-17 五八有限公司 Obtain method, equipment and the computer readable storage medium of equipment unique mark
CN110020166B (en) * 2017-12-21 2023-02-10 腾讯科技(深圳)有限公司 Data analysis method and related equipment
CN108363655B (en) * 2018-02-11 2021-07-13 百度在线网络技术(北京)有限公司 User behavior characteristic analysis method and device
CN110213202B (en) * 2018-03-01 2021-10-15 腾讯科技(深圳)有限公司 Identification encryption matching method and device, and identification processing method and device
CN108965114B (en) * 2018-09-10 2021-08-10 每日互动股份有限公司 Method for pushing information to terminal equipment
CN109257764B (en) * 2018-10-24 2022-02-22 北京小米移动软件有限公司 User portrait data processing method and device
CN109543105B (en) * 2018-11-15 2021-09-07 万惠投资管理有限公司 Content recommendation method and system for mobile device application
CN110866241A (en) * 2019-10-08 2020-03-06 北京百度网讯科技有限公司 Evaluation model generation and equipment association method, device and storage medium
CN111241502B (en) * 2020-01-14 2023-01-17 恩亿科(北京)数据科技有限公司 Cross-device user identification method and device, electronic device and storage medium
CN111368184B (en) * 2020-02-24 2024-03-22 百度在线网络技术(北京)有限公司 Screen protection delivery method and device for intelligent voice equipment and storage medium
CN111708948B (en) * 2020-06-19 2024-04-16 北京达佳互联信息技术有限公司 Content item recommendation method, device, server and computer readable storage medium
CN111859136B (en) * 2020-07-23 2024-03-15 深圳前海微众银行股份有限公司 Personalized recommendation method, device, equipment and readable storage medium
CN111930995B (en) * 2020-08-18 2023-12-22 湖南快乐阳光互动娱乐传媒有限公司 Data processing method and device
CN112286712B (en) * 2020-12-25 2021-04-02 成都数联铭品科技有限公司 Fault-tolerant repair method based on unique ID
CN113453216B (en) * 2021-06-16 2023-09-05 中国联合网络通信集团有限公司 Method and device for determining user terminal equipment
CN113515515A (en) * 2021-07-30 2021-10-19 广东电网有限责任公司 Customer data and power grid equipment data fusion method, device, equipment and medium
CN116662639B (en) * 2022-11-14 2023-11-28 荣耀终端有限公司 Content pushing method, electronic equipment and communication system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103533530A (en) * 2013-09-26 2014-01-22 林毅 Cross-device user corresponding and user tracking methods and systems
CN103793420A (en) * 2012-10-31 2014-05-14 国际商业机器公司 Cross-site data analysis method and system
CN103873698A (en) * 2014-03-31 2014-06-18 宇龙计算机通信科技(深圳)有限公司 Terminal association device and terminal association method
CN104104660A (en) * 2013-04-07 2014-10-15 中国移动通信集团浙江有限公司 Method of acquiring user data and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103793420A (en) * 2012-10-31 2014-05-14 国际商业机器公司 Cross-site data analysis method and system
CN104104660A (en) * 2013-04-07 2014-10-15 中国移动通信集团浙江有限公司 Method of acquiring user data and system
CN103533530A (en) * 2013-09-26 2014-01-22 林毅 Cross-device user corresponding and user tracking methods and systems
CN103873698A (en) * 2014-03-31 2014-06-18 宇龙计算机通信科技(深圳)有限公司 Terminal association device and terminal association method

Also Published As

Publication number Publication date
CN106202117A (en) 2016-12-07

Similar Documents

Publication Publication Date Title
CN106202117B (en) Data processing method and device and server
CN108696399B (en) Business service test method and device
CN110609937A (en) Crawler identification method and device
CN107872534B (en) Information pushing method and device, server and readable storage medium
CN104917643A (en) Abnormal account detection method and device
CN104541245A (en) Mobile application management
KR101249059B1 (en) Information exchange apparatus, method and managing system applied thereto
US10057302B2 (en) Context-based selection of instruction sets for connecting through captive portals
CN107786601B (en) Information processing method, terminal and server
CN107995285B (en) Information display method, device, server and readable storage medium
US11240777B2 (en) Device positioning method and apparatus
CN107404481A (en) User profile recognition methods and device
CN105553770B (en) Data acquisition control method and device
EP2577584A1 (en) Method of identifying remote users of websites
CN102317935A (en) Context-based communication service
CN106067879B (en) The detection method and device of information
CN112507211A (en) Message pushing method and device, storage medium and electronic device
CN113412607A (en) Content pushing method and device, mobile terminal and storage medium
CN110807050B (en) Performance analysis method, device, computer equipment and storage medium
CN103856568A (en) Terminal and system for prompting safety state of user terminal and implementation method
WO2010131259A2 (en) A monitoring system for capturing information relating to the wireless smart device for behavioral analytics
CN103347029A (en) Method, terminal, server and system for service data interaction
CN110599278B (en) Method, apparatus, and computer storage medium for aggregating device identifiers
CN103634348A (en) Terminal device and method for releasing information
CN106354365A (en) Interface selection method and mobile device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant