CN115225609A - User data processing method and device and server - Google Patents

User data processing method and device and server Download PDF

Info

Publication number
CN115225609A
CN115225609A CN202110424109.8A CN202110424109A CN115225609A CN 115225609 A CN115225609 A CN 115225609A CN 202110424109 A CN202110424109 A CN 202110424109A CN 115225609 A CN115225609 A CN 115225609A
Authority
CN
China
Prior art keywords
user
address
data
address data
family
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110424109.8A
Other languages
Chinese (zh)
Inventor
宁坤
胡忠磊
孙恒
赵婷婷
严杰
倪成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Daikin China Investment Co ltd
Original Assignee
Daikin China Investment Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Daikin China Investment Co ltd filed Critical Daikin China Investment Co ltd
Priority to CN202110424109.8A priority Critical patent/CN115225609A/en
Publication of CN115225609A publication Critical patent/CN115225609A/en
Pending legal-status Critical Current

Links

Images

Abstract

The embodiment of the invention provides a user data processing method and device and a server. Through the combination of the normalized address and the user single identification, a large amount of data which is associated with the user and has a unified address can be obtained; and the family relation of the user can be simply and accurately determined based on the normalized address, and the family ID is used for identification to obtain a large amount of user data associated with the family ID, so that efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.

Description

User data processing method and device and server
Technical Field
The present invention relates to the field of data processing, and in particular, to a method and an apparatus for processing user data, and a server.
Background
With the development of economy and the popularization of big data applications, the processing of user data is becoming an increasingly hot point of research.
The existing user data processing is mainly based on data acquisition and integration of individual users to obtain an individual information data table, and relevant application of the individual users is carried out based on the data table. However, data processing and application based on individual users are limited, and the pertinence and efficiency of the service are poor.
In recent years, a related art of determining a family relationship of a user based on data processing has appeared.
For example, the relationship between users is determined from the common relationship of the wireless network addresses of the users; for another example, the family relationship of the user is determined according to the residence time of the user terminal in each base station and the call information; for another example, the family relationship of the user is determined based on the frequency of the association between the users and the association index.
It should be noted that the above description of the background art is provided for the sake of clarity and complete description of the technical solutions of the present invention, and for the understanding of those skilled in the art. Such solutions are not considered to be known to the person skilled in the art merely because they have been set forth in the background section of the invention.
Disclosure of Invention
However, the inventor found that, in the above prior art, the wireless network address of the user, the residence time of each base station, and the call information need to be obtained through a special channel, the data amount and the data source are both limited, and a large amount of user data cannot be conveniently obtained, and the information such as the call frequency and the call index also needs to be calculated according to the information such as the call and the short message of the user, and a large amount of user data cannot be obtained, and the calculation process is complex.
The present inventors have also found that although the amount of data available for the user address information is large, the basic data of the conventional user system is only single address information, and the address information has problems such as a non-uniform filling standard and inaccurate entry, which causes a disorder in address information redundancy, and cannot be effectively used.
In order to solve at least one of the above problems, embodiments of the present invention provide a user data processing method and apparatus, and a server. Normalizing the address data of the user, associating the obtained normalized address data with the single user identifier, and generating a first user data table containing the associated user address data, so that a large amount of data which is associated with the user and has uniform addresses can be obtained by combining the normalized address with the single user identifier; and the family ID is determined according to the address ID of the normalized address and the single user identification, the data belonging to the same family ID in the first user data table are integrated, and the second user data table is generated.
According to a first aspect of the embodiments of the present invention, there is provided a user data processing method, including: acquiring an identification mark of a user; acquiring address data related to the identification of the user; normalizing the address data to obtain normalized address data; generating a user single identification according to the identification of the user; associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data; determining different user single identifications corresponding to the same normalization address in the first user data table as the same family ID; and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
According to a second aspect of an embodiment of the present invention, there is provided a user data processing apparatus, including: a first acquisition unit for acquiring an identification of a user; a second acquisition unit for acquiring address data related to an identification of the user; the normalization unit is used for performing normalization processing on the address data to obtain normalized address data; a first generating unit, configured to generate a user unique identifier according to the identification identifier of the user; a second generating unit, configured to associate the normalized address data with the user unique identifier, and generate a first user data table including associated user address data; a first determining unit configured to determine different user unique identifiers corresponding to the same normalized address in the first user data table as belonging to the same home ID; and a third generating unit, configured to integrate data belonging to the same family ID in the first user data table, and generate a second user data table.
According to a third aspect of embodiments of the present invention, there is provided a server including the user data processing apparatus according to the second aspect of embodiments of the present invention.
One of the beneficial effects of the embodiment of the invention is as follows: normalizing the address data of the user, associating the obtained normalized address data with the user single identification, and generating a first user data table containing the associated user address data, so that a large amount of data which are associated with the user and have unified addresses can be obtained by combining the normalized address with the user single identification; and the family ID is determined according to the address ID of the normalized address and the single user identification, the data belonging to the same family ID in the first user data table are integrated, and the second user data table is generated.
Specific embodiments of the present invention are disclosed in detail with reference to the following description and drawings, indicating the manner in which the principles of the invention may be employed. It should be understood that the embodiments of the invention are not so limited in scope. The embodiments of the invention include many variations, modifications and equivalents within the spirit and scope of the appended claims.
The feature information described and illustrated with respect to one embodiment may be used in the same or similar manner in one or more other embodiments, in combination with or instead of the feature information in the other embodiments.
It should be emphasized that the term "comprises/comprising" when used herein, is taken to specify the presence of stated features, integers, steps or components but does not preclude the presence or addition of one or more other features, integers, steps or components.
Drawings
Many aspects of the invention can be better understood with reference to the following drawings. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention. For convenience in illustrating and describing some parts of the present invention, corresponding parts may be enlarged or reduced in the drawings. Elements and feature information described in one figure or embodiment of the invention may be combined with elements and feature information shown in one or more other figures or embodiments. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views, and may be used to designate corresponding parts for use in more than one embodiment.
In the drawings:
fig. 1 is a flowchart of a user data processing method according to embodiment 1 of the present invention;
FIG. 2 is a diagram illustrating the address data obtained in step 102 according to embodiment 1 of the present invention;
fig. 3 is a flowchart of a method of normalizing address data according to embodiment 1 of the present invention;
FIG. 4 is a diagram illustrating structured address information according to embodiment 1 of the present invention;
FIG. 5 is a schematic diagram of a subscriber identity module according to embodiment 1 of the present invention;
fig. 6 is a schematic diagram of a part of a first user data table of embodiment 1 of the present invention;
fig. 7 is a schematic view of embodiment 1 of the present invention in which data associated with the same normalized address is recognized as belonging to the same home ID;
fig. 8 is a schematic view of a part of a second user data table of embodiment 1 of the present invention;
FIG. 9 is a block diagram of a user data processing apparatus according to embodiment 2 of the present invention;
fig. 10 is a block diagram of a normalization unit 903 according to embodiment 2 of the present invention;
fig. 11 is a schematic diagram of a server according to embodiment 3 of the present invention;
fig. 12 is a schematic block diagram of a system configuration of a server according to embodiment 3 of the present invention.
Detailed Description
Preferred embodiments of the present invention will be described below with reference to the accompanying drawings.
Example 1
The embodiment 1 of the invention provides a user data processing method. Fig. 1 is a flowchart of a user data processing method according to embodiment 1 of the present invention. As shown in fig. 1, the method includes:
step 101: acquiring an identification mark of a user;
step 102: acquiring address data related to the identification of the user;
step 103: normalizing the address data to obtain normalized address data;
step 104: generating a user single identification according to the identification of the user;
step 105: associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data;
step 106: determining a family ID according to an address ID for identifying the normalized address data and the user single identification in the first user data table; and
step 107: and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
In this way, address data of the user is normalized, the obtained normalized address data is associated with the user single identifier, and a first user data table containing the associated user address data is generated, so that a large amount of data which are associated with the user and have unified addresses can be obtained by combining the normalized address with the user single identifier; and the family ID is determined according to the address ID of the normalized address and the single user identification, the data belonging to the same family ID in the first user data table are integrated, and the second user data table is generated.
In the embodiment of the present invention, the identification of the user may include various identifications capable of identifying different users, for example, the identification of the user includes at least one of a mobile phone or phone number of the user, a member ID, cookie data, a website access identification of the user, and a social software identification.
In the embodiment of the invention, the social software identification can be the identification of various social software.
For example, the social software identification is a micro-beacon identification, such as a micro-signal, which may include at least one of an openid and a unionid.
As another example, the social software identification is a QQ identification, such as a QQ number.
In step 101, the id of the user is collected, and the id of the user obtained by the collecting may be obtained in real time or according to a history.
For example, the user's identification may be obtained from the underlying database of the respective platform or system as well as a historical database. The platform or system may be various types of platforms or systems, such as a commercial platform or system, a utility platform or system, and so forth.
In the embodiment of the invention, the user data to be collected can be all users in the database, and part of users can be selected for data collection according to actual needs.
In the embodiment of the present invention, data of limited users is exemplified for convenience of explanation.
In step 102, address data associated with the user's id is obtained, for example, by user matching. Wherein the matching may be performed using one of the identification identities of the user, e.g. using the mobile phone or the phone number of the user for the matching of the address data.
In the embodiment of the present invention, the address data may be obtained from the database of each platform or system, and the address data may be historical data, that is, stock data, or data obtained in real time, that is, incremental data.
Since address data is typically present in the user's vesting, i.e., retained material, the amount of data available is large and readily available. However, the address data obtained in step 102 may be non-standard, redundant, chaotic address data.
Fig. 2 is a schematic diagram of the address data obtained in step 102 in embodiment 1 of the present invention. As shown in fig. 2, matching is performed in the address database according to the telephone number of the user, and an address related to the telephone number of the user, which may also be referred to as a source address, and a source of the data are obtained.
In step 103, normalization processing is performed on the address data to obtain normalized address data. The following describes a method of address data normalization processing in detail.
Fig. 3 is a flowchart of a method for normalizing address data according to embodiment 1 of the present invention. As shown in fig. 3, the method includes:
step 301: inputting the address data into an address standardization model to obtain structured address information; and
step 302: similarity matching calculation is carried out on the structured address information, address merging is carried out based on the calculated similarity, and address data after merging are obtained.
Therefore, the structured address is obtained through the address standardization model, the merged address data is obtained through similarity matching, the address data can be effectively integrated, and the utilization efficiency of the address data is improved.
In step 301, the address data, i.e. the original address, is input into an address standardization model, resulting in structured address information. Fig. 4 is a schematic diagram of the structured address information according to embodiment 1 of the present invention. As shown in FIG. 4, the original address is a complete detail address and the structured address is a segmented 15-level address.
In step 302, similarity matching calculation is performed on the structured address information, and address merging is performed based on the calculated similarity, so as to obtain merged address data, for example, address data of different expressions, i.e. different writing methods but actually the same address, are merged to belong to the same address. Various existing methods can be adopted for the specific similarity matching method.
As shown in fig. 3, for example, the method may further include:
step 303: and carrying out excessive normalization correction on the address data after the normalization to obtain the normalized address data.
Thus, the address can be prevented from being excessively normalized, and the accuracy of the normalized address can be further improved.
In the embodiment of the present invention, for example, the normalized address data is obtained by performing matching processing on the merged address data by using a Geographic Information System (GIS) algorithm.
For example, according to the GIS algorithm, two or more addresses that are to be merged into the same address but should not be the same address according to the geographical information are not treated as the normalized address.
In the embodiment of the present invention, one address ID may be used to identify the normalized address, that is, to identify address data representing the same address in the normalized address data. Therefore, the address data can be conveniently normalized and identified.
Fig. 4 is a schematic diagram of a normalized address in embodiment 1 of the present invention. As shown in fig. 4, the same address ID indicates address data that is actually the same address.
In the embodiment of the present invention, the address standardization model used may be a pre-established model or an online learning model.
For example, the address standardization model is established using an address segmentation processing algorithm according to collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
In the embodiment of the present invention, the address segmentation algorithm used is, for example, a Natural Language Processing (NLP) algorithm, and by using this algorithm, the address segmentation can be performed quickly and accurately.
In step 104, a user identity is generated from the identity of the user obtained in step 101. In the embodiment of the present invention, the execution sequence of step 104 and steps 102 to 103 is not limited, that is, steps 102 to 103 and step 104 may be executed sequentially or in parallel.
In the embodiment of the invention, a plurality of identifiers of the same user can be integrated by generating a single identifier (also called user OneID) of the user.
Fig. 5 is a schematic diagram of a user id according to embodiment 1 of the present invention. As shown in fig. 5, the identification identifiers such as the telephone number, the WeChat openid, the WeChat unionid, and various member IDs of the user are integrated by the user unique identifier, i.e., the user OneID.
In step 105, the normalized address data obtained in step 103 is associated with the user id obtained in step 104, and a first user data table containing associated user address data is generated.
That is, in the generated first user data table, normalized address data identified by an address ID is stored in association with the user OneID.
Fig. 6 is a schematic diagram of a part of the first user data table according to embodiment 1 of the present invention. As shown in fig. 6, the user OneID is used to integrate the user identification data associated with the user OneID and the normalized address data identified by the address ID, so as to obtain the first user data table.
In step 106, a family ID, i.e. family ID, is determined in the first user data table according to the address ID identifying the normalized address data and the user unique identifier.
For example, data corresponding to the same address ID is determined as belonging to the same home ID; and, data corresponding to the same user unique identification is determined as belonging to the same family ID.
That is, all data associated with the same normalized address and the same user OneID are considered to be data belonging to the same family, which may contain data of different user oneids, and the data belonging to the same family are identified and associated with a family ID, i.e., family ID.
Fig. 7 is a diagram of embodiment 1 of the present invention, in which data associated with the same normalized address is determined to belong to the same home ID. As shown in fig. 7, in step 103, the addresses listed in the address field address are determined to belong to the same normalized address, that is, to have the same address ID, so that the data corresponding to these addresses, for example, the mobile phone number and the OneID data of the user, are determined to belong to the same family ID, that is, to be assigned the same family ID.
In step 107, data belonging to the same family ID in the first user data table is integrated to generate a second user data table.
That is to say, various data associated with the family ID are integrated through the family ID to obtain a second user data table with a large information amount, so that efficient systematic application can be performed based on the user data, and data processing efficiency and user experience are improved.
Fig. 8 is a schematic diagram of a part of the second user data table of embodiment 1 of the present invention. As shown in fig. 8, first, data corresponding to the same normalized address, i.e., the same address ID (also referred to as address normalization ID), is determined to belong to the same family ID, and then data corresponding to the same user OneID (or a phone number corresponding to the user OneID) is determined to belong to the same family ID, so that family _001 in fig. 8 is obtained.
In the embodiment of the present invention, after the second user data table is obtained, data supplementation and/or various applications may be performed based on the second user data table.
For example, as shown in fig. 1, the method may further include:
step 108: obtaining additional information related to the home ID for at least one of consumption, interaction, installation, maintenance, and complaints; and
step 109: the additional information is added to the second user data table in association with the home ID.
Therefore, additional information of at least one of consumption, interaction, installation, maintenance and complaint is added into the second user data table, so that the second user data table contains more available user data, more user applications can be carried out based on the expanded second user data table, and the effectiveness and the accuracy of user services are improved.
In the embodiment of the present invention, step 108 and step 109 may obtain new additional information in real time or at regular time and add the new additional information to the second user data table, so that the second user data table can be kept updated.
For example, as shown in fig. 1, the method may further include:
step 110: and providing services for the family user corresponding to the family ID and/or the individual user belonging to the family user by using the information associated with the family ID in the second user data table.
In this way, the information associated with the home ID in the second user data table can provide an accurate service to the home user corresponding to the home ID and/or the individual user belonging to the home user.
In an embodiment of the present invention, steps 108-110 are optional steps, represented by the dashed box in FIG. 1.
In the embodiment of the present invention, a family may be used as an object to provide services, or each user in the family may be provided with services, and when each user in the family is provided with services, the same service may be provided based on the family relationship of the users, or a service including the same part and a personalized part may be provided. Thereby further enhancing the user experience.
In the embodiment of the present invention, the provided service may be various types of services, for example, generating and/or pushing at least one of user survey information, commodity information, offer information, commercial activity information, and after-sales information.
As can be seen from the above embodiments, address data of a user is normalized, the obtained normalized address data is associated with a user unique identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address with the user unique identifier; and determining the family ID according to the address ID of the normalized address and the user single identification, integrating data belonging to the same family ID in the first user data table, and generating the second user data table, so that the family relation of the user can be simply and accurately determined based on the normalized address, the family ID is used for identification, a large amount of user data associated with the family ID is obtained, efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
Example 2
Embodiment 2 of the present invention provides a user data processing apparatus corresponding to the user data processing method described in embodiment 1, and the specific implementation thereof may refer to the implementation of the method described in embodiment 1, and the same or related contents are not described again.
In the embodiment of the present invention, the user data processing apparatus may be a single device, or may be wholly or partially disposed in the server. For example, the server may be a cloud server.
Fig. 9 is a block diagram of a user data processing apparatus according to embodiment 2 of the present invention. As shown in fig. 9, the user data processing apparatus 900 includes:
a first obtaining unit 901, configured to obtain an identification of a user;
a second acquisition unit 902 for acquiring address data related to the identification of the user;
a normalization unit 903, configured to perform normalization processing on the address data to obtain normalized address data;
a first generating unit 904, configured to generate a user unique identifier according to the identification identifier of the user;
a second generating unit 905, configured to associate the normalized address data with the user unique identifier, and generate a first user data table including associated user address data;
a first determining unit 906 configured to determine a home ID from an address ID identifying the normalized address data and the user unique identification in the first user data table; and
a third generating unit 907 is configured to integrate the data belonging to the same family ID in the first user data table, and generate a second user data table.
Fig. 10 is a block diagram of the normalization unit 903 according to embodiment 2 of the present invention. As shown in fig. 10, the normalization unit 903 includes:
a structuring unit 1001 configured to input the address data into an address standardization model to obtain structured address information; and
a first matching unit 1002, configured to perform similarity matching calculation on the structured address information, and perform address merging based on the calculated similarity, so as to obtain merged address data.
For example, as shown in fig. 10, the normalization unit 903 may further include:
a correcting unit 1003, configured to correct the merged address data by over-normalization to obtain the normalized address data.
In this embodiment of the present invention, the correcting unit 1003 may perform matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
For example, the first determining unit 906 may include: a second determination unit for determining data corresponding to the same address ID as belonging to the same home ID; and a third determination unit for determining data corresponding to the same user unique identification as belonging to the same home ID.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
a modeling unit 908 for building the address standardization model using an address segmentation process algorithm according to the collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
a third acquisition unit 909 for acquiring additional information of at least one of consumption, interaction, installation, maintenance, and complaints related to the home ID;
an adding unit 910 for adding the additional information associated with the home ID to the second user data table.
For example, as shown in fig. 9, the user data processing apparatus 900 may further include:
a service unit 911, configured to provide a service to the home user corresponding to the home ID and/or the individual user belonging to the home user by using the information associated with the home ID in the second user data table.
In the embodiment of the present invention, the functions of the above units can be realized by referring to the specific contents of the corresponding steps in embodiment 1, and a description thereof is not repeated here.
It can be known from the above embodiment that address data of a user is normalized, the obtained normalized address data is associated with a user single identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address with the user single identifier; and determining the family ID according to the address ID of the normalized address and the user single identification, integrating data belonging to the same family ID in the first user data table, and generating the second user data table, so that the family relation of the user can be simply and accurately determined based on the normalized address, the family ID is used for identification, a large amount of user data associated with the family ID is obtained, efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
Example 3
An embodiment of the present invention further provides a server, and fig. 11 is a schematic diagram of a server according to embodiment 3 of the present invention. As shown in fig. 11, the server 1100 includes the user data processing apparatus 900, and the structure and function of the user data processing apparatus 900 are the same as those described in embodiment 2, and are not described again here.
In an embodiment of the present invention, the server 1100 may be various types of servers, for example, a cloud server.
In an embodiment of the present invention, the functions of the user data processing apparatus 900 may be performed by the server 110.
Fig. 12 is a schematic block diagram of a system configuration of a server according to embodiment 3 of the present invention. As shown in fig. 12, the server 1200 may include a processor 1201 and a memory 1202; the memory 1202 is coupled to the processor 1201. The figure is exemplary; other types of structures may also be used in addition to or in place of the structure to implement telecommunications or other functions.
As shown in fig. 12, the server 1200 may further include: an input unit 1203, a display 1204, a power supply 1205.
In one embodiment, the functionality of the user data processing apparatus described in example 2 may be integrated into the processor 1201. Wherein the processor 1201 may be configured to: acquiring an identification mark of a user; acquiring address data related to the identification of the user; normalizing the address data to obtain normalized address data; generating a user single identification according to the identification of the user; associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data; determining a family ID according to an address ID for identifying the normalized address data and the user single identification in the first user data table; and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
For example, the normalizing the address data to obtain normalized address data includes: inputting the address data into an address standardization model to obtain structured address information; and performing similarity matching calculation on the structured address information, and performing address merging based on the calculated similarity to obtain merged address data.
For example, the normalizing the address data to obtain normalized address data further includes: and carrying out excessive normalization correction on the address data after the normalization to obtain the normalized address data.
For example, the performing an over-normalization correction on the merged address data to obtain the normalized address data includes: and matching the merged address data by utilizing a Geographic Information System (GIS) algorithm to obtain the normalized address data.
For example, the processor 1201 may also be configured to: the address standardization model is established using an address segmentation processing algorithm according to collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
For example, the determining, in the first user data table, the home ID according to the address ID for identifying the normalized address data and the user unique identifier includes: determining data corresponding to the same address ID as belonging to the same family ID; and determining data corresponding to the same user unique identification as belonging to the same family ID.
For example, the processor 1201 may also be configured to: obtaining additional information related to the home ID for at least one of consumption, interaction, installation, repair, and complaints; the additional information is added to the second user data table in association with the home ID.
For example, the processor 1201 may also be configured to: and providing services for the family user corresponding to the family ID and/or the individual user belonging to the family user by using the information associated with the family ID in the second user data table.
For example, the identification of the user includes at least one of a mobile phone or phone number of the user, a member ID, cookie data, a website access identification of the user, and a social software identification.
In another embodiment, the user data processing apparatus described in embodiment 1 may be configured separately from the processor 1201, for example, the user data processing apparatus may be configured as a chip connected to the processor 1201, and the function of the user data processing apparatus is realized by the control of the processor 1201.
It is not necessary for the server 1200 to include all of the components shown in fig. 12 in this embodiment.
As shown in fig. 12, the processor 1201, also sometimes referred to as a controller or operational control, may include a microprocessor or other processor device and/or logic device, and the processor 1201 receives input and controls operation of the various components of the server 1200.
The memory 1202 may be, for example, one or more of a buffer, a flash memory, a hard drive, a removable media, a volatile memory, a non-volatile memory, or other suitable device. And the processor 1201 can execute the program stored in the memory 1202 to realize information storage or processing or the like. The functions of other parts are similar to the prior art and are not described in detail here. The components of the server 1200 may be implemented in dedicated hardware, firmware, software, or combinations thereof, without departing from the scope of the invention.
As can be seen from the above embodiments, address data of a user is normalized, the obtained normalized address data is associated with a user unique identifier, and a first user data table containing associated user address data is generated, so that a large amount of data associated with the user and having uniform addresses can be obtained by combining the normalized address with the user unique identifier; and determining the family ID according to the address ID of the normalized address and the user single identification, integrating data belonging to the same family ID in the first user data table, and generating the second user data table, so that the family relation of the user can be simply and accurately determined based on the normalized address, the family ID is used for identification, a large amount of user data associated with the family ID is obtained, efficient systematic application can be performed based on the user data, and the data processing efficiency and the user experience are improved.
An embodiment of the present invention also provides a computer-readable program, where when the program is executed in a user data processing apparatus or a server, the program causes a computer to execute the user data processing method described in embodiment 1 in the user data processing apparatus or the server.
An embodiment of the present invention further provides a storage medium storing a computer-readable program, where the computer-readable program enables a computer to execute the user data processing method described in embodiment 1 in a user data processing apparatus or a server.
The user data processing method performed in the user data processing apparatus or server described in connection with the embodiments of the present invention may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. For example, one or more of the functional block diagrams and/or one or more combinations of the functional block diagrams shown in fig. 9 may correspond to individual software modules of a computer program flow, or may correspond to individual hardware modules. These software modules may correspond to the steps shown in fig. 1, respectively. These hardware modules may be implemented, for example, using Field Programmable Gate Arrays (FPGAs) to consolidate the software modules.
A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. A storage medium may be coupled to the processor such that the processor can read information from, and write information to, the storage medium; or the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The software module may be stored in the memory of the mobile terminal or in a memory card that is insertable into the mobile terminal. For example, if the server employs a relatively large MEGA-SIM card or a large flash memory device, the software module may be stored in the MEGA-SIM card or the large flash memory device.
One or more of the functional block diagrams and/or one or more combinations of the functional block diagrams described with respect to fig. 9 may be implemented as a general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any suitable combination thereof designed to perform the functions described herein. One or more of the functional block diagrams and/or one or more combinations of the functional block diagrams described with respect to fig. 9 may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP communication, or any other such configuration.
While the invention has been described with reference to specific embodiments, it will be apparent to those skilled in the art that these descriptions are illustrative and not intended to limit the scope of the invention. Various modifications and alterations of this invention will become apparent to those skilled in the art based upon the spirit and principles of this invention, and such modifications and alterations are also within the scope of this invention.

Claims (19)

1. A method for processing user data, the method comprising:
acquiring an identification mark of a user;
acquiring address data related to the identification mark of the user;
normalizing the address data to obtain normalized address data;
generating a user single identification according to the identification of the user;
associating the normalized address data with the user single identifier to generate a first user data table containing associated user address data;
determining a family ID according to an address ID for identifying the normalized address data and the user single identification in the first user data table; and
and integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
2. The method according to claim 1, wherein the normalizing the address data to obtain normalized address data includes:
inputting the address data into an address standardization model to obtain structured address information; and
and performing similarity matching calculation on the structured address information, and performing address merging based on the calculated similarity to obtain merged address data.
3. The user data processing method according to claim 2, wherein the normalizing the address data to obtain normalized address data further comprises:
and carrying out excessive normalization correction on the address data after the normalization to obtain the normalized address data.
4. The method according to claim 3, wherein the performing over-normalization correction on the merged address data to obtain the normalized address data includes:
and matching the merged address data by utilizing a Geographic Information System (GIS) algorithm to obtain the normalized address data.
5. The method of claim 2, further comprising:
the address standardization model is established using an address segmentation processing algorithm according to collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
6. The method according to claim 1, wherein determining the home ID from the address ID for identifying the normalized address data and the user unique identifier in the first user data table comprises:
determining data corresponding to the same address ID as belonging to the same family ID; and
data corresponding to the same user unique identification is determined as belonging to the same family ID.
7. The method of claim 1, further comprising:
obtaining additional information for at least one of consumption, interaction, installation, repair, and complaints related to the home ID;
adding the additional information associated with the home ID to the second user data table.
8. The user data processing method according to any of claims 1-7, characterized in that the method further comprises:
and providing service for the family user corresponding to the family ID and/or the individual user belonging to the family user by using the information associated with the family ID in the second user data table.
9. The user data processing method according to any one of claims 1-7,
the identification mark of the user comprises at least one of a mobile phone or a telephone number of the user, a member ID, cookie data, a website access mark of the user and a social software mark.
10. A user data processing apparatus, characterized in that the apparatus comprises:
a first acquisition unit for acquiring an identification of a user;
a second acquisition unit for acquiring address data related to an identification of the user;
the normalization unit is used for performing normalization processing on the address data to obtain normalized address data;
a first generating unit, configured to generate a user unique identifier according to the identification identifier of the user;
a second generating unit, configured to associate the normalized address data with the user unique identifier, and generate a first user data table including associated user address data;
a first determining unit configured to determine, in the first user data table, a home ID from an address ID that identifies the normalized address data and the user unique identification; and
and the third generation unit is used for integrating the data belonging to the same family ID in the first user data table to generate a second user data table.
11. The apparatus according to claim 10, wherein the normalization unit comprises:
the structuring unit is used for inputting the address data into an address standardization model to obtain structured address information; and
and the first matching unit is used for carrying out similarity matching calculation on the structured address information and carrying out address merging based on the calculated similarity to obtain merged address data.
12. The apparatus according to claim 11, wherein the normalization unit further comprises:
and the correcting unit is used for carrying out excessive normalization correction on the address data after the address data is merged to obtain the normalized address data.
13. The user data processing apparatus of claim 12,
and the correction unit performs matching processing on the merged address data by using a Geographic Information System (GIS) algorithm to obtain the normalized address data.
14. The apparatus according to claim 11, wherein the apparatus further comprises:
a modeling unit for building the address standardization model using an address segmentation processing algorithm according to the collected address data, location Based Service (LBS) address information, external database information, and national address standard information.
15. The apparatus according to claim 10, wherein the first determining unit comprises:
a second determination unit for determining data corresponding to the same address ID as belonging to the same home ID; and
and a third determination unit for determining data corresponding to the same user unique identification as belonging to the same family ID.
16. The apparatus as claimed in claim 10, further comprising:
a third obtaining unit for obtaining additional information of at least one of consumption, interaction, installation, maintenance, and complaint related to the home ID;
an adding unit configured to add the additional information associated with the home ID to the second user data table.
17. User data processing device according to any of claims 10-16, characterized in that the device further comprises:
and the service unit is used for providing services for the family user corresponding to the family ID and/or the individual user belonging to the family user by utilizing the information related to the family ID in the second user data table.
18. User data processing device according to any of claims 10-16,
the identification mark of the user comprises at least one of a mobile phone or a telephone number of the user, a member ID, cookie data, a website access mark of the user and a social software mark.
19. A server, characterized in that the server comprises a user data processing device according to any of claims 10-18.
CN202110424109.8A 2021-04-20 2021-04-20 User data processing method and device and server Pending CN115225609A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110424109.8A CN115225609A (en) 2021-04-20 2021-04-20 User data processing method and device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110424109.8A CN115225609A (en) 2021-04-20 2021-04-20 User data processing method and device and server

Publications (1)

Publication Number Publication Date
CN115225609A true CN115225609A (en) 2022-10-21

Family

ID=83604605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110424109.8A Pending CN115225609A (en) 2021-04-20 2021-04-20 User data processing method and device and server

Country Status (1)

Country Link
CN (1) CN115225609A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101883129A (en) * 2009-05-07 2010-11-10 杨兵红 Household electronic equipment capable of realizing positioning, system and method
US20130246449A1 (en) * 2012-03-16 2013-09-19 Research In Motion Limited Methods and devices for identifying a relationship between contacts
KR20160000446A (en) * 2014-06-24 2016-01-04 경희대학교 산학협력단 System for identifying human relationships around users and coaching based on identified human relationships
US20170337292A1 (en) * 2015-02-13 2017-11-23 Alibaba Group Holding Limited Text address processing method and apparatus
CN111711534A (en) * 2020-05-27 2020-09-25 新浪网技术(中国)有限公司 Network service quality analysis method, device, system, equipment and storage medium
CN111917773A (en) * 2020-07-31 2020-11-10 中国工商银行股份有限公司 Service data processing method and device and server
CN112269805A (en) * 2020-11-18 2021-01-26 杭州米雅信息科技有限公司 Data processing method, device, equipment and medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101883129A (en) * 2009-05-07 2010-11-10 杨兵红 Household electronic equipment capable of realizing positioning, system and method
US20130246449A1 (en) * 2012-03-16 2013-09-19 Research In Motion Limited Methods and devices for identifying a relationship between contacts
KR20160000446A (en) * 2014-06-24 2016-01-04 경희대학교 산학협력단 System for identifying human relationships around users and coaching based on identified human relationships
US20170337292A1 (en) * 2015-02-13 2017-11-23 Alibaba Group Holding Limited Text address processing method and apparatus
CN111711534A (en) * 2020-05-27 2020-09-25 新浪网技术(中国)有限公司 Network service quality analysis method, device, system, equipment and storage medium
CN111917773A (en) * 2020-07-31 2020-11-10 中国工商银行股份有限公司 Service data processing method and device and server
CN112269805A (en) * 2020-11-18 2021-01-26 杭州米雅信息科技有限公司 Data processing method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN109474578B (en) Message checking method, device, computer equipment and storage medium
CN108427705B (en) Electronic device, distributed system log query method and storage medium
CN108415952B (en) User data storage method, label calculation method and calculation equipment
CN112613917A (en) Information pushing method, device and equipment based on user portrait and storage medium
CN104133765A (en) Test case sending method and test case server of network activities
CN108684044B (en) User behavior detection system, method and device
CN110807547A (en) Method and system for predicting family population structure
CN112328631A (en) Production fault analysis method and device, electronic equipment and storage medium
CN111914101B (en) File association relationship abnormality identification method and device and computer equipment
CN113434542A (en) Data relation identification method and device, electronic equipment and storage medium
CN110737820A (en) Method and apparatus for generating event information
CN111949845A (en) Method, apparatus, computer device and storage medium for processing mapping information
CN115225609A (en) User data processing method and device and server
CN111182465A (en) Method and device for determining terminal affiliation
CN113360889B (en) Rights management method and apparatus, server, and computer-readable storage medium
CN116094907A (en) Complaint information processing method, complaint information processing device and storage medium
CN113127207B (en) Crowd-sourced task resource allocation method and device, electronic equipment and storage medium
CN114791914A (en) User behavior statistical method, device, equipment and medium based on Bitmap
CN109885710B (en) User image depicting method based on differential evolution algorithm and server
CN115967906A (en) User resident position identification method, terminal, electronic device and storage medium
CN108763498B (en) User identity identification method and device, electronic equipment and readable storage medium
CN114265740A (en) Error information processing method, device, equipment and storage medium
CN114327377B (en) Method and device for generating demand tracking matrix, computer equipment and storage medium
CN111475505A (en) Data acquisition method and equipment
CN112347095B (en) Data table processing method, device and server

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination