WO2020257993A1 - Content pushing method and apparatus, server, and storage medium - Google Patents

Content pushing method and apparatus, server, and storage medium Download PDF

Info

Publication number
WO2020257993A1
WO2020257993A1 PCT/CN2019/092594 CN2019092594W WO2020257993A1 WO 2020257993 A1 WO2020257993 A1 WO 2020257993A1 CN 2019092594 W CN2019092594 W CN 2019092594W WO 2020257993 A1 WO2020257993 A1 WO 2020257993A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
natural person
real
time
user
Prior art date
Application number
PCT/CN2019/092594
Other languages
French (fr)
Chinese (zh)
Inventor
吴旭镇
Original Assignee
深圳市欢太科技有限公司
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市欢太科技有限公司, Oppo广东移动通信有限公司 filed Critical 深圳市欢太科技有限公司
Priority to CN201980091615.0A priority Critical patent/CN113412608B/en
Priority to PCT/CN2019/092594 priority patent/WO2020257993A1/en
Publication of WO2020257993A1 publication Critical patent/WO2020257993A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols

Definitions

  • This application relates to the field of communication technology, and specifically relates to a content push method, device, server and storage medium.
  • the server pushes content to the client (for example, the application client)
  • the server pushes the content according to the user portrait of the user identity (ID) reported by the client. Since the accuracy of the user portrait of a single user ID is not high, the accuracy of the content pushed by the server is low.
  • the embodiments of the present application provide a content pushing method, device, server, and storage medium, which can improve the accuracy of content pushing.
  • an embodiment of the present application provides a content pushing method, including:
  • an embodiment of the present application provides a content pushing device, the content pushing device includes an acquiring unit, a calculating unit, a determining unit, and a pushing unit, wherein:
  • the acquiring unit is configured to acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generate the real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs,
  • the real-time characteristic parameter includes the current time point;
  • the calculation unit is configured to calculate the similarity between the real-time user feature of the target natural person ID and at least one historical feature of the target historical time period of the target natural person ID, and the current time point is the same as the target historical time period Corresponding;
  • the determining unit is configured to determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target corresponding to the first target historical feature Content label
  • the pushing unit is configured to push the content corresponding to the first target content tag to the target natural person ID.
  • an embodiment of the present application provides a server, including a processor and a memory, the memory is used to store one or more programs, and the one or more programs are configured to be executed by the processor.
  • the program includes instructions for executing the steps in the first aspect of the embodiments of the present application.
  • an embodiment of the present application provides a computer-readable storage medium, wherein the foregoing computer-readable storage medium stores a computer program for electronic data exchange, wherein the foregoing computer program enables a computer to execute Some or all of the steps described in one aspect.
  • embodiments of the present application provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute Example part or all of the steps described in the first aspect.
  • the computer program product may be a software installation package.
  • the content pushing method described in the embodiment of this application specifically includes the following steps: acquiring real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and real-time user characteristics based on multiple user IDs Generate the real-time user characteristics of the target natural person ID, the real-time characteristic parameters include the current time point; calculate the similarity between the real-time user characteristics of the target natural person ID and the target historical time period of the target natural person ID, the current time point and the target Corresponding to the historical time period; determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among at least one historical feature, and determine the first target content label corresponding to the first target historical feature; send the target natural person ID Push the content corresponding to the first target content tag.
  • the content to be pushed to the target natural person ID when pushing content to the target natural person ID, can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID And push the content corresponding to the label to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
  • FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application
  • FIG. 2 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application.
  • Figure 3 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application.
  • Fig. 4 is a schematic structural diagram of a server disclosed in an embodiment of the present application.
  • the mobile terminals involved in the embodiments of this application may include various handheld devices with wireless communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to wireless modems, as well as various forms of user equipment (User Equipment, UE), mobile station (Mobile Station, MS), terminal device (terminal device), etc.
  • UE User Equipment
  • MS Mobile Station
  • terminal device terminal device
  • FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application. As shown in FIG. 1, the content pushing method includes the following steps.
  • the server acquires real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generates real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include the current Point in time.
  • the server serves the client, and the content of the service includes providing resources to the client and storing client data.
  • the server is a targeted service program, and the device running the server can be called a server.
  • the server can establish connections with multiple clients at the same time, and can provide services to multiple clients at the same time.
  • the services provided by the server for the client in the embodiments of the present application mainly include content push services.
  • the content push service may include: an advertisement content push service.
  • the advertisement content push service may include: browser content push service, application content push service, game content push service, etc.
  • the server can include application server, browser server, game server, etc.
  • User ID can include any one or more of the following types: single sign on identity (SSOID), OpenID, integrated circuit card identity (ICCID), international mobile device identity (International Mobile) Equipment Identity, IMEI), telephone number (telephone, TEL), Globally Unique Identifier (GUID), etc.
  • SSO is in multiple application systems. Users only need to log in once to access all mutually trusted application systems.
  • the natural person ID in the embodiment of this application corresponds to a natural person.
  • This natural person may correspond to a mobile terminal (for example, a mobile phone), at least one phone number, at least one application account, at least one OpenID, one SSOID, at least one ICCID, and at least one IMEI.
  • a mobile terminal for example, a mobile phone
  • the IMEI, phone number, and 5 application accounts of the mobile phone are labeled with a natural person ID.
  • the user behavior data corresponding to these 5 application accounts all belong to the user behavior data of this natural person ID.
  • a real natural person can have many user IDs (for example, the IMEI of a mobile phone, a phone number, and 5 application accounts), but only one unique natural person ID is corresponding.
  • the specific presentation form of the natural person ID can be a string of characters.
  • the natural person ID can correspond to an identification of a mobile terminal.
  • the user characteristics of the user ID may include device characteristics, positioning characteristics, and application (APP) characteristics.
  • Device characteristics may include the model of the device, the identification of the device, and so on.
  • the positioning features may include global positioning system (Global Positioning System, GPS) positioning information, mobile location-based service (Location Based Service, LBS) location trajectory, etc.
  • the application program characteristics may include the cumulative running time of the application program, the number of startup times of the application program, the usage frequency and times of the application program, and the usage of the application program function. Among them, the application function usage includes the types of advertisements that are paid attention to in the application, the application preference, etc.
  • the real-time user characteristics of the user ID are the user characteristics of the user ID collected at the current point in time.
  • the server can generate the real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of multiple user IDs. For example, if there are 5 user IDs corresponding to the target natural person identification ID, the server can obtain 5 real-time user characteristics of these 5 user IDs, and the server can integrate the 5 real-time user characteristics of these 5 user IDs , Get the real-time user characteristics of the target natural person ID.
  • the 5 real-time user characteristics include: the advertisement A followed by the first APP, the login information of the second APP (not logged in), the login information of the third APP (not logged in), and the login information of the fourth APP ( Not logged in), the login information of the fifth APP (not logged in), then only one user feature (ad A followed by the first APP) among these 5 real-time user features is a useful feature, then the ads followed by the first APP A is the real-time user characteristic of the target natural person ID.
  • the real-time user features of multiple user IDs at a point in time there is generally only one user feature with at most one user ID as a useful feature.
  • the user may open multiple APPs at the same time.
  • multiple user ID characteristics may be useful characteristics.
  • multiple user ID characteristics need to be integrated. For another example, if 5 real-time user characteristics include: Advertisement A followed by the first APP, Advertisement B followed by the second APP, Advertisement C followed by the third APP, login information of the fourth APP (not logged in), and the fifth APP Login information (not logged in).
  • Advertisement A, Advertisement B, and Advertisement C are determined in, if the types of Advertisement A, Advertisement B, and Advertisement C are the same, and they are all type XX advertisements, it can be determined that the real-time user characteristics of the natural person ID are type XX advertisements.
  • step 101 the following steps may be performed:
  • the server obtains the correspondence between the natural person ID and the user ID;
  • the server determines multiple user IDs corresponding to the target natural person ID according to the corresponding relationship between the natural person ID and the user ID.
  • the corresponding relationship between the natural person ID and the user ID can be stored in the database of the server.
  • the server can quickly determine multiple user IDs corresponding to the target natural person ID according to the correspondence between the natural person ID and the user ID.
  • step (11) before performing step (11), the following steps can also be performed:
  • the server obtains the historical behavior data of N user IDs reported by multiple clients, and calculates the similarity between the historical behavior data of N user IDs, where N is a positive integer;
  • the server constructs a relationship pair between user IDs based on the similarity between the historical behavior data of N user IDs;
  • the server constructs the corresponding relationship between the natural person ID and the user ID according to the relationship between the user IDs.
  • one natural person ID corresponds to at least two user IDs.
  • the historical behavior data may include: historical device characteristics, historical positioning characteristics, and historical application (Application, APP) characteristics of N user IDs.
  • Historical device characteristics can include the model of the device, the identification of the device, and the usage habits of the device (for example, the backlight brightness of the device, the volume of the device, the holding posture of the device, the average use time of the device, the boot time of the device, and the Shutdown time, etc.).
  • Historical positioning features may include global positioning system (Global Positioning System, GPS) positioning information, mobile location-based service (Location Based Service, LBS) location trajectory, etc.
  • Historical application features can include application setting parameters (for example, application brightness, application volume, application refresh frequency), application opening time, application closing time, application function usage, application Program continuous running time, cumulative application running time, application installation data, application uninstallation data, etc.
  • the server can use the PageRank algorithm, the shortest path algorithm, and the Alternating Least Squares (ALS) algorithm to calculate the similarity between the historical behavior data of N user IDs.
  • a relationship pair is established for user IDs whose similarity is greater than a preset similarity threshold. For example, SSOID1 ⁇ ->IMEI2, OPENID1 ⁇ ->ICCID3, SSOID2 ⁇ ->TEL2, IMEI2 ⁇ ->TEL3, IMEI2 ⁇ ->ICCID1, SSOID2 ⁇ ->OPENID1, IMEI1 ⁇ ->SSOID2.
  • SSOID1, IMEI2, TEL3, and ICCID1 correspond to one natural person ID (for example, natural person ID1)
  • OPENID1, ICCID3, SSOID2, TEL2, and IMEI1 correspond to another natural person ID (for example, natural person ID2). See Table 1 for details.
  • Table 1 is a table of correspondence between user IDs and natural person IDs disclosed in the embodiments of the present application. As shown in Table 1, the correspondence between natural person ID1 and SSOID1, IMEI2, TEL3, and ICCID1, and the correspondence between natural person ID2 and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1.
  • N is greater than a preset number threshold.
  • the server calculates the similarity between the historical behavior data of N user IDs, specifically:
  • the server uses a locally sensitive hash algorithm to calculate the similarity between the historical behavior data of N user IDs.
  • the number of N is large, the number of user IDs is very large, and if all the user IDs of the N user IDs are calculated one by one, the amount of calculation is very large.
  • Using local sensitive hashing algorithm to calculate the similarity between massive data can reduce the complexity of user similarity calculation.
  • the hash-sensitive algorithm can construct a hash function that puts user IDs with the same or similar characteristics into the same hash bucket, and then calculates the similarity of the user IDs in the hash bucket.
  • the local sensitive hash algorithm of this application can hash user IDs with the same or similar characteristics into the same hash bucket for similarity calculation, so that similar users can be assigned to the same hash bucket with a greater probability , Only need to calculate the similarity between user IDs in the bucket, thereby reducing the complexity of similarity calculation.
  • the same or similar features can be location features, and user IDs with similar latitude and longitude can be hashed into the same bucket.
  • the Euclidean distance calculation formula can be used to determine whether two user IDs have the same or similar location characteristics.
  • the location feature of the first user ID (longitude x 1 , latitude y 1 ) and the location of the second user ID can be obtained Features (longitude is x 2 , latitude y 2 ), calculate the position similarity between the first user ID and the second user ID:
  • d is less than or equal to the preset threshold, it indicates that the first user ID and the second user ID have the same or similar location characteristics, and the first user ID and the second user ID are placed in the same hash bucket. If d is greater than the preset threshold, it indicates that the first user ID and the second user ID do not have the same or similar location features.
  • the server can analyze the user behavior data of the newly registered user ID, analyze the user behavior data of the newly registered user ID and the user behavior data of all natural person IDs that have been stored, if the above has been stored.
  • the similarity of the natural person ID with the greatest similarity to the newly registered user ID among all the natural person IDs is greater than the preset similarity threshold, and the corresponding relationship between the natural person ID with the greatest similarity and the newly registered user ID is established.
  • the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period.
  • the historical feature of the target natural person ID is obtained by integrating the historical characteristics of multiple user IDs corresponding to the target natural person ID.
  • the server can obtain the user characteristics of multiple user IDs corresponding to the target natural person ID at various time points in the past month, and classify the user characteristics at each time point according to the time period to obtain the target natural person ID at each time. At least one historical feature for each time period. Specifically, the server can classify the user characteristics of multiple user IDs corresponding to the target natural person ID at 6-9 a.m.
  • the server can assign the target natural person
  • the user characteristics of the multiple user IDs corresponding to the ID in the past month from 9-11 a.m. are included in the historical characteristics of the second time period of the target natural person ID; the server can assign multiple user IDs corresponding to the target natural person ID in the past The user characteristics at 11-14 o'clock every day within a month are classified as the historical characteristics of the third time period of the target natural person ID; the server can assign multiple user IDs corresponding to the target natural person ID at 14-17 o'clock every day in the past month
  • the user characteristics are classified into the historical characteristics of the fourth time period of the target natural person ID; the server can include the user characteristics of multiple user IDs corresponding to the target natural person ID at 17-20 o'clock every day in the past month into the target natural person ID
  • the historical characteristics of the fifth time period; the server can classify the user characteristics of multiple user IDs corresponding to the target natural person ID at 20-24 o'clock every day in the
  • the current time point is the time point at which the server obtains the real-time user characteristics of the multiple user IDs corresponding to the target natural person identification ID. If the current time point is 7 am, the current time point corresponds to the first time period (6-9 o'clock), and the server calculates at least one of the real-time user characteristics of the target natural person ID and the first time period of the target natural person ID Similarity of historical features.
  • the server can obtain the content label corresponding to each historical feature in each time period.
  • the corresponding content tag may be advertisement A; if the historical feature is the focus on the first APP, the corresponding content tag may be the first APP.
  • the server obtains the content label corresponding to each historical feature in each time period, specifically:
  • the server acquires the first historical feature of the first time period, where the first historical feature is any one of all historical features of the first time period;
  • the server determines the group of natural persons with the first historical characteristic
  • the content label corresponding to the first historical feature in the first time period is the label corresponding to the natural person group.
  • the group of natural persons with the first historical characteristic indicates that the group of natural persons has the first historical characteristic, indicating that the target natural person ID also belongs to the group of natural persons and has the corresponding characteristics of the group of natural persons.
  • the target natural person ID pushes the content corresponding to the tag corresponding to the natural person group, thereby improving the accuracy of content pushing.
  • a group of natural persons refers to a collection of IDs of a type of natural persons with at least one common feature.
  • the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, specifically:
  • the server extracts the target digital parameter in the real-time user characteristics of the target natural person ID, and extracts at least one historical digital parameter of at least one historical characteristic of the target historical time period of the target natural person ID;
  • At least one Euclidean distance between the target digital parameter and the at least one historical digital parameter is calculated by using the Euclidean distance calculation formula.
  • the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, specifically:
  • the server extracts the target vector in the real-time user characteristics of the target natural person ID, and extracts at least one historical vector of at least one historical feature of the target historical time period of the target natural person ID;
  • the Hamming distance between the target vector and the at least one historical vector is calculated by a Hamming distance calculation formula.
  • the target vector is 10 bits
  • at least one history vector is 10 bits.
  • each bit of the target vector is the same. If they are the same, it indicates that the corresponding Hamming The distance is 0. If it is different, it means that the Hamming distance corresponding to the bit is 1.
  • the server determines the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determines the first target content tag corresponding to the first target historical feature.
  • the server pushes the content corresponding to the first target content tag to the target natural person ID.
  • each historical feature has at least one content tag corresponding to it.
  • the server After calculating the similarity between the real-time user feature of the target natural person ID and at least one historical feature of the target historical time period of the target natural person ID, the server determines the highest similarity between the at least one historical feature and the real-time user feature of the target natural person ID.
  • a target historical feature the server pushes the content corresponding to the first target content tag to the target natural person ID.
  • the first target content tag may be an advertisement content tag, and its corresponding content may be advertisement content.
  • accurate advertisement placement can be implemented in a specific time period according to the real-time characteristics of the user.
  • the real-time feature parameters also include current geographic location information.
  • the following steps may also be performed:
  • the server determines the geographic location tag corresponding to the current geographic location information.
  • Step 104 may specifically be:
  • the server pushes the content corresponding to the first target content tag and the geographic location tag to the target natural person ID.
  • the current geographic location information may include current GPS positioning information (for example, current longitude and latitude information).
  • current GPS positioning information for example, current longitude and latitude information.
  • the content to be pushed to the target natural person ID when pushing content to the target natural person ID, can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID And push the content corresponding to the tag to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
  • FIG. 2 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application.
  • Figure 2 is further optimized on the basis of Figure 1.
  • the content pushing method includes the following steps.
  • the server obtains real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generates real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include the current time point And current geographic location information.
  • the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period.
  • the server determines the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determines the first target content tag corresponding to the first target historical feature.
  • step 201 to step 203 can refer to step 101 to step 103 shown in FIG. 1, which will not be repeated here.
  • the server calculates the similarity between the first target content tag and the current geographic location information.
  • the server pushes the content corresponding to the first target content tag to the target natural person ID.
  • the current geographic location information may include current GPS positioning information (for example, current longitude and latitude information).
  • the server calculates the similarity between the first target content tag and the current geographic location information, specifically: the server extracts the location subtag in the first target content tag, and calculates the location subtag in the first target content tag and the current geographic location information The similarity.
  • the server can calculate the similarity between the location subtag in the first target content tag and the current geographic location information through the Euclidean distance calculation formula.
  • the method shown in FIG. 2 may further include the following steps:
  • the server determines the second target with the second highest similarity to the real-time user feature of the target natural person ID in at least one historical feature Historical characteristics, determine the second target content label corresponding to the second target historical characteristics;
  • the server calculates the similarity between the second target content label and the current geographic location information
  • the server pushes the content corresponding to the second target content tag to the target natural person ID.
  • the similarity between the first target content tag and the current geographic location information is less than the preset similarity threshold, it indicates that the current geographic location is not suitable for pushing content corresponding to the first target content tag. For example, if the first target content tag is a travel-related tag, and the current geographic location information is located in a hospital, the similarity between the two is small.
  • the server determines the second highest similarity between the at least one historical feature and the real-time user feature of the target natural person ID.
  • the second target historical feature determines the second target content tag corresponding to the second target historical feature. If the similarity between the second target content tag and the current geographic location information is greater than or equal to the preset similarity threshold, the server sends the target natural person ID Push the content corresponding to the second target content tag. If the similarity between the second target content label and the current geographic location information is less than the preset similarity threshold, step 201 is executed again.
  • the server includes hardware structures and/or software modules corresponding to each function.
  • the present invention can be implemented in the form of hardware or a combination of hardware and computer software. Whether a certain function is executed by hardware or computer software-driven hardware depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the present invention.
  • the embodiment of the present application may divide the server side into functional units according to the foregoing method examples.
  • each functional unit may be divided corresponding to each function, or two or more functions may be integrated into one processing unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. It should be noted that the division of units in the embodiments of the present application is illustrative, and is only a logical function division, and there may be other division methods in actual implementation.
  • FIG. 3 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application.
  • the content pushing device 300 includes an acquiring unit 301, a calculating unit 302, a determining unit 303, and a pushing unit 304, wherein:
  • the acquiring unit 301 is configured to acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to a target natural person identification ID, and generate real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs ,
  • the real-time characteristic parameter includes the current time point;
  • the calculation unit 302 is configured to calculate the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point is the same as the target historical time Segment corresponding
  • the determining unit 303 is configured to determine the first target historical feature with the highest similarity to the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target historical feature corresponding to the first target historical feature.
  • Target content label
  • the pushing unit 304 is configured to push the content corresponding to the first target content tag to the target natural person ID.
  • the real-time characteristic parameter also includes current geographic location information.
  • the determining unit 303 is further configured to determine a geographic location tag corresponding to the current geographic location information
  • the pushing unit 304 pushes the content corresponding to the first target content tag to the target natural person ID, specifically: pushing the content corresponding to the first target content tag and the geographic location tag to the target natural person ID content.
  • the real-time characteristic parameters also include current geographic location information
  • the calculation unit 302 is further configured to calculate the similarity between the first target content tag and the current geographic location information
  • the pushing unit 304 is further configured to push the target natural person ID to the target natural person ID when the similarity between the first target content tag and the current geographic location information is greater than or equal to a preset similarity threshold. Content corresponding to a target content tag.
  • the determining unit 303 is further configured to determine the at least one historical feature when the similarity between the first target content label and the current geographic location information is less than the preset similarity threshold Determining the second target content tag corresponding to the second target historical feature in the second target historical feature with the second highest similarity to the real-time user characteristics of the target natural person identification ID;
  • the calculation unit 302 is further configured to calculate the similarity between the second target content tag and the current geographic location information
  • the pushing unit 304 is further configured to push the target natural person ID to the target natural person ID when the similarity between the second target content tag and the current geographic location information is greater than or equal to a preset similarity threshold. 2. Content corresponding to the target content tag.
  • the acquiring unit 301 is further configured to acquire the corresponding relationship between the natural person ID and the user ID before acquiring the real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID;
  • the determining unit 303 is further configured to determine multiple user IDs corresponding to the target natural person ID according to the corresponding relationship between the natural person ID and the user ID.
  • the content pushing apparatus 300 may further include a processing unit 305.
  • the obtaining unit 301 is further configured to obtain historical behavior data of N user IDs reported by multiple clients before obtaining the corresponding relationship between the natural person ID and the user ID;
  • the calculation unit 302 is also used to calculate the similarity between the historical behavior data of the N user IDs, where N is a positive integer;
  • the processing unit 305 is configured to construct a relationship pair between user IDs based on the similarity between the historical behavior data of the N user IDs; construct a pair of natural person IDs and user IDs based on the relationship between the user IDs Correspondence, in the correspondence between the natural person ID and the user ID, one natural person ID corresponds to at least two user IDs.
  • the calculation unit 302 calculates the similarity between the historical behavior data of the N user IDs, specifically: using a local sensitive hash algorithm to calculate the number of the N user IDs The similarity between historical behavior data.
  • the acquiring unit 301, the calculating unit 302, the determining unit 303, the pushing unit 304, and the processing unit 305 in FIG. 3 may be processors.
  • the target natural person ID when pushing content to the target natural person ID, can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID.
  • the tag of the content pushed by the ID, and the content corresponding to the tag is pushed to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
  • FIG. 4 is a schematic structural diagram of a server disclosed in an embodiment of the present application.
  • the server 400 includes a processor 401 and a memory 402.
  • the server 400 may also include a bus 403.
  • the processor 401 and the memory 402 may be connected to each other through the bus 403.
  • the bus 403 may be a peripheral component. Connect the standard (Peripheral Component Interconnect, referred to as PCI) bus or extended industry standard architecture (Extended Industry Standard Architecture, referred to as EISA) bus, etc.
  • PCI Peripheral Component Interconnect
  • EISA Extended Industry Standard Architecture
  • the bus 403 can be divided into an address bus, a data bus, a control bus, and so on. For ease of presentation, only one thick line is used to represent in FIG.
  • the server 400 may also include an input and output device 404.
  • the memory 402 is used to store one or more programs containing instructions; the processor 401 is used to call the instructions stored in the memory 402 to execute some or all of the method steps in FIGS. 1 to 2.
  • the target natural person ID when pushing content to the target natural person ID, can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID.
  • the tag of the pushed content, and the content corresponding to the tag is pushed to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
  • the embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a computer program for electronic data exchange, and the computer program causes the computer to execute any part of the content push method recorded in the above method embodiment Or all steps.
  • the embodiments of the present application also provide a computer program product.
  • the computer program product includes a non-transitory computer-readable storage medium storing a computer program.
  • the computer program is operable to cause a computer to execute any of the methods described in the foregoing method embodiments. Part or all of the steps of a content push method.
  • the disclosed device may be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable memory.
  • the technical solution of the present invention essentially or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory, A number of instructions are included to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention.
  • the aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other various media that can store program codes.
  • the program can be stored in a computer-readable memory, and the memory can include: flash disk , Read-only memory (English: Read-Only Memory, abbreviated as: ROM), random access device (English: Random Access Memory, abbreviated as: RAM), magnetic disk or optical disc, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Disclosed in embodiments of the present application are a content pushing method and apparatus, a server, and a storage medium. The method comprises: obtaining real-time user characteristics of a plurality of user IDs corresponding to a target natural person identifier ID and real-time characteristic parameters, and generating a real-time user characteristic of the target natural person identifier ID on the basis of the real-time user characteristics of the plurality of user IDs, the real-time characteristic parameters comprising a current time point; calculating similarity between the real-time user characteristic of the target natural person identifier ID and at least one historical characteristic of a target historical time period of the target natural person ID, the current time point corresponding to the target historical time period; determining a first target historical characteristic having the highest similarity with the real-time user characteristic of the target natural person identifier ID from the at least one historical characteristic, and determining a first target content tag corresponding to the first target historical characteristic; and pushing content corresponding to the first target content tag to the target natural person ID. The embodiments of the present application can improve the accuracy of content pushing.

Description

内容推送方法、装置、服务端及存储介质Content pushing method, device, server and storage medium 技术领域Technical field
本申请涉及通信技术领域,具体涉及一种内容推送方法、装置、服务端及存储介质。This application relates to the field of communication technology, and specifically relates to a content push method, device, server and storage medium.
背景技术Background technique
目前,服务端(比如,服务器)向客户端(比如,应用程序客户端)进行内容推送时,服务端根据客户端上报的用户标识(Identity,ID)的用户画像进行内容推送。由于单个的用户ID的用户画像的准确度不高,服务端推送的内容的准确性较低。Currently, when the server (for example, the server) pushes content to the client (for example, the application client), the server pushes the content according to the user portrait of the user identity (ID) reported by the client. Since the accuracy of the user portrait of a single user ID is not high, the accuracy of the content pushed by the server is low.
发明内容Summary of the invention
本申请实施例提供了一种内容推送方法、装置、服务端及存储介质,可以提高内容推送的准确性。The embodiments of the present application provide a content pushing method, device, server, and storage medium, which can improve the accuracy of content pushing.
第一方面,本申请实施例提供一种内容推送方法,包括:In the first aspect, an embodiment of the present application provides a content pushing method, including:
获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于所述多个用户ID的实时用户特征生成所述目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点;Acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, generate real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include current Point in time
计算所述目标自然人标识ID的实时用户特征与所述目标自然人ID的目标历史时间段的至少一个历史特征的相似度,所述当前时间点与所述目标历史时间段相对应;Calculating the similarity between the real-time user characteristic of the target natural person identification ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period;
确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与所述第一目标历史特征对应的第一目标内容标签;Determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target content tag corresponding to the first target historical feature;
向所述目标自然人ID推送与所述第一目标内容标签对应的内容。Push the content corresponding to the first target content tag to the target natural person ID.
第二方面,本申请实施例提供了一种内容推送装置,所述内容推送装置包括获取单元、计算单元、确定单元和推送单元,其中:In the second aspect, an embodiment of the present application provides a content pushing device, the content pushing device includes an acquiring unit, a calculating unit, a determining unit, and a pushing unit, wherein:
所述获取单元,用于获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于所述多个用户ID的实时用户特征生成所述目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点;The acquiring unit is configured to acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generate the real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, The real-time characteristic parameter includes the current time point;
所述计算单元,用于计算所述目标自然人标识ID的实时用户特征与所述目标自然人ID的目标历史时间段的至少一个历史特征的相似度,所述当前时间点与所述目标历史时间段相对应;The calculation unit is configured to calculate the similarity between the real-time user feature of the target natural person ID and at least one historical feature of the target historical time period of the target natural person ID, and the current time point is the same as the target historical time period Corresponding;
所述确定单元,用于确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与所述第一目标历史特征对应的第一目标内容标签;The determining unit is configured to determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target corresponding to the first target historical feature Content label
所述推送单元,用于向所述目标自然人ID推送与所述第一目标内容标签对应的内容。The pushing unit is configured to push the content corresponding to the first target content tag to the target natural person ID.
第三方面,本申请实施例提供一种服务端,包括处理器、存储器,所述存储器用于存储一个或多个程序,所述一个或多个程序被配置成由所述处理器执行,上述程序包括用于执行本申请实施例第一方面中的步骤的指令。In a third aspect, an embodiment of the present application provides a server, including a processor and a memory, the memory is used to store one or more programs, and the one or more programs are configured to be executed by the processor. The program includes instructions for executing the steps in the first aspect of the embodiments of the present application.
第四方面,本申请实施例提供了一种计算机可读存储介质,其中,上述计算机可读存储介质存储用于电子数据交换的计算机程序,其中,上述计算机程序使得计算机执行如本申请实施例第一方面中所描述的部分或全部步骤。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the foregoing computer-readable storage medium stores a computer program for electronic data exchange, wherein the foregoing computer program enables a computer to execute Some or all of the steps described in one aspect.
第五方面,本申请实施例提供了一种计算机程序产品,其中,上述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,上述计算机程序可操作来使计算机执行如本申请实施例第一方面中所描述的部分或全部步骤。该计算机程序产品可以为一个软件安装包。In a fifth aspect, embodiments of the present application provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute Example part or all of the steps described in the first aspect. The computer program product may be a software installation package.
可以看出,本申请实施例中所描述的内容推送方法,具体包括如下步骤:获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于多个用户ID的实时用户特征生成目标自然人标识ID的实时用户特征,实时特征参数包括当前时间点;计算目标自然人标识ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度,当前时间点与目标历史时间段相对应;确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与第一目标历史特征对应的第一目标内容标签;向目标自然人ID推送与第一目标内容标签对应的内容。 实施本申请实施例,在向目标自然人ID推送内容时,可以基于该目标自然人ID的实时用户特征以及实时特征参数与该目标自然人ID的历史特征的相似度来确定向该目标自然人ID推送的内容的标签,并向该目标自然人ID推送该标签对应的内容。由于用于对比的历史特征是基于该自然人ID对应的多个用户ID得到的,与采用单一用户ID的历史特征相比,使得历史特征得到极大的丰富,提高了内容标签的准确度,从而可以提高内容推送的准确性。It can be seen that the content pushing method described in the embodiment of this application specifically includes the following steps: acquiring real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and real-time user characteristics based on multiple user IDs Generate the real-time user characteristics of the target natural person ID, the real-time characteristic parameters include the current time point; calculate the similarity between the real-time user characteristics of the target natural person ID and the target historical time period of the target natural person ID, the current time point and the target Corresponding to the historical time period; determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among at least one historical feature, and determine the first target content label corresponding to the first target historical feature; send the target natural person ID Push the content corresponding to the first target content tag. In implementing the embodiments of this application, when pushing content to the target natural person ID, the content to be pushed to the target natural person ID can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID And push the content corresponding to the label to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
附图说明Description of the drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1是本申请实施例公开的一种内容推送方法的流程示意图;FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application;
图2是本申请实施例公开的另一种内容推送方法的流程示意图;2 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application;
图3是本申请实施例公开的一种内容推送装置的结构示意图;Figure 3 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application;
图4是本申请实施例公开的一种服务端的结构示意图。Fig. 4 is a schematic structural diagram of a server disclosed in an embodiment of the present application.
具体实施方式Detailed ways
为了使本技术领域的人员更好地理解本发明方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其他步骤或单元。The terms "first", "second", etc. in the specification and claims of the present invention and the above-mentioned drawings are used to distinguish different objects, rather than to describe a specific sequence. In addition, the terms "including" and "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product, or device that includes a series of steps or units is not limited to the listed steps or units, but optionally includes unlisted steps or units, or optionally also includes Other steps or units inherent to these processes, methods, products or equipment.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference to "embodiments" herein means that a specific feature, structure or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present invention. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment mutually exclusive with other embodiments. Those skilled in the art clearly and implicitly understand that the embodiments described herein can be combined with other embodiments.
本申请实施例所涉及到的移动终端可以包括各种具有无线通信功能的手持设备、车载设备、可穿戴设备、计算设备或连接到无线调制解调器的其他处理设备,以及各种形式的用户设备(User Equipment,UE),移动台(Mobile Station,MS),终端设备(terminal device)等等。为方便描述,上面提到的设备统称为移动终端。The mobile terminals involved in the embodiments of this application may include various handheld devices with wireless communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to wireless modems, as well as various forms of user equipment (User Equipment, UE), mobile station (Mobile Station, MS), terminal device (terminal device), etc. For ease of description, the devices mentioned above are collectively referred to as mobile terminals.
下面对本申请实施例进行详细介绍。The following describes the embodiments of the present application in detail.
请参阅图1,图1是本申请实施例公开的一种内容推送方法的流程示意图,如图1所示,该内容推送方法包括如下步骤。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application. As shown in FIG. 1, the content pushing method includes the following steps.
101,服务端获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于多个用户ID的实时用户特征生成目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点。101. The server acquires real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generates real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include the current Point in time.
本申请实施例中,服务端是为客户端服务的,服务的内容诸如向客户端提供资源,保存客户端数据等。服务端是一种有针对性的服务程序,运行服务端的设备可以称为服务器。服务端可以同时与多个客户端建立连接,可以同时为多个客户端提供服务。本申请实施例中服务端为客户端提供的服务主要包括内容推送服务。内容推送服务可以包括:广告内容推送服务。广告内容推送服务可以包括:浏览器内容推送服务、应用程序内容推送服务、游戏内容推送服务等。服务端可以包括应用程序服务端、浏览器服务端、游戏服务端等。In the embodiments of the present application, the server serves the client, and the content of the service includes providing resources to the client and storing client data. The server is a targeted service program, and the device running the server can be called a server. The server can establish connections with multiple clients at the same time, and can provide services to multiple clients at the same time. The services provided by the server for the client in the embodiments of the present application mainly include content push services. The content push service may include: an advertisement content push service. The advertisement content push service may include: browser content push service, application content push service, game content push service, etc. The server can include application server, browser server, game server, etc.
用户ID可以包括如下任意一种或多种类型:单点登录标识(single sign on identity,SSOID)、OpenID、集成电路卡识别码(Integrate circuit card identity,ICCID)、国际移动设备识别码(International Mobile Equipment Identity,IMEI)、电话号码(telephone,TEL)、全局唯一标识符(Globally Unique Identifier,GUID)等。SSO是在多个应用系统中,用户只需登录一次就可以访问所有相互信任的应用系统。User ID can include any one or more of the following types: single sign on identity (SSOID), OpenID, integrated circuit card identity (ICCID), international mobile device identity (International Mobile) Equipment Identity, IMEI), telephone number (telephone, TEL), Globally Unique Identifier (GUID), etc. SSO is in multiple application systems. Users only need to log in once to access all mutually trusted application systems.
其中,本申请实施例中的自然人ID会对应一个自然人。这个自然人可能会对应一个移动终端(比如,手机)、至少一个电话号码、至少一个应用程序账号、至少一个OpenID、一个SSOID、至少一个ICCID、至少一个IMEI。比如说,一个自然人有用一部手机、一个电话号码、5个应用程序账号,则将手机的IMEI、电话号码、5个应用程序账号打上一个自然人ID的标签。这5个应用程序账号对应的用户行为数据都属于这个自然人ID的用户行为数据。这样,一个真实的自然人,可以有很多个用户ID(比如,一个手机的IMEI、一个电话号码、5个应用程序账号),但是却只对应一个唯一的自然人ID。自然人ID的具体呈现形式可以是一串字符。该自然人ID可以对应一个移动终端的标识,服务端向该自然人ID推送内容时,可以向该自然人ID对应的移动终端推送内容,无需单独向应用程序账号发送推送内容,从而提高推送效率。Among them, the natural person ID in the embodiment of this application corresponds to a natural person. This natural person may correspond to a mobile terminal (for example, a mobile phone), at least one phone number, at least one application account, at least one OpenID, one SSOID, at least one ICCID, and at least one IMEI. For example, if a natural person has a mobile phone, a phone number, and 5 application accounts, the IMEI, phone number, and 5 application accounts of the mobile phone are labeled with a natural person ID. The user behavior data corresponding to these 5 application accounts all belong to the user behavior data of this natural person ID. In this way, a real natural person can have many user IDs (for example, the IMEI of a mobile phone, a phone number, and 5 application accounts), but only one unique natural person ID is corresponding. The specific presentation form of the natural person ID can be a string of characters. The natural person ID can correspond to an identification of a mobile terminal. When the server pushes content to the natural person ID, it can push the content to the mobile terminal corresponding to the natural person ID without sending the pushed content to the application account separately, thereby improving the push efficiency.
用户ID的用户特征可以包括设备特征、定位特征和应用程序(Application,APP)特征。设备特征可以包括设备的型号、设备的标识等。定位特征可以包括全球定位系统(Global Positioning System,GPS)定位信息、基于移动位置服务(Location Based Service,LBS)位置轨迹等。应用程序特征可以包括应用程序的累计运行时长、应用程序的启动次数、应用程序的使用频率和次数、应用程序功能使用情况等。其中,应用程序功能使用情况包括在应用程序内关注的广告的类型、应用程序的使用偏好等。The user characteristics of the user ID may include device characteristics, positioning characteristics, and application (APP) characteristics. Device characteristics may include the model of the device, the identification of the device, and so on. The positioning features may include global positioning system (Global Positioning System, GPS) positioning information, mobile location-based service (Location Based Service, LBS) location trajectory, etc. The application program characteristics may include the cumulative running time of the application program, the number of startup times of the application program, the usage frequency and times of the application program, and the usage of the application program function. Among them, the application function usage includes the types of advertisements that are paid attention to in the application, the application preference, etc.
其中,用户ID的实时用户特征是在当前时间点采集的关于该用户ID的用户特征。Among them, the real-time user characteristics of the user ID are the user characteristics of the user ID collected at the current point in time.
服务端可以基于多个用户ID的实时用户特征生成目标自然人标识ID的实时用户特征。举例来说,如果目标自然人标识ID对应的5个用户ID,则服务端可以获取这5个用户ID的5个实时用户特征,服务端可以将这5个用户ID的5个实时用户特征进行整合,得到该目标自然人ID的实时用户特征。具体的,例如,如果5个实时用户特征包括:第一APP关注的广告A、第二APP的登录信息(未登录)、第三APP的登录信息(未登录)、第四APP的登录信息(未登录)、第五APP的登录信息(未登录),则这5个实时用户特征中只有1个用户特征(第一APP关注的广告A)是有用的特征,则将第一APP关注的广告A作为该目标自然人ID的实时用户特征。一般而言,在一个时间点, 多个用户ID的实时用户特征中,一般最多只有一个用户ID的用户特征是有用特征。在某些情况下,可能用户同时打开了多个APP,多个用户ID的实时用户特征中,可能有多个用户ID特征是有用特征,此时则需要将多个用户ID特征进行整合。又例如,如果5个实时用户特征包括:第一APP关注的广告A、第二APP关注的广告B、第三APP关注的广告C、第四APP的登录信息(未登录)、第五APP的登录信息(未登录)。则果5个实时用户特征中只有3个用户特征(第一APP关注的广告A、第二APP关注的广告B、第三APP关注的广告C)是有用的特征,则从这3个用户特征中确定广告A、广告B和广告C的类型,如果广告A、广告B和广告C的类型相同,均为XX类型广告,则可以确定自然人ID的实时用户特征为XX类型广告。The server can generate the real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of multiple user IDs. For example, if there are 5 user IDs corresponding to the target natural person identification ID, the server can obtain 5 real-time user characteristics of these 5 user IDs, and the server can integrate the 5 real-time user characteristics of these 5 user IDs , Get the real-time user characteristics of the target natural person ID. Specifically, for example, if the 5 real-time user characteristics include: the advertisement A followed by the first APP, the login information of the second APP (not logged in), the login information of the third APP (not logged in), and the login information of the fourth APP ( Not logged in), the login information of the fifth APP (not logged in), then only one user feature (ad A followed by the first APP) among these 5 real-time user features is a useful feature, then the ads followed by the first APP A is the real-time user characteristic of the target natural person ID. Generally speaking, among the real-time user features of multiple user IDs at a point in time, there is generally only one user feature with at most one user ID as a useful feature. In some cases, the user may open multiple APPs at the same time. Among the real-time user characteristics of multiple user IDs, multiple user ID characteristics may be useful characteristics. In this case, multiple user ID characteristics need to be integrated. For another example, if 5 real-time user characteristics include: Advertisement A followed by the first APP, Advertisement B followed by the second APP, Advertisement C followed by the third APP, login information of the fourth APP (not logged in), and the fifth APP Login information (not logged in). If only 3 user characteristics (Ad A followed by the first APP, Ad B followed by the second APP, and Ad C followed by the third APP) of the 5 real-time user characteristics are useful features, then from these 3 user characteristics The types of Advertisement A, Advertisement B, and Advertisement C are determined in, if the types of Advertisement A, Advertisement B, and Advertisement C are the same, and they are all type XX advertisements, it can be determined that the real-time user characteristics of the natural person ID are type XX advertisements.
可选的,在执行步骤101之前,可以执行如下步骤:Optionally, before performing step 101, the following steps may be performed:
(11)服务端获取自然人ID与用户ID的对应关系;(11) The server obtains the correspondence between the natural person ID and the user ID;
(12)服务端根据自然人ID与用户ID的对应关系确定与目标自然人ID对应的多个用户ID。(12) The server determines multiple user IDs corresponding to the target natural person ID according to the corresponding relationship between the natural person ID and the user ID.
本申请实施例中,服务端的数据库中可以存储自然人ID与用户ID的对应关系。服务端可以根据自然人ID与用户ID的对应关系快速确定目标自然人ID对应的多个用户ID。In the embodiment of the present application, the corresponding relationship between the natural person ID and the user ID can be stored in the database of the server. The server can quickly determine multiple user IDs corresponding to the target natural person ID according to the correspondence between the natural person ID and the user ID.
可选的,在执行步骤(11)之前,还可以执行如下步骤:Optionally, before performing step (11), the following steps can also be performed:
(21)服务端获取多个客户端上报的N个用户ID的历史行为数据,计算N个用户ID的历史行为数据之间的相似度,N为正整数;(21) The server obtains the historical behavior data of N user IDs reported by multiple clients, and calculates the similarity between the historical behavior data of N user IDs, where N is a positive integer;
(22)服务端基于N个用户ID的历史行为数据之间的相似度构建用户ID之间的关系对;(22) The server constructs a relationship pair between user IDs based on the similarity between the historical behavior data of N user IDs;
(23)服务端根据用户ID之间的关系对构建自然人ID与用户ID的对应关系,自然人ID与用户ID的对应关系中,一个自然人ID对应至少两个用户ID。(23) The server constructs the corresponding relationship between the natural person ID and the user ID according to the relationship between the user IDs. In the corresponding relationship between the natural person ID and the user ID, one natural person ID corresponds to at least two user IDs.
本申请实施例中,历史行为数据可以包括:N个用户ID的历史设备特征、历史定位特征和历史应用程序(Application,APP)特征。历史设备特征可以包括设备的型号、设备的标识、设备的使用习惯(比如,设备的背光亮度大小、设备的音量大小、设备的握持姿势、设备的平均使用时长、设备的开机时间、 设备的关机时间等)。历史定位特征可以包括全球定位系统(Global Positioning System,GPS)定位信息、基于移动位置服务(Location Based Service,LBS)位置轨迹等。历史应用程序特征可以包括应用程序的设置参数(比如,应用程序的亮度、应用程序的音量、应用程序的刷新频率)、应用程序开启时间点、应用程序关闭时间点、应用程序功能使用情况、应用程序持续运行时长、应用程序累计运行时长、应用程序安装数据、应用程序卸载数据等。In the embodiment of the present application, the historical behavior data may include: historical device characteristics, historical positioning characteristics, and historical application (Application, APP) characteristics of N user IDs. Historical device characteristics can include the model of the device, the identification of the device, and the usage habits of the device (for example, the backlight brightness of the device, the volume of the device, the holding posture of the device, the average use time of the device, the boot time of the device, and the Shutdown time, etc.). Historical positioning features may include global positioning system (Global Positioning System, GPS) positioning information, mobile location-based service (Location Based Service, LBS) location trajectory, etc. Historical application features can include application setting parameters (for example, application brightness, application volume, application refresh frequency), application opening time, application closing time, application function usage, application Program continuous running time, cumulative application running time, application installation data, application uninstallation data, etc.
服务端可以采用网页排名(PageRank)算法、最短路径算法、交替最小二乘(Alternating Least Squares,ALS)算法计算N个用户ID的历史行为数据之间的相似度。将相似度大于预设相似度阈值的用户ID建立关系对。比如,SSOID1<->IMEI2,OPENID1<->ICCID3,SSOID2<->TEL2,IMEI2<->TEL3,IMEI2<->ICCID1,SSOID2<->OPENID1,IMEI1<->SSOID2。则SSOID1、IMEI2、TEL3、ICCID1对应一个自然人ID(比如,自然人ID1),OPENID1、ICCID3、SSOID2、TEL2、IMEI1对应另一个自然人ID(比如,自然人ID2)。具体可以参见表1。The server can use the PageRank algorithm, the shortest path algorithm, and the Alternating Least Squares (ALS) algorithm to calculate the similarity between the historical behavior data of N user IDs. A relationship pair is established for user IDs whose similarity is greater than a preset similarity threshold. For example, SSOID1<->IMEI2, OPENID1<->ICCID3, SSOID2<->TEL2, IMEI2<->TEL3, IMEI2<->ICCID1, SSOID2<->OPENID1, IMEI1<->SSOID2. Then SSOID1, IMEI2, TEL3, and ICCID1 correspond to one natural person ID (for example, natural person ID1), and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1 correspond to another natural person ID (for example, natural person ID2). See Table 1 for details.
表1Table 1
用户IDUser ID 自然人IDNatural person ID
SSOID1、IMEI2、TEL3、ICCID1SSOID1, IMEI2, TEL3, ICCID1 自然人ID1Natural person ID1
OPENID1、ICCID3、SSOID2、TEL2、IMEI1OPENID1, ICCID3, SSOID2, TEL2, IMEI1 自然人ID2Natural person ID2
表1是本申请实施例公开的一种用户ID与自然人ID的对应关系表。如表1所示,自然人ID1与SSOID1、IMEI2、TEL3、ICCID1的对应、自然人ID2与OPENID1、ICCID3、SSOID2、TEL2、IMEI1的对应。Table 1 is a table of correspondence between user IDs and natural person IDs disclosed in the embodiments of the present application. As shown in Table 1, the correspondence between natural person ID1 and SSOID1, IMEI2, TEL3, and ICCID1, and the correspondence between natural person ID2 and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1.
可选的,N大于预设数量阈值,步骤(21)中,服务端计算N个用户ID的历史行为数据之间的相似度,具体为:Optionally, N is greater than a preset number threshold. In step (21), the server calculates the similarity between the historical behavior data of N user IDs, specifically:
服务端采用局部敏感哈希算法计算N个用户ID的历史行为数据之间的相似度。The server uses a locally sensitive hash algorithm to calculate the similarity between the historical behavior data of N user IDs.
本申请实施例中,如果N的数字较大,用户ID的数量非常庞大,如果将N个用户ID的所有用户ID之间逐个进行相似度计算,则计算量非常庞大。采用局部敏感哈希算法计算海量数据之间的相似度,可以降低用户相似度计算的 复杂度。In the embodiment of the present application, if the number of N is large, the number of user IDs is very large, and if all the user IDs of the N user IDs are calculated one by one, the amount of calculation is very large. Using local sensitive hashing algorithm to calculate the similarity between massive data can reduce the complexity of user similarity calculation.
具体的,哈希敏感算法可以构建哈希函数,该哈希函数将具有相同或相似特征的用户ID放入同一个哈系桶内,然后对哈希桶内的用户ID进行相似度计算。本申请的局部敏感哈希算法可以将具有相同或相似特征的用户ID哈希到同一个哈系桶内进行相似度计算,这样,相似的用户以较大的概率分配到同一个哈系桶内,只需要计算桶内的用户ID之间的相似度,从而降低了相似度计算的复杂度。Specifically, the hash-sensitive algorithm can construct a hash function that puts user IDs with the same or similar characteristics into the same hash bucket, and then calculates the similarity of the user IDs in the hash bucket. The local sensitive hash algorithm of this application can hash user IDs with the same or similar characteristics into the same hash bucket for similarity calculation, so that similar users can be assigned to the same hash bucket with a greater probability , Only need to calculate the similarity between user IDs in the bucket, thereby reducing the complexity of similarity calculation.
举例来说,相同或相似特征可以是位置特征,可以将相近经纬度的用户ID哈希到同一个桶内。具体的,可以通过欧式距离计算公式确定两个用户ID是否具有相同或相似的位置特征。For example, the same or similar features can be location features, and user IDs with similar latitude and longitude can be hashed into the same bucket. Specifically, the Euclidean distance calculation formula can be used to determine whether two user IDs have the same or similar location characteristics.
举例来说,计算第一用户ID与第二用户ID是否具有相同或相似的位置特征时,可以获取第一用户ID的位置特征(经度为x 1,纬度y 1),第二用户ID的位置特征(经度为x 2,纬度y 2),计算第一用户ID与第二用户ID的位置相似度: For example, when calculating whether the first user ID and the second user ID have the same or similar location features, the location feature of the first user ID (longitude x 1 , latitude y 1 ) and the location of the second user ID can be obtained Features (longitude is x 2 , latitude y 2 ), calculate the position similarity between the first user ID and the second user ID:
Figure PCTCN2019092594-appb-000001
Figure PCTCN2019092594-appb-000001
如果d小于或等于预设阈值,表明第一用户ID与第二用户ID具有相同或相似的位置特征,将第一用户ID和第二用户ID放入同一个哈系桶内。如果d大于预设阈值,表明第一用户ID与第二用户ID不具有相同或相似的位置特征。If d is less than or equal to the preset threshold, it indicates that the first user ID and the second user ID have the same or similar location characteristics, and the first user ID and the second user ID are placed in the same hash bucket. If d is greater than the preset threshold, it indicates that the first user ID and the second user ID do not have the same or similar location features.
可选的,服务端可以对新注册的用户ID的用户行为数据进行分析,分析该新注册的用户ID的用户行为数据与已经存储的所有自然人ID的用户行为数据进行分析,如果上述已经存储的所有自然人ID中与该新注册的用户ID的相似度最大的自然人ID的相似度大于预设相似度阈值,则建立上述相似度最大的自然人ID与该新注册的用户ID的对应关系。Optionally, the server can analyze the user behavior data of the newly registered user ID, analyze the user behavior data of the newly registered user ID and the user behavior data of all natural person IDs that have been stored, if the above has been stored The similarity of the natural person ID with the greatest similarity to the newly registered user ID among all the natural person IDs is greater than the preset similarity threshold, and the corresponding relationship between the natural person ID with the greatest similarity and the newly registered user ID is established.
102,服务端计算目标自然人标识ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度,当前时间点与目标历史时间段相对应。102. The server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period.
本申请实施例中,目标自然人ID的历史特征是根据该目标自然人ID对应的多个用户ID的历史特征进行整合得到的。比如,服务端可以获取目标自然人ID对应的多个用户ID在过去一个月内的各个时间点的用户特征,将所述各 个时间点的用户特征按照时间段进行归类,得到目标自然人ID在每个时间段的至少一个历史特征。具体的,服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天6-9点的用户特征归入该目标自然人ID的第一时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天9-11点的用户特征归入该目标自然人ID的第二时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天11-14点的用户特征归入该目标自然人ID的第三时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天14-17点的用户特征归入该目标自然人ID的第四时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天17-20点的用户特征归入该目标自然人ID的第五时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天20-24点的用户特征归入该目标自然人ID的第六时间段的历史特征;服务端可以将目标自然人ID对应的多个用户ID在过去一个月内每天0-6点的用户特征归入该目标自然人ID的第七时间段的历史特征。当前时间点是服务端获取目标自然人标识ID对应的多个用户ID的实时用户特征的时间点。如果当前时间点为上午7点,则当前时间点与第一时间段(6-9点)对应,则服务端计算目标自然人标识ID的实时用户特征与目标自然人ID的第一时间段的至少一个历史特征的相似度。In the embodiment of the present application, the historical feature of the target natural person ID is obtained by integrating the historical characteristics of multiple user IDs corresponding to the target natural person ID. For example, the server can obtain the user characteristics of multiple user IDs corresponding to the target natural person ID at various time points in the past month, and classify the user characteristics at each time point according to the time period to obtain the target natural person ID at each time. At least one historical feature for each time period. Specifically, the server can classify the user characteristics of multiple user IDs corresponding to the target natural person ID at 6-9 a.m. in the past month into the historical characteristics of the first time period of the target natural person ID; the server can assign the target natural person The user characteristics of the multiple user IDs corresponding to the ID in the past month from 9-11 a.m. are included in the historical characteristics of the second time period of the target natural person ID; the server can assign multiple user IDs corresponding to the target natural person ID in the past The user characteristics at 11-14 o'clock every day within a month are classified as the historical characteristics of the third time period of the target natural person ID; the server can assign multiple user IDs corresponding to the target natural person ID at 14-17 o'clock every day in the past month The user characteristics are classified into the historical characteristics of the fourth time period of the target natural person ID; the server can include the user characteristics of multiple user IDs corresponding to the target natural person ID at 17-20 o'clock every day in the past month into the target natural person ID The historical characteristics of the fifth time period; the server can classify the user characteristics of multiple user IDs corresponding to the target natural person ID at 20-24 o'clock every day in the past month into the historical characteristics of the sixth time period of the target natural person ID; service The terminal may categorize the user characteristics of multiple user IDs corresponding to the target natural person ID at 0-6 o'clock every day in the past month into the historical characteristics of the seventh time period of the target natural person ID. The current time point is the time point at which the server obtains the real-time user characteristics of the multiple user IDs corresponding to the target natural person identification ID. If the current time point is 7 am, the current time point corresponds to the first time period (6-9 o'clock), and the server calculates at least one of the real-time user characteristics of the target natural person ID and the first time period of the target natural person ID Similarity of historical features.
服务端获取目标自然人ID在每个时间段的至少一个历史特征后,服务端可以获取在每个时间段的每个历史特征对应的内容标签。比如,如果历史特征为关注广告A,则对应的内容标签可以为广告A;如果历史特征为关注第一APP,则对应的内容标签可以为第一APP。After the server obtains at least one historical feature of the target natural person ID in each time period, the server can obtain the content label corresponding to each historical feature in each time period. For example, if the historical feature is the focus on advertisement A, the corresponding content tag may be advertisement A; if the historical feature is the focus on the first APP, the corresponding content tag may be the first APP.
服务端获取在每个时间段的每个历史特征对应的内容标签,具体为:The server obtains the content label corresponding to each historical feature in each time period, specifically:
服务端获取第一时间段的第一历史特征,第一历史特征为所述第一时间段的所有历史特征中的任一个;The server acquires the first historical feature of the first time period, where the first historical feature is any one of all historical features of the first time period;
服务端确定具有所述第一历史特征的自然人群体;The server determines the group of natural persons with the first historical characteristic;
确定所述自然人群体对应的标签;Determine the label corresponding to the group of natural persons;
确定所述第一时间段的第一历史特征对应的内容标签为所述自然人群体对应的标签。It is determined that the content label corresponding to the first historical feature in the first time period is the label corresponding to the natural person group.
本申请实施例中,具有第一历史特征的自然人群体,表明这个自然人群体具都有该第一历史特征,表明该目标自然人ID也属于该自然人群体,具有该自然人群体的相应特征,后续向该目标自然人ID推送与该自然人群体对应的标签对应的内容,从而提高内容推送的准确度。自然人群体是指具有至少一个共有特征的一类自然人ID的集合。In the examples of this application, the group of natural persons with the first historical characteristic indicates that the group of natural persons has the first historical characteristic, indicating that the target natural person ID also belongs to the group of natural persons and has the corresponding characteristics of the group of natural persons. The target natural person ID pushes the content corresponding to the tag corresponding to the natural person group, thereby improving the accuracy of content pushing. A group of natural persons refers to a collection of IDs of a type of natural persons with at least one common feature.
可选的,服务端计算目标自然人ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度,具体为:Optionally, the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, specifically:
服务端提取所述目标自然人ID的实时用户特征中的目标数字参数,提取所述目标自然人ID的目标历史时间段的至少一个历史特征的至少一个历史数字参数;The server extracts the target digital parameter in the real-time user characteristics of the target natural person ID, and extracts at least one historical digital parameter of at least one historical characteristic of the target historical time period of the target natural person ID;
通过欧式距离计算公式分别计算所述目标数字参数与所述至少一个历史数字参数之间的至少一个欧式距离大小。At least one Euclidean distance between the target digital parameter and the at least one historical digital parameter is calculated by using the Euclidean distance calculation formula.
其中,计算的欧式距离越小,其相似度越高;计算的欧式距离越大,其相似度越低。Among them, the smaller the calculated Euclidean distance, the higher the similarity; the larger the calculated Euclidean distance, the lower the similarity.
可选的,服务端计算目标自然人ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度,具体为:Optionally, the server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, specifically:
服务端提取所述目标自然人ID的实时用户特征中的目标向量,提取所述目标自然人ID的目标历史时间段的至少一个历史特征的至少一个历史向量;The server extracts the target vector in the real-time user characteristics of the target natural person ID, and extracts at least one historical vector of at least one historical feature of the target historical time period of the target natural person ID;
通过汉明距离计算公式分别计算所述目标向量与所述至少一个历史向量之间的汉明距离大小。The Hamming distance between the target vector and the at least one historical vector is calculated by a Hamming distance calculation formula.
其中,计算的汉明距离越小,其相似度越高;计算的汉明距离越大,其相似度越低。Among them, the smaller the calculated Hamming distance, the higher the similarity; the larger the calculated Hamming distance, the lower the similarity.
举例来说,目标向量为10位,至少一个历史向量均为10位,通过比较目标向量的每一位与至少一个历史向量的每一位是否相同,若相同,则表明该位对应的汉明距离为0,若不同,则表明该位对应的汉明距离为1,将所有位的汉明距离相加,得到向量之间的最终汉明距离,最终汉明距离为0~10之间。最终汉明距离越大,其相似度越低;最终汉明距离越小,其相似度越高。For example, the target vector is 10 bits, and at least one history vector is 10 bits. By comparing each bit of the target vector with each bit of at least one history vector, it is the same. If they are the same, it indicates that the corresponding Hamming The distance is 0. If it is different, it means that the Hamming distance corresponding to the bit is 1. Add the Hamming distances of all bits to obtain the final Hamming distance between the vectors. The final Hamming distance is between 0 and 10. The larger the final Hamming distance, the lower the similarity; the smaller the final Hamming distance, the higher the similarity.
103,服务端确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与第一目标历史特征对应的第一目标 内容标签。103. The server determines the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determines the first target content tag corresponding to the first target historical feature.
104,服务端向目标自然人ID推送与第一目标内容标签对应的内容。104. The server pushes the content corresponding to the first target content tag to the target natural person ID.
本申请实施例中,每个历史特征都有与其对应的至少一个内容标签。服务端计算目标自然人标识ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度后,确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,服务端向该目标自然人ID推送与第一目标内容标签对应的内容。In the embodiment of the present application, each historical feature has at least one content tag corresponding to it. After calculating the similarity between the real-time user feature of the target natural person ID and at least one historical feature of the target historical time period of the target natural person ID, the server determines the highest similarity between the at least one historical feature and the real-time user feature of the target natural person ID. A target historical feature, the server pushes the content corresponding to the first target content tag to the target natural person ID.
其中,第一目标内容标签可以是广告内容标签,其对应的内容可以为广告内容。本申请实施例可以根据用户的实时特征在特定时段可以实现精准的广告投放。Wherein, the first target content tag may be an advertisement content tag, and its corresponding content may be advertisement content. According to the embodiments of the present application, accurate advertisement placement can be implemented in a specific time period according to the real-time characteristics of the user.
可选的,实时特征参数还包括当前地理位置信息,在执行步骤104之前,还可以执行如下步骤:Optionally, the real-time feature parameters also include current geographic location information. Before step 104 is performed, the following steps may also be performed:
服务端确定与当前地理位置信息对应的地理位置标签。The server determines the geographic location tag corresponding to the current geographic location information.
步骤104具体可以为:Step 104 may specifically be:
服务端向目标自然人ID推送与第一目标内容标签和地理位置标签对应的内容。The server pushes the content corresponding to the first target content tag and the geographic location tag to the target natural person ID.
本申请实施例中,当前地理位置信息可以包括当前的GPS定位信息(比如,当前的经纬度信息)。本申请实施例在进行内容推送时,不仅考虑时间因素,还考虑地理位置信息,从而进一步提高向目标自然人ID推送的内容的准确度。In the embodiment of the present application, the current geographic location information may include current GPS positioning information (for example, current longitude and latitude information). In the embodiment of the present application, when content is pushed, not only the time factor is considered, but also geographic location information is also considered, thereby further improving the accuracy of the content pushed to the target natural person ID.
实施本申请实施例,在向目标自然人ID推送内容时,可以基于该目标自然人ID的实时用户特征以及实时特征参数与该目标自然人ID的历史特征的相似度来确定向该目标自然人ID推送的内容的标签,并向该目标自然人ID推送该标签对应的内容。由于用于对比的历史特征是基于该自然人ID对应的多个用户ID得到的,与采用单一用户ID的历史特征相比,使得历史特征得到极大的丰富,提高了内容标签的准确度,从而可以提高内容推送的准确性。In implementing the embodiments of this application, when pushing content to the target natural person ID, the content to be pushed to the target natural person ID can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID And push the content corresponding to the tag to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
请参阅图2,图2是本申请实施例公开的另一种内容推送方法的流程示意图。图2是在图1的基础上进一步优化得到的,如图2所示,该内容推送方法包括如下步骤。Please refer to FIG. 2. FIG. 2 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application. Figure 2 is further optimized on the basis of Figure 1. As shown in Figure 2, the content pushing method includes the following steps.
201,服务端获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于多个用户ID的实时用户特征生成目标自然人标识ID的实时用户特征,实时特征参数包括当前时间点和当前地理位置信息。201. The server obtains real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generates real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include the current time point And current geographic location information.
202,服务端计算目标自然人标识ID的实时用户特征与目标自然人ID的目标历史时间段的至少一个历史特征的相似度,当前时间点与目标历史时间段相对应。202. The server calculates the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period.
203,服务端确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与第一目标历史特征对应的第一目标内容标签。203. The server determines the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determines the first target content tag corresponding to the first target historical feature.
其中,步骤201至步骤203的具体实施可以参见图1所示的步骤101至步骤103,此处不再赘述。Among them, the specific implementation of step 201 to step 203 can refer to step 101 to step 103 shown in FIG. 1, which will not be repeated here.
204,服务端计算第一目标内容标签与当前地理位置信息的相似度。204. The server calculates the similarity between the first target content tag and the current geographic location information.
205,若第一目标内容标签与当前地理位置信息的相似度大于或等于预设相似度阈值,服务端向目标自然人ID推送与第一目标内容标签对应的内容。205. If the similarity between the first target content tag and the current geographic location information is greater than or equal to the preset similarity threshold, the server pushes the content corresponding to the first target content tag to the target natural person ID.
本申请实施例中,当前地理位置信息可以包括当前的GPS定位信息(比如,当前的经纬度信息)。服务端计算第一目标内容标签与当前地理位置信息的相似度,具体为:服务端提取第一目标内容标签中的位置子标签,计算第一目标内容标签中的位置子标签与当前地理位置信息的相似度。In the embodiment of the present application, the current geographic location information may include current GPS positioning information (for example, current longitude and latitude information). The server calculates the similarity between the first target content tag and the current geographic location information, specifically: the server extracts the location subtag in the first target content tag, and calculates the location subtag in the first target content tag and the current geographic location information The similarity.
其中,服务端可以通过欧式距离计算公式算第一目标内容标签中的位置子标签与当前地理位置信息的相似度。Among them, the server can calculate the similarity between the location subtag in the first target content tag and the current geographic location information through the Euclidean distance calculation formula.
本申请实施例在进行内容推送时,不仅考虑时间因素,还考虑地理位置信息,从而进一步提高向目标自然人ID推送的内容的准确度。In the embodiment of the present application, when content is pushed, not only the time factor is considered, but also geographic location information is also considered, thereby further improving the accuracy of the content pushed to the target natural person ID.
可选的,图2所示的方法还可以包括如下步骤:Optionally, the method shown in FIG. 2 may further include the following steps:
(31)若第一目标内容标签与当前地理位置信息的相似度小于预设相似度阈值,服务端确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度第二高的第二目标历史特征,确定与第二目标历史特征对应的第二目标内容标签;(31) If the similarity between the first target content tag and the current geographic location information is less than the preset similarity threshold, the server determines the second target with the second highest similarity to the real-time user feature of the target natural person ID in at least one historical feature Historical characteristics, determine the second target content label corresponding to the second target historical characteristics;
(32)服务端计算第二目标内容标签与当前地理位置信息的相似度;(32) The server calculates the similarity between the second target content label and the current geographic location information;
(33)若第二目标内容标签与当前地理位置信息的相似度大于或等于预设 相似度阈值,服务端向目标自然人ID推送与第二目标内容标签对应的内容。(33) If the similarity between the second target content tag and the current geographic location information is greater than or equal to the preset similarity threshold, the server pushes the content corresponding to the second target content tag to the target natural person ID.
本申请实施例中,如果第一目标内容标签与当前地理位置信息的相似度小于预设相似度阈值,表明在当前地理位置,并不适合推送与第一目标内容标签对应的内容。比如,第一目标内容标签为旅行有关的标签,而当前地理位置信息定位在医院,则二者相似度小。In this embodiment of the application, if the similarity between the first target content tag and the current geographic location information is less than the preset similarity threshold, it indicates that the current geographic location is not suitable for pushing content corresponding to the first target content tag. For example, if the first target content tag is a travel-related tag, and the current geographic location information is located in a hospital, the similarity between the two is small.
本申请实施例中,当第一目标内容标签与当前地理位置信息的相似度小于预设相似度阈值,服务端确定至少一个历史特征中与目标自然人标识ID的实时用户特征相似度第二高的第二目标历史特征,确定与第二目标历史特征对应的第二目标内容标签,若第二目标内容标签与当前地理位置信息的相似度大于或等于预设相似度阈值,服务端向目标自然人ID推送与第二目标内容标签对应的内容。若第二目标内容标签与当前地理位置信息的相似度小于预设相似度阈值,则重新执行步骤201。In this embodiment of the application, when the similarity between the first target content tag and the current geographic location information is less than the preset similarity threshold, the server determines the second highest similarity between the at least one historical feature and the real-time user feature of the target natural person ID. The second target historical feature determines the second target content tag corresponding to the second target historical feature. If the similarity between the second target content tag and the current geographic location information is greater than or equal to the preset similarity threshold, the server sends the target natural person ID Push the content corresponding to the second target content tag. If the similarity between the second target content label and the current geographic location information is less than the preset similarity threshold, step 201 is executed again.
本申请实施例在进行内容推送时,不仅考虑时间因素,还考虑地理位置信息,从而进一步提高向目标自然人ID推送的内容的准确度。In the embodiment of the present application, when content is pushed, not only the time factor is considered, but also geographic location information is also considered, thereby further improving the accuracy of the content pushed to the target natural person ID.
上述主要从方法侧执行过程的角度对本申请实施例的方案进行了介绍。可以理解的是,服务端为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本发明能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。The foregoing mainly introduces the solution of the embodiment of the present application from the perspective of the execution process on the method side. It can be understood that, in order to realize the above-mentioned functions, the server includes hardware structures and/or software modules corresponding to each function. Those skilled in the art should easily realize that in combination with the units and algorithm steps of the examples described in the embodiments disclosed herein, the present invention can be implemented in the form of hardware or a combination of hardware and computer software. Whether a certain function is executed by hardware or computer software-driven hardware depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the present invention.
本申请实施例可以根据上述方法示例对服务端进行功能单元的划分,例如,可以对应各个功能划分各个功能单元,也可以将两个或两个以上的功能集成在一个处理单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。需要说明的是,本申请实施例中对单元的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。The embodiment of the present application may divide the server side into functional units according to the foregoing method examples. For example, each functional unit may be divided corresponding to each function, or two or more functions may be integrated into one processing unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. It should be noted that the division of units in the embodiments of the present application is illustrative, and is only a logical function division, and there may be other division methods in actual implementation.
请参阅图3,图3是本申请实施例公开的一种内容推送装置的结构示意图。如图3所示,该内容推送装置300包括获取单元301、计算单元302、确定单 元303和推送单元304,其中:Please refer to FIG. 3, which is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application. As shown in Fig. 3, the content pushing device 300 includes an acquiring unit 301, a calculating unit 302, a determining unit 303, and a pushing unit 304, wherein:
所述获取单元301,用于获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于所述多个用户ID的实时用户特征生成所述目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点;The acquiring unit 301 is configured to acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to a target natural person identification ID, and generate real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs , The real-time characteristic parameter includes the current time point;
所述计算单元302,用于计算所述目标自然人标识ID的实时用户特征与所述目标自然人ID的目标历史时间段的至少一个历史特征的相似度,所述当前时间点与所述目标历史时间段相对应;The calculation unit 302 is configured to calculate the similarity between the real-time user characteristic of the target natural person ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point is the same as the target historical time Segment corresponding
所述确定单元303,用于确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与所述第一目标历史特征对应的第一目标内容标签;The determining unit 303 is configured to determine the first target historical feature with the highest similarity to the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target historical feature corresponding to the first target historical feature. Target content label;
所述推送单元304,用于向所述目标自然人ID推送与所述第一目标内容标签对应的内容。The pushing unit 304 is configured to push the content corresponding to the first target content tag to the target natural person ID.
可选的,所述实时特征参数还包括当前地理位置信息。Optionally, the real-time characteristic parameter also includes current geographic location information.
所述确定单元303,还用于确定与所述当前地理位置信息对应的地理位置标签;The determining unit 303 is further configured to determine a geographic location tag corresponding to the current geographic location information;
所述推送单元304向所述目标自然人ID推送与所述第一目标内容标签对应的内容,具体为:向所述目标自然人ID推送与所述第一目标内容标签和所述地理位置标签对应的内容。The pushing unit 304 pushes the content corresponding to the first target content tag to the target natural person ID, specifically: pushing the content corresponding to the first target content tag and the geographic location tag to the target natural person ID content.
可选的,所述实时特征参数还包括当前地理位置信息,Optionally, the real-time characteristic parameters also include current geographic location information,
所述计算单元302,还用于计算所述第一目标内容标签与所述当前地理位置信息的相似度;The calculation unit 302 is further configured to calculate the similarity between the first target content tag and the current geographic location information;
所述推送单元304,还用于在所述第一目标内容标签与所述当前地理位置信息的相似度大于或等于预设相似度阈值的情况下,向所述目标自然人ID推送与所述第一目标内容标签对应的内容。The pushing unit 304 is further configured to push the target natural person ID to the target natural person ID when the similarity between the first target content tag and the current geographic location information is greater than or equal to a preset similarity threshold. Content corresponding to a target content tag.
可选的,所述确定单元303,还用于在所述第一目标内容标签与所述当前地理位置信息的相似度小于所述预设相似度阈值的情况下,确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度第二高的第二目标历史特征,确定与所述第二目标历史特征对应的第二目标内容标签;Optionally, the determining unit 303 is further configured to determine the at least one historical feature when the similarity between the first target content label and the current geographic location information is less than the preset similarity threshold Determining the second target content tag corresponding to the second target historical feature in the second target historical feature with the second highest similarity to the real-time user characteristics of the target natural person identification ID;
所述计算单元302,还用于计算所述第二目标内容标签与所述当前地理位 置信息的相似度;The calculation unit 302 is further configured to calculate the similarity between the second target content tag and the current geographic location information;
所述推送单元304,还用于在所述第二目标内容标签与所述当前地理位置信息的相似度大于或等于预设相似度阈值的情况下,向所述目标自然人ID推送与所述第二目标内容标签对应的内容。The pushing unit 304 is further configured to push the target natural person ID to the target natural person ID when the similarity between the second target content tag and the current geographic location information is greater than or equal to a preset similarity threshold. 2. Content corresponding to the target content tag.
可选的,所述获取单元301,还用于在获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数之前,获取自然人ID与用户ID的对应关系;Optionally, the acquiring unit 301 is further configured to acquire the corresponding relationship between the natural person ID and the user ID before acquiring the real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID;
所述确定单元303,还用于根据自然人ID与用户ID的对应关系确定与所述目标自然人ID对应的多个用户ID。The determining unit 303 is further configured to determine multiple user IDs corresponding to the target natural person ID according to the corresponding relationship between the natural person ID and the user ID.
可选的,该内容推送装置300还可以包括处理单元305。Optionally, the content pushing apparatus 300 may further include a processing unit 305.
所述获取单元301,还用于获取自然人ID与用户ID的对应关系之前,获取多个客户端上报的N个用户ID的历史行为数据;The obtaining unit 301 is further configured to obtain historical behavior data of N user IDs reported by multiple clients before obtaining the corresponding relationship between the natural person ID and the user ID;
所述计算单元302,还用于计算所述N个用户ID的历史行为数据之间的相似度,N为正整数;The calculation unit 302 is also used to calculate the similarity between the historical behavior data of the N user IDs, where N is a positive integer;
所述处理单元305,用于基于所述N个用户ID的历史行为数据之间的相似度构建用户ID之间的关系对;根据所述用户ID之间的关系对构建自然人ID与用户ID的对应关系,所述自然人ID与用户ID的对应关系中,一个自然人ID对应至少两个用户ID。The processing unit 305 is configured to construct a relationship pair between user IDs based on the similarity between the historical behavior data of the N user IDs; construct a pair of natural person IDs and user IDs based on the relationship between the user IDs Correspondence, in the correspondence between the natural person ID and the user ID, one natural person ID corresponds to at least two user IDs.
可选的,N大于预设数量阈值,所述计算单元302计算所述N个用户ID的历史行为数据之间的相似度,具体为:采用局部敏感哈希算法计算所述N个用户ID的历史行为数据之间的相似度。Optionally, if N is greater than a preset number threshold, the calculation unit 302 calculates the similarity between the historical behavior data of the N user IDs, specifically: using a local sensitive hash algorithm to calculate the number of the N user IDs The similarity between historical behavior data.
其中,图3中的获取单元301、计算单元302、确定单元303、推送单元304和处理单元305可以是处理器、。Wherein, the acquiring unit 301, the calculating unit 302, the determining unit 303, the pushing unit 304, and the processing unit 305 in FIG. 3 may be processors.
实施图3所示的内容推送装置,在向目标自然人ID推送内容时,可以基于该目标自然人ID的实时用户特征以及实时特征参数与该目标自然人ID的历史特征的相似度来确定向该目标自然人ID推送的内容的标签,并向该目标自然人ID推送该标签对应的内容。由于用于对比的历史特征是基于该自然人ID对应的多个用户ID得到的,与采用单一用户ID的历史特征相比,使得历史特征得到极大的丰富,提高了内容标签的准确度,从而可以提高内容推送的准确 性。Implementing the content pushing device shown in Figure 3, when pushing content to the target natural person ID, the target natural person ID can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID. The tag of the content pushed by the ID, and the content corresponding to the tag is pushed to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
请参阅图4,图4是本申请实施例公开的一种服务端的结构示意图。如图4所示,该服务端400包括处理器401和存储器402,其中,服务端400还可以包括总线403,处理器401和存储器402可以通过总线403相互连接,总线403可以是外设部件互连标准(Peripheral Component Interconnect,简称PCI)总线或扩展工业标准结构(Extended Industry Standard Architecture,简称EISA)总线等。总线403可以分为地址总线、数据总线、控制总线等。为便于表示,图4中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。其中,服务端400还可以包括输入输出设备404。存储器402用于存储包含指令的一个或多个程序;处理器401用于调用存储在存储器402中的指令执行上述图1至图2中的部分或全部方法步骤。Please refer to FIG. 4, which is a schematic structural diagram of a server disclosed in an embodiment of the present application. As shown in FIG. 4, the server 400 includes a processor 401 and a memory 402. The server 400 may also include a bus 403. The processor 401 and the memory 402 may be connected to each other through the bus 403. The bus 403 may be a peripheral component. Connect the standard (Peripheral Component Interconnect, referred to as PCI) bus or extended industry standard architecture (Extended Industry Standard Architecture, referred to as EISA) bus, etc. The bus 403 can be divided into an address bus, a data bus, a control bus, and so on. For ease of presentation, only one thick line is used to represent in FIG. 4, but it does not mean that there is only one bus or one type of bus. The server 400 may also include an input and output device 404. The memory 402 is used to store one or more programs containing instructions; the processor 401 is used to call the instructions stored in the memory 402 to execute some or all of the method steps in FIGS. 1 to 2.
实施图4所示的服务端,在向目标自然人ID推送内容时,可以基于该目标自然人ID的实时用户特征以及实时特征参数与该目标自然人ID的历史特征的相似度来确定向该目标自然人ID推送的内容的标签,并向该目标自然人ID推送该标签对应的内容。由于用于对比的历史特征是基于该自然人ID对应的多个用户ID得到的,与采用单一用户ID的历史特征相比,使得历史特征得到极大的丰富,提高了内容标签的准确度,从而可以提高内容推送的准确性。Implementing the server shown in Figure 4, when pushing content to the target natural person ID, the target natural person ID can be determined based on the real-time user characteristics of the target natural person ID and the similarity between the real-time feature parameters and the historical characteristics of the target natural person ID. The tag of the pushed content, and the content corresponding to the tag is pushed to the target natural person ID. Since the historical features used for comparison are obtained based on multiple user IDs corresponding to the natural person ID, compared with the historical features using a single user ID, the historical features are greatly enriched, and the accuracy of content tags is improved, thereby Can improve the accuracy of content push.
本申请实施例还提供一种计算机存储介质,其中,该计算机存储介质存储用于电子数据交换的计算机程序,该计算机程序使得计算机执行如上述方法实施例中记载的任何一种内容推送方法的部分或全部步骤。The embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a computer program for electronic data exchange, and the computer program causes the computer to execute any part of the content push method recorded in the above method embodiment Or all steps.
本申请实施例还提供一种计算机程序产品,该计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,该计算机程序可操作来使计算机执行如上述方法实施例中记载的任何一种内容推送方法的部分或全部步骤。The embodiments of the present application also provide a computer program product. The computer program product includes a non-transitory computer-readable storage medium storing a computer program. The computer program is operable to cause a computer to execute any of the methods described in the foregoing method embodiments. Part or all of the steps of a content push method.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that for the foregoing method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should know that the present invention is not limited by the described sequence of actions. Because according to the present invention, certain steps can be performed in other order or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the involved actions and modules are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详 述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail in an embodiment, reference may be made to related descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed device may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储器中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储器中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储器包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable memory. Based on this understanding, the technical solution of the present invention essentially or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory, A number of instructions are included to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention. The aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other various media that can store program codes.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储器中,存储器可以包括:闪存盘、只读存储器(英文:Read-Only Memory,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、 磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing relevant hardware through a program. The program can be stored in a computer-readable memory, and the memory can include: flash disk , Read-only memory (English: Read-Only Memory, abbreviated as: ROM), random access device (English: Random Access Memory, abbreviated as: RAM), magnetic disk or optical disc, etc.
以上对本申请实施例进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The embodiments of the present application are described in detail above, and specific examples are used in this article to illustrate the principles and implementation of the present invention. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present invention; Those of ordinary skill in the art, based on the idea of the present invention, will have changes in the specific implementation and the scope of application. In summary, the content of this specification should not be construed as limiting the present invention.

Claims (10)

  1. 一种内容推送方法,其特征在于,包括:A content pushing method, characterized in that it comprises:
    获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于所述多个用户ID的实时用户特征生成所述目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点;Acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, generate real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, and the real-time characteristic parameters include current Point in time
    计算所述目标自然人标识ID的实时用户特征与所述目标自然人ID的目标历史时间段的至少一个历史特征的相似度,所述当前时间点与所述目标历史时间段相对应;Calculating the similarity between the real-time user characteristic of the target natural person identification ID and at least one historical characteristic of the target historical time period of the target natural person ID, and the current time point corresponds to the target historical time period;
    确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与所述第一目标历史特征对应的第一目标内容标签;Determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target content tag corresponding to the first target historical feature;
    向所述目标自然人ID推送与所述第一目标内容标签对应的内容。Push the content corresponding to the first target content tag to the target natural person ID.
  2. 根据权利要求1所述的方法,其特征在于,所述实时特征参数还包括当前地理位置信息,所述向所述目标自然人ID推送与所述第一目标内容标签对应的内容之前,所述方法还包括:The method according to claim 1, wherein the real-time characteristic parameter further includes current geographic location information, and before the content corresponding to the first target content tag is pushed to the target natural person ID, the method Also includes:
    确定与所述当前地理位置信息对应的地理位置标签;Determining a geographic location tag corresponding to the current geographic location information;
    所述向所述目标自然人ID推送与所述第一目标内容标签对应的内容,包括:The pushing content corresponding to the first target content tag to the target natural person ID includes:
    向所述目标自然人ID推送与所述第一目标内容标签和所述地理位置标签对应的内容。Push the content corresponding to the first target content tag and the geographic location tag to the target natural person ID.
  3. 根据权利要求1所述的方法,其特征在于,所述实时特征参数还包括当前地理位置信息,所述确定与所述第一目标历史特征对应的第一目标内容标签之后,所述方法还包括:The method according to claim 1, wherein the real-time feature parameter further includes current geographic location information, and after the first target content tag corresponding to the first target historical feature is determined, the method further comprises :
    计算所述第一目标内容标签与所述当前地理位置信息的相似度;Calculating the similarity between the first target content label and the current geographic location information;
    若所述第一目标内容标签与所述当前地理位置信息的相似度大于或等于预设相似度阈值,执行所述向所述目标自然人ID推送与所述第一目标内容标 签对应的内容的步骤。If the similarity between the first target content tag and the current geographic location information is greater than or equal to a preset similarity threshold, execute the step of pushing content corresponding to the first target content tag to the target natural person ID .
  4. 根据权利要求3所述的方法,其特征在于,所述方法还包括:The method according to claim 3, wherein the method further comprises:
    若所述第一目标内容标签与所述当前地理位置信息的相似度小于所述预设相似度阈值,确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度第二高的第二目标历史特征,确定与所述第二目标历史特征对应的第二目标内容标签;If the similarity between the first target content tag and the current geographic location information is less than the preset similarity threshold, it is determined that the real-time user feature similarity between the at least one historical feature and the target natural person identification ID is second High second target historical feature, and determine the second target content tag corresponding to the second target historical feature;
    计算所述第二目标内容标签与所述当前地理位置信息的相似度;Calculating the similarity between the second target content tag and the current geographic location information;
    若所述第二目标内容标签与所述当前地理位置信息的相似度大于或等于预设相似度阈值,向所述目标自然人ID推送与所述第二目标内容标签对应的内容。If the similarity between the second target content tag and the current geographic location information is greater than or equal to a preset similarity threshold, the content corresponding to the second target content tag is pushed to the target natural person ID.
  5. 根据权利要求1所述的方法,其特征在于,所述获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数之前,所述方法还包括:The method according to claim 1, wherein before said acquiring real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, the method further comprises:
    获取自然人ID与用户ID的对应关系;Obtain the correspondence between natural person ID and user ID;
    根据自然人ID与用户ID的对应关系确定与所述目标自然人ID对应的多个用户ID。A plurality of user IDs corresponding to the target natural person ID are determined according to the corresponding relationship between the natural person ID and the user ID.
  6. 根据权利要求5所述的方法,其特征在于,所述获取自然人ID与用户ID的对应关系之前,所述方法还包括:The method according to claim 5, characterized in that, before the obtaining the correspondence between the natural person ID and the user ID, the method further comprises:
    获取多个客户端上报的N个用户ID的历史行为数据,计算所述N个用户ID的历史行为数据之间的相似度,N为正整数;Acquiring historical behavior data of N user IDs reported by multiple clients, and calculating the similarity between the historical behavior data of the N user IDs, where N is a positive integer;
    基于所述N个用户ID的历史行为数据之间的相似度构建用户ID之间的关系对;Constructing a relationship pair between user IDs based on the similarity between the historical behavior data of the N user IDs;
    根据所述用户ID之间的关系对构建自然人ID与用户ID的对应关系,所述自然人ID与用户ID的对应关系中,一个自然人ID对应至少两个用户ID。The corresponding relationship between the natural person ID and the user ID is constructed according to the relationship between the user IDs. In the corresponding relationship between the natural person ID and the user ID, one natural person ID corresponds to at least two user IDs.
  7. 根据权利要求6所述的方法,其特征在于,N大于预设数量阈值,所 述计算所述N个用户ID的历史行为数据之间的相似度,包括:The method according to claim 6, wherein N is greater than a preset number threshold, and the calculating the similarity between historical behavior data of the N user IDs includes:
    采用局部敏感哈希算法计算所述N个用户ID的历史行为数据之间的相似度。A local sensitive hash algorithm is used to calculate the similarity between the historical behavior data of the N user IDs.
  8. 一种内容推送装置,其特征在于,所述内容推送装置包括获取单元、计算单元、确定单元和推送单元,其中:A content pushing device, characterized in that the content pushing device includes an acquisition unit, a calculation unit, a determination unit and a pushing unit, wherein:
    所述获取单元,用于获取目标自然人标识ID对应的多个用户ID的实时用户特征以及实时特征参数,基于所述多个用户ID的实时用户特征生成所述目标自然人标识ID的实时用户特征,所述实时特征参数包括当前时间点;The acquiring unit is configured to acquire real-time user characteristics and real-time characteristic parameters of multiple user IDs corresponding to the target natural person identification ID, and generate the real-time user characteristics of the target natural person identification ID based on the real-time user characteristics of the multiple user IDs, The real-time characteristic parameter includes the current time point;
    所述计算单元,用于计算所述目标自然人标识ID的实时用户特征与所述目标自然人ID的目标历史时间段的至少一个历史特征的相似度,所述当前时间点与所述目标历史时间段相对应;The calculation unit is configured to calculate the similarity between the real-time user feature of the target natural person ID and at least one historical feature of the target historical time period of the target natural person ID, and the current time point is the same as the target historical time period Corresponding;
    所述确定单元,用于确定所述至少一个历史特征中与所述目标自然人标识ID的实时用户特征相似度最高的第一目标历史特征,确定与所述第一目标历史特征对应的第一目标内容标签;The determining unit is configured to determine the first target historical feature that has the highest similarity with the real-time user feature of the target natural person identification ID among the at least one historical feature, and determine the first target corresponding to the first target historical feature Content label
    所述推送单元,用于向所述目标自然人ID推送与所述第一目标内容标签对应的内容。The pushing unit is configured to push the content corresponding to the first target content tag to the target natural person ID.
  9. 一种服务端,其特征在于,包括处理器以及存储器,所述存储器用于存储一个或多个程序,所述一个或多个程序被配置成由所述处理器执行,所述程序包括用于执行如权利要求1~7任一项所述的方法。A server is characterized by comprising a processor and a memory, the memory is used to store one or more programs, the one or more programs are configured to be executed by the processor, and the programs include Perform the method according to any one of claims 1-7.
  10. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质用于存储电子数据交换的计算机程序,其中,所述计算机程序使得计算机执行如权利要求1~7任一项所述的方法。A computer-readable storage medium, characterized in that the computer-readable storage medium is used to store a computer program for electronic data exchange, wherein the computer program causes a computer to execute any one of claims 1-7 method.
PCT/CN2019/092594 2019-06-24 2019-06-24 Content pushing method and apparatus, server, and storage medium WO2020257993A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201980091615.0A CN113412608B (en) 2019-06-24 2019-06-24 Content pushing method and device, server and storage medium
PCT/CN2019/092594 WO2020257993A1 (en) 2019-06-24 2019-06-24 Content pushing method and apparatus, server, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/092594 WO2020257993A1 (en) 2019-06-24 2019-06-24 Content pushing method and apparatus, server, and storage medium

Publications (1)

Publication Number Publication Date
WO2020257993A1 true WO2020257993A1 (en) 2020-12-30

Family

ID=74060434

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/092594 WO2020257993A1 (en) 2019-06-24 2019-06-24 Content pushing method and apparatus, server, and storage medium

Country Status (2)

Country Link
CN (1) CN113412608B (en)
WO (1) WO2020257993A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112699676A (en) * 2020-12-31 2021-04-23 中国农业银行股份有限公司 Address similarity relation generation method and device
CN113742599A (en) * 2021-11-05 2021-12-03 太平金融科技服务(上海)有限公司深圳分公司 Content recommendation method, device, equipment and computer readable storage medium
CN114187037A (en) * 2021-11-30 2022-03-15 北京深演智能科技股份有限公司 Information pushing method and device and nonvolatile storage medium
CN114676288A (en) * 2022-03-17 2022-06-28 北京悠易网际科技发展有限公司 ID pull-through method and device
CN114860557A (en) * 2022-04-08 2022-08-05 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN115796932A (en) * 2022-11-10 2023-03-14 永道工程咨询有限公司 Engineering cost prediction method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014112362A1 (en) * 2013-01-18 2014-07-24 パナソニック株式会社 Content presentation method and prog ram
CN105447147A (en) * 2015-11-26 2016-03-30 晶赞广告(上海)有限公司 Data processing method and apparatus
CN105610929A (en) * 2015-12-24 2016-05-25 北京奇虎科技有限公司 Personalized data pushing method and device
CN107277115A (en) * 2017-05-27 2017-10-20 深圳大学 A kind of content delivery method and device
CN107332807A (en) * 2016-04-29 2017-11-07 高德信息技术有限公司 A kind of information-pushing method and device
CN109145146A (en) * 2018-09-07 2019-01-04 北京奇艺世纪科技有限公司 A kind of data object recommended method, device and electronic equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103106259B (en) * 2013-01-25 2016-01-20 西北工业大学 A kind of mobile webpage content recommendation method based on situation
CN105100029B (en) * 2014-05-22 2018-10-30 阿里巴巴集团控股有限公司 The method and apparatus that authentication is carried out to user
CN104517217A (en) * 2014-11-24 2015-04-15 形山科技(深圳)有限公司 Data processing method and terminal
CN107846393B (en) * 2017-09-11 2020-01-14 阿里巴巴集团控股有限公司 Real person authentication method and device
CN108011928A (en) * 2017-11-10 2018-05-08 深圳市金立通信设备有限公司 A kind of information-pushing method, terminal device and computer-readable medium
CN108363821A (en) * 2018-05-09 2018-08-03 深圳壹账通智能科技有限公司 A kind of information-pushing method, device, terminal device and storage medium
CN108985954B (en) * 2018-07-02 2022-06-21 武汉斗鱼网络科技有限公司 Method for establishing association relation of each identifier and related equipment
CN109299327A (en) * 2018-11-16 2019-02-01 广州市百果园信息技术有限公司 Video recommendation method, device, equipment and storage medium
CN109495770B (en) * 2018-11-23 2021-03-16 武汉斗鱼网络科技有限公司 Live broadcast room recommendation method, device, equipment and medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014112362A1 (en) * 2013-01-18 2014-07-24 パナソニック株式会社 Content presentation method and prog ram
CN105447147A (en) * 2015-11-26 2016-03-30 晶赞广告(上海)有限公司 Data processing method and apparatus
CN105610929A (en) * 2015-12-24 2016-05-25 北京奇虎科技有限公司 Personalized data pushing method and device
CN107332807A (en) * 2016-04-29 2017-11-07 高德信息技术有限公司 A kind of information-pushing method and device
CN107277115A (en) * 2017-05-27 2017-10-20 深圳大学 A kind of content delivery method and device
CN109145146A (en) * 2018-09-07 2019-01-04 北京奇艺世纪科技有限公司 A kind of data object recommended method, device and electronic equipment

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112699676A (en) * 2020-12-31 2021-04-23 中国农业银行股份有限公司 Address similarity relation generation method and device
CN112699676B (en) * 2020-12-31 2024-04-12 中国农业银行股份有限公司 Address similarity relation generation method and device
CN113742599A (en) * 2021-11-05 2021-12-03 太平金融科技服务(上海)有限公司深圳分公司 Content recommendation method, device, equipment and computer readable storage medium
CN113742599B (en) * 2021-11-05 2022-03-18 太平金融科技服务(上海)有限公司深圳分公司 Content recommendation method, device, equipment and computer readable storage medium
CN114187037A (en) * 2021-11-30 2022-03-15 北京深演智能科技股份有限公司 Information pushing method and device and nonvolatile storage medium
CN114676288A (en) * 2022-03-17 2022-06-28 北京悠易网际科技发展有限公司 ID pull-through method and device
CN114860557A (en) * 2022-04-08 2022-08-05 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN114860557B (en) * 2022-04-08 2023-05-26 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN115796932A (en) * 2022-11-10 2023-03-14 永道工程咨询有限公司 Engineering cost prediction method and device, electronic equipment and storage medium
CN115796932B (en) * 2022-11-10 2023-10-03 永道工程咨询有限公司 Project cost prediction method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN113412608A (en) 2021-09-17
CN113412608B (en) 2022-11-15

Similar Documents

Publication Publication Date Title
WO2020257993A1 (en) Content pushing method and apparatus, server, and storage medium
US10278005B2 (en) Method, apparatus, and system for determining a location of a terminal
US10897685B2 (en) Matching users in a location-based service
US9813309B2 (en) Method, apparatus, and system for determining a location corresponding to an IP address
WO2018192496A1 (en) Trend information generation method and device, storage medium and electronic device
CN106296344B (en) Malicious address identification method and device
CN105898085B (en) Harass recognition methods and the device of communications account
WO2019019385A1 (en) Cross-platform data matching method and apparatus, computer device and storage medium
CN109582844A (en) A kind of method, apparatus and system identifying crawler
EP2939200A1 (en) Method and apparatus for secure advertising
CN105871585A (en) Terminal association method and device
WO2020252639A1 (en) Content pushing method and related product
WO2017020550A1 (en) Method and device for generating access point attribute information of wireless access point
CN115145587A (en) Product parameter checking method and device, electronic equipment and storage medium
CN105701224A (en) Security information customized service system based on big data
CN105959306A (en) IP address identification method and device
CN110557351B (en) Method and apparatus for generating information
WO2020258101A1 (en) User similarity calculation method and apparatus, server end, and storage medium
CN115470399A (en) ID (identity) communication method, device, equipment and storage medium based on big data
US20170171330A1 (en) Method for pushing information and electronic device
CN113383360B (en) Content pushing method, device, server side and storage medium
WO2020233093A1 (en) Association graph generation method and apparatus, computer device, and storage medium
CN109561406B (en) SIM card selection method, device, system, electronic equipment and medium
TWM565357U (en) Geographic information system of automated service equipment
CN106844396B (en) Information processing method and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19935507

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19935507

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 27/05/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19935507

Country of ref document: EP

Kind code of ref document: A1