WO2020258102A1 - Content pushing method and apparatus, mobile terminal and storage medium - Google Patents

Content pushing method and apparatus, mobile terminal and storage medium Download PDF

Info

Publication number
WO2020258102A1
WO2020258102A1 PCT/CN2019/093110 CN2019093110W WO2020258102A1 WO 2020258102 A1 WO2020258102 A1 WO 2020258102A1 CN 2019093110 W CN2019093110 W CN 2019093110W WO 2020258102 A1 WO2020258102 A1 WO 2020258102A1
Authority
WO
WIPO (PCT)
Prior art keywords
natural person
user
ids
target
target natural
Prior art date
Application number
PCT/CN2019/093110
Other languages
French (fr)
Chinese (zh)
Inventor
喻婷
Original Assignee
深圳市欢太科技有限公司
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市欢太科技有限公司, Oppo广东移动通信有限公司 filed Critical 深圳市欢太科技有限公司
Priority to PCT/CN2019/093110 priority Critical patent/WO2020258102A1/en
Priority to CN201980091547.8A priority patent/CN113412607B/en
Publication of WO2020258102A1 publication Critical patent/WO2020258102A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L9/00Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
    • H04L9/40Network security protocols

Definitions

  • This application relates to the field of communication technology, and in particular to a content pushing method, device, mobile terminal and storage medium.
  • a server for example, a server
  • a user for example, an application client
  • it generally pushes content according to the user profile of the user's identity (ID) logged in to the client.
  • ID the user profile of the user's identity
  • the server will push the wrong content to the client.
  • the embodiments of the present application provide a content pushing method, device, mobile terminal, and storage medium, which can improve the accuracy of content pushing.
  • an embodiment of the present application provides a content pushing method, including:
  • the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
  • an embodiment of the present application provides a content pushing device, the content pushing device includes a capturing unit, a determining unit, a detecting unit, and a processing unit, wherein:
  • the grabbing unit is used to grab M user identification IDs
  • the determining unit is configured to determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
  • the detection unit is configured to detect whether a target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
  • the processing unit is configured to refuse to push content to the target natural person ID when the detection unit detects that the target natural person ID is an abnormal natural person ID.
  • an embodiment of the present application provides a mobile terminal, including a processor and a memory, the memory is used to store one or more programs, and the one or more programs are configured to be executed by the processor.
  • the program includes instructions for executing the steps in the first aspect of the embodiments of the present application.
  • an embodiment of the present application provides a computer-readable storage medium, wherein the foregoing computer-readable storage medium stores a computer program for electronic data exchange, wherein the foregoing computer program enables a computer to execute Some or all of the steps described in one aspect.
  • embodiments of the present application provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute Example part or all of the steps described in the first aspect.
  • the computer program product may be a software installation package.
  • the content push method described in the embodiment of the application specifically includes the following steps: grab M user identification IDs, and determine N natural person IDs corresponding to the M user IDs, where M and N are both positive Integer, N is less than or equal to M; check whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of N natural person IDs; if so, refuse to push content to the target natural person ID.
  • the number of natural person IDs can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of content pushing on the server, thereby increasing The speed of content delivery.
  • FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application
  • Figure 2 is a schematic diagram of judging whether a natural person ID is a substitute ID disclosed in an embodiment of the present application
  • FIG. 3 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application.
  • Figure 4 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application.
  • Fig. 5 is a schematic structural diagram of a mobile terminal disclosed in an embodiment of the present application.
  • the mobile terminals involved in the embodiments of this application may include various handheld devices with wireless communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to wireless modems, as well as various forms of user equipment (User Equipment, UE), mobile station (Mobile Station, MS), terminal device (terminal device), etc.
  • UE User Equipment
  • MS Mobile Station
  • terminal device terminal device
  • FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application. As shown in FIG. 1, the content pushing method includes the following steps.
  • the server grabs M user identification IDs, and determines N natural person IDs corresponding to the M user IDs. Both M and N are positive integers, and N is less than or equal to M.
  • the server serves the client, and the content of the service includes providing resources to the client and storing client data.
  • the server is a targeted service program, and the device running the server can be called a server.
  • the server can establish connections with multiple clients at the same time, and can provide services to multiple clients at the same time.
  • the services provided by the server for the client in the embodiments of the present application mainly include content push services.
  • the content push service may include: browser content push service, application download push service, game content push service, etc.
  • the server can include application server, browser server, game server, etc.
  • the server can grab M user identities (IDs) from the database, and can push specific content to the clients logged in by the M user IDs.
  • User ID can include any one or more of the following types: single sign on identity (SSOID), OpenID, integrated circuit card identity (ICCID), international mobile device identity (International Mobile) Equipment (Identity, IMEI), telephone number (telephone, TEL), etc.
  • SSO is in multiple application systems. Users only need to log in once to access all mutually trusted application systems.
  • the server can randomly grab M user IDs from the database, and subsequently can push different content to the normal natural person IDs among the N natural person IDs corresponding to the M user IDs.
  • the server can also grab M user IDs with the same characteristics from the database, and subsequently can push the same content to the normal natural person IDs among the N natural person IDs corresponding to the M user IDs.
  • the server determines the N natural person IDs corresponding to the M user IDs, which may specifically be: the server determines the N natural person IDs corresponding to the M user IDs according to the generated correspondence between the user ID and the natural person ID.
  • the first situation if the M user IDs are all in the corresponding relationship between the user IDs and the natural person IDs that have been generated, the N natural person IDs corresponding to the M user IDs are directly determined according to the above method.
  • the second case if none of the M user IDs are in the corresponding relationship between the generated user ID and the natural person ID, the server can obtain all the natural person IDs in the corresponding relationship between the generated user ID and the natural person ID, according to
  • the user behavior data of M user IDs analyzes the similarity between the M user IDs and each of the above-mentioned natural person IDs. If the first user ID (the first user ID is any one of the above-mentioned M user IDs) and If the similarity of the natural person ID with the highest similarity among all the above natural person IDs is greater than the preset similarity threshold, then the first user ID is associated with the natural person ID with the highest similarity among all the natural person IDs.
  • the user behavior data of the first user ID is matched with the user behavior data of other user IDs in the database. If the similarity between the user behavior data and the user ID with the highest similarity among other user IDs in the aforementioned database is greater than the preset similarity threshold, the first user ID is established with the user ID with the highest similarity among other user IDs in the aforementioned database The corresponding relationship forms a new natural person ID, which corresponds to the user ID with the highest similarity among the first user ID and other user IDs in the database.
  • the third case if part of the M user IDs are in the corresponding relationship between the user IDs and natural person IDs that have been generated above, and another part is not in the corresponding relationship between the user IDs that have been generated and the natural person IDs, then they have been generated above
  • the user ID in the corresponding relationship between the user ID and the natural person ID directly determines the corresponding natural person ID according to the corresponding relationship, and the user ID that is not in the corresponding relationship between the user ID and the natural person ID that has been generated above is determined according to the method in the second case above The corresponding natural person ID.
  • the natural person ID in the embodiment of this application corresponds to a natural person.
  • This natural person may correspond to at least one mobile terminal (for example, mobile phone), at least one phone number, at least one application account, at least one OpenID, one SSOID, at least one ICCID, and at least one IMEI.
  • the IMEI, phone number, and 5 application accounts of the mobile phone are labeled with a natural person ID.
  • the user behavior data corresponding to these 5 application accounts all belong to the user behavior data of this natural person ID.
  • a real natural person can have many user IDs (for example, the IMEI of a mobile phone, a phone number, and 5 application accounts), but only one unique natural person ID is corresponding.
  • the server detects whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the above N natural person IDs. If yes, go to step 103, if not, go to step 104.
  • the abnormal natural person ID includes: all user IDs corresponding to the abnormal natural person ID are offline for a long time, and all user IDs corresponding to the abnormal natural person ID have user IDs with abnormal logins (for example, a certain application account After the account is stolen, a large number of spam messages are sent to friends in the contact list), and all user IDs corresponding to the abnormal natural person ID have user IDs that have swiping behavior (for example, the user ID frequently downloads and uninstalls applications, the user Recharge accounts with a large number of IDs, etc.), and all user IDs corresponding to the abnormal natural person ID have user IDs with substitute behaviors (for example, the user behavior habits of the user ID are very different from previous user behavior habits).
  • abnormal logins for example, a certain application account After the account is stolen, a large number of spam messages are sent to friends in the contact list
  • all user IDs corresponding to the abnormal natural person ID have user IDs that have swiping behavior (for example, the user ID frequently downloads and uninstalls applications,
  • the server If the server detects that the target natural person ID is an abnormal natural person ID, it will not push content to the target natural person ID, reducing unnecessary content pushing, reducing the pressure of the server content pushing, thereby increasing the speed of content pushing.
  • the server detects that the target natural person ID is not an abnormal natural person ID, it will push content to the target natural person ID. Specifically, the server may push content to any one of the multiple user IDs corresponding to the target natural person ID. For example, if the target natural person ID corresponds to 5 different application accounts (for example, A browser application account, B browser application account, C browser application account, D browser application account, E browser App account), if there is breaking news, the server only needs to send and push the breaking news message to one of the 5 different application accounts, so there is no need to send the breaking news to the other 4 applications. Sending the same push message to an account can improve the efficiency of message push and avoid repeated push of messages.
  • 5 different application accounts for example, A browser application account, B browser application account, C browser application account, D browser application account, E browser App account
  • the content pushed by the server can be notification messages (for example, news messages), marketing messages (for example, product promotion information), etc.
  • the server detects whether the target natural person ID is an abnormal natural person ID, including:
  • the server obtains the user behavior data of the target natural person ID, and analyzes whether the target natural person ID is a swipe ID based on the user behavior data of the target natural person ID;
  • the server determines that the target natural person ID is an abnormal natural person ID
  • the server uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID
  • the server determines that the target natural person ID is an abnormal natural person ID.
  • the server first analyzes whether the target natural person ID is a swipe ID. If it is a swipe ID, the target natural person ID is considered to be an abnormal natural person ID. If it is not a swipe ID, it further detects whether the target natural person ID is The substitute ID, if it is a substitute ID, the target natural person ID is considered to be an abnormal natural person ID, and if it is not a substitute ID, the target natural person ID is considered to be a normal natural person ID and belongs to a natural person ID that meets the content push conditions.
  • the brush ID has certain behavioral characteristics: For example, for the brush ID of the APP rankings, the number of apps installed per day is 100+ times, the number of apps uninstalled per day is 100+ times, and the brush ID is always positioned at In the same place, the amount of ID is used to purchase VIPs for one month, only active for a week or even one or two days, the payment amount of the amount of ID is 3000+, and so on.
  • the embodiment of the present application uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID.
  • the embodiments of this application can use random forests, decision trees, xgboost, local sensitive hashes in machine learning based on user location characteristics, travel preferences, APP usage habits and interests, device usage characteristics, and the unique identifier corresponding to the device itself.
  • LSH Large-Sensitive Hashing
  • the graph database method connects various information islands, thereby enriching the normal life cycle of a user's equipment.
  • FIG. 2 is a schematic diagram of judging whether a natural person ID is a substitute ID disclosed in an embodiment of the present application.
  • the server obtains multiple application data logged in by the user through multiple application IDs in advance, determines multiple sets of features based on the multiple application data, and trains the multiple sets of features to obtain a machine learning model. After each login, collect some current data and perform feature extraction to obtain the features, and input the features into the machine learning model to obtain the substitution weight value of the account, so as to better judge whether the account has substitution behavior.
  • the above-mentioned user behavior data includes application installation data, application uninstallation data, and payment amount.
  • the server analyzes whether the target natural person ID is a credit ID based on the user behavior data of the target natural person ID. for:
  • the server detects whether there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the preset upper threshold of the number of installations; whether there is a single-day uninstallation data that is greater than the total number of uninstallation data of the target natural person ID Data of the preset upper limit threshold of the number of unloading; detecting whether the payment amount of the target natural person ID exceeds the preset amount threshold;
  • the server determines the target natural person ID as the amount ID.
  • the user behavior data of the target natural person ID may include user behavior data of multiple user IDs corresponding to the target natural person ID. For example, if there are 5 application program accounts corresponding to the target natural person ID, the user behavior data of the target natural person ID includes user behavior data of the 5 application program accounts. As long as the user behavior data of any one of the five application accounts is abnormal, the target natural person ID is considered to be an abnormal natural person ID.
  • User behavior data may include: application installation data, application uninstallation data, payment quota, application activity data, positioning data, and the like.
  • the server determines The target natural person ID is the amount ID
  • the server determines that the target natural person ID is Brush ID;
  • the server determines that the target natural person ID is the amount ID.
  • the embodiment of the application provides a method for determining whether the target natural person ID is a credit ID, which can quickly identify whether the target natural person ID is a credit ID, avoid subsequent content pushing to the credit ID, and reduce the pressure of content pushing on the server.
  • the server uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID, specifically:
  • the server determines at least one user ID corresponding to the target natural person ID
  • the server After the first user ID corresponding to the target natural person ID logs in to the target device, the server obtains the location characteristics of the target device, the travel preference characteristics of the first user ID, the application usage habit characteristics of the first user ID, and the first user Interest characteristics of the ID, device usage characteristics of the target device, and target device identification; the first user ID is any one of at least one user ID;
  • the server combines the location characteristics of the target device, the travel preference characteristics of the first user ID, the application usage habit characteristics of the first user ID, the interest characteristics of the first user ID, the device usage characteristics of the target device, and the target device identification Input the trained machine learning model to get the surrogate weight value of the first user ID;
  • the server determines the target natural person ID as the substitute ID.
  • the location feature of the target device may be extracted from GPS location information periodically reported by the target device.
  • the travel preference feature can be extracted from the itinerary and GPS location information reported by the first user ID.
  • the application usage habit characteristics of the first user ID can be reported from the application opening time, application closing time, application brightness, application volume, application function usage, and application continuous operation reported by the first user ID It is extracted from the duration and accumulated running time of the application.
  • the interest feature of the first user ID can be extracted from the web browsing records, favorite records, and search records reported by the first user ID.
  • the device usage characteristics of the target device can be extracted from the usage habits of the target device (for example, the boot time of the target device, the shutdown time, the backlight brightness of the target device, the volume of the target device, the holding posture of the target device, etc.).
  • the trained machine learning model can be obtained by training according to historical user behavior data of at least one user ID corresponding to the target natural person ID.
  • the trained machine learning model is obtained by training based on the historical user behavior data of all user IDs corresponding to the target natural person ID.
  • the trained machine learning model can better reflect a user's habit of using equipment
  • the user behavior data of the user ID corresponding to the natural person ID can be input into the trained machine learning model to obtain a substitute weight value.
  • the higher the substitution weight value the higher the possibility of the user ID being substituted.
  • the preset weight threshold can be set in advance for judging the substitution behavior of the user ID.
  • natural person IDs correspond to three user IDs (openid1, openid2, openid3)
  • APP rules include application usage habits
  • geographic location rules include target device location characteristics
  • account system rules include travel preference characteristics and interest characteristics .
  • the preset weight threshold can be set to 0.5, then for openid1, it is recognized as a substitute ID, and for openid2 and openid3, it is recognized as a self-use ID. Since one of the three user IDs corresponding to the natural person ID is a substitute ID, it can be determined that the natural person ID is an abnormal natural person ID.
  • step (131) the following steps can also be performed:
  • the server obtains in advance multiple application data reported by at least one user ID, determines multiple sets of features based on the multiple application data, and trains the multiple sets of features to obtain a trained machine learning model.
  • the trained machine learning model is a set of features (APP feature, location feature, target device) extracted from multiple application data reported by the server according to at least one user ID corresponding to the target natural person ID obtained in advance.
  • Use feature for training, the trained machine learning model is determined for the user habits of the target natural person ID, and can accurately screen the use of other natural persons who are not the target natural person, and accurately identify whether the user ID is Out of modern use.
  • the server refuses to push content to the target natural person ID.
  • the server pushes content to the target natural person ID.
  • the number of natural person IDs can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of content pushing on the server, thereby increasing The speed of content delivery.
  • the embodiments of this application can identify normal natural person IDs and abnormal natural person IDs (fake users, interfering users), eliminate abnormal natural person IDs in time, reduce unnecessary economic and analysis losses; push content of normal natural person IDs, thereby improving the accuracy of the push Performance, reduce resource waste, and achieve precise operation.
  • FIG. 3 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application.
  • Fig. 3 is obtained by further optimization on the basis of Fig. 1.
  • the content pushing method includes the following steps.
  • the server obtains the behavior data of multiple user IDs reported by multiple clients, and uses a graph storage calculation engine to calculate the similarity between the behavior data of the multiple user IDs.
  • the server constructs a relationship pair between user IDs based on the similarity between the behavior data of the multiple user IDs.
  • the server constructs a corresponding relationship between the user ID and the natural person ID according to the relationship between the user IDs.
  • the graph storage calculation engine may also be referred to as a graph calculation engine.
  • the graph storage calculation engine uses the PageRank algorithm, the shortest path algorithm, and the Alternating Least Squares (ALS) algorithm to calculate the The similarity between the behavior data of multiple user IDs.
  • a relationship pair is established for user IDs whose similarity is greater than a preset similarity threshold. For example, SSOID1 ⁇ ->IMEI2, OPENID1 ⁇ ->ICCID3, SSOID2 ⁇ ->TEL2, IMEI2 ⁇ ->TEL3, IMEI2 ⁇ ->ICCID1, SSOID2 ⁇ ->OPENID1, IMEI1 ⁇ ->SSOID2.
  • SSOID1, IMEI2, TEL3, and ICCID1 correspond to one natural person ID (for example, natural person ID1)
  • OPENID1, ICCID3, SSOID2, TEL2, and IMEI1 correspond to another natural person ID (for example, natural person ID2). See Table 1 for details.
  • Table 1 is a table of correspondence between user IDs and natural person IDs disclosed in the embodiments of the present application. As shown in Table 1, the correspondence between natural person ID1 and SSOID1, IMEI2, TEL3, and ICCID1, and the correspondence between natural person ID2 and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1.
  • the server grabs M user identification IDs, and the server acquires the corresponding relationship between the user ID and the natural person ID.
  • the server determines N natural person IDs corresponding to the M user IDs according to the corresponding relationship between the user ID and the natural person ID.
  • the correspondence between the user ID and the natural person ID can be obtained in advance.
  • the correspondence between the user ID and the natural person ID can be determined in steps 301 to 303.
  • Step 304 is executed after step 301 to step 303.
  • the correspondence between user IDs and natural person IDs does not need to be generated in real time, and N natural person IDs corresponding to M user IDs can be quickly determined.
  • the server detects whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the above N natural person IDs. If yes, go to step 307, if not, go to step 308.
  • the server refuses to push content to the target natural person ID.
  • the server pushes content to the target natural person ID.
  • step 306 to step 308 in the embodiment of the present application, reference may be made to the description of step 102 to step 104 shown in FIG. 1, which will not be repeated here.
  • the M user IDs are converted into N natural person IDs, the number of IDs pushed is reduced, and then the abnormal natural person ID in the natural person ID is identified, and the abnormal natural person ID is not reported to the abnormal natural person.
  • ID pushes content further reduces the number of natural person IDs that are pushed, can avoid pushing content to abnormal natural person IDs, and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can be reduced The pressure of server-side content push, thereby increasing the speed of content push.
  • the mobile terminal includes hardware structures and/or software modules corresponding to each function.
  • the present invention can be implemented in the form of hardware or a combination of hardware and computer software. Whether a certain function is executed by hardware or computer software-driven hardware depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the present invention.
  • the embodiments of the present application may divide the mobile terminal into functional units according to the foregoing method examples.
  • each functional unit may be divided corresponding to each function, or two or more functions may be integrated into one processing unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. It should be noted that the division of units in the embodiments of the present application is illustrative, and is only a logical function division, and there may be other division methods in actual implementation.
  • FIG. 4 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application.
  • the content pushing device 400 includes a grabbing unit 401, a determining unit 402, a detecting unit 403, and a processing unit 404, wherein:
  • the grabbing unit 401 is configured to grab M user identification IDs
  • the determining unit 402 is configured to determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
  • the detection unit 403 is configured to detect whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
  • the processing unit 404 is configured to refuse to push content to the target natural person ID when the detection unit 403 detects that the target natural person ID is an abnormal natural person ID.
  • the processing unit 404 is further configured to push content to the target natural person ID when the detection unit 403 detects that the target natural person ID is not an abnormal natural person ID.
  • the detection unit 403 detects whether the target natural person ID is an abnormal natural person ID, specifically: acquiring user behavior data of the target natural person ID, and analyzing whether the target natural person ID is based on the user behavior data of the target natural person ID Is the amount ID; if the target natural person ID is the amount ID, the target natural person ID is determined to be an abnormal natural person ID; if the target natural person ID is not the amount ID, a machine learning algorithm is used to detect whether the target natural person ID Is a substitute ID; if the target natural person ID is a substitute ID, it is determined that the target natural person ID is an abnormal natural person ID.
  • the user behavior data includes application installation data, application uninstallation data, and payment amount
  • the detection unit 403 analyzes whether the target natural person ID is a credit ID based on the user behavior data of the target natural person ID, Specifically: detecting whether there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the preset upper limit of the number of installations; detecting whether there is a single-day uninstallation data in the application uninstallation data of the target natural person ID The amount of data is greater than the preset upper limit threshold of the number of uninstalls; it is detected whether the payment amount of the target natural person ID exceeds the preset amount threshold; if there is a total number of single-day installations in the target natural person ID application installation data greater than the preset The data of the upper limit threshold of the number of installations, and there is data in the application uninstallation data of the target natural person ID that the total amount of uninstallation in a single day is greater than the preset upper limit of the uninstallation number threshold, and the payment amount of the target natural
  • the detection unit 403 uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID, specifically: determining at least one user ID corresponding to the target natural person ID; After the user ID logs in to the target device, obtain the location feature of the target device, the travel preference feature of the first user ID, the application usage habit feature of the first user ID, the interest feature of the first user ID, The device usage feature of the target device and the target device identifier; the first user ID is any one of the at least one user ID; the location feature of the target device and the travel preference feature of the first user ID , The application usage habit characteristics of the first user ID, the interest characteristics of the first user ID, the device usage characteristics of the target device, and the target device identification input a trained machine learning model to obtain the first user The substitution weight value of the ID; if the substitution weight value of the first user ID is greater than the preset weight threshold, it is determined that the target natural person ID is the substitution ID.
  • the detecting unit 403 determines the at least one user ID corresponding to the target natural person ID, it is further configured to obtain in advance multiple pieces of application data reported by the at least one user ID, and determine the multiple pieces according to the multiple pieces of application data. Group features, training the multiple groups of features to obtain a trained machine learning model.
  • the determining unit 402 determines the N natural person IDs corresponding to the M user IDs, specifically: acquiring the corresponding relationship between the user ID and the natural person ID; and determining the corresponding relationship according to the corresponding relationship between the user ID and the natural person ID. N natural person IDs corresponding to the M user IDs.
  • the determining unit 402 before the determining unit 402 obtains the corresponding relationship between the user ID and the natural person ID, it is also used to obtain the behavior data of multiple user IDs reported by multiple clients, and calculate the multiple user IDs using a graph storage calculation engine.
  • the similarity between the behavior data of the user ID; the relationship between the user ID is constructed based on the similarity between the behavior data of the multiple user IDs; the relationship between the user ID and the natural person ID is constructed based on the relationship between the user IDs Correspondence.
  • Implement the content pushing device shown in Figure 4 first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify abnormal natural person IDs in natural person IDs, and do not push content to abnormal natural person IDs. Further narrowing the number of natural person IDs pushed can avoid pushing content to abnormal natural person IDs, and improve the accuracy of content pushing. Since the number of natural person IDs that meet the push conditions is less than M user identification IDs, it can reduce the amount of server-side content pushing Pressure to increase the speed of content delivery.
  • FIG. 5 is a schematic structural diagram of a mobile terminal disclosed in an embodiment of the present application.
  • the mobile terminal 500 includes a processor 501 and a memory 502.
  • the mobile terminal 500 may also include a bus 503.
  • the processor 501 and the memory 502 may be connected to each other through the bus 503.
  • the bus 503 may be a peripheral component. Connect the standard (Peripheral Component Interconnect, referred to as PCI) bus or extended industry standard architecture (Extended Industry Standard Architecture, referred to as EISA) bus, etc.
  • PCI Peripheral Component Interconnect
  • EISA Extended Industry Standard Architecture
  • the bus 503 can be divided into an address bus, a data bus, a control bus, and so on. For ease of presentation, only one thick line is used in FIG.
  • the mobile terminal 500 may also include an input and output device 504, and the input and output device 504 may include a display screen, such as a liquid crystal display screen.
  • the memory 502 is used to store one or more programs containing instructions; the processor 501 is used to call the instructions stored in the memory 502 to execute some or all of the method steps in FIGS. 1 to 3.
  • Implement the mobile terminal shown in Figure 5 first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify abnormal natural person IDs in natural person IDs, and do not push content to abnormal natural person IDs, and further Reducing the number of natural person IDs pushed can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of server content pushing , Thereby improving the speed of content push.
  • the embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a computer program for electronic data exchange, and the computer program causes the computer to execute any part of the content push method recorded in the above method embodiment Or all steps.
  • the embodiments of the present application also provide a computer program product.
  • the computer program product includes a non-transitory computer-readable storage medium storing a computer program.
  • the computer program is operable to cause a computer to execute any of the methods described in the foregoing method embodiments. Part or all of the steps of a content push method.
  • the disclosed device can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable memory.
  • the technical solution of the present invention essentially or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory, A number of instructions are included to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention.
  • the aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other various media that can store program codes.
  • the program can be stored in a computer-readable memory, and the memory can include: flash disk , Read-only memory (English: Read-Only Memory, abbreviation: ROM), random access device (English: Random Access Memory, abbreviation: RAM), magnetic disk or optical disc, etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

Provided is a content pushing method, comprising: grabbing M user identifiers (IDs), and determining N natural person IDs corresponding to the M user IDs; checking whether a target natural person ID is an abnormal natural person ID, wherein the target natural person ID is any one of the N natural person IDs; and if so, refusing to push content to the target natural person ID. According to the method, the accuracy of content pushing can be improved.

Description

内容推送方法、装置、移动终端及存储介质Content pushing method, device, mobile terminal and storage medium 技术领域Technical field
本申请涉及通信技术领域,具体涉及一种内容推送方法、装置、移动终端及存储介质。This application relates to the field of communication technology, and in particular to a content pushing method, device, mobile terminal and storage medium.
背景技术Background technique
目前,服务端(比如,服务器)向用户端(比如,应用程序客户端)进行内容推送时,一般根据登录用户端的用户标识(Identity,ID)的兴趣、爱好等用户画像进行内容推送。然而,一旦用户ID的用户画像判断有误,则服务端向客户端会推送错误的内容。At present, when a server (for example, a server) pushes content to a user (for example, an application client), it generally pushes content according to the user profile of the user's identity (ID) logged in to the client. However, once the user portrait of the user ID is judged incorrectly, the server will push the wrong content to the client.
发明内容Summary of the invention
本申请实施例提供了一种内容推送方法、装置、移动终端及存储介质,可以提高内容推送的准确性。The embodiments of the present application provide a content pushing method, device, mobile terminal, and storage medium, which can improve the accuracy of content pushing.
第一方面,本申请实施例提供一种内容推送方法,包括:In the first aspect, an embodiment of the present application provides a content pushing method, including:
抓取M个用户标识ID,确定与所述M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M;Grab M user identification IDs, and determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
检测目标自然人ID是否为异常自然人ID,所述目标自然人ID为所述N个自然人ID中的任意一个;Detecting whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
若是,拒绝向所述目标自然人ID进行内容推送。If yes, refuse to push content to the target natural person ID.
第二方面,本申请实施例提供了一种内容推送装置,所述内容推送装置包括抓取单元、确定单元、检测单元和处理单元,其中:In a second aspect, an embodiment of the present application provides a content pushing device, the content pushing device includes a capturing unit, a determining unit, a detecting unit, and a processing unit, wherein:
所述抓取单元,用于抓取M个用户标识ID;The grabbing unit is used to grab M user identification IDs;
所述确定单元,用于确定与所述M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M;The determining unit is configured to determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
所述检测单元,用于检测目标自然人ID是否为异常自然人ID,所述目标自然人ID为所述N个自然人ID中的任意一个;The detection unit is configured to detect whether a target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
所述处理单元,用于在所述检测单元检测出所述目标自然人ID为异常自然人ID的情况下,拒绝向所述目标自然人ID进行内容推送。The processing unit is configured to refuse to push content to the target natural person ID when the detection unit detects that the target natural person ID is an abnormal natural person ID.
第三方面,本申请实施例提供一种移动终端,包括处理器、存储器,所述存储器用于存储一个或多个程序,所述一个或多个程序被配置成由所述处理器执行,上述程序包括用于执行本申请实施例第一方面中的步骤的指令。In a third aspect, an embodiment of the present application provides a mobile terminal, including a processor and a memory, the memory is used to store one or more programs, and the one or more programs are configured to be executed by the processor. The program includes instructions for executing the steps in the first aspect of the embodiments of the present application.
第四方面,本申请实施例提供了一种计算机可读存储介质,其中,上述计算机可读存储介质存储用于电子数据交换的计算机程序,其中,上述计算机程序使得计算机执行如本申请实施例第一方面中所描述的部分或全部步骤。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the foregoing computer-readable storage medium stores a computer program for electronic data exchange, wherein the foregoing computer program enables a computer to execute Some or all of the steps described in one aspect.
第五方面,本申请实施例提供了一种计算机程序产品,其中,上述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,上述计算机程序可操作来使计算机执行如本申请实施例第一方面中所描述的部分或全部步骤。该计算机程序产品可以为一个软件安装包。In a fifth aspect, embodiments of the present application provide a computer program product, wherein the computer program product includes a non-transitory computer-readable storage medium storing a computer program, and the computer program is operable to cause a computer to execute Example part or all of the steps described in the first aspect. The computer program product may be a software installation package.
可以看出,本申请实施例中所描述的内容推送方法,具体包括如下步骤:抓取M个用户标识ID,确定与M个用户ID对应的N个自然人ID,其中,M、N均为正整数,N小于或等于M;检测目标自然人ID是否为异常自然人ID,目标自然人ID为N个自然人ID中的任意一个;若是,拒绝向目标自然人ID进行内容推送。实施本申请实施例,首先将M个用户ID转换为N个自然人ID,减小推送的ID的数量,然后识别自然人ID中的异常自然人ID,不向异常自然人ID进行内容推送,进一步缩小推送的自然人ID的数量,可以避免向异常自然人ID进行内容推送,提高内容推送的准确性,由于符合推送条件的自然人ID的数量要小于M个用户标识ID,可以降低服务端内容推送的压力,从而提高内容推送的速度。It can be seen that the content push method described in the embodiment of the application specifically includes the following steps: grab M user identification IDs, and determine N natural person IDs corresponding to the M user IDs, where M and N are both positive Integer, N is less than or equal to M; check whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of N natural person IDs; if so, refuse to push content to the target natural person ID. To implement the embodiments of this application, first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify abnormal natural person IDs in natural person IDs, and do not push content to abnormal natural person IDs, further reducing the number of pushes The number of natural person IDs can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of content pushing on the server, thereby increasing The speed of content delivery.
附图说明Description of the drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly describe the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1是本申请实施例公开的一种内容推送方法的流程示意图;FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application;
图2是本申请实施例公开的一种判断自然人ID是否为代用ID的示意图;Figure 2 is a schematic diagram of judging whether a natural person ID is a substitute ID disclosed in an embodiment of the present application;
图3是本申请实施例公开的另一种内容推送方法的流程示意图;FIG. 3 is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application;
图4是本申请实施例公开的一种内容推送装置的结构示意图;Figure 4 is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application;
图5是本申请实施例公开的一种移动终端的结构示意图。Fig. 5 is a schematic structural diagram of a mobile terminal disclosed in an embodiment of the present application.
具体实施方式Detailed ways
为了使本技术领域的人员更好地理解本发明方案,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其他步骤或单元。The terms "first", "second", etc. in the specification and claims of the present invention and the above-mentioned drawings are used to distinguish different objects, rather than to describe a specific sequence. In addition, the terms "including" and "having" and any variations thereof are intended to cover non-exclusive inclusion. For example, a process, method, system, product, or device that includes a series of steps or units is not limited to the listed steps or units, but optionally includes unlisted steps or units, or optionally also includes Other steps or units inherent to these processes, methods, products or equipment.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference to "embodiments" herein means that a specific feature, structure or characteristic described in conjunction with the embodiments may be included in at least one embodiment of the present invention. The appearance of the phrase in various places in the specification does not necessarily refer to the same embodiment, nor is it an independent or alternative embodiment mutually exclusive with other embodiments. Those skilled in the art clearly and implicitly understand that the embodiments described herein can be combined with other embodiments.
本申请实施例所涉及到的移动终端可以包括各种具有无线通信功能的手持设备、车载设备、可穿戴设备、计算设备或连接到无线调制解调器的其他处理设备,以及各种形式的用户设备(User Equipment,UE),移动台(Mobile Station,MS),终端设备(terminal device)等等。为方便描述,上面提到的设备统称为移动终端。The mobile terminals involved in the embodiments of this application may include various handheld devices with wireless communication functions, vehicle-mounted devices, wearable devices, computing devices or other processing devices connected to wireless modems, as well as various forms of user equipment (User Equipment, UE), mobile station (Mobile Station, MS), terminal device (terminal device), etc. For ease of description, the devices mentioned above are collectively referred to as mobile terminals.
下面对本申请实施例进行详细介绍。The following describes the embodiments of the present application in detail.
请参阅图1,图1是本申请实施例公开的一种内容推送方法的流程示意图, 如图1所示,该内容推送方法包括如下步骤。Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a content pushing method disclosed in an embodiment of the present application. As shown in FIG. 1, the content pushing method includes the following steps.
101,服务端抓取M个用户标识ID,确定与M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M。101. The server grabs M user identification IDs, and determines N natural person IDs corresponding to the M user IDs. Both M and N are positive integers, and N is less than or equal to M.
本申请实施例中,服务端是为客户端服务的,服务的内容诸如向客户端提供资源,保存客户端数据等。服务端是一种有针对性的服务程序,运行服务端的设备可以称为服务器。服务端可以同时与多个客户端建立连接,可以同时为多个客户端提供服务。本申请实施例中服务端为客户端提供的服务主要包括内容推送服务。内容推送服务可以包括:浏览器内容推送服务、应用程序下载推送服务、游戏内容推送服务等。服务端可以包括应用程序服务端、浏览器服务端、游戏服务端等。In the embodiments of the present application, the server serves the client, and the content of the service includes providing resources to the client and storing client data. The server is a targeted service program, and the device running the server can be called a server. The server can establish connections with multiple clients at the same time, and can provide services to multiple clients at the same time. The services provided by the server for the client in the embodiments of the present application mainly include content push services. The content push service may include: browser content push service, application download push service, game content push service, etc. The server can include application server, browser server, game server, etc.
服务端可以从数据库中抓取M个用户标识(identity,ID),可以向这M个用户ID所登录的客户端推送特定的内容。用户ID可以包括如下任意一种或多种类型:单点登录标识(single sign on identity,SSOID)、OpenID、集成电路卡识别码(Integrate circuit card identity,ICCID)、国际移动设备识别码(International Mobile Equipment Identity,IMEI)、电话号码(telephone,TEL)等。SSO是在多个应用系统中,用户只需登录一次就可以访问所有相互信任的应用系统。The server can grab M user identities (IDs) from the database, and can push specific content to the clients logged in by the M user IDs. User ID can include any one or more of the following types: single sign on identity (SSOID), OpenID, integrated circuit card identity (ICCID), international mobile device identity (International Mobile) Equipment (Identity, IMEI), telephone number (telephone, TEL), etc. SSO is in multiple application systems. Users only need to log in once to access all mutually trusted application systems.
服务端可以从数据库中随机抓取M个用户ID,后续可以向这M个用户ID中对应的N个自然人ID中的正常自然人ID进行不同的内容推送。服务端也可以从数据库中抓取具有相同特征的M个用户ID,后续可以向这M个用户ID中对应的N个自然人ID中的正常自然人ID进行相同的内容推送。The server can randomly grab M user IDs from the database, and subsequently can push different content to the normal natural person IDs among the N natural person IDs corresponding to the M user IDs. The server can also grab M user IDs with the same characteristics from the database, and subsequently can push the same content to the normal natural person IDs among the N natural person IDs corresponding to the M user IDs.
服务端确定与M个用户ID对应的N个自然人ID,具体可以为:服务端根据已经生成的用户ID与自然人ID的对应关系确定与M个用户ID对应的N个自然人ID。The server determines the N natural person IDs corresponding to the M user IDs, which may specifically be: the server determines the N natural person IDs corresponding to the M user IDs according to the generated correspondence between the user ID and the natural person ID.
第一种情况:若M个用户ID均在上述已经生成的用户ID与自然人ID的对应关系中,则直接按照上述方法确定与M个用户ID对应的N个自然人ID。The first situation: if the M user IDs are all in the corresponding relationship between the user IDs and the natural person IDs that have been generated, the N natural person IDs corresponding to the M user IDs are directly determined according to the above method.
第二中情况:若M个用户ID都不在上述已经生成的用户ID与自然人ID的对应关系中,则服务端可以获取上述已经生成的用户ID与自然人ID的对应关系中的所有自然人ID,根据M个用户ID的用户行为数据分析该M个用户 ID与上述所有自然人ID中每个自然人ID的相似度,如果第一用户ID(第一用户ID为上述M个用户ID中的任一个)与上述所有自然人ID中相似度最高的自然人ID的相似度大于预设相似度阈值,则将第一用户ID与上述所有自然人ID中相似度最高的自然人ID建立对应关系,如果第一用户ID与上述所有自然人ID中相似度最高的自然人ID的相似度大于预设相似度阈值,则将第一用户ID的用户行为数据与数据库中的其他用户ID的用户行为数据进行匹配,若第一用户ID的用户行为数据与上述数据库中的其他用户ID中相似度最高的用户ID的相似度大于预设相似度阈值,则将第一用户ID与上述数据库中的其他用户ID中相似度最高的用户ID建立对应关系,形成新的自然人ID,该新的自然人ID对应该第一用户ID和上述数据库中的其他用户ID中相似度最高的用户ID。The second case: if none of the M user IDs are in the corresponding relationship between the generated user ID and the natural person ID, the server can obtain all the natural person IDs in the corresponding relationship between the generated user ID and the natural person ID, according to The user behavior data of M user IDs analyzes the similarity between the M user IDs and each of the above-mentioned natural person IDs. If the first user ID (the first user ID is any one of the above-mentioned M user IDs) and If the similarity of the natural person ID with the highest similarity among all the above natural person IDs is greater than the preset similarity threshold, then the first user ID is associated with the natural person ID with the highest similarity among all the natural person IDs. If the first user ID is If the similarity of the natural person ID with the highest similarity among all natural person IDs is greater than the preset similarity threshold, the user behavior data of the first user ID is matched with the user behavior data of other user IDs in the database. If the similarity between the user behavior data and the user ID with the highest similarity among other user IDs in the aforementioned database is greater than the preset similarity threshold, the first user ID is established with the user ID with the highest similarity among other user IDs in the aforementioned database The corresponding relationship forms a new natural person ID, which corresponds to the user ID with the highest similarity among the first user ID and other user IDs in the database.
第三中情况:若M个用户ID有部分在上述已经生成的用户ID与自然人ID的对应关系中,有另外一部分不在上述已经生成的用户ID与自然人ID的对应关系中,则在上述已经生成的用户ID与自然人ID的对应关系中的用户ID直接按照对应关系确定对应的自然人ID,不在上述已经生成的用户ID与自然人ID的对应关系中的用户ID按照上述第二种情况的方法确定与之对应的自然人ID。The third case: if part of the M user IDs are in the corresponding relationship between the user IDs and natural person IDs that have been generated above, and another part is not in the corresponding relationship between the user IDs that have been generated and the natural person IDs, then they have been generated above The user ID in the corresponding relationship between the user ID and the natural person ID directly determines the corresponding natural person ID according to the corresponding relationship, and the user ID that is not in the corresponding relationship between the user ID and the natural person ID that has been generated above is determined according to the method in the second case above The corresponding natural person ID.
其中,本申请实施例中的自然人ID会对应一个自然人。这个自然人可能会对应至少一个移动终端(比如,手机)、至少一个电话号码、至少一个应用程序账号、至少一个OpenID、一个SSOID、至少一个ICCID、至少一个IMEI。比如说,一个自然人有用一部手机、一个电话号码、5个应用程序账号,则将手机的IMEI、电话号码、5个应用程序账号打上一个自然人ID的标签。这5个应用程序账号对应的用户行为数据都属于这个自然人ID的用户行为数据。这样,一个真实的自然人,可以有很多个用户ID(比如,一个手机的IMEI、一个电话号码、5个应用程序账号),但是却只对应一个唯一的自然人ID。Among them, the natural person ID in the embodiment of this application corresponds to a natural person. This natural person may correspond to at least one mobile terminal (for example, mobile phone), at least one phone number, at least one application account, at least one OpenID, one SSOID, at least one ICCID, and at least one IMEI. For example, if a natural person has a mobile phone, a phone number, and 5 application accounts, the IMEI, phone number, and 5 application accounts of the mobile phone are labeled with a natural person ID. The user behavior data corresponding to these 5 application accounts all belong to the user behavior data of this natural person ID. In this way, a real natural person can have many user IDs (for example, the IMEI of a mobile phone, a phone number, and 5 application accounts), but only one unique natural person ID is corresponding.
102,服务端检测目标自然人ID是否为异常自然人ID,目标自然人ID为上述N个自然人ID中的任意一个。若是,则执行步骤103,若否,则执行步骤104。102. The server detects whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the above N natural person IDs. If yes, go to step 103, if not, go to step 104.
本申请实施例中,异常自然人ID包括:该异常自然人ID对应的所有用户 ID长时间处于离线状态、该异常自然人ID对应的所有用户ID中存在登录异常的用户ID(比如,某个应用程序账号被盗号后向联系人列表中的好友发送大量的垃圾消息)、该异常自然人ID对应的所有用户ID中存在刷量行为的用户ID(比如,该用户ID频繁的下载和卸载应用程序,该用户ID大量的充值账号等)、该异常自然人ID对应的所有用户ID中存在代用行为的用户ID(比如,该用户ID的用户行为习惯与之前的用户行为习惯差别很大)。In the embodiment of the present application, the abnormal natural person ID includes: all user IDs corresponding to the abnormal natural person ID are offline for a long time, and all user IDs corresponding to the abnormal natural person ID have user IDs with abnormal logins (for example, a certain application account After the account is stolen, a large number of spam messages are sent to friends in the contact list), and all user IDs corresponding to the abnormal natural person ID have user IDs that have swiping behavior (for example, the user ID frequently downloads and uninstalls applications, the user Recharge accounts with a large number of IDs, etc.), and all user IDs corresponding to the abnormal natural person ID have user IDs with substitute behaviors (for example, the user behavior habits of the user ID are very different from previous user behavior habits).
如果服务端检测到目标自然人ID为异常自然人ID,则不会向该目标自然人ID进行内容推送,减少不必要的内容推送,降低服务端内容推送的压力,从而提高内容推送的速度。If the server detects that the target natural person ID is an abnormal natural person ID, it will not push content to the target natural person ID, reducing unnecessary content pushing, reducing the pressure of the server content pushing, thereby increasing the speed of content pushing.
如果服务端检测到目标自然人ID不是异常自然人ID,则向该目标自然人ID进行内容推送。具体的,服务端可以向该目标自然人ID对应的多个用户ID中的任意一个进行内容推送。举例来说,如果该目标自然人ID对应5个不同的应用程序账号(比如,A浏览器应用程序账号、B浏览器应用程序账号、C浏览器应用程序账号、D浏览器应用程序账号、E浏览器应用程序账号),如果有突发性新闻,则服务端只需向这5个不同的应用程序账号中的其中一个发送推送该突发性新闻消息即可,因而无需向另外4个应用程序账号发送相同的推送消息,可以提高消息推送的效率,避免消息重复推送。If the server detects that the target natural person ID is not an abnormal natural person ID, it will push content to the target natural person ID. Specifically, the server may push content to any one of the multiple user IDs corresponding to the target natural person ID. For example, if the target natural person ID corresponds to 5 different application accounts (for example, A browser application account, B browser application account, C browser application account, D browser application account, E browser App account), if there is breaking news, the server only needs to send and push the breaking news message to one of the 5 different application accounts, so there is no need to send the breaking news to the other 4 applications. Sending the same push message to an account can improve the efficiency of message push and avoid repeated push of messages.
其中,服务端推送的内容可以是通知类消息(比如,新闻类消息)、营销类消息(比如,商品促销信息)等。Among them, the content pushed by the server can be notification messages (for example, news messages), marketing messages (for example, product promotion information), etc.
可选的,步骤102中,服务端检测目标自然人ID是否为异常自然人ID,包括:Optionally, in step 102, the server detects whether the target natural person ID is an abnormal natural person ID, including:
(11)服务端获取目标自然人ID的用户行为数据,基于目标自然人ID的用户行为数据分析目标自然人ID是否为刷量ID;(11) The server obtains the user behavior data of the target natural person ID, and analyzes whether the target natural person ID is a swipe ID based on the user behavior data of the target natural person ID;
(12)若目标自然人ID为刷量ID,服务端确定目标自然人ID为异常自然人ID;(12) If the target natural person ID is a swipe ID, the server determines that the target natural person ID is an abnormal natural person ID;
(13)若目标自然人ID不是刷量ID,服务端采用机器学习算法检测目标自然人ID是否为代用ID;(13) If the target natural person ID is not a swipe ID, the server uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID;
(14)若目标自然人ID为代用ID,服务端确定目标自然人ID为异常自然人ID。(14) If the target natural person ID is a substitute ID, the server determines that the target natural person ID is an abnormal natural person ID.
本申请实施例中,服务端首先分析目标自然人ID是否为刷量ID,若是刷量ID,则认为该目标自然人ID为异常自然人ID,若不是刷量ID,则进一步检测该目标自然人ID是否为代用ID,若是代用ID,则认为该目标自然人ID为异常自然人ID,若不是代用ID,则认为该目标自然人ID为正常自然人ID,是属于符合内容推送条件的自然人ID。In the embodiment of this application, the server first analyzes whether the target natural person ID is a swipe ID. If it is a swipe ID, the target natural person ID is considered to be an abnormal natural person ID. If it is not a swipe ID, it further detects whether the target natural person ID is The substitute ID, if it is a substitute ID, the target natural person ID is considered to be an abnormal natural person ID, and if it is not a substitute ID, the target natural person ID is considered to be a normal natural person ID and belongs to a natural person ID that meets the content push conditions.
其中,刷量用户的最核心的诉求是获取更多的关注,从而带来更大的流量,吸引更多用户。通过观察大盘用户的用户行为数据发现刷量用户几个比较明显的特征:对于软件商店来说,刷量用户更多是刷下载,对游戏来说刷充值,想让平台给其更好的展示位置,这种用户跟正常用户相比是比较容易识别出来的,观察大盘用户的行为分布情况,就能制定相应的规则条件,初步剔除非常明显的刷量用户。Among them, the core demand of scalping users is to get more attention, thereby bringing in greater traffic and attracting more users. By observing the user behavior data of large-cap users, we found several more obvious characteristics of users who scam users: For software stores, users who scam users are more of downloads, and for games, they want to let the platform show them better. Location, compared with normal users, this kind of user is easier to identify. Observing the behavior distribution of large-cap users, you can formulate corresponding rules and conditions, and preliminarily eliminate very obvious users who are brushing.
刷量ID具有一定的行为特征:比如,对于刷APP排行榜的刷量ID而言,每天安装的APP的次数100+次、每天卸载的APP的次数100+次、该刷量ID一直定位在同一个地方、该刷量ID购买一月的VIP,仅活跃了一周甚至一两天、该刷量ID的付费额度3000+,等等。The brush ID has certain behavioral characteristics: For example, for the brush ID of the APP rankings, the number of apps installed per day is 100+ times, the number of apps uninstalled per day is 100+ times, and the brush ID is always positioned at In the same place, the amount of ID is used to purchase VIPs for one month, only active for a week or even one or two days, the payment amount of the amount of ID is 3000+, and so on.
相较于刷量ID,代用ID则难以识别,本申请实施例采用机器学习算法检测目标自然人ID是否为代用ID。具体的,本申请实施例可以根据用户位置特征,出行偏好,APP使用习惯和兴趣,设备使用特征,以及设备自身对应的唯一标识,利用机器学习中随机森林,决策树,xgboost,局部敏感哈希(Locality-Sensitive Hashing,LSH),图数据库方法将各个信息孤岛连接起来,从而丰富一个用户使用设备的正常生命周期。举例来说,请参阅图2,图2是本申请实施例公开的一种判断自然人ID是否为代用ID的示意图。如图2所示,服务端预先获取用户通过多个应用ID登录的多个应用数据,根据该多个应用数据确定多组特征,对该多组特征进行训练,得到一个机器学习模型,在用户每次登录后,采集当前的一些数据,并进行特征提取,得到特征,将特征输入到机器学习模型,得到账号的代用权重值,从而更好判断这个账号是否是有代用行为。Compared with the amount of ID, the substitute ID is difficult to identify. The embodiment of the present application uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID. Specifically, the embodiments of this application can use random forests, decision trees, xgboost, local sensitive hashes in machine learning based on user location characteristics, travel preferences, APP usage habits and interests, device usage characteristics, and the unique identifier corresponding to the device itself. (Locality-Sensitive Hashing, LSH), the graph database method connects various information islands, thereby enriching the normal life cycle of a user's equipment. For example, please refer to FIG. 2, which is a schematic diagram of judging whether a natural person ID is a substitute ID disclosed in an embodiment of the present application. As shown in Figure 2, the server obtains multiple application data logged in by the user through multiple application IDs in advance, determines multiple sets of features based on the multiple application data, and trains the multiple sets of features to obtain a machine learning model. After each login, collect some current data and perform feature extraction to obtain the features, and input the features into the machine learning model to obtain the substitution weight value of the account, so as to better judge whether the account has substitution behavior.
可选的,上述用户行为数据包括应用程序安装数据、应用程序卸载数据以及付费额度,上述步骤(11)中,服务端基于目标自然人ID的用户行为数据 分析目标自然人ID是否为刷量ID,具体为:Optionally, the above-mentioned user behavior data includes application installation data, application uninstallation data, and payment amount. In the above step (11), the server analyzes whether the target natural person ID is a credit ID based on the user behavior data of the target natural person ID. for:
(111)服务端检测目标自然人ID的应用程序安装数据中是否存在单日安装总量大于预设安装数量上限阈值的数据;检测目标自然人ID的应用程序卸载数据中是否存在单日卸载总量大于预设卸载数量上限阈值的数据;检测目标自然人ID的付费额度是否超过预设额度阈值;(111) The server detects whether there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the preset upper threshold of the number of installations; whether there is a single-day uninstallation data that is greater than the total number of uninstallation data of the target natural person ID Data of the preset upper limit threshold of the number of unloading; detecting whether the payment amount of the target natural person ID exceeds the preset amount threshold;
(112)若目标自然人ID的应用程序安装数据中存在单日安装总量大于预设安装数量上限阈值的数据,并且目标自然人ID的应用程序卸载数据中存在单日卸载总量大于预设卸载数量上限阈值的数据,并且目标自然人ID的付费额度超过预设额度阈值,服务端确定目标自然人ID为刷量ID。(112) If there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the upper threshold of the preset number of installations, and there is data in the application uninstallation data of the target natural person ID that the total number of single-day uninstalls is greater than the preset uninstallation number Data of the upper limit threshold, and the payment amount of the target natural person ID exceeds the preset amount threshold, the server determines the target natural person ID as the amount ID.
本申请实施例中,目标自然人ID的用户行为数据可以包括该目标自然人ID对应的多个用户ID的用户行为数据。比如,该目标自然人ID对应的5个应用程序账号,则该目标自然人ID的用户行为数据包括5个应用程序账号的用户行为数据。只要这5个应用程序账号的其中任意一个的用户行为数据异常,则认为该目标自然人ID为异常自然人ID。用户行为数据可以包括:应用程序安装数据、应用程序卸载数据、付费额度、应用程序活跃数据、定位数据等。In the embodiment of the present application, the user behavior data of the target natural person ID may include user behavior data of multiple user IDs corresponding to the target natural person ID. For example, if there are 5 application program accounts corresponding to the target natural person ID, the user behavior data of the target natural person ID includes user behavior data of the 5 application program accounts. As long as the user behavior data of any one of the five application accounts is abnormal, the target natural person ID is considered to be an abnormal natural person ID. User behavior data may include: application installation data, application uninstallation data, payment quota, application activity data, positioning data, and the like.
可选的,若目标自然人ID的应用程序安装数据中存在单日安装总量大于预设安装数量上限阈值的数据,并且目标自然人ID的定位数据在预设时长内没有发生变化,则服务端确定目标自然人ID为刷量ID;Optionally, if there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the upper threshold of the preset number of installations, and the positioning data of the target natural person ID does not change within the preset time period, the server determines The target natural person ID is the amount ID;
若目标自然人ID的应用程序卸载数据中存在单日卸载总量大于预设卸载数量上限阈值的数据,并且目标自然人ID的定位数据在预设时长内没有发生变化,则服务端确定目标自然人ID为刷量ID;If there is data in the application uninstallation data of the target natural person ID that the total number of uninstalls in a single day is greater than the upper threshold of the preset number of uninstalls, and the positioning data of the target natural person ID does not change within the preset time period, the server determines that the target natural person ID is Brush ID;
若目标自然人ID的付费额度超过预设额度阈值,并且目标自然人ID的定位数据在预设时长内没有发生变化,则服务端确定目标自然人ID为刷量ID。If the payment amount of the target natural person ID exceeds the preset amount threshold, and the positioning data of the target natural person ID does not change within the preset time period, the server determines that the target natural person ID is the amount ID.
本申请实施例提供一种确定目标自然人ID是否为刷量ID的方法,可以快速识别目标自然人ID是否为刷量ID,避免后续向刷量ID进行内容推送,降低服务端内容推送的压力。The embodiment of the application provides a method for determining whether the target natural person ID is a credit ID, which can quickly identify whether the target natural person ID is a credit ID, avoid subsequent content pushing to the credit ID, and reduce the pressure of content pushing on the server.
可选的,上述(13)中,服务端采用机器学习算法检测目标自然人ID是否为代用ID,具体为:Optionally, in the above (13), the server uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID, specifically:
(131)服务端确定目标自然人ID对应的至少一个用户ID;(131) The server determines at least one user ID corresponding to the target natural person ID;
(132)在目标自然人ID对应的第一用户ID登录目标设备后,服务端获取目标设备的位置特征、第一用户ID的出行偏好特征、第一用户ID的应用程序使用习惯特征、第一用户ID的兴趣特征、目标设备的设备使用特征以及目标设备标识;第一用户ID为至少一个用户ID中的任意一个;(132) After the first user ID corresponding to the target natural person ID logs in to the target device, the server obtains the location characteristics of the target device, the travel preference characteristics of the first user ID, the application usage habit characteristics of the first user ID, and the first user Interest characteristics of the ID, device usage characteristics of the target device, and target device identification; the first user ID is any one of at least one user ID;
(133)服务端将目标设备的位置特征、第一用户ID的出行偏好特征、第一用户ID的应用程序使用习惯特征、第一用户ID的兴趣特征、目标设备的设备使用特征以及目标设备标识输入训练好的机器学习模型,得到第一用户ID的代用权重值;(133) The server combines the location characteristics of the target device, the travel preference characteristics of the first user ID, the application usage habit characteristics of the first user ID, the interest characteristics of the first user ID, the device usage characteristics of the target device, and the target device identification Input the trained machine learning model to get the surrogate weight value of the first user ID;
(134)若第一用户ID的代用权重值大于预设权重阈值,服务端确定目标自然人ID为代用ID。(134) If the substitute weight value of the first user ID is greater than the preset weight threshold, the server determines the target natural person ID as the substitute ID.
本申请实施例中,目标设备的位置特征可以从目标设备周期性上报的GPS位置信息中提取。出行偏好特征可以从第一用户ID上报的行程单、GPS位置信息中提取。第一用户ID的应用程序使用习惯特征可以从第一用户ID上报的应用程序开启时间点、应用程序关闭时间点、应用程序的亮度、应用程序的音量、应用程序功能使用情况、应用程序持续运行时长、应用程序累计运行时长中提取。第一用户ID的兴趣特征可以从第一用户ID上报的网页浏览记录、收藏记录、搜索记录从提取。目标设备的设备使用特征可以从目标设备的使用习惯(比如,目标设备的开机时间、关机时间、目标设备的背光亮度、目标设备的音量、目标设备的握持姿势等)中提取。In the embodiment of the present application, the location feature of the target device may be extracted from GPS location information periodically reported by the target device. The travel preference feature can be extracted from the itinerary and GPS location information reported by the first user ID. The application usage habit characteristics of the first user ID can be reported from the application opening time, application closing time, application brightness, application volume, application function usage, and application continuous operation reported by the first user ID It is extracted from the duration and accumulated running time of the application. The interest feature of the first user ID can be extracted from the web browsing records, favorite records, and search records reported by the first user ID. The device usage characteristics of the target device can be extracted from the usage habits of the target device (for example, the boot time of the target device, the shutdown time, the backlight brightness of the target device, the volume of the target device, the holding posture of the target device, etc.).
其中,训练好的机器学习模型可以根据该目标自然人ID对应的至少一个用户ID的历史用户行为数据进行训练得到。Wherein, the trained machine learning model can be obtained by training according to historical user behavior data of at least one user ID corresponding to the target natural person ID.
本申请实施例中,训练好的机器学习模型是基于该目标自然人ID对应的所有用户ID的历史用户行为数据进行训练得到,该训练好的机器学习模型可以较好的反映一个用户使用设备的习惯,后续当该自然人ID对应的用户ID登录后,可以将该自然人ID对应的该用户ID的用户行为数据输入该训练好的机器学习模型,得到代用权重值。一般而言,代用权重值越高,则表明该用户ID被代用的可能性越高。预设权重阈值可以预先进行设定,用于对用户ID的代用行为进行判断。如图2所示,自然人ID对应三个用户ID(openid1、openid2、 openid3),APP规则包括应用程序使用习惯特征,地理位置规则包括目标设备的位置特征,账号体系规则包括出行偏好特征、兴趣特征。可以设置预设权重阈值为0.5,则对于openid1而言,则认定为代用ID,对于openid2和openid3而言,则认定为自用ID。由于该自然人ID对应的三个用户ID有一个用户ID为代用ID,则可以判定该自然人ID为异常自然人ID。In the embodiment of this application, the trained machine learning model is obtained by training based on the historical user behavior data of all user IDs corresponding to the target natural person ID. The trained machine learning model can better reflect a user's habit of using equipment After the user ID corresponding to the natural person ID logs in, the user behavior data of the user ID corresponding to the natural person ID can be input into the trained machine learning model to obtain a substitute weight value. Generally speaking, the higher the substitution weight value, the higher the possibility of the user ID being substituted. The preset weight threshold can be set in advance for judging the substitution behavior of the user ID. As shown in Figure 2, natural person IDs correspond to three user IDs (openid1, openid2, openid3), APP rules include application usage habits, geographic location rules include target device location characteristics, and account system rules include travel preference characteristics and interest characteristics . The preset weight threshold can be set to 0.5, then for openid1, it is recognized as a substitute ID, and for openid2 and openid3, it is recognized as a self-use ID. Since one of the three user IDs corresponding to the natural person ID is a substitute ID, it can be determined that the natural person ID is an abnormal natural person ID.
可选的,在执行步骤(131)之前,还可以执行如下步骤:Optionally, before step (131) is performed, the following steps can also be performed:
服务端预先获取至少一个用户ID上报的多个应用数据,根据多个应用数据确定多组特征,对多组特征进行训练,得到训练好的机器学习模型。The server obtains in advance multiple application data reported by at least one user ID, determines multiple sets of features based on the multiple application data, and trains the multiple sets of features to obtain a trained machine learning model.
本申请实施例中,训练好的机器学习模型是服务端根据预先获取的该目标自然人ID对应的至少一个用户ID上报的多个应用数据中提取的多组特征(APP特征、位置特征、目标设备使用特征)进行训练得到的,该训练好的机器学习模型的是针对该目标自然人ID的用户习惯确定的,可以准确的非该目标自然人的其他自然人的使用进行甄别,准确的识别出用户ID是否出现代用情况。In the embodiment of this application, the trained machine learning model is a set of features (APP feature, location feature, target device) extracted from multiple application data reported by the server according to at least one user ID corresponding to the target natural person ID obtained in advance. Use feature) for training, the trained machine learning model is determined for the user habits of the target natural person ID, and can accurately screen the use of other natural persons who are not the target natural person, and accurately identify whether the user ID is Out of modern use.
103,服务端拒绝向目标自然人ID进行内容推送。103. The server refuses to push content to the target natural person ID.
104,服务端向目标自然人ID进行内容推送。104. The server pushes content to the target natural person ID.
本申请实施例中,首先将M个用户ID转换为N个自然人ID,减小推送的ID的数量,然后识别自然人ID中的异常自然人ID,不向异常自然人ID进行内容推送,进一步缩小推送的自然人ID的数量,可以避免向异常自然人ID进行内容推送,提高内容推送的准确性,由于符合推送条件的自然人ID的数量要小于M个用户标识ID,可以降低服务端内容推送的压力,从而提高内容推送的速度。In the embodiment of this application, first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify the abnormal natural person IDs in the natural person IDs, and do not push content to the abnormal natural person IDs, further reducing the number of pushes The number of natural person IDs can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of content pushing on the server, thereby increasing The speed of content delivery.
本申请实施例,可以识别正常自然人ID和异常自然人ID(虚假用户、干扰用户),及时剔除异常自然人ID,减少不必要的经济和分析损失;对正常自然人ID进行内容推送,从而提高推送的准确性,减少资源浪费,并且能实现精准运营。The embodiments of this application can identify normal natural person IDs and abnormal natural person IDs (fake users, interfering users), eliminate abnormal natural person IDs in time, reduce unnecessary economic and analysis losses; push content of normal natural person IDs, thereby improving the accuracy of the push Performance, reduce resource waste, and achieve precise operation.
请参阅图3,图3是本申请实施例公开的另一种内容推送方法的流程示意图。图3是在图1的基础上进一步优化得到的,如图3所示,该内容推送方法包括如下步骤。Please refer to FIG. 3, which is a schematic flowchart of another content pushing method disclosed in an embodiment of the present application. Fig. 3 is obtained by further optimization on the basis of Fig. 1. As shown in Fig. 3, the content pushing method includes the following steps.
301,服务端获取多个客户端上报的多个用户ID的行为数据,使用图存储计算引擎计算所述多个用户ID的行为数据之间的相似度。301. The server obtains the behavior data of multiple user IDs reported by multiple clients, and uses a graph storage calculation engine to calculate the similarity between the behavior data of the multiple user IDs.
302,服务端基于所述多个用户ID的行为数据之间的相似度构建用户ID之间的关系对。302. The server constructs a relationship pair between user IDs based on the similarity between the behavior data of the multiple user IDs.
303,服务端根据所述用户ID之间的关系对构建用户ID与自然人ID的对应关系。303. The server constructs a corresponding relationship between the user ID and the natural person ID according to the relationship between the user IDs.
本申请实施例中,图存储计算引擎,也可以称为图计算引擎,图存储计算引擎采用网页排名(PageRank)算法、最短路径算法、交替最小二乘(Alternating Least Squares,ALS)算法计算所述多个用户ID的行为数据之间的相似度。将相似度大于预设相似度阈值的用户ID建立关系对。比如,SSOID1<->IMEI2,OPENID1<->ICCID3,SSOID2<->TEL2,IMEI2<->TEL3,IMEI2<->ICCID1,SSOID2<->OPENID1,IMEI1<->SSOID2。则SSOID1、IMEI2、TEL3、ICCID1对应一个自然人ID(比如,自然人ID1),OPENID1、ICCID3、SSOID2、TEL2、IMEI1对应另一个自然人ID(比如,自然人ID2)。具体可以参见表1。In the embodiments of the present application, the graph storage calculation engine may also be referred to as a graph calculation engine. The graph storage calculation engine uses the PageRank algorithm, the shortest path algorithm, and the Alternating Least Squares (ALS) algorithm to calculate the The similarity between the behavior data of multiple user IDs. A relationship pair is established for user IDs whose similarity is greater than a preset similarity threshold. For example, SSOID1<->IMEI2, OPENID1<->ICCID3, SSOID2<->TEL2, IMEI2<->TEL3, IMEI2<->ICCID1, SSOID2<->OPENID1, IMEI1<->SSOID2. Then SSOID1, IMEI2, TEL3, and ICCID1 correspond to one natural person ID (for example, natural person ID1), and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1 correspond to another natural person ID (for example, natural person ID2). See Table 1 for details.
表1Table 1
用户IDUser ID 自然人IDNatural person ID
SSOID1、IMEI2、TEL3、ICCID1SSOID1, IMEI2, TEL3, ICCID1 自然人ID1Natural person ID1
OPENID1、ICCID3、SSOID2、TEL2、IMEI1OPENID1, ICCID3, SSOID2, TEL2, IMEI1 自然人ID2Natural person ID2
表1是本申请实施例公开的一种用户ID与自然人ID的对应关系表。如表1所示,自然人ID1与SSOID1、IMEI2、TEL3、ICCID1的对应、自然人ID2与OPENID1、ICCID3、SSOID2、TEL2、IMEI1的对应。Table 1 is a table of correspondence between user IDs and natural person IDs disclosed in the embodiments of the present application. As shown in Table 1, the correspondence between natural person ID1 and SSOID1, IMEI2, TEL3, and ICCID1, and the correspondence between natural person ID2 and OPENID1, ICCID3, SSOID2, TEL2, and IMEI1.
304,服务端抓取M个用户标识ID,服务端获取用户ID与自然人ID的对应关系。304. The server grabs M user identification IDs, and the server acquires the corresponding relationship between the user ID and the natural person ID.
305,服务端根据该用户ID与自然人ID的对应关系确定与M个用户ID对应的N个自然人ID。305. The server determines N natural person IDs corresponding to the M user IDs according to the corresponding relationship between the user ID and the natural person ID.
本申请实施中,用户ID与自然人ID的对应关系可以预先获取。可以在步骤301至步骤303中确定用户ID与自然人ID的对应关系。步骤304在步骤301至步骤303之后执行,用户ID与自然人ID的对应关系无需实时生成,可 以快速确定M个用户ID对应的N个自然人ID。In the implementation of this application, the correspondence between the user ID and the natural person ID can be obtained in advance. The correspondence between the user ID and the natural person ID can be determined in steps 301 to 303. Step 304 is executed after step 301 to step 303. The correspondence between user IDs and natural person IDs does not need to be generated in real time, and N natural person IDs corresponding to M user IDs can be quickly determined.
306,服务端检测目标自然人ID是否为异常自然人ID,目标自然人ID为上述N个自然人ID中的任意一个。若是,则执行步骤307,若否,则执行步骤308。306. The server detects whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the above N natural person IDs. If yes, go to step 307, if not, go to step 308.
307,服务端拒绝向目标自然人ID进行内容推送。307. The server refuses to push content to the target natural person ID.
308,服务端向目标自然人ID进行内容推送。308. The server pushes content to the target natural person ID.
本申请实施例中的步骤306至步骤308的具体实施可以参见图1所示的步骤102至步骤104的描述,此处不再赘述。For the specific implementation of step 306 to step 308 in the embodiment of the present application, reference may be made to the description of step 102 to step 104 shown in FIG. 1, which will not be repeated here.
本申请实施例中,首先根据用户ID与自然人ID的对应关系将M个用户ID转换为N个自然人ID,减小推送的ID的数量,然后识别自然人ID中的异常自然人ID,不向异常自然人ID进行内容推送,进一步缩小推送的自然人ID的数量,可以避免向异常自然人ID进行内容推送,提高内容推送的准确性,由于符合推送条件的自然人ID的数量要小于M个用户标识ID,可以降低服务端内容推送的压力,从而提高内容推送的速度。In the embodiment of this application, first, according to the corresponding relationship between the user ID and the natural person ID, the M user IDs are converted into N natural person IDs, the number of IDs pushed is reduced, and then the abnormal natural person ID in the natural person ID is identified, and the abnormal natural person ID is not reported to the abnormal natural person. ID pushes content, further reduces the number of natural person IDs that are pushed, can avoid pushing content to abnormal natural person IDs, and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can be reduced The pressure of server-side content push, thereby increasing the speed of content push.
上述主要从方法侧执行过程的角度对本申请实施例的方案进行了介绍。可以理解的是,移动终端为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本发明能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。The foregoing mainly introduces the solution of the embodiment of the present application from the perspective of the execution process on the method side. It can be understood that, in order to implement the above-mentioned functions, the mobile terminal includes hardware structures and/or software modules corresponding to each function. Those skilled in the art should easily realize that in combination with the units and algorithm steps of the examples described in the embodiments disclosed herein, the present invention can be implemented in the form of hardware or a combination of hardware and computer software. Whether a certain function is executed by hardware or computer software-driven hardware depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the present invention.
本申请实施例可以根据上述方法示例对移动终端进行功能单元的划分,例如,可以对应各个功能划分各个功能单元,也可以将两个或两个以上的功能集成在一个处理单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。需要说明的是,本申请实施例中对单元的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。The embodiments of the present application may divide the mobile terminal into functional units according to the foregoing method examples. For example, each functional unit may be divided corresponding to each function, or two or more functions may be integrated into one processing unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. It should be noted that the division of units in the embodiments of the present application is illustrative, and is only a logical function division, and there may be other division methods in actual implementation.
请参阅图4,图4是本申请实施例公开的一种内容推送装置的结构示意图。如图4所示,该内容推送装置400包括抓取单元401、确定单元402、检测单 元403和处理单元404,其中:Please refer to FIG. 4, which is a schematic structural diagram of a content pushing device disclosed in an embodiment of the present application. As shown in Figure 4, the content pushing device 400 includes a grabbing unit 401, a determining unit 402, a detecting unit 403, and a processing unit 404, wherein:
所述抓取单元401,用于抓取M个用户标识ID;The grabbing unit 401 is configured to grab M user identification IDs;
所述确定单元402,用于确定与所述M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M;The determining unit 402 is configured to determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
所述检测单元403,用于检测目标自然人ID是否为异常自然人ID,所述目标自然人ID为所述N个自然人ID中的任意一个;The detection unit 403 is configured to detect whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
所述处理单元404,用于在所述检测单元403检测出所述目标自然人ID为异常自然人ID的情况下,拒绝向所述目标自然人ID进行内容推送。The processing unit 404 is configured to refuse to push content to the target natural person ID when the detection unit 403 detects that the target natural person ID is an abnormal natural person ID.
可选的,所述处理单元404,还用于在所述检测单元403检测出所述目标自然人ID不是异常自然人ID的情况下,向所述目标自然人ID进行内容推送。Optionally, the processing unit 404 is further configured to push content to the target natural person ID when the detection unit 403 detects that the target natural person ID is not an abnormal natural person ID.
可选的,所述检测单元403检测目标自然人ID是否为异常自然人ID,具体为:获取所述目标自然人ID的用户行为数据,基于所述目标自然人ID的用户行为数据分析所述目标自然人ID是否为刷量ID;若所述目标自然人ID为刷量ID,则确定所述目标自然人ID为异常自然人ID;若所述目标自然人ID不是刷量ID,采用机器学习算法检测所述目标自然人ID是否为代用ID;若所述目标自然人ID为代用ID,则确定所述目标自然人ID为异常自然人ID。Optionally, the detection unit 403 detects whether the target natural person ID is an abnormal natural person ID, specifically: acquiring user behavior data of the target natural person ID, and analyzing whether the target natural person ID is based on the user behavior data of the target natural person ID Is the amount ID; if the target natural person ID is the amount ID, the target natural person ID is determined to be an abnormal natural person ID; if the target natural person ID is not the amount ID, a machine learning algorithm is used to detect whether the target natural person ID Is a substitute ID; if the target natural person ID is a substitute ID, it is determined that the target natural person ID is an abnormal natural person ID.
可选的,所述用户行为数据包括应用程序安装数据、应用程序卸载数据以及付费额度,所述检测单元403基于所述目标自然人ID的用户行为数据分析所述目标自然人ID是否为刷量ID,具体为:检测所述目标自然人ID的应用程序安装数据中是否存在单日安装总量大于预设安装数量上限阈值的数据;检测所述目标自然人ID的应用程序卸载数据中是否存在单日卸载总量大于预设卸载数量上限阈值的数据;检测所述目标自然人ID的付费额度是否超过预设额度阈值;若所述目标自然人ID的应用程序安装数据中存在单日安装总量大于所述预设安装数量上限阈值的数据,并且所述目标自然人ID的应用程序卸载数据中存在单日卸载总量大于所述预设卸载数量上限阈值的数据,并且所述目标自然人ID的付费额度超过所述预设额度阈值,则确定目标自然人ID为刷量ID。Optionally, the user behavior data includes application installation data, application uninstallation data, and payment amount, and the detection unit 403 analyzes whether the target natural person ID is a credit ID based on the user behavior data of the target natural person ID, Specifically: detecting whether there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the preset upper limit of the number of installations; detecting whether there is a single-day uninstallation data in the application uninstallation data of the target natural person ID The amount of data is greater than the preset upper limit threshold of the number of uninstalls; it is detected whether the payment amount of the target natural person ID exceeds the preset amount threshold; if there is a total number of single-day installations in the target natural person ID application installation data greater than the preset The data of the upper limit threshold of the number of installations, and there is data in the application uninstallation data of the target natural person ID that the total amount of uninstallation in a single day is greater than the preset upper limit of the uninstallation number threshold, and the payment amount of the target natural person ID exceeds the preset If the quota threshold is set, the target natural person ID is determined as the amount ID.
可选的,所述检测单元403采用机器学习算法检测所述目标自然人ID是否为代用ID,具体为:确定所述目标自然人ID对应的至少一个用户ID;在 所述目标自然人ID对应的第一用户ID登录目标设备后,获取所述目标设备的位置特征、所述第一用户ID的出行偏好特征、所述第一用户ID的应用程序使用习惯特征、所述第一用户ID的兴趣特征、所述目标设备的设备使用特征以及目标设备标识;所述第一用户ID为所述至少一个用户ID中的任意一个;将所述目标设备的位置特征、所述第一用户ID的出行偏好特征、所述第一用户ID的应用程序使用习惯特征、所述第一用户ID的兴趣特征、所述目标设备的设备使用特征以及目标设备标识输入训练好的机器学习模型,得到所述第一用户ID的代用权重值;若所述第一用户ID的代用权重值大于预设权重阈值,则确定所述目标自然人ID为代用ID。Optionally, the detection unit 403 uses a machine learning algorithm to detect whether the target natural person ID is a substitute ID, specifically: determining at least one user ID corresponding to the target natural person ID; After the user ID logs in to the target device, obtain the location feature of the target device, the travel preference feature of the first user ID, the application usage habit feature of the first user ID, the interest feature of the first user ID, The device usage feature of the target device and the target device identifier; the first user ID is any one of the at least one user ID; the location feature of the target device and the travel preference feature of the first user ID , The application usage habit characteristics of the first user ID, the interest characteristics of the first user ID, the device usage characteristics of the target device, and the target device identification input a trained machine learning model to obtain the first user The substitution weight value of the ID; if the substitution weight value of the first user ID is greater than the preset weight threshold, it is determined that the target natural person ID is the substitution ID.
可选的,所述检测单元403确定所述目标自然人ID对应的至少一个用户ID之前,还用于预先获取所述至少一个用户ID上报的多个应用数据,根据所述多个应用数据确定多组特征,对所述多组特征进行训练,得到训练好的机器学习模型。Optionally, before the detecting unit 403 determines the at least one user ID corresponding to the target natural person ID, it is further configured to obtain in advance multiple pieces of application data reported by the at least one user ID, and determine the multiple pieces according to the multiple pieces of application data. Group features, training the multiple groups of features to obtain a trained machine learning model.
可选的,所述确定单元402确定与所述M个用户ID对应的N个自然人ID,具体为:获取用户ID与自然人ID的对应关系;根据所述用户ID与自然人ID的对应关系确定与所述M个用户ID对应的N个自然人ID。Optionally, the determining unit 402 determines the N natural person IDs corresponding to the M user IDs, specifically: acquiring the corresponding relationship between the user ID and the natural person ID; and determining the corresponding relationship according to the corresponding relationship between the user ID and the natural person ID. N natural person IDs corresponding to the M user IDs.
可选的,所述确定单元402获取用户ID与自然人ID的对应关系之前,还用于获取多个客户端上报的多个用户ID的行为数据,使用图存储计算引擎计算所述多个用户ID的行为数据之间的相似度;基于所述多个用户ID的行为数据之间的相似度构建用户ID之间的关系对;根据所述用户ID之间的关系对构建用户ID与自然人ID的对应关系。Optionally, before the determining unit 402 obtains the corresponding relationship between the user ID and the natural person ID, it is also used to obtain the behavior data of multiple user IDs reported by multiple clients, and calculate the multiple user IDs using a graph storage calculation engine. The similarity between the behavior data of the user ID; the relationship between the user ID is constructed based on the similarity between the behavior data of the multiple user IDs; the relationship between the user ID and the natural person ID is constructed based on the relationship between the user IDs Correspondence.
实施图4所示的内容推送装置,首先将M个用户ID转换为N个自然人ID,减小推送的ID的数量,然后识别自然人ID中的异常自然人ID,不向异常自然人ID进行内容推送,进一步缩小推送的自然人ID的数量,可以避免向异常自然人ID进行内容推送,提高内容推送的准确性,由于符合推送条件的自然人ID的数量要小于M个用户标识ID,可以降低服务端内容推送的压力,从而提高内容推送的速度。Implement the content pushing device shown in Figure 4, first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify abnormal natural person IDs in natural person IDs, and do not push content to abnormal natural person IDs. Further narrowing the number of natural person IDs pushed can avoid pushing content to abnormal natural person IDs, and improve the accuracy of content pushing. Since the number of natural person IDs that meet the push conditions is less than M user identification IDs, it can reduce the amount of server-side content pushing Pressure to increase the speed of content delivery.
请参阅图5,图5是本申请实施例公开的一种移动终端的结构示意图。如图5所示,该移动终端500包括处理器501和存储器502,其中,移动终端500 还可以包括总线503,处理器501和存储器502可以通过总线503相互连接,总线503可以是外设部件互连标准(Peripheral Component Interconnect,简称PCI)总线或扩展工业标准结构(Extended Industry Standard Architecture,简称EISA)总线等。总线503可以分为地址总线、数据总线、控制总线等。为便于表示,图5中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。其中,移动终端500还可以包括输入输出设备504,输入输出设备504可以包括显示屏,例如液晶显示屏。存储器502用于存储包含指令的一个或多个程序;处理器501用于调用存储在存储器502中的指令执行上述图1至图3中的部分或全部方法步骤。Please refer to FIG. 5, which is a schematic structural diagram of a mobile terminal disclosed in an embodiment of the present application. As shown in FIG. 5, the mobile terminal 500 includes a processor 501 and a memory 502. The mobile terminal 500 may also include a bus 503. The processor 501 and the memory 502 may be connected to each other through the bus 503. The bus 503 may be a peripheral component. Connect the standard (Peripheral Component Interconnect, referred to as PCI) bus or extended industry standard architecture (Extended Industry Standard Architecture, referred to as EISA) bus, etc. The bus 503 can be divided into an address bus, a data bus, a control bus, and so on. For ease of presentation, only one thick line is used in FIG. 5 to represent, but it does not mean that there is only one bus or one type of bus. The mobile terminal 500 may also include an input and output device 504, and the input and output device 504 may include a display screen, such as a liquid crystal display screen. The memory 502 is used to store one or more programs containing instructions; the processor 501 is used to call the instructions stored in the memory 502 to execute some or all of the method steps in FIGS. 1 to 3.
实施图5所示的移动终端,首先将M个用户ID转换为N个自然人ID,减小推送的ID的数量,然后识别自然人ID中的异常自然人ID,不向异常自然人ID进行内容推送,进一步缩小推送的自然人ID的数量,可以避免向异常自然人ID进行内容推送,提高内容推送的准确性,由于符合推送条件的自然人ID的数量要小于M个用户标识ID,可以降低服务端内容推送的压力,从而提高内容推送的速度。Implement the mobile terminal shown in Figure 5, first convert M user IDs into N natural person IDs, reduce the number of IDs pushed, and then identify abnormal natural person IDs in natural person IDs, and do not push content to abnormal natural person IDs, and further Reducing the number of natural person IDs pushed can avoid pushing content to abnormal natural person IDs and improve the accuracy of content pushing. Since the number of natural person IDs that meet the pushing conditions is less than M user identification IDs, it can reduce the pressure of server content pushing , Thereby improving the speed of content push.
本申请实施例还提供一种计算机存储介质,其中,该计算机存储介质存储用于电子数据交换的计算机程序,该计算机程序使得计算机执行如上述方法实施例中记载的任何一种内容推送方法的部分或全部步骤。The embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a computer program for electronic data exchange, and the computer program causes the computer to execute any part of the content push method recorded in the above method embodiment Or all steps.
本申请实施例还提供一种计算机程序产品,该计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,该计算机程序可操作来使计算机执行如上述方法实施例中记载的任何一种内容推送方法的部分或全部步骤。The embodiments of the present application also provide a computer program product. The computer program product includes a non-transitory computer-readable storage medium storing a computer program. The computer program is operable to cause a computer to execute any of the methods described in the foregoing method embodiments. Part or all of the steps of a content push method.
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明并不受所描述的动作顺序的限制,因为依据本发明,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本发明所必须的。It should be noted that for the foregoing method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should know that the present invention is not limited by the described sequence of actions. Because according to the present invention, certain steps can be performed in other order or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the involved actions and modules are not necessarily required by the present invention.
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own focus. For parts that are not described in detail in an embodiment, reference may be made to related descriptions of other embodiments.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置,可通过其 它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed device can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储器中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储器中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储器包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a computer readable memory. Based on this understanding, the technical solution of the present invention essentially or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a memory, A number of instructions are included to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention. The aforementioned memory includes: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other various media that can store program codes.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储器中,存储器可以包括:闪存盘、只读存储器(英文:Read-Only Memory,简称:ROM)、随机存取器(英文:Random Access Memory,简称:RAM)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing relevant hardware through a program. The program can be stored in a computer-readable memory, and the memory can include: flash disk , Read-only memory (English: Read-Only Memory, abbreviation: ROM), random access device (English: Random Access Memory, abbreviation: RAM), magnetic disk or optical disc, etc.
以上对本申请实施例进行了详细介绍,本文中应用了具体个例对本发明的 原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The embodiments of the present application are described in detail above, and specific examples are used in this article to illustrate the principles and implementation of the present invention. The descriptions of the above embodiments are only used to help understand the methods and core ideas of the present invention; Persons of ordinary skill in the art, based on the idea of the present invention, will have changes in the specific implementation and the scope of application. In summary, the content of this specification should not be construed as limiting the present invention.

Claims (10)

  1. 一种内容推送方法,其特征在于,包括:A content pushing method, characterized in that it comprises:
    抓取M个用户标识ID,确定与所述M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M;Grab M user identification IDs, and determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
    检测目标自然人ID是否为异常自然人ID,所述目标自然人ID为所述N个自然人ID中的任意一个;Detecting whether the target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
    若是,拒绝向所述目标自然人ID进行内容推送。If yes, refuse to push content to the target natural person ID.
  2. 根据权利要求1所述的方法,其特征在于,所述检测目标自然人ID是否为异常自然人ID,包括:The method according to claim 1, wherein the detecting whether the target natural person ID is an abnormal natural person ID comprises:
    获取所述目标自然人ID的用户行为数据,基于所述目标自然人ID的用户行为数据分析所述目标自然人ID是否为刷量ID;Obtain user behavior data of the target natural person ID, and analyze whether the target natural person ID is a credit ID based on the user behavior data of the target natural person ID;
    若所述目标自然人ID为刷量ID,则确定所述目标自然人ID为异常自然人ID;If the target natural person ID is a brush amount ID, it is determined that the target natural person ID is an abnormal natural person ID;
    若所述目标自然人ID不是刷量ID,采用机器学习算法检测所述目标自然人ID是否为代用ID;If the target natural person ID is not a swipe ID, use a machine learning algorithm to detect whether the target natural person ID is a substitute ID;
    若所述目标自然人ID为代用ID,则确定所述目标自然人ID为异常自然人ID。If the target natural person ID is a substitute ID, it is determined that the target natural person ID is an abnormal natural person ID.
  3. 根据权利要求2所述的方法,其特征在于,所述用户行为数据包括应用程序安装数据、应用程序卸载数据以及付费额度,所述基于所述目标自然人ID的用户行为数据分析所述目标自然人ID是否为刷量ID,包括:The method according to claim 2, wherein the user behavior data includes application installation data, application uninstallation data, and payment amount, and the user behavior data based on the target natural person ID analyzes the target natural person ID Whether it is the brush ID, including:
    检测所述目标自然人ID的应用程序安装数据中是否存在单日安装总量大于预设安装数量上限阈值的数据;检测所述目标自然人ID的应用程序卸载数据中是否存在单日卸载总量大于预设卸载数量上限阈值的数据;检测所述目标自然人ID的付费额度是否超过预设额度阈值;Detect whether there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the preset upper limit threshold of the number of installations; detect whether there is a single-day uninstallation data that is greater than the expected amount in the application uninstallation data of the target natural person ID Set the data of the upper limit threshold of the number of uninstalls; detect whether the payment amount of the target natural person ID exceeds the preset amount threshold;
    若所述目标自然人ID的应用程序安装数据中存在单日安装总量大于所述预设安装数量上限阈值的数据,并且所述目标自然人ID的应用程序卸载数据 中存在单日卸载总量大于所述预设卸载数量上限阈值的数据,并且所述目标自然人ID的付费额度超过所述预设额度阈值,则确定目标自然人ID为刷量ID。If there is data in the application installation data of the target natural person ID that the total number of single-day installations is greater than the upper limit threshold of the preset number of installations, and the application uninstallation data of the target natural person ID includes the total number of single-day uninstalls greater than all According to the data of the upper limit threshold of the preset amount of unloading, and the payment amount of the target natural person ID exceeds the preset amount threshold, it is determined that the target natural person ID is the amount ID.
  4. 根据权利要求2所述的方法,其特征在于,所述采用机器学习算法检测所述目标自然人ID是否为代用ID,包括:The method according to claim 2, wherein the detecting whether the target natural person ID is a substitute ID by using a machine learning algorithm comprises:
    确定所述目标自然人ID对应的至少一个用户ID;Determine at least one user ID corresponding to the target natural person ID;
    在所述目标自然人ID对应的第一用户ID登录目标设备后,获取所述目标设备的位置特征、所述第一用户ID的出行偏好特征、所述第一用户ID的应用程序使用习惯特征、所述第一用户ID的兴趣特征、所述目标设备的设备使用特征以及目标设备标识;所述第一用户ID为所述至少一个用户ID中的任意一个;After the first user ID corresponding to the target natural person ID logs in to the target device, the location characteristics of the target device, the travel preference characteristics of the first user ID, the application usage habit characteristics of the first user ID, The interest feature of the first user ID, the device usage feature of the target device, and the target device identifier; the first user ID is any one of the at least one user ID;
    将所述目标设备的位置特征、所述第一用户ID的出行偏好特征、所述第一用户ID的应用程序使用习惯特征、所述第一用户ID的兴趣特征、所述目标设备的设备使用特征以及目标设备标识输入训练好的机器学习模型,得到所述第一用户ID的代用权重值;The location feature of the target device, the travel preference feature of the first user ID, the application usage habit feature of the first user ID, the interest feature of the first user ID, and the device usage of the target device Input the trained machine learning model to the characteristics and the target device identification to obtain the substitute weight value of the first user ID;
    若所述第一用户ID的代用权重值大于预设权重阈值,则确定所述目标自然人ID为代用ID。If the substitute weight value of the first user ID is greater than a preset weight threshold, it is determined that the target natural person ID is the substitute ID.
  5. 根据权利要求4所述的方法,其特征在于,所述确定所述目标自然人ID对应的至少一个用户ID之前,所述方法还包括:The method according to claim 4, wherein before the determining at least one user ID corresponding to the target natural person ID, the method further comprises:
    预先获取所述至少一个用户ID上报的多个应用数据,根据所述多个应用数据确定多组特征,对所述多组特征进行训练,得到训练好的机器学习模型。Acquire multiple application data reported by the at least one user ID in advance, determine multiple sets of features based on the multiple application data, and train the multiple sets of features to obtain a trained machine learning model.
  6. 根据权利要求1~5任一项所述的方法,其特征在于,所述确定与所述M个用户ID对应的N个自然人ID,包括:The method according to any one of claims 1 to 5, wherein the determining N natural person IDs corresponding to the M user IDs comprises:
    获取用户ID与自然人ID的对应关系;Obtain the correspondence between the user ID and the natural person ID;
    根据所述用户ID与自然人ID的对应关系确定与所述M个用户ID对应的N个自然人ID。The N natural person IDs corresponding to the M user IDs are determined according to the correspondence between the user ID and the natural person ID.
  7. 根据权利要求6所述的方法,其特征在于,所述获取用户ID与自然人ID的对应关系之前,所述方法还包括:The method according to claim 6, characterized in that, before the obtaining the correspondence between the user ID and the natural person ID, the method further comprises:
    获取多个客户端上报的多个用户ID的行为数据,使用图存储计算引擎计算所述多个用户ID的行为数据之间的相似度;Obtain the behavior data of multiple user IDs reported by multiple clients, and use a graph storage calculation engine to calculate the similarity between the behavior data of the multiple user IDs;
    基于所述多个用户ID的行为数据之间的相似度构建用户ID之间的关系对;Constructing a relationship pair between user IDs based on the similarity between the behavior data of the multiple user IDs;
    根据所述用户ID之间的关系对构建用户ID与自然人ID的对应关系。The corresponding relationship between the user ID and the natural person ID is constructed according to the relationship between the user IDs.
  8. 一种内容推送装置,其特征在于,所述内容推送装置包括抓取单元、确定单元、检测单元和处理单元,其中:A content pushing device, characterized in that the content pushing device includes a grabbing unit, a determining unit, a detecting unit and a processing unit, wherein:
    所述抓取单元,用于抓取M个用户标识ID;The grabbing unit is used to grab M user identification IDs;
    所述确定单元,用于确定与所述M个用户ID对应的N个自然人ID,M、N均为正整数,N小于或等于M;The determining unit is configured to determine N natural person IDs corresponding to the M user IDs, where M and N are both positive integers, and N is less than or equal to M;
    所述检测单元,用于检测目标自然人ID是否为异常自然人ID,所述目标自然人ID为所述N个自然人ID中的任意一个;The detection unit is configured to detect whether a target natural person ID is an abnormal natural person ID, and the target natural person ID is any one of the N natural person IDs;
    所述处理单元,用于在所述检测单元检测出所述目标自然人ID为异常自然人ID的情况下,拒绝向所述目标自然人ID进行内容推送。The processing unit is configured to refuse to push content to the target natural person ID when the detection unit detects that the target natural person ID is an abnormal natural person ID.
  9. 一种移动终端,其特征在于,包括处理器以及存储器,所述存储器用于存储一个或多个程序,所述一个或多个程序被配置成由所述处理器执行,所述程序包括用于执行如权利要求1~7任一项所述的方法。A mobile terminal, characterized by comprising a processor and a memory, the memory is used to store one or more programs, the one or more programs are configured to be executed by the processor, and the programs include Perform the method according to any one of claims 1-7.
  10. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质用于存储电子数据交换的计算机程序,其中,所述计算机程序使得计算机执行如权利要求1~7任一项所述的方法。A computer-readable storage medium, characterized in that the computer-readable storage medium is used to store a computer program for electronic data exchange, wherein the computer program causes a computer to execute any one of claims 1-7 method.
PCT/CN2019/093110 2019-06-26 2019-06-26 Content pushing method and apparatus, mobile terminal and storage medium WO2020258102A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2019/093110 WO2020258102A1 (en) 2019-06-26 2019-06-26 Content pushing method and apparatus, mobile terminal and storage medium
CN201980091547.8A CN113412607B (en) 2019-06-26 2019-06-26 Content pushing method and device, mobile terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/093110 WO2020258102A1 (en) 2019-06-26 2019-06-26 Content pushing method and apparatus, mobile terminal and storage medium

Publications (1)

Publication Number Publication Date
WO2020258102A1 true WO2020258102A1 (en) 2020-12-30

Family

ID=74059881

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/093110 WO2020258102A1 (en) 2019-06-26 2019-06-26 Content pushing method and apparatus, mobile terminal and storage medium

Country Status (2)

Country Link
CN (1) CN113412607B (en)
WO (1) WO2020258102A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112650746A (en) * 2021-01-12 2021-04-13 浪潮云信息技术股份公司 Method for automatically capturing exchange service data applied to government affair handling system
CN113435505A (en) * 2021-06-28 2021-09-24 中电积至(海南)信息技术有限公司 Construction method and device for safe user portrait
CN114860557A (en) * 2022-04-08 2022-08-05 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN115914153A (en) * 2022-11-15 2023-04-04 浪潮云信息技术股份公司 Method and system for combining Instant Messaging (IM) and public number based on WeChat

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104639550A (en) * 2015-02-12 2015-05-20 北京嘀嘀无限科技发展有限公司 User identification method and equipment
CN105550348A (en) * 2015-12-25 2016-05-04 时趣互动(北京)科技有限公司 Cheating user decision method and device based on on-line behavior data of user
CN106572048A (en) * 2015-10-09 2017-04-19 腾讯科技(深圳)有限公司 Identification method and system of user information in social network
EP3487200A1 (en) * 2017-11-17 2019-05-22 Bundesdruckerei GmbH Behaviour-based authentication with fall-back position

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105095891A (en) * 2014-05-05 2015-11-25 深圳市贝尔信智能系统有限公司 Human face capturing method, device and system
CN108898042B (en) * 2017-12-27 2021-10-22 浩云科技股份有限公司 Method for detecting abnormal user behavior in ATM cabin
CN108416015A (en) * 2018-03-05 2018-08-17 重庆电子工程职业学院 A kind of information security method for pushing
CN108985954B (en) * 2018-07-02 2022-06-21 武汉斗鱼网络科技有限公司 Method for establishing association relation of each identifier and related equipment
CN109241343B (en) * 2018-07-27 2022-07-22 北京奇艺世纪科技有限公司 System, method and device for identifying brush amount user

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104639550A (en) * 2015-02-12 2015-05-20 北京嘀嘀无限科技发展有限公司 User identification method and equipment
CN106572048A (en) * 2015-10-09 2017-04-19 腾讯科技(深圳)有限公司 Identification method and system of user information in social network
CN105550348A (en) * 2015-12-25 2016-05-04 时趣互动(北京)科技有限公司 Cheating user decision method and device based on on-line behavior data of user
EP3487200A1 (en) * 2017-11-17 2019-05-22 Bundesdruckerei GmbH Behaviour-based authentication with fall-back position

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112650746A (en) * 2021-01-12 2021-04-13 浪潮云信息技术股份公司 Method for automatically capturing exchange service data applied to government affair handling system
CN113435505A (en) * 2021-06-28 2021-09-24 中电积至(海南)信息技术有限公司 Construction method and device for safe user portrait
CN114860557A (en) * 2022-04-08 2022-08-05 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN114860557B (en) * 2022-04-08 2023-05-26 广东联想懂的通信有限公司 User behavior information generation method, device, equipment and readable storage medium
CN115914153A (en) * 2022-11-15 2023-04-04 浪潮云信息技术股份公司 Method and system for combining Instant Messaging (IM) and public number based on WeChat

Also Published As

Publication number Publication date
CN113412607A (en) 2021-09-17
CN113412607B (en) 2022-09-09

Similar Documents

Publication Publication Date Title
WO2020258102A1 (en) Content pushing method and apparatus, mobile terminal and storage medium
US11270343B2 (en) Method and apparatus for generating targeted label, and storage medium
CN104468249B (en) Account abnormity detection method and device
US10375242B2 (en) System and method for user notification regarding detected events
CN108737535B (en) Message pushing method, storage medium and server
US20190342415A1 (en) Event information push method, event information push apparatus, and storage medium
CN104579912B (en) Data pushing method and device
CN109669795B (en) Crash information processing method and device
CN109218390B (en) User screening method and device
WO2020248658A1 (en) Abnormal account detection method and apparatus
CN107370780B (en) Media pushing method, device and system based on Internet
US9578119B2 (en) Method, server, and client for pushing and displaying splash screen
US11537751B2 (en) Using machine learning algorithm to ascertain network devices used with anonymous identifiers
CN107515915A (en) User based on user behavior data identifies correlating method
WO2020257993A1 (en) Content pushing method and apparatus, server, and storage medium
CN106713290B (en) Method for identifying main user account and server
CN114095567B (en) Data access request processing method and device, computer equipment and medium
CN105491444A (en) Data identification processing method and device
WO2020257991A1 (en) User identification method and related product
CN109034867B (en) Click traffic detection method and device and storage medium
CN109495378A (en) Detect method, apparatus, server and the storage medium of abnormal account number
CN108804501A (en) A kind of method and device of detection effective information
CN110460593B (en) Network address identification method, device and medium for mobile traffic gateway
CN111182018B (en) User identification method, device, equipment and medium
CN115470399A (en) ID (identity) communication method, device, equipment and storage medium based on big data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19934874

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19934874

Country of ref document: EP

Kind code of ref document: A1