WO2017202006A1 - Dispositif et procédé de traitement de données, et support d'informations informatique - Google Patents

Dispositif et procédé de traitement de données, et support d'informations informatique Download PDF

Info

Publication number
WO2017202006A1
WO2017202006A1 PCT/CN2016/109729 CN2016109729W WO2017202006A1 WO 2017202006 A1 WO2017202006 A1 WO 2017202006A1 CN 2016109729 W CN2016109729 W CN 2016109729W WO 2017202006 A1 WO2017202006 A1 WO 2017202006A1
Authority
WO
WIPO (PCT)
Prior art keywords
account
variable
data
feature
behavior
Prior art date
Application number
PCT/CN2016/109729
Other languages
English (en)
Chinese (zh)
Inventor
陈玲
陈谦
陈培炫
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2017202006A1 publication Critical patent/WO2017202006A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance

Definitions

  • the present invention relates to the field of data processing, and in particular to a data processing method and apparatus, and a computer storage medium.
  • the data processing can be used for each service, taking the existing personal credit information business as an example, and the data processing process is described as follows:
  • the basic database of credit information includes credit information, public records and inquiry records. Credit information includes credit card records, bank loan records, personal asset records and other credit loan records. Public records include personal housing provident fund, personal pension insurance, etc., and the inquiry records include personal addresses. And contact information, etc.
  • the bank's credit information is used as the primary basis. Obtain an individual's credit rating by means of a sample survey. However, due to the slow update of the bank's credit information, it is impossible to reflect the true credit level of the individual in time, resulting in inaccurate credit levels. At the same time, the data obtained by means of the prior art sample survey cannot fully reflect the true credit level of the bank account, and the resulting credit level is inaccurate, resulting in inaccurate data.
  • the embodiment of the invention provides a data processing method and device, and a computer storage medium, to solve at least the technical problem that the credit level of the account cannot be accurately obtained and the data is inaccurate.
  • a data processing method including: Collecting behavior data of the first account, the behavior data includes online behavior data and offline behavior data based on the Internet; acquiring a first feature variable of the first account according to the behavior data, wherein the first feature a variable for indicating a behavior characteristic of the first account; inputting the first feature variable into a data analysis model, wherein the data analysis model is configured to output a first value according to the first feature variable, the first The value is used to indicate a probability value that the behavior of the first account does not satisfy the preset condition; and the first value output by the data analysis model is recorded.
  • a data processing apparatus including: an collecting unit, configured to collect behavior data of a first account, where the behavior data includes online behavior data and offline behavior based on the Internet a data acquisition unit, configured to acquire, according to the behavior data, a first feature variable of the first account, where the first feature variable is used to represent behavior characteristics of the first account, and an input unit is configured to The first feature variable is input to a data analysis model, wherein the data analysis model is configured to output a first value according to the first feature variable, where the first value is used to indicate that the behavior of the first account is not satisfied a probability value of the condition; a recording unit configured to record the first value output by the data analysis model.
  • the collecting unit, the obtaining unit, the input unit, and the recording unit may use a central processing unit (CPU), a digital signal processor (DSP, Digital Singnal Processor), or Field-Programmable Gate Array (FPGA) implementation.
  • CPU central processing unit
  • DSP digital signal processor
  • FPGA Field-Programmable Gate Array
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are configured to execute the data processing method described above.
  • the first feature variable is used to represent the behavior characteristic of the first account, and the behavior characteristic of the first account is obtained based on the behavior data of the first account based on the Internet, and then the first feature variable is input into the data analysis model. , the probability value that the behavior of the first account does not satisfy the preset condition can be obtained. Since the behavior data of the first account in the social application can be relatively wide coverage The behavior of an account, the behavior data input into the data analysis model can fully reflect the behavior of the first account, so that the analyzed probability value of the behavior of the first account does not meet the preset condition is more accurate, thereby solving the inaccuracy Get technical questions about the credit level of your account.
  • FIG. 1 is a schematic diagram of a network architecture in accordance with an embodiment of the present invention.
  • FIG. 2 is a flow chart of a data processing method according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of a model architecture in accordance with an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of a data processing apparatus according to an embodiment of the present invention.
  • FIG. 5 is a hardware configuration diagram of a server according to an embodiment of the present invention.
  • an embodiment of a method that can be performed by an embodiment of the apparatus of the present application is provided. It is noted that the steps illustrated in the flowchart of the accompanying drawings can be in a computer system such as a set of computer executable instructions. The execution is performed, and although the logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in a different order than the ones described herein.
  • a data processing method is provided.
  • the data processing method can be applied to a hardware environment formed by the terminal 102 and the server 104 as shown in FIG. 1.
  • the terminal 102 is connected to the server 104 through a network.
  • the network includes but is not limited to: a mobile communication network, a wide area network, a metropolitan area network, or a local area network.
  • the terminal 102 may be a mobile phone terminal, or may be a PC terminal or a notebook terminal. Or a tablet terminal.
  • the server 104 collects behavior data of the plurality of terminals 102, including behavior data obtained by the terminal 102 performing actions through the Internet (for example, chatting in an instant messaging application, watching videos, games, etc.) and combining the terminal 102 via the Internet and offline actions. Behavioral data obtained by the action (such as storing motion data in the cloud through a wearable device during exercise, etc.).
  • the server 104 analyzes the feature variables of one or more terminals 102 according to the collected behavior data, and then acquires the probability that the behavior of a certain terminal satisfies a preset condition according to the feature variables of one or more terminals 102 (eg, a certain terminal). Credit rating). Further, when the credits of a certain terminal 102 are acquired through the feature variables of the plurality of terminals 102, the plurality of terminals have an association relationship (such as a friend relationship) with the one terminal 102.
  • the first account is based on social application-based behavior data, it is not limited to the prior art silver.
  • Row data the collected behavior data covers a wider range, and can reflect the probability value of the behavior of the first account meeting the preset condition from multiple aspects, thereby improving the accuracy of the obtained probability value, thereby solving the existing Technology can't accurately get the technical problem of the credit level of the account.
  • FIG. 2 is a flowchart of a data processing method according to an embodiment of the present invention.
  • the data processing method provided by the embodiment of the present invention is specifically described below with reference to FIG. 2 .
  • the data processing method mainly includes the following steps:
  • Step S202 Collect behavior data of the first account, and the behavior data includes online data and offline data based on the Internet.
  • Step S204 Acquire a first feature variable of the first account according to the behavior data, where the first feature variable is used to represent a behavior feature of the first account.
  • Step S206 the first feature variable is input into the data analysis model, wherein the data analysis model is configured to output the first value according to the first feature variable, where the first value is used to indicate a probability value that the behavior of the first account does not satisfy the preset condition.
  • Step S208 recording a first value output by the data analysis model.
  • the first characteristic variable is used to represent the behavior characteristic of the first account, and the behavior characteristic of the first account is obtained based on the behavior data of the first account based on the Internet, and then the first feature variable is input into the data analysis model, and the first account is obtained.
  • the behavior does not satisfy the probability value of the preset condition. Since the behavior data of the first account in the social application can cover the behavior of the first account relatively widely, the behavior data input into the data analysis model can fully reflect the behavior of the first account, thereby causing the analysis of the first account. The probability value that the behavior does not satisfy the preset condition is more accurate.
  • the behavior data includes online behavior data and offline behavior data of the first account based on the Internet.
  • the virtual space behavior data on the Internet includes not only:
  • User's basic demographic attribute information such as name, age, gender, region, education, occupation, etc.
  • Virtual value-added service data such as virtual account role dressing, game item purchase, film and television membership service, cloud storage space value-added service, music flow package, etc.
  • Economic behavior data such as payment, wealth management, shopping, stocks, funds, P2P, finance, etc.
  • the online data can be obtained through the user's mobile phone, tablet or PC computer instant messaging application, game client, APP download platform, financial platform, shopping software, etc. to collect the information filled by the user or the application is actively reported.
  • Offline related scene data includes not only:
  • O2O online to offline, such as housekeeping services, urban services, beauty care, etc.
  • wear device data such as medical health, sports, etc.
  • LBS location based service
  • Travel data such as ticket ordering, hotel reservations, etc.
  • the behavior data includes actions in various online and offline scenes, and almost includes behavior data of various aspects of life. Therefore, the probability values obtained according to the behavior data also more accurately reflect the true probability value of the account.
  • the behavior data changes, it will be immediately fed back to the server or instantly obtained by the server. Therefore, the behavior data is updated quickly, and the probability value obtained according to the behavior data of the instant update may reflect that the behavior of the first account is not satisfied.
  • Set the probability of the condition The probability value that does not satisfy the preset condition may be the probability of default, such as not complying with the contract Definite behavior, etc.
  • the account chat behavior of the user A in the instant messaging application may be collected.
  • Extracting the first feature variable from the behavior data separately can obtain the first feature variable of different categories.
  • the first feature variable of the instant communication class, the first feature variable of the video class, and the first feature variable of the download class all of the first feature variables of the different classes are input into the data analysis model, and the first value is output. It is also possible to input portions of the first characteristic variables of different classes into the data analysis model.
  • the friend of the user A has a similarity with the user A, and the behavior data of the friend of the user A can also reflect the probability that the behavior of the user A does not satisfy the preset condition. Therefore, when the first feature variable is input to the data analysis model, the feature variable associated with the friend of the user A can also be input at the same time.
  • the inputting the first feature variable into the data analysis model includes: acquiring the second feature variable, wherein the second feature variable is used to represent behavior characteristics of the plurality of second account accounts having an association relationship with the first account; And the second characteristic variable is input to the data analysis model, wherein the data analysis model is further configured to output the first value according to the first feature variable and the second feature variable.
  • the method of obtaining the second feature variable is the same as the method of acquiring the first feature variable, which will be described in detail later.
  • the first account with the associated relationship and the plurality of the second account are friends, and the plurality of second accounts are friends of the first account.
  • Both the online behavior and the offline behavior in the above example can be mapped to the behavior of an application account through a certain correspondence.
  • the second account registers the navigation service and the instant messaging application by using the mobile phone number, and collects the behavior data of the second account in the navigation service and the behavior of the second account in the instant messaging application when acquiring the behavior data of the second account. data.
  • inputting the first feature variable and the second feature variable to the data analysis model includes: acquiring intimacy between the plurality of second account accounts and the first account, wherein the intimacy is according to each second The interaction behavior between the account and the first account is generated; the third feature variable is obtained according to the intimacy and the second feature variable by using the following formula:
  • ⁇ ' f(( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n ),( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n )),
  • ⁇ ' represents the third characteristic variable
  • i represents the i-th second account
  • ⁇ i is the intimacy of the i-th second account and the first account
  • ⁇ i is the second characteristic variable of the i-th second account
  • f is a weighted average of the second characteristic variable and the intimacy of the first n second accounts in the order of indicating the high to low intimacy
  • the first characteristic variable and the third characteristic variable are input to the data analysis model .
  • the second feature variable of the second account is processed to make it better reflect the behavior characteristics of the first account. Therefore, when the second variable is acquired, each second characteristic variable is multiplied by the corresponding weight value, and then the weighted average is performed.
  • the weight value indicates the intimacy of the first account and the second account. The closer the first account is to the second account, the greater the weight value; conversely, the smaller the weight value.
  • the intimacy can be measured by the interaction between the first account and the second account. For example, the more chats between the first account and the second account, the more intimate the relationship. The higher the degree of overlap between the first account and the second account, the more intimate the relationship between the two accounts. The intimacy and coincidence can be obtained by means of a training model.
  • Interactive behaviors include interactions in the circle of friends, payment interactions (such as red envelopes), and sports interactions (such as walking 10,000 steps).
  • Intimacy can be reflected by the above information interaction, including the number of times the information is sent and received, the number of days, etc., as well as the ratio of sending and receiving information, the number of times of information interaction every day.
  • This information includes text information, video information, and voice information.
  • Intimacy can also be obtained by commenting, praising, marking a friend as a special friend, giving a gift, or pulling black.
  • the third characteristic variable That is, the intimacy is a weighted average of the second characteristic variable and the intimacy of the top 10 friends arranged in order of intimacy.
  • the general characteristics of a group can reflect the characteristics of a certain user in this group. Therefore, the probability value of the behavior that does not satisfy the preset condition can be obtained according to the characteristics of a group, and the credit degree of the user can be more accurately reflected.
  • the second feature of acquiring multiple second accounts is obtained.
  • the first n second accounts of the intimacy are selected according to the intimacy of the first account and the first account, and the third feature variable is generated according to the intimacy and the second feature variable.
  • the abnormal data may be data that is obviously beyond a certain range. For example, the normal person's age does not exceed one hundred. If the collected data shows that the age is 100, the abnormal data is deleted. If the collected data shows that the age includes 0 and 49, 0 and 49 are in the range of 0 to 100, however, most of the other data are between 18 and 45, so 0 and 49 belong to the singular point with a large fluctuation value.
  • behavioral data is divided into multiple dimensions based on data sources and business characteristics. For example, basic information, social interaction, financial management, etc., are classified and integrated into the database.
  • basic information, social interaction, financial management, etc. are classified and integrated into the database.
  • the above basic attributes, social interaction behavior, purchase behavior, taxi behavior and friend attributes can all reflect the behavior characteristics of the first account.
  • acquiring the first feature variable of the first account according to the behavior data includes: acquiring an information gain of the feature in the behavior data, the information gain is used to indicate the amount of information included in the behavior data; and determining whether the information gain is Within the preset value range; if the information gain is within the preset value range, the derived variable is constructed according to the behavior data, wherein the derived variable is the merged or split behavior data; if the information gain is outside the preset value range, Then, the feature corresponding to the information gain outside the preset value range is deleted, and the derivative variable is constructed according to the remaining features; the derived variable is used as the first feature variable.
  • deleting a feature corresponding to an information gain outside a preset value range, and then constructing a derivative variable according to the remaining feature includes: deleting a feature corresponding to an information gain outside a preset value range, Obtaining the correlation coefficient of the remaining features; combining the features whose correlation coefficient is greater than or equal to the preset coefficient into one merge feature; and using the merged feature as a derivative variable.
  • the collected behavior data including the number of text chats, the number of voice calls, the amount of payment, etc., are all characteristics in the behavior data. For example, text chat 9 times, voice call 10 times and payment amount 100, the numbers are called feature values.
  • the information gain can reflect the amount of information of a feature. If the amount of information is less than the threshold, the feature can be deleted. For example, the information gain is sorted for each type of feature, and the feature with the information gain less than the threshold is deleted. Then, the correlation of the remaining features is detected. If there are more relevant features, the features with stronger correlation are combined to obtain the first feature variable. If a feature is weakly correlated and highly significant, then this feature can be refined into multiple features. For example, split the number of chats into evening chats, daytime chats, weekend chats, and weekday chats. Number of days, etc. Conversely, you can combine night chats and day chats into chats.
  • the behavior data can be flexibly split and merged to construct the first feature variable, and when splitting and merging, the same or different methods can be used for multiple features (for example, some features adopt the principal component)
  • the analysis method and other features using clustering methods, etc.) increase the flexibility of constructing the first feature variable.
  • each subcategory when the data analysis model is established, each subcategory may be generated corresponding to each category according to the category classified by the behavior data, and each submodel can output a first sub-value. Processing these first sub-values yields the first value of the data analysis model output. Further, when the sub-model is established, the sub-model may be trained according to the sample data of each category, and each category may be further divided, and a low-level model is established for the divided data, and multiple low-level models are constructed. The submodel, and then the submodel constitutes the data analysis model.
  • the method before inputting the first feature variable and the second feature variable to the data analysis model, the method further includes: dividing the behavior data into a plurality of categories; respectively for each of the plurality of categories The category establishes a sub-model, wherein each sub-model is configured to output a first sub-value according to the first characteristic variable and/or the second characteristic variable, wherein the first sub-value is used to represent the category corresponding to the sub-model,
  • the behavior of an account does not satisfy the probability value of the preset condition; and the plurality of sub-models corresponding to the plurality of categories are constructed as a data analysis model.
  • establishing a sub-model for each of the plurality of categories includes: establishing a sub-model for each category using the same or different training models; or using the same or different training The model establishes a low-level model for each subcategory under each category, and constructs a low-level model corresponding to multiple subcategories under each category as a sub-model.
  • the training models used to build sub-models in each category can be the same or different. For example, among the 10 categories, 5 categories use the decision tree training model, and the other 5 use the neural network training sub-model. type.
  • the constructing the plurality of sub-models corresponding to the plurality of categories into the data analysis model comprises: constructing the plurality of sub-models into the data analysis model in the following manner:
  • P is the first value
  • i is the i-th sub-model of the plurality of sub-models
  • n is the number of the plurality of sub-models
  • P i ' is the first sub-value output of the i-th sub-model
  • P 0 is a constant feature
  • dividing the behavior data into a plurality of categories includes: dividing the behavior data into a plurality of categories according to the type of the service included in the behavior data; or dividing the data including the target object in the behavior data into one category, and not in the behavior data The data including the target object is divided into another category.
  • the three division methods may use any one of them to separately construct the sub-model, or may construct the sub-model by any two or three combinations. For example, first establish a sub-model according to whether or not the target object is divided, and then divide the sub-model below the sub-model according to the business type.
  • the sub-services mainly refer to the previous data categories, such as basic information, value-added services, social interactions, economic behaviors, etc.; grouping is mainly based on business characteristics. For example, in economic activities, there are credit cards and users without credit cards in payment, shopping, wealth management, etc. The behavioral performance is quite different, so it can be divided into two groups to build the model separately.
  • the layering is mainly at the level of the whole model architecture. For example, the sub-model layer can also be divided into multiple dimension layers, each layer is adopted. Machine learning algorithms can be quite different.
  • the data processing method of this embodiment is mainly divided into four parts, including data acquisition, data processing, feature mining and model construction.
  • Data collection This includes collecting online data and offline scene data.
  • Online data includes data on games, finance, apps, shopping, social, and education, such as game titles, purchase amounts, and more.
  • Offline scene data includes data such as life, navigation, travel, check-in, medical, and sports. For example, medical records, booking hotels, tourist locations, etc.
  • Data processing Includes cleaning, integration, and standardization. Cleaning includes deduplication, deletion of singularities, removal of abnormal data and information supplementation, integration includes dividing the same category of data into the same category, and normalization includes normalization of data types and normalization of storage data structures.
  • the processed data is mined, for example, using graph calculation and text mining methods.
  • the characteristics of mining include data on user basic information, social interaction, personality traits, hobbies, emotional orientation, life circle, physical health and financial management.
  • Model construction Classify the mined features and build a model for each category. For example, social interaction classes, hobbies, health and personality. Create a model for each category type. And each model can be obtained using different learning and training methods.
  • the characteristics of the social interaction class can also be subdivided into chat features, phone features and video features. After building the submodel, the total model is obtained. The first feature variable and the third feature variable are then input into the sub-model to obtain a first value of the total model output.
  • the first feature variable includes the feature a1, the feature a2, and the feature a3.
  • the feature a1, the feature a2, and the feature a3 and the feature b1, the feature b2, and the feature b3 are three pairs of features that sequentially correspond.
  • feature a1 represents the payment amount of the first account
  • feature b1 represents the payment amount of the second account
  • feature a2 represents the game type of the first account
  • feature b2 represents the game type of the second account
  • feature a3 represents the movement of the first account.
  • the number of times, feature b3 represents the number of times of movement of the second account.
  • the first value in order to improve the readability of the first value, is converted into a credit program capable of embodying the first account.
  • the first value represents the probability value of the first account default, and after converting to the third value, the credit level of the first account may be indicated. That is, after recording the first value output by the data analysis model, the method further comprises: converting the first value to the third value S by using the following method:
  • S is used to indicate the degree to which the behavior of the first account satisfies the preset condition
  • b represents a reference value
  • p represents a first value
  • st represents a step size
  • the features used comprehensively cover the online and offline behavior characteristics of the user, including not only basic user information, social interaction, financial activities, hobbies, life circles, but also deepening the user's personality characteristics, emotional inclination, etc. It is more able to characterize the stable features of the user's mental outlook and personality.
  • the multi-layered machine learning algorithm is adopted, which can improve the complexity and predictive ability of the algorithm while improving the accuracy of the user's credit program.
  • the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation.
  • the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
  • the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
  • a data processing apparatus for implementing the data processing method
  • the data processing apparatus is mainly used to perform the data processing method provided by the foregoing content of the embodiment of the present invention, and the following is an embodiment of the present invention.
  • the data processing device provided is specifically introduced:
  • the data processing apparatus mainly includes an acquisition unit 10, an acquisition unit 20, an input unit 30, and a recording unit 40.
  • the collecting unit 10 is configured to collect behavior data of the first account, and the behavior data includes online behavior data and offline behavior data based on the Internet.
  • the obtaining unit 20 is configured to obtain a first feature variable of the first account according to the behavior data, where the first feature variable is used to represent a behavior feature of the first account.
  • the input unit 30 is configured to input the first feature variable into the data analysis model, wherein the data analysis
  • the model is configured to output a first value according to the first feature variable, where the first value is used to indicate a probability value that the behavior of the first account does not satisfy the preset condition.
  • the recording unit 40 is for recording the first value of the data analysis model output.
  • the first characteristic variable is used to represent the behavior characteristic of the first account, and the behavior characteristic of the first account is obtained based on the behavior data of the first account based on the Internet, and then the first feature variable is input into the data analysis model, and the first account is obtained.
  • the behavior does not satisfy the probability value of the preset condition. Since the behavior data of the first account in the social application can cover the behavior of the first account relatively widely, the behavior data input into the data analysis model can fully reflect the behavior of the first account, thereby causing the analysis of the first account. The probability value that the behavior does not satisfy the preset condition is more accurate.
  • the behavior data includes online behavior data and offline behavior data of the first account based on the Internet.
  • Behavioral data includes actions in a variety of online and offline scenarios, including behavioral data for all aspects of life. Therefore, the probability values obtained from these behavioral data also more accurately reflect the true probability value of the account.
  • the behavior data changes, it will be immediately fed back to the server or instantly obtained by the server. Therefore, the behavior data is updated quickly, and the probability value obtained according to the behavior data of the instant update may reflect that the current behavior of the first account is not satisfied.
  • Set the probability of the condition The probability value that does not satisfy the preset condition may be a probability of default, such as non-compliance with the contracted behavior.
  • the account chat behavior of the user A in the instant messaging application may be collected.
  • Extracting the first feature variable from the behavior data separately can obtain the first feature variable of different categories.
  • the first feature variable of the instant communication class, the first feature variable of the video class, and the first feature variable of the download class all of the first feature variables of the different classes are input into the data analysis model, and the first value is output. It is also possible to input portions of the first characteristic variables of different classes into the data analysis model.
  • the friend of the user A has a similarity with the user A, and the behavior data of the friend of the user A can also reflect the probability that the behavior of the user A does not satisfy the preset condition. Therefore, when the first feature variable is input to the data analysis model, the feature variable associated with the friend of the user A can also be input at the same time.
  • the input unit includes: a first acquiring subunit, configured to acquire a second feature variable, wherein the second feature variable is used to represent a behavior characteristic of the plurality of second account accounts having an association relationship with the first account; the input subunit, And a method for inputting the first feature variable and the second feature variable to the data analysis model, wherein the data analysis model is further configured to output the first value according to the first feature variable and the second feature variable.
  • the method of obtaining the second feature variable is the same as the method of acquiring the first feature variable, which will be described in detail later.
  • the first account with the associated relationship and the plurality of the second account are friends, and the plurality of second accounts are friends of the first account.
  • Both the online behavior and the offline behavior in the above example can be mapped to the behavior of an application account through a certain correspondence.
  • the second account registers the navigation service and the instant messaging application by using the mobile phone number, and collects the behavior data of the second account in the navigation service and the behavior of the second account in the instant messaging application when acquiring the behavior data of the second account. data.
  • the input subunit includes: a first obtaining module, configured to acquire a closeness between the plurality of second accounts and the first account, wherein the intimacy is generated according to an interaction behavior of each second account and the first account;
  • the third feature variable is obtained from the intimacy and the second characteristic variable using the following formula:
  • ⁇ ' f(( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n ),( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n )),
  • ⁇ ' represents the third characteristic variable
  • i represents the i-th second account
  • ⁇ i is the intimacy of the i-th second account and the first account
  • ⁇ i is the second characteristic variable of the i-th second account
  • f is a weighted average of the second characteristic variable and the intimacy of the first n second accounts in the order of indicating the intimacy from high to low
  • the input module is configured to use the first feature variable and the third feature variable Enter the data analysis model.
  • the second feature variable of the second account is processed to make it better reflect the behavior characteristics of the first account. Therefore, when the second variable is acquired, each second characteristic variable is multiplied by the corresponding weight value, and then the weighted average is performed.
  • the weight value indicates the intimacy of the first account and the second account. The closer the first account is to the second account, the greater the weight value; conversely, the smaller the weight value.
  • the intimacy can be measured by the interaction between the first account and the second account. For example, the more chats between the first account and the second account, the more intimate the relationship. The higher the degree of overlap between the first account and the second account, the more intimate the relationship between the two accounts. The intimacy and coincidence can be obtained by means of a training model.
  • Interactive behaviors include interactions in the circle of friends, payment interactions (such as red envelopes), and sports interactions (such as walking 10,000 steps).
  • Intimacy can be reflected by the above information interaction, including the number of times the information is sent and received, the number of days, etc., as well as the ratio of sending and receiving information, the number of times of information interaction every day.
  • This information includes text information, video information, and voice information.
  • Intimacy can also be obtained by commenting, praising, marking a friend as a special friend, giving a gift, or pulling black.
  • the third characteristic variable That is, the intimacy is a weighted average of the second characteristic variable and the intimacy of the top 10 friends arranged in order of intimacy.
  • the general characteristics of a group can reflect the characteristics of a certain user in this group. Therefore, the probability value of the behavior that does not satisfy the preset condition can be obtained according to the characteristics of a group, and the credit degree of the user can be more accurately reflected. It should be noted that, when acquiring the second feature variable of the plurality of second accounts, first selecting the intimacy ranking of the first n second accounts according to the intimacy of the first account and the first account, and then according to the intimacy And generating a third feature variable with the second feature variable.
  • the abnormal data may be data that is obviously beyond a certain range. For example, usually the age of the person does not exceed one hundred, if the collected data shows the age is 100, the exception data is deleted. If the collected data shows that the age includes 0 and 49, 0 and 49 are in the range of 0 to 100, however, most of the other data are between 18 and 45, so 0 and 49 belong to the singular point with a large fluctuation value.
  • behavioral data is divided into multiple dimensions based on data sources and business characteristics. For example, basic information, social interaction, financial management, etc., are classified and integrated into the database.
  • basic information, social interaction, financial management, etc. are classified and integrated into the database.
  • the above basic attributes, social interaction behavior, purchase behavior, taxi behavior and friend attributes can all reflect the behavior characteristics of the first account.
  • the acquiring unit includes: an acquiring subunit, configured to acquire an information gain of a feature in the behavior data, where the information gain is used to represent the amount of information included in the behavior data; a determining subunit for determining whether the information gain is within a preset value range; constructing a subunit for constructing a derivative variable according to the behavior data when the information gain is within a preset value range, wherein the derived variable is a merge or Decomposed behavior data; delete sub-units, when the information gain is outside the preset value range, delete the feature corresponding to the information gain outside the preset value range, and then construct the derivative variable according to the remaining features; determine the sub-unit Used to use the derived variable as the first feature variable.
  • the deleting the subunit includes: a second acquiring module, configured to acquire a correlation coefficient of the remaining feature after deleting the feature corresponding to the information gain outside the preset numerical range; The feature that the correlation coefficient is greater than or equal to the preset coefficient is merged into one merge feature; the determining module is configured to use the merged feature as a derivative variable.
  • the collected behavior data including the number of text chats, the number of voice calls, the amount of payment, etc., are all characteristics in the behavior data. For example, text chat 9 times, voice call 10 times and payment amount 100, the numbers are called feature values.
  • the information gain can reflect the amount of information of a feature. If the amount of information is less than the threshold, the feature can be deleted. For example, the information gain is sorted for each type of feature, and the feature with the information gain less than the threshold is deleted. Then, the correlation of the remaining features is detected. If there are more relevant features, the features with stronger correlation are combined to obtain the first feature variable. If a feature is weakly correlated and highly significant, then this feature can be refined into multiple features. For example, split the number of chats into evening chats, daytime chats, weekend chats, and weekday chats. Conversely, you can combine night chats and day chats into chats.
  • the behavior data can be flexibly split and merged to construct the first feature variable, and when splitting and merging, the same or different methods can be used for multiple features (for example, some features adopt the principal component)
  • the analysis method and other features using clustering methods, etc.) increase the flexibility of constructing the first feature variable.
  • the apparatus further includes: a dividing unit, configured to divide the behavior data into a plurality of categories before inputting the first feature variable and the second feature variable to the data analysis model; a sub-model for each of the plurality of categories, wherein each sub-model is configured to output a first sub-value according to the first characteristic variable and/or the second characteristic variable, wherein the first sub-value is used In the category corresponding to the sub-model, the behavior of the first account does not satisfy the probability value of the preset condition; and the second establishing unit is configured to construct the plurality of sub-models corresponding to the plurality of categories as the data analysis model.
  • the first establishing unit includes: a first establishing subunit, configured to respectively establish a submodel for each category by using the same or different training models; or a second establishing subunit, for A low-level model is established for each subcategory under each category by using the same or different training models, and a low-level model corresponding to multiple subcategories under each category is constructed as a sub-model.
  • the training models used to build sub-models in each category can be the same or different. For example, among the 10 categories, 5 categories use the decision tree training model, and the other 5 use the neural network training sub-model.
  • the second establishing unit is further configured to construct multiple sub-models into a data analysis model in the following manner:
  • P always represents the first value
  • i is the i-th sub-model of the plurality of sub-models
  • n is the number of the plurality of sub-models
  • P i ' is the first sub-value output of the i-th sub-model
  • P 0 is a constant.
  • the dividing unit includes: a first dividing subunit, configured to divide the behavior data into multiple categories according to a service type included in the behavior data; or a second dividing subunit, configured to perform the behavior Data in the data including the target object is divided into one class, which will behave The data in the data that does not include the target object is divided into another category.
  • the apparatus further includes: a converting unit, configured to convert the first value into the third value S by using the following method after recording the first value output by the data analysis model:
  • S is used to indicate the degree to which the behavior of the first account satisfies the preset condition
  • b represents a reference value
  • p represents a first value
  • st represents a step size
  • the features used comprehensively cover the online and offline behavior characteristics of the user, including not only basic user information, social interaction, financial activities, hobbies, life circles, but also deepening the user's personality characteristics, emotional inclination, etc. It is more able to characterize the stable features of the user's mental outlook and personality.
  • the multi-layered machine learning algorithm is adopted, which can improve the complexity and predictive ability of the algorithm while improving the accuracy of the user's credit program.
  • a server for implementing the above data processing method is further provided.
  • the server mainly includes a processor 501, a data interface 503, a memory 505, and a network interface 507, where:
  • the data interface 503 transmits the behavior data acquired by the third party tool to the processor 501 mainly by means of data transmission.
  • the memory 505 is mainly used to store behavior data and data analysis models.
  • the network interface 507 is mainly used for network communication with the server, and obtains behavior data provided by the terminal from other servers.
  • the processor 501 is mainly configured to perform the following operations:
  • the behavior data includes online behavior data and offline behavior data based on the Internet; and acquiring the first feature of the first account according to the behavior data And the first feature variable is used to represent a behavior characteristic of the first account; the first feature variable is input to a data analysis model, wherein the data analysis model is used according to the first feature variable And outputting a first value, where the first value is used to indicate a probability value that the behavior of the first account does not satisfy a preset condition; and the first value output by the data analysis model is recorded.
  • the processor 501 is further configured to acquire a second feature variable, where the second feature variable is used to represent behavior characteristics of a plurality of second account accounts that have an association relationship with the first account; the first feature variable and The second characteristic variable is input to the data analysis model, wherein the data analysis model is further configured to output the first value according to the first feature variable and the second feature variable.
  • the processor 501 is further configured to acquire the intimacy between the plurality of second accounts and the first account, where the intimacy is generated according to an interaction behavior of each of the second accounts and the first account. Obtaining a third characteristic variable according to the intimacy and the second characteristic variable by using the following formula:
  • ⁇ ' f(( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n ),( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n )),
  • ⁇ ' denotes the third characteristic variable
  • i denotes the i-th second account
  • ⁇ i is the intimacy of the i-th second account and the first account
  • ⁇ i is the ith second
  • f is a weighted average value of the second characteristic variable and the intimacy of the first n second account numbers in the order of indicating the intimacy from high to low
  • the first feature variable and the third feature variable are input to the data analysis model.
  • Embodiments of the present invention also provide a storage medium.
  • the above storage medium may be used to store program codes of the data processing method of the embodiment of the present invention.
  • the foregoing storage medium may be located in a mobile communication network, a wide area network, or a metropolitan area. At least one of a plurality of network devices in a network of a network or a local area network.
  • the storage medium is arranged to store program code for performing the following steps:
  • S1 Collect behavior data of the first account, where the behavior data includes online behavior data and offline behavior data based on the Internet.
  • the first feature variable is input to a data analysis model, wherein the data analysis model is configured to output a first value according to the first feature variable, where the first value is used to represent behavior of the first account The probability value that does not satisfy the preset condition.
  • the storage medium may include, but is not limited to, a USB flash drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), a mobile hard disk, and a magnetic memory.
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • a mobile hard disk and a magnetic memory.
  • the processor performs acquiring the second feature variable according to the stored program code in the storage medium, where the second feature variable is used to indicate that the relationship relationship with the first account is Behavioral characteristics of the second account; inputting the first feature variable and the second feature variable to the data analysis model, wherein the data analysis model is further configured to use the first feature variable and the The second characteristic variable outputs the first value.
  • the processor performs, according to the stored program code in the storage medium, acquiring the intimacy between the plurality of second accounts and the first account, wherein the intimacy is according to each The interaction behavior of the second account with the first account is generated; and the third feature variable is obtained according to the intimacy and the second feature variable by using the following formula:
  • ⁇ ' f(( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n ),( ⁇ 1 , ⁇ 2 ,..., ⁇ i ,..., ⁇ n )),
  • ⁇ ' represents the third characteristic variable
  • i represents the i-th second account
  • ⁇ i is the intimacy of the i-th second account and the first account
  • ⁇ i is the ith second
  • f is a weighted average value of the second characteristic variable and the intimacy of the first n second account numbers in the order of indicating the intimacy from high to low
  • the first feature variable and the third feature variable are input to the data analysis model.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are configured to execute the data processing method described above.
  • the integrated unit in the above embodiment if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in the above-described computer readable storage medium.
  • the technical solution of the present invention may contribute to the prior art or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium.
  • a number of instructions are included to cause one or more computer devices (which may be a personal computer, server or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the disclosed client may be implemented in other manners.
  • the device embodiments described above are merely illustrative.
  • the division of the unit is only a logical function division.
  • multiple units or components may be combined or may be Integrate into another system, or some features can be ignored or not executed.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be through some interface, indirect coupling of the unit or module or The communication connection can be in electrical or other form.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the first feature variable is used to represent the behavior characteristic of the first account, and the behavior characteristic of the first account is obtained based on the behavior data of the first account based on the Internet, and then the first feature variable is input into the data analysis model. , the probability value that the behavior of the first account does not satisfy the preset condition can be obtained. Since the behavior data of the first account in the social application can cover the behavior of the first account relatively widely, the behavior data input into the data analysis model can fully reflect the behavior of the first account, thereby causing the analysis of the first account. The probability value that the behavior does not satisfy the preset condition is more accurate, thereby solving the technical problem that the credit level of the account cannot be accurately obtained.

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

La présente invention concerne un dispositif et un procédé de traitement de données, et un support d'informations informatique. Le procédé consiste : à collecter des données de comportement d'un premier compte, les données de comportement comprenant des données de comportement en ligne basées sur Internet et des données de comportement hors ligne (S202) ; à obtenir une première variable caractéristique du premier compte conformément aux données de comportement (S204), la première variable caractéristique étant utilisée pour représenter les caractéristiques de comportement du premier compte ; à entrer la première variable caractéristique dans un modèle d'analyse de données (S206), le modèle d'analyse de données étant utilisé pour émettre une première valeur numérique conformément à la première variable caractéristique, et la première valeur numérique est utilisée pour représenter la valeur de probabilité qu'un comportement du premier compte ne remplit pas une condition prédéfinie ; et à enregistrer la première valeur numérique émise par le modèle d'analyse de données (S208).
PCT/CN2016/109729 2016-05-25 2016-12-13 Dispositif et procédé de traitement de données, et support d'informations informatique WO2017202006A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610354926.X 2016-05-25
CN201610354926.XA CN106056444A (zh) 2016-05-25 2016-05-25 数据处理方法和装置

Publications (1)

Publication Number Publication Date
WO2017202006A1 true WO2017202006A1 (fr) 2017-11-30

Family

ID=57174694

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/109729 WO2017202006A1 (fr) 2016-05-25 2016-12-13 Dispositif et procédé de traitement de données, et support d'informations informatique

Country Status (2)

Country Link
CN (1) CN106056444A (fr)
WO (1) WO2017202006A1 (fr)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110874645A (zh) * 2019-11-14 2020-03-10 北京首汽智行科技有限公司 一种数据还原方法
CN111178934A (zh) * 2019-11-29 2020-05-19 北京深演智能科技股份有限公司 获取目标对象的方法及装置
CN111539532A (zh) * 2020-04-01 2020-08-14 深圳市魔数智擎人工智能有限公司 一种面向模型构建的特征自动化衍生方法
CN111598159A (zh) * 2020-05-14 2020-08-28 清华大学 机器学习模型的训练方法、装置、设备及存储介质
CN111652259A (zh) * 2019-04-16 2020-09-11 上海铼锶信息技术有限公司 一种清洗数据的方法及系统
CN112883689A (zh) * 2020-11-27 2021-06-01 苏宁消费金融有限公司 一种征信二代信用报告指衍生变量加工方法
US11436430B2 (en) 2017-02-13 2022-09-06 Tencent Technology (Shenzhen) Company Limited Feature information extraction method, apparatus, server cluster, and storage medium

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106056444A (zh) * 2016-05-25 2016-10-26 腾讯科技(深圳)有限公司 数据处理方法和装置
CN108066990B (zh) * 2016-11-18 2021-01-22 腾讯科技(深圳)有限公司 一种从候选用户列表中选择用户的方法、装置及服务器
CN106775268B (zh) * 2016-11-28 2020-10-30 浙江翼信科技有限公司 一种消息显示方法和设备
CN108322317B (zh) * 2017-01-16 2022-07-29 腾讯科技(深圳)有限公司 一种账号识别关联方法及服务器
CN108280757B (zh) * 2017-02-13 2021-08-17 腾讯科技(深圳)有限公司 用户信用评估方法及装置
CN106683680B (zh) * 2017-03-10 2022-03-25 百度在线网络技术(北京)有限公司 说话人识别方法及装置、计算机设备及计算机可读介质
CN108734565B (zh) * 2017-04-14 2020-11-17 腾讯科技(深圳)有限公司 一种征信分实时调整处理方法、装置及处理服务器
CN108805689A (zh) * 2017-04-26 2018-11-13 腾讯科技(深圳)有限公司 一种贷款风险评估控制方法及装置
CN108510071B (zh) * 2017-05-10 2020-01-10 腾讯科技(深圳)有限公司 数据的特征提取方法、装置及计算机可读存储介质
CN107871286A (zh) * 2017-07-20 2018-04-03 上海前隆信息科技有限公司 用户与联系人间亲密度判定方法/系统、存储介质及设备
CN109427010B (zh) * 2017-08-31 2022-05-27 腾讯科技(深圳)有限公司 通信费用透支额度分配方法、装置、存储介质和计算机设备
CN107730283A (zh) * 2017-11-03 2018-02-23 中国银行股份有限公司 一种中小企业的征信方法及装置
CN109829593B (zh) * 2017-11-23 2023-05-16 广州腾讯科技有限公司 目标对象的信用度确定方法、装置、存储介质及电子装置
CN109871514B (zh) * 2017-12-05 2022-11-04 财付通支付科技有限公司 一种数据处理方法、装置及存储介质
CN108280759A (zh) * 2018-01-17 2018-07-13 深圳市和讯华谷信息技术有限公司 风控模型优化方法、终端及计算机可读存储介质
CN109191185A (zh) * 2018-08-15 2019-01-11 深圳市和讯华谷信息技术有限公司 一种客群分类方法及系统
TWI709923B (zh) * 2018-10-03 2020-11-11 臺灣土地銀行股份有限公司 行為模式信評系統
CN109657793B (zh) * 2018-12-26 2020-09-22 广州小狗机器人技术有限公司 模型训练方法及装置、存储介质及电子设备

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101493913A (zh) * 2008-01-23 2009-07-29 阿里巴巴集团控股有限公司 一种评估网上用户信用的方法及系统
KR20140047863A (ko) * 2012-10-15 2014-04-23 주식회사 우리은행 신용 상태 평가 방법, 이를 실행하는 서버 및 이를 실행하는 시스템
CN104866969A (zh) * 2015-05-25 2015-08-26 百度在线网络技术(北京)有限公司 个人信用数据处理方法和装置
CN105225149A (zh) * 2015-09-07 2016-01-06 腾讯科技(深圳)有限公司 一种征信评分确定方法及装置
CN105243566A (zh) * 2015-10-28 2016-01-13 联动优势科技有限公司 利用运营商不同手机号信息评估用户信用的办法和装置
CN106056444A (zh) * 2016-05-25 2016-10-26 腾讯科技(深圳)有限公司 数据处理方法和装置

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102880691B (zh) * 2012-09-19 2015-08-19 北京航空航天大学深圳研究院 一种基于用户亲密度的混合推荐系统及方法
CN105389714B (zh) * 2015-10-23 2022-07-05 北京慧辰资道资讯股份有限公司 一种从行为数据识别用户特性的方法
CN105302911B (zh) * 2015-11-10 2018-12-21 珠海多玩信息技术有限公司 一种数据筛选引擎建立方法及数据筛选引擎

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101493913A (zh) * 2008-01-23 2009-07-29 阿里巴巴集团控股有限公司 一种评估网上用户信用的方法及系统
KR20140047863A (ko) * 2012-10-15 2014-04-23 주식회사 우리은행 신용 상태 평가 방법, 이를 실행하는 서버 및 이를 실행하는 시스템
CN104866969A (zh) * 2015-05-25 2015-08-26 百度在线网络技术(北京)有限公司 个人信用数据处理方法和装置
CN105225149A (zh) * 2015-09-07 2016-01-06 腾讯科技(深圳)有限公司 一种征信评分确定方法及装置
CN105243566A (zh) * 2015-10-28 2016-01-13 联动优势科技有限公司 利用运营商不同手机号信息评估用户信用的办法和装置
CN106056444A (zh) * 2016-05-25 2016-10-26 腾讯科技(深圳)有限公司 数据处理方法和装置

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11436430B2 (en) 2017-02-13 2022-09-06 Tencent Technology (Shenzhen) Company Limited Feature information extraction method, apparatus, server cluster, and storage medium
CN111652259A (zh) * 2019-04-16 2020-09-11 上海铼锶信息技术有限公司 一种清洗数据的方法及系统
CN111652259B (zh) * 2019-04-16 2024-03-08 上海铼锶信息技术有限公司 一种清洗数据的方法及系统
CN110874645A (zh) * 2019-11-14 2020-03-10 北京首汽智行科技有限公司 一种数据还原方法
CN111178934A (zh) * 2019-11-29 2020-05-19 北京深演智能科技股份有限公司 获取目标对象的方法及装置
CN111178934B (zh) * 2019-11-29 2024-03-08 北京深演智能科技股份有限公司 获取目标对象的方法及装置
CN111539532A (zh) * 2020-04-01 2020-08-14 深圳市魔数智擎人工智能有限公司 一种面向模型构建的特征自动化衍生方法
CN111598159A (zh) * 2020-05-14 2020-08-28 清华大学 机器学习模型的训练方法、装置、设备及存储介质
CN111598159B (zh) * 2020-05-14 2024-04-26 清华大学 机器学习模型的训练方法、装置、设备及存储介质
CN112883689A (zh) * 2020-11-27 2021-06-01 苏宁消费金融有限公司 一种征信二代信用报告指衍生变量加工方法

Also Published As

Publication number Publication date
CN106056444A (zh) 2016-10-26

Similar Documents

Publication Publication Date Title
WO2017202006A1 (fr) Dispositif et procédé de traitement de données, et support d'informations informatique
US11164105B2 (en) Intelligent recommendations implemented by modelling user profile through deep learning of multimodal user data
CN110313009B (zh) 用于为请求实体调整第二实体的信任得分的方法和系统
US11995112B2 (en) System and method for information recommendation
TWI599901B (zh) 用於更新信任分數之方法及系統
US10127522B2 (en) Automatic profiling of social media users
IL261207A (en) A method and system for searching for entities based on a score of trust and geography
US20180285936A1 (en) Intelligent visual object management system
US20180033027A1 (en) Interactive user-interface based analytics engine for creating a comprehensive profile of a user
WO2019084922A1 (fr) Procédé et système de traitement d'informations, serveur, terminal et support de stockage informatique
CN111177473B (zh) 人员关系分析方法、装置和可读存储介质
US11704495B2 (en) Prediction of film success-quotient
Dhingra et al. Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop
US10497045B2 (en) Social network data processing and profiling
US11436446B2 (en) Image analysis enhanced related item decision
Masyutin Credit scoring based on social network data
Ilakkuvan et al. Cameras for public health surveillance: a methods protocol for crowdsourced annotation of point-of-sale photographs
CN114491255A (zh) 推荐方法、系统、电子设备和介质
CN109829593B (zh) 目标对象的信用度确定方法、装置、存储介质及电子装置
US11373057B2 (en) Artificial intelligence driven image retrieval
US20240080280A1 (en) Understanding social media user behavior
US10762154B2 (en) Relative weighting for social collaboration comments
CN117795502A (zh) 消息传递系统中主题的演进
CN114529399A (zh) 用户数据处理方法、装置、计算机设备和存储介质
US20190304040A1 (en) System and Method for Vetting Potential Jurors

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16902990

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16902990

Country of ref document: EP

Kind code of ref document: A1