WO2022089190A1 - Product recommendation method and apparatus, and electronic device and readable storage medium - Google Patents

Product recommendation method and apparatus, and electronic device and readable storage medium Download PDF

Info

Publication number
WO2022089190A1
WO2022089190A1 PCT/CN2021/123176 CN2021123176W WO2022089190A1 WO 2022089190 A1 WO2022089190 A1 WO 2022089190A1 CN 2021123176 W CN2021123176 W CN 2021123176W WO 2022089190 A1 WO2022089190 A1 WO 2022089190A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
recommended
product
historical data
grouping
Prior art date
Application number
PCT/CN2021/123176
Other languages
French (fr)
Chinese (zh)
Inventor
张�杰
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2022089190A1 publication Critical patent/WO2022089190A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/24323Tree-organised classifiers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance

Definitions

  • the present application relates to the field of artificial intelligence technology and data analysis, and in particular, to a product recommendation method, device, electronic device, and readable storage medium.
  • the product recommendations provided in this application include:
  • Parse the product recommendation request sent by the user based on the client obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
  • a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  • the present application also provides a product recommendation device, the device comprising:
  • the grouping module is used for obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain multiple user groups;
  • a calculation module configured to calculate the product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data
  • the parsing module is configured to parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, and obtain the second historical data of the user to be recommended from the database based on the identifier , and determine the target user group corresponding to the user to be recommended;
  • a determination module configured to obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into The attribute analysis model obtains the target attribute value of the user to be recommended;
  • a recommendation module configured to recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group when the target attribute value is less than a preset threshold.
  • the present application also provides an electronic device, the electronic device comprising:
  • the memory stores a product recommendation program executable by the at least one processor, and the product recommendation program is executed by the at least one processor to enable the at least one processor to perform the following steps:
  • Parse the product recommendation request sent by the user based on the client obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
  • a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  • the present application also provides a computer-readable storage medium, where a product recommendation program is stored on the computer-readable storage medium, and the product recommendation program can be executed by one or more processors to implement the following steps:
  • Parse the product recommendation request sent by the user based on the client obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
  • a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  • FIG. 1 is a schematic flowchart of a product recommendation method provided by an embodiment of the present application.
  • FIG. 2 is a schematic diagram of a module of a product recommendation device provided by an embodiment of the present application
  • FIG. 3 is a schematic structural diagram of an electronic device for implementing a product recommendation method provided by an embodiment of the present application
  • the embodiments of the present application may acquire and process related data based on artificial intelligence technology.
  • Artificial Intelligence is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
  • the basic technologies of artificial intelligence generally include technologies such as sensors, special artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, and mechatronics.
  • Artificial intelligence software technology mainly includes computer vision technology, robotics technology, biometrics technology, speech processing technology, natural language processing technology, and machine learning/deep learning.
  • the present application provides a product recommendation method.
  • FIG. 1 it is a schematic flowchart of a product recommendation method according to an embodiment of the present application.
  • the method may be performed by an electronic device, which may be implemented by software and/or hardware.
  • the product recommendation method includes:
  • the first historical data includes the basic information and asset information of the user
  • the first characteristic factor includes the user's gender, age, educational background, age of accounts, holdings in the basic information.
  • the first feature is obtained by splicing the feature values corresponding to each feature in the basic information and the asset information.
  • the grouping of each user based on the first feature to obtain a plurality of user groups includes:
  • K-means clustering algorithm uses the K-means clustering algorithm to group each user based on the first feature, where K respectively takes a value of each natural number within a preset value range, and a value of K corresponds to a grouping result, and obtains multiple grouping results;
  • K represents the number of user groups.
  • K is any natural number from 3 to 10, then it can be divided into 3 user groups, 4 user groups, ..., 9 user groups, Divided into 10 user groups with a total of 8 grouping results.
  • A2 determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the corresponding silhouette coefficient of each kind of grouping result based on the first feature of the center user;
  • the existing calculation method of the silhouette coefficient has high complexity, wherein the calculation formula of the silhouette coefficient corresponding to the user is:
  • S ij represents the silhouette coefficient corresponding to the j th user in the ith grouping result, represents the average distance from the first feature of the jth user in the i-th grouping result to the first features of other users in the same user group, Indicates the minimum value of the average distance between the first feature of the jth user and the first features of other user groups in the ith grouping result.
  • S i represents the contour coefficient corresponding to the i-th grouping result
  • S ij represents the contour coefficient corresponding to the j-th user in the i-th grouping result
  • n represents the total number of users.
  • the silhouette coefficient is an evaluation method for the quality of the grouping results, which reflects the cohesion and separation of the clustering method. If the inner clustering of the same cluster is higher, and the separation degree of different clusters is higher, the clustering effect is better, and the closer S ij is to 1, it means The smaller the value, the better the clustering effect.
  • the present application proposes a simplified method for calculating the silhouette coefficient.
  • the centroid of each cluster is known due to the K-means algorithm.
  • the centroid is the core point of each cluster, and other points in the cluster are close to it, so the centroid can approximately represent the entire cluster.
  • the determining of the central user (ie the centroid) of each user group corresponding to each of the multiple grouping results includes:
  • the central user of each user group may also be determined in other manners, and the present application does not limit the determination manner of the central user.
  • S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result, represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group, Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
  • the present application simplifies the complexity of calculating the silhouette coefficient from n 2 to mn, where n is the total number of users and m is the number of user groups. Therefore, the present application greatly improves the grouping efficiency.
  • the grouping result with the contour coefficient closest to the preset value is used as the target grouping result.
  • the preset value is 1, and the grouping result with the contour coefficient closest to 1 is used as the target grouping result.
  • the calculating, based on the first historical data, an array of product penetration rates corresponding to each of the multiple user groups includes:
  • the first historical data also includes the user's product purchase information
  • the product penetration rate in the product penetration rate array corresponding to a certain user group is the number of people who purchased each product in the user group and the total users of the user group percentage of the quantity.
  • the product penetration rate array corresponding to user group 1 is ⁇ 20%, 5%, 50%, 2%, 6%, ..., 2% ⁇ . According to the product penetration rate array, different user groups can be analyzed. The degree of preference for different products can be used for product recommendation.
  • the historical data of the user to be recommended can be obtained from the database, it means that the user to be recommended is an existing user, and the target user group information corresponding to the user to be recommended can be obtained based on the user ID.
  • the absolute value of the difference between the recommended user and the first characteristic of the central user of each user group, and the user group to which the central user corresponding to the smallest absolute value of the difference belongs is used as the target user group of the user to be recommended.
  • the method further includes:
  • the user to be recommended is a new user and no data information has been generated. At this time, the preconfigured product list for the new user is recommended to the new user.
  • the attribute analysis model is used to analyze the loss probability of users, and the attribute analysis model is a random forest model.
  • the second feature factor includes volatility feature, trend feature, product hobby feature, and activity feature.
  • time slices are used to construct the second feature of the user, for example, the first 6 months of data are selected from the second historical data. The data and the data of the previous 3 months are used to determine the eigenvalues corresponding to each eigenfactor of the user.
  • the characteristic values corresponding to the volatility characteristics include: the asset range in the past three months (the difference between the maximum asset value and the asset minimum value), the asset standard deviation in the past three months, the asset range in the past six months, and the asset range in the past six months.
  • the standard deviation of assets in 6 months; the characteristic values corresponding to trend characteristics include: asset growth rate in the past 3 months, growth rate of the number of individual gold products held in the past 3 months, asset growth rate in the past 6 months, and recent asset growth rate.
  • the feature values corresponding to the feature include: ageing, average number of transactions in the past 3 months, average number of transactions in the past 6 months, etc.
  • the second feature is obtained by splicing the feature values corresponding to the aforementioned volatility feature, trend feature, product hobby feature, and activity feature, and the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended (ie, loss probability) .
  • the target attribute value that is, the churn probability
  • the preset threshold it means that the probability of user churn is low.
  • the product penetration rate array corresponding to the target user group to which the user to be recommended belongs is obtained, and the products with the highest product penetration rate are used as The target product is recommended to the user to be recommended.
  • the method further includes:
  • the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
  • the preset index is a preset index that has a greater impact on the churn rate, and the user churn can be determined based on the preset index Reasons and recommend suitable products.
  • the standard deviation of assets is a preset indicator. When the standard deviation of assets is greater than the indicator threshold, it means that the assets of the users to be recommended have changed greatly. In order to reduce the standard deviation of assets, you can recommend large-denomination certificates of deposit such as Long-term financial products are recommended to users to be recommended.
  • each user is grouped according to the first feature to obtain a plurality of user groups, and the product penetration rate array corresponding to each user group is calculated based on the first historical data.
  • the degree of preference of different user groups for different products is analyzed, which can be helpful for accurate recommendation; then, the second historical data of the user to be recommended is obtained, and the target user group corresponding to the user to be recommended is determined.
  • the historical data determines the second feature of the user to be recommended, and the second feature is input into the attribute analysis model to obtain the target attribute value (ie, the churn rate) of the user to be recommended.
  • the product corresponding to the target user group The penetration rate array is the target product recommended by the user to be recommended.
  • product recommendation is performed based on the product penetration rate array when the user churn rate is low, so that the success rate of product recommendation is higher. Therefore, the present application improves the success rate of product recommendation.
  • FIG. 2 it is a schematic diagram of a module of a product recommendation apparatus provided by an embodiment of the present application.
  • the product recommendation apparatus 100 described in this application may be installed in an electronic device. According to the implemented functions, the product recommendation apparatus 100 may include a grouping module 110 , a calculation module 120 , a parsing module 130 , a determination module 140 and a recommendation module 150 .
  • the modules described in this application may also be referred to as units, which refer to a series of computer program segments that can be executed by the processor of an electronic device and can perform fixed functions, and are stored in the memory of the electronic device.
  • each module/unit is as follows:
  • the grouping module 110 is configured to obtain the first historical data of each user in the database, determine the first characteristic of each user based on the first characteristic factor and the first historical data, and group each user based on the first characteristic to obtain Multiple user groups.
  • the first historical data includes the basic information and asset information of the user
  • the first characteristic factor includes the user's gender, age, educational background, age of accounts, holdings in the basic information.
  • the first feature is obtained by splicing the feature values corresponding to each feature in the basic information and the asset information.
  • the grouping of each user based on the first feature to obtain a plurality of user groups includes:
  • K-means clustering algorithm uses the K-means clustering algorithm to group each user based on the first feature, where K respectively takes a value of each natural number within a preset value range, and a value of K corresponds to a grouping result, and obtains multiple grouping results;
  • K represents the number of user groups.
  • K is any natural number from 3 to 10, then it can be divided into 3 user groups, 4 user groups, ..., 9 user groups, Divided into 10 user groups with a total of 8 grouping results.
  • A2 determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the corresponding silhouette coefficient of each kind of grouping result based on the first feature of the center user;
  • the existing calculation method of the silhouette coefficient has high complexity, wherein the calculation formula of the silhouette coefficient corresponding to the user is:
  • S ij represents the silhouette coefficient corresponding to the j th user in the ith grouping result, represents the average distance from the first feature of the jth user in the i-th grouping result to the first features of other users in the same user group, Indicates the minimum value of the average distance between the first feature of the jth user and the first features of other user groups in the ith grouping result.
  • S i represents the contour coefficient corresponding to the i-th grouping result
  • S ij represents the contour coefficient corresponding to the j-th user in the i-th grouping result
  • n represents the total number of users.
  • the silhouette coefficient is an evaluation method for the quality of the grouping results, which reflects the cohesion and separation of the clustering method. If the inner clustering of the same cluster is higher, and the separation degree of different clusters is higher, the clustering effect is better, and the closer S ij is to 1, it means The smaller the value, the better the clustering effect.
  • the present application proposes a simplified method for calculating the silhouette coefficient.
  • the centroid of each cluster is known due to the K-means algorithm.
  • the centroid is the core point of each cluster, and other points in the cluster are close to it, so the centroid can approximately represent the entire cluster.
  • the determining of the central user (ie the centroid) of each user group corresponding to each of the multiple grouping results includes:
  • the central user of each user group may also be determined in other manners, and the present application does not limit the determination manner of the central user.
  • S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result, represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group, Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
  • the present application simplifies the complexity of calculating the silhouette coefficient from n 2 to mn, where n is the total number of users and m is the number of user groups. Therefore, the present application greatly improves the grouping efficiency.
  • the grouping result with the contour coefficient closest to the preset value is used as the target grouping result.
  • the preset value is 1, and the grouping result with the contour coefficient closest to 1 is used as the target grouping result.
  • the calculation module 120 is configured to calculate a product penetration rate array corresponding to each user group in the plurality of user groups based on the first historical data.
  • the calculating, based on the first historical data, an array of product penetration rates corresponding to each of the multiple user groups includes:
  • the first historical data also includes the user's product purchase information
  • the product penetration rate in the product penetration rate array corresponding to a certain user group is the number of people who purchased each product in the user group and the total users of the user group percentage of the quantity.
  • the product penetration rate array corresponding to user group 1 is ⁇ 20%, 5%, 50%, 2%, 6%, ..., 2% ⁇ . According to the product penetration rate array, different user groups can be analyzed. The degree of preference for different products can be used for product recommendation.
  • the parsing module 130 is configured to parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, and obtain the second history of the user to be recommended from the database based on the identifier data, and determine the target user group corresponding to the user to be recommended;
  • the historical data of the user to be recommended can be obtained from the database, it means that the user to be recommended is an existing user, and the target user group information corresponding to the user to be recommended can be obtained based on the user ID.
  • the absolute value of the difference between the recommended user and the first characteristic of the central user of each user group, and the user group to which the central user corresponding to the smallest absolute value of the difference belongs is used as the target user group of the user to be recommended.
  • the parsing module 130 is further configured to:
  • the user to be recommended is a new user and no data information has been generated. At this time, the preconfigured product list for the new user is recommended to the new user.
  • the determining module 140 is configured to obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and assign the second characteristic
  • the attribute analysis model is input to obtain the target attribute value of the user to be recommended.
  • the attribute analysis model is used to analyze the loss probability of users, and the attribute analysis model is a random forest model.
  • the second feature factor includes volatility feature, trend feature, product hobby feature, and activity feature.
  • time slices are used to construct the second feature of the user, for example, the first 6 months of data are selected from the second historical data. The data and the data of the previous 3 months are used to determine the eigenvalues corresponding to each eigenfactor of the user.
  • the characteristic values corresponding to the volatility characteristics include: the asset range in the past three months (the difference between the maximum asset value and the asset minimum value), the asset standard deviation in the past three months, the asset range in the past six months, and the asset range in the past six months.
  • the standard deviation of assets in 6 months; the characteristic values corresponding to trend characteristics include: asset growth rate in the past 3 months, growth rate of the number of individual gold products held in the past 3 months, asset growth rate in the past 6 months, and recent asset growth rate.
  • the feature values corresponding to the feature include: ageing, average number of transactions in the past 3 months, average number of transactions in the past 6 months, etc.
  • the second feature is obtained by splicing the feature values corresponding to the aforementioned volatility feature, trend feature, product hobby feature, and activity feature, and the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended (ie, loss probability) .
  • the recommendation module 150 is configured to recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group when the target attribute value is less than a preset threshold.
  • the target attribute value that is, the churn probability
  • the preset threshold it means that the probability of user churn is low.
  • the product penetration rate array corresponding to the target user group to which the user to be recommended belongs is obtained, and the products with the highest product penetration rate are used as The target product is recommended to the user to be recommended.
  • the recommendation module 150 is further configured to:
  • the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
  • the preset index is a preset index that has a greater impact on the churn rate, and the user churn can be determined based on the preset index Reasons and recommend suitable products.
  • the standard deviation of assets is a preset indicator. When the standard deviation of assets is greater than the indicator threshold, it means that the assets of the users to be recommended have changed greatly. In order to reduce the standard deviation of assets, you can recommend large-denomination certificates of deposit such as Long-term financial products are recommended to users to be recommended.
  • FIG. 3 it is a schematic structural diagram of an electronic device for implementing a product recommendation method according to an embodiment of the present application.
  • the electronic device 1 is a device that can automatically perform numerical calculation and/or information processing according to pre-set or stored instructions.
  • the electronic device 1 may be a computer, a single network server, a server group composed of multiple network servers, or a cloud based on cloud computing composed of a large number of hosts or network servers, wherein cloud computing is a kind of distributed computing, A super virtual computer consisting of a collection of loosely coupled computers.
  • the electronic device 1 includes, but is not limited to, a memory 11 , a processor 12 , and a network interface 13 that can be communicatively connected to each other through a system bus, the memory 11 stores a product recommendation program 10 , and the product recommendation program 10 is executable by the processor 12 .
  • FIG. 3 only shows the electronic device 1 having the components 11-13 and the product recommendation program 10. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation on the electronic device 1, and may include a Fewer or more components are shown, or some components are combined, or a different arrangement of components.
  • the memory 11 includes a memory and at least one type of readable storage medium.
  • the memory provides a cache for the operation of the electronic device 1;
  • the readable storage medium can be, for example, flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM) ), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc. non-volatile storage media.
  • the readable storage medium may also be a volatile storage medium.
  • the readable storage medium may be an internal storage unit of the electronic device 1, such as a hard disk of the electronic device 1; in other embodiments, the storage medium may also be an external storage device of the electronic device 1, such as A pluggable hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash memory card (Flash Card), etc., are equipped on the electronic device 1.
  • the readable storage medium of the memory 11 is generally used to store the operating system and various application software installed in the electronic device 1 , for example, to store the code of the product recommendation program 10 in an embodiment of the present application.
  • the memory 11 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 12 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips.
  • the processor 12 is generally used to control the overall operation of the electronic device 1, such as performing control and processing related to data interaction or communication with other devices.
  • the processor 12 is configured to run the program code or process data stored in the memory 11, for example, run the product recommendation program 10 and the like.
  • the network interface 13 may include a wireless network interface or a wired network interface, and the network interface 13 is used to establish a communication connection between the electronic device 1 and a client (not shown in the figure).
  • the electronic device 1 may further include a user interface, and the user interface may include a display (Display), an input unit such as a keyboard (Keyboard), and an optional user interface may also include a standard wired interface and a wireless interface.
  • the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, and the like.
  • the display may also be appropriately called a display screen or a display unit, which is used for displaying information processed in the electronic device 1 and for displaying a visualized user interface.
  • the product recommendation program 10 stored in the memory 11 in the electronic device 1 is a combination of multiple instructions, and when running in the processor 12, can realize:
  • Parse the product recommendation request sent by the user based on the client obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
  • a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  • the above-mentioned product recommendation program 10 by the processor 12, reference may be made to the description of the relevant steps in the corresponding embodiment of FIG. 1 , which will not be repeated here. It should be emphasized that, in order to further ensure the privacy and security of the above-mentioned first and second historical data, the above-mentioned first and second historical data may also be stored in a node of a blockchain.
  • the modules/units integrated in the electronic device 1 may be stored in a computer-readable storage medium.
  • the computer-readable storage medium may be non-volatile or non-volatile.
  • the computer-readable storage medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory) ).
  • a product recommendation program 10 is stored on the computer-readable storage medium, and the product recommendation program 10 can be executed by one or more processors to realize the following steps:
  • Parse the product recommendation request sent by the user based on the client obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
  • a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  • modules described as separate components may or may not be physically separated, and the components shown as modules may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional module in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware, or can be implemented in the form of hardware plus software function modules.
  • the blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.

Abstract

The present application relates to the technical field of artificial intelligence. Disclosed is a product recommendation method, comprising: determining first features of users, and grouping the users on the basis of the first features, so as to obtain a plurality of user groups; calculating a product permeability array corresponding to each user group in the plurality of user groups; determining a target user group corresponding to a user to be given a recommendation; acquiring a second feature factor corresponding to a product recommendation request, determining, on the basis of the second feature factor and second historical data, a second feature of the user to be given a recommendation, and inputting the second feature into an attribute analysis model, so as to obtain a target attribute value of the user to be given a recommendation; and when the target attribute value is less than a pre-set threshold value, on the basis of the product permeability array corresponding to the target user group, recommending a target product to the user to be given a recommendation. Further provided are a product recommendation apparatus, an electronic device and a readable storage medium. By means of the present application, the success rate of product recommendation is improved.

Description

产品推荐方法、装置、电子设备及可读存储介质Product recommendation method, device, electronic device and readable storage medium
本申请要求于2020年11月2日提交中国专利局、申请号为CN202011206222.0、名称为“产品推荐方法、装置、电子设备及可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number CN202011206222.0 and the title of "Product Recommendation Method, Apparatus, Electronic Equipment and Readable Storage Medium" filed with the China Patent Office on November 2, 2020, the entire contents of which are Incorporated herein by reference.
技术领域technical field
本申请涉及人工智能技术领域和数据分析领域,尤其涉及一种产品推荐方法、装置、电子设备及可读存储介质。The present application relates to the field of artificial intelligence technology and data analysis, and in particular, to a product recommendation method, device, electronic device, and readable storage medium.
背景技术Background technique
随着科技的发展,产品推荐在人们生活中的应用越来越广泛,例如,文章推送、商品信息推荐等,发明人意识到,目前通常单一的根据用户自身相关的信息,例如历史浏览信息、购买信息进行推荐,使得推荐成功率不高。因此,亟需一种产品推荐方法,以提高推荐成功率。With the development of science and technology, the application of product recommendation in people's life is more and more widely, for example, article push, product information recommendation, etc. The inventor realized that at present, it is usually based on the user's own related information, such as historical browsing information, Purchase information for recommendation, so that the success rate of recommendation is not high. Therefore, a product recommendation method is urgently needed to improve the success rate of recommendation.
发明内容SUMMARY OF THE INVENTION
本申请提供的产品推荐方法,包括:The product recommendations provided in this application include:
获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
本申请还提供一种产品推荐装置,所述装置包括:The present application also provides a product recommendation device, the device comprising:
分组模块,用于获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;The grouping module is used for obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain multiple user groups;
计算模块,用于基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;a calculation module, configured to calculate the product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析模块,用于解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;The parsing module is configured to parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, and obtain the second historical data of the user to be recommended from the database based on the identifier , and determine the target user group corresponding to the user to be recommended;
确定模块,用于获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;A determination module, configured to obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into The attribute analysis model obtains the target attribute value of the user to be recommended;
推荐模块,用于当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。A recommendation module, configured to recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group when the target attribute value is less than a preset threshold.
本申请还提供一种电子设备,所述电子设备包括:The present application also provides an electronic device, the electronic device comprising:
至少一个处理器;以及,at least one processor; and,
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
所述存储器存储有可被所述至少一个处理器执行的产品推荐程序,所述产品推荐程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores a product recommendation program executable by the at least one processor, and the product recommendation program is executed by the at least one processor to enable the at least one processor to perform the following steps:
获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有产品推荐程序,所述产品推荐程序可被一个或者多个处理器执行,以实现如下步骤:The present application also provides a computer-readable storage medium, where a product recommendation program is stored on the computer-readable storage medium, and the product recommendation program can be executed by one or more processors to implement the following steps:
获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
附图说明Description of drawings
图1为本申请一实施例提供的产品推荐方法的流程示意图;1 is a schematic flowchart of a product recommendation method provided by an embodiment of the present application;
图2为本申请一实施例提供的产品推荐装置的模块示意图;FIG. 2 is a schematic diagram of a module of a product recommendation device provided by an embodiment of the present application;
图3为本申请一实施例提供的实现产品推荐方法的电子设备的结构示意图;3 is a schematic structural diagram of an electronic device for implementing a product recommendation method provided by an embodiment of the present application;
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the purpose of the present application will be further described with reference to the accompanying drawings in conjunction with the embodiments.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
需要说明的是,在本申请中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions involving "first", "second", etc. in this application are only for the purpose of description, and should not be construed as indicating or implying their relative importance or implying the number of indicated technical features . Thus, a feature delimited with "first", "second" may expressly or implicitly include at least one of that feature. In addition, the technical solutions between the various embodiments can be combined with each other, but must be based on the realization by those of ordinary skill in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered that the combination of such technical solutions does not exist. , is not within the scope of protection claimed in this application.
本申请实施例可以基于人工智能技术对相关的数据进行获取和处理。其中,人工智能(Artificial Intelligence,AI)是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩 展人的智能,感知环境、获取知识并使用知识获得最佳结果的理论、方法、技术及应用系统。The embodiments of the present application may acquire and process related data based on artificial intelligence technology. Among them, Artificial Intelligence (AI) is a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .
人工智能基础技术一般包括如传感器、专用人工智能芯片、云计算、分布式存储、大数据处理技术、操作/交互系统、机电一体化等技术。人工智能软件技术主要包括计算机视觉技术、机器人技术、生物识别技术、语音处理技术、自然语言处理技术以及机器学习/深度学习等几大方向。The basic technologies of artificial intelligence generally include technologies such as sensors, special artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, and mechatronics. Artificial intelligence software technology mainly includes computer vision technology, robotics technology, biometrics technology, speech processing technology, natural language processing technology, and machine learning/deep learning.
本申请提供一种产品推荐方法。参照图1所示,为本申请一实施例提供的产品推荐方法的流程示意图。该方法可以由一个电子设备执行,该电子设备可以由软件和/或硬件实现。The present application provides a product recommendation method. Referring to FIG. 1 , it is a schematic flowchart of a product recommendation method according to an embodiment of the present application. The method may be performed by an electronic device, which may be implemented by software and/or hardware.
本实施例中,产品推荐方法包括:In this embodiment, the product recommendation method includes:
S1、获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组。S1. Acquire the first historical data of each user in the database, determine the first characteristic of each user based on the first characteristic factor and the first historical data, and group each user based on the first characteristic to obtain a plurality of user groups .
本实施例以为银行客户推荐产品为例进行说明,所述第一历史数据包括用户的基本信息及资产信息,所述第一特征因子包括基本信息中用户的性别、年龄、学历、账龄、持卡等级、客户等级、是否为会员、是否完成风险测评、是否绑定三方支付等和资产信息中时点AUM(资产总市值)、历史最大AUM、PPC值(当前持有产品数)、AUM月日均值、AUM年日均值等。所述第一特征为将基本信息及资产信息中的各个特征对应的特征值进行拼接得到的。This embodiment is described by taking a product recommended by a bank customer as an example, the first historical data includes the basic information and asset information of the user, and the first characteristic factor includes the user's gender, age, educational background, age of accounts, holdings in the basic information. Card level, customer level, whether you are a member, whether you have completed risk assessment, whether you are bound to third-party payment, etc. and asset information in time-point AUM (total market value of assets), historical maximum AUM, PPC value (current number of products held), AUM month Daily average value, AUM annual daily average value, etc. The first feature is obtained by splicing the feature values corresponding to each feature in the basic information and the asset information.
所述基于所述第一特征对各个用户进行分组,得到多个用户组包括:The grouping of each user based on the first feature to obtain a plurality of user groups includes:
A1、基于所述第一特征采用K均值聚类算法对各个用户进行分组,其中,K分别取值为预设数值范围内的各个自然数,K的一个取值对应一种分组结果,得到多种分组结果;A1. Use the K-means clustering algorithm to group each user based on the first feature, where K respectively takes a value of each natural number within a preset value range, and a value of K corresponds to a grouping result, and obtains multiple grouping results;
K表示用户组的数量,本实施例中,K为3~10中的任一个自然数,则可以得到分为3个用户组、分为4个用户组、……、分为9个用户组、分为10个用户组共8种分组结果。K represents the number of user groups. In this embodiment, K is any natural number from 3 to 10, then it can be divided into 3 user groups, 4 user groups, ..., 9 user groups, Divided into 10 user groups with a total of 8 grouping results.
以K=3举例说明用户分组过程:任取3个用户的第一特征作为三个初始聚类中心,然后计算剩余用户与各个聚类中心之间的第一特征的距离,把每个用户分配给距离它最近的聚类,每分配一个用户,聚类的聚类中心会根据聚类中现有的用户被重新计算,如此循环直至将所有用户分组完成。Take K=3 as an example to illustrate the user grouping process: arbitrarily take the first features of 3 users as the three initial cluster centers, then calculate the distance between the remaining users and the first features of each cluster center, and assign each user to To the cluster closest to it, each time a user is assigned, the cluster center of the cluster will be recalculated according to the existing users in the cluster, and so on until all users are grouped.
A2、确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户,基于所述中心用户的第一特征分别计算每种分组结果对应的轮廓系数;A2, determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the corresponding silhouette coefficient of each kind of grouping result based on the first feature of the center user;
现有的轮廓系数的计算方法复杂度较高,其中,用户对应的轮廓系数的计算公式为:The existing calculation method of the silhouette coefficient has high complexity, wherein the calculation formula of the silhouette coefficient corresponding to the user is:
Figure PCTCN2021123176-appb-000001
Figure PCTCN2021123176-appb-000001
其中,S ij表示第i种分组结果中第j个用户对应的轮廓系数,
Figure PCTCN2021123176-appb-000002
表示第i种分组结果中第j个用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
Figure PCTCN2021123176-appb-000003
表示第i种分组结果中第j个用户的第一特征到其他用户组的第一特征的平均距离的最小值。
Among them, S ij represents the silhouette coefficient corresponding to the j th user in the ith grouping result,
Figure PCTCN2021123176-appb-000002
represents the average distance from the first feature of the jth user in the i-th grouping result to the first features of other users in the same user group,
Figure PCTCN2021123176-appb-000003
Indicates the minimum value of the average distance between the first feature of the jth user and the first features of other user groups in the ith grouping result.
分组结果对应的轮廓系数的计算公式为:The calculation formula of the contour coefficient corresponding to the grouping result is:
Figure PCTCN2021123176-appb-000004
Figure PCTCN2021123176-appb-000004
其中,S i表示第i种分组结果对应的轮廓系数,S ij表示第i种分组结果中第j个用户对应的轮廓系数,n表示用户的总数量。 Among them, S i represents the contour coefficient corresponding to the i-th grouping result, S ij represents the contour coefficient corresponding to the j-th user in the i-th grouping result, and n represents the total number of users.
轮廓系数是分组结果好坏的一种评价方式,反映了该聚类方法的内聚度和分离度。若同一个簇的内聚类越高,不同簇的分离度越高,则聚类效果越好,S ij越接近1表示
Figure PCTCN2021123176-appb-000005
越小,聚类效果越好。
The silhouette coefficient is an evaluation method for the quality of the grouping results, which reflects the cohesion and separation of the clustering method. If the inner clustering of the same cluster is higher, and the separation degree of different clusters is higher, the clustering effect is better, and the closer S ij is to 1, it means
Figure PCTCN2021123176-appb-000005
The smaller the value, the better the clustering effect.
由上述公式可知,在计算每个用户对应的轮廓系数的时候,要分别计算其与其他用户的第一特征值的距离,最终再将所有用户的距离平均,时间复杂度为n 2。为了降低时间复 杂度,本申请提出一种简化的轮廓系数计算方法。在聚类过程中,由于采用K均值算法,因此每个簇的质心均已知。质心是每一个簇最为核心的点,簇内的其他点都与之相近,因此质心可以近似代表整个簇。在计算轮廓系数时,仅计算质心之间的距离,即本实施例计算轮廓系数的公式不变,但参数意义不同,计算量大大降低了。 It can be seen from the above formula that when calculating the silhouette coefficient corresponding to each user, the distance from the first eigenvalue of other users should be calculated separately, and finally the distances of all users are averaged, and the time complexity is n 2 . In order to reduce the time complexity, the present application proposes a simplified method for calculating the silhouette coefficient. During the clustering process, the centroid of each cluster is known due to the K-means algorithm. The centroid is the core point of each cluster, and other points in the cluster are close to it, so the centroid can approximately represent the entire cluster. When calculating the contour coefficient, only the distance between the centroids is calculated, that is, the formula for calculating the contour coefficient in this embodiment remains unchanged, but the meanings of the parameters are different, and the amount of calculation is greatly reduced.
所述确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户(即质心)包括:The determining of the central user (ie the centroid) of each user group corresponding to each of the multiple grouping results includes:
计算每种分组结果对应的各个用户组中所有用户的第一特征的平均值,将各个用户组中第一特征与所述平均值的差值绝对值最小的用户作为各个用户组的中心用户。Calculate the average value of the first characteristics of all users in each user group corresponding to each grouping result, and take the user with the smallest absolute value of the difference between the first characteristic and the average value in each user group as the central user of each user group.
在其他实施例中也可以采用其他方式确定各个用户组的中心用户,本申请不对中心用户的确定方式进行限制。In other embodiments, the central user of each user group may also be determined in other manners, and the present application does not limit the determination manner of the central user.
本实施例中轮廓系数的计算公式为:The calculation formula of the contour coefficient in this embodiment is:
Figure PCTCN2021123176-appb-000006
Figure PCTCN2021123176-appb-000006
Figure PCTCN2021123176-appb-000007
Figure PCTCN2021123176-appb-000007
其中,S pq表示第p种分组结果中第q个用户组对应的轮廓系数,
Figure PCTCN2021123176-appb-000008
表示第p种分组结果中第q个用户组的中心用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
Figure PCTCN2021123176-appb-000009
表示第p种分组结果中第q个用户组的中心用户的第一特征到其他用户组的第一特征的平均距离的最小值,S p表示第p种分组结果对应的轮廓系数,m表示第p种分组结果中用户组的总数量。
Among them, S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result,
Figure PCTCN2021123176-appb-000008
represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group,
Figure PCTCN2021123176-appb-000009
Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
可见,本申请将计算轮廓系数的复杂度由n 2简化为mn,其中,n为用户的总数量,m为用户组的数量,故而,本申请大大提高了分组效率。 It can be seen that the present application simplifies the complexity of calculating the silhouette coefficient from n 2 to mn, where n is the total number of users and m is the number of user groups. Therefore, the present application greatly improves the grouping efficiency.
A3、将轮廓系数最接近预设数值的分组结果作为目标分组结果。A3. The grouping result with the contour coefficient closest to the preset value is used as the target grouping result.
本实施例中,预设数值为1,将轮廓系数最接近1的分组结果作为目标分组结果。In this embodiment, the preset value is 1, and the grouping result with the contour coefficient closest to 1 is used as the target grouping result.
S2、基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组。S2. Calculate, based on the first historical data, a product penetration rate array corresponding to each user group in the multiple user groups.
所述基于所述第一历史数据计算所述多个用户组中各个用户组对应的产品渗透率数组包括:The calculating, based on the first historical data, an array of product penetration rates corresponding to each of the multiple user groups includes:
B1、基于所述第一历史数据确定各个用户组中每个用户对应的产品;B1. Determine the product corresponding to each user in each user group based on the first historical data;
B2、计算各个用户组中各种产品对应的用户占比;B2. Calculate the proportion of users corresponding to various products in each user group;
B3、基于所述用户占比确定各个用户组对应的产品渗透率数组。B3. Determine the product penetration rate array corresponding to each user group based on the user ratio.
本实施例中,所述第一历史数据中还包括用户的产品购买信息,某一用户组对应的产品渗透率数组中的产品渗透率为该用户组中购买各个产品的人数与用户组总用户数量的百分比。In this embodiment, the first historical data also includes the user's product purchase information, the product penetration rate in the product penetration rate array corresponding to a certain user group is the number of people who purchased each product in the user group and the total users of the user group percentage of the quantity.
例如,各个用户组中各个产品对应的用户占比如下表1所示:For example, the proportion of users corresponding to each product in each user group is shown in Table 1 below:
   活期current 定期regular 理财financial management 基金fund 保险Insurance ……... 信托trust
用户组1User group 1 20%20% 5%5% 50%50% 2%2% 6%6% ……... 2%2%
用户组2User group 2 30%30% 1%1% 1%1% 62%62% 1%1% ……... 1%1%
用户组3User group 3 13%13% 20%20% 1%1% 2%2% 52%52% ……... 2%2%
用户组4User group 4 9%9% 78%78% 2%2% 5%5% 2%2% ……... 1%1%
表1Table 1
由上表1可知,用户组1对应的产品渗透率数组为{20%,5%,50%,2%,6%,……,2%},根据产品渗透率数组可分析出不同用户组对不同产品的喜好程度,可用于产品推荐。It can be seen from Table 1 above that the product penetration rate array corresponding to user group 1 is {20%, 5%, 50%, 2%, 6%, ..., 2%}. According to the product penetration rate array, different user groups can be analyzed. The degree of preference for different products can be used for product recommendation.
S3、解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;S3. Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the target user group corresponding to the user to be recommended;
若可从数据库中获取待推荐用户的历史数据,则说明待推荐用户为存量用户,基于用户标识即可获取待推荐用户对应的目标用户组信息,若无法确定目标用户组,则计算所述待推荐用户与各个用户组的中心用户的第一特征的差值绝对值,将最小的差值绝对值对应的中心用户所属的用户组作为所述待推荐用户的目标用户组。If the historical data of the user to be recommended can be obtained from the database, it means that the user to be recommended is an existing user, and the target user group information corresponding to the user to be recommended can be obtained based on the user ID. The absolute value of the difference between the recommended user and the first characteristic of the central user of each user group, and the user group to which the central user corresponding to the smallest absolute value of the difference belongs is used as the target user group of the user to be recommended.
本实施例中,所述方法还包括:In this embodiment, the method further includes:
若无法从所述数据库中获取所述待推荐用户的第二历史数据,则将预设产品清单推荐给所述待推荐用户。If the second historical data of the user to be recommended cannot be obtained from the database, a preset product list is recommended to the user to be recommended.
若无法获取历史数据,则说明待推荐用户为新用户,还没有产生数据信息,此时,将预先配置的针对新用户的产品清单推荐给所述新用户。If the historical data cannot be obtained, it means that the user to be recommended is a new user and no data information has been generated. At this time, the preconfigured product list for the new user is recommended to the new user.
S4、获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值。S4. Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into an attribute analysis model Obtain the target attribute value of the user to be recommended.
本实施例中,所述属性分析模型用于分析用户的流失概率,所述属性分析模型为随机森林模型。In this embodiment, the attribute analysis model is used to analyze the loss probability of users, and the attribute analysis model is a random forest model.
所述第二特征因子包括波动性特征、趋势性特征、产品爱好度特征、活跃度特征,本实施例利用时间切片构建用户的第二特征,例如从第二历史数据中筛选出前6个月的数据及前3个月的数据以确定用户的各个特征因子对应的特征值。The second feature factor includes volatility feature, trend feature, product hobby feature, and activity feature. In this embodiment, time slices are used to construct the second feature of the user, for example, the first 6 months of data are selected from the second historical data. The data and the data of the previous 3 months are used to determine the eigenvalues corresponding to each eigenfactor of the user.
所述波动性特征对应的特征值包括:近3个月的资产极差(资产最大值与资产最小值的差)、近3个月的资产标准差、近6个月的资产极差及近6个月的资产标准差;趋势性特征对应的特征值包括:近3个月的资产增长率、近3个月的个金产品持有数量增长率、近6个月的资产增长率及近6个月的个金产品持有数量增长率;产品爱好度特征对应的特征值包括:活期占比、定期占比、理财占比、基金占比、资管占比、保险占比;活跃度特征对应的特征值包括:账龄、近3个月平均动账交易次数、近6个月平均动账交易次数等。The characteristic values corresponding to the volatility characteristics include: the asset range in the past three months (the difference between the maximum asset value and the asset minimum value), the asset standard deviation in the past three months, the asset range in the past six months, and the asset range in the past six months. The standard deviation of assets in 6 months; the characteristic values corresponding to trend characteristics include: asset growth rate in the past 3 months, growth rate of the number of individual gold products held in the past 3 months, asset growth rate in the past 6 months, and recent asset growth rate. The growth rate of the number of individual gold products held in 6 months; the characteristic values corresponding to the characteristics of product hobbies include: the proportion of current accounts, the proportion of regular accounts, the proportion of wealth management, the proportion of funds, the proportion of asset management, and the proportion of insurance; activeness The feature values corresponding to the feature include: ageing, average number of transactions in the past 3 months, average number of transactions in the past 6 months, etc.
将上述波动性特征、趋势性特征、产品爱好度特征及活跃度特征对应的特征值进行拼接得到第二特征,将第二特征输入属性分析模型得到待推荐用户的目标属性值(即流失概率)。The second feature is obtained by splicing the feature values corresponding to the aforementioned volatility feature, trend feature, product hobby feature, and activity feature, and the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended (ie, loss probability) .
S5、当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。S5. When the target attribute value is less than a preset threshold, recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group.
当目标属性值(即流失概率)小于预设阈值时,说明用户流失概率低,此时获取待推荐用户所属的目标用户组对应的产品渗透率数组,将产品渗透率靠前的多个产品作为目标产品推荐给所述待推荐用户。When the target attribute value (that is, the churn probability) is less than the preset threshold, it means that the probability of user churn is low. At this time, the product penetration rate array corresponding to the target user group to which the user to be recommended belongs is obtained, and the products with the highest product penetration rate are used as The target product is recommended to the user to be recommended.
本实施例中,在将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值之后,所述方法还包括:In this embodiment, after inputting the second feature into the attribute analysis model to obtain the target attribute value of the user to be recommended, the method further includes:
若所述目标属性值大于预设阈值,则基于所述第二历史数据计算多个预设指标的指标值,当所述多个预设指标中某一指定指标对应的指标值大于指标阈值时,基于所述指定指标为所述待推荐用户推荐目标产品。If the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
当目标属性值(即流失概率)大于预设阈值时,说明用户流失概率高,所述预设指标为预先设置的对流失率影响较大的指标,基于所述预设指标可确定用户的流失原因并推荐合适的产品,例如,资产标准差为预设指标,当资产标准差大于指标阈值时,说明待推荐 用户的资产变化幅度较大,为了减少资产标准差,可以推荐大额存单这样的周期较长的金融产品给待推荐用户。When the target attribute value (that is, the churn probability) is greater than the preset threshold, it indicates that the user churn probability is high, the preset index is a preset index that has a greater impact on the churn rate, and the user churn can be determined based on the preset index Reasons and recommend suitable products. For example, the standard deviation of assets is a preset indicator. When the standard deviation of assets is greater than the indicator threshold, it means that the assets of the users to be recommended have changed greatly. In order to reduce the standard deviation of assets, you can recommend large-denomination certificates of deposit such as Long-term financial products are recommended to users to be recommended.
由上述实施例可知,本申请提出的产品推荐方法,首先,根据第一特征对各个用户进行分组,得到多个用户组,基于第一历史数据计算各个用户组对应的产品渗透率数组,该步骤分析出了不同用户组对不同产品的喜好程度,可有助于精准推荐;接着,获取待推荐用户的第二历史数据,确定待推荐用户对应的目标用户组,根据第二特征因子及第二历史数据确定待推荐用户的第二特征,将第二特征输入属性分析模型得到待推荐用户的目标属性值(即流失率),当目标属性值小于预设阈值时,根据目标用户组对应的产品渗透率数组为待推荐用户推荐目标产品,本步骤在用户流失率低的情况下基于产品渗透率数组进行产品推荐,使得产品推荐成功率更高。因此,本申请提高了产品推荐成功率。It can be seen from the above embodiments that, in the product recommendation method proposed by the present application, firstly, each user is grouped according to the first feature to obtain a plurality of user groups, and the product penetration rate array corresponding to each user group is calculated based on the first historical data. The degree of preference of different user groups for different products is analyzed, which can be helpful for accurate recommendation; then, the second historical data of the user to be recommended is obtained, and the target user group corresponding to the user to be recommended is determined. The historical data determines the second feature of the user to be recommended, and the second feature is input into the attribute analysis model to obtain the target attribute value (ie, the churn rate) of the user to be recommended. When the target attribute value is less than the preset threshold, the product corresponding to the target user group The penetration rate array is the target product recommended by the user to be recommended. In this step, product recommendation is performed based on the product penetration rate array when the user churn rate is low, so that the success rate of product recommendation is higher. Therefore, the present application improves the success rate of product recommendation.
如图2所示,为本申请一实施例提供的产品推荐装置的模块示意图。As shown in FIG. 2 , it is a schematic diagram of a module of a product recommendation apparatus provided by an embodiment of the present application.
本申请所述产品推荐装置100可以安装于电子设备中。根据实现的功能,所述产品推荐装置100可以包括分组模块110、计算模块120、解析模块130、确定模块140及推荐模块150。本申请所述模块也可以称之为单元,是指一种能够被电子设备处理器所执行,并且能够完成固定功能的一系列计算机程序段,其存储在电子设备的存储器中。The product recommendation apparatus 100 described in this application may be installed in an electronic device. According to the implemented functions, the product recommendation apparatus 100 may include a grouping module 110 , a calculation module 120 , a parsing module 130 , a determination module 140 and a recommendation module 150 . The modules described in this application may also be referred to as units, which refer to a series of computer program segments that can be executed by the processor of an electronic device and can perform fixed functions, and are stored in the memory of the electronic device.
在本实施例中,关于各模块/单元的功能如下:In this embodiment, the functions of each module/unit are as follows:
分组模块110,用于获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组。The grouping module 110 is configured to obtain the first historical data of each user in the database, determine the first characteristic of each user based on the first characteristic factor and the first historical data, and group each user based on the first characteristic to obtain Multiple user groups.
本实施例以为银行客户推荐产品为例进行说明,所述第一历史数据包括用户的基本信息及资产信息,所述第一特征因子包括基本信息中用户的性别、年龄、学历、账龄、持卡等级、客户等级、是否为会员、是否完成风险测评、是否绑定三方支付等和资产信息中时点AUM(资产总市值)、历史最大AUM、PPC值(当前持有产品数)、AUM月日均值、AUM年日均值等。所述第一特征为将基本信息及资产信息中的各个特征对应的特征值进行拼接得到的。This embodiment is described by taking a product recommended by a bank customer as an example, the first historical data includes the basic information and asset information of the user, and the first characteristic factor includes the user's gender, age, educational background, age of accounts, holdings in the basic information. Card level, customer level, whether you are a member, whether you have completed risk assessment, whether you are bound to third-party payment, etc. and asset information in time-point AUM (total market value of assets), historical maximum AUM, PPC value (current number of products held), AUM month Daily average value, AUM annual daily average value, etc. The first feature is obtained by splicing the feature values corresponding to each feature in the basic information and the asset information.
所述基于所述第一特征对各个用户进行分组,得到多个用户组包括:The grouping of each user based on the first feature to obtain a plurality of user groups includes:
A1、基于所述第一特征采用K均值聚类算法对各个用户进行分组,其中,K分别取值为预设数值范围内的各个自然数,K的一个取值对应一种分组结果,得到多种分组结果;A1. Use the K-means clustering algorithm to group each user based on the first feature, where K respectively takes a value of each natural number within a preset value range, and a value of K corresponds to a grouping result, and obtains multiple grouping results;
K表示用户组的数量,本实施例中,K为3~10中的任一个自然数,则可以得到分为3个用户组、分为4个用户组、……、分为9个用户组、分为10个用户组共8种分组结果。K represents the number of user groups. In this embodiment, K is any natural number from 3 to 10, then it can be divided into 3 user groups, 4 user groups, ..., 9 user groups, Divided into 10 user groups with a total of 8 grouping results.
以K=3举例说明用户分组过程:任取3个用户的第一特征作为三个初始聚类中心,然后计算剩余用户与各个聚类中心之间的第一特征的距离,把每个用户分配给距离它最近的聚类,每分配一个用户,聚类的聚类中心会根据聚类中现有的用户被重新计算,如此循环直至将所有用户分组完成。Take K=3 as an example to illustrate the user grouping process: arbitrarily take the first features of 3 users as the three initial cluster centers, then calculate the distance between the remaining users and the first features of each cluster center, and assign each user to To the cluster closest to it, each time a user is assigned, the cluster center of the cluster will be recalculated according to the existing users in the cluster, and so on until all users are grouped.
A2、确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户,基于所述中心用户的第一特征分别计算每种分组结果对应的轮廓系数;A2, determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the corresponding silhouette coefficient of each kind of grouping result based on the first feature of the center user;
现有的轮廓系数的计算方法复杂度较高,其中,用户对应的轮廓系数的计算公式为:The existing calculation method of the silhouette coefficient has high complexity, wherein the calculation formula of the silhouette coefficient corresponding to the user is:
Figure PCTCN2021123176-appb-000010
Figure PCTCN2021123176-appb-000010
其中,S ij表示第i种分组结果中第j个用户对应的轮廓系数,
Figure PCTCN2021123176-appb-000011
表示第i种分组结果中第j个用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
Figure PCTCN2021123176-appb-000012
表示第i种分组结果中第j个用户的第一特征到其他用户组的第一特征的平均距离的最小值。
Among them, S ij represents the silhouette coefficient corresponding to the j th user in the ith grouping result,
Figure PCTCN2021123176-appb-000011
represents the average distance from the first feature of the jth user in the i-th grouping result to the first features of other users in the same user group,
Figure PCTCN2021123176-appb-000012
Indicates the minimum value of the average distance between the first feature of the jth user and the first features of other user groups in the ith grouping result.
分组结果对应的轮廓系数的计算公式为:The calculation formula of the contour coefficient corresponding to the grouping result is:
Figure PCTCN2021123176-appb-000013
Figure PCTCN2021123176-appb-000013
其中,S i表示第i种分组结果对应的轮廓系数,S ij表示第i种分组结果中第j个用户对应的轮廓系数,n表示用户的总数量。 Among them, S i represents the contour coefficient corresponding to the i-th grouping result, S ij represents the contour coefficient corresponding to the j-th user in the i-th grouping result, and n represents the total number of users.
轮廓系数是分组结果好坏的一种评价方式,反映了该聚类方法的内聚度和分离度。若同一个簇的内聚类越高,不同簇的分离度越高,则聚类效果越好,S ij越接近1表示
Figure PCTCN2021123176-appb-000014
越小,聚类效果越好。
The silhouette coefficient is an evaluation method for the quality of the grouping results, which reflects the cohesion and separation of the clustering method. If the inner clustering of the same cluster is higher, and the separation degree of different clusters is higher, the clustering effect is better, and the closer S ij is to 1, it means
Figure PCTCN2021123176-appb-000014
The smaller the value, the better the clustering effect.
由上述公式可知,在计算每个用户对应的轮廓系数的时候,要分别计算其与其他用户的第一特征值的距离,最终再将所有用户的距离平均,时间复杂度为n 2。为了降低时间复杂度,本申请提出一种简化的轮廓系数计算方法。在聚类过程中,由于采用K均值算法,因此每个簇的质心均已知。质心是每一个簇最为核心的点,簇内的其他点都与之相近,因此质心可以近似代表整个簇。在计算轮廓系数时,仅计算质心之间的距离,即本实施例计算轮廓系数的公式不变,但参数意义不同,计算量大大降低了。 It can be seen from the above formula that when calculating the silhouette coefficient corresponding to each user, the distance from the first eigenvalue of other users should be calculated separately, and finally the distances of all users are averaged, and the time complexity is n 2 . In order to reduce the time complexity, the present application proposes a simplified method for calculating the silhouette coefficient. During the clustering process, the centroid of each cluster is known due to the K-means algorithm. The centroid is the core point of each cluster, and other points in the cluster are close to it, so the centroid can approximately represent the entire cluster. When calculating the contour coefficient, only the distance between the centroids is calculated, that is, the formula for calculating the contour coefficient in this embodiment remains unchanged, but the meanings of the parameters are different, and the amount of calculation is greatly reduced.
所述确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户(即质心)包括:The determining of the central user (ie the centroid) of each user group corresponding to each of the multiple grouping results includes:
计算每种分组结果对应的各个用户组中所有用户的第一特征的平均值,将各个用户组中第一特征与所述平均值的差值绝对值最小的用户作为各个用户组的中心用户。Calculate the average value of the first characteristics of all users in each user group corresponding to each grouping result, and take the user with the smallest absolute value of the difference between the first characteristic and the average value in each user group as the central user of each user group.
在其他实施例中也可以采用其他方式确定各个用户组的中心用户,本申请不对中心用户的确定方式进行限制。In other embodiments, the central user of each user group may also be determined in other manners, and the present application does not limit the determination manner of the central user.
本实施例中轮廓系数的计算公式为:The calculation formula of the contour coefficient in this embodiment is:
Figure PCTCN2021123176-appb-000015
Figure PCTCN2021123176-appb-000015
Figure PCTCN2021123176-appb-000016
Figure PCTCN2021123176-appb-000016
其中,S pq表示第p种分组结果中第q个用户组对应的轮廓系数,
Figure PCTCN2021123176-appb-000017
表示第p种分组结果中第q个用户组的中心用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
Figure PCTCN2021123176-appb-000018
表示第p种分组结果中第q个用户组的中心用户的第一特征到其他用户组的第一特征的平均距离的最小值,S p表示第p种分组结果对应的轮廓系数,m表示第p种分组结果中用户组的总数量。
Among them, S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result,
Figure PCTCN2021123176-appb-000017
represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group,
Figure PCTCN2021123176-appb-000018
Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
可见,本申请将计算轮廓系数的复杂度由n 2简化为mn,其中,n为用户的总数量,m为用户组的数量,故而,本申请大大提高了分组效率。 It can be seen that the present application simplifies the complexity of calculating the silhouette coefficient from n 2 to mn, where n is the total number of users and m is the number of user groups. Therefore, the present application greatly improves the grouping efficiency.
A3、将轮廓系数最接近预设数值的分组结果作为目标分组结果。A3. The grouping result with the contour coefficient closest to the preset value is used as the target grouping result.
本实施例中,预设数值为1,将轮廓系数最接近1的分组结果作为目标分组结果。In this embodiment, the preset value is 1, and the grouping result with the contour coefficient closest to 1 is used as the target grouping result.
计算模块120,用于基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组。The calculation module 120 is configured to calculate a product penetration rate array corresponding to each user group in the plurality of user groups based on the first historical data.
所述基于所述第一历史数据计算所述多个用户组中各个用户组对应的产品渗透率数组包括:The calculating, based on the first historical data, an array of product penetration rates corresponding to each of the multiple user groups includes:
B1、基于所述第一历史数据确定各个用户组中每个用户对应的产品;B1. Determine the product corresponding to each user in each user group based on the first historical data;
B2、计算各个用户组中各种产品对应的用户占比;B2. Calculate the proportion of users corresponding to various products in each user group;
B3、基于所述用户占比确定各个用户组对应的产品渗透率数组。B3. Determine the product penetration rate array corresponding to each user group based on the user ratio.
本实施例中,所述第一历史数据中还包括用户的产品购买信息,某一用户组对应的产品渗透率数组中的产品渗透率为该用户组中购买各个产品的人数与用户组总用户数量的百分比。In this embodiment, the first historical data also includes the user's product purchase information, the product penetration rate in the product penetration rate array corresponding to a certain user group is the number of people who purchased each product in the user group and the total users of the user group percentage of the quantity.
例如,各个用户组中各个产品对应的用户占比如上表1所示。For example, the proportion of users corresponding to each product in each user group is shown in Table 1 above.
由上表1可知,用户组1对应的产品渗透率数组为{20%,5%,50%,2%,6%,……,2%},根 据产品渗透率数组可分析出不同用户组对不同产品的喜好程度,可用于产品推荐。It can be seen from Table 1 above that the product penetration rate array corresponding to user group 1 is {20%, 5%, 50%, 2%, 6%, ..., 2%}. According to the product penetration rate array, different user groups can be analyzed. The degree of preference for different products can be used for product recommendation.
解析模块130,用于解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;The parsing module 130 is configured to parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, and obtain the second history of the user to be recommended from the database based on the identifier data, and determine the target user group corresponding to the user to be recommended;
若可从数据库中获取待推荐用户的历史数据,则说明待推荐用户为存量用户,基于用户标识即可获取待推荐用户对应的目标用户组信息,若无法确定目标用户组,则计算所述待推荐用户与各个用户组的中心用户的第一特征的差值绝对值,将最小的差值绝对值对应的中心用户所属的用户组作为所述待推荐用户的目标用户组。If the historical data of the user to be recommended can be obtained from the database, it means that the user to be recommended is an existing user, and the target user group information corresponding to the user to be recommended can be obtained based on the user ID. The absolute value of the difference between the recommended user and the first characteristic of the central user of each user group, and the user group to which the central user corresponding to the smallest absolute value of the difference belongs is used as the target user group of the user to be recommended.
本实施例中,所述解析模块130还用于:In this embodiment, the parsing module 130 is further configured to:
若无法从所述数据库中获取所述待推荐用户的第二历史数据,则将预设产品清单推荐给所述待推荐用户。If the second historical data of the user to be recommended cannot be obtained from the database, a preset product list is recommended to the user to be recommended.
若无法获取历史数据,则说明待推荐用户为新用户,还没有产生数据信息,此时,将预先配置的针对新用户的产品清单推荐给所述新用户。If the historical data cannot be obtained, it means that the user to be recommended is a new user and no data information has been generated. At this time, the preconfigured product list for the new user is recommended to the new user.
确定模块140,用于获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值。The determining module 140 is configured to obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and assign the second characteristic The attribute analysis model is input to obtain the target attribute value of the user to be recommended.
本实施例中,所述属性分析模型用于分析用户的流失概率,所述属性分析模型为随机森林模型。In this embodiment, the attribute analysis model is used to analyze the loss probability of users, and the attribute analysis model is a random forest model.
所述第二特征因子包括波动性特征、趋势性特征、产品爱好度特征、活跃度特征,本实施例利用时间切片构建用户的第二特征,例如从第二历史数据中筛选出前6个月的数据及前3个月的数据以确定用户的各个特征因子对应的特征值。The second feature factor includes volatility feature, trend feature, product hobby feature, and activity feature. In this embodiment, time slices are used to construct the second feature of the user, for example, the first 6 months of data are selected from the second historical data. The data and the data of the previous 3 months are used to determine the eigenvalues corresponding to each eigenfactor of the user.
所述波动性特征对应的特征值包括:近3个月的资产极差(资产最大值与资产最小值的差)、近3个月的资产标准差、近6个月的资产极差及近6个月的资产标准差;趋势性特征对应的特征值包括:近3个月的资产增长率、近3个月的个金产品持有数量增长率、近6个月的资产增长率及近6个月的个金产品持有数量增长率;产品爱好度特征对应的特征值包括:活期占比、定期占比、理财占比、基金占比、资管占比、保险占比;活跃度特征对应的特征值包括:账龄、近3个月平均动账交易次数、近6个月平均动账交易次数等。The characteristic values corresponding to the volatility characteristics include: the asset range in the past three months (the difference between the maximum asset value and the asset minimum value), the asset standard deviation in the past three months, the asset range in the past six months, and the asset range in the past six months. The standard deviation of assets in 6 months; the characteristic values corresponding to trend characteristics include: asset growth rate in the past 3 months, growth rate of the number of individual gold products held in the past 3 months, asset growth rate in the past 6 months, and recent asset growth rate. The growth rate of the number of individual gold products held in 6 months; the characteristic values corresponding to the characteristics of product hobbies include: the proportion of current accounts, the proportion of regular accounts, the proportion of wealth management, the proportion of funds, the proportion of asset management, and the proportion of insurance; activeness The feature values corresponding to the feature include: ageing, average number of transactions in the past 3 months, average number of transactions in the past 6 months, etc.
将上述波动性特征、趋势性特征、产品爱好度特征及活跃度特征对应的特征值进行拼接得到第二特征,将第二特征输入属性分析模型得到待推荐用户的目标属性值(即流失概率)。The second feature is obtained by splicing the feature values corresponding to the aforementioned volatility feature, trend feature, product hobby feature, and activity feature, and the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended (ie, loss probability) .
推荐模块150,用于当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。The recommendation module 150 is configured to recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group when the target attribute value is less than a preset threshold.
当目标属性值(即流失概率)小于预设阈值时,说明用户流失概率低,此时获取待推荐用户所属的目标用户组对应的产品渗透率数组,将产品渗透率靠前的多个产品作为目标产品推荐给所述待推荐用户。When the target attribute value (that is, the churn probability) is less than the preset threshold, it means that the probability of user churn is low. At this time, the product penetration rate array corresponding to the target user group to which the user to be recommended belongs is obtained, and the products with the highest product penetration rate are used as The target product is recommended to the user to be recommended.
本实施例中,在将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值之后,所述推荐模块150还用于:In this embodiment, after inputting the second feature into the attribute analysis model to obtain the target attribute value of the user to be recommended, the recommendation module 150 is further configured to:
若所述目标属性值大于预设阈值,则基于所述第二历史数据计算多个预设指标的指标值,当所述多个预设指标中某一指定指标对应的指标值大于指标阈值时,基于所述指定指标为所述待推荐用户推荐目标产品。If the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
当目标属性值(即流失概率)大于预设阈值时,说明用户流失概率高,所述预设指标为预先设置的对流失率影响较大的指标,基于所述预设指标可确定用户的流失原因并推荐合适的产品,例如,资产标准差为预设指标,当资产标准差大于指标阈值时,说明待推荐用户的资产变化幅度较大,为了减少资产标准差,可以推荐大额存单这样的周期较长的金融产品给待推荐用户。When the target attribute value (that is, the churn probability) is greater than the preset threshold, it indicates that the user churn probability is high, the preset index is a preset index that has a greater impact on the churn rate, and the user churn can be determined based on the preset index Reasons and recommend suitable products. For example, the standard deviation of assets is a preset indicator. When the standard deviation of assets is greater than the indicator threshold, it means that the assets of the users to be recommended have changed greatly. In order to reduce the standard deviation of assets, you can recommend large-denomination certificates of deposit such as Long-term financial products are recommended to users to be recommended.
如图3所示,为本申请一实施例提供的实现产品推荐方法的电子设备的结构示意图。As shown in FIG. 3 , it is a schematic structural diagram of an electronic device for implementing a product recommendation method according to an embodiment of the present application.
所述电子设备1是一种能够按照事先设定或者存储的指令,自动进行数值计算和/或信息处理的设备。所述电子设备1可以是计算机、也可以是单个网络服务器、多个网络服务器组成的服务器组或者基于云计算的由大量主机或者网络服务器构成的云,其中云计算是分布式计算的一种,由一群松散耦合的计算机集组成的一个超级虚拟计算机。The electronic device 1 is a device that can automatically perform numerical calculation and/or information processing according to pre-set or stored instructions. The electronic device 1 may be a computer, a single network server, a server group composed of multiple network servers, or a cloud based on cloud computing composed of a large number of hosts or network servers, wherein cloud computing is a kind of distributed computing, A super virtual computer consisting of a collection of loosely coupled computers.
在本实施例中,电子设备1包括,但不仅限于,可通过系统总线相互通信连接的存储器11、处理器12、网络接口13,该存储器11中存储有产品推荐程序10,所述产品推荐程序10可被所述处理器12执行。图3仅示出了具有组件11-13以及产品推荐程序10的电子设备1,本领域技术人员可以理解的是,图3示出的结构并不构成对电子设备1的限定,可以包括比图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。In this embodiment, the electronic device 1 includes, but is not limited to, a memory 11 , a processor 12 , and a network interface 13 that can be communicatively connected to each other through a system bus, the memory 11 stores a product recommendation program 10 , and the product recommendation program 10 is executable by the processor 12 . FIG. 3 only shows the electronic device 1 having the components 11-13 and the product recommendation program 10. Those skilled in the art can understand that the structure shown in FIG. 3 does not constitute a limitation on the electronic device 1, and may include a Fewer or more components are shown, or some components are combined, or a different arrangement of components.
其中,存储器11包括内存及至少一种类型的可读存储介质。内存为电子设备1的运行提供缓存;可读存储介质可为如闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等的非易失性存储介质。可读存储介质也可以是易失性存储介质。在一些实施例中,可读存储介质可以是电子设备1的内部存储单元,例如该电子设备1的硬盘;在另一些实施例中,该存储介质也可以是电子设备1的外部存储设备,例如电子设备1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。本实施例中,存储器11的可读存储介质通常用于存储安装于电子设备1的操作系统和各类应用软件,例如存储本申请一实施例中的产品推荐程序10的代码等。此外,存储器11还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 11 includes a memory and at least one type of readable storage medium. The memory provides a cache for the operation of the electronic device 1; the readable storage medium can be, for example, flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM) ), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc. non-volatile storage media. The readable storage medium may also be a volatile storage medium. In some embodiments, the readable storage medium may be an internal storage unit of the electronic device 1, such as a hard disk of the electronic device 1; in other embodiments, the storage medium may also be an external storage device of the electronic device 1, such as A pluggable hard disk, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, a flash memory card (Flash Card), etc., are equipped on the electronic device 1. In this embodiment, the readable storage medium of the memory 11 is generally used to store the operating system and various application software installed in the electronic device 1 , for example, to store the code of the product recommendation program 10 in an embodiment of the present application. In addition, the memory 11 can also be used to temporarily store various types of data that have been output or will be output.
处理器12在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器12通常用于控制所述电子设备1的总体操作,例如执行与其他设备进行数据交互或者通信相关的控制和处理等。本实施例中,所述处理器12用于运行所述存储器11中存储的程序代码或者处理数据,例如运行产品推荐程序10等。In some embodiments, the processor 12 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips. The processor 12 is generally used to control the overall operation of the electronic device 1, such as performing control and processing related to data interaction or communication with other devices. In this embodiment, the processor 12 is configured to run the program code or process data stored in the memory 11, for example, run the product recommendation program 10 and the like.
网络接口13可包括无线网络接口或有线网络接口,该网络接口13用于在所述电子设备1与客户端(图中未画出)之间建立通信连接。The network interface 13 may include a wireless network interface or a wired network interface, and the network interface 13 is used to establish a communication connection between the electronic device 1 and a client (not shown in the figure).
可选的,所述电子设备1还可以包括用户接口,用户接口可以包括显示器(Display)、输入单元比如键盘(Keyboard),可选的用户接口还可以包括标准的有线接口、无线接口。可选的,在一些实施例中,显示器可以是LED显示器、液晶显示器、触控式液晶显示器以及OLED(Organic Light-Emitting Diode,有机发光二极管)触摸器等。其中,显示器也可以适当的称为显示屏或显示单元,用于显示在电子设备1中处理的信息以及用于显示可视化的用户界面。Optionally, the electronic device 1 may further include a user interface, and the user interface may include a display (Display), an input unit such as a keyboard (Keyboard), and an optional user interface may also include a standard wired interface and a wireless interface. Optionally, in some embodiments, the display may be an LED display, a liquid crystal display, a touch-sensitive liquid crystal display, an OLED (Organic Light-Emitting Diode, organic light-emitting diode) touch device, and the like. The display may also be appropriately called a display screen or a display unit, which is used for displaying information processed in the electronic device 1 and for displaying a visualized user interface.
应该了解,所述实施例仅为说明之用,在专利申请范围上并不受此结构的限制。It should be understood that the embodiments are only used for illustration, and are not limited by this structure in the scope of the patent application.
所述电子设备1中的所述存储器11存储的产品推荐程序10是多个指令的组合,在所述处理器12中运行时,可以实现:The product recommendation program 10 stored in the memory 11 in the electronic device 1 is a combination of multiple instructions, and when running in the processor 12, can realize:
获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐 用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
具体地,所述处理器12对上述产品推荐程序10的具体实现方法可参考图1对应实施例中相关步骤的描述,在此不赘述。需要强调的是,为进一步保证上述第一、第二历史数据的私密和安全性,上述第一、第二历史数据还可以存储于一区块链的节点中。Specifically, for the specific implementation method of the above-mentioned product recommendation program 10 by the processor 12, reference may be made to the description of the relevant steps in the corresponding embodiment of FIG. 1 , which will not be repeated here. It should be emphasized that, in order to further ensure the privacy and security of the above-mentioned first and second historical data, the above-mentioned first and second historical data may also be stored in a node of a blockchain.
进一步地,所述电子设备1集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。所述计算机可读存储介质可以是非易失性的,也可以是非易失性的。所述计算机可读存储介质可以包括:能够携带所述计算机程序代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)。Further, if the modules/units integrated in the electronic device 1 are implemented in the form of software functional units and sold or used as independent products, they may be stored in a computer-readable storage medium. The computer-readable storage medium may be non-volatile or non-volatile. The computer-readable storage medium may include: any entity or device capable of carrying the computer program code, a recording medium, a U disk, a removable hard disk, a magnetic disk, an optical disk, a computer memory, a read-only memory (ROM, Read-Only Memory) ).
所述计算机可读存储介质上存储有产品推荐程序10,所述产品推荐程序10可被一个或者多个处理器执行,以实现如下步骤:A product recommendation program 10 is stored on the computer-readable storage medium, and the product recommendation program 10 can be executed by one or more processors to realize the following steps:
获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
在本申请所提供的几个实施例中,应该理解到,所揭露的设备,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided in this application, it should be understood that the disclosed apparatus, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the modules is only a logical function division, and there may be other division manners in actual implementation.
所述作为分离部件说明的模块可以是或者也可以不是物理上分开的,作为模块显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The modules described as separate components may or may not be physically separated, and the components shown as modules may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本申请各个实施例中的各功能模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能模块的形式实现。In addition, each functional module in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware, or can be implemented in the form of hardware plus software function modules.
对于本领域技术人员而言,显然本申请不限于上述示范性实施例的细节,而且在不背离本申请的精神或基本特征的情况下,能够以其他的具体形式实现本申请。It will be apparent to those skilled in the art that the present application is not limited to the details of the above-described exemplary embodiments, but that the present application can be implemented in other specific forms without departing from the spirit or essential characteristics of the present application.
因此,无论从哪一点来看,均应将实施例看作是示范性的,而且是非限制性的,本申请的范围由所附权利要求而不是上述说明限定,因此旨在将落在权利要求的等同要件的含义和范围内的所有变化涵括在本申请内。不应将权利要求中的任何附关联图标记视为限制所涉及的权利要求。Accordingly, the embodiments are to be regarded in all respects as illustrative and not restrictive, and the scope of the application is to be defined by the appended claims rather than the foregoing description, which is therefore intended to fall within the scope of the claims. All changes within the meaning and scope of the equivalents of , are included in this application. Any reference signs in the claims shall not be construed as limiting the involved claim.
本申请所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。The blockchain referred to in this application is a new application mode of computer technologies such as distributed data storage, point-to-point transmission, consensus mechanism, and encryption algorithm. Blockchain, essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information to verify its Validity of information (anti-counterfeiting) and generation of the next block. The blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
此外,显然“包括”一词不排除其他单元或步骤,单数不排除复数。系统权利要求中陈 述的多个单元或装置也可以由一个单元或装置通过软件或者硬件来实现。第二等词语用来表示名称,而并不表示任何特定的顺序。Furthermore, it is clear that the word "comprising" does not exclude other units or steps and the singular does not exclude the plural. A plurality of units or means recited in the system claims can also be implemented by one unit or means by software or hardware. Second-class terms are used to denote names and do not denote any particular order.
最后应说明的是,以上实施例仅用以说明本申请的技术方案而非限制,尽管参照较佳实施例对本申请进行了详细说明,本领域的普通技术人员应当理解,可以对本申请的技术方案进行修改或等同替换,而不脱离本申请技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application rather than limitations. Although the present application has been described in detail with reference to the preferred embodiments, those of ordinary skill in the art should understand that the technical solutions of the present application can be Modifications or equivalent substitutions can be made without departing from the spirit and scope of the technical solutions of the present application.

Claims (20)

  1. 一种产品推荐方法,其中,所述方法包括:A product recommendation method, wherein the method includes:
    获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
    基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
    解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
    获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
    当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  2. 如权利要求1所述的产品推荐方法,其中,所述基于所述第一特征对各个用户进行分组,得到多个用户组包括:The product recommendation method according to claim 1, wherein the grouping of each user based on the first characteristic to obtain a plurality of user groups comprises:
    基于所述第一特征采用K均值聚类算法对各个用户进行分组,其中,K分别取值为预设数值范围内的各个自然数,K的一个取值对应一种分组结果,得到多种分组结果;Based on the first feature, the K-means clustering algorithm is used to group each user, wherein K respectively takes the value of each natural number within a preset value range, and one value of K corresponds to one kind of grouping result, and various grouping results are obtained. ;
    确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户,基于所述中心用户的第一特征分别计算每种分组结果对应的轮廓系数;Determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the silhouette coefficient corresponding to each kind of grouping result based on the first feature of the center user;
    将轮廓系数最接近预设数值的分组结果作为目标分组结果。The grouping result with the silhouette coefficient closest to the preset value is taken as the target grouping result.
  3. 如权利要求2所述的产品推荐方法,其中,所述确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户包括:The product recommendation method according to claim 2, wherein said determining the central user of each user group corresponding to each of the plurality of grouping results comprises:
    计算每种分组结果对应的各个用户组中所有用户的第一特征的平均值,将各个用户组中第一特征与所述平均值的差值绝对值最小的用户作为各个用户组的中心用户。Calculate the average value of the first characteristics of all users in each user group corresponding to each grouping result, and take the user with the smallest absolute value of the difference between the first characteristic and the average value in each user group as the central user of each user group.
  4. 如权利要求2所述的产品推荐方法,其中,所述轮廓系数的计算公式为:The product recommendation method according to claim 2, wherein the calculation formula of the silhouette coefficient is:
    Figure PCTCN2021123176-appb-100001
    Figure PCTCN2021123176-appb-100001
    Figure PCTCN2021123176-appb-100002
    Figure PCTCN2021123176-appb-100002
    其中,S pq表示第p种分组结果中第q个用户组对应的轮廓系数,
    Figure PCTCN2021123176-appb-100003
    表示第p种分组结果中第q个用户组的中心用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
    Figure PCTCN2021123176-appb-100004
    表示第p种分组结果中第q个用户组的中心用户的第一特征到其他用户组的第一特征的平均距离的最小值,S p表示第p种分组结果对应的轮廓系数,m表示第p种分组结果中用户组的总数量。
    Among them, S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result,
    Figure PCTCN2021123176-appb-100003
    represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group,
    Figure PCTCN2021123176-appb-100004
    Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
  5. 如权利要求1所述的产品推荐方法,其中,在将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值之后,所述方法还包括:The product recommendation method according to claim 1, wherein after inputting the second feature into an attribute analysis model to obtain the target attribute value of the user to be recommended, the method further comprises:
    若所述目标属性值大于预设阈值,则基于所述第二历史数据计算多个预设指标的指标值,当所述多个预设指标中某一指定指标对应的指标值大于指标阈值时,基于所述指定指标为所述待推荐用户推荐目标产品。If the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
  6. 如权利要求1所述的产品推荐方法,其中,所述基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组包括:The product recommendation method according to claim 1, wherein calculating the product penetration rate array corresponding to each user group in the plurality of user groups based on the first historical data comprises:
    基于所述第一历史数据确定各个用户组中每个用户对应的产品;Determine the product corresponding to each user in each user group based on the first historical data;
    计算各个用户组中各种产品对应的用户占比;Calculate the proportion of users corresponding to various products in each user group;
    基于所述用户占比确定各个用户组对应的产品渗透率数组。The product penetration rate array corresponding to each user group is determined based on the user ratio.
  7. 如权利要求1所述的产品推荐方法,其中,在基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据之后,所述方法还包括:The product recommendation method according to claim 1, wherein after acquiring the second historical data of the user to be recommended from the database based on the identifier, the method further comprises:
    若无法从所述数据库中获取所述待推荐用户的第二历史数据,则将预设产品清单推荐给所述待推荐用户。If the second historical data of the user to be recommended cannot be obtained from the database, a preset product list is recommended to the user to be recommended.
  8. 一种产品推荐装置,其中,所述装置包括:A product recommendation device, wherein the device includes:
    分组模块,用于获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;The grouping module is used for obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain multiple user groups;
    计算模块,用于基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;a calculation module, configured to calculate the product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
    解析模块,用于解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;The parsing module is configured to parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, and obtain the second historical data of the user to be recommended from the database based on the identifier , and determine the target user group corresponding to the user to be recommended;
    确定模块,用于获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;A determination module, configured to obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into The attribute analysis model obtains the target attribute value of the user to be recommended;
    推荐模块,用于当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。A recommendation module, configured to recommend a target product for the user to be recommended based on the product penetration rate array corresponding to the target user group when the target attribute value is less than a preset threshold.
  9. 一种电子设备,其中,所述电子设备包括:An electronic device, wherein the electronic device comprises:
    至少一个处理器;以及,at least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的产品推荐程序,所述产品推荐程序被所述至少一个处理器执行,以使所述至少一个处理器能够执行如下步骤:The memory stores a product recommendation program executable by the at least one processor, and the product recommendation program is executed by the at least one processor to enable the at least one processor to perform the following steps:
    获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
    基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
    解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
    获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
    当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  10. 如权利要求9所述的电子设备,其中,所述基于所述第一特征对各个用户进行分组,得到多个用户组包括:The electronic device according to claim 9, wherein the grouping of each user based on the first characteristic to obtain a plurality of user groups comprises:
    基于所述第一特征采用K均值聚类算法对各个用户进行分组,其中,K分别取值为预设数值范围内的各个自然数,K的一个取值对应一种分组结果,得到多种分组结果;Based on the first feature, the K-means clustering algorithm is used to group each user, wherein K respectively takes the value of each natural number within a preset value range, and one value of K corresponds to one kind of grouping result, and various grouping results are obtained. ;
    确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户,基于所述中心用户的第一特征分别计算每种分组结果对应的轮廓系数;Determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the silhouette coefficient corresponding to each kind of grouping result based on the first feature of the center user;
    将轮廓系数最接近预设数值的分组结果作为目标分组结果。The grouping result with the silhouette coefficient closest to the preset value is taken as the target grouping result.
  11. 如权利要求10所述的电子设备,其中,所述确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户包括:The electronic device according to claim 10, wherein the determining the central user of each user group corresponding to each of the plurality of grouping results comprises:
    计算每种分组结果对应的各个用户组中所有用户的第一特征的平均值,将各个用户组中第一特征与所述平均值的差值绝对值最小的用户作为各个用户组的中心用户。Calculate the average value of the first characteristics of all users in each user group corresponding to each grouping result, and take the user with the smallest absolute value of the difference between the first characteristic and the average value in each user group as the central user of each user group.
  12. 如权利要求10所述的电子设备,其中,所述轮廓系数的计算公式为:The electronic device according to claim 10, wherein the calculation formula of the silhouette coefficient is:
    Figure PCTCN2021123176-appb-100005
    Figure PCTCN2021123176-appb-100005
    Figure PCTCN2021123176-appb-100006
    Figure PCTCN2021123176-appb-100006
    其中,S pq表示第p种分组结果中第q个用户组对应的轮廓系数,
    Figure PCTCN2021123176-appb-100007
    表示第p种分组结果中第q个用户组的中心用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
    Figure PCTCN2021123176-appb-100008
    表示第p种分组结果中第q个用户组的中心用户的第一特征到其他用户组的第一特征的平均距离的最小值,S p表示第p种分组结果对应的轮廓系数,m表示第p种分组结果中用户组的总数量。
    Among them, S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result,
    Figure PCTCN2021123176-appb-100007
    represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group,
    Figure PCTCN2021123176-appb-100008
    Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
  13. 如权利要求9所述的电子设备,其中,在将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值之后,所述产品推荐程序被处理器执行时还实现如下步骤:The electronic device according to claim 9, wherein after the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended, the product recommendation program further implements the following steps when executed by the processor:
    若所述目标属性值大于预设阈值,则基于所述第二历史数据计算多个预设指标的指标值,当所述多个预设指标中某一指定指标对应的指标值大于指标阈值时,基于所述指定指标为所述待推荐用户推荐目标产品。If the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
  14. 如权利要求9所述的电子设备,其中,所述基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组包括:The electronic device according to claim 9, wherein calculating the product penetration rate array corresponding to each user group in the plurality of user groups based on the first historical data comprises:
    基于所述第一历史数据确定各个用户组中每个用户对应的产品;Determine the product corresponding to each user in each user group based on the first historical data;
    计算各个用户组中各种产品对应的用户占比;Calculate the proportion of users corresponding to various products in each user group;
    基于所述用户占比确定各个用户组对应的产品渗透率数组。The product penetration rate array corresponding to each user group is determined based on the user ratio.
  15. 如权利要求9所述的电子设备,其中,在基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据之后,所述产品推荐程序被处理器执行时还实现如下步骤:The electronic device according to claim 9, wherein after acquiring the second historical data of the user to be recommended from the database based on the identifier, the product recommendation program further implements the following steps when executed by the processor:
    若无法从所述数据库中获取所述待推荐用户的第二历史数据,则将预设产品清单推荐给所述待推荐用户。If the second historical data of the user to be recommended cannot be obtained from the database, a preset product list is recommended to the user to be recommended.
  16. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有产品推荐程序,所述产品推荐程序可被一个或者多个处理器执行,以实现如下步骤:A computer-readable storage medium, wherein a product recommendation program is stored on the computer-readable storage medium, and the product recommendation program can be executed by one or more processors to realize the following steps:
    获取数据库中各个用户的第一历史数据,基于第一特征因子及所述第一历史数据确定各个用户的第一特征,基于所述第一特征对各个用户进行分组,得到多个用户组;Obtaining the first historical data of each user in the database, determining the first characteristic of each user based on the first characteristic factor and the first historical data, and grouping each user based on the first characteristic to obtain a plurality of user groups;
    基于所述第一历史数据计算所述多个用户组中每个用户组对应的产品渗透率数组;Calculate a product penetration rate array corresponding to each user group in the multiple user groups based on the first historical data;
    解析用户基于客户端发出的产品推荐请求,获取所述产品推荐请求携带的待推荐用户的标识,基于所述标识从所述数据库中获取所述待推荐用户的第二历史数据,并确定所述待推荐用户对应的目标用户组;Parse the product recommendation request sent by the user based on the client, obtain the identifier of the user to be recommended carried in the product recommendation request, acquire the second historical data of the user to be recommended from the database based on the identifier, and determine the The target user group corresponding to the user to be recommended;
    获取所述产品推荐请求对应的第二特征因子,基于所述第二特征因子及所述第二历史数据确定所述待推荐用户的第二特征,将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值;Obtain the second characteristic factor corresponding to the product recommendation request, determine the second characteristic of the user to be recommended based on the second characteristic factor and the second historical data, and input the second characteristic into the attribute analysis model to obtain the result. Describe the target attribute value of the user to be recommended;
    当所述目标属性值小于预设阈值时,基于所述目标用户组对应的产品渗透率数组为所述待推荐用户推荐目标产品。When the target attribute value is less than the preset threshold, a target product is recommended for the user to be recommended based on the product penetration rate array corresponding to the target user group.
  17. 如权利要求16所述的计算机可读存储介质,其中,所述基于所述第一特征对各个用户进行分组,得到多个用户组包括:The computer-readable storage medium of claim 16, wherein the grouping of the respective users based on the first characteristic to obtain a plurality of user groups comprises:
    基于所述第一特征采用K均值聚类算法对各个用户进行分组,其中,K分别取值为预设数值范围内的各个自然数,K的一个取值对应一种分组结果,得到多种分组结果;Based on the first feature, the K-means clustering algorithm is used to group each user, wherein K respectively takes the value of each natural number within a preset value range, and one value of K corresponds to one kind of grouping result, and various grouping results are obtained. ;
    确定所述多种分组结果中每种分组结果对应的各个用户组的中心用户,基于所述中心用户的第一特征分别计算每种分组结果对应的轮廓系数;Determine the center user of each user group corresponding to each kind of grouping result in the multiple grouping results, and calculate the silhouette coefficient corresponding to each kind of grouping result based on the first feature of the center user;
    将轮廓系数最接近预设数值的分组结果作为目标分组结果。The grouping result with the silhouette coefficient closest to the preset value is taken as the target grouping result.
  18. 如权利要求17所述的计算机可读存储介质,其中,所述确定所述多种分组结果 中每种分组结果对应的各个用户组的中心用户包括:The computer-readable storage medium of claim 17, wherein the central user of each user group corresponding to each of the plurality of grouping results is determined comprising:
    计算每种分组结果对应的各个用户组中所有用户的第一特征的平均值,将各个用户组中第一特征与所述平均值的差值绝对值最小的用户作为各个用户组的中心用户。Calculate the average value of the first characteristics of all users in each user group corresponding to each grouping result, and take the user with the smallest absolute value of the difference between the first characteristic and the average value in each user group as the central user of each user group.
  19. 如权利要求17所述的计算机可读存储介质,其中,所述轮廓系数的计算公式为:The computer-readable storage medium of claim 17, wherein the calculation formula of the silhouette coefficient is:
    Figure PCTCN2021123176-appb-100009
    Figure PCTCN2021123176-appb-100009
    Figure PCTCN2021123176-appb-100010
    Figure PCTCN2021123176-appb-100010
    其中,S pq表示第p种分组结果中第q个用户组对应的轮廓系数,
    Figure PCTCN2021123176-appb-100011
    表示第p种分组结果中第q个用户组的中心用户的第一特征到同一个用户组中其他用户的第一特征的平均距离,
    Figure PCTCN2021123176-appb-100012
    表示第p种分组结果中第q个用户组的中心用户的第一特征到其他用户组的第一特征的平均距离的最小值,S p表示第p种分组结果对应的轮廓系数,m表示第p种分组结果中用户组的总数量。
    Among them, S pq represents the silhouette coefficient corresponding to the qth user group in the pth grouping result,
    Figure PCTCN2021123176-appb-100011
    represents the average distance from the first feature of the central user of the qth user group in the pth grouping result to the first features of other users in the same user group,
    Figure PCTCN2021123176-appb-100012
    Represents the minimum value of the average distance between the first feature of the central user of the qth user group and the first features of other user groups in the pth grouping result, Sp represents the contour coefficient corresponding to the pth grouping result, m represents the th The total number of user groups in p types of grouping results.
  20. 如权利要求16所述的计算机可读存储介质,其中,在将所述第二特征输入属性分析模型得到所述待推荐用户的目标属性值之后,所述产品推荐程序被处理器执行时还实现如下步骤:The computer-readable storage medium according to claim 16, wherein after the second feature is input into the attribute analysis model to obtain the target attribute value of the user to be recommended, the product recommendation program further implements when the processor is executed. Follow the steps below:
    若所述目标属性值大于预设阈值,则基于所述第二历史数据计算多个预设指标的指标值,当所述多个预设指标中某一指定指标对应的指标值大于指标阈值时,基于所述指定指标为所述待推荐用户推荐目标产品。If the target attribute value is greater than the preset threshold, calculate the index values of multiple preset indexes based on the second historical data, and when the index value corresponding to a specified index among the multiple preset indexes is greater than the index threshold , recommending a target product for the user to be recommended based on the specified index.
PCT/CN2021/123176 2020-11-02 2021-10-12 Product recommendation method and apparatus, and electronic device and readable storage medium WO2022089190A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011206222.0A CN112148995A (en) 2020-11-02 2020-11-02 Product recommendation method and device, electronic equipment and readable storage medium
CN202011206222.0 2020-11-02

Publications (1)

Publication Number Publication Date
WO2022089190A1 true WO2022089190A1 (en) 2022-05-05

Family

ID=73953773

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/123176 WO2022089190A1 (en) 2020-11-02 2021-10-12 Product recommendation method and apparatus, and electronic device and readable storage medium

Country Status (2)

Country Link
CN (1) CN112148995A (en)
WO (1) WO2022089190A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116522105A (en) * 2023-02-01 2023-08-01 深圳中汇能科技有限公司 Method, device, equipment and medium for integrally constructing data based on cloud computing

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112148995A (en) * 2020-11-02 2020-12-29 深圳壹账通智能科技有限公司 Product recommendation method and device, electronic equipment and readable storage medium
CN113158053B (en) * 2021-04-25 2022-09-30 平安科技(深圳)有限公司 Service product recommendation method and device, computer equipment and storage medium
CN113627653B (en) * 2021-07-14 2023-10-20 深圳索信达数据技术有限公司 Method and device for determining activity prediction strategy of mobile banking user
CN113407857A (en) * 2021-08-19 2021-09-17 明品云(北京)数据科技有限公司 Insurance recommendation method, system, device and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106919647A (en) * 2017-01-17 2017-07-04 浙江工业大学 A kind of network structure similitude based on cluster recommends method
CN108492194A (en) * 2018-03-06 2018-09-04 平安科技(深圳)有限公司 Products Show method, apparatus and storage medium
CN109447728A (en) * 2018-09-07 2019-03-08 平安科技(深圳)有限公司 Financial product recommended method, device, computer equipment and storage medium
CN110825723A (en) * 2019-10-09 2020-02-21 上海电力大学 Residential user classification method based on power load analysis
CN112148995A (en) * 2020-11-02 2020-12-29 深圳壹账通智能科技有限公司 Product recommendation method and device, electronic equipment and readable storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106919647A (en) * 2017-01-17 2017-07-04 浙江工业大学 A kind of network structure similitude based on cluster recommends method
CN108492194A (en) * 2018-03-06 2018-09-04 平安科技(深圳)有限公司 Products Show method, apparatus and storage medium
CN109447728A (en) * 2018-09-07 2019-03-08 平安科技(深圳)有限公司 Financial product recommended method, device, computer equipment and storage medium
CN110825723A (en) * 2019-10-09 2020-02-21 上海电力大学 Residential user classification method based on power load analysis
CN112148995A (en) * 2020-11-02 2020-12-29 深圳壹账通智能科技有限公司 Product recommendation method and device, electronic equipment and readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116522105A (en) * 2023-02-01 2023-08-01 深圳中汇能科技有限公司 Method, device, equipment and medium for integrally constructing data based on cloud computing
CN116522105B (en) * 2023-02-01 2023-09-22 深圳中汇能科技有限公司 Method, device, equipment and medium for integrally constructing data based on cloud computing

Also Published As

Publication number Publication date
CN112148995A (en) 2020-12-29

Similar Documents

Publication Publication Date Title
WO2022089190A1 (en) Product recommendation method and apparatus, and electronic device and readable storage medium
US11665072B2 (en) Parallel computational framework and application server for determining path connectivity
WO2022105115A1 (en) Question and answer pair matching method and apparatus, electronic device and storage medium
JP6371870B2 (en) Machine learning service
US20200192894A1 (en) System and method for using data incident based modeling and prediction
US20200265293A1 (en) Secure broker-mediated data analysis and prediction
WO2019169756A1 (en) Product recommendation method and apparatus, and storage medium
WO2020048051A1 (en) Financial product recommendation method, server and computer readable storage medium
WO2021174944A1 (en) Message push method based on target activity, and related device
US20140006166A1 (en) System and method for determining offers based on predictions of user interest
Amram et al. Optimal policy trees
WO2022105496A1 (en) Intelligent follow-up contact method and apparatus, and electronic device and readable storage medium
US20200357063A1 (en) Systems and computer-implemented processes for model-based underwriting
EP3459208A1 (en) Systems and methods for determining an optimal strategy
WO2022160442A1 (en) Answer generation method and apparatus, electronic device, and readable storage medium
CN112597135A (en) User classification method and device, electronic equipment and readable storage medium
CN111639077A (en) Data management method and device, electronic equipment and storage medium
US20160232376A1 (en) Privacy fractal mirroring of transaction data
CN114706985A (en) Text classification method and device, electronic equipment and storage medium
WO2019192130A1 (en) Customer classification method and device and storage medium
CN112395401A (en) Adaptive negative sample pair sampling method and device, electronic equipment and storage medium
WO2021174881A1 (en) Multi-dimensional information combination prediction method, apparatus, computer device, and medium
CN115204971A (en) Product recommendation method and device, electronic equipment and computer-readable storage medium
CN113987351A (en) Artificial intelligence based intelligent recommendation method and device, electronic equipment and medium
US9892462B1 (en) Heuristic model for improving the underwriting process

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21884919

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 07.08.2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21884919

Country of ref document: EP

Kind code of ref document: A1