WO2019061976A1 - Fund product recommendation method and apparatus, terminal device, and storage medium - Google Patents

Fund product recommendation method and apparatus, terminal device, and storage medium Download PDF

Info

Publication number
WO2019061976A1
WO2019061976A1 PCT/CN2018/074570 CN2018074570W WO2019061976A1 WO 2019061976 A1 WO2019061976 A1 WO 2019061976A1 CN 2018074570 W CN2018074570 W CN 2018074570W WO 2019061976 A1 WO2019061976 A1 WO 2019061976A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
data
cluster
fund
target
Prior art date
Application number
PCT/CN2018/074570
Other languages
French (fr)
Chinese (zh)
Inventor
姬马婧雯
何军
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019061976A1 publication Critical patent/WO2019061976A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/06Asset management; Financial planning or analysis

Definitions

  • the present application relates to the field of financial data processing, and in particular, to a fund product recommendation method, device, terminal device and storage medium.
  • Fund investment refers to a kind of financial management means for investors to purchase fund products through the fund trading system to realize the management and distribution of assets.
  • the fund products are divided into fund types such as stock type, index type, hybrid type, bond type and currency type. Users select fund products of different fund types according to fund type and investment conditions. transaction.
  • the current fund trading system does not have the function of recommending fund products based on the user's investment conditions, which makes investors' investment conditions and fund products to be positioned, which affects the return rate of investment funds due to low positioning accuracy.
  • the application provides a fund product recommendation method, device, terminal device and storage medium to solve the problem that the current fund transaction system does not have the fund product recommendation based on the user's investment condition.
  • the present application provides a fund product recommendation method, including:
  • the current user portrait data including at least one current feature data
  • the target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  • the application provides a fund product recommendation device, including:
  • a current user image data obtaining module configured to acquire current user portrait data, where the current user portrait data includes at least one current feature data
  • a user data model obtaining module configured to acquire a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
  • a target clustering cluster determining module configured to acquire, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
  • the target fund product determining module is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
  • the present application provides a terminal device including a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, the processor executing the computer The following steps are implemented when reading the instruction:
  • the current user portrait data including at least one current feature data
  • the target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  • the present application provides a computer readable storage medium storing computer readable instructions that, when executed by a processor, implement the following steps:
  • the current user portrait data including at least one current feature data
  • the target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  • the target risk assessment value is determined based on the current user portrait data and the user data type, and the target target fund product is determined by using the target risk assessment value, so that The target fund products recommended to the target users accurately locate the user's own investment conditions and fund products, which helps the target users to improve the investment return rate.
  • Embodiment 1 is a flow chart of a fund product recommendation method in Embodiment 1.
  • FIG. 3 is a specific flow chart of step S50 of FIG. 2.
  • step S30 of FIG. 1 is a specific flow chart of step S30 of FIG. 1.
  • FIG. 5 is a specific flowchart of step S40 in FIG. 1.
  • FIG. 6 is a schematic block diagram of a fund product recommendation device in Embodiment 2.
  • Figure 7 is a schematic diagram of a terminal device in Embodiment 4.
  • FIG. 1 shows a flow chart of a fund product recommendation method in this embodiment.
  • the fund product recommendation method is applied in the fund trading system, and is used to recommend the target fund product to the user according to the investment condition of the user, so as to avoid the investor's low investment accuracy of the investment condition and the fund product, which affects the profit rate of the investment fund.
  • the fund product recommendation method includes the following steps:
  • S10 Acquire current user portrait data, and the current user portrait data includes at least one current feature data.
  • the current user portrait data is user portrait data of a user who needs to recommend a fund product.
  • the user who needs to recommend the fund product is simply referred to as the target user.
  • User portraits are a tagged user model abstracted from information such as user social attributes, lifestyle habits, and consumer behavior.
  • the user portrait data is data for constructing the user portrait.
  • the current user portrait data can reflect the investment conditions of the target user.
  • the current feature data is data related to the target user's own investment conditions.
  • the current feature data includes, but is not limited to, age, occupation, income, investment experience, investment ratio, risk preference, and bearish loss value in the present embodiment. That is, the current user portrait data includes at least one current feature data, specifically, at least one of age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value.
  • the age in the current feature data is the age of the target user, and the age of the target user is related to the fund type of the fund product that the target user may purchase. Generally speaking, the older the target user is, the more biased it is to purchase a fund product with lower risk. Conversely, the younger the target user is, the more biased it is to purchase a fund product with higher risk.
  • the occupation in the current feature data is the occupation of the target user. The occupation of the target user is generally related to the personality of the target user. The target users with different personalities may select fund products with different risks; otherwise, the target users with the same personality may choose to purchase the same risk. Fund products.
  • the income in the current characteristic data may be the disposable income of the target user, or the net income of the target user, and the income may also affect the fund type of the target user selecting the fund product.
  • the investment experience in the current characteristic data refers to the experience of the target user in investing in funds or investing in other financial wealth management products. The less general investment experience, the more biased it is to purchase fund products with lower risk.
  • the proportion of investment in the current characteristic data refers to the ratio of the target user's purchase of the fund product to all the investment. According to the proportion of the investment, it can be determined which kind of fund product the user prefers to purchase.
  • the risk preference in the current feature data is the risk preference of the target user, and the fund products of different risks are recommended according to the risk preference.
  • the value of the loss in the current characteristic data is the value that the target user can bear the loss, and the fund product with different risk can be recommended based on the loss value.
  • S20 Acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
  • the user data model is a model in which the fund trading system obtains a risk assessment value associated with an investment fund after training based on training user portrait data.
  • the training user portrait data is user image data of the training user
  • the training user portrait data is data for training the user data model.
  • the training user is a user who has previously opened an account in the fund trading system and conducted a fund transaction.
  • the clustering cluster is a collection of similar training user image data obtained by clustering the training user image data by using a clustering algorithm.
  • Each cluster cluster corresponds to a risk assessment value, which is an evaluation value of the risk that the training user corresponding to the training user portrait data in any cluster cluster can bear the risk when investing in the fund. Understandably, the risk assessment value is related to the fund type of the fund product.
  • the training user with the higher risk assessment value tends to invest in the fund product with high risk and high return. On the contrary, the less the risk assessment value, the more the training user is biased. Fund products with low investment risk and low returns.
  • the fund product recommendation method further includes:
  • the training feature data is data related to training the user's own investment conditions.
  • the training characteristic data includes, but is not limited to, age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value in the present embodiment.
  • the current user portrait data includes at least one training feature data, specifically, at least one of age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value.
  • training the user data model based on the training user portrait data refers to clustering at least one training feature data in the training user portrait data, and using the set of similar training user portrait data as a cluster cluster to image all the training users.
  • the data is divided into at least two cluster clusters, and the risk assessment values corresponding to each cluster cluster are obtained, thereby forming a user data model.
  • the age in the training feature data is the age of the training user, and the age of the training user is related to the fund type of the fund product that the training user may purchase. Generally speaking, the older the training user is, the more he prefers to buy a fund product with lower risk. On the contrary, the younger the training user, the more he prefers to buy a higher risk fund product.
  • the occupation in the training characteristic data is the occupation of the training user. The occupation of the training user is generally related to the personality of the training user. The training users with different personality may select the fund products with different risks; on the contrary, the training users with the same personality may choose to purchase the same risk. Fund products.
  • the income in the training characteristic data may be the disposable income of the training user, or the net income of the training user, and the income may also affect the fund type of the training user to select the fund product.
  • the investment experience in the training characteristic data refers to the experience of training users in investing in funds or investing in other financial wealth management products. The less general investment experience, the more biased it is to purchase fund products with lower risk.
  • the proportion of investment in the training characteristic data refers to the ratio of training users to purchase fund products in all investments. According to the proportion of investment, it can determine which kind of risk fund products the user prefers to purchase.
  • the risk preference in the training feature data is to train the user's risk preference, and recommend different risk fund products according to the risk preference.
  • the value of the loss in the training characteristic data is the value that the training user can bear the loss, and the fund product with different risk can be recommended based on the loss value.
  • step S50 the user data model is trained based on the training user image data, and specifically includes the following steps:
  • S51 Perform normalization processing on at least one training feature data in the training user portrait data, so that the training user image data includes at least one standard feature data.
  • data normalization is to scale the data to a small specific interval, to remove the unit limit of the data, and convert it into a pure value of the infinite level, which is convenient for different units or magnitude indicators.
  • a conversion process may be performed using min-max normalization to acquire at least one standard feature data.
  • min-max normalization is also called deviation normalization, which refers to the process of linearly transforming the original data by using a conversion function to make the result fall into a preset interval, wherein the conversion function Min is the minimum value of the sample data, max is the maximum value of the sample data, and N is the interval size of the preset interval. It can be understood that if N is 1, the result after the min-max normalization process falls within the range of [0, 1]; if N is 10, the result after the min-max normalization process falls in [0] , 10] within this range.
  • K-means clustering algorithm is used to cluster at least one standard feature data in the training user image data, and at least two clustering clusters are obtained, and each clustering cluster corresponds to a centroid user image data.
  • K-means clustering algorithm is a clustering algorithm based on distance evaluation similarity, that is, the closer the distance between two objects, the larger the similarity is.
  • the K-means clustering algorithm evaluates the similarity of two objects according to the Euclidean distance by calculating the Euclidean distance of two objects.
  • Euclidean metric also known as Euclidean metric refers to the true distance between two points in m-dimensional space, or the natural length of the vector (ie, the distance from the point to the origin). Euclidean distance of any two n-dimensional vectors a(X i1 ,X i2 ,...,X in ) and b(X j1 ,X j2 ,...,X jn )
  • the training user portrait data is user portrait data of a training user for training a user data model, and the training user portrait data includes at least one training feature data.
  • Step S51 Convert each training feature data in the training user image data into standard feature data, so that in step S52, the K-means clustering algorithm is used to cluster at least one standard feature data in the training user image data to obtain at least Two clustering clusters, each of which includes training user portrait data corresponding to a plurality of training users.
  • there is a centroid user image data corresponding to the centroid user in the training user image data of the plurality of training users so that the sum of the distances of the other training user image data to the centroid user portrait data is minimized.
  • the centroid user portrait data is one of all training user portrait data in any cluster cluster, and therefore, the centroid user portrait data also includes at least one standard feature data converted from the training feature data.
  • Step S52 will be described in detail below with reference to specific examples.
  • the training user image data of the m training users is acquired, and at least one training feature data in the training user image data is normalized to form at least one standard feature data, and the user data matrix R is obtained (as shown in the following table).
  • the user data matrix R is an m*n matrix, m is the number of training users, and n is the number of standard feature data.
  • the comparison table of the occupational and standard characteristic data may be preset, and the comparison table may be obtained based on the training characteristic data of the occupation in the training user portrait data, and the corresponding standard feature may be obtained. data.
  • different standardized scores may be enumerated according to occupations, and each standardized score corresponds to different occupations of different enterprises, so as to query corresponding standardized scores based on the training characteristic data of the occupation in the training user portrait data, as Corresponding standard feature data.
  • other feature data is standardized by using a conversion function or a comparison table.
  • Step (1) an n-dimensional map is established, and m data points in the n-dimensional graph are drawn according to the values of the standard feature data of each training user Ui in the user data matrix R. Ui, where i ⁇ m, each data point Ui corresponds to a training user.
  • step (3) one data point Ui is randomly selected in each data set Gj as the centroid Ci such that there are K centroids Ci in all data sets.
  • step (4) the Euclidean distance Di of any of the data points Ui and the K centroids Gi in each data set Gj is calculated, and the data points Ui are classified into a data set Gj having the smallest Euclidean distance Di.
  • step (5) all data points Ui are executed in step (4) to form a new data set G.
  • S53 weighting the centroid user image data by using a weighting algorithm to determine that the centroid user image data corresponds to a risk assessment value.
  • the weighting algorithm is among them, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data.
  • the weight of each standard feature data may be a value obtained by statistically processing the standard feature data of each training user's portrait data in advance using a multivariate linear regression model, so as to be directly called when a risk assessment is required.
  • x 0 1
  • is the row vector
  • the row vector contains the parameters in the linear regression model
  • X is the sample feature matrix.
  • S54 Acquire a user data model based on the clustering cluster and the risk assessment value.
  • the centroid user image data of each cluster cluster and the same cluster cluster is similar, and the risk evaluation value corresponding to the centroid user portrait data can be used as the risk evaluation value of other training user portrait data in the clustering cluster to determine the user data model.
  • S60 Store the user data model in a database.
  • the user data model trained in step S50 is stored in MySQL, Oracle or other database, so that the pre-trained user data model is called from the database when the fund product needs to be recommended to the target user.
  • step S20 includes: obtaining a user data model from a database. Since the user data model is pre-trained and stored in the database, when the user data model needs to be recommended for the fund product, the user data model can be directly called from the database, and the corresponding recommendation processing can be performed, and the operation process is simple and fast.
  • S30 Acquire, according to the current user portrait data, a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters.
  • the target clustering cluster refers to the cluster cluster in which the centroid user portrait data closest to the current user image data is located.
  • the target clustering cluster specifically refers to a clustering cluster corresponding to the centroid user portrait data closest to the current user portrait data formed by the at least one current feature data.
  • step S30 specifically includes the following steps:
  • S31 Calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
  • K cluster clusters are stored in the user data model, and each cluster cluster corresponds to a centroid user image data.
  • the current user image data is an n-dimensional vector a (X i1 , X i2 ,. .., X in )
  • the centroid user image data of any cluster cluster is n-dimensional vector b (X j1 , X j2 ,..., X jn )
  • the current user image data and the centroid user image data are Distance
  • the dimension n of the vector a corresponds to the number of current feature data in the current user portrait data
  • the dimension n of the vector b corresponds to the number of training feature data in the centroid user portrait data.
  • step S31 at least one current feature data in the current user image data needs to be normalized to include at least one standard feature data in the current user image data, so as to calculate current user image data and a centroid user image.
  • the Euclidean distance between the data helps to simplify the calculation process and improve the calculation efficiency.
  • the process of normalizing at least one current training data in the current user image data is the same as the process of normalizing at least one training feature data in the training user image data. To avoid repetition, details are not described herein.
  • S32 Select a cluster cluster of the centroid user image data corresponding to the minimum value of the at least two Euclidean distances as the target cluster cluster corresponding to the at least one current feature data.
  • the cluster class of the centroid user portrait data corresponding to the minimum value is selected from the K Euclidean distances D a,b .
  • the cluster is a target cluster class cluster corresponding to at least one current feature data in the current user portrait data.
  • S40 Determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
  • the risk assessment value corresponding to the target cluster cluster is defined as the target risk assessment value.
  • the centroid user image data of each cluster cluster is weighted by a weighting algorithm to determine a risk assessment value corresponding to the centroid user portrait data, and the risk assessment value is used as a corresponding cluster cluster.
  • Risk assessment value is used as the target risk assessment value.
  • the target clustering cluster is one of at least two clustering clusters, and the risk assessment value corresponding to the target clustering cluster may be used as the target risk assessment value.
  • the target fund product is a fund product recommended by the fund trading system based on the obtained target risk assessment value to the target user.
  • the target fund product is associated with the current user portrait data of the target user, so that the target fund product meets the investment condition of the target user; and the target fund product is recommended based on the target risk assessment value, so that it is related to the fund type of the fund product,
  • the positioning of fund products is more accurate.
  • the fund trading system displays the target fund product through a terminal such as a smart phone or a tablet computer, so that the target user understands the target fund product, and performs fund trading based on the target fund product to assist the user.
  • a terminal such as a smart phone or a tablet computer
  • step S40 specifically includes the following steps:
  • S41 Determine a corresponding target fund type based on the risk assessment value corresponding to the target clustering cluster.
  • the type of fund is based on the type of risk of the fund.
  • the fund types include, but are not limited to, stock type, index type, hybrid type, bond type, and currency type.
  • the corresponding relationship between fund risk and fund type is preset. If the risk range of the fund is (0, 10), divide the risk risk interval (0, 2), (2, 4), (4, 6], (6, 8) and (8, 10) into five.
  • the risk levels correspond to the five fund types of currency type, bond type, hybrid type, index type and stock type, and make the fund products of the fund trading system uniquely correspond to one type of fund. In this embodiment, it can be standardized. Therefore, the obtained target risk assessment value is within the interval (0, 10), so that the corresponding fund type is directly determined according to the target risk assessment value.
  • the fund products to be recommended are all fund products of the fund trading system whose fund type is the target fund type.
  • the fund evaluation index is used to evaluate the quality of the fund product.
  • the fund evaluation index is related to the investment income and/or investment risk of the fund product.
  • the fund evaluation index is associated with the fund type, and the fund evaluation index for evaluating the corresponding fund product may be determined based on the target fund type.
  • the fund evaluation index includes, but is not limited to, an average return rate, an alpha coefficient, a standard deviation, a beta coefficient, a morning star risk coefficient, a Sharpe ratio, and an R square in the present embodiment.
  • the average rate of return is an indicator related to income, which is used to evaluate the indicator of return on investment. The larger the average rate of return, the better.
  • the alpha coefficient is an indicator related to income and is a relative index. The larger the alpha coefficient, the greater the ability of the fund to obtain excess returns.
  • the standard deviation is a risk-related indicator, reflecting the fluctuation range of the fund recovery rate. The smaller the standard deviation, the better. Specifically, it refers to the deviation of the fund's monthly rate of return relative to the average monthly rate of return.
  • the beta coefficient is a risk-related indicator used to measure price volatility, which is used to assess the volatility of a stock or a stock fund relative to the entire market. The bigger the beta coefficient is, the better the bull market or the rising phase, and the smaller the beta coefficient in the bear market or the down phase, the better.
  • the Morningstar risk factor is a risk-related indicator used to calculate the risk of a downward float relative to a similar fund over a certain period of time. The greater the Morningstar risk indicator, the greater the risk of downward fluctuation. Therefore, the Morningstar risk coefficient is as small as possible.
  • the Sharpe ratio is an indicator related to both income and risk, and is a standardized indicator of fund performance evaluation.
  • R-square is a measure related to both income and risk, which is used to reflect the change of performance.
  • R-square is a measure of the extent to which a change in the performance of a fund can be explained by the change of the benchmark index, from 0 to 100, the closer to 100 The more reliable the alpha and beta coefficients are.
  • the fund types according to different risk levels may be recommended according to different fund evaluation indicators.
  • the fund products of the two types of funds such as currency type and bond type
  • the fund product can be based on the Sharpe ratio.
  • R-square two fund-based evaluation indicators based on income and risk
  • fund products of both index type and stock type there are three risk-based fund evaluations based on standard deviation, beta coefficient and morningstar risk coefficient. Indicators are recommended.
  • the quick-sorting algorithm is used to sort the recommended fund products, and the target fund products are determined.
  • the basic idea of the quick sort algorithm is to divide the data to be sorted into two independent parts by one sorting, in which all the data of one part is smaller than all the data of the other part, and then the two parts of data are used according to this method.
  • Quick sorting is performed separately, and the entire sorting process can be performed recursively, so that the entire data becomes an ordered sequence.
  • the fast sorting algorithm is the fastest algorithm in the internal sorting algorithm based on keyword comparison, and the algorithm is highly efficient.
  • the quick-sorting algorithm is used to sort the fund products to be recommended by the fund trading system, and the target fund products are obtained through the sorting result of the fund evaluation indexes corresponding to the fund products to be recommended, and the smart phone is obtained through the smart phone.
  • the display interface of the terminal such as a tablet computer displays the target fund product. For example, taking the fund evaluation index as the average rate of return, using the quick sort algorithm to rank the average rate of return of the recommended fund products, the sorting result of the average rate of return can be obtained, and the highest rate of return among the ranked results corresponds to the fund to be recommended.
  • Target fund products For example, taking the fund evaluation index as the average rate of return, using the quick sort algorithm to rank the average rate of return of the recommended fund products, the sorting result of the average rate of return can be obtained, and the highest rate of return among the ranked results corresponds to the fund to be recommended.
  • Target fund products For example, taking the fund evaluation index as the average rate of return, using the quick sort algorithm to rank the average rate of return of
  • the target fund products viewed by the target users through terminals such as smart phones and tablet computers are sorted according to the fund evaluation indicators, so that the target users can understand the target fund products that match the investment conditions of the users, and may have It will help improve the accuracy of investors' purchase of fund products and reduce the risk of fund purchases.
  • the fund product recommendation method further includes: obtaining a product recommendation instruction, so that the current user portrait data is acquired based on the product recommendation instruction in step S10, and the training is invoked based on the product recommendation instruction in step S20.
  • the “fund recommendation” button may be displayed in the display interface, and the user may click the “fund recommendation” button to name the fund trading system to obtain the product recommendation instruction.
  • the user may preset settings, and when the user logs into the fund trading system by using the pre-registered login account, the fund trading system may be triggered to obtain the product recommendation instruction, so that the display interface displays the target fund product.
  • the target risk assessment value is determined based on the current user portrait data and the user data type, and the target target fund product is determined by using the target risk assessment value, so that the target fund recommended to the target user is obtained.
  • the product accurately locates the user's own investment conditions and fund products, which helps the target users to improve the investment return rate.
  • Fig. 6 is a block diagram showing the principle of the fund product recommendation device corresponding to the fund product recommendation method in the first embodiment.
  • the fund product recommendation device includes a current user portrait data acquisition module 10, a user data model acquisition module 20, a target cluster cluster determination module 30, and a target fund product determination module 40.
  • the implementation functions of the current user portrait data obtaining module 10, the user data model invoking module 20, the target clustering cluster determining module 30, and the target fund product determining module 40 correspond to the steps corresponding to the fund product recommendation method in the first embodiment. In order to avoid redundancy, the present embodiment will not be described in detail.
  • the current user portrait data obtaining module 10 is configured to acquire current user portrait data, and the current user portrait data includes at least one current feature data.
  • the user data model obtaining module 20 is configured to acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
  • the target clustering cluster determining module 30 is configured to acquire a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters based on the current user image data.
  • the target fund product determining module 40 is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
  • the fund product recommendation device further includes a user data model training module 50 and a user data model storage module 60.
  • the user data model training module 50 is configured to train the user data model based on the training user image data, and the training user image data includes at least one training feature data.
  • the user data model storage module 60 is configured to store the user data model in a database.
  • the user data model obtaining module 20 is configured to obtain a user data model from a database.
  • the user data model training module 50 includes a normalization processing unit 51, a cluster cluster acquisition unit 52, a risk evaluation value acquisition unit 53, and a data model acquisition unit 54.
  • a normalization processing unit 51 configured to perform normalization processing on at least one standard feature data in the training user image data, so that the training user image data includes at least one standard feature data;
  • the clustering cluster acquiring unit 52 is configured to cluster at least one standard feature data in the training user portrait data by using a K-means clustering algorithm, and acquire at least two clustering clusters, each cluster cluster corresponding to a centroid User portrait data.
  • the risk evaluation value obtaining unit 53 is configured to perform weighting processing on the centroid user portrait data by using a weighting operation algorithm, and determine that the centroid user portrait data corresponds to a risk evaluation value; the weighting operation algorithm is among them, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data.
  • the data model obtaining unit 54 is configured to acquire a user data model based on the clustering cluster and the risk assessment value.
  • the target clustering class cluster determining module 30 includes an Euclidean distance acquiring unit 31 and a target clustering cluster selecting unit 32.
  • the Euclidean distance obtaining unit 31 is configured to calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
  • the target clustering cluster selection unit 32 is configured to select a cluster cluster in which the centroid user portrait data corresponding to the minimum value of the at least two Euclidean distances is located as the target cluster cluster corresponding to the at least one current feature data.
  • the target fund product determining module 40 includes a fund type determining unit 41, an evaluation index obtaining unit 42, and a target fund product determining unit 43.
  • the fund type determining unit 41 is configured to determine a target fund type corresponding to the target risk assessment value based on the target risk assessment value.
  • the evaluation index obtaining unit 42 is configured to obtain the fund products to be recommended and the fund evaluation indicators corresponding to the fund type according to the fund type.
  • the target fund product determining unit 43 is configured to sort the recommended fund products by using a quick sorting algorithm according to the fund evaluation index, and determine the target fund product.
  • the embodiment provides a computer readable storage medium having stored thereon computer readable instructions, which are implemented by the processor to implement the fund product recommendation method in Embodiment 1, in order to avoid duplication, here No longer.
  • the computer readable instructions are executed by the processor, the functions of the modules/units in the fund product recommendation device in Embodiment 2 are implemented. To avoid repetition, details are not described herein again.
  • FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application.
  • the terminal device 70 of this embodiment includes a processor 71, a memory 72, and computer readable instructions 73 stored in the memory 72 and operable on the processor 71, the processor 71 executing computer readable instructions
  • Each step of the fund product recommendation method in Embodiment 1 is implemented at 73 hours, such as steps S10, S20, S30, and S40 shown in FIG.
  • the processor 71 executes the computer readable instructions 73, the functions of the modules/units of the fund product recommendation device in Embodiment 2 are implemented.
  • the current user profile data acquisition module 10 the user data model acquisition module 20, and the target aggregation
  • the class cluster determination module 30 and the target fund product determination module 40 function.
  • computer readable instructions 73 may be partitioned into one or more modules/units, one or more modules/units being stored in memory 72 and executed by processor 71 to complete the application.
  • the one or more modules/units may be an instruction segment of a series of computer readable instructions 73 capable of performing a particular function, which is used to describe the execution of computer readable instructions 73 in the terminal device 70.
  • the computer readable instructions 73 may be divided into a current user portrait data acquisition module 10, a user data model acquisition module 20, a target cluster cluster determination module 30, a target fund product determination module 40, a user data model training module 50, and a user.
  • the data model storage module 60 has the following functions:
  • the current user portrait data obtaining module 10 is configured to acquire current user portrait data, and the current user portrait data includes at least one current feature data.
  • the user data model obtaining module 20 is configured to acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
  • the target clustering cluster determining module 30 is configured to acquire a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters based on the current user image data.
  • the target fund product determining module 40 is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
  • the fund product recommendation device further includes a user data model training module 50 and a user data model storage module 60.
  • the user data model training module 50 is configured to train the user data model based on the training user image data, and the training user image data includes at least one training feature data.
  • the user data model storage module 60 is configured to store the user data model in a database.
  • the user data model obtaining module 20 is configured to obtain a user data model from a database.
  • the user data model training module 50 includes a normalization processing unit 51, a cluster cluster acquisition unit 52, a risk evaluation value acquisition unit 53, and a data model acquisition unit 54.
  • a normalization processing unit 51 configured to perform normalization processing on at least one standard feature data in the training user image data, so that the training user image data includes at least one standard feature data;
  • the clustering cluster acquiring unit 52 is configured to cluster at least one standard feature data in the training user portrait data by using a K-means clustering algorithm, and acquire at least two clustering clusters, each cluster cluster corresponding to a centroid User portrait data.
  • the risk evaluation value obtaining unit 53 is configured to perform weighting processing on the centroid user portrait data by using a weighting operation algorithm, and determine that the centroid user portrait data corresponds to a risk evaluation value; the weighting operation algorithm is among them, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data.
  • the data model obtaining unit 54 is configured to acquire a user data model based on the clustering cluster and the risk assessment value.
  • the target clustering class cluster determining module 30 includes an Euclidean distance acquiring unit 31 and a target clustering cluster selecting unit 32.
  • the Euclidean distance obtaining unit 31 is configured to calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
  • the target clustering cluster selection unit 32 is configured to select a cluster cluster in which the centroid user portrait data corresponding to the minimum value of the at least two Euclidean distances is located as the target cluster cluster corresponding to the at least one current feature data.
  • the target fund product determining module 40 includes a fund type determining unit 41, an evaluation index obtaining unit 42, and a target fund product determining unit 43.
  • the fund type determining unit 41 is configured to determine a target fund type corresponding to the target risk assessment value based on the target risk assessment value.
  • the evaluation index obtaining unit 42 is configured to obtain the fund products to be recommended and the fund evaluation indicators corresponding to the fund type according to the fund type.
  • the target fund product determining unit 43 is configured to sort the recommended fund products by using a quick sorting algorithm according to the fund evaluation index, and determine the target fund product.
  • the terminal device 70 can be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server.
  • the terminal device may include, but is not limited to, a processor 71, a memory 72. It will be understood by those skilled in the art that FIG. 7 is only an example of the terminal device 70, and does not constitute a limitation of the terminal device 70, and may include more or less components than those illustrated, or combine some components, or different components.
  • the terminal device may further include an input/output device, a network access device, a bus, and the like.
  • the processor 71 may be a central processing unit (CPU), or may be other general-purpose processors, a digital signal processor (DSP), an application specific integrated circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the memory 72 may be an internal storage unit of the terminal device 70, such as a hard disk or a memory of the terminal device 70.
  • the memory 72 may also be an external storage device of the terminal device 70, such as a plug-in hard disk provided on the terminal device 70, a smart memory card (SMC), a Secure Digital (SD) card, and a flash memory card (Flash). Card) and so on.
  • the memory 72 may also include both an internal storage unit of the terminal device 70 and an external storage device.
  • the memory 72 is used to store computer readable instructions 73 and other programs and data required by the terminal device.
  • the memory 72 can also be used to temporarily store data that has been or will be output.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated modules/units if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium.
  • the present application implements all or part of the processes in the foregoing embodiments, and may also be implemented by computer readable instructions, which may be stored in a computer readable storage medium.
  • the computer readable instructions when executed by a processor, may implement the steps of the various method embodiments described above.
  • the computer readable instructions comprise computer readable instruction code, which may be in the form of source code, an object code form, an executable file or some intermediate form or the like.
  • the computer readable medium can include any entity or device capable of carrying the computer readable instruction code, a recording medium, a USB flash drive, a removable hard drive, a magnetic disk, an optical disk, a computer memory, a read only memory (ROM, Read-Only) Memory), random access memory (RAM), electrical carrier signals, telecommunications signals, and software distribution media.
  • a recording medium a USB flash drive
  • a removable hard drive a magnetic disk, an optical disk
  • a computer memory a read only memory (ROM, Read-Only) Memory
  • RAM random access memory

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Operations Research (AREA)
  • Technology Law (AREA)
  • Human Resources & Organizations (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

The present application discloses a fund product recommendation method and apparatus, a terminal device, and a storage medium. Said fund product recommendation method comprises: acquiring current user profile data, the current user profile data comprising at least one piece of current characteristic data; acquiring a user data model, the user data model comprising at least two clustered clusters, each clustered cluster corresponding to a risk evaluation value; on the basis of the current user profile data, acquiring from the at least two clustered clusters a target clustered cluster corresponding to the at least one piece of current characteristic data; and on the basis of a risk evaluation value corresponding to the target clustered cluster, determining a target fund product. Said fund product recommendation method can recommend a fund product on the basis of the investment condition of a user, improving the accuracy of positioning of fund products made by investors.

Description

基金产品推荐方法、装置、终端设备及存储介质Fund product recommendation method, device, terminal device and storage medium
本专利申请以2017年9月28日提交的申请号为201710899347.8,名称为“基于基金产品推荐方法、装置、终端设备及存储介质”的中国发明专利申请为基础,并要求其优先权。This patent application is based on the Chinese invention patent application filed on September 28, 2017, with the application number of 201710899347.8, entitled "Recommended method, device, terminal equipment and storage medium based on fund products", and requires priority.
技术领域Technical field
本申请涉及金融数据处理领域,尤其涉及一种基金产品推荐方法、装置、终端设备及存储介质。The present application relates to the field of financial data processing, and in particular, to a fund product recommendation method, device, terminal device and storage medium.
背景技术Background technique
基金投资是指投资者通过基金交易系统购买基金产品以实现对资产的管理和分配的一种理财手段。当前基金交易系统中依据投资风险的高低将基金产品依次划分成股票型、指数型、混合型、债券型和货币型等基金类型,用户根据基金类型和自身投资条件选取不同基金类型的基金产品进行交易。当前基金交易系统不具有基于用户的投资条件进行基金产品推荐的功能,使得投资者对自身投资条件和基金产品进行定位时,因定位准确性较低而影响投资基金的收益率。Fund investment refers to a kind of financial management means for investors to purchase fund products through the fund trading system to realize the management and distribution of assets. In the current fund trading system, according to the level of investment risk, the fund products are divided into fund types such as stock type, index type, hybrid type, bond type and currency type. Users select fund products of different fund types according to fund type and investment conditions. transaction. The current fund trading system does not have the function of recommending fund products based on the user's investment conditions, which makes investors' investment conditions and fund products to be positioned, which affects the return rate of investment funds due to low positioning accuracy.
发明内容Summary of the invention
本申请提供一种基金产品推荐方法、装置、终端设备及存储介质,以解决当前基金交易系统不具有基于用户的投资条件进行基金产品推荐的问题。The application provides a fund product recommendation method, device, terminal device and storage medium to solve the problem that the current fund transaction system does not have the fund product recommendation based on the user's investment condition.
第一方面,本申请提供一种基金产品推荐方法,包括:In a first aspect, the present application provides a fund product recommendation method, including:
获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
第二方面,本申请提供一种基金产品推荐装置,包括:In a second aspect, the application provides a fund product recommendation device, including:
当前用户画像数据获取模块,用于获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;a current user image data obtaining module, configured to acquire current user portrait data, where the current user portrait data includes at least one current feature data;
用户数据模型获取模块,用于获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;a user data model obtaining module, configured to acquire a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
目标聚类类簇确定模块,用于基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;a target clustering cluster determining module, configured to acquire, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
目标基金产品确定模块,用于基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product determining module is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
第三方面,本申请提供一种一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:In a third aspect, the present application provides a terminal device including a memory, a processor, and computer readable instructions stored in the memory and executable on the processor, the processor executing the computer The following steps are implemented when reading the instruction:
获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
第四方面,本申请提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如下步骤:In a fourth aspect, the present application provides a computer readable storage medium storing computer readable instructions that, when executed by a processor, implement the following steps:
获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
本申请所提供的的基金产品推荐方法、装置、设备及存储介质中,基于当前用户画像数据和用户数据类型,确定目标风险评估值,并利用目标风险评估值确定对应的目标基金产品,以使推荐给目标用户的目标基金产品对用户自身投资条件和基金产品进行准确定位,有助于目标用户提高投资收益率。In the fund product recommendation method, device, device and storage medium provided by the present application, the target risk assessment value is determined based on the current user portrait data and the user data type, and the target target fund product is determined by using the target risk assessment value, so that The target fund products recommended to the target users accurately locate the user's own investment conditions and fund products, which helps the target users to improve the investment return rate.
附图说明DRAWINGS
为了更清楚地说明本申请的技术方案,下面将对本申请的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the present application, the drawings to be used in the description of the present application will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present application. For ordinary technicians, other drawings can be obtained based on these drawings without paying for creative labor.
图1是实施例1中基金产品推荐方法的一流程图。1 is a flow chart of a fund product recommendation method in Embodiment 1.
图2是实施例1中基金产品推荐方法的另一流程图。2 is another flow chart of the fund product recommendation method in Embodiment 1.
图3是图2中步骤S50的一具体流程图。FIG. 3 is a specific flow chart of step S50 of FIG. 2.
图4是图1中步骤S30的一具体流程图。4 is a specific flow chart of step S30 of FIG. 1.
图5是图1中步骤S40的一具体流程图。FIG. 5 is a specific flowchart of step S40 in FIG. 1.
图6是实施例2中基金产品推荐装置的一原理框图。6 is a schematic block diagram of a fund product recommendation device in Embodiment 2.
图7是实施例4中终端设备的一示意图。Figure 7 is a schematic diagram of a terminal device in Embodiment 4.
具体实施方式Detailed ways
下面将结合本申请中的附图,对本申请中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the present application will be clearly and completely described in the following with reference to the drawings in the present application. It is obvious that the described embodiments are a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope are the scope of the present application.
实施例1Example 1
图1示出本实施例中基金产品推荐方法的流程图。该基金产品推荐方法应用在基金交易系统中,用于根据用户的投资条件给用户推荐目标基金产品,以避免投资者对自身投资条件和基金产品定位准确性较低而影响投资基金的收益率。如图1所示,该基金产品推荐方法包括如下步骤:FIG. 1 shows a flow chart of a fund product recommendation method in this embodiment. The fund product recommendation method is applied in the fund trading system, and is used to recommend the target fund product to the user according to the investment condition of the user, so as to avoid the investor's low investment accuracy of the investment condition and the fund product, which affects the profit rate of the investment fund. As shown in Figure 1, the fund product recommendation method includes the following steps:
S10:获取当前用户画像数据,当前用户画像数据包括至少一个当前特征数据。S10: Acquire current user portrait data, and the current user portrait data includes at least one current feature data.
其中,当前用户画像数据是需要推荐基金产品的用户的用户画像数据。本实施例中,将需要推荐基金产品的用户简称为目标用户。用户画像是根据用户社会属性、生活习惯和消费行为等信息抽象出的一个标签化的用户模型。用户画像数据是构建该用户画像的数据。该当前用户画像数据可体现目标用户自身的投资条件。Among them, the current user portrait data is user portrait data of a user who needs to recommend a fund product. In this embodiment, the user who needs to recommend the fund product is simply referred to as the target user. User portraits are a tagged user model abstracted from information such as user social attributes, lifestyle habits, and consumer behavior. The user portrait data is data for constructing the user portrait. The current user portrait data can reflect the investment conditions of the target user.
当前特征数据是与目标用户自身投资条件相关的数据。该当前特征数据包括但不限于本实施例中的年龄、职业、收入、投资经验、投资比例、风险偏好和承受亏损值。即当前用户画像数据包括至少一个当前特征数据,具体是指包括年龄、职业、收入、投资经验、 投资比例、风险偏好和承受亏损值中的至少一个。The current feature data is data related to the target user's own investment conditions. The current feature data includes, but is not limited to, age, occupation, income, investment experience, investment ratio, risk preference, and bearish loss value in the present embodiment. That is, the current user portrait data includes at least one current feature data, specifically, at least one of age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value.
当前特征数据中的年龄是目标用户的年龄,目标用户的年龄与目标用户可能购买的基金产品的基金类型具有相关性。一般而言,目标用户的年龄越大,越偏向于购买风险较低的基金产品;反之,目标用户的年龄越小,越偏向于购买风险较高的基金产品。当前特征数据中的职业是目标用户的职业,目标用户的职业一般与目标用户的性格相关,性格不同的目标用户可能选择不同风险的基金产品;反之,性格相同的目标用户可能选择购买相同风险的基金产品。当前特征数据中的收入可以是目标用户的可支配收入,也可以是目标用户的纯收入,收入也可能影响目标用户选择基金产品的基金类型。当前特征数据中的投资经验是指目标用户在投资基金或者投资其他金融理财产品上的经验,一般投资经验越少,越偏向于购买风险较低的基金产品。当前特征数据中的投资比例是指目标用户购买基金产品占所有投资的比例,根据投资比例的高低,可确定该用户偏向于购买哪种风险的基金产品。当前特征数据中的风险偏好是目标用户的风险偏好,根据风险偏好推荐不同风险的基金产品。当前特征数据中的承受亏损值是目标用户可以承受亏损的值,可基于承受亏损值推荐风险不同的基金产品。The age in the current feature data is the age of the target user, and the age of the target user is related to the fund type of the fund product that the target user may purchase. Generally speaking, the older the target user is, the more biased it is to purchase a fund product with lower risk. Conversely, the younger the target user is, the more biased it is to purchase a fund product with higher risk. The occupation in the current feature data is the occupation of the target user. The occupation of the target user is generally related to the personality of the target user. The target users with different personalities may select fund products with different risks; otherwise, the target users with the same personality may choose to purchase the same risk. Fund products. The income in the current characteristic data may be the disposable income of the target user, or the net income of the target user, and the income may also affect the fund type of the target user selecting the fund product. The investment experience in the current characteristic data refers to the experience of the target user in investing in funds or investing in other financial wealth management products. The less general investment experience, the more biased it is to purchase fund products with lower risk. The proportion of investment in the current characteristic data refers to the ratio of the target user's purchase of the fund product to all the investment. According to the proportion of the investment, it can be determined which kind of fund product the user prefers to purchase. The risk preference in the current feature data is the risk preference of the target user, and the fund products of different risks are recommended according to the risk preference. The value of the loss in the current characteristic data is the value that the target user can bear the loss, and the fund product with different risk can be recommended based on the loss value.
S20:获取用户数据模型,用户数据模型包括至少两个聚类类簇,每一聚类类簇对应一风险评估值。S20: Acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
用户数据模型是基金交易系统预先基于训练用户画像数据训练后获取与投资基金的风险评估值关联的模型。其中,训练用户画像数据是训练用户的用户画像数据,该训练用户画像数据是用于训练用户数据模型的数据。该训练用户是预先在基金交易系统开户并进行过基金交易的用户。聚类类簇是采用聚类算法对训练用户画像数据进行聚类后,获取的相似训练用户画像数据的集合。每一聚类类簇对应一风险评估值,该风险评估值是任一聚类类簇中训练用户画像数据对应的训练用户在投资基金时可承受风险的评估值。可以理解地,风险评估值与基金产品的基金类型相关联,风险评估值越大的训练用户越偏向于投资风险高且收益高的基金产品,反之,风险评估值越小的训练用户越偏向于投资风险低且收益低的基金产品。The user data model is a model in which the fund trading system obtains a risk assessment value associated with an investment fund after training based on training user portrait data. The training user portrait data is user image data of the training user, and the training user portrait data is data for training the user data model. The training user is a user who has previously opened an account in the fund trading system and conducted a fund transaction. The clustering cluster is a collection of similar training user image data obtained by clustering the training user image data by using a clustering algorithm. Each cluster cluster corresponds to a risk assessment value, which is an evaluation value of the risk that the training user corresponding to the training user portrait data in any cluster cluster can bear the risk when investing in the fund. Understandably, the risk assessment value is related to the fund type of the fund product. The training user with the higher risk assessment value tends to invest in the fund product with high risk and high return. On the contrary, the less the risk assessment value, the more the training user is biased. Fund products with low investment risk and low returns.
在一具体实施方式中,如图2所示,该基金产品推荐方法还包括:In a specific embodiment, as shown in FIG. 2, the fund product recommendation method further includes:
S50:基于训练用户画像数据训练用户数据模型,训练用户画像数据包括至少一个训练特征数据。S50: Train the user data model based on the training user image data, and the training user image data includes at least one training feature data.
其中,训练特征数据是与训练用户自身投资条件相关的数据。该训练特征数据包括但不限于本实施例中的年龄、职业、收入、投资经验、投资比例、风险偏好和承受亏损值。 即当前用户画像数据包括至少一个训练特征数据,具体是指包括年龄、职业、收入、投资经验、投资比例、风险偏好和承受亏损值中的至少一个。具体地,基于训练用户画像数据训练用户数据模型是指对训练用户画像数据中至少一个训练特征数据进行聚类,将相似训练用户画像数据的集合作为一聚类类簇,以将所有训练用户画像数据划分成至少两个聚类类簇,并获取每一聚类类簇对应的风险评估值,即可形成用户数据模型。Among them, the training feature data is data related to training the user's own investment conditions. The training characteristic data includes, but is not limited to, age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value in the present embodiment. That is, the current user portrait data includes at least one training feature data, specifically, at least one of age, occupation, income, investment experience, investment ratio, risk preference, and loss tolerance value. Specifically, training the user data model based on the training user portrait data refers to clustering at least one training feature data in the training user portrait data, and using the set of similar training user portrait data as a cluster cluster to image all the training users. The data is divided into at least two cluster clusters, and the risk assessment values corresponding to each cluster cluster are obtained, thereby forming a user data model.
训练特征数据中的年龄是训练用户的年龄,训练用户的年龄与训练用户可能购买的基金产品的基金类型具有相关性。一般而言,训练用户的年龄越大,越偏向于购买风险较低的基金产品;反之,训练用户的年龄越小,越偏向于购买风险较高的基金产品。训练特征数据中的职业是训练用户的职业,训练用户的职业一般与训练用户的性格相关,性格不同的训练用户可能选择不同风险的基金产品;反之,性格相同的训练用户可能选择购买相同风险的基金产品。训练特征数据中的收入可以是训练用户的可支配收入,也可以是训练用户的纯收入,收入也可能影响训练用户选择基金产品的基金类型。训练特征数据中的投资经验是指训练用户在投资基金或者投资其他金融理财产品上的经验,一般投资经验越少,越偏向于购买风险较低的基金产品。训练特征数据中的投资比例是指训练用户购买基金产品占所有投资的比例,根据投资比例的高低,可确定该用户偏向于购买哪种风险的基金产品。训练特征数据中的风险偏好是训练用户的风险偏好,根据风险偏好推荐不同风险的基金产品。训练特征数据中的承受亏损值是训练用户可以承受亏损的值,可基于承受亏损值推荐风险不同的基金产品。The age in the training feature data is the age of the training user, and the age of the training user is related to the fund type of the fund product that the training user may purchase. Generally speaking, the older the training user is, the more he prefers to buy a fund product with lower risk. On the contrary, the younger the training user, the more he prefers to buy a higher risk fund product. The occupation in the training characteristic data is the occupation of the training user. The occupation of the training user is generally related to the personality of the training user. The training users with different personality may select the fund products with different risks; on the contrary, the training users with the same personality may choose to purchase the same risk. Fund products. The income in the training characteristic data may be the disposable income of the training user, or the net income of the training user, and the income may also affect the fund type of the training user to select the fund product. The investment experience in the training characteristic data refers to the experience of training users in investing in funds or investing in other financial wealth management products. The less general investment experience, the more biased it is to purchase fund products with lower risk. The proportion of investment in the training characteristic data refers to the ratio of training users to purchase fund products in all investments. According to the proportion of investment, it can determine which kind of risk fund products the user prefers to purchase. The risk preference in the training feature data is to train the user's risk preference, and recommend different risk fund products according to the risk preference. The value of the loss in the training characteristic data is the value that the training user can bear the loss, and the fund product with different risk can be recommended based on the loss value.
本实施例中,如图3所示,步骤S50中,基于训练用户画像数据训练用户数据模型,具体包括如下步骤:In this embodiment, as shown in FIG. 3, in step S50, the user data model is trained based on the training user image data, and specifically includes the following steps:
S51:对训练用户画像数据中的至少一个训练特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据。S51: Perform normalization processing on at least one training feature data in the training user portrait data, so that the training user image data includes at least one standard feature data.
其中,数据标准化(normalization)是将数据按比例缩放,使之落入一个小的特定区间,用于去除数据的单位限制,将其转化为无量级的纯数值,便于不同单位或量级的指标能够进行比较和加权。具体地,在对训练用户画像数据中的至少一个训练特征数据进行标准化处理时,可采用min-max标准化(Min-max normalization)进行转换处理,获取至少一个标准特征数据。其中,min-max标准化(Min-max normalization)也称为离差标准化,是指采用转换函数对原始数据进行线性变换,使结果落到预设区间的过程,其中,转换函数
Figure PCTCN2018074570-appb-000001
min为样本数据的最小值,max为样本数据的最大值,N为预设区间的区间大小。可以理解地,若N为1,则采用min-max标准化处理后的结果落在[0,1]这个 区间范围内;若N为10,则采用min-max标准化处理后的结果落在[0,10]这个区间范围内。
Among them, data normalization is to scale the data to a small specific interval, to remove the unit limit of the data, and convert it into a pure value of the infinite level, which is convenient for different units or magnitude indicators. Ability to compare and weight. Specifically, when normalizing at least one training feature data in the training user portrait data, a conversion process may be performed using min-max normalization to acquire at least one standard feature data. Among them, min-max normalization is also called deviation normalization, which refers to the process of linearly transforming the original data by using a conversion function to make the result fall into a preset interval, wherein the conversion function
Figure PCTCN2018074570-appb-000001
Min is the minimum value of the sample data, max is the maximum value of the sample data, and N is the interval size of the preset interval. It can be understood that if N is 1, the result after the min-max normalization process falls within the range of [0, 1]; if N is 10, the result after the min-max normalization process falls in [0] , 10] within this range.
S52:采用K-means聚类算法对训练用户画像数据中至少一个标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据。S52: K-means clustering algorithm is used to cluster at least one standard feature data in the training user image data, and at least two clustering clusters are obtained, and each clustering cluster corresponds to a centroid user image data.
其中,K-means聚类算法是一种基于距离评估相似度的聚类算法,即两个对象的距离越近,其相似度越大的聚类算法。K-means聚类算法通过计算两个对象的欧氏距离,根据欧氏距离的大小评价两个对象的相似性。欧氏距离(euclidean metric,又称欧几里得度量)是指在m维空间中两个点之间的真实距离,或者向量的自然长度(即该点到原点的距离)。任意两个n维向量a(X i1,X i2,...,X in)与b(X j1,X j2,...,X jn)的欧氏距离
Figure PCTCN2018074570-appb-000002
Among them, K-means clustering algorithm is a clustering algorithm based on distance evaluation similarity, that is, the closer the distance between two objects, the larger the similarity is. The K-means clustering algorithm evaluates the similarity of two objects according to the Euclidean distance by calculating the Euclidean distance of two objects. Euclidean metric (also known as Euclidean metric) refers to the true distance between two points in m-dimensional space, or the natural length of the vector (ie, the distance from the point to the origin). Euclidean distance of any two n-dimensional vectors a(X i1 ,X i2 ,...,X in ) and b(X j1 ,X j2 ,...,X jn )
Figure PCTCN2018074570-appb-000002
训练用户画像数据是用于训练用户数据模型的训练用户的用户画像数据,训练用户画像数据包括至少一个训练特征数据。步骤S51将训练用户画像数据中的每一训练特征数据转换成标准特征数据,使得步骤S52中需采用K-means聚类算法对训练用户画像数据中至少一个标准特征数据进行聚类,以获取至少两个聚类类簇,每一聚类类簇中包括多个训练用户对应的训练用户画像数据。任一聚类类簇中,多个训练用户的训练用户画像数据中存在一个质心用户对应的质心用户画像数据,使其他训练用户画像数据到质心用户画像数据的距离之和最小。可以理解地,质心用户画像数据是任一聚类类簇中所有训练用户画像数据中的一个,因此,该质心用户画像数据也包括至少一个由训练特征数据转换来的标准特征数据。以下结合具体示例对步骤S52进行详细说明。The training user portrait data is user portrait data of a training user for training a user data model, and the training user portrait data includes at least one training feature data. Step S51: Convert each training feature data in the training user image data into standard feature data, so that in step S52, the K-means clustering algorithm is used to cluster at least one standard feature data in the training user image data to obtain at least Two clustering clusters, each of which includes training user portrait data corresponding to a plurality of training users. In any of the clustering clusters, there is a centroid user image data corresponding to the centroid user in the training user image data of the plurality of training users, so that the sum of the distances of the other training user image data to the centroid user portrait data is minimized. It can be understood that the centroid user portrait data is one of all training user portrait data in any cluster cluster, and therefore, the centroid user portrait data also includes at least one standard feature data converted from the training feature data. Step S52 will be described in detail below with reference to specific examples.
首先,获取m个训练用户的训练用户画像数据,对训练用户画像数据中至少一个训练特征数据进行标准化处理,形成至少一个标准特征数据,获取用户数据矩阵R(如下表所示)。该用户数据矩阵R为m*n矩阵,m为训练用户的数量,n为标准特征数据的数量。在对年龄这一训练特征数据进行标准化处理时,若年龄的最小值为18,最大值为88,N的大小为10,若一训练用户的年龄为48,则采用转换函数获取的标准特征数据为5。在对职业这一训练特征数据进行标准化处理时,可预先设置职业与标准特征数据的对照表,基于训练用户画像数据中的职业这一训练特征数据查询该对照表,即可获取相应的标准特征数据。其中,该对照表中可依职业列举不同的标准化分值,每一标准化分值对应不同企业的不同职业,以便基于训练用户画像数据中的职业这一训练特征数据查询相应的标准化分值,作为对应的标准特征数据。同理,采用转换函数或对照表对其他特征数据进行标准化处理。First, the training user image data of the m training users is acquired, and at least one training feature data in the training user image data is normalized to form at least one standard feature data, and the user data matrix R is obtained (as shown in the following table). The user data matrix R is an m*n matrix, m is the number of training users, and n is the number of standard feature data. When the training characteristic data of the age is normalized, if the minimum value of the age is 18, the maximum value is 88, and the size of N is 10, if the age of a training user is 48, the standard characteristic data acquired by the conversion function is used. Is 5. When standardizing the training characteristic data of the occupation, the comparison table of the occupational and standard characteristic data may be preset, and the comparison table may be obtained based on the training characteristic data of the occupation in the training user portrait data, and the corresponding standard feature may be obtained. data. Wherein, in the comparison table, different standardized scores may be enumerated according to occupations, and each standardized score corresponds to different occupations of different enterprises, so as to query corresponding standardized scores based on the training characteristic data of the occupation in the training user portrait data, as Corresponding standard feature data. Similarly, other feature data is standardized by using a conversion function or a comparison table.
  U1U1 U2U2 U3U3 U4U4 U5U5 U6U6 ……...... UmUm
年龄age 55 33 22 33 11 44 ……...... 77
职业Career 55 55 33 44 22 66 ……...... 66
收入income 66 44 11 66 44 77 ……...... 55
投资经验Investment experience 88 66 22 77 44 77 ……...... 66
投资比例Investment ratio 44 66 33 99 33 55 ……...... 44
风险偏好Risk preference 66 44 66 77 22 88 ……...... 77
承受亏损值Bear the loss 66 55 33 77 22 55 ……...... 88
……...... ……...... ……...... ……...... ……...... ……...... ……...... ……...... ……......
然后,对用户数据矩阵R中标准特征数据的值采用K-means聚类算法进行聚类。采用K-means聚类算法进行聚类过程如下:步骤(1),建立n维图,根据用户数据矩阵R中每个训练用户Ui的标准特征数据的值绘制出n维图中m个数据点Ui,其中,i∈m,每一数据点Ui对应一训练用户。步骤(2),预定义K值,根据K值可将m个数据点划分成K个数据集G=[G1,G2,G3,G4,…Gj…,Gk],其中,K≥2,j∈k。步骤(3),在每个数据集Gj中随机选择一个数据点Ui作为质心Ci,使得所有数据集中存在K个质心Ci。步骤(4),计算每一数据集Gj中任一数据点Ui与K个质心Gi的欧氏距离Di,将数据点Ui归入欧氏距离Di最小的一个数据集Gj中。步骤(5),使所有数据点Ui执行步骤(4),形成新的数据集G。重复步骤(3)-(5),使得任一数据集Gj中新的质心Ci与旧的质心Ci小于预设的阈值时,K-means聚类算法终止,形成K个聚类类簇,每个聚类类簇有一个质心用户,该质心用户对应质心用户画像数据。Then, the values of the standard feature data in the user data matrix R are clustered by a K-means clustering algorithm. The K-means clustering algorithm is used to perform the clustering process as follows: Step (1), an n-dimensional map is established, and m data points in the n-dimensional graph are drawn according to the values of the standard feature data of each training user Ui in the user data matrix R. Ui, where i∈m, each data point Ui corresponds to a training user. Step (2), pre-defining the K value, according to the K value, the m data points can be divided into K data sets G=[G1, G2, G3, G4, ... Gj..., Gk], where K≥2,j ∈k. In step (3), one data point Ui is randomly selected in each data set Gj as the centroid Ci such that there are K centroids Ci in all data sets. In step (4), the Euclidean distance Di of any of the data points Ui and the K centroids Gi in each data set Gj is calculated, and the data points Ui are classified into a data set Gj having the smallest Euclidean distance Di. In step (5), all data points Ui are executed in step (4) to form a new data set G. Repeat steps (3)-(5) such that when the new centroid Ci and the old centroid Ci in any data set Gj are less than a preset threshold, the K-means clustering algorithm terminates, forming K cluster clusters, each The cluster cluster has a centroid user, and the centroid user corresponds to the centroid user portrait data.
S53:采用加权运算算法对质心用户画像数据进行加权处理,确定质心用户画像数据对应一风险评估值。S53: weighting the centroid user image data by using a weighting algorithm to determine that the centroid user image data corresponds to a risk assessment value.
其中,加权运算算法为
Figure PCTCN2018074570-appb-000003
其中,
Figure PCTCN2018074570-appb-000004
Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重。每一标准特征数据的权重可以是预先采用多变量线性回归模型对各训练用户画像数据的标准特征数据进行统计处理后获取的值,以便在需进行风险评估时直接调用。该多变量线性回归模型为h θ(x)=θ 01x 12x 2+...+θ nx n,其中,h θ(x)为假设函数,各个θ为输入值间的夹角向量,各个x为对应的特征,在上式中加入x 0令x 0=1,则有 h θ(x)=θ 0x 01x 12x 2+...+θ nx n=θ TX。其中,θ是行向量,行向量里包含了线性回归模型中的参数,X是样本特征矩阵。
Wherein, the weighting algorithm is
Figure PCTCN2018074570-appb-000003
among them,
Figure PCTCN2018074570-appb-000004
Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data. The weight of each standard feature data may be a value obtained by statistically processing the standard feature data of each training user's portrait data in advance using a multivariate linear regression model, so as to be directly called when a risk assessment is required. The multivariate linear regression model is h θ (x)=θ 01 x 12 x 2 +...+θ n x n , where h θ (x) is a hypothesis function and each θ is an input The angle vector between values, each x is a corresponding feature. When x 0 is added to the above formula, x 0 =1, then h θ (x)=θ 0 x 01 x 12 x 2 + ... + θ n x n = θ T X. Where θ is the row vector, the row vector contains the parameters in the linear regression model, and X is the sample feature matrix.
S54:基于聚类类簇和风险评估值,获取用户数据模型。S54: Acquire a user data model based on the clustering cluster and the risk assessment value.
本实施例中,由于采用K-means聚类算法将用户数据矩阵R中所有训练用户画像数据划分成K个聚类类簇,每个聚类类簇的质心用户画像数据与同一聚类类簇中其他训练用户画像数据相似,可将质心用户画像数据对应的风险评估值作为该聚类类簇中其他训练用户画像数据的风险评估值,从而确定用户数据模型。In this embodiment, since the K-means clustering algorithm is used to divide all training user image data in the user data matrix R into K cluster clusters, the centroid user image data of each cluster cluster and the same cluster cluster The other training user image data is similar, and the risk evaluation value corresponding to the centroid user portrait data can be used as the risk evaluation value of other training user portrait data in the clustering cluster to determine the user data model.
S60:将用户数据模型存储在数据库中。S60: Store the user data model in a database.
本实施例中,将步骤S50中训练出的用户数据模型存储在MySQL、Oracle或其他数据库中,以便于在需要给目标用户推荐基金产品时,从数据库中调用该预先训练好的用户数据模型。In this embodiment, the user data model trained in step S50 is stored in MySQL, Oracle or other database, so that the pre-trained user data model is called from the database when the fund product needs to be recommended to the target user.
在该具体实施方式中,步骤S20包括:从数据库中获取用户数据模型。由于用户数据模型预先训练好并存储在数据库中,在需要使得用户数据模型进行基金产品推荐时,可直接从数据库中调用该用户数据模型,即可进行相应推荐处理,操作过程简单快捷。In this embodiment, step S20 includes: obtaining a user data model from a database. Since the user data model is pre-trained and stored in the database, when the user data model needs to be recommended for the fund product, the user data model can be directly called from the database, and the corresponding recommendation processing can be performed, and the operation process is simple and fast.
S30:基于当前用户画像数据,从至少两个聚类类簇中获取与至少一个当前特征数据相对应的目标聚类类簇。S30: Acquire, according to the current user portrait data, a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters.
其中,目标聚类类簇是指与当前用户画像数据最相近的质心用户画像数据所在的聚类类簇。目标聚类类簇具体是指与至少一个当前特征数据所形成的当前用户画像数据距离最近的质心用户画像数据对应的聚类类簇。The target clustering cluster refers to the cluster cluster in which the centroid user portrait data closest to the current user image data is located. The target clustering cluster specifically refers to a clustering cluster corresponding to the centroid user portrait data closest to the current user portrait data formed by the at least one current feature data.
在一具体实施方式中,如图4所示,步骤S30具体包括如下步骤:In a specific implementation, as shown in FIG. 4, step S30 specifically includes the following steps:
S31:将当前用户画像数据分别与用户数据模型中至少两个聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离。S31: Calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
本实施例中,用户数据模型中存储有K个聚类类簇,每一聚类类簇对应一质心用户画像数据,若设当前用户画像数据为n维向量a(X i1,X i2,...,X in),任一聚类类簇的质心用户画像数据为n维向量b(X j1,X j2,...,X jn),则当前用户画像数据与质心用户画像数据的欧氏距离
Figure PCTCN2018074570-appb-000005
其中,向量a的维数n与当前用户画像数据中当前特征数据的数量相对应;相应地,向量b的维数n与质心用户画像数据中训练特征数据的数量相对应。
In this embodiment, K cluster clusters are stored in the user data model, and each cluster cluster corresponds to a centroid user image data. If the current user image data is an n-dimensional vector a (X i1 , X i2 ,. .., X in ), the centroid user image data of any cluster cluster is n-dimensional vector b (X j1 , X j2 ,..., X jn ), then the current user image data and the centroid user image data are Distance
Figure PCTCN2018074570-appb-000005
Wherein, the dimension n of the vector a corresponds to the number of current feature data in the current user portrait data; accordingly, the dimension n of the vector b corresponds to the number of training feature data in the centroid user portrait data.
可以理解地,在步骤S31之前,需对当前用户画像数据中至少一个当前特征数据进行 标准化处理,以使当前用户画像数据中包括至少一个标准特征数据,以便于计算当前用户画像数据与质心用户画像数据之间的欧氏距离,有利于简化计算过程,提高计算效率。其中,对当前用户画像数据中至少一个当前训练数据进行标准化处理过程与对训练用户画像数据中至少一个训练特征数据进行标准化处理过程相同,为避免重复,在此不一一赘述。It can be understood that, before step S31, at least one current feature data in the current user image data needs to be normalized to include at least one standard feature data in the current user image data, so as to calculate current user image data and a centroid user image. The Euclidean distance between the data helps to simplify the calculation process and improve the calculation efficiency. The process of normalizing at least one current training data in the current user image data is the same as the process of normalizing at least one training feature data in the training user image data. To avoid repetition, details are not described herein.
S32:选取至少两个欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个当前特征数据相对应的目标聚类类簇。S32: Select a cluster cluster of the centroid user image data corresponding to the minimum value of the at least two Euclidean distances as the target cluster cluster corresponding to the at least one current feature data.
由于步骤S31中获取K个当前用户画像数据与质心用户画像数据的欧氏距离D a,b,从K个欧氏距离D a,b中选取最小值对应的质心用户画像数据所在的聚类类簇,作为与当前用户画像数据中至少一个当前特征数据相对应的目标聚类类簇。 Since the Euclidean distance D a,b of the K current user portrait data and the centroid user portrait data is obtained in step S31, the cluster class of the centroid user portrait data corresponding to the minimum value is selected from the K Euclidean distances D a,b . The cluster is a target cluster class cluster corresponding to at least one current feature data in the current user portrait data.
S40:基于目标聚类类簇对应的风险评估值,确定目标基金产品。S40: Determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
本实施例中,将目标聚类类簇对应的风险评估值定义为目标风险评估值。于步骤S52中,对每一聚类类簇的质心用户画像数据采用加权运算算法进行加权处理,确定与质心用户画像数据相对应的风险评估值,并将该风险评估值作为相应聚类类簇的风险评估值。本实施例中,目标聚类类簇是至少两个聚类类簇中的一个,可将该目标聚类类簇对应的风险评估值作为目标风险评估值In this embodiment, the risk assessment value corresponding to the target cluster cluster is defined as the target risk assessment value. In step S52, the centroid user image data of each cluster cluster is weighted by a weighting algorithm to determine a risk assessment value corresponding to the centroid user portrait data, and the risk assessment value is used as a corresponding cluster cluster. Risk assessment value. In this embodiment, the target clustering cluster is one of at least two clustering clusters, and the risk assessment value corresponding to the target clustering cluster may be used as the target risk assessment value.
其中,目标基金产品是基金交易系统基于获取到的目标风险评估值推荐给目标用户的基金产品。该目标基金产品与目标用户的当前用户画像数据关联,以使目标基金产品符合目标用户的投资条件;且该目标基金产品基于目标风险评估值进行推荐,使得其与基金产品的基金类型相关,对基金产品的定位较为准确。本实施例中,基金交易系统在获取到目标基金产品后,通过智能手机、平板电脑等终端显示目标基金产品,以使目标用户了解目标基金产品,并基于目标基金产品进行基金交易,以辅助用户对自身投资条件和基金产品的定位有更准确的认识,有助于目标用户提高投资收益率。Among them, the target fund product is a fund product recommended by the fund trading system based on the obtained target risk assessment value to the target user. The target fund product is associated with the current user portrait data of the target user, so that the target fund product meets the investment condition of the target user; and the target fund product is recommended based on the target risk assessment value, so that it is related to the fund type of the fund product, The positioning of fund products is more accurate. In this embodiment, after obtaining the target fund product, the fund trading system displays the target fund product through a terminal such as a smart phone or a tablet computer, so that the target user understands the target fund product, and performs fund trading based on the target fund product to assist the user. Have a more accurate understanding of their own investment conditions and the positioning of fund products, which will help target users to improve their investment returns.
在一具体实施方式中,如图5所示,步骤S40具体包括如下步骤:In a specific implementation, as shown in FIG. 5, step S40 specifically includes the following steps:
S41:基于目标聚类类簇对应的风险评估值,确定对应的目标基金类型。S41: Determine a corresponding target fund type based on the risk assessment value corresponding to the target clustering cluster.
其中,基金类型是依据基金风险高低进行划分的类型。本实施例中,基金类型包括但不限于股票型、指数型、混合型、债券型和货币型。在基金交易系统中,预先设置基金风险与基金类型的对应关系。若设基金风险的区间为(0,10),将基金风险的区间划(0,2],(2,4],(4,6],(6,8]和(8,10)分成五个风险等级,分别对应货币型、债券型、混合型、指数型和股票型这五个基金类型,并使基金交易系统的基金产品唯一对应一种基金类型。本实施例中,可通过标准化处理,使得获取的目标风险评估值的范围在区间(0, 10)内,以便根据目标风险评估值直接确定对应基金类型。Among them, the type of fund is based on the type of risk of the fund. In this embodiment, the fund types include, but are not limited to, stock type, index type, hybrid type, bond type, and currency type. In the fund trading system, the corresponding relationship between fund risk and fund type is preset. If the risk range of the fund is (0, 10), divide the risk risk interval (0, 2), (2, 4), (4, 6], (6, 8) and (8, 10) into five. The risk levels correspond to the five fund types of currency type, bond type, hybrid type, index type and stock type, and make the fund products of the fund trading system uniquely correspond to one type of fund. In this embodiment, it can be standardized. Therefore, the obtained target risk assessment value is within the interval (0, 10), so that the corresponding fund type is directly determined according to the target risk assessment value.
S42:根据基金类型,获取与基金类型相对应的待荐基金产品和基金评价指标。S42: According to the fund type, obtain the fund products and fund evaluation indicators corresponding to the fund type.
其中,待荐基金产品是基金交易系统中基金类型为目标基金类型的所有基金产品。基金评价指标是用于评价基金产品好坏的指标,该基金评价指标与基金产品的投资收益和/或投资风险关联。具体地,基金评价指标与基金类型关联,可基于目标基金类型确定用于评价相应基金产品的基金评价指标。Among them, the fund products to be recommended are all fund products of the fund trading system whose fund type is the target fund type. The fund evaluation index is used to evaluate the quality of the fund product. The fund evaluation index is related to the investment income and/or investment risk of the fund product. Specifically, the fund evaluation index is associated with the fund type, and the fund evaluation index for evaluating the corresponding fund product may be determined based on the target fund type.
具体地,基金评价指标包括但不限于本实施例中的平均回报率、阿尔法系数、标准差、贝塔系数、晨星风险系数、夏普比率和R平方等。其中,平均回报率是与收益相关的指标,用于评价投资回报的指标,平均回报率越大越好。阿尔法系数是与收益相关的指标,是一种相对指数,阿尔法系数越大说明其基金获得超额收益的能力越大。标准差是与风险相关的指标,反映基金回收率的波动幅度,标准差越小越好,具体是指基金每个月的收益率相对于平均月收益率的偏差幅度的大小。基金的每月收益波动越大,相对应的标准差也就越大。贝塔系数是与风险相关的指标,用于衡量价格波动情况,即用以评估某只股票或某只股票型基金相对于整个市场的波动情况。在牛市或上升阶段贝塔系数越大越好,而熊市或下跌阶段贝塔系数越小越好。晨星风险系数是与风险相关的指标,用于计算一定期间相对同类基金,收益向下浮动的风险,晨星风险指标越大,向下浮动的风险越大,因此,晨星风险系数越小越好。夏普比率是与收益和风险都相关的指标,是基金绩效评价标准化指标,夏普比率越高越好。R平方是与收益和风险都相关的指标,用于反映业绩变化情况,R平方是衡量一只基金业绩变化在多大程度上可以由基准指数的变动来解释,以0至100计,越接近100阿尔法系数和贝塔系数越可靠。Specifically, the fund evaluation index includes, but is not limited to, an average return rate, an alpha coefficient, a standard deviation, a beta coefficient, a morning star risk coefficient, a Sharpe ratio, and an R square in the present embodiment. Among them, the average rate of return is an indicator related to income, which is used to evaluate the indicator of return on investment. The larger the average rate of return, the better. The alpha coefficient is an indicator related to income and is a relative index. The larger the alpha coefficient, the greater the ability of the fund to obtain excess returns. The standard deviation is a risk-related indicator, reflecting the fluctuation range of the fund recovery rate. The smaller the standard deviation, the better. Specifically, it refers to the deviation of the fund's monthly rate of return relative to the average monthly rate of return. The greater the monthly income fluctuation of the fund, the greater the corresponding standard deviation. The beta coefficient is a risk-related indicator used to measure price volatility, which is used to assess the volatility of a stock or a stock fund relative to the entire market. The bigger the beta coefficient is, the better the bull market or the rising phase, and the smaller the beta coefficient in the bear market or the down phase, the better. The Morningstar risk factor is a risk-related indicator used to calculate the risk of a downward float relative to a similar fund over a certain period of time. The greater the Morningstar risk indicator, the greater the risk of downward fluctuation. Therefore, the Morningstar risk coefficient is as small as possible. The Sharpe ratio is an indicator related to both income and risk, and is a standardized indicator of fund performance evaluation. The higher the Sharpe ratio, the better. R-square is a measure related to both income and risk, which is used to reflect the change of performance. R-square is a measure of the extent to which a change in the performance of a fund can be explained by the change of the benchmark index, from 0 to 100, the closer to 100 The more reliable the alpha and beta coefficients are.
本实施例中,对于依据不同风险等级的基金类型可依据不同的基金评价指标进行推荐。如对于货币型和债券型这两种基金类型的基金产品可基于平均回报率和阿尔法系数这两种基于收益的基金评价指标进行推荐;对于混合型这一种基金类型的基金产品可基于夏普比率和R平方这两种基于收益和风险的基金评价指标进行推荐;对于指数型和股票型这两种基金类型的基金产品可基于标准差、贝塔系数和晨星风险系数这有三种基于风险的基金评价指标进行推荐。In this embodiment, the fund types according to different risk levels may be recommended according to different fund evaluation indicators. For example, the fund products of the two types of funds, such as currency type and bond type, can be recommended based on two kinds of income-based fund evaluation indicators: average return rate and alpha coefficient; for the hybrid fund type, the fund product can be based on the Sharpe ratio. And R-square, two fund-based evaluation indicators based on income and risk; for fund products of both index type and stock type, there are three risk-based fund evaluations based on standard deviation, beta coefficient and morningstar risk coefficient. Indicators are recommended.
S43:依据基金评价指标,采用快速排序算法对待荐基金产品进行排序,确定目标基金产品。S43: According to the fund evaluation index, the quick-sorting algorithm is used to sort the recommended fund products, and the target fund products are determined.
其中,快速排序算法的基本思想是通过一趟排序将要排序的数据分割成独立的两部分,其中一部分的所有数据都比另外一部分的所有数据都要小,然后再按此方法对这两部 分数据分别进行快速排序,整个排序过程可以递归进行,以此达到整个数据变成有序序列。快速排序算法是基于关键字比较的内部排序算法中速度最快的一种算法,算法效率高。Among them, the basic idea of the quick sort algorithm is to divide the data to be sorted into two independent parts by one sorting, in which all the data of one part is smaller than all the data of the other part, and then the two parts of data are used according to this method. Quick sorting is performed separately, and the entire sorting process can be performed recursively, so that the entire data becomes an ordered sequence. The fast sorting algorithm is the fastest algorithm in the internal sorting algorithm based on keyword comparison, and the algorithm is highly efficient.
本实施例中,依据基金评价指标,采用快速排序算法对基金交易系统获取的待荐基金产品进行排序,通过待荐基金产品对应的基金评价指标的排序结果来获取目标基金产品,并通过智能手机、平板电脑等终端的显示界面显示该目标基金产品。例如取基金评价指标为平均回报率,采用快速排序算法对待荐基金产品的平均回报率进行排序,可以得到平均回报率的排序结果,该排序结果中排序最高的平均回报率对应的待荐基金产品目标基金产品。可以理解地,目标用户通过智能手机、平板电脑等终端查看到的目标基金产品是依据基金评估指标进行排序的,以使目标用户可了解与用户自身的投资条件相匹配的目标基金产品,可有利于提高投资者购买基金产品的准确率,降低基金购买的风险性。In this embodiment, according to the fund evaluation index, the quick-sorting algorithm is used to sort the fund products to be recommended by the fund trading system, and the target fund products are obtained through the sorting result of the fund evaluation indexes corresponding to the fund products to be recommended, and the smart phone is obtained through the smart phone. The display interface of the terminal such as a tablet computer displays the target fund product. For example, taking the fund evaluation index as the average rate of return, using the quick sort algorithm to rank the average rate of return of the recommended fund products, the sorting result of the average rate of return can be obtained, and the highest rate of return among the ranked results corresponds to the fund to be recommended. Target fund products. Understandably, the target fund products viewed by the target users through terminals such as smart phones and tablet computers are sorted according to the fund evaluation indicators, so that the target users can understand the target fund products that match the investment conditions of the users, and may have It will help improve the accuracy of investors' purchase of fund products and reduce the risk of fund purchases.
本实施例中,步骤S10和步骤S20之前,该基金产品推荐方法还包括获取产品推荐指令,以使步骤S10中基于产品推荐指令获取当前用户画像数据,步骤S20中基于产品推荐指令调用训练好的用户数据模型。在基金交易系统中,可在显示界面中显示“基金推荐”按钮,用户可点击该“基金推荐”按钮即可命名基金交易系统获取产品推荐指令。或者,在基金交易系统中,用户可预设设置,在用户采用预先注册的登录帐号登陆基金交易系统时,触发基金交易系统即可获取产品推荐指令,以使其显示界面显示目标基金产品。In this embodiment, before the step S10 and the step S20, the fund product recommendation method further includes: obtaining a product recommendation instruction, so that the current user portrait data is acquired based on the product recommendation instruction in step S10, and the training is invoked based on the product recommendation instruction in step S20. User data model. In the fund trading system, the “fund recommendation” button may be displayed in the display interface, and the user may click the “fund recommendation” button to name the fund trading system to obtain the product recommendation instruction. Alternatively, in the fund trading system, the user may preset settings, and when the user logs into the fund trading system by using the pre-registered login account, the fund trading system may be triggered to obtain the product recommendation instruction, so that the display interface displays the target fund product.
本实施例所提供的基金产品推荐方法中,基于当前用户画像数据和用户数据类型,确定目标风险评估值,并利用目标风险评估值确定对应的目标基金产品,以使推荐给目标用户的目标基金产品对用户自身投资条件和基金产品进行准确定位,有助于目标用户提高投资收益率。In the fund product recommendation method provided by the embodiment, the target risk assessment value is determined based on the current user portrait data and the user data type, and the target target fund product is determined by using the target risk assessment value, so that the target fund recommended to the target user is obtained. The product accurately locates the user's own investment conditions and fund products, which helps the target users to improve the investment return rate.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请的实施过程构成任何限定。It should be understood that the size of the serial number of each step in the above embodiments does not mean the order of execution order, and the order of execution of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the present application.
实施例2Example 2
图6示出与实施例1中基金产品推荐方法一一对应的基金产品推荐装置的原理框图。如图6所示,该基金产品推荐装置装置包括当前用户画像数据获取模块10、用户数据模型获取模块20、目标聚类类簇确定模块30和目标基金产品确定模块40。其中,当前用户画像数据获取模块10、用户数据模型调用模块20、目标聚类类簇确定模块30和目标基金产品确定模块40的实现功能与实施例1中基金产品推荐方法对应的步骤一一对应,为避免赘述,本实施例不一一详述。Fig. 6 is a block diagram showing the principle of the fund product recommendation device corresponding to the fund product recommendation method in the first embodiment. As shown in FIG. 6, the fund product recommendation device includes a current user portrait data acquisition module 10, a user data model acquisition module 20, a target cluster cluster determination module 30, and a target fund product determination module 40. The implementation functions of the current user portrait data obtaining module 10, the user data model invoking module 20, the target clustering cluster determining module 30, and the target fund product determining module 40 correspond to the steps corresponding to the fund product recommendation method in the first embodiment. In order to avoid redundancy, the present embodiment will not be described in detail.
当前用户画像数据获取模块10,用于获取当前用户画像数据,当前用户画像数据包 括至少一个当前特征数据。The current user portrait data obtaining module 10 is configured to acquire current user portrait data, and the current user portrait data includes at least one current feature data.
用户数据模型获取模块20,用于获取用户数据模型,用户数据模型包括至少两个聚类类簇,每一聚类类簇对应一风险评估值。The user data model obtaining module 20 is configured to acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
目标聚类类簇确定模块30,用于基于当前用户画像数据,从至少两个聚类类簇中获取与至少一个当前特征数据相对应的目标聚类类簇。The target clustering cluster determining module 30 is configured to acquire a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters based on the current user image data.
目标基金产品确定模块40,用于基于目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product determining module 40 is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
优选地,基金产品推荐装置还包括用户数据模型训练模块50和用户数据模型存储模块60。Preferably, the fund product recommendation device further includes a user data model training module 50 and a user data model storage module 60.
用户数据模型训练模块50,用于基于训练用户画像数据训练用户数据模型,训练用户画像数据包括至少一个训练特征数据。The user data model training module 50 is configured to train the user data model based on the training user image data, and the training user image data includes at least one training feature data.
用户数据模型存储模块60,用于将用户数据模型存储在数据库中。The user data model storage module 60 is configured to store the user data model in a database.
用户数据模型获取模块20,用于从数据库中获取用户数据模型。The user data model obtaining module 20 is configured to obtain a user data model from a database.
优选地,用户数据模型训练模块50包括标准化处理单元51、聚类类簇获取单元52、风险评估值获取单元53和数据模型获取单元54。Preferably, the user data model training module 50 includes a normalization processing unit 51, a cluster cluster acquisition unit 52, a risk evaluation value acquisition unit 53, and a data model acquisition unit 54.
标准化处理单元51,用于对训练用户画像数据中的至少一个标准特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;a normalization processing unit 51, configured to perform normalization processing on at least one standard feature data in the training user image data, so that the training user image data includes at least one standard feature data;
聚类类簇获取单元52,用于采用K-means聚类算法对训练用户画像数据中至少一个标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据。The clustering cluster acquiring unit 52 is configured to cluster at least one standard feature data in the training user portrait data by using a K-means clustering algorithm, and acquire at least two clustering clusters, each cluster cluster corresponding to a centroid User portrait data.
风险评估值获取单元53,用于采用加权运算算法对质心用户画像数据进行加权处理,确定质心用户画像数据对应一风险评估值;加权运算算法为
Figure PCTCN2018074570-appb-000006
其中,
Figure PCTCN2018074570-appb-000007
Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重。
The risk evaluation value obtaining unit 53 is configured to perform weighting processing on the centroid user portrait data by using a weighting operation algorithm, and determine that the centroid user portrait data corresponds to a risk evaluation value; the weighting operation algorithm is
Figure PCTCN2018074570-appb-000006
among them,
Figure PCTCN2018074570-appb-000007
Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data.
数据模型获取单元54,用于基于聚类类簇和风险评估值,获取用户数据模型。The data model obtaining unit 54 is configured to acquire a user data model based on the clustering cluster and the risk assessment value.
优选地,目标聚类类簇确定模块30包括欧氏距离获取单元31和目标聚类类簇选取单元32。Preferably, the target clustering class cluster determining module 30 includes an Euclidean distance acquiring unit 31 and a target clustering cluster selecting unit 32.
欧氏距离获取单元31,用于将当前用户画像数据分别与用户数据模型中至少两个聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离。The Euclidean distance obtaining unit 31 is configured to calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
目标聚类类簇选取单元32,用于选取至少两个欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个当前特征数据相对应的目标聚类类簇。The target clustering cluster selection unit 32 is configured to select a cluster cluster in which the centroid user portrait data corresponding to the minimum value of the at least two Euclidean distances is located as the target cluster cluster corresponding to the at least one current feature data.
优选地,目标基金产品确定模块40包括基金类型确定单元41、评估指标获取单元42和目标基金产品确定单元43。Preferably, the target fund product determining module 40 includes a fund type determining unit 41, an evaluation index obtaining unit 42, and a target fund product determining unit 43.
基金类型确定单元41,用于基于目标风险评估值,确定与目标风险评估值对应的目标基金类型。The fund type determining unit 41 is configured to determine a target fund type corresponding to the target risk assessment value based on the target risk assessment value.
评估指标获取单元42,用于根据基金类型,获取与基金类型相对应的待荐基金产品和基金评价指标。The evaluation index obtaining unit 42 is configured to obtain the fund products to be recommended and the fund evaluation indicators corresponding to the fund type according to the fund type.
目标基金产品确定单元43,用于依据基金评价指标,采用快速排序算法对待荐基金产品进行排序,确定目标基金产品。The target fund product determining unit 43 is configured to sort the recommended fund products by using a quick sorting algorithm according to the fund evaluation index, and determine the target fund product.
实施例3Example 3
本实施例提供一计算机可读存储介质,该计算机可读存储介质上存储有计算机可读指令,该计算机可读指令被处理器执行时实现实施例1中基金产品推荐方法,为避免重复,这里不再赘述。或者,该计算机可读指令被处理器执行时实现实施例2中基金产品推荐装置中各模块/单元的功能,为避免重复,这里不再赘述。The embodiment provides a computer readable storage medium having stored thereon computer readable instructions, which are implemented by the processor to implement the fund product recommendation method in Embodiment 1, in order to avoid duplication, here No longer. Alternatively, when the computer readable instructions are executed by the processor, the functions of the modules/units in the fund product recommendation device in Embodiment 2 are implemented. To avoid repetition, details are not described herein again.
实施例4Example 4
图7是本申请一实施例提供的终端设备的示意图。如图7所示,该实施例的终端设备70包括:处理器71、存储器72以及存储在存储器72中并可在处理器71上运行的计算机可读指令73,处理器71执行计算机可读指令73时实现实施例1中基金产品推荐方法的各个步骤,例如图1所示的步骤S10、S20、S30和S40。或者,处理器71执行计算机可读指令73时实现实施例2中基金产品推荐装置各模块/单元的功能,如图6所示当前用户画像数据获取模块10、用户数据模型获取模块20、目标聚类类簇确定模块30和目标基金产品确定模块40的功能。FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application. As shown in FIG. 7, the terminal device 70 of this embodiment includes a processor 71, a memory 72, and computer readable instructions 73 stored in the memory 72 and operable on the processor 71, the processor 71 executing computer readable instructions Each step of the fund product recommendation method in Embodiment 1 is implemented at 73 hours, such as steps S10, S20, S30, and S40 shown in FIG. Alternatively, when the processor 71 executes the computer readable instructions 73, the functions of the modules/units of the fund product recommendation device in Embodiment 2 are implemented. As shown in FIG. 6, the current user profile data acquisition module 10, the user data model acquisition module 20, and the target aggregation The class cluster determination module 30 and the target fund product determination module 40 function.
示例性的,计算机可读指令73可以被分割成一个或多个模块/单元,一个或者多个模块/单元被存储在存储器72中,并由处理器71执行,以完成本申请。一个或多个模块/单元可以是能够完成特定功能的一系列计算机可读指令73的指令段,该指令段用于描述计算机可读指令73在终端设备70中的执行过程。例如,计算机可读指令73可以被分割成当前用户画像数据获取模块10、用户数据模型获取模块20、目标聚类类簇确定模块30、目标基金产品确定模块40、用户数据模型训练模块50和用户数据模型存储模块60,其功能如下:Illustratively, computer readable instructions 73 may be partitioned into one or more modules/units, one or more modules/units being stored in memory 72 and executed by processor 71 to complete the application. The one or more modules/units may be an instruction segment of a series of computer readable instructions 73 capable of performing a particular function, which is used to describe the execution of computer readable instructions 73 in the terminal device 70. For example, the computer readable instructions 73 may be divided into a current user portrait data acquisition module 10, a user data model acquisition module 20, a target cluster cluster determination module 30, a target fund product determination module 40, a user data model training module 50, and a user. The data model storage module 60 has the following functions:
当前用户画像数据获取模块10,用于获取当前用户画像数据,当前用户画像数据包括至少一个当前特征数据。The current user portrait data obtaining module 10 is configured to acquire current user portrait data, and the current user portrait data includes at least one current feature data.
用户数据模型获取模块20,用于获取用户数据模型,用户数据模型包括至少两个聚类类簇,每一聚类类簇对应一风险评估值。The user data model obtaining module 20 is configured to acquire a user data model, where the user data model includes at least two clustering clusters, and each clustering cluster corresponds to a risk assessment value.
目标聚类类簇确定模块30,用于基于当前用户画像数据,从至少两个聚类类簇中获取与至少一个当前特征数据相对应的目标聚类类簇。The target clustering cluster determining module 30 is configured to acquire a target clustering cluster corresponding to the at least one current feature data from the at least two clustering clusters based on the current user image data.
目标基金产品确定模块40,用于基于目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product determining module 40 is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
优选地,基金产品推荐装置还包括用户数据模型训练模块50和用户数据模型存储模块60。Preferably, the fund product recommendation device further includes a user data model training module 50 and a user data model storage module 60.
用户数据模型训练模块50,用于基于训练用户画像数据训练用户数据模型,训练用户画像数据包括至少一个训练特征数据。The user data model training module 50 is configured to train the user data model based on the training user image data, and the training user image data includes at least one training feature data.
用户数据模型存储模块60,用于将用户数据模型存储在数据库中。The user data model storage module 60 is configured to store the user data model in a database.
用户数据模型获取模块20,用于从数据库中获取用户数据模型。The user data model obtaining module 20 is configured to obtain a user data model from a database.
优选地,用户数据模型训练模块50包括标准化处理单元51、聚类类簇获取单元52、风险评估值获取单元53和数据模型获取单元54。Preferably, the user data model training module 50 includes a normalization processing unit 51, a cluster cluster acquisition unit 52, a risk evaluation value acquisition unit 53, and a data model acquisition unit 54.
标准化处理单元51,用于对训练用户画像数据中的至少一个标准特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;a normalization processing unit 51, configured to perform normalization processing on at least one standard feature data in the training user image data, so that the training user image data includes at least one standard feature data;
聚类类簇获取单元52,用于采用K-means聚类算法对训练用户画像数据中至少一个标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据。The clustering cluster acquiring unit 52 is configured to cluster at least one standard feature data in the training user portrait data by using a K-means clustering algorithm, and acquire at least two clustering clusters, each cluster cluster corresponding to a centroid User portrait data.
风险评估值获取单元53,用于采用加权运算算法对质心用户画像数据进行加权处理,确定质心用户画像数据对应一风险评估值;加权运算算法为
Figure PCTCN2018074570-appb-000008
其中,
Figure PCTCN2018074570-appb-000009
Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重。
The risk evaluation value obtaining unit 53 is configured to perform weighting processing on the centroid user portrait data by using a weighting operation algorithm, and determine that the centroid user portrait data corresponds to a risk evaluation value; the weighting operation algorithm is
Figure PCTCN2018074570-appb-000008
among them,
Figure PCTCN2018074570-appb-000009
Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data.
数据模型获取单元54,用于基于聚类类簇和风险评估值,获取用户数据模型。The data model obtaining unit 54 is configured to acquire a user data model based on the clustering cluster and the risk assessment value.
优选地,目标聚类类簇确定模块30包括欧氏距离获取单元31和目标聚类类簇选取单元32。Preferably, the target clustering class cluster determining module 30 includes an Euclidean distance acquiring unit 31 and a target clustering cluster selecting unit 32.
欧氏距离获取单元31,用于将当前用户画像数据分别与用户数据模型中至少两个聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离。The Euclidean distance obtaining unit 31 is configured to calculate the current user image data and the centroid user image data of at least two cluster clusters in the user data model to obtain at least two Euclidean distances.
目标聚类类簇选取单元32,用于选取至少两个欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个当前特征数据相对应的目标聚类类簇。The target clustering cluster selection unit 32 is configured to select a cluster cluster in which the centroid user portrait data corresponding to the minimum value of the at least two Euclidean distances is located as the target cluster cluster corresponding to the at least one current feature data.
优选地,目标基金产品确定模块40包括基金类型确定单元41、评估指标获取单元42 和目标基金产品确定单元43。Preferably, the target fund product determining module 40 includes a fund type determining unit 41, an evaluation index obtaining unit 42, and a target fund product determining unit 43.
基金类型确定单元41,用于基于目标风险评估值,确定与目标风险评估值对应的目标基金类型。The fund type determining unit 41 is configured to determine a target fund type corresponding to the target risk assessment value based on the target risk assessment value.
评估指标获取单元42,用于根据基金类型,获取与基金类型相对应的待荐基金产品和基金评价指标。The evaluation index obtaining unit 42 is configured to obtain the fund products to be recommended and the fund evaluation indicators corresponding to the fund type according to the fund type.
目标基金产品确定单元43,用于依据基金评价指标,采用快速排序算法对待荐基金产品进行排序,确定目标基金产品。The target fund product determining unit 43 is configured to sort the recommended fund products by using a quick sorting algorithm according to the fund evaluation index, and determine the target fund product.
该终端设备70可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。终端设备可包括,但不仅限于,处理器71、存储器72。本领域技术人员可以理解,图7仅仅终端设备70的示例,并不构成对终端设备70的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如终端设备还可以包括输入输出设备、网络接入设备、总线等。The terminal device 70 can be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The terminal device may include, but is not limited to, a processor 71, a memory 72. It will be understood by those skilled in the art that FIG. 7 is only an example of the terminal device 70, and does not constitute a limitation of the terminal device 70, and may include more or less components than those illustrated, or combine some components, or different components. For example, the terminal device may further include an input/output device, a network access device, a bus, and the like.
所称处理器71可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 71 may be a central processing unit (CPU), or may be other general-purpose processors, a digital signal processor (DSP), an application specific integrated circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
存储器72可以是终端设备70的内部存储单元,例如终端设备70的硬盘或内存。存储器72也可以是终端设备70的外部存储设备,例如终端设备70上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器72还可以既包括终端设备70的内部存储单元也包括外部存储设备。存储器72用于存储计算机可读指令73以及终端设备所需的其他程序和数据。存储器72还可以用于暂时地存储已经输出或者将要输出的数据。The memory 72 may be an internal storage unit of the terminal device 70, such as a hard disk or a memory of the terminal device 70. The memory 72 may also be an external storage device of the terminal device 70, such as a plug-in hard disk provided on the terminal device 70, a smart memory card (SMC), a Secure Digital (SD) card, and a flash memory card (Flash). Card) and so on. Further, the memory 72 may also include both an internal storage unit of the terminal device 70 and an external storage device. The memory 72 is used to store computer readable instructions 73 and other programs and data required by the terminal device. The memory 72 can also be used to temporarily store data that has been or will be output.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。It will be clearly understood by those skilled in the art that, for convenience and brevity of description, only the division of each functional unit and module described above is exemplified. In practical applications, the above functions may be assigned to different functional units according to needs. The module is completed by dividing the internal structure of the device into different functional units or modules to perform all or part of the functions described above.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既 可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的模块/单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实现上述实施例方法中的全部或部分流程,也可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一计算机可读存储介质中,该计算机可读指令在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机可读指令包括计算机可读指令代码,所述计算机可读指令代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质可以包括:能够携带所述计算机可读指令代码的任何实体或装置、记录介质、U盘、移动硬盘、磁碟、光盘、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质等。需要说明的是,所述计算机可读介质包含的内容可以根据司法管辖区内立法和专利实践的要求进行适当的增减,例如在某些司法管辖区,根据立法和专利实践,计算机可读介质不包括是电载波信号和电信信号。The integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, the present application implements all or part of the processes in the foregoing embodiments, and may also be implemented by computer readable instructions, which may be stored in a computer readable storage medium. The computer readable instructions, when executed by a processor, may implement the steps of the various method embodiments described above. Wherein, the computer readable instructions comprise computer readable instruction code, which may be in the form of source code, an object code form, an executable file or some intermediate form or the like. The computer readable medium can include any entity or device capable of carrying the computer readable instruction code, a recording medium, a USB flash drive, a removable hard drive, a magnetic disk, an optical disk, a computer memory, a read only memory (ROM, Read-Only) Memory), random access memory (RAM), electrical carrier signals, telecommunications signals, and software distribution media. It should be noted that the content contained in the computer readable medium may be appropriately increased or decreased according to the requirements of legislation and patent practice in a jurisdiction, for example, in some jurisdictions, according to legislation and patent practice, computer readable media It does not include electrical carrier signals and telecommunication signals.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing embodiments. The technical solutions described in the examples are modified or equivalently replaced with some of the technical features; and the modifications or substitutions do not deviate from the spirit and scope of the technical solutions of the embodiments of the present application, and should be included in the present disclosure. Within the scope of protection of the application.

Claims (20)

  1. 一种基金产品推荐方法,其特征在于,包括:A fund product recommendation method, characterized in that it comprises:
    获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
    获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
    基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
    基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  2. 如权利要求1所述的基金产品推荐方法,其特征在于,所述获取用户数据模型之前,所述基金产品推荐方法还包括:The method for recommending a fund product according to claim 1, wherein the method for recommending the fund product further comprises:
    基于训练用户画像数据训练所述用户数据模型,所述训练用户画像数据包括至少一个训练特征数据;Training the user data model based on training user portrait data, the training user portrait data including at least one training feature data;
    将所述用户数据模型存储在数据库中;Storing the user data model in a database;
    所述获取用户数据模型,包括:从所述数据库中获取所述用户数据模型。The acquiring the user data model includes: obtaining the user data model from the database.
  3. 如权利要求2所述的基金产品推荐方法,其特征在于,所述基于训练用户画像数据训练所述用户数据模型,包括:The fund product recommendation method according to claim 2, wherein the training the user data model based on training user portrait data comprises:
    对所述训练用户画像数据中的至少一个所述训练特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;Performing normalization processing on at least one of the training feature data in the training user image data, so that the training user image data includes at least one standard feature data;
    采用K-means聚类算法对所述训练用户画像数据中至少一个所述标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据;The K-means clustering algorithm is used to cluster at least one of the standard feature data in the training user image data to obtain at least two clustering clusters, and each clustering cluster corresponds to a centroid user image data;
    采用加权运算算法对所述质心用户画像数据进行加权处理,确定所述质心用户画像数据对应一风险评估值,所述加权运算算法为P i=∑V i·W i,其中,∑W i=1,Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重; The weighting operation algorithm is used to perform weighting processing on the centroid user image data, and the centroid user image data is determined to correspond to a risk evaluation value, where the weighting operation algorithm is P i =∑V i ·W i , where ∑W i = 1, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data;
    基于所述聚类类簇和所述风险评估值,获取所述用户数据模型。The user data model is obtained based on the cluster cluster and the risk assessment value.
  4. 如权利要求1所述的基金产品推荐方法,其特征在于,所述基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇,包括:The fund product recommendation method according to claim 1, wherein the acquiring a target corresponding to at least one of the current feature data from at least two of the cluster clusters based on the current user portrait data Clustering clusters, including:
    将所述当前用户画像数据分别与所述用户数据模型中至少两个所述聚类类簇的质心 用户画像数据进行计算,获取至少两个欧氏距离;Calculating, respectively, the current user portrait data and the centroid user portrait data of at least two of the cluster clusters in the user data model to obtain at least two Euclidean distances;
    选取至少两个所述欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个所述当前特征数据相对应的所述目标聚类类簇。And selecting a cluster cluster in which the centroid user image data corresponding to the minimum value of the at least two of the Euclidean distances is located as the target cluster cluster corresponding to the at least one of the current feature data.
  5. 如权利要求1所述的基金产品推荐方法,其特征在于,所述基于所述目标聚类类簇对应的风险评估值,确定目标基金产品,包括:The fund product recommendation method according to claim 1, wherein the determining the target fund product based on the risk assessment value corresponding to the target clustering cluster comprises:
    基于所述目标聚类类簇对应的风险评估值,确定对应的目标基金类型;Determining a corresponding target fund type based on the risk assessment value corresponding to the target clustering cluster;
    根据所述基金类型,获取与所述基金类型相对应的待荐基金产品和基金评价指标;Obtaining a fund product and fund evaluation index corresponding to the fund type according to the type of the fund;
    依据所述基金评价指标,采用快速排序算法对所述待荐基金产品进行排序,确定所述目标基金产品。According to the fund evaluation index, the quick-sorting algorithm is used to sort the products to be recommended to determine the target fund product.
  6. 一种基金产品推荐装置,其特征在于,包括:A fund product recommendation device, comprising:
    当前用户画像数据获取模块,用于获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;a current user image data obtaining module, configured to acquire current user portrait data, where the current user portrait data includes at least one current feature data;
    用户数据模型获取模块,用于获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;a user data model obtaining module, configured to acquire a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
    目标聚类类簇确定模块,用于基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;a target clustering cluster determining module, configured to acquire, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
    目标基金产品确定模块,用于基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product determining module is configured to determine the target fund product based on the risk assessment value corresponding to the target clustering cluster.
  7. 如权利要求6所述的基金产品推荐装置,其特征在于,所述基金产品推荐装置还包括:The fund product recommendation device according to claim 6, wherein the fund product recommendation device further comprises:
    用户数据模型训练模块,用于基于训练用户画像数据训练所述用户数据模型,所述训练用户画像数据包括至少一个训练特征数据;a user data model training module, configured to train the user data model based on training user portrait data, the training user portrait data including at least one training feature data;
    用户数据模型存储模块,用于将所述用户数据模型存储在数据库中;a user data model storage module, configured to store the user data model in a database;
    所述用户数据模型获取模块,用于从所述数据库中获取所述用户数据模型。The user data model obtaining module is configured to obtain the user data model from the database.
  8. 如权利要求7所述的基金产品推荐装置,其特征在于,所述用户数据模型训练模块包括:The fund product recommendation device according to claim 7, wherein the user data model training module comprises:
    标准化处理单元,用于对所述训练用户画像数据中的至少一个所述标准特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;a normalization processing unit, configured to perform normalization processing on at least one of the standard feature data in the training user image data, so that the training user image data includes at least one standard feature data;
    聚类类簇获取单元,用于采用K-means聚类算法对所述训练用户画像数据中至少一个所述标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像 数据;a clustering cluster acquiring unit, configured to cluster at least one of the standard feature data in the training user portrait data by using a K-means clustering algorithm, to acquire at least two clustering clusters, each cluster cluster Corresponding to a centroid user image data;
    风险评估值获取单元,用于采用加权运算算法对所述质心用户画像数据进行加权处理,确定所述质心用户画像数据对应一风险评估值;a risk evaluation value obtaining unit, configured to perform weighting processing on the centroid user image data by using a weighting operation algorithm, and determine that the centroid user image data corresponds to a risk evaluation value;
    数据模型获取单元,用于基于所述聚类类簇和所述风险评估值,获取所述用户数据模型。And a data model obtaining unit, configured to acquire the user data model based on the clustering cluster and the risk assessment value.
  9. 如权利要求6所述的基金产品推荐装置,其特征在于,所述目标聚类类簇确定模块包括:The fund product recommendation device according to claim 6, wherein the target clustering cluster determining module comprises:
    欧氏距离获取单元,用于将所述当前用户画像数据分别与所述用户数据模型中至少两个所述聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离;An Euclidean distance acquisition unit, configured to calculate the current user image data and the centroid user image data of at least two of the cluster clusters in the user data model to obtain at least two Euclidean distances;
    目标聚类类簇选取单元,用于选取至少两个所述欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个所述当前特征数据相对应的所述目标聚类类簇。a target clustering cluster selecting unit, configured to select a clustering cluster in which at least two of the at least two of the Euclidean distances correspond to the centroid user image data as the target cluster corresponding to the at least one of the current feature data Class class cluster.
  10. 如权利要求6所述的基金产品推荐装置,其特征在于,所述目标基金产品确定模块包括:The fund product recommendation device according to claim 6, wherein the target fund product determination module comprises:
    基金类型确定单元,用于基于所述目标风险评估值,确定与所述目标风险评估值相对应的目标基金类型;a fund type determining unit, configured to determine a target fund type corresponding to the target risk assessment value based on the target risk assessment value;
    评估指标获取单元,用于根据所述基金类型,获取与所述基金类型相对应的待荐基金产品和基金评价指标;An evaluation index obtaining unit, configured to obtain, according to the fund type, a fund product and fund evaluation index corresponding to the fund type;
    目标基金产品确定单元,用于依据所述基金评价指标对所述待荐基金产品采用快速排序算法进行排序,确定所述目标基金产品。The target fund product determining unit is configured to sort the to-be-recommended fund products by using a quick sorting algorithm according to the fund evaluation index, and determine the target fund product.
  11. 一种终端设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:A terminal device comprising a memory, a processor, and computer readable instructions stored in the memory and operable on the processor, wherein the processor executes the computer readable instructions as follows step:
    获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
    获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
    基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
    基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  12. 如权利要求11所述的终端设备,其特征在于,所述获取用户数据模型的步骤之前,所述处理器执行所述计算机可读指令时还实现如下步骤:The terminal device according to claim 11, wherein before said step of acquiring a user data model, said processor further implements the following steps when said computer readable instructions are executed:
    基于训练用户画像数据训练所述用户数据模型,所述训练用户画像数据包括至少一个训练特征数据;Training the user data model based on training user portrait data, the training user portrait data including at least one training feature data;
    将所述用户数据模型存储在数据库中;Storing the user data model in a database;
    所述获取用户数据模型,包括:从所述数据库中获取所述用户数据模型。The acquiring the user data model includes: obtaining the user data model from the database.
  13. 如权利要求12所述的终端设备,其特征在于,所述基于训练用户画像数据训练所述用户数据模型,包括:The terminal device according to claim 12, wherein the training the user data model based on training user portrait data comprises:
    对所述训练用户画像数据中的至少一个所述训练特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;Performing normalization processing on at least one of the training feature data in the training user image data, so that the training user image data includes at least one standard feature data;
    采用K-means聚类算法对所述训练用户画像数据中至少一个所述标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据;The K-means clustering algorithm is used to cluster at least one of the standard feature data in the training user image data to obtain at least two clustering clusters, and each clustering cluster corresponds to a centroid user image data;
    采用加权运算算法对所述质心用户画像数据进行加权处理,确定所述质心用户画像数据对应一风险评估值,所述加权运算算法为P i=∑V i·W i,其中,∑W i=1,Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重; The weighting operation algorithm is used to perform weighting processing on the centroid user image data, and the centroid user image data is determined to correspond to a risk evaluation value, where the weighting operation algorithm is P i =∑V i ·W i , where ∑W i = 1, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data;
    基于所述聚类类簇和所述风险评估值,获取所述用户数据模型。The user data model is obtained based on the cluster cluster and the risk assessment value.
  14. 如权利要求11所述的终端设备,其特征在于,所述基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇,包括:The terminal device according to claim 11, wherein the acquiring, based on the current user portrait data, target clusters corresponding to at least one of the current feature data from at least two cluster clusters Class clusters, including:
    将所述当前用户画像数据分别与所述用户数据模型中至少两个所述聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离;Calculating, by the current user image data, the centroid user image data of at least two of the cluster clusters in the user data model, to obtain at least two Euclidean distances;
    选取至少两个所述欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个所述当前特征数据相对应的所述目标聚类类簇。And selecting a cluster cluster in which the centroid user image data corresponding to the minimum value of the at least two of the Euclidean distances is located as the target cluster cluster corresponding to the at least one of the current feature data.
  15. 如权利要求11所述的终端设备,其特征在于,所述基于所述目标聚类类簇对应的风险评估值,确定目标基金产品,包括:The terminal device according to claim 11, wherein the determining the target fund product based on the risk assessment value corresponding to the target clustering cluster comprises:
    基于所述目标聚类类簇对应的风险评估值,确定对应的目标基金类型;Determining a corresponding target fund type based on the risk assessment value corresponding to the target clustering cluster;
    根据所述基金类型,获取与所述基金类型相对应的待荐基金产品和基金评价指标;Obtaining a fund product and fund evaluation index corresponding to the fund type according to the type of the fund;
    依据所述基金评价指标,采用快速排序算法对所述待荐基金产品进行排序,确定所述目标基金产品。According to the fund evaluation index, the quick-sorting algorithm is used to sort the products to be recommended to determine the target fund product.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被处理器执行时实现如下步骤:A computer readable storage medium storing computer readable instructions, wherein the computer readable instructions, when executed by a processor, implement the following steps:
    获取当前用户画像数据,所述当前用户画像数据包括至少一个当前特征数据;Obtaining current user portrait data, the current user portrait data including at least one current feature data;
    获取用户数据模型,所述用户数据模型包括至少两个聚类类簇,每一所述聚类类簇对应一风险评估值;Obtaining a user data model, where the user data model includes at least two clustering clusters, each of the clustering clusters corresponding to a risk assessment value;
    基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇;And acquiring, according to the current user portrait data, a target clustering cluster corresponding to at least one of the current feature data from at least two of the clustering clusters;
    基于所述目标聚类类簇对应的风险评估值,确定目标基金产品。The target fund product is determined based on the risk assessment value corresponding to the target clustering cluster.
  17. 如权利要求16所述的计算机可读存储介质,其特征在于,在所述获取用户数据模型的步骤之前,所述计算机可读指令被所述处理器执行时还实现如下步骤:The computer readable storage medium of claim 16 wherein said computer readable instructions, when executed by said processor, further implement the following steps prior to said step of obtaining a user data model:
    基于训练用户画像数据训练所述用户数据模型,所述训练用户画像数据包括至少一个训练特征数据;Training the user data model based on training user portrait data, the training user portrait data including at least one training feature data;
    将所述用户数据模型存储在数据库中;Storing the user data model in a database;
    所述获取用户数据模型,包括:从所述数据库中获取所述用户数据模型。The acquiring the user data model includes: obtaining the user data model from the database.
  18. 如权利要求17所述的计算机可读存储介质,其特征在于,所述基于训练用户画像数据训练所述用户数据模型,包括:The computer readable storage medium of claim 17, wherein the training the user data model based on training user portrait data comprises:
    对所述训练用户画像数据中的至少一个所述训练特征数据进行标准化处理,以使训练用户画像数据包括至少一个标准特征数据;Performing normalization processing on at least one of the training feature data in the training user image data, so that the training user image data includes at least one standard feature data;
    采用K-means聚类算法对所述训练用户画像数据中至少一个所述标准特征数据进行聚类,获取至少二个聚类类簇,每一聚类类簇对应一质心用户画像数据;The K-means clustering algorithm is used to cluster at least one of the standard feature data in the training user image data to obtain at least two clustering clusters, and each clustering cluster corresponds to a centroid user image data;
    采用加权运算算法对所述质心用户画像数据进行加权处理,确定所述质心用户画像数据对应一风险评估值,所述加权运算算法为P i=∑V i·W i,其中,∑W i=1,Pi为质心用户的风险评估值,Vi为质心用户画像数据中每一标准特征数据的值,Wi是每一种标准特征数据的权重; The weighting operation algorithm is used to perform weighting processing on the centroid user image data, and the centroid user image data is determined to correspond to a risk evaluation value, where the weighting operation algorithm is P i =∑V i ·W i , where ∑W i = 1, Pi is the risk assessment value of the centroid user, Vi is the value of each standard feature data in the centroid user portrait data, and Wi is the weight of each standard feature data;
    基于所述聚类类簇和所述风险评估值,获取所述用户数据模型。The user data model is obtained based on the cluster cluster and the risk assessment value.
  19. 如权利要求16所述的计算机可读存储介质,其特征在于,所述基于所述当前用户画像数据,从至少两个所述聚类类簇中获取与至少一个所述当前特征数据相对应的目标聚类类簇,包括:The computer readable storage medium according to claim 16, wherein said acquiring, based on said current user portrait data, at least two of said clustering clusters corresponds to at least one of said current feature data Target clustering clusters, including:
    将所述当前用户画像数据分别与所述用户数据模型中至少两个所述聚类类簇的质心用户画像数据进行计算,获取至少两个欧氏距离;Calculating, by the current user image data, the centroid user image data of at least two of the cluster clusters in the user data model, to obtain at least two Euclidean distances;
    选取至少两个所述欧氏距离中最小值对应的质心用户画像数据所在的聚类类簇作为与至少一个所述当前特征数据相对应的所述目标聚类类簇。And selecting a cluster cluster in which the centroid user image data corresponding to the minimum value of the at least two of the Euclidean distances is located as the target cluster cluster corresponding to the at least one of the current feature data.
  20. 如权利要求16所述的计算机可读存储介质,其特征在于,所述基于所述目标聚类类簇对应的风险评估值,确定目标基金产品,包括:The computer readable storage medium according to claim 16, wherein the determining the target fund product based on the risk assessment value corresponding to the target clustering cluster comprises:
    基于所述目标聚类类簇对应的风险评估值,确定对应的目标基金类型;Determining a corresponding target fund type based on the risk assessment value corresponding to the target clustering cluster;
    根据所述基金类型,获取与所述基金类型相对应的待荐基金产品和基金评价指标;依据所述基金评价指标,采用快速排序算法对所述待荐基金产品进行排序,确定所述目标基金产品。According to the fund type, obtaining a fund product and fund evaluation index corresponding to the fund type; according to the fund evaluation index, using a quick sorting algorithm to sort the fund products to be recommended, and determining the target fund product.
PCT/CN2018/074570 2017-09-28 2018-01-30 Fund product recommendation method and apparatus, terminal device, and storage medium WO2019061976A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710899347.8 2017-09-28
CN201710899347.8A CN107679946B (en) 2017-09-28 2017-09-28 Fund product recommendation method and device, terminal equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2019061976A1 true WO2019061976A1 (en) 2019-04-04

Family

ID=61139156

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/074570 WO2019061976A1 (en) 2017-09-28 2018-01-30 Fund product recommendation method and apparatus, terminal device, and storage medium

Country Status (2)

Country Link
CN (1) CN107679946B (en)
WO (1) WO2019061976A1 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135694A (en) * 2019-04-12 2019-08-16 深圳壹账通智能科技有限公司 Product risks appraisal procedure, device, computer equipment and storage medium
CN110223155A (en) * 2019-04-25 2019-09-10 深圳壹账通智能科技有限公司 Method for pushing, device and the computer equipment of investment recommendation information
CN110458600A (en) * 2019-07-08 2019-11-15 平安科技(深圳)有限公司 Portrait model training method, device, computer equipment and storage medium
CN110490729A (en) * 2019-08-16 2019-11-22 南京汇银迅信息技术有限公司 A kind of financial user classification method based on user's portrait model
CN111177505A (en) * 2019-12-31 2020-05-19 中国移动通信集团江苏有限公司 Training method, recommendation method and device of index anomaly detection model
CN111209953A (en) * 2020-01-03 2020-05-29 腾讯科技(深圳)有限公司 Method and device for recalling neighbor vector, computer equipment and storage medium
CN111444368A (en) * 2020-03-25 2020-07-24 平安科技(深圳)有限公司 Method and device for constructing user portrait, computer equipment and storage medium
CN111475719A (en) * 2020-03-30 2020-07-31 招商局金融科技有限公司 Information pushing method and device based on data mining and storage medium
CN112418956A (en) * 2020-12-16 2021-02-26 国网雄安金融科技集团有限公司 Financial product recommendation method and device
CN112926816A (en) * 2020-09-08 2021-06-08 广东电网有限责任公司 Supplier evaluation method, supplier evaluation device, computer equipment and storage medium
CN113065739A (en) * 2021-02-24 2021-07-02 广州互联网法院 Executed person fulfillment ability evaluation method and device and electronic equipment
CN113744030A (en) * 2021-09-08 2021-12-03 未鲲(上海)科技服务有限公司 Recommendation method, device, server and medium based on AI user portrait
CN114398557A (en) * 2022-01-18 2022-04-26 平安国际智慧城市科技股份有限公司 Information recommendation method and device based on double portraits, electronic equipment and storage medium
CN114398557B (en) * 2022-01-18 2024-04-30 平安国际智慧城市科技股份有限公司 Information recommendation method and device based on double images, electronic equipment and storage medium

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108492194A (en) * 2018-03-06 2018-09-04 平安科技(深圳)有限公司 Products Show method, apparatus and storage medium
CN108389133A (en) * 2018-03-19 2018-08-10 朱将中 A kind of intelligent auxiliary throws the decision-making technique of Gu
CN110288112A (en) * 2018-03-19 2019-09-27 朱将中 A kind of intelligence wide towards range throws the judgment method of Gu
CN108665355B (en) * 2018-05-18 2023-06-02 深圳壹账通智能科技有限公司 Financial product recommendation method, apparatus, device and computer storage medium
CN108985935A (en) * 2018-07-06 2018-12-11 兴业证券股份有限公司 Financial product recommended method and storage medium
CN108985638B (en) * 2018-07-25 2020-07-24 腾讯科技(深圳)有限公司 User investment risk assessment method and device and storage medium
CN109191307A (en) * 2018-08-08 2019-01-11 平安科技(深圳)有限公司 Asset Allocation reasonability judgment method, system, computer equipment and storage medium
CN109447728A (en) * 2018-09-07 2019-03-08 平安科技(深圳)有限公司 Financial product recommended method, device, computer equipment and storage medium
CN109766454A (en) * 2019-01-18 2019-05-17 国家电网有限公司 A kind of investor's classification method, device, equipment and medium
CN109903082B (en) * 2019-01-24 2022-10-28 平安科技(深圳)有限公司 Clustering method based on user portrait, electronic device and storage medium
CN110033378A (en) * 2019-01-29 2019-07-19 阿里巴巴集团控股有限公司 A kind of resource allocation method, device and electronic equipment
CN111724007B (en) * 2019-03-18 2022-12-06 马上消费金融股份有限公司 Risk evaluation method, evaluation device, intelligent system and storage device
CN110009503A (en) * 2019-04-03 2019-07-12 平安信托有限责任公司 Finance product recommended method, device, computer equipment and storage medium
CN110163723A (en) * 2019-05-20 2019-08-23 深圳市和讯华谷信息技术有限公司 Recommended method, device, computer equipment and storage medium based on product feature
CN110428322A (en) * 2019-06-12 2019-11-08 平安科技(深圳)有限公司 A kind of adaptation method and device of business datum
CN110929155B (en) * 2019-11-28 2023-12-19 中国银行股份有限公司 Product information recommendation method and device, electronic equipment and storage medium
CN111210201B (en) * 2020-01-02 2021-02-26 平安科技(深圳)有限公司 Occupational label establishing method and device, electronic equipment and storage medium
CN111429232A (en) * 2020-04-12 2020-07-17 中信银行股份有限公司 Product recommendation method and device, electronic equipment and computer-readable storage medium
CN112330412B (en) * 2020-11-17 2024-04-05 中国平安财产保险股份有限公司 Product recommendation method and device, computer equipment and storage medium
CN113610580B (en) * 2021-08-10 2023-09-19 平安科技(深圳)有限公司 Product recommendation method and device, electronic equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090132347A1 (en) * 2003-08-12 2009-05-21 Russell Wayne Anderson Systems And Methods For Aggregating And Utilizing Retail Transaction Records At The Customer Level
CN106097044A (en) * 2016-06-01 2016-11-09 腾讯科技(深圳)有限公司 A kind of data recommendation processing method and device
CN106228399A (en) * 2016-07-20 2016-12-14 福建工程学院 A kind of stock trader's customer risk preference categories method based on big data
CN106504099A (en) * 2015-09-07 2017-03-15 国家计算机网络与信息安全管理中心 A kind of system for building user's portrait
CN106991609A (en) * 2016-01-21 2017-07-28 阿里巴巴集团控股有限公司 The recommendation method and apparatus of investment product

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5775425B2 (en) * 2011-11-18 2015-09-09 日本電信電話株式会社 Purchase data analysis apparatus, method, and program
CN105760957B (en) * 2016-02-23 2017-05-31 国元证券股份有限公司 A kind of Forecasting Methodology of the soft customer revenue of security
CN106530073A (en) * 2016-09-12 2017-03-22 国网辽宁省电力有限公司鞍山供电公司 Method of analyzing user credit rate based on CART algorithm
CN106503438A (en) * 2016-10-20 2017-03-15 上海科瓴医疗科技有限公司 A kind of H RFM user modeling method and system for pharmacy member analysis
CN106600369A (en) * 2016-12-09 2017-04-26 广东奡风科技股份有限公司 Real-time recommendation system and method of financial products of banks based on Naive Bayesian classification

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090132347A1 (en) * 2003-08-12 2009-05-21 Russell Wayne Anderson Systems And Methods For Aggregating And Utilizing Retail Transaction Records At The Customer Level
CN106504099A (en) * 2015-09-07 2017-03-15 国家计算机网络与信息安全管理中心 A kind of system for building user's portrait
CN106991609A (en) * 2016-01-21 2017-07-28 阿里巴巴集团控股有限公司 The recommendation method and apparatus of investment product
CN106097044A (en) * 2016-06-01 2016-11-09 腾讯科技(深圳)有限公司 A kind of data recommendation processing method and device
CN106228399A (en) * 2016-07-20 2016-12-14 福建工程学院 A kind of stock trader's customer risk preference categories method based on big data

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110135694A (en) * 2019-04-12 2019-08-16 深圳壹账通智能科技有限公司 Product risks appraisal procedure, device, computer equipment and storage medium
CN110223155A (en) * 2019-04-25 2019-09-10 深圳壹账通智能科技有限公司 Method for pushing, device and the computer equipment of investment recommendation information
CN110458600A (en) * 2019-07-08 2019-11-15 平安科技(深圳)有限公司 Portrait model training method, device, computer equipment and storage medium
CN110490729A (en) * 2019-08-16 2019-11-22 南京汇银迅信息技术有限公司 A kind of financial user classification method based on user's portrait model
CN110490729B (en) * 2019-08-16 2022-11-18 南京汇银迅信息技术有限公司 Financial user classification method based on user portrait model
CN111177505A (en) * 2019-12-31 2020-05-19 中国移动通信集团江苏有限公司 Training method, recommendation method and device of index anomaly detection model
CN111209953A (en) * 2020-01-03 2020-05-29 腾讯科技(深圳)有限公司 Method and device for recalling neighbor vector, computer equipment and storage medium
CN111209953B (en) * 2020-01-03 2024-01-16 腾讯科技(深圳)有限公司 Recall method, recall device, computer equipment and storage medium for neighbor vector
CN111444368A (en) * 2020-03-25 2020-07-24 平安科技(深圳)有限公司 Method and device for constructing user portrait, computer equipment and storage medium
CN111475719A (en) * 2020-03-30 2020-07-31 招商局金融科技有限公司 Information pushing method and device based on data mining and storage medium
CN111475719B (en) * 2020-03-30 2023-04-07 招商局金融科技有限公司 Information pushing method and device based on data mining and storage medium
CN112926816A (en) * 2020-09-08 2021-06-08 广东电网有限责任公司 Supplier evaluation method, supplier evaluation device, computer equipment and storage medium
CN112926816B (en) * 2020-09-08 2023-09-22 广东电网有限责任公司 Vendor evaluation method, device, computer device and storage medium
CN112418956A (en) * 2020-12-16 2021-02-26 国网雄安金融科技集团有限公司 Financial product recommendation method and device
CN113065739B (en) * 2021-02-24 2023-07-04 广州互联网法院 Method and device for evaluating performance capability of executed person and electronic equipment
CN113065739A (en) * 2021-02-24 2021-07-02 广州互联网法院 Executed person fulfillment ability evaluation method and device and electronic equipment
CN113744030A (en) * 2021-09-08 2021-12-03 未鲲(上海)科技服务有限公司 Recommendation method, device, server and medium based on AI user portrait
CN114398557A (en) * 2022-01-18 2022-04-26 平安国际智慧城市科技股份有限公司 Information recommendation method and device based on double portraits, electronic equipment and storage medium
CN114398557B (en) * 2022-01-18 2024-04-30 平安国际智慧城市科技股份有限公司 Information recommendation method and device based on double images, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN107679946A (en) 2018-02-09
CN107679946B (en) 2021-09-10

Similar Documents

Publication Publication Date Title
WO2019061976A1 (en) Fund product recommendation method and apparatus, terminal device, and storage medium
US20070282730A1 (en) Consolidation, sharing and analysis of investment information
WO2019205325A1 (en) Method for determining risk level of user, terminal device, and computer-readable storage medium
US20100005035A1 (en) Systems and Methods for a Cross-Linked Investment Trading Platform
US20090265283A1 (en) Systems and Methods for Ranking Investors and Rating Investment Positions
WO2022026433A1 (en) Environmental, social, and governance (esg) performance trends
CN110737917A (en) Data sharing device and method based on privacy protection and readable storage medium
CN113643117A (en) Method, device and storage medium for dynamically adjusting standard cost based on settlement report
US8468081B2 (en) Private company valuation
CN116596576B (en) Target recommendation method and device
Staňková Threshold moving approach with logit models for bankruptcy prediction
US10242068B1 (en) Methods and systems for ranking leads based on given characteristics
WO2019218517A1 (en) Server, method for processing text data and storage medium
US20170061548A1 (en) Advice engine
US10394834B1 (en) Methods and systems for ranking leads based on given characteristics
Kumar Examining the dynamic and non-linear linkages between crude oil price and Indian stock market volatility
Andria et al. Fintech payment application in improving customer services
CN110852392A (en) User grouping method, device, equipment and medium
Koshiyama et al. A derivatives trading recommendation system: The mid‐curve calendar spread case
US11836143B1 (en) Apparatus and methods for generating an instruction set for a user
US11397783B1 (en) Ranking similar users based on values and personal journeys
CN110941714A (en) Classification rule base construction method, application classification method and device
Dixon et al. A Bayesian approach to ranking private companies based on predictive indicators
US11663668B1 (en) Apparatus and method for generating a pecuniary program
US20240020771A1 (en) Apparatus and method for generating a pecuniary program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18861852

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24/09/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18861852

Country of ref document: EP

Kind code of ref document: A1