WO2020221022A1 - Service object recommendation method - Google Patents

Service object recommendation method Download PDF

Info

Publication number
WO2020221022A1
WO2020221022A1 PCT/CN2020/085254 CN2020085254W WO2020221022A1 WO 2020221022 A1 WO2020221022 A1 WO 2020221022A1 CN 2020085254 W CN2020085254 W CN 2020085254W WO 2020221022 A1 WO2020221022 A1 WO 2020221022A1
Authority
WO
WIPO (PCT)
Prior art keywords
parameter
business object
user
model
characteristic data
Prior art date
Application number
PCT/CN2020/085254
Other languages
French (fr)
Chinese (zh)
Inventor
彭艺
李楠
刘家豪
王超
谢淼
王寅
Original Assignee
阿里巴巴集团控股有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 阿里巴巴集团控股有限公司 filed Critical 阿里巴巴集团控股有限公司
Publication of WO2020221022A1 publication Critical patent/WO2020221022A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0278Product appraisal
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions

Definitions

  • This application relates to the field of data processing technology, and specifically relates to a method for recommending business objects.
  • the recommendation system is to use e-commerce websites to provide customers with product information and suggestions, help users decide what products should be purchased, and simulate sales staff to help customers complete the purchase process.
  • Product cold start refers to the recommendation of products that are lacking in user behavior. Because of the lack of a data basis for recommendation in the case of product cold start, cold start has become a classic problem in the recommendation system.
  • a typical cold-start method of a recommendation system is based on the upper limit of the confidence limit of the multi-armed gambling machine.
  • the processing process includes the following steps: 1) Data collection is performed to construct a product data set, and the product data in the product data set Perform preprocessing to obtain the explicit features of the products in a standardized format; construct the invisible features of the commodities based on the explicit features of the commodities, based on the latent Dirichlet algorithm, set the output invisible feature dimensions, and relabel the commodities; 2) Construct candidates based on the commodity data set Commodity set: cluster the commodity data set according to the invisible characteristics of the commodity, and cluster the commodities. The commodities in the same cluster have similar properties, and the commodities in different clusters are quite different.
  • a product is randomly selected to construct a candidate product set; 3) The selection of the best product from the candidate product set is regarded as a multi-armed gambling machine problem, and the product with the highest estimated score is calculated based on the upper bound algorithm of the confidence interval as the recommended product; 4) After recommending the product with the highest score in the candidate product set to the user, update the user characteristics and weight parameters according to the feedback.
  • the inventor found that the technical solution has at least the following problems: because the above solution requires that the product has sufficient user behavior characteristic data, that is, the user behavior characteristic data must be large enough to correctly evaluate the product. Therefore, it is only suitable for the application scenarios of personalized product recommendation for new users based on parameterized modeling of product value. However, in practical applications, the distribution of more product characteristics is unknown, that is, some products do not have sufficient user behavior characteristic data, and it is impossible to correctly evaluate the value of the product based on a parameterized model constructed based on user behavior data.
  • the new products in the second-hand transaction products account for a relatively large proportion and most of them are single products (orphan products)
  • the corresponding transaction cycle is short, which leads to short exposure time, and because of the exposure flow on the products.
  • the distribution is relatively uniform, so the user behavior characteristic data that can be collected for second-hand goods will be relatively insufficient, that is, the value of second-hand goods cannot be determined based on the user behavior characteristics of a limited dimension, and cold-start recommendations for products with unknown commodity characteristic distribution
  • the above solution cannot correctly evaluate the value of the product, which causes the recommendation result to fail to gradually converge, and thus it is impossible to screen out the products that users are interested in.
  • This application provides a method for recommending business objects to solve the problem in the prior art that the products of interest to users cannot be filtered out in the cold-start scenario of products.
  • This application provides a method for recommending business objects, including:
  • the score of the candidate business object is determined according to the first characteristic data of the candidate business object;
  • the first characteristic data includes user behavior characteristic data;
  • the first The parameter includes a weight parameter related to the first characteristic data, and the second parameter includes an unknown second characteristic data distribution parameter;
  • the business object includes:
  • Optional also includes:
  • the first user feedback information includes operation behavior information and browsing behavior information of the user on the business object.
  • the updating the first parameter and the second parameter according to the first user feedback information includes:
  • the first parameter and the second parameter are updated according to the generated training samples and historical samples.
  • Optional also includes:
  • Optional also includes:
  • Optional also includes:
  • the initializing the first parameter and the second parameter includes:
  • the first parameter and the second parameter to be initialized are determined.
  • the first parameter includes: a parameter of a linear machine learning model or a parameter of a nonlinear machine learning model;
  • the second parameter includes: statistical items related to the Gaussian process, statistical items related to the Dirichlet process, and statistical items related to the infinite-dimensional distribution.
  • the business objects include: commodity objects, video objects, and news objects.
  • the present application also provides a computer-readable storage medium having instructions stored in the computer-readable storage medium, which when run on a computer, cause the computer to execute the above-mentioned various methods.
  • the present application also provides a computer program product including instructions, which when run on a computer, causes the computer to execute the above-mentioned various methods.
  • the score of the candidate business object is determined according to the first characteristic data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model; the score is determined according to the score
  • the set of business objects recommended to the user are sent back to the client; this processing method allows the business object value evaluation model to be divided into parameterized items and non-parameterized items, and comprehensive parameter models and non-parametric models Evaluate the value of business objects with unknown feature distributions. Because non-parametric terms enable the model to fit the feature distribution of unknown business objects, it can continuously narrow the gap between the parametric model and the real environment; therefore, it can effectively improve the performance of business objects with unknown feature distributions. Value accuracy, so that the single recommendation time step loss can converge, and then the accuracy of business object recommendation can be gradually improved.
  • FIG. 1 is a flowchart of an embodiment of a method for recommending a business object provided by this application
  • FIG. 2 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application.
  • FIG. 3 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application.
  • FIG. 4 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application.
  • FIG. 5 is a specific flowchart of an embodiment of a method for recommending business objects provided by this application
  • Fig. 6 is a specific flowchart of an embodiment of a method for recommending a business object provided by the present application.
  • the business object recommendation technical solution provided by the embodiments of this application has the technical idea of dividing the business object value evaluation model into parameterized items and non-parameterized items, and comprehensive parameter models and non-parametric models to evaluate the value of business objects whose characteristic distribution is unknown , And then determine the business object recommended to the user based on the value. Since the non-parametric term allows the model to fit the unknown product feature distribution, it can continuously narrow the gap between the parameter model and the real environment, so it can effectively improve the value accuracy of business objects with unknown feature distribution, so that the single recommended time step loss can be Convergence can gradually improve the accuracy of business object recommendation.
  • FIG. 1 is a flowchart of an embodiment of a method for recommending a business object provided by this application.
  • the execution body of the method includes a device for recommending a business object.
  • a business object recommendation method provided by this application includes:
  • Step S101 Determine the score of the candidate business object according to the feature data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model.
  • the recommendation device is usually deployed on a server, but is not limited to a server, and can also be any device that can implement the business object recommendation method.
  • the equipment equipped with the recommendation device can actively start the recommendation device to perform business object recommendation processing, or submit a business object recommendation request according to the user client, provide the user with a business object recommendation service, and according to the user’s recommendation results We continuously optimize the value evaluation model of business objects, so as to gradually improve the evaluation accuracy of business object scores.
  • the recommending device first receives the business object recommendation request sent by the client.
  • the client terminal includes, but is not limited to, mobile communication equipment, that is, a mobile phone or a smart phone in general, and also includes terminal equipment such as personal computers, PADs, and iPads.
  • the business objects include but are not limited to: commodity objects, video objects, news objects, and so on.
  • commodity objects include but are not limited to: commodity objects, video objects, news objects, and so on.
  • the method provided in the embodiments of the present application will be described below by taking a commodity object as an example.
  • the application scenario of the method provided in the embodiments of the present application may be a recommendation scenario of a business object whose business object value is determined by the first feature data and the second feature data.
  • the first feature data refers to feature data whose data distribution is known, which can be artificially set features, including but not limited to feature data related to user behavior (referred to as user behavior feature data), such as a product being bought in one day The number of times a user clicks on the home user, the number of buyer users who have collected goods in seven days, the number of buyer users who communicated with seller users who sold goods, etc.; the first characteristic data may also include other characteristic data that has nothing to do with user behavior, Such as commodity price, commodity classification, seller location, etc.
  • the second feature data refers to feature data whose data distribution is unknown, that is, features that cannot be clearly expressed in the form of feature data.
  • This application abbreviates this scenario as a scenario with unknown data distribution, also known as a commodity cold start scenario.
  • second-hand commodities sold on a second-hand commodity trading platform account for a large proportion of new products and most of them are orphans (single Product), the corresponding transaction cycle is short, which leads to short product exposure time.
  • the exposure flow is more evenly distributed on multiple products, the behavior data that can be collected by the product will be relatively insufficient, that is, according to these values Relatively insufficient user behavior data cannot accurately assess the value of goods, and second-hand product recommendation scenarios are scenarios with unknown data distribution.
  • the application scenarios of the method provided in the embodiments of this application are not limited to scenarios with unknown data distribution.
  • the methods provided in this application can also be used in other scenarios where business objects need to be recommended to users.
  • the value of business objects can be described as The recommended scenario of the business object directly determined by the first feature data, which is referred to as a linear scenario in this application.
  • a linear scenario for example, for non-second-hand commodities sold on ordinary commodity trading platforms, since the transaction commodities are ordinary commodities with a certain amount of inventory, the corresponding transaction cycle is longer, so the commodity exposure time is longer, so the behavior data that can be collected by the commodity will be quite sufficient That is to say, according to the user behavior data with sufficient data volume, the value of the product can be more accurately evaluated, so the common product recommendation scenario is a linear scenario.
  • the method provided in the embodiments of this application can also be applied to application scenarios where a linear scenario is combined with a scenario where data distribution is unknown.
  • the method provided in this application can be used in scenarios where similar business objects are recommended. Recommendations of business objects.
  • the target user opens a mobile App (such as a second-hand commodity trading App, etc.) in a smart phone, and the App sends a business object recommendation request to the server.
  • the business object recommendation request may include information such as a user ID.
  • the server can obtain user information according to the user ID, and recommend a business object that meets the user's interest characteristics through the method provided in the embodiment of the present application.
  • the business object recommendation request may also not include the user identification.
  • the method provided in the embodiment of the present application can recommend business objects irrelevant to the user's interest characteristics, that is, non-personalized recommendation business objects.
  • the business object value evaluation model refers to a model for determining the value of a business object based on the characteristics of the business object (including known first characteristic data and unknown second characteristic data).
  • the input data of the model includes the first feature data whose distribution of the business object is known, and the model output data includes the score of the business object, and the score can be used as a basis for recommendation of the business object.
  • the business object value evaluation model includes a first parameter and a second parameter, the first parameter includes a weight parameter related to the first feature data whose distribution is known, and the second parameter includes a distribution related to the business object.
  • Statistical parameters related to the unknown second feature data that can reflect the difference between the real environment and the parameter model.
  • the first parameter includes a weight parameter related to the first feature data.
  • the first parameter is called a parameter item, and the model corresponding to the first parameter is called a parameter model.
  • the parameter model can be a linear machine learning model, such as linear UCB or linear Thompson Sampling; the parameter model can also be a nonlinear machine learning model, such as Mirror Descent, gradient descent (Gradient Descent) algorithm, and so on.
  • the second parameter includes a statistical parameter that reflects the gap between the parameter model and the real environment.
  • the second parameter is called a non-parametric item
  • the model corresponding to the model of the second parameter is called a non-parametric model.
  • Non-parametric models can be Gaussian processes, Dirichlet processes, and non-parametric methods corresponding to infinite-dimensional distributions, such as Kernel Regression, Decision Trees, and so on.
  • the parameter based on the linear UCB method is used as the first parameter
  • the parameter based on the Gaussian process is used as the second parameter.
  • the confidence interval radius ⁇ of the non-parameter item calculates the confidence interval radius ⁇ of the non-parameter item, and combine the parameter item radius ⁇ to obtain the upper bound U of the semi-parametric confidence interval, which is the score of the commodity.
  • the mathematical expression formula of the process of determining the score is given below to intuitively explain the method of determining the score.
  • the business object is a commodity object
  • L such as 24
  • the confidence interval radius ⁇ of the non-parameter items of the commodity object
  • t represents the t-th business object recommendation
  • T t-1(e) represents the total number of recommendations of the business object e at the t-1th recommendation time
  • ⁇ t-1 (e) represents the product object e at the t-th
  • U t (e) represents the upper bound of the confidence interval of business object e at the t-th recommendation time, that is, the score of business object e (business object value); Represents the non-parametric statistics of business object e at the t-1th recommendation time; Represents the parameter item statistics of business object e at the t-1th recommendation time; ⁇ t-1 (e) represents the sum of the radius of business object e at the t-1th recommendation time; ⁇ X t,e represents the business object e at The difference between the first feature data at the t-th recommendation time and the first feature data estimate (such as the average value) of the business object e at the t-th recommendation time.
  • Step S103 Determine the set of business objects recommended to the user according to the score.
  • the score of the business object serves as a basis for recommendation of the business object, and the set of business objects recommended to the user can be determined according to the score.
  • the value score of a commodity is the upper bound of the confidence interval of the commodity. Since the upper bounds of the confidence interval of different commodities are not independent, this embodiment is based on the upper bound of the confidence interval of the commodity and is calculated according to the offline combination optimization algorithm The best combination of products.
  • the mathematical expression of the process of determining the set of business objects is as follows:
  • a t represents a t-th set of business objects recommended time
  • k represents the number of elements of the set of business objects
  • combinatorial optimization problem A type of optimization problem that finds the optimal solution in a finite set of feasible solutions is called combinatorial optimization problem, which is an important branch of operations research.
  • Combination optimization algorithm optical combination algorithm
  • Combination optimization algorithm is a type of problem that seeks extreme values in a discrete state. Since the combinatorial optimization algorithm is a relatively mature existing technology, it will not be repeated here.
  • the value scores of different commodities are independent of each other. Therefore, according to the order of commodity scores from high to low, a preset number of high-ranking commodities can be selected as a combination of commodities recommended to users.
  • Step S105 Push the set of business objects to the client.
  • the server sends the determined business object back to the client, so that the client can display the business object to the target user for viewing, so as to help the user find the business object of interest, thereby promoting the transaction rate of the business object.
  • the method provided in the embodiments of the present application may be a method of updating the business object value evaluation model online or offline, and determining the business object score through the updated model, and then determining the recommended business object based on the score.
  • FIG. 2 is a specific flowchart of an embodiment of a method for recommending business objects provided by this application.
  • the model is updated online, and the method further includes the following steps:
  • Step S201 Obtain first user feedback information for the business object set.
  • the first user feedback information may include operation behavior information of the user on the business object pushed by the recommendation system, and may also include browsing behavior information.
  • the operation behavior information includes, but is not limited to, the following information: which business objects the user clicks (such as viewing the detailed information of the product), which business objects the user saves, the user stay time, and so on.
  • the browsing behavior information refers to which business objects the user has browsed. For example, 20 business objects are shown to the user and displayed in 2 pages, with 10 business objects displayed on each page. In this case, the user may only view Since the business objects displayed on page 1 are displayed, the browsing behavior information may only include the identities of these 10 business objects.
  • the user can perform operations such as clicking, bookmarking, etc., on the business objects recommended by the system through the client, and these operation information will be collected by the server through the network to form the first user feedback information.
  • the mathematical expression of the first user feedback information includes: O t and W t , where O t represents the business object information that the user has browsed at the t-th recommendation time, and W t represents the user's operation at the t-th recommendation time (such as clicking , Favorites, etc.) business object information.
  • Step S203 Update the first parameter and the second parameter according to the first user feedback information.
  • the model can be updated according to the first user feedback information.
  • step S203 may include the following specific sub-steps:
  • Step S2031 Update the user behavior characteristic data according to the operation behavior information.
  • Step S2033 Generate training samples according to the updated user behavior characteristic data and the browsing behavior information.
  • the user every time the user is shown 20 recommended product objects, for a certain recommendation result, the user only browses the first 10 product objects and clicks on 3 of them to view the product details; in this case, you can Generate 10 new training samples, including: training samples corresponding to each browsed commodity object, the training samples including the user behavior characteristic data of the business object and the corresponding relationship with the sample label information.
  • the training samples corresponding to 3 commodity objects include updated user behavior characteristic data, and the sample label information is 1, indicating that the commodity object has been clicked by the user; the training samples corresponding to the other 7 commodity objects can be It is the user behavior characteristic data at the last recommendation moment, and the sample label information is 0, indicating that the commodity object was not clicked by the user at the current recommendation moment.
  • Step S2035 Update the first parameter and the second parameter according to the generated training samples and historical samples.
  • the newly-added samples and the historical samples of the model can be combined to update the first parameter and the second parameter. Update the first parameter and the second parameter, that is, update the model.
  • the updated model can be used to process the next business object recommendation request submitted through the client. Gradually improve the value accuracy of business objects, and then improve the accuracy of business object recommendations.
  • the process of updating the parameter item (the first parameter) can be expressed as follows:
  • X t represents the first feature data set at the t-th recommendation time (referred to as the newly added first feature data for short)
  • X t-1 represents the first feature data set at the t-1th recommendation time (referred to as the historical The first characteristic data)
  • the business object The difference between the first feature data estimates (such as the average) at the tth recommendation time
  • the business object The difference between the first feature data estimates (such as the average value) at the t-th recommendation time.
  • Y t represents the training sample set at the tth recommendation time
  • Y t-1 represents the training sample set at the t-1th recommendation time
  • W t (e) represents the user clicks (or favorites, etc.) at the tth recommendation time Etc.
  • the business object e, ⁇ W t (e) represents the second feature data of the business object e at the tth recommendation time, and the second feature data estimate (such as the average value) of the business object e at the tth recommendation time The difference between.
  • V t represents the cumulative matrix at the t-th recommendation time.
  • the elements in the matrix represent the correlation between two business objects.
  • V i, j represents the correlation between business object i and business object j
  • V t-1 represents the cumulative matrix at the t-1th recommendation time
  • the parameter estimate is determined by X t and Y t .
  • the first parameter includes 100 parameter items, A column vector composed of the estimated values of these 100 parameter items.
  • ⁇ t represents the parameter item radius at the t-th recommendation time.
  • this embodiment updates the first feature data set X t , the training sample set Y t and the accumulation matrix V t according to the collected user feedback O t and w t at each recommendation moment, and estimates through ridge regression parameter And calculate the updated parameter term radius ⁇ t .
  • O t is the browsing behavior information
  • w t is the operation behavior information.
  • T t (e) ⁇ T t-1 (e) the meaning of this formula is to use the first feature data of business object e at the t-1th recommendation time as the first feature data of business object e at the t-th recommendation time The initial value of.
  • the meaning of this formula is to take the k-th business object browsed by the user at the t-th recommendation time as the business object e to be processed.
  • this formula means to take the first feature data of the business object e that the user has viewed at the t-th recommendation time (such as the number of times the product is clicked by the user in a day, etc.) ) Accumulate 1.
  • this embodiment updates the statistical value of the business object based on user feedback at each recommendation moment And feature mean
  • step S203 after updating the model through step S203, the following steps may be further included:
  • Step S401 Determine whether the model converges according to the first parameter and the second parameter before the update, and the first parameter and the second parameter after the update.
  • the difference between the first parameter before the update and the first parameter after the update is less than the first preset difference threshold
  • the difference between the second parameter before the update and the second parameter after the update is less than The second preset difference threshold is determined to converge the model.
  • Step S403 If the above judgment result is yes, stop updating the model.
  • Step S405 If the above judgment result is no, continue to update the model.
  • the method provided by the embodiment of the application uses the online method shown in Figure 2 to update the model. After each business object is recommended to the user, user feedback information is collected in real time, and the user behavior of the business object is updated in real time according to the user feedback information. Feature data, thereby updating the model to improve the accuracy of business object recommendation; this processing method enables real-time collection of user behavior data and rapid accumulation of user behavior feature data of business objects, making the value of user behavior feature data more sufficient; Therefore, it is more suitable for scenarios with unknown data distribution, such as second-hand merchandise sales scenarios.
  • the processing method of updating the model in offline mode can also be used, so that sufficient user behavior characteristic data existing in the product can be used to avoid the occupation of more computing resources caused by real-time updating of user behavior data, so it is more suitable for linear scenarios .
  • the method provided by the embodiments of this application is not limited to the cold start phase of the business object.
  • the model is updated online, and after this phase, the online update of the model can be stopped; this method also
  • the cold start stage of non-business objects that is, it can be applied to the stage where the product has been placed for a period of time and has sufficient user interaction behavior data, that is, always collect user behavior data and update the model based on real-time user behavior data.
  • the method may further include the following steps:
  • Step S501 Initialize the first parameter and the second parameter included in the business object value evaluation model.
  • the model By initializing the model, the model has an initial business object value evaluation capability. At this time, the accuracy of the value evaluation of the model is usually low.
  • user feedback information is continuously collected, thereby continuously improving model parameters, and gradually increasing the recommendation accuracy, until the user no longer gives feedback information, or until the model converges, that is, before and after The difference between the two models stabilized.
  • step S401 is a specific flowchart of step S401 in an embodiment of a method for recommending a business object provided by this application.
  • the step of initializing the first parameter and the second parameter may include the following sub-steps:
  • Step S5011 Show the candidate business object to the user at least once.
  • the at least one candidate business object includes all business objects that the recommendation system can recommend to the user.
  • the recommendation system first releases all business objects in the system to the user client once to collect the initial user feedback information, that is, the second user feedback information.
  • Step S5013 Obtain second user feedback information for the at least one candidate business object.
  • the second user feedback information may include operation behavior information of the user on the business object that the recommendation system recommends to the user for the first time, and may also include browsing behavior information.
  • Step S5015 Generate training samples of the model according to the second user feedback information.
  • the user behavior characteristic data is first updated according to the operation behavior information, and then the initial training sample of the model is generated based on the updated user behavior characteristic data and the browsing behavior information.
  • Step S5017 Determine the first parameter and the second parameter to be initialized according to the training sample.
  • the first parameter and the second parameter can be determined according to the initial training samples.
  • initializing the model may include the following specific steps: 1) Set the first feature data set X 0 and the training sample set Y 0 to empty sets, set the cumulative matrix V 0 to the unit matrix, and set the parameters Item estimate Set to 0; 2) Put all products once each, collect user feedback, and initialize product features based on user feedback Non-parametric statistics among them, Represents the average value of the first feature data of all commodity objects at the initial time t 0 , Represents the non-parametric statistics at the initial time t 0 .
  • the business object recommendation method determines the score of the candidate business object according to the first characteristic data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model Determine the set of business objects recommended to the user according to the score; send the set of business objects back to the client; this processing method allows the business object value evaluation model to be divided into parameterized items and non-parameterized items, comprehensive Parametric models and non-parametric models evaluate the value of business objects with unknown feature distributions.
  • the computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
  • processors CPU
  • input/output interfaces network interfaces
  • memory volatile and non-volatile memory
  • the memory may include non-permanent memory in computer readable media, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
  • RAM random access memory
  • ROM read-only memory
  • flash RAM flash memory
  • Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology.
  • the information can be computer-readable instructions, data structures, program modules, or other data.
  • Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices.
  • computer-readable media does not include non-transitory computer-readable media (transitory media), such as modulated data signals and carrier waves.
  • this application can be provided as methods, systems or computer program products. Therefore, this application may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, this application may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.
  • a computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Disclosed is a service object recommendation method, comprising: by means of a first parameter and a second parameter comprised in a service object value evaluation model, determining, according to first feature data of a candidate service object, a score of the candidate service object; determining, according to the score, a service object set recommended to a user; and pushing the service object set to a client. By means of the use of such a processing means, the service object value evaluation model is divided into a parameterization item and a non-parameterization item, and the values of service objects with unknown feature distributions are evaluated in combination with a parameter model and a non-parameter model; the non-parameterization item causes the models to fit the unknown service object feature distributions, and a gap between the parameter model and a real environment can thus be continuously reduced; therefore, the value accuracy of the service objects with the unknown feature distributions can be effectively improved, such that the single recommendation time step loss can be converged, and the accuracy of service object recommendation can then be improved step by step.

Description

业务对象推荐方法Business object recommendation method
本申请要求2019年04月28日递交的申请号为201910350833.3、发明名称为“业务对象推荐方法”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 201910350833.3 and the invention title of "Business Object Recommendation Method" filed on April 28, 2019, the entire content of which is incorporated into this application by reference.
技术领域Technical field
本申请涉及数据处理技术领域,具体涉及业务对象推荐方法。This application relates to the field of data processing technology, and specifically relates to a method for recommending business objects.
背景技术Background technique
推荐系统是利用电子商务网站向客户提供商品信息和建议,帮助用户决定应该购买什么产品,模拟销售人员帮助客户完成购买过程。商品冷启动是指针对用户行为匮乏商品的推荐,由于在商品冷启动情况下缺少进行推荐的数据基础,因此冷启动成为推荐系统中的一个经典问题。The recommendation system is to use e-commerce websites to provide customers with product information and suggestions, help users decide what products should be purchased, and simulate sales staff to help customers complete the purchase process. Product cold start refers to the recommendation of products that are lacking in user behavior. Because of the lack of a data basis for recommendation in the case of product cold start, cold start has become a classic problem in the recommendation system.
目前,一种典型的推荐系统冷启动方法是基于多臂赌博机置信上限的方法,其处理过程包括如下步骤:1)进行数据收集,以构建商品数据集,并对该商品数据集中的商品数据进行预处理,获得格式规范的商品显性特征;根据商品显性特征,基于潜在狄利克雷算法构造商品隐形特征,设置输出的商品隐形特征维度,重新标记商品;2)基于商品数据集构建候选商品集:根据商品隐形特征对商品数据集进行聚类,将商品聚类,在同一类簇中的商品具有相似的性质,在不同类簇中的商品差异性较大,从每个类簇中分别随机抽取一个商品,构建候选商品集;3)将从候选商品集中挑选最优商品视为多臂赌博机问题,基于置信区间上界算法计算出估分最高的商品,作为推荐商品;4)将候选商品集中评分最高的商品推荐给用户后,根据反馈更新用户特征以及权重参数。At present, a typical cold-start method of a recommendation system is based on the upper limit of the confidence limit of the multi-armed gambling machine. The processing process includes the following steps: 1) Data collection is performed to construct a product data set, and the product data in the product data set Perform preprocessing to obtain the explicit features of the products in a standardized format; construct the invisible features of the commodities based on the explicit features of the commodities, based on the latent Dirichlet algorithm, set the output invisible feature dimensions, and relabel the commodities; 2) Construct candidates based on the commodity data set Commodity set: cluster the commodity data set according to the invisible characteristics of the commodity, and cluster the commodities. The commodities in the same cluster have similar properties, and the commodities in different clusters are quite different. From each cluster A product is randomly selected to construct a candidate product set; 3) The selection of the best product from the candidate product set is regarded as a multi-armed gambling machine problem, and the product with the highest estimated score is calculated based on the upper bound algorithm of the confidence interval as the recommended product; 4) After recommending the product with the highest score in the candidate product set to the user, update the user characteristics and weight parameters according to the feedback.
然而,在实现本发明过程中,发明人发现该技术方案至少存在如下问题:由于上述方案要求商品具有较为充足的用户行为特征数据,也就是说,用户行为特征数据要足够大到能够正确评估商品价值,因而只适合基于对商品价值进行参数化建模的针对新用户进行个性化商品推荐的应用场景。但是,在实际应用中更多的商品特征分布是未知的,也就是说,有些商品并不具有较为充足的用户行为特征数据,无法仅根据用户行为数据等构建的参数化模型正确评估商品价值。例如,在二手商品推荐场景中,由于二手成交商品中新品的占比较大且多为单品(孤品),对应的成交周期较短,因而导致曝光时间短,又由于曝光流量在商品上的分布又较为均匀,因而对于二手商品可采集的用户行为特征数据会相对不足,也就是说,二手商品的价值无法根据有限维度的用户行为特征确 定,对于这种商品特征分布未知的商品冷启动推荐应用场景,上述方案无法正确评估出商品价值,因而导致推荐结果不能逐步收敛,从而无法筛选出用户感兴趣的商品。However, in the process of implementing the present invention, the inventor found that the technical solution has at least the following problems: because the above solution requires that the product has sufficient user behavior characteristic data, that is, the user behavior characteristic data must be large enough to correctly evaluate the product. Therefore, it is only suitable for the application scenarios of personalized product recommendation for new users based on parameterized modeling of product value. However, in practical applications, the distribution of more product characteristics is unknown, that is, some products do not have sufficient user behavior characteristic data, and it is impossible to correctly evaluate the value of the product based on a parameterized model constructed based on user behavior data. For example, in the second-hand commodity recommendation scenario, because the new products in the second-hand transaction products account for a relatively large proportion and most of them are single products (orphan products), the corresponding transaction cycle is short, which leads to short exposure time, and because of the exposure flow on the products. The distribution is relatively uniform, so the user behavior characteristic data that can be collected for second-hand goods will be relatively insufficient, that is, the value of second-hand goods cannot be determined based on the user behavior characteristics of a limited dimension, and cold-start recommendations for products with unknown commodity characteristic distribution In application scenarios, the above solution cannot correctly evaluate the value of the product, which causes the recommendation result to fail to gradually converge, and thus it is impossible to screen out the products that users are interested in.
发明内容Summary of the invention
本申请提供业务对象推荐方法,以解决现有技术存在的在商品冷启动场景下无法筛选出用户感兴趣商品的问题。This application provides a method for recommending business objects to solve the problem in the prior art that the products of interest to users cannot be filtered out in the cold-start scenario of products.
本申请提供一种业务对象推荐方法,包括:This application provides a method for recommending business objects, including:
通过业务对象价值评估模型包括的第一参数和第二参数,根据候选业务对象的第一特征数据确定所述候选业务对象的得分;所述第一特征数据包括用户行为特征数据;所述第一参数包括与所述第一特征数据相关的权重参数,所述第二参数包括未知的第二特征数据分布的参数;According to the first parameter and the second parameter included in the business object value evaluation model, the score of the candidate business object is determined according to the first characteristic data of the candidate business object; the first characteristic data includes user behavior characteristic data; the first The parameter includes a weight parameter related to the first characteristic data, and the second parameter includes an unknown second characteristic data distribution parameter;
根据所述得分确定向用户推荐的业务对象集;Determining a set of business objects recommended to the user according to the score;
向客户端推送所述业务对象集。Push the set of business objects to the client.
可选的,所述业务对象包括:Optionally, the business object includes:
业务对象价值由所述第一特征数据和第二特征数据共同确定的业务对象,和/或业务对象价值由所述第一特征数据确定的业务对象。A business object whose business object value is determined jointly by the first characteristic data and the second characteristic data, and/or a business object whose business object value is determined by the first characteristic data.
可选的,还包括:Optional, also includes:
获取针对所述业务对象集的第一用户反馈信息;Acquiring first user feedback information for the set of business objects;
根据所述第一用户反馈信息更新所述第一参数和所述第二参数。Updating the first parameter and the second parameter according to the first user feedback information.
可选的,所述第一用户反馈信息包括用户对业务对象的操作行为信息和浏览行为信息。Optionally, the first user feedback information includes operation behavior information and browsing behavior information of the user on the business object.
可选的,所述根据所述第一用户反馈信息更新所述第一参数和所述第二参数,包括:Optionally, the updating the first parameter and the second parameter according to the first user feedback information includes:
根据所述操作行为信息更新所述用户行为特征数据;Updating the user behavior characteristic data according to the operation behavior information;
根据更新后的用户行为特征数据和所述浏览行为信息,生成训练样本;Generating training samples according to the updated user behavior characteristic data and the browsing behavior information;
根据生成的训练样本和历史样本,更新所述第一参数和所述第二参数。The first parameter and the second parameter are updated according to the generated training samples and historical samples.
可选的,还包括:Optional, also includes:
根据更新前的第一参数和第二参数、和更新后的第一参数和第二参数,判断所述模型是否收敛;Judging whether the model converges according to the first parameter and the second parameter before the update, and the first parameter and the second parameter after the update;
若上述判断结果为是,则停止更新所述模型。If the above judgment result is yes, stop updating the model.
可选的,还包括:Optional, also includes:
若上述判断结果为否,则继续更新所述模型。If the above judgment result is no, continue to update the model.
可选的,还包括:Optional, also includes:
初始化所述第一参数和第二参数。Initialize the first parameter and the second parameter.
可选的,所述初始化所述第一参数和第二参数,包括:Optionally, the initializing the first parameter and the second parameter includes:
向用户展示至少一次候选业务对象;Show users at least one candidate business object;
获取针对所述至少一次候选业务对象的第二用户反馈信息;Acquiring second user feedback information for the at least one candidate business object;
根据所述第二用户反馈信息生成所述模型的训练样本;Generating training samples of the model according to the second user feedback information;
根据所述训练样本,确定初始化的第一参数和第二参数。According to the training sample, the first parameter and the second parameter to be initialized are determined.
可选的,所述第一参数包括:线性机器学习模型的参数或非线性机器学习模型的参数;Optionally, the first parameter includes: a parameter of a linear machine learning model or a parameter of a nonlinear machine learning model;
所述第二参数包括:与高斯过程相关的统计项,与狄利克雷过程相关的统计项,与无限维分布相关的统计项。The second parameter includes: statistical items related to the Gaussian process, statistical items related to the Dirichlet process, and statistical items related to the infinite-dimensional distribution.
可选的,所述业务对象包括:商品对象,视频对象,新闻对象。Optionally, the business objects include: commodity objects, video objects, and news objects.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述各种方法。The present application also provides a computer-readable storage medium having instructions stored in the computer-readable storage medium, which when run on a computer, cause the computer to execute the above-mentioned various methods.
本申请还提供一种包括指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述各种方法。The present application also provides a computer program product including instructions, which when run on a computer, causes the computer to execute the above-mentioned various methods.
与现有技术相比,本申请具有以下优点:Compared with the prior art, this application has the following advantages:
本申请实施例提供的业务对象推荐方法,通过业务对象价值评估模型包括的第一参数和第二参数,根据候选业务对象的第一特征数据确定所述候选业务对象的得分;根据所述得分确定向用户推荐的业务对象集;向所述客户端回送所述业务对象集;这种处理方式,使得将业务对象价值评估模型划分为参数化项和非参数化项,综合参数模型与非参模型评估特征分布未知的业务对象的价值,由于非参数化项使得模型能够拟合未知的业务对象特征分布,可以不断缩小参数模型与真实环境的差距;因此,可以有效提升特征分布未知的业务对象的价值准确度,从而使得单推荐时间步损失能够收敛,进而可以逐步提升业务对象推荐的准确度。In the business object recommendation method provided by the embodiment of the application, the score of the candidate business object is determined according to the first characteristic data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model; the score is determined according to the score The set of business objects recommended to the user; the set of business objects are sent back to the client; this processing method allows the business object value evaluation model to be divided into parameterized items and non-parameterized items, and comprehensive parameter models and non-parametric models Evaluate the value of business objects with unknown feature distributions. Because non-parametric terms enable the model to fit the feature distribution of unknown business objects, it can continuously narrow the gap between the parametric model and the real environment; therefore, it can effectively improve the performance of business objects with unknown feature distributions. Value accuracy, so that the single recommendation time step loss can converge, and then the accuracy of business object recommendation can be gradually improved.
附图说明Description of the drawings
图1是本申请提供的业务对象推荐方法的实施例的流程图;FIG. 1 is a flowchart of an embodiment of a method for recommending a business object provided by this application;
图2是本申请提供的业务对象推荐方法的实施例的具体流程图;FIG. 2 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application;
图3是本申请提供的业务对象推荐方法的实施例的具体流程图;FIG. 3 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application;
图4是本申请提供的业务对象推荐方法的实施例的具体流程图;FIG. 4 is a specific flowchart of an embodiment of the business object recommendation method provided by the present application;
图5是本申请提供的业务对象推荐方法的实施例的具体流程图;FIG. 5 is a specific flowchart of an embodiment of a method for recommending business objects provided by this application;
图6是本申请提供的业务对象推荐方法的实施例的具体流程图。Fig. 6 is a specific flowchart of an embodiment of a method for recommending a business object provided by the present application.
具体实施方式Detailed ways
在下面的描述中阐述了很多具体细节以便于充分理解本申请。但是本申请能够以很多不同于在此描述的其它方式来实施,本领域技术人员可以在不违背本申请内涵的情况下做类似推广,因此本申请不受下面公开的具体实施的限制。In the following description, many specific details are explained in order to fully understand this application. However, this application can be implemented in many other ways different from those described here, and those skilled in the art can make similar promotion without violating the connotation of this application. Therefore, this application is not limited by the specific implementation disclosed below.
本申请实施例提供的业务对象推荐技术方案,其技术思想为:将业务对象价值评估模型划分为参数化项和非参数化项,综合参数模型与非参模型评估特征分布未知的业务对象的价值,进而根据该价值确定向用户推荐的业务对象。由于非参数化项使得模型能够拟合未知的商品特征分布,可以不断缩小参数模型与真实环境的差距,因此可以有效提升特征分布未知的业务对象的价值准确度,从而使得单推荐时间步损失能够收敛,进而可以逐步提升业务对象推荐的准确度。The business object recommendation technical solution provided by the embodiments of this application has the technical idea of dividing the business object value evaluation model into parameterized items and non-parameterized items, and comprehensive parameter models and non-parametric models to evaluate the value of business objects whose characteristic distribution is unknown , And then determine the business object recommended to the user based on the value. Since the non-parametric term allows the model to fit the unknown product feature distribution, it can continuously narrow the gap between the parameter model and the real environment, so it can effectively improve the value accuracy of business objects with unknown feature distribution, so that the single recommended time step loss can be Convergence can gradually improve the accuracy of business object recommendation.
第一实施例First embodiment
请参考图1,其为本申请提供的一种业务对象推荐方法实施例的流程图,该方法的执行主体包括业务对象推荐装置。本申请提供的一种业务对象推荐方法包括:Please refer to FIG. 1, which is a flowchart of an embodiment of a method for recommending a business object provided by this application. The execution body of the method includes a device for recommending a business object. A business object recommendation method provided by this application includes:
步骤S101:通过业务对象价值评估模型包括的第一参数和第二参数,根据候选业务对象的特征数据确定所述候选业务对象的得分。Step S101: Determine the score of the candidate business object according to the feature data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model.
所述推荐装置,通常部署于服务器,但并不局限于服务器,也可以是能够实现所述业务对象推荐方法的任何设备。部署有所述推荐装置的设备,可主动启动所述推荐装置执行业务对象推荐处理,也可根据用户客户端提交业务对象推荐请求,向用户提供业务对象推荐服务,并且根据用户对每次推荐结果的反馈信息,不断优化业务对象价值评估模型,以便于逐步提升业务对象得分的评估准确度。The recommendation device is usually deployed on a server, but is not limited to a server, and can also be any device that can implement the business object recommendation method. The equipment equipped with the recommendation device can actively start the recommendation device to perform business object recommendation processing, or submit a business object recommendation request according to the user client, provide the user with a business object recommendation service, and according to the user’s recommendation results We continuously optimize the value evaluation model of business objects, so as to gradually improve the evaluation accuracy of business object scores.
在本实施例中,所述推荐装置首先接收客户端发送的业务对象推荐请求。所述的客户端包括但不限于移动通讯设备,即:通常所说的手机或者智能手机,还包括个人电脑、PAD、iPad等终端设备。In this embodiment, the recommending device first receives the business object recommendation request sent by the client. The client terminal includes, but is not limited to, mobile communication equipment, that is, a mobile phone or a smart phone in general, and also includes terminal equipment such as personal computers, PADs, and iPads.
从业务对象类别角度而言,所述业务对象包括但不限于:商品对象,也可以是视频对象,新闻对象等等。为了便于描述,下面将以商品对象为例对本申请实施例提供的方 法进行说明。From the perspective of business object categories, the business objects include but are not limited to: commodity objects, video objects, news objects, and so on. For ease of description, the method provided in the embodiments of the present application will be described below by taking a commodity object as an example.
从应用场景角度而言,本申请实施例提供的方法,其应用场景可以是业务对象价值由所述第一特征数据和第二特征数据共同确定的业务对象的推荐场景。所述第一特征数据是指数据分布已知的特征数据,可以是人为设定的特征,包括但不限于与用户行为有关的特征数据(简称用户行为特征数据),如商品在一天内被买家用户点击的次数,七天内收藏商品的买家用户数量,与售卖商品的卖家用户沟通过的买家用户数量等等;所述第一特征数据还可以包括其它与用户行为无关的特征数据,如商品价格、商品分类、卖家所在地等等。所述第二特征数据,是指数据分布未知的特征数据,也就是说,无法明确用特征数据形式表达的特征。本申请将该场景简称为数据分布未知场景,又称为商品冷启动场景,例如,在二手商品交易平台上售卖的二手商品,由于成交商品中新品的占比较大、且多为孤品(单品),对应的成交周期较短,因而导致商品曝光时间短,同时由于曝光流量在多个商品上的分布又较为均匀,因而商品可采集的行为数据会相对不足,也就是说,根据这些数值相对不足的用户行为数据无法准确评估商品价值,二手商品推荐场景属于数据分布未知场景。From the perspective of application scenarios, the application scenario of the method provided in the embodiments of the present application may be a recommendation scenario of a business object whose business object value is determined by the first feature data and the second feature data. The first feature data refers to feature data whose data distribution is known, which can be artificially set features, including but not limited to feature data related to user behavior (referred to as user behavior feature data), such as a product being bought in one day The number of times a user clicks on the home user, the number of buyer users who have collected goods in seven days, the number of buyer users who communicated with seller users who sold goods, etc.; the first characteristic data may also include other characteristic data that has nothing to do with user behavior, Such as commodity price, commodity classification, seller location, etc. The second feature data refers to feature data whose data distribution is unknown, that is, features that cannot be clearly expressed in the form of feature data. This application abbreviates this scenario as a scenario with unknown data distribution, also known as a commodity cold start scenario. For example, second-hand commodities sold on a second-hand commodity trading platform account for a large proportion of new products and most of them are orphans (single Product), the corresponding transaction cycle is short, which leads to short product exposure time. At the same time, because the exposure flow is more evenly distributed on multiple products, the behavior data that can be collected by the product will be relatively insufficient, that is, according to these values Relatively insufficient user behavior data cannot accurately assess the value of goods, and second-hand product recommendation scenarios are scenarios with unknown data distribution.
本申请实施例提供的方法,其应用场景也不仅仅局限于数据分布未知场景,在其它需要向用户推荐业务对象的场景下也可以采用本申请所提供的方法,例如,业务对象价值可由所述第一特征数据直接确定的业务对象的推荐场景,本申请将该场景称为线性场景。例如,在普通商品交易平台上售卖的非二手商品,由于成交商品为具有一定库存量的普通商品,对应的成交周期较长,因而商品曝光时间较长,因此商品可采集的行为数据会相当充足,也就是说,根据这些数据量较为充足的用户行为数据可以较为准确评估商品价值,因此普通商品推荐场景属于线性场景。The application scenarios of the method provided in the embodiments of this application are not limited to scenarios with unknown data distribution. The methods provided in this application can also be used in other scenarios where business objects need to be recommended to users. For example, the value of business objects can be described as The recommended scenario of the business object directly determined by the first feature data, which is referred to as a linear scenario in this application. For example, for non-second-hand commodities sold on ordinary commodity trading platforms, since the transaction commodities are ordinary commodities with a certain amount of inventory, the corresponding transaction cycle is longer, so the commodity exposure time is longer, so the behavior data that can be collected by the commodity will be quite sufficient That is to say, according to the user behavior data with sufficient data volume, the value of the product can be more accurately evaluated, so the common product recommendation scenario is a linear scenario.
此外,本申请实施例提供的方法,还可以应用在是线性场景与数据分布未知场景相结合的应用场景,换句话说,在有类似业务对象推荐的场景下都可以应用本申请提供的方法进行业务对象的推荐。In addition, the method provided in the embodiments of this application can also be applied to application scenarios where a linear scenario is combined with a scenario where data distribution is unknown. In other words, the method provided in this application can be used in scenarios where similar business objects are recommended. Recommendations of business objects.
在本实施例中,目标用户在智能手机中打开移动App(如二手商品交易App等等),App向服务器发送业务对象推荐请求。所述业务对象推荐请求,可包括用户标识等等信息,在这种情况下,服务器可根据用户标识获取用户信息,通过本申请实施例提供的方法向用户推荐符合其兴趣特点的业务对象。所述业务对象推荐请求,也可不包括用户标识,在这种情况下,通过本申请实施例提供的方法可向用户推荐与其兴趣特点无关的业务对象,也就是说,非个性化推荐业务对象。In this embodiment, the target user opens a mobile App (such as a second-hand commodity trading App, etc.) in a smart phone, and the App sends a business object recommendation request to the server. The business object recommendation request may include information such as a user ID. In this case, the server can obtain user information according to the user ID, and recommend a business object that meets the user's interest characteristics through the method provided in the embodiment of the present application. The business object recommendation request may also not include the user identification. In this case, the method provided in the embodiment of the present application can recommend business objects irrelevant to the user's interest characteristics, that is, non-personalized recommendation business objects.
所述业务对象价值评估模型,是指根据业务对象特征(包括已知的第一特征数据和未知的第二特征数据)确定业务对象的价值的模型。所述模型的输入数据包括业务对象的分布已知的第一特征数据,模型输出数据包括业务对象的得分,该得分可作为业务对象推荐依据。The business object value evaluation model refers to a model for determining the value of a business object based on the characteristics of the business object (including known first characteristic data and unknown second characteristic data). The input data of the model includes the first feature data whose distribution of the business object is known, and the model output data includes the score of the business object, and the score can be used as a basis for recommendation of the business object.
所述业务对象价值评估模型包括第一参数和第二参数,所述第一参数包括与业务对象的分布已知的第一特征数据相关的权重参数,所述第二参数包括与业务对象的分布未知的第二特征数据相关的、能够体现真实环境与参数模型间差异的统计参数。采用这种处理方式,使得引入了非参项的估计,可以不断缩小参数模型与真实环境的差距,因此可以有效提升价值评估准确度,使得单时间步损失能够收敛,从而有效提升推荐准确度。The business object value evaluation model includes a first parameter and a second parameter, the first parameter includes a weight parameter related to the first feature data whose distribution is known, and the second parameter includes a distribution related to the business object. Statistical parameters related to the unknown second feature data that can reflect the difference between the real environment and the parameter model. By adopting this processing method, non-parametric estimation is introduced, and the gap between the parameter model and the real environment can be continuously narrowed. Therefore, the accuracy of value evaluation can be effectively improved, the single time step loss can be converged, and the recommendation accuracy can be effectively improved.
所述第一参数包括与第一特征数据相关的权重参数,本实施例将第一参数称为参数项,将与第一参数对应的模型称为参数模型。参数模型可以是线性机器学习模型,如线性UCB或线性Thompson Sampling等等;参数模型也可以是非线性机器学习模型,如Mirror Descent、梯度下降(Gradient Descent)算法等等。The first parameter includes a weight parameter related to the first feature data. In this embodiment, the first parameter is called a parameter item, and the model corresponding to the first parameter is called a parameter model. The parameter model can be a linear machine learning model, such as linear UCB or linear Thompson Sampling; the parameter model can also be a nonlinear machine learning model, such as Mirror Descent, gradient descent (Gradient Descent) algorithm, and so on.
所述第二参数包括体现参数模型与真实环境间差距的统计参数,本实施例将第二参数称为非参数项,将与第二参数的模型对应的模型称为非参数模型。非参数模型可以是高斯过程、狄利克雷过程以及无限维分布对应的非参方法,如核回归模型(Kernel Regression)、决策树(Decision Trees)等等。The second parameter includes a statistical parameter that reflects the gap between the parameter model and the real environment. In this embodiment, the second parameter is called a non-parametric item, and the model corresponding to the model of the second parameter is called a non-parametric model. Non-parametric models can be Gaussian processes, Dirichlet processes, and non-parametric methods corresponding to infinite-dimensional distributions, such as Kernel Regression, Decision Trees, and so on.
在本实施例中,将基于线性UCB方法的参数作为第一参数,将基于高斯过程的参数作为第二参数。例如,对一个商品e,计算非参项的置信区间半径α,结合参数项半径β得到半参置信区间上界U,即商品的得分。下面给出确定所述得分的过程的数学表达公式,以直观地说明所述得分的确定方式。In this embodiment, the parameter based on the linear UCB method is used as the first parameter, and the parameter based on the Gaussian process is used as the second parameter. For example, for a commodity e, calculate the confidence interval radius α of the non-parameter item, and combine the parameter item radius β to obtain the upper bound U of the semi-parametric confidence interval, which is the score of the commodity. The mathematical expression formula of the process of determining the score is given below to intuitively explain the method of determining the score.
在本实施例中,业务对象为商品对象,且要向用户推荐L(如24)个商品对象,对一个商品对象e,通过下述公式计算该商品对象的非参项的置信区间半径α:In this embodiment, the business object is a commodity object, and L (such as 24) commodity objects are recommended to the user. For a commodity object e, the confidence interval radius α of the non-parameter items of the commodity object is calculated by the following formula:
Figure PCTCN2020085254-appb-000001
Figure PCTCN2020085254-appb-000001
其中,t表示第t次业务对象推荐;T t-1(e)表示业务对象e在第t-1次推荐时刻的总推荐次数,α t-1(e)表示商品对象e在第t-1次推荐时刻的非参项的置信区间半径。 Among them, t represents the t-th business object recommendation; T t-1(e) represents the total number of recommendations of the business object e at the t-1th recommendation time, and α t-1 (e) represents the product object e at the t-th The radius of the confidence interval of the non-parameter at the moment of 1 recommendation
同时,结合参数项半径β,通过下述公式得到置信区间上界U:At the same time, combined with the parameter term radius β, the upper bound U of the confidence interval is obtained by the following formula:
Figure PCTCN2020085254-appb-000002
Figure PCTCN2020085254-appb-000002
Figure PCTCN2020085254-appb-000003
Figure PCTCN2020085254-appb-000003
Figure PCTCN2020085254-appb-000004
Figure PCTCN2020085254-appb-000004
其中,U t(e)表示业务对象e在第t次推荐时刻的置信区间上界,也就是业务对象e的得分(业务对象价值);
Figure PCTCN2020085254-appb-000005
表示业务对象e在第t-1次推荐时刻的非参项统计;
Figure PCTCN2020085254-appb-000006
表示业务对象e在第t-1次推荐时刻的参数项统计;γ t-1(e)表示业务对象e在第t-1次推荐时刻的半径之和;ΔX t,e表示业务对象e在第t次推荐时刻的第一特征数据、与业务对象e在第t次推荐时刻的第一特征数据估计(如平均值)之间的差异。
Among them, U t (e) represents the upper bound of the confidence interval of business object e at the t-th recommendation time, that is, the score of business object e (business object value);
Figure PCTCN2020085254-appb-000005
Represents the non-parametric statistics of business object e at the t-1th recommendation time;
Figure PCTCN2020085254-appb-000006
Represents the parameter item statistics of business object e at the t-1th recommendation time; γ t-1 (e) represents the sum of the radius of business object e at the t-1th recommendation time; ΔX t,e represents the business object e at The difference between the first feature data at the t-th recommendation time and the first feature data estimate (such as the average value) of the business object e at the t-th recommendation time.
步骤S103:根据所述得分确定向用户推荐的业务对象集。Step S103: Determine the set of business objects recommended to the user according to the score.
所述业务对象的得分作为业务对象推荐的依据,根据所述得分即可确定向用户推荐的业务对象集。在本实施例中,商品的价值得分为商品的置信区间上界,由于不同商品的置信区间上界之间并不独立,因此本实施例基于商品的置信区间上界,根据离线组合优化算法计算出最优的商品组合。确定所述业务对象集的过程的数学表达如下所述:The score of the business object serves as a basis for recommendation of the business object, and the set of business objects recommended to the user can be determined according to the score. In this embodiment, the value score of a commodity is the upper bound of the confidence interval of the commodity. Since the upper bounds of the confidence interval of different commodities are not independent, this embodiment is based on the upper bound of the confidence interval of the commodity and is calculated according to the offline combination optimization algorithm The best combination of products. The mathematical expression of the process of determining the set of business objects is as follows:
Figure PCTCN2020085254-appb-000007
Figure PCTCN2020085254-appb-000007
其中,A t表示第t次推荐时刻的业务对象集,k表示业务对象集的元素数量,这k个业务对象根据所有业务对象e在第t次推荐时刻的得分U t确定。 Wherein, A t represents a t-th set of business objects recommended time, k represents the number of elements of the set of business objects, business objects which k is determined based on all the U-score t e business objects in the t-th time recommended.
在有限个可行解的集合中找出最优解的一类优化问题称为组合最优化问题,它是运筹学中的一个重要分支。组合优化算法(optimal combination algorithm)是一类在离散状态下求极值的问题。由于组合优化算法属于较为成熟的现有技术,因此此处不再赘述。A type of optimization problem that finds the optimal solution in a finite set of feasible solutions is called combinatorial optimization problem, which is an important branch of operations research. Combination optimization algorithm (optimal combination algorithm) is a type of problem that seeks extreme values in a discrete state. Since the combinatorial optimization algorithm is a relatively mature existing technology, it will not be repeated here.
在另一个示例中,不同商品的价值得分相互独立,因此也可以根据商品得分从高到低的顺序,选取出预设数量的排在高位的商品,作为向用户推荐的商品组合。In another example, the value scores of different commodities are independent of each other. Therefore, according to the order of commodity scores from high to low, a preset number of high-ranking commodities can be selected as a combination of commodities recommended to users.
步骤S105:向客户端推送所述业务对象集。Step S105: Push the set of business objects to the client.
服务器将确定的业务对象回送至所述客户端,以便于客户端将业务对象展示给目标用户查看,以帮助用户发现感兴趣的业务对象,从而促进业务对象的成交率。The server sends the determined business object back to the client, so that the client can display the business object to the target user for viewing, so as to help the user find the business object of interest, thereby promoting the transaction rate of the business object.
本申请实施例提供的方法,可以是一种通过在线方式或离线方式更新业务对象价值评估模型,并通过更新的模型确定业务对象得分,进而根据得分确定推荐的业务对象的方法。The method provided in the embodiments of the present application may be a method of updating the business object value evaluation model online or offline, and determining the business object score through the updated model, and then determining the recommended business object based on the score.
请参考图2,其为本申请提供的一种业务对象推荐方法实施例的具体流程图。在本 实施例中,以在线方式更新所述模型,所述方法还包括如下步骤:Please refer to FIG. 2, which is a specific flowchart of an embodiment of a method for recommending business objects provided by this application. In this embodiment, the model is updated online, and the method further includes the following steps:
步骤S201:获取针对所述业务对象集的第一用户反馈信息。Step S201: Obtain first user feedback information for the business object set.
所述第一用户反馈信息,可包括用户对推荐系统推送的业务对象的操作行为信息,还可包括浏览行为信息。所述操作行为信息,包括但不限于以下信息:用户点击(如查看商品的详情信息)了哪些业务对象,用户收藏了哪些业务对象,用户停留时间等等。所述浏览行为信息,是指用户浏览了哪些业务对象,例如,向用户展示了20个业务对象,并分为2页显示,每页显示10个业务对象,这种情况下,用户可能只查看了第1页中显示的业务对象,因此,所述浏览行为信息可只包括这10个业务对象的标识。The first user feedback information may include operation behavior information of the user on the business object pushed by the recommendation system, and may also include browsing behavior information. The operation behavior information includes, but is not limited to, the following information: which business objects the user clicks (such as viewing the detailed information of the product), which business objects the user saves, the user stay time, and so on. The browsing behavior information refers to which business objects the user has browsed. For example, 20 business objects are shown to the user and displayed in 2 pages, with 10 business objects displayed on each page. In this case, the user may only view Since the business objects displayed on page 1 are displayed, the browsing behavior information may only include the identities of these 10 business objects.
具体实施时,用户可通过所述客户端对系统推荐的业务对象进行点击、收藏等等操作,通过网络这些操作信息都会被服务器端采集到,形成所述第一用户反馈信息。During specific implementation, the user can perform operations such as clicking, bookmarking, etc., on the business objects recommended by the system through the client, and these operation information will be collected by the server through the network to form the first user feedback information.
所述第一用户反馈信息的数学表达包括:O t和W t,其中O t表示用户在第t次推荐时刻浏览过的业务对象信息,W t表示用户在第t次推荐时刻操作(如点击、收藏等等操作)过的业务对象信息。 The mathematical expression of the first user feedback information includes: O t and W t , where O t represents the business object information that the user has browsed at the t-th recommendation time, and W t represents the user's operation at the t-th recommendation time (such as clicking , Favorites, etc.) business object information.
步骤S203:根据所述第一用户反馈信息更新所述第一参数和所述第二参数。Step S203: Update the first parameter and the second parameter according to the first user feedback information.
在获取到所述第一用户反馈信息后,由于这些信息反映出了业务对象的与用户行为有关的特征数据的变化情况,因此可以根据所述第一用户反馈信息更新所述模型。After the first user feedback information is obtained, since the information reflects changes in the characteristic data of the business object related to the user behavior, the model can be updated according to the first user feedback information.
如图3所示,在本实施例中,步骤S203可包括如下具体子步骤:As shown in FIG. 3, in this embodiment, step S203 may include the following specific sub-steps:
步骤S2031:根据所述操作行为信息更新所述用户行为特征数据。Step S2031: Update the user behavior characteristic data according to the operation behavior information.
例如,向用户展示了20个商品对象,用户点击了其中3个商品对象,收藏了其中1个商品对象,这种情况下,可以对这3个商品对象的用户在1天内点击的次数累积加1,对其中1个商品对象的收藏用户数量累积加1等等。For example, 20 product objects are shown to the user, the user clicks on 3 of the product objects, and one of the product objects is bookmarked. In this case, the number of clicks by users of these 3 product objects in a day can be cumulatively added. 1. Add 1 cumulatively to the number of favorite users of one of the commodity objects.
步骤S2033:根据更新后的用户行为特征数据和所述浏览行为信息,生成训练样本。Step S2033: Generate training samples according to the updated user behavior characteristic data and the browsing behavior information.
例如,每次向用户展示20个推荐的商品对象,对于某一次推荐结果,用户只浏览了前10个商品对象,并点击了其中3个商品对象,以查看商品详情;这种情况下,可生成10个新增的训练样本,包括:与每个浏览过的商品对象对应的训练样本,所述训练样本包括业务对象的用户行为特征数据、与样本标注信息间的对应关系。在本实施例中,其中3个商品对象对应的训练样本包括更新后的用户行为特征数据,其样本标注信息为1,表示该商品对象已被用户点击;另外7个商品对象对应的训练样本可以是上一推荐时刻的用户行为特征数据,其样本标注信息为0,表示该商品对象在本次推荐时刻未被用户点击。For example, every time the user is shown 20 recommended product objects, for a certain recommendation result, the user only browses the first 10 product objects and clicks on 3 of them to view the product details; in this case, you can Generate 10 new training samples, including: training samples corresponding to each browsed commodity object, the training samples including the user behavior characteristic data of the business object and the corresponding relationship with the sample label information. In this embodiment, the training samples corresponding to 3 commodity objects include updated user behavior characteristic data, and the sample label information is 1, indicating that the commodity object has been clicked by the user; the training samples corresponding to the other 7 commodity objects can be It is the user behavior characteristic data at the last recommendation moment, and the sample label information is 0, indicating that the commodity object was not clicked by the user at the current recommendation moment.
步骤S2035:根据生成的训练样本和历史样本,更新所述第一参数和所述第二参数。Step S2035: Update the first parameter and the second parameter according to the generated training samples and historical samples.
在生成所述模型的新增训练样本后,就可以结合该新增样本及模型的历史样本,更新所述第一参数和所述第二参数。更新所述第一参数和所述第二参数,也就是更新所述模型,在更新所述模型后,就可以使用更新后的模型对通过客户端提交的下一个业务对象推荐请求进行处理,从而逐步提高业务对象的价值准确度,进而提升业务对象的推荐准确度。After the newly-added training samples of the model are generated, the newly-added samples and the historical samples of the model can be combined to update the first parameter and the second parameter. Update the first parameter and the second parameter, that is, update the model. After the model is updated, the updated model can be used to process the next business object recommendation request submitted through the client. Gradually improve the value accuracy of business objects, and then improve the accuracy of business object recommendations.
下面给出更新所述第一参数和所述第二参数的过程的数学表达公式,以直观地说明模型更新处理方式。The mathematical expression formula of the process of updating the first parameter and the second parameter is given below to intuitively explain the model update processing method.
在本实施例中,更新参数项(第一参数)的过程可采用如下数学表达:In this embodiment, the process of updating the parameter item (the first parameter) can be expressed as follows:
Figure PCTCN2020085254-appb-000008
Figure PCTCN2020085254-appb-000008
其中,X t表示第t次推荐时刻的第一特征数据集合(简称为新增的第一特征数据),X t-1表示第t-1次推荐时刻的第一特征数据集合(简称为历史的第一特征数据),
Figure PCTCN2020085254-appb-000009
表示用户在第t次推荐时刻浏览过的第一个业务对象的更新后的第一特征数据、与业务对象
Figure PCTCN2020085254-appb-000010
在第t次推荐时刻的第一特征数据估计(如平均值)之间的差异,
Figure PCTCN2020085254-appb-000011
表示用户在第t次推荐时刻浏览过的第O t个业务对象的更新后的第一特征数据、与业务对象
Figure PCTCN2020085254-appb-000012
在第t次推荐时刻的第一特征数据估计(如平均值)之间的差异。采用这种处理方式,使得仅根据用户浏览过的业务对象信息更新所述模型;因此,可以有效提升模型准确度,同时节约存储资源和计算资源。
Among them, X t represents the first feature data set at the t-th recommendation time (referred to as the newly added first feature data for short), and X t-1 represents the first feature data set at the t-1th recommendation time (referred to as the historical The first characteristic data),
Figure PCTCN2020085254-appb-000009
Represents the updated first feature data of the first business object viewed by the user at the t-th recommendation time, and the business object
Figure PCTCN2020085254-appb-000010
The difference between the first feature data estimates (such as the average) at the tth recommendation time,
Figure PCTCN2020085254-appb-000011
Represents the updated first feature data of the O t business object viewed by the user at the t recommendation time, and the business object
Figure PCTCN2020085254-appb-000012
The difference between the first feature data estimates (such as the average value) at the t-th recommendation time. By adopting this processing method, the model is updated only according to the business object information that the user has browsed; therefore, the accuracy of the model can be effectively improved while saving storage resources and computing resources.
Figure PCTCN2020085254-appb-000013
Figure PCTCN2020085254-appb-000013
Figure PCTCN2020085254-appb-000014
Figure PCTCN2020085254-appb-000014
其中,Y t表示第t次推荐时刻的训练样本集合,Y t-1表示第t-1次推荐时刻的训练样本集合,W t(e)表示用户在第t次推荐时刻点击(或收藏等等)过业务对象e,ΔW t(e)表示业务对象e在第t次推荐时刻的第二特征数据、与业务对象e在第t次推荐时刻的第二特征数据估计(如平均值)之间的差异。 Among them, Y t represents the training sample set at the tth recommendation time, Y t-1 represents the training sample set at the t-1th recommendation time, and W t (e) represents the user clicks (or favorites, etc.) at the tth recommendation time Etc.) The business object e, ΔW t (e) represents the second feature data of the business object e at the tth recommendation time, and the second feature data estimate (such as the average value) of the business object e at the tth recommendation time The difference between.
Figure PCTCN2020085254-appb-000015
Figure PCTCN2020085254-appb-000015
其中,V t表示第t次推荐时刻的累积矩阵,该矩阵中的元素表示两个业务对象之间的相关度,如V i,j表示业务对象i和业务对象j之间的相关度,V t-1表示第t-1次推荐时刻 的累积矩阵,
Figure PCTCN2020085254-appb-000016
表示在用户浏览过的业务对象中包括的两两业务对象之间的相关度之和。
Among them, V t represents the cumulative matrix at the t-th recommendation time. The elements in the matrix represent the correlation between two business objects. For example, V i, j represents the correlation between business object i and business object j, and V t-1 represents the cumulative matrix at the t-1th recommendation time,
Figure PCTCN2020085254-appb-000016
Represents the sum of the correlation between two business objects included in the business objects browsed by the user.
Figure PCTCN2020085254-appb-000017
Figure PCTCN2020085254-appb-000017
其中,
Figure PCTCN2020085254-appb-000018
表示第t次推荐时刻的参数项估计,该参数项估计由X t和Y t确定。在本实施例中,第一参数包括100个参数项,
Figure PCTCN2020085254-appb-000019
为这100个参数项的估计值构成的列向量。
among them,
Figure PCTCN2020085254-appb-000018
Indicates the parameter estimate at the t-th recommendation time. The parameter estimate is determined by X t and Y t . In this embodiment, the first parameter includes 100 parameter items,
Figure PCTCN2020085254-appb-000019
A column vector composed of the estimated values of these 100 parameter items.
Figure PCTCN2020085254-appb-000020
Figure PCTCN2020085254-appb-000020
其中,β t表示第t次推荐时刻的参数项半径。 Among them, β t represents the parameter item radius at the t-th recommendation time.
综上所述,本实施例在每一个推荐时刻,根据采集到的用户反馈O t和w t,更新第一特征数据集合X t、训练样本集合Y t和累积矩阵V t,通过岭回归估计参数
Figure PCTCN2020085254-appb-000021
并计算更新后的参数项半径β t。其中,O t为所述浏览行为信息,w t为所述操作行为信息。
In summary, this embodiment updates the first feature data set X t , the training sample set Y t and the accumulation matrix V t according to the collected user feedback O t and w t at each recommendation moment, and estimates through ridge regression parameter
Figure PCTCN2020085254-appb-000021
And calculate the updated parameter term radius β t . Wherein, O t is the browsing behavior information, and w t is the operation behavior information.
在本实施例中,更新非参数项的过程可采用如下数学表达:In this embodiment, the process of updating non-parametric items can be expressed as follows:
1)假设共向用户推荐L个业务对象,对于每个业务对象e,执行如下计算:1) Assuming that a total of L business objects are recommended to users, for each business object e, the following calculation is performed:
T t(e)←T t-1(e),该公式含义是将业务对象e在第t-1次推荐时刻的第一特征数据作为业务对象e在第t次推荐时刻的第一特征数据的初始值。 T t (e)←T t-1 (e), the meaning of this formula is to use the first feature data of business object e at the t-1th recommendation time as the first feature data of business object e at the t-th recommendation time The initial value of.
2)对于k=1,…,min{O t,|A t|},其中|A t|表示向用户推荐的业务对象数量,O t表示用户浏览过的业务对象数量,执行如下计算: 2) For k=1,...,min{O t ,|A t |}, where |A t | represents the number of business objects recommended to the user, and O t represents the number of business objects browsed by the user, the following calculation is performed:
Figure PCTCN2020085254-appb-000022
该公式含义是将用户在第t次推荐时刻浏览过的第k个业务对象作为待处理的业务对象e。
Figure PCTCN2020085254-appb-000022
The meaning of this formula is to take the k-th business object browsed by the user at the t-th recommendation time as the business object e to be processed.
T t(e)←T t(e)+1,该公式含义是将用户在第t次推荐时刻浏览过的业务对象e的第一特征数据(如商品在一天内被用户点击的次数等等)累加1。 T t (e)←T t (e)+1, this formula means to take the first feature data of the business object e that the user has viewed at the t-th recommendation time (such as the number of times the product is clicked by the user in a day, etc.) ) Accumulate 1.
Figure PCTCN2020085254-appb-000023
该公式含义是业务对象e在第t次推荐时刻的非参项。
Figure PCTCN2020085254-appb-000023
The meaning of this formula is the non-parameter of the business object e at the t-th recommendation time.
Figure PCTCN2020085254-appb-000024
该公式含义是业务对象e在第t次推荐时刻的参数项特征均值。
Figure PCTCN2020085254-appb-000024
The meaning of this formula is the mean value of the parameter item feature of the business object e at the t-th recommendation time.
综上所述,本实施例在每一个推荐时刻,根据用户反馈更新业务对象的统计价值
Figure PCTCN2020085254-appb-000025
和特征均值
Figure PCTCN2020085254-appb-000026
In summary, this embodiment updates the statistical value of the business object based on user feedback at each recommendation moment
Figure PCTCN2020085254-appb-000025
And feature mean
Figure PCTCN2020085254-appb-000026
如图4所示,在本实施例中,在通过步骤S203更新所述模型后,还可以包括如下步骤:As shown in Fig. 4, in this embodiment, after updating the model through step S203, the following steps may be further included:
步骤S401:根据更新前的第一参数和第二参数、及更新后的第一参数和第二参数,判断所述模型是否收敛。Step S401: Determine whether the model converges according to the first parameter and the second parameter before the update, and the first parameter and the second parameter after the update.
在本实施例中,如果更新前的第一参数和更新后的第一参数之间的差异小于第一预设差异阈值,更新前的第二参数和更新后的第二参数之间的差异小于第二预设差异阈值,则判定所述模型收敛。In this embodiment, if the difference between the first parameter before the update and the first parameter after the update is less than the first preset difference threshold, the difference between the second parameter before the update and the second parameter after the update is less than The second preset difference threshold is determined to converge the model.
步骤S403:若上述判断结果为是,则停止更新所述模型。Step S403: If the above judgment result is yes, stop updating the model.
如果判定所述模型收敛,则表示所述模型的各种参数已经相对稳定,可以正确评估业务对象的价值得分,从而使得推荐结果的准确度可以逐渐提升。在这种情况下,就可以停止搜集用户反馈信息,停止更新所述模型,以节省服务器的计算资源。If it is determined that the model is convergent, it means that various parameters of the model are relatively stable, and the value score of the business object can be correctly evaluated, so that the accuracy of the recommendation result can be gradually improved. In this case, you can stop collecting user feedback information and stop updating the model to save the server's computing resources.
步骤S405:若上述判断结果为否,则继续更新所述模型。Step S405: If the above judgment result is no, continue to update the model.
如果判定所述模型未收敛,则表示所述模型的各种参数还未稳定,无法正确评估业务对象的价值得分,因而需要继续搜集用户反馈信息,继续更新所述模型的第一参数和第二参数,以使得逐步提升模型准确度,从而提升业务对象的价值评估准确度,进而提升推荐结果的准确度,使得推荐结果逐渐收敛。If it is determined that the model does not converge, it means that the various parameters of the model are not stable, and the value score of the business object cannot be correctly evaluated. Therefore, it is necessary to continue to collect user feedback information and continue to update the first and second parameters of the model. Parameters, so as to gradually improve the accuracy of the model, thereby improving the accuracy of the value evaluation of the business object, and then improving the accuracy of the recommendation result, so that the recommendation result gradually converges.
本申请实施例提供的方法,采用如图2所示的在线方式更新所述模型,通过在每次向用户推荐业务对象后,实时采集用户反馈信息,根据用户反馈信息实时更新业务对象的用户行为特征数据,从而更新所述模型,以提升业务对象推荐的准确度;这种处理方式,使得实时搜集用户行为数据,快速累积业务对象的用户行为特征数据,使得用户行为特征数据的数值更加充足;因此,更适用于数据分布未知场景,如二手商品售卖场景。The method provided by the embodiment of the application uses the online method shown in Figure 2 to update the model. After each business object is recommended to the user, user feedback information is collected in real time, and the user behavior of the business object is updated in real time according to the user feedback information. Feature data, thereby updating the model to improve the accuracy of business object recommendation; this processing method enables real-time collection of user behavior data and rapid accumulation of user behavior feature data of business objects, making the value of user behavior feature data more sufficient; Therefore, it is more suitable for scenarios with unknown data distribution, such as second-hand merchandise sales scenarios.
具体实施时,也可以采用离线方式更新模型的处理方式,使得可利用商品已有的较为充足的用户行为特征数据,避免实时更新用户行为数据导致的占用较多计算资源,因此更适用于线性场景。In specific implementation, the processing method of updating the model in offline mode can also be used, so that sufficient user behavior characteristic data existing in the product can be used to avoid the occupation of more computing resources caused by real-time updating of user behavior data, so it is more suitable for linear scenarios .
从应用时间角度而言,本申请实施例提供的方法不仅仅局限于业务对象冷启动的阶段,该阶段以在线方式更新所述模型,在该阶段后可以停止在线更新所述模型;该方法也同样适用于非业务对象冷启动的阶段,也就是说,可以适用于商品已投放一段时间且具有较为充足的用户交互行为数据的阶段,即始终搜集用户行为数据,根据实时用户行为数据更新模型。From the perspective of application time, the method provided by the embodiments of this application is not limited to the cold start phase of the business object. In this phase, the model is updated online, and after this phase, the online update of the model can be stopped; this method also The same applies to the cold start stage of non-business objects, that is, it can be applied to the stage where the product has been placed for a period of time and has sufficient user interaction behavior data, that is, always collect user behavior data and update the model based on real-time user behavior data.
如图5所示,在本实施例中,所述方法还可包括如下步骤:As shown in Fig. 5, in this embodiment, the method may further include the following steps:
步骤S501:初始化业务对象价值评估模型包括的第一参数和第二参数。Step S501: Initialize the first parameter and the second parameter included in the business object value evaluation model.
通过初始化所述模型,使得所述模型具备初始的业务对象价值评估能力,此时模型的价值评估准确度通常较低。随着在向用户进行多次业务对象推荐过程中,不断采集用户反馈信息,从而不断改进模型参数,进而逐渐提升推荐准确率,直至用户不再给出反馈信息,或者是直至模型收敛,即前后两次模型间差异趋于稳定。By initializing the model, the model has an initial business object value evaluation capability. At this time, the accuracy of the value evaluation of the model is usually low. As the process of recommending business objects to users for many times, user feedback information is continuously collected, thereby continuously improving model parameters, and gradually increasing the recommendation accuracy, until the user no longer gives feedback information, or until the model converges, that is, before and after The difference between the two models stabilized.
请参考图6,其为本申请提供的一种业务对象推荐方法实施例的步骤S401的具体流程图。在本实施例中,初始化所述第一参数和第二参数的步骤,可包括如下子步骤:Please refer to FIG. 6, which is a specific flowchart of step S401 in an embodiment of a method for recommending a business object provided by this application. In this embodiment, the step of initializing the first parameter and the second parameter may include the following sub-steps:
步骤S5011:向用户展示至少一次候选业务对象。Step S5011: Show the candidate business object to the user at least once.
所述至少一次候选业务对象,包括推荐系统可向用户推荐的所有业务对象。在本实施例中,推荐系统首先将系统内所有业务对象向用户客户端投放一次,以采集初始的用户反馈信息,即第二用户反馈信息。The at least one candidate business object includes all business objects that the recommendation system can recommend to the user. In this embodiment, the recommendation system first releases all business objects in the system to the user client once to collect the initial user feedback information, that is, the second user feedback information.
步骤S5013:获取针对所述至少一次候选业务对象的第二用户反馈信息。Step S5013: Obtain second user feedback information for the at least one candidate business object.
所述第二用户反馈信息,可包括用户对推荐系统向用户首次推荐的业务对象的操作行为信息,还可包括浏览行为信息。The second user feedback information may include operation behavior information of the user on the business object that the recommendation system recommends to the user for the first time, and may also include browsing behavior information.
步骤S5015:根据所述第二用户反馈信息生成所述模型的训练样本。Step S5015: Generate training samples of the model according to the second user feedback information.
在本实施例中,首先根据所述操作行为信息更新所述用户行为特征数据,然后根据更新后的用户行为特征数据和所述浏览行为信息生成模型的初始训练样本。In this embodiment, the user behavior characteristic data is first updated according to the operation behavior information, and then the initial training sample of the model is generated based on the updated user behavior characteristic data and the browsing behavior information.
步骤S5017:根据所述训练样本,确定初始化的第一参数和第二参数。Step S5017: Determine the first parameter and the second parameter to be initialized according to the training sample.
在生成所述模型的初始训练样本后,就可以根据初始训练样本确定所述第一参数和所述第二参数。After the initial training samples of the model are generated, the first parameter and the second parameter can be determined according to the initial training samples.
在本实施例中,初始化所述模型可包括如下具体步骤:1)将第一特征数据集合X 0和训练样本集合Y 0置为空集,将累积矩阵V 0置为单位阵,以及将参数项估计
Figure PCTCN2020085254-appb-000027
置为0;2)投放所有商品各1次,并收集用户反馈,根据用户反馈初始化商品特征
Figure PCTCN2020085254-appb-000028
与非参项统计
Figure PCTCN2020085254-appb-000029
其中,
Figure PCTCN2020085254-appb-000030
表示所有商品对象在初始时刻t 0的第一特征数据的平均值,
Figure PCTCN2020085254-appb-000031
表示在初始时刻t 0的非参项统计。
In this embodiment, initializing the model may include the following specific steps: 1) Set the first feature data set X 0 and the training sample set Y 0 to empty sets, set the cumulative matrix V 0 to the unit matrix, and set the parameters Item estimate
Figure PCTCN2020085254-appb-000027
Set to 0; 2) Put all products once each, collect user feedback, and initialize product features based on user feedback
Figure PCTCN2020085254-appb-000028
Non-parametric statistics
Figure PCTCN2020085254-appb-000029
among them,
Figure PCTCN2020085254-appb-000030
Represents the average value of the first feature data of all commodity objects at the initial time t 0 ,
Figure PCTCN2020085254-appb-000031
Represents the non-parametric statistics at the initial time t 0 .
从上述实施例可见,本申请实施例提供的业务对象推荐方法,通过业务对象价值评估模型包括的第一参数和第二参数,根据候选业务对象的第一特征数据确定所述候选业务对象的得分;根据所述得分确定向用户推荐的业务对象集;向所述客户端回送所述业务对象集;这种处理方式,使得将业务对象价值评估模型划分为参数化项和非参数化项, 综合参数模型与非参模型评估特征分布未知的业务对象的价值,由于非参数化项使得模型能够拟合未知的业务对象特征分布,可以不断缩小参数模型与真实环境的差距;因此,可以有效提升特征分布未知的业务对象的价值准确度,从而使得单推荐时间步损失能够收敛,进而提升业务对象推荐的准确度。It can be seen from the foregoing embodiments that the business object recommendation method provided by the embodiment of the present application determines the score of the candidate business object according to the first characteristic data of the candidate business object through the first parameter and the second parameter included in the business object value evaluation model Determine the set of business objects recommended to the user according to the score; send the set of business objects back to the client; this processing method allows the business object value evaluation model to be divided into parameterized items and non-parameterized items, comprehensive Parametric models and non-parametric models evaluate the value of business objects with unknown feature distributions. Because non-parametric terms enable the model to fit the unknown business object feature distributions, the gap between the parametric model and the real environment can be continuously reduced; therefore, the features can be effectively improved The value accuracy of the unknown business object is distributed, so that the single recommendation time step loss can be converged, thereby improving the accuracy of the business object recommendation.
本申请虽然以较佳实施例公开如上,但其并不是用来限定本申请,任何本领域技术人员在不脱离本申请的精神和范围内,都可以做出可能的变动和修改,因此本申请的保护范围应当以本申请权利要求所界定的范围为准。Although this application is disclosed as above in preferred embodiments, it is not intended to limit the application. Any person skilled in the art can make possible changes and modifications without departing from the spirit and scope of the application. Therefore, this application The scope of protection shall be subject to the scope defined by the claims of this application.
在一个典型的配置中,计算设备包括一个或多个处理器(CPU)、输入/输出接口、网络接口和内存。In a typical configuration, the computing device includes one or more processors (CPU), input/output interfaces, network interfaces, and memory.
内存可能包括计算机可读介质中的非永久性存储器,随机存取存储器(RAM)和/或非易失性内存等形式,如只读存储器(ROM)或闪存(flash RAM)。内存是计算机可读介质的示例。The memory may include non-permanent memory in computer readable media, random access memory (RAM) and/or non-volatile memory, such as read-only memory (ROM) or flash memory (flash RAM). Memory is an example of computer readable media.
1、计算机可读介质包括永久性和非永久性、可移动和非可移动媒体可以由任何方法或技术来实现信息存储。信息可以是计算机可读指令、数据结构、程序的模块或其他数据。计算机的存储介质的例子包括,但不限于相变内存(PRAM)、静态随机存取存储器(SRAM)、动态随机存取存储器(DRAM)、其他类型的随机存取存储器(RAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、快闪记忆体或其他内存技术、只读光盘只读存储器(CD-ROM)、数字多功能光盘(DVD)或其他光学存储、磁盒式磁带,磁带磁磁盘存储或其他磁性存储设备或任何其他非传输介质,可用于存储可以被计算设备访问的信息。按照本文中的界定,计算机可读介质不包括非暂存电脑可读媒体(transitory media),如调制的数据信号和载波。1. Computer-readable media include permanent and non-permanent, removable and non-removable media, and information storage can be realized by any method or technology. The information can be computer-readable instructions, data structures, program modules, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include non-transitory computer-readable media (transitory media), such as modulated data signals and carrier waves.
2、本领域技术人员应明白,本申请的实施例可提供为方法、系统或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。2. Those skilled in the art should understand that the embodiments of the present application can be provided as methods, systems or computer program products. Therefore, this application may adopt the form of a complete hardware embodiment, a complete software embodiment, or an embodiment combining software and hardware. Moreover, this application may adopt the form of a computer program product implemented on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer-usable program codes.

Claims (10)

  1. 一种业务对象推荐方法,其特征在于,包括:A method for recommending business objects, which is characterized in that it includes:
    通过业务对象价值评估模型包括的第一参数和第二参数,根据候选业务对象的第一特征数据确定所述候选业务对象的得分;所述第一特征数据包括用户行为特征数据;所述第一参数包括与所述第一特征数据相关的权重参数,所述第二参数包括未知的第二特征数据分布的参数;According to the first parameter and the second parameter included in the business object value evaluation model, the score of the candidate business object is determined according to the first characteristic data of the candidate business object; the first characteristic data includes user behavior characteristic data; the first The parameter includes a weight parameter related to the first characteristic data, and the second parameter includes an unknown second characteristic data distribution parameter;
    根据所述得分确定向用户推荐的业务对象集;Determining a set of business objects recommended to the user according to the score;
    向客户端推送所述业务对象集。Push the set of business objects to the client.
  2. 根据权利要求1所述的方法,其特征在于,所述业务对象包括:The method according to claim 1, wherein the business object comprises:
    业务对象价值由所述第一特征数据和第二特征数据共同确定的业务对象,和/或业务对象价值由所述第一特征数据确定的业务对象。A business object whose business object value is determined jointly by the first characteristic data and the second characteristic data, and/or a business object whose business object value is determined by the first characteristic data.
  3. 根据权利要求1所述的方法,其特征在于,还包括:The method according to claim 1, further comprising:
    获取针对所述业务对象集的第一用户反馈信息;Acquiring first user feedback information for the set of business objects;
    根据所述第一用户反馈信息更新所述第一参数和所述第二参数。Updating the first parameter and the second parameter according to the first user feedback information.
  4. 根据权利要求3所述的方法,其特征在于,The method according to claim 3, wherein:
    所述第一用户反馈信息包括用户对业务对象的操作行为信息和浏览行为信息。The first user feedback information includes operation behavior information and browsing behavior information of the user on the business object.
  5. 根据权利要求4所述的方法,其特征在于,所述根据所述第一用户反馈信息更新所述第一参数和所述第二参数,包括:The method according to claim 4, wherein said updating said first parameter and said second parameter according to said first user feedback information comprises:
    根据所述操作行为信息更新所述用户行为特征数据;Updating the user behavior characteristic data according to the operation behavior information;
    根据更新后的用户行为特征数据和所述浏览行为信息,生成训练样本;Generating training samples according to the updated user behavior characteristic data and the browsing behavior information;
    根据生成的训练样本和历史样本,更新所述第一参数和所述第二参数。The first parameter and the second parameter are updated according to the generated training samples and historical samples.
  6. 根据权利要求3所述的方法,其特征在于,还包括:The method according to claim 3, further comprising:
    根据更新前的第一参数和第二参数、和更新后的第一参数和第二参数,判断所述模型是否收敛;Judging whether the model converges according to the first parameter and the second parameter before the update, and the first parameter and the second parameter after the update;
    若上述判断结果为是,则停止更新所述模型。If the above judgment result is yes, stop updating the model.
  7. 根据权利要求6所述的方法,其特征在于,还包括:The method according to claim 6, further comprising:
    若上述判断结果为否,则继续更新所述模型。If the above judgment result is no, continue to update the model.
  8. 根据权利要求1所述的方法,其特征在于,还包括:The method according to claim 1, further comprising:
    初始化所述第一参数和第二参数。Initialize the first parameter and the second parameter.
  9. 根据权利要求8所述的方法,其特征在于,所述初始化所述第一参数和第二参数, 包括:The method according to claim 8, wherein the initializing the first parameter and the second parameter comprises:
    向用户展示至少一次候选业务对象;Show users at least one candidate business object;
    获取针对所述至少一次候选业务对象的第二用户反馈信息;Acquiring second user feedback information for the at least one candidate business object;
    根据所述第二用户反馈信息生成所述模型的训练样本;Generating training samples of the model according to the second user feedback information;
    根据所述训练样本,确定初始化的第一参数和第二参数。According to the training sample, the first parameter and the second parameter to be initialized are determined.
  10. 根据权利要求1所述的方法,其特征在于,The method according to claim 1, wherein:
    所述第一参数包括:线性机器学习模型的参数或非线性机器学习模型的参数;The first parameter includes: a parameter of a linear machine learning model or a parameter of a nonlinear machine learning model;
    所述第二参数包括:与高斯过程相关的统计项,与狄利克雷过程相关的统计项,与无限维分布相关的统计项。The second parameter includes: statistical items related to the Gaussian process, statistical items related to the Dirichlet process, and statistical items related to the infinite-dimensional distribution.
PCT/CN2020/085254 2019-04-28 2020-04-17 Service object recommendation method WO2020221022A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910350833.3A CN111861605A (en) 2019-04-28 2019-04-28 Business object recommendation method
CN201910350833.3 2019-04-28

Publications (1)

Publication Number Publication Date
WO2020221022A1 true WO2020221022A1 (en) 2020-11-05

Family

ID=72966348

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/085254 WO2020221022A1 (en) 2019-04-28 2020-04-17 Service object recommendation method

Country Status (2)

Country Link
CN (1) CN111861605A (en)
WO (1) WO2020221022A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116055074A (en) * 2021-10-27 2023-05-02 北京字节跳动网络技术有限公司 Method and device for managing recommendation strategy
CN114881736B (en) * 2022-05-16 2023-12-19 阿里巴巴(中国)有限公司 Recommendation method, display method and device
CN114943585B (en) * 2022-05-27 2023-05-05 天翼爱音乐文化科技有限公司 Service recommendation method and system based on generation of countermeasure network
CN116501972B (en) * 2023-05-06 2024-01-05 广州市巨应信息科技有限公司 Content pushing method and AI intelligent pushing system based on big data online service

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103499A (en) * 2017-04-26 2017-08-29 中国人民大学 Method is recommended in a kind of cross-site cold start-up based on deep learning
US9838743B2 (en) * 2016-02-25 2017-12-05 Adobe Systems Incorporated Techniques for context aware video recommendation
CN107563867A (en) * 2017-09-06 2018-01-09 天津大学 A kind of commending system cold start-up method based on multi-arm fruit machine confidence upper limit
WO2018053257A1 (en) * 2016-09-16 2018-03-22 Second Spectrum, Inc. Methods and systems of spatiotemporal pattern recognition for video content development

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020845B (en) * 2012-12-14 2018-08-10 百度在线网络技术(北京)有限公司 A kind of method for pushing and system of mobile application
CN103744917B (en) * 2013-12-27 2018-04-06 东软集团股份有限公司 Method and system are recommended in mixing
CN106503022B (en) * 2015-09-08 2020-12-01 北京邮电大学 Method and device for pushing recommendation information
US11113704B2 (en) * 2015-12-07 2021-09-07 Daniel J. Towriss Systems and methods for interactive annuity product services using machine learning modeling
CN107203518A (en) * 2016-03-16 2017-09-26 阿里巴巴集团控股有限公司 Method, system and device, the electronic equipment of on-line system personalized recommendation
CN106022865A (en) * 2016-05-10 2016-10-12 江苏大学 Goods recommendation method based on scores and user behaviors
CN105959374B (en) * 2016-05-12 2019-05-03 腾讯科技(深圳)有限公司 A kind of data recommendation method and its equipment
CN107256508A (en) * 2017-05-27 2017-10-17 上海交通大学 Commercial product recommending system and its method based on Novel Temporal Scenario
CN107578332A (en) * 2017-09-22 2018-01-12 深圳乐信软件技术有限公司 A kind of method, apparatus, equipment and storage medium for recommending cash commodity
CN107729542B (en) * 2017-10-31 2021-05-04 咪咕音乐有限公司 Information scoring method and device and storage medium
CN107743249A (en) * 2017-11-27 2018-02-27 四川长虹电器股份有限公司 A kind of CTR predictor methods based on Model Fusion
CN109241425B (en) * 2018-08-31 2022-02-18 腾讯科技(深圳)有限公司 Resource recommendation method, device, equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9838743B2 (en) * 2016-02-25 2017-12-05 Adobe Systems Incorporated Techniques for context aware video recommendation
WO2018053257A1 (en) * 2016-09-16 2018-03-22 Second Spectrum, Inc. Methods and systems of spatiotemporal pattern recognition for video content development
CN107103499A (en) * 2017-04-26 2017-08-29 中国人民大学 Method is recommended in a kind of cross-site cold start-up based on deep learning
CN107563867A (en) * 2017-09-06 2018-01-09 天津大学 A kind of commending system cold start-up method based on multi-arm fruit machine confidence upper limit

Also Published As

Publication number Publication date
CN111861605A (en) 2020-10-30

Similar Documents

Publication Publication Date Title
WO2020221022A1 (en) Service object recommendation method
JP6134444B2 (en) Method and system for recommending information
JP6425735B2 (en) Determination of temporary trade limits
KR20200123015A (en) Information recommendation method, apparatus, device and medium
CN105469263A (en) Commodity recommendation method and device
Xue et al. Pricing personalized bundles: A new approach and an empirical study
TW201804400A (en) Data object pushing method, device and system
CN112598472A (en) Product recommendation method, device, system, medium and program product
CN111429214B (en) Transaction data-based buyer and seller matching method and device
CN111292109A (en) Method, system, device and storage medium for determining price interval of product
Nurchayati et al. Antecedents of user attitude towards e-commerce and future purchase intention
Jiang et al. Intertemporal pricing via nonparametric estimation: Integrating reference effects and consumer heterogeneity
US20170068962A1 (en) Forecasting customer channel choice using cross-channel loyalty
US10096045B2 (en) Tying objective ratings to online items
CN112348590A (en) Method and device for determining value of article, electronic equipment and storage medium
CN110796505B (en) Business object recommendation method and device
CN110796520A (en) Commodity recommendation method and device, computing equipment and medium
CN115827994A (en) Data processing method, device, equipment and storage medium
JP6664580B2 (en) Calculation device, calculation method and calculation program
CN114529399A (en) User data processing method, device, computer equipment and storage medium
CN114549125A (en) Item recommendation method and device, electronic equipment and computer-readable storage medium
CN110738538B (en) Method and device for identifying similar objects
CN113362141A (en) Associated commodity recommendation method, device, medium and electronic equipment
CN113792952A (en) Method and apparatus for generating a model
JP6809148B2 (en) Program and combination extraction system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20798037

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20798037

Country of ref document: EP

Kind code of ref document: A1