CN108629665B - Personalized commodity recommendation method and system - Google Patents
Personalized commodity recommendation method and system Download PDFInfo
- Publication number
- CN108629665B CN108629665B CN201810433175.XA CN201810433175A CN108629665B CN 108629665 B CN108629665 B CN 108629665B CN 201810433175 A CN201810433175 A CN 201810433175A CN 108629665 B CN108629665 B CN 108629665B
- Authority
- CN
- China
- Prior art keywords
- commodity
- behavior data
- user
- commodities
- historical behavior
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 68
- 230000006399 behavior Effects 0.000 claims abstract description 162
- 238000012549 training Methods 0.000 claims abstract description 65
- 230000002452 interceptive effect Effects 0.000 claims abstract description 47
- 238000013136 deep learning model Methods 0.000 claims abstract description 44
- 238000004364 calculation method Methods 0.000 claims abstract description 17
- 239000013598 vector Substances 0.000 claims description 57
- 230000000306 recurrent effect Effects 0.000 claims description 25
- 238000003062 neural network model Methods 0.000 claims description 20
- 230000003993 interaction Effects 0.000 claims description 17
- 238000012163 sequencing technique Methods 0.000 claims description 13
- 238000012216 screening Methods 0.000 claims description 11
- 238000010606 normalization Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 230000000694 effects Effects 0.000 abstract description 5
- 238000013528 artificial neural network Methods 0.000 description 6
- 238000001914 filtration Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 241000531116 Blitum bonus-henricus Species 0.000 description 1
- 235000008645 Chenopodium bonus henricus Nutrition 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000000516 sunscreening agent Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0631—Item recommendations
Abstract
The invention discloses a personalized commodity recommendation method and a system, wherein the method comprises the following steps: acquiring historical behavior data of a plurality of users in a preset time period, and sorting the historical behavior data according to a preset rule to obtain a first training sample; obtaining an influence factor based on a cosine similarity method to serve as a second training sample; training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model; and outputting the list of the commodities predicted by the model to be interested by the user. According to the invention, the time sequence information of the commodities in the historical behaviors of the user is effectively utilized, so that the commodities in the historical behaviors have different weighted values in the calculation of the recommendation system according to the time sequence of the occurrence of the interactive behaviors, the commodity influence factors reflect the global characteristics of the commodities and the interest degree of the user on the commodities, the characteristic quantity obtained by the deep learning model is effectively increased, and the personalized recommendation effect on the cold-start user is effectively improved.
Description
Technical Field
The invention relates to a personalized commodity recommendation method and system, and belongs to the technical field of recommendation.
Background
With the popularization of the internet and the rapid development of electronic commerce, more and more users browse and purchase goods through an electronic commerce platform. The variety of commodities in the e-commerce platform is various, and it is difficult to accurately recommend commodities which are interested by a user to the user without the help of a personalized recommendation engine.
One of the mainstream methods of the personalized recommendation method adopted by the current e-commerce platform is as follows: the collaborative filtering method recommends the commodities, which are similar to the historical browsing or purchasing behaviors of the current user, of the users interested in the behaviors to the current user by searching for the users similar to the historical browsing or purchasing behaviors of the current user. The personalized recommendation method based on collaborative filtering gives the same weight to all the commodities browsed or purchased by the current user in history when calculating the similarity with other commodities, for example, a commodity A browsed by the current user ten days ago, a commodity B browsed by the current user one hour ago, and the roles of the commodity A and the commodity B in the collaborative filtering method for calculating the commodities which are interested by the current user are the same. However, from experience, the reference value of the current personalized commodity recommendation of the commodity B browsed before one hour is often higher than that of the commodity A browsed before ten days, so that the personalized recommendation method based on collaborative filtering cannot reflect the weight difference of commodities in different sequences in historical behaviors in the current personalized recommendation, and the recommendation effect is not accurate enough. In addition, the effect of the collaborative filtering algorithm is not good when the user has too little historical browsing or purchasing data, i.e. there is a cold start problem.
The second main method is: the method based on the recurrent neural network trains and predicts the commodities which the user is interested in by sequentially inputting the historical behaviors of the user into the recurrent neural network. The method can utilize the commodity time sequence characteristics in the user historical behaviors, so that the effect is generally superior to that of a collaborative filtering algorithm under the conditions of sufficient training samples and proper adjustment of recurrent neural network parameters. Under the condition that the historical behavior data of a user is less, the personalized recommendation method based on the recurrent neural network has the cold start problem because the historical browsing or commodity purchasing data which can be used for predicting the interestingness of the recurrent network is too less.
Both of the above methods have a cold start problem, that is, when the available user history operation records are less, the commodity cannot be recommended well, and the user experience is affected. The cold start problem is a common problem in the field of personalized recommendation, and since the grasped historical behavior data of the user to be recommended currently is too little, it is difficult to accurately provide the user with commodities which the user may be interested in.
One of the main solutions to the cold start problem in the existing solutions: the method based on the user information recommends according to basic information such as the age, sex, residence and the like of the user. However, since some users may not fill in detailed personal information, the user information that can be acquired by the recommendation system is limited. And the recommendation made according to the information of the gender, the age and the like of the user only represents the general interest of the user group, and is not personalized recommendation for the user.
The second main solution to the cold start problem in the existing solution is: according to the label selection method, various classified labels are provided for the user to select when the user registers for use for the first time, and the commodities under the label classification selected by the user are recommended to the user under the condition that user behavior data are insufficient. The problem with the tag selection based approach is: some users may not fill in category labels at the beginning of use, and may choose to "skip this step" or choose at will, resulting in the selected labels not really representing the user's interests. And the goods that the user is interested in may change with time, so there is great difference between the interest label of original choice and the goods of current interest, for example, a certain user may be "sun cream" in summer, change to "down coat" in winter, if recommend according to the originally selected label of user, will produce very big deviation. Furthermore, due to the variety of goods, the label cannot completely cover all goods features.
Disclosure of Invention
Aiming at the defects, the invention provides a personalized commodity recommendation method and a system, after historical behavior data of a user is obtained, the data are analyzed and sorted, influence factors of each commodity corresponding to the current user are obtained by calculation according to historical behavior information of the current user and global characteristics corresponding to the commodities in the historical behaviors, the commodities in the historical behaviors of the user and the influence factors corresponding to the commodities are sequentially input into a preset deep learning model according to the time sequence of interaction between the user and the commodities to obtain predicted commodities which are interested by the current user, the commodities are ranked according to the predicted interest degrees, and a plurality of commodities with the highest interest degrees of the user are recommended to the user according to the ranked sequence, wherein the deep learning model adopts a recurrent neural network model.
The invention can effectively utilize the time sequence information of the commodities in the historical behaviors of the user, so that the commodities in the historical behaviors have different weighted values according to the time sequence of the occurrence of the interactive behaviors in the calculation of the recommendation system.
In order to achieve the purpose, the invention is concretely realized by the following technical scheme:
the invention provides a personalized commodity recommendation method, which comprises the following steps:
acquiring historical behavior data of a plurality of users in a preset time period, and sorting the historical behavior data according to a preset rule to obtain a first training sample;
calculating an influence factor corresponding to each commodity in the historical behavior data of each user based on a cosine similarity method by using the sorted historical behavior data as a second training sample;
training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model;
and inputting a first training sample and a second training sample of the pre-recommended user into the trained deep learning model, outputting a commodity list predicted by the model and interested by the user, and recommending the commodity list to the pre-recommended user according to the commodity sequence in the commodity list.
Further, the step of obtaining the first training sample after the arrangement according to the predetermined rule includes:
screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period;
converting the information in the interactive behavior data of the specific category into a unique number form for storage;
and sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
Further, the specific step of calculating the influence factor corresponding to each commodity in the historical behavior data of each user based on the cosine similarity method includes:
calculating to obtain N-dimensional vector of specific commodity based on cosine similarity methodThe vector is subjected to maximum value normalization, so that each value in the vector is in [0,1]]In the method, the N-dimensional vector is an influence factor corresponding to the commodity, N is the total quantity of the commodity in the preprocessed historical behavior data or the quantity of candidate commodities screened in advance by certain rules from the preprocessed historical behavior data, and the Value of the ith number in the influence factor represented by the N-dimensional vector corresponding to the commodityiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user.
Further, the deep learning model includes, but is not limited to, a recurrent neural network model.
Further, the recommending to the pre-recommending user according to the commodity sequence in the commodity list includes:
recommending a plurality of commodities ranked in the front to the pre-recommending user according to the sequence of the interest degrees of the commodities predicted in the commodity list from high to low.
The invention also provides a personalized commodity recommendation system, which comprises:
the acquisition module is used for acquiring historical behavior data of a plurality of users in a preset time period, and obtaining a first training sample after the historical behavior data is arranged according to a preset rule;
the calculation module is used for calculating an influence factor corresponding to each commodity in the historical behavior data of each user based on a cosine similarity method by using the sorted historical behavior data as a second training sample;
the training module is used for training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model;
and the recommending module is used for inputting the first training sample and the second training sample of the pre-recommending user into the trained deep learning model, outputting a commodity list predicted by the model and interested by the user, and recommending the commodity list to the pre-recommending user according to the commodity sequence in the commodity list.
The acquisition module includes:
the screening unit is used for screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period;
the conversion unit is used for converting the information in the interactive behavior data of the specific category into a unique number form for storage;
and the sequencing unit is used for sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
The calculation module comprises:
a calculating unit for calculating N-dimensional vector of specific commodity based on cosine similarity method and normalizing the maximum value of the vector to make each value in the vector be [0, 1%]In the method, the N-dimensional vector is an influence factor corresponding to the commodity, N is the total quantity of the commodity in the preprocessed historical behavior data or the quantity of candidate commodities screened in advance by certain rules from the preprocessed historical behavior data, and the Value of the ith number in the influence factor represented by the N-dimensional vector corresponding to the commodityiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user.
The deep learning model includes, but is not limited to, a recurrent neural network model.
The recommending to the pre-recommending user according to the commodity sequence in the commodity list comprises the following steps:
recommending a plurality of commodities ranked in the front to the pre-recommending user according to the sequence of the interest degrees of the commodities predicted in the commodity list from high to low.
The invention has the beneficial effects that:
by the technical scheme provided by the invention, the user historical behavior data containing the time sequence information can be processed, and due to the fact that the correlation between the commodities interacted at different times and the commodities interested by the current user is different, the time sequence information contained in the user historical behavior is extracted and utilized in the deep learning model, and commodity personalized recommendation can be more accurately carried out on the user. The invention creatively provides commodity influence factors, global characteristics of each commodity are found from historical behaviors of a plurality of users, influence factors corresponding to the commodities are calculated, and commodity vector expression forms in the preprocessed historical behaviors of the users and vector expression forms of the corresponding influence factors are jointly input into the deep learning model.
Drawings
Fig. 1 is a flowchart illustrating an embodiment of a method for recommending personalized goods according to the present invention.
Detailed Description
The technical solutions of the present invention are specifically described below, it should be noted that the technical solutions of the present invention are not limited to the embodiments described in the examples, and those skilled in the art should refer to and refer to the contents of the technical solutions of the present invention, and make improvements and designs on the basis of the present invention, and shall fall into the protection scope of the present invention.
Example one
The embodiment of the invention provides a personalized commodity recommendation method, which comprises the following steps of S110-S140:
in step S110, historical behavior data of a plurality of users within a preset time period is obtained, and a first training sample is obtained after the historical behavior data is collated according to a predetermined rule.
The specific value of the preset time period and the number of extracted users can be set according to actual conditions, for example, if the time period is set to be one month and the number of extracted users is Num, historical behavior data of Num users randomly selected in one month are extracted from all current data. Since personalized recommendation is usually time-efficient, for example, a user browses 'down jackets' half a year ago and browses 'one-piece dress' recently, if browsing data of the user half a year ago is still considered, the recommendation effect may be the right of the way, and therefore specific values of the preset time period need to be set according to actual application conditions.
Further, the step of obtaining the first training sample after the arrangement according to the predetermined rule includes: screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period; converting the information in the interactive behavior data of the specific category into a unique number form for storage; and sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
The step of sorting according to a preset rule is a preprocessing process of the invention, and data sorting is carried out according to historical behavior data extracted in a preset time period. Because the specific implementation scenes are different, the behavior data types may be different, for example, the behavior data may be in various forms such as browsing, paying attention to, listening to, watching and the like, so that the behavior data is generated by using "interaction" to refer to the user and the commodity. The preprocessing process specifically comprises the steps of converting information in the interactive behavior data of a specific category into a unique number form for storage; and sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence, and the like.
And screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period, namely selecting behavior data of the specific category from all the interactive behavior data of the users according to the requirements of specific applications. For example, behavior data of a user in a shopping website can include behaviors of browsing, adding in a favorite, adding in a shopping cart, purchasing and the like, if personalized recommendation is planned only by using the browsing data of the user, browsing commodity data of the user is extracted, and interactive behaviors of the user in adding in the favorite and the like are ignored.
The information in the interaction behavior data of the specific category is converted into a unique number form to be stored, that is, the specific information is converted into a number form to be replaced, specifically, the user is replaced in the number form, the commodity is replaced in the number form, and the numbers of the same commodities interacted by different users are the same. And then, the commodity numbers are converted into vector representation, and each commodity has a unique vector representation form and is convenient to be used as an input of a deep learning model. There are many ways to convert the commodity numbers into vector representation, as long as it is ensured that each converted commodity has a unique vector representation form and is not confused with other commodities. For example, one method for converting the commodity number into a vector representation may be: and N is the total number of commodities in the preprocessed historical behavior data or the number of candidate commodities screened from the preprocessed historical behavior data through some rules in advance, an N-dimensional vector is constructed, wherein the commodity A is represented as [1,0,0, …,0,0], namely the other N-1 bits of the vector are 0 except for the first bit of 1, and the commodity B can be represented as [0,1,0, …,0,0], namely the other N-1 bits of the vector are 0 except for the second bit of 1, and the N commodities in the commodity library are converted into N vector forms for representation.
And sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence, namely sequencing the commodities of the same user after screening out the specific interactive behaviors according to the time sequence of the interactive behaviors, wherein the commodity which is interacted firstly is arranged in front of the commodity which is interacted secondly.
In step S120, using the sorted historical behavior data, an influence factor corresponding to each commodity in the historical behavior data of each user is calculated based on a cosine similarity method to serve as a second training sample.
Further, the specific step of calculating the influence factor corresponding to each commodity in the historical behavior data of each user based on the cosine similarity method includes:
calculating to obtain an N-dimensional vector of a specific commodity based on a cosine similarity method, and then carrying out maximum value normalization on the vector to ensure that each numerical value in the vector is [0,1]]In the method, the N-dimensional vector is an influence factor corresponding to the commodity, N is the total quantity of the commodity in the preprocessed historical behavior data or the quantity of candidate commodities screened in advance by certain rules from the preprocessed historical behavior data, and the Value of the ith number in the influence factor represented by the N-dimensional vector corresponding to the commodityiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user. The N-dimensional vector of the specific commodity is obtained after the maximum value normalization is calculated by the method, each numerical value in the vector is between [0 and 1], and the N-dimensional vector is the influence factor corresponding to the specific commodity. The larger the value of the ith number in the vector is, the greater the correlation between the specific product and the product corresponding to the ith number is.
And judging the heat degree of the commodity, the times of the two commodities being interested by the same user and the like according to the times of the commodities appearing in the historical behavior data by utilizing the preprocessed historical behavior data. Generally, the commodities interacted by more users are more likely to cause the current user interaction, and the fact that two commodities are frequently and simultaneously appeared in the historical behavior data of the same user can indicate that the two commodities are relatively strong in relevance, and if the current user interacts with one commodity, the other commodity is interested with a relatively high probability.
In step S130, the first training sample and the second training sample are trained as training samples of the deep learning model, so as to obtain a trained deep learning model.
And (4) inputting the historical behavior data obtained after the arrangement according to the preset rules in the step (S110) and the influence factors of the commodities obtained in the step (S120) as training samples into the deep learning model for training. The deep learning model adopted by the method is a recurrent neural network model which can be an RNN model and an improved model thereof, such as LSTM and the like.
And randomly initializing each parameter in the deep learning model, and then sequentially inputting a vector expression corresponding to the interactive commodity and a vector expression of an influence factor into the recurrent neural network model for each user according to the interaction occurrence time sequence of the interactive commodity corresponding to the user according to the user number in the preprocessed historical behavior data to train the neural network.
For the recurrent neural network model, the ith input is the vector expression form of the ith commodity interacted by the user and the vector expression form of the influence factor thereof in the preprocessed behavior data, at this time, the model obtains a predicted output, the output is compared with the (i + 1) th commodity interacted by the user in the preprocessed behavior data, the deviation of the recurrent neural network is calculated, and the parameter of the neural network model is continuously corrected according to the deviation. After the vector expression forms of all the commodities interacted by one user in the preprocessed behavior data and the vector expression forms of the influence factors of the commodities are input into the deep learning model in sequence for training, the vector expression form of the interactive commodity of the next user in the preprocessed behavior data and the vector expression form of the influence factors of the interactive commodity are input into the trained recurrent neural network model in sequence according to time, and training is continued until the vector expression forms of the interactive commodities of all the users and the vector expression forms of the influence factors of the interactive commodities in the preprocessed behavior data are all input into the deep learning model for training. At this time, a trained deep learning model is obtained. Because the recurrent neural network model is generally used in the field of deep learning, the specific construction method of the recurrent neural network model is not repeated.
The model can obtain the next predicted commodity by sequentially inputting the historical behavior data preprocessed by the user to be subjected to personalized recommendation into the model, and the specific output of the next predicted commodity is the predicted scores of all commodities in the selected commodity library and is arranged from high to low according to the scores. The selected commodity library may be all commodities in a platform to which the recommendation method is applied, or a candidate commodity library that is pre-screened according to some principles. For example, if the shopping platform wants to recommend new commodities to the user and selects a plurality of commodities from 2000 new online commodities to recommend to the user, the candidate commodity library at this time contains the 2000 new online commodities.
In step S140, the first training sample and the second training sample of the pre-recommended user are input into the trained deep learning model, and the commodity list predicted by the model to be interested by the user is output and recommended to the pre-recommended user according to the commodity sequence in the commodity list.
When the pre-recommended user or the current user is recommended in an individualized manner, the historical behavior data of the pre-recommended user or the current user in the selected time period is acquired, and the historical behavior data of the pre-recommended user or the current user sorted according to the predetermined rule and the influence factors corresponding to the commodities, which are acquired in the step S110 and the step S120, are input into the deep learning model trained in the step S130 for calculation. And the trained deep learning model obtains the scores of the commodities in the predicted commodity library, the commodities are ranked from high to low according to the scores, the commodities are a commodity list interested by the current user, and a plurality of commodities ranked in the front are recommended to the current user according to the sequence of the predicted interest degree from high to low. The higher the predicted interest level, the more forward the merchandise is ranked.
If only the commodities in the candidate commodity library, for example 2000 new online commodities, are recommended to the user, the prediction scores of the 2000 commodities are only calculated and ranked. And then selecting a plurality of commodities ranked in the front from the commodity list in which the current user is interested to recommend the commodities to the current user according to the specific commodity quantity recommended to the current user as required, and completing the personalized recommendation to the current user.
The commodities are not limited to physical commodities such as clothes, daily necessities and the like, and for a multimedia platform, specific music and movies can be regarded as commodities under the conditions that the music is recommended to a user in a personalized manner according to a song listening list of the user or the interactive information between other users and a music platform, or the movies are recommended to the user in a personalized manner according to the movie watching of the user or the other interactive information between the user and the movies.
The method can collect the interactive behaviors of a plurality of users in a specific time period, and can be used for carrying out personalized recommendation by predicting the commodities interested by the users by using an improved recurrent neural network model according to the interactive behaviors of the users on an Internet platform and containing time sequence information. The improved recurrent neural network model may be the number of neurons of the network model, the number of layers of the network model, the addition threshold function and the like, and if the recurrent neural network model with the improved structure does not directly influence the manner of adding the commodity time sequence information and the influence factors into the recurrent neural network model, which is provided by the method, the method can be regarded as one of the implementation methods of the method.
The deep learning model can be a recurrent neural network model or a model obtained by improving the recurrent neural network model, such as an LSTM model, and if the deep learning model is a model obtained by recursively ordering the commodities according to a certain rule, and sequentially inputting the corresponding features of the current commodity, and then outputting the corresponding deep learning which is the features of the next commodity, the deep learning model can be regarded as the recurrent neural network model and the improved model thereof.
The interactive behavior can be one or a combination of behaviors of browsing commodities, watching or listening, adding to favorites, adding to shopping carts, purchasing and the like.
If only the coefficient of the influence factor calculation method is changed, the method still belongs to the protection scope of the method.
Example two
The second embodiment of the present invention further provides a personalized commodity recommendation system, including:
the acquisition module is used for acquiring historical behavior data of a plurality of users in a preset time period, and obtaining a first training sample after the historical behavior data is arranged according to a preset rule;
the calculation module is used for calculating an influence factor corresponding to each commodity in the historical behavior data of each user based on a cosine similarity method by using the sorted historical behavior data as a second training sample;
the training module is used for training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model;
and the recommending module is used for inputting the first training sample and the second training sample of the pre-recommending user into the trained deep learning model, outputting a commodity list predicted by the model and interested by the user, and recommending the commodity list to the pre-recommending user according to the commodity sequence in the commodity list.
The acquisition module includes:
the screening unit is used for screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period;
the conversion unit is used for converting the information in the interactive behavior data of the specific category into a unique number form for storage;
and the sequencing unit is used for sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
The calculation module comprises:
a calculating unit for calculating N-dimensional vector of specific commodity based on cosine similarity method and normalizing the maximum value of the vector to make each value in the vector be [0, 1%]In between, the N-dimensional vector is the influence factor corresponding to the commodity, N is preThe total number of commodities in the processed historical behavior data or the number of candidate commodities screened in advance from the preprocessed historical behavior data through some rules, and the Value of the ith number in the influence factors expressed by the N-dimensional vectors corresponding to the commoditiesiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user.
The deep learning model includes, but is not limited to, a recurrent neural network model.
The recommending to the pre-recommending user according to the commodity sequence in the commodity list comprises the following steps:
recommending a plurality of commodities ranked in the front to the pre-recommending user according to the sequence of the interest degrees of the commodities predicted in the commodity list from high to low.
The specific implementation functions and processing modes refer to specific steps described in the first embodiment of the method.
Since the processing and functions implemented by the system of the second embodiment substantially correspond to the embodiment, the principle and the example of the method shown in fig. 1, the description of the embodiment is not detailed, and reference may be made to the related description in the foregoing embodiment, which is not described herein again.
The invention has the beneficial effects that:
by the technical scheme provided by the invention, the user historical behavior data containing the time sequence information can be processed, and due to the fact that the correlation between the commodities interacted at different times and the commodities interested by the current user is different, the time sequence information contained in the user historical behavior is extracted and utilized in the deep learning model, and commodity personalized recommendation can be more accurately carried out on the user. The invention creatively provides commodity influence factors, global characteristics of each commodity are found from historical behaviors of a plurality of users, influence factors corresponding to the commodities are calculated, and commodity vector expression forms in the preprocessed historical behaviors of the users and vector expression forms of the corresponding influence factors are jointly input into the deep learning model.
Through the above description of the embodiments, those skilled in the art will clearly understand that the present invention may be implemented by hardware, or by software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present invention can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.), and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments of the present invention.
The above disclosure is only for a few specific embodiments of the present invention, however, the present invention is not limited to the above embodiments, and any variations that can be made by those skilled in the art are intended to fall within the scope of the present invention.
Claims (8)
1. A method for recommending personalized goods, the method comprising:
acquiring historical behavior data of a plurality of users in a preset time period, and sorting the behavior data obtained after sorting according to a preset rule according to a time sequence to obtain a first training sample;
calculating an influence factor corresponding to each commodity in the historical behavior data of each user based on a cosine similarity method by using the sorted historical behavior data as a second training sample;
training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model;
inputting a first training sample and a second training sample of a pre-recommended user into a trained deep learning model, outputting a commodity list predicted by the model and interested by the user, and recommending the commodity list to the pre-recommended user according to the commodity sequence in the commodity list;
the specific steps of calculating the influence factor corresponding to each commodity in the historical behavior data of each user based on the cosine similarity method comprise:
calculating to obtain an N-dimensional vector of a specific commodity based on a cosine similarity method, and then carrying out maximum value normalization on the vector to ensure that each numerical value in the vector is [0,1]]In the method, the N-dimensional vector is an influence factor corresponding to the commodity, N is the total quantity of the commodity in the preprocessed historical behavior data or the quantity of candidate commodities screened in advance by certain rules from the preprocessed historical behavior data, and the Value of the ith number in the influence factor represented by the N-dimensional vector corresponding to the commodityiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user.
2. The method of claim 1, wherein the step of obtaining the first training sample after the sorting according to the predetermined rule comprises:
screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period;
converting the information in the interactive behavior data of the specific category into a unique number form for storage;
and sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
3. The method of claim 1, in which the deep learning model comprises a recurrent neural network model.
4. The method of claim 1, wherein the recommending to the pre-recommending user in the order of the items in the item list comprises:
recommending a plurality of commodities ranked in the front to the pre-recommending user according to the sequence of the interest degrees of the commodities predicted in the commodity list from high to low.
5. A personalized goods recommendation system, comprising:
the acquisition module is used for acquiring historical behavior data of a plurality of users in a preset time period, and sequencing the behavior data obtained after the behavior data is sorted according to a preset rule in a time sequence to obtain a first training sample;
the calculation module is used for calculating an influence factor corresponding to each commodity in the historical behavior data of each user based on a cosine similarity method by using the sorted historical behavior data as a second training sample;
the training module is used for training the first training sample and the second training sample as training samples of the deep learning model to obtain a trained deep learning model;
the recommendation module is used for inputting a first training sample and a second training sample of a pre-recommended user into the trained deep learning model, outputting a commodity list predicted by the model and interested by the user, and recommending the commodity list to the pre-recommended user according to the commodity sequence in the commodity list;
the calculation module comprises:
a calculating unit for calculating N-dimensional vector of specific commodity based on cosine similarity method and normalizing the maximum value of the vector to make each value in the vector be [0, 1%]In the method, the N-dimensional vector is an influence factor corresponding to the commodity, N is the total quantity of the commodity in the preprocessed historical behavior data or the quantity of candidate commodities screened in advance by certain rules from the preprocessed historical behavior data, and the Value of the ith number in the influence factor represented by the N-dimensional vector corresponding to the commodityiThe specific calculation method comprises the following steps:
the method comprises the steps that Count (specific commodity & ith commodity) represents the times of the specific commodity and the ith commodity appearing in preprocessed historical behavior data together, i is larger than or equal to 1 and smaller than or equal to N, Count (specific commodity) represents the total interaction times of the specific commodity in the historical behavior data sorted according to a preset rule, Count (ith commodity) represents the total interaction times of the ith commodity in the historical behavior data sorted according to the preset rule, and Max (values) represents the maximum value of influence factors corresponding to all commodities in the historical behavior data of each user.
6. The system of claim 5, wherein the acquisition module comprises:
the screening unit is used for screening out interactive behavior data of a specific category based on the acquired historical behavior data of a plurality of users in a preset time period;
the conversion unit is used for converting the information in the interactive behavior data of the specific category into a unique number form for storage;
and the sequencing unit is used for sequencing the interactive behavior data of each user in the interactive behavior data of the specific category according to the time sequence.
7. The system of claim 5, in which the deep learning model comprises a recurrent neural network model.
8. The system of claim 5, wherein the recommending to the pre-recommending user in the order of the items in the item list comprises:
recommending a plurality of commodities ranked in the front to the pre-recommending user according to the sequence of the interest degrees of the commodities predicted in the commodity list from high to low.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810433175.XA CN108629665B (en) | 2018-05-08 | 2018-05-08 | Personalized commodity recommendation method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810433175.XA CN108629665B (en) | 2018-05-08 | 2018-05-08 | Personalized commodity recommendation method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108629665A CN108629665A (en) | 2018-10-09 |
CN108629665B true CN108629665B (en) | 2021-07-16 |
Family
ID=63695947
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810433175.XA Active CN108629665B (en) | 2018-05-08 | 2018-05-08 | Personalized commodity recommendation method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108629665B (en) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109614544B (en) * | 2018-10-30 | 2023-11-03 | 北京奇虎科技有限公司 | Method and device for predicting personal information of user |
CN111126110B (en) * | 2018-10-31 | 2024-01-05 | 杭州海康威视数字技术股份有限公司 | Commodity information identification method, settlement method, device and unmanned retail system |
CN109242654A (en) * | 2018-11-15 | 2019-01-18 | 中国联合网络通信集团有限公司 | A kind of item recommendation method and system |
CN109446430B (en) * | 2018-11-29 | 2021-10-01 | 西安电子科技大学 | Product recommendation method and device, computer equipment and readable storage medium |
CN109829761A (en) * | 2019-01-31 | 2019-05-31 | 广州视源电子科技股份有限公司 | A kind of commodity selection method, apparatus, equipment and storage medium |
CN111695023A (en) * | 2019-03-11 | 2020-09-22 | 北京京东尚科信息技术有限公司 | Information recommendation method and device, storage medium and equipment |
CN110070148B (en) * | 2019-03-15 | 2021-06-29 | 北京木业邦科技有限公司 | Forestry product feature analysis method and device and computer readable medium |
CN109919737B (en) * | 2019-03-20 | 2021-09-28 | 中电科大数据研究院有限公司 | Recommendation method and system for producing and selling commodities |
CN110232153A (en) * | 2019-05-29 | 2019-09-13 | 华南理工大学 | A kind of cross-cutting recommended method based on content |
CN110442810B (en) * | 2019-08-08 | 2023-06-13 | 广州华建工智慧科技有限公司 | Mobile terminal BIM model intelligent caching method based on deep FM recommendation algorithm |
CN110555753A (en) * | 2019-08-14 | 2019-12-10 | 中国平安人寿保险股份有限公司 | recommendation-based ranking control method and device, computer equipment and storage medium |
CN110851718B (en) * | 2019-11-11 | 2022-06-28 | 重庆邮电大学 | Movie recommendation method based on long and short term memory network and user comments |
CN111325572A (en) * | 2020-01-21 | 2020-06-23 | 深圳前海微众银行股份有限公司 | Data processing method and device |
CN111340115B (en) * | 2020-02-27 | 2023-05-12 | 青岛创新奇智科技集团股份有限公司 | Recommendation model training method and system based on feature dynamic construction |
CN113495966B (en) * | 2020-03-18 | 2023-06-23 | 北京达佳互联信息技术有限公司 | Interactive operation information determining method and device and video recommendation system |
CN113538079A (en) * | 2020-04-17 | 2021-10-22 | 北京金山数字娱乐科技有限公司 | Recommendation model training method and device, and recommendation method and device |
CN111639989B (en) * | 2020-04-28 | 2024-02-02 | 上海秒针网络科技有限公司 | Commodity recommendation method and readable storage medium |
CN111612588B (en) * | 2020-05-27 | 2023-12-26 | 腾讯科技(深圳)有限公司 | Commodity presenting method and device, computing equipment and computer readable storage medium |
CN111951051B (en) * | 2020-08-14 | 2023-09-26 | 中国工商银行股份有限公司 | Method, device and system for recommending products to clients |
CN112330362A (en) * | 2020-11-04 | 2021-02-05 | 江苏瑞祥科技集团有限公司 | Rapid data intelligent analysis method for internet mall user behavior habits |
CN112329172B (en) * | 2020-11-05 | 2022-12-27 | 上海交通大学 | Shield tunneling machine cutter head torque prediction method and system based on parallel neural network |
CN113360773B (en) * | 2021-07-07 | 2023-07-04 | 脸萌有限公司 | Recommendation method and device, storage medium and electronic equipment |
CN113705792A (en) * | 2021-08-31 | 2021-11-26 | 平安银行股份有限公司 | Personalized recommendation method, device, equipment and medium based on deep learning model |
CN114820142B (en) * | 2022-06-29 | 2022-09-16 | 国能(北京)商务网络有限公司 | Commodity information recommendation method for B-side purchasing user |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103700005A (en) * | 2013-12-17 | 2014-04-02 | 南京信息工程大学 | Association-rule recommending method based on self-adaptive multiple minimum supports |
CN106294758A (en) * | 2016-09-23 | 2017-01-04 | 华南师范大学 | Collaborative recommendation method based on the change of user cognition degree |
CN106846082A (en) * | 2016-12-10 | 2017-06-13 | 江苏途致信息科技有限公司 | Tourism cold start-up consumer products commending system and method based on hardware information |
CN107256494A (en) * | 2017-05-18 | 2017-10-17 | 深圳大学 | A kind of item recommendation method and device |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10373230B2 (en) * | 2012-10-05 | 2019-08-06 | Raise Marketplace, Llc | Computer-implemented method for recommendation system input management |
-
2018
- 2018-05-08 CN CN201810433175.XA patent/CN108629665B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103700005A (en) * | 2013-12-17 | 2014-04-02 | 南京信息工程大学 | Association-rule recommending method based on self-adaptive multiple minimum supports |
CN106294758A (en) * | 2016-09-23 | 2017-01-04 | 华南师范大学 | Collaborative recommendation method based on the change of user cognition degree |
CN106846082A (en) * | 2016-12-10 | 2017-06-13 | 江苏途致信息科技有限公司 | Tourism cold start-up consumer products commending system and method based on hardware information |
CN107256494A (en) * | 2017-05-18 | 2017-10-17 | 深圳大学 | A kind of item recommendation method and device |
Also Published As
Publication number | Publication date |
---|---|
CN108629665A (en) | 2018-10-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108629665B (en) | Personalized commodity recommendation method and system | |
CN108648049B (en) | Sequence recommendation method based on user behavior difference modeling | |
CN110717098B (en) | Meta-path-based context-aware user modeling method and sequence recommendation method | |
US7805431B2 (en) | System and method for generating a display of tags | |
US8407104B2 (en) | Catalog based price search | |
CN111461841B (en) | Article recommendation method, device, server and storage medium | |
CN111709812A (en) | E-commerce platform commodity recommendation method and system based on user dynamic classification | |
Bauer et al. | Recommender systems based on quantitative implicit customer feedback | |
CN109064285B (en) | Commodity recommendation sequence and commodity recommendation method | |
CN102902691A (en) | Recommending method and recommending system | |
CN111737418B (en) | Method, apparatus and storage medium for predicting relevance of search term and commodity | |
US20210366006A1 (en) | Ranking of business object | |
US11682060B2 (en) | Methods and apparatuses for providing search results using embedding-based retrieval | |
CN112508613A (en) | Commodity recommendation method and device, electronic equipment and readable storage medium | |
Levinas | An analysis of memory based collaborative filtering recommender systems with improvement proposals | |
CN115131101A (en) | Individualized intelligent recommendation system for insurance products | |
CN112488781A (en) | Search recommendation method and device, electronic equipment and readable storage medium | |
CN113761347A (en) | Commodity recommendation method, commodity recommendation device, storage medium and commodity recommendation system | |
Salampasis et al. | Comparison of RNN and Embeddings Methods for Next-item and Last-basket Session-based Recommendations | |
CN115131108A (en) | E-commerce commodity screening system | |
CN114298787A (en) | Automatic commodity recommendation method and device | |
Ghadekar et al. | Image-Based Product Recommendations Using Market Basket Analysis | |
CN116645167B (en) | Commodity recommendation system and method based on intelligent decision | |
CN117391824B (en) | Method and device for recommending articles based on large language model and search engine | |
Chawade et al. | The Book Forum: Recommendation Application System using Collaborative Filtering and Autoencoders |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |