CN114297521A - Product recommendation sorting algorithm based on big data - Google Patents

Product recommendation sorting algorithm based on big data Download PDF

Info

Publication number
CN114297521A
CN114297521A CN202111674049.1A CN202111674049A CN114297521A CN 114297521 A CN114297521 A CN 114297521A CN 202111674049 A CN202111674049 A CN 202111674049A CN 114297521 A CN114297521 A CN 114297521A
Authority
CN
China
Prior art keywords
recommendation
information
user
users
product
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111674049.1A
Other languages
Chinese (zh)
Inventor
谢洋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yiyun Cloud Computing Co ltd
Original Assignee
Shenzhen Yiyun Cloud Computing Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Yiyun Cloud Computing Co ltd filed Critical Shenzhen Yiyun Cloud Computing Co ltd
Priority to CN202111674049.1A priority Critical patent/CN114297521A/en
Publication of CN114297521A publication Critical patent/CN114297521A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a big data-based product recommendation sorting algorithm, which is used for generating recommendation candidate information comprising products expected by users by using a big data analysis algorithm installed in advance, generating a similar tendency historical relationship information set based on the calculated historical relationship information by collecting historical relationship information among the users, recommending candidate products according to the information set which has the similar tendency with the users and generates the recommendation candidate information by the arrangement of the recommendation algorithm generating the recommendation candidate information, and extracting the information of a preset number of the recommendation candidate products as the recommendation product information. By combining the historical relationship information and the recommendation algorithm arrangement, the purchased products can be recommended and sorted in a similarity accumulation mode when a plurality of users extracting approximate historical behaviors through big data generate new product purchasing behaviors, so that possible purchasing behaviors of the predicted users are generated, and the recommended products are sorted.

Description

Product recommendation sorting algorithm based on big data
Technical Field
The invention relates to the technical field of big data intelligent analysis and application, in particular to a product recommendation sequencing algorithm based on big data.
Background
With the rapid development of information technology and the rise of network technology, the network data is in an explosive growth trend by the internet technology, and more data information and services are full of networks. Meanwhile, the internet technology also becomes a common channel for people to gather and collect information. However, the data information resources have uneven quality and complex structure, so that people can hardly find their own requirements in huge information. In the prior art, for example, chinese patent application with publication number CN112380451A discloses a favorite content recommendation method based on big data, which includes acquiring target user information by acquiring a face image of a target person, acquiring user history information clustered with the target user information according to the target user information, optimizing the user history information, calculating similarity between the target user information and the acquired user history information to score unevaluated items of the target user, and finally pushing the items with high scores to the target user according to the score. The similarity calculation formula based on the project coincidence dependency improves the precision of the similarity between projects, and the collaborative filtering recommendation algorithm based on the projects is utilized to relieve the sparsity of the scoring data, so that the recommendation effect of the collaborative filtering recommendation algorithm based on the users is greatly improved.
Disclosure of Invention
The invention aims to provide a product recommendation sequencing algorithm based on big data, which can improve user experience and push accurately and efficiently aiming at the prior art.
The product recommendation sorting algorithm based on big data is used for generating recommendation candidate information comprising products expected by users by using a big data analysis algorithm installed in advance, recommending candidate products according to the arrangement of the recommendation algorithm generating the recommendation candidate information and the historical relationship information set with the same tendency of the users by collecting historical relationship information among the users and generating the historical relationship information set with the same tendency based on the calculated historical relationship information, and extracting the information of a preset number of the recommendation candidate products as the recommendation product information; extracting the similarity of the first m recommended candidate products and the first n users, accumulating the similarity of each user to the m recommended candidate products, obtaining the recommendation scores of the m recommended candidate products, and recommending the product information in the order of the scores. By combining the historical relationship information and the recommendation algorithm arrangement, the purchased products can be recommended and sorted in a similarity accumulation mode when a plurality of users extracting approximate historical behaviors through big data generate new product purchasing behaviors, so that possible purchasing behaviors of the predicted users are generated, and the recommended products are sorted.
In order to further optimize the technical scheme, the adopted measures further comprise:
the historical relationship information includes: browsing, purchasing, searching, collecting operation and order adding; the recommendation candidate information contains product name, product introduction, and purchase information.
Historical relationship information is stored in a user profile, context data for the user may be accessed, the context data indicating user activity; the user profile corresponding to the activities of the user is created through induction, comparison, clustering and the like, and the habit and the intention which are similar to each other among the users are evaluated.
The user similarity is a similarity between the contextual data in the user profile of the first user and the contextual data in the user profile of the second user. The similarity can be calculated by adopting a classical text or data similarity comparison method and then carrying out normalization.
User activities include browsing, purchasing, searching, collecting operations, joining orders and corresponding product names, product introductions, purchasing information. The records of the user activities are used for comparing the similarity between the users, under the condition that the departure times of the common behaviors are more, the corresponding users are gradually classified as users with similar behaviors and intentions, the corresponding users are used for predicting the behaviors which may occur to new (current) users with similar characteristics, and then the product information related to the behaviors which may occur is recommended to the new (current) users.
The invention also discloses a computer program of the product recommendation sorting algorithm based on the big data.
A storage medium storing the computer program of the big data based product recommendation ranking algorithm.
The invention adopts a product recommendation sorting algorithm based on big data, which is characterized in that: the recommendation system comprises a database, a recommendation server and a recommendation server, wherein the database is used for generating recommendation candidate information comprising products expected by users by using a big data analysis algorithm installed in advance, generating a similar tendency historical relationship information set by collecting historical relationship information among the users and based on the calculated historical relationship information, recommending candidate products according to the arrangement of the recommendation algorithm generating the recommendation candidate information and the similar tendency historical relationship information set of the users, and extracting a preset amount of recommendation candidate product information as recommendation product information; extracting the similarity of the first m recommended candidate products and the first n users, accumulating the similarity of each user to the m recommended candidate products, obtaining the recommendation scores of the m recommended candidate products, and recommending the product information in the order of the scores. By combining the historical relationship information and the recommendation algorithm arrangement, the purchased products can be recommended and sorted in a similarity accumulation mode when a plurality of users extracting approximate historical behaviors through big data generate new product purchasing behaviors, so that possible purchasing behaviors of the predicted users are generated, and the recommended products are sorted. Therefore, the method and the device have the advantages of improving user experience and pushing accurately and efficiently.
Drawings
FIG. 1 is a schematic diagram of a prior art method mode of an embodiment of the present invention;
FIG. 2 is a schematic diagram of an improved method mode of an embodiment of the present invention;
FIG. 3 is a flowchart illustrating a sequence of steps according to an embodiment of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the following examples.
Example (b):
referring to fig. 1 to 3, a big data based product recommendation sorting algorithm for generating recommendation candidate information including a product desired by a user using a big data analysis algorithm installed in advance, recommending candidate products according to an information set having a history relation of similar tendencies with the user in an arrangement of recommendation algorithms generating the recommendation candidate information by collecting history relation information between users and generating a history relation information set of similar tendencies based on the calculated history relation information, and extracting a predetermined number of recommendation candidate product information as recommendation product information; extracting the similarity of the first m recommended candidate products and the first n users, accumulating the similarity of each user to the m recommended candidate products, obtaining the recommendation scores of the m recommended candidate products, and recommending the product information in the order of the scores. By combining the historical relationship information and the recommendation algorithm arrangement, the purchased products can be recommended and sorted in a similarity accumulation mode when a plurality of users extracting approximate historical behaviors through big data generate new product purchasing behaviors, so that possible purchasing behaviors of the predicted users are generated, and the recommended products are sorted. Referring to fig. 1 and 2, the present invention improves the prior art by abstracting the predicted behavior, which is derived from the simple single-user behavior of the original user, into a data set through learning, and then performing behavior prediction. The abstracted behaviors are pre-judged and are presumed as the next intention of the current entity user, so that the judgment of a plurality of historical user behaviors is reduced, on one hand, the calculation loss is reduced, and the method is more efficient. On the other hand, obviously, the stability of the obtained abstract behavior prejudgment is higher through learning and clustering.
In order to further optimize the technical scheme, the adopted measures further comprise:
the historical relationship information includes: browsing, purchasing, searching, collecting operation and order adding; the recommendation candidate information contains product name, product introduction, and purchase information.
Historical relationship information is stored in a user profile, context data for the user may be accessed, the context data indicating user activity; the user profile corresponding to the activities of the user is created through induction, comparison, clustering and the like, and the habit and the intention which are similar to each other among the users are evaluated.
The user similarity is a similarity between the contextual data in the user profile of the first user and the contextual data in the user profile of the second user. The similarity can be calculated by adopting a classical text or data similarity comparison method and then carrying out normalization.
User activities include browsing, purchasing, searching, collecting operations, joining orders and corresponding product names, product introductions, purchasing information. The records of the user activities are used for comparing the similarity between the users, under the condition that the departure times of the common behaviors are more, the corresponding users are gradually classified as users with similar behaviors and intentions, the corresponding users are used for predicting the behaviors which may occur to new (current) users with similar characteristics, and then the product information related to the behaviors which may occur is recommended to the new (current) users.
Historical relationship information is typically stored in a user profile, and context data for the user may be accessed, the context data indicating user activity (e.g., browsing, joining an order). The user profile may determine an activity of the user and may thereby indicate that at least a portion of the user profile corresponds to the determined activity of the user. User profiles having corresponding activities of the users can be created by induction, comparison, clustering, etc., and habits and intentions having similar behaviors and intentions among the users can be evaluated. This may have the effect of creating a plurality of user profiles (e.g. taste profiles) corresponding to a plurality of activities engaged in or performed by the user, respectively. For example, when a user conducts a merchandise search, the user profile may describe the user, the user behavior, the information of the purchased merchandise.
The historical relationship information extraction pseudo code is as follows:
input given data set (i.e., set of standard user history data) X ═ X1,x2,…,xN]TWherein x ∈ RdD is a characteristic dimension, and the set of historical relationship information Y corresponding to a given dataset X is [ Y ═ Y1,y2,…,yN]TThe test set (i.e., the set of current user history data) T ═ T1,t2,…,tM]TWherein, tM∈RdD is the sample characteristic dimension, yiE {1,2, C }; parameter k in process of constructing computing network by using KNN algorithm1Truncation distance dc, width parameter σ, parameter k in KNN algorithm when implementing physical similarity assumptions2A damping coefficient lambda, a maximum iteration number H and a threshold theta when an iteration termination condition is met.
And outputting a computation history relation information set aiming at the test set.
Training phase
And Step1, constructing a computing network Q according to a given training set X and a corresponding historical relationship information set Y by using a KNN algorithm.
Step2 calculation of the concentration ρ of each node in the network Q by using the equation (7)j
Step3 respective use formula(6) Equation (5) calculates the weight ε of the ith node in subnetwork qc iAnd the weight ε of the subnetwork qc
Step4, setting h to 0;
step5 looping until an iteration end condition is met
Figure BDA0003450294090000051
Or H > H.
Step5.1:h=h+1;
Step5.2, circulating until j is N;
step5.2.1 updating node impact factor In using equation (1)h j
Calculation phase
Step6, setting M to 1, and circulating the program until M is larger than M.
Step6.1 determining test samples T in the test set T by applying similarity hypothesis through KNN algorithm by using formula (2)mV set of neighboring nodesi
Step6.2 calculating the maximum probability c using the equations (5) and (4)*Then according to c*Will determine the test sample tmAnd t is the historical relationship information typemClassification into class c;
Step6.3:m=m+1
step7 outputting a set of computation history relation information [ y ] for the test set T1,y2,…,yM]
And Step8, exporting the user information in the historical relationship information set to generate a similar tendency user association information set.
End up
The calculation history relation information contains
Ini hRepresenting the impact factor of the node i during the h-th iteration.
Figure BDA0003450294090000052
vtKnn (t), formula (2)
In formula (1), when h is 0, the moietyThe concentration of the dots is set as the influence factor of the node, i.e.
Figure BDA0003450294090000053
Representing the impact factor of the node i during the h-th iteration. deiRepresenting the degree of departure of node i, i.e. directed edge eijThe number of (2).
In formula (2), vtK sets of neighbor nodes, v, representing ttThe nodes in (a) may be from different sub-networks qcC is more than or equal to 1 and less than or equal to C. To be able to implement the style similarity assumption from a data point of view, we will implement the maximum probability c of t for the c-th class (i.e. sub-network qc)*=arg maxcΨcFormula (3); ΨcRepresenting the probability that t belongs to the c-th category as measured from a stylistic perspective.
Defining Ψ by weights of sub-networks and influence factors of nodes in a computational network Qc
Figure BDA0003450294090000061
In the formula (4), the reaction mixture is,
Figure BDA0003450294090000062
representative set vtThe ith neighbor node of t, and
Figure BDA0003450294090000063
as its influencing factor.
The weights defining the sub-network qc are as follows:
Figure BDA0003450294090000064
sub-network qcIs the average of the weights of all nodes it contains, so that the sub-network qcThe weight of the ith node in (1) is:
Figure BDA0003450294090000065
eijrepresenting a subnetwork qcDirected edge from node i to node j, NiRepresenting a directed edge eijThe number of (2).
The local concentrations of the nodes are not equal, and the influence factors of the nodes calculated by using 1/N iteration do not accord with the actual distribution condition of the nodes. Here, the concentration of node j (i.e., corresponding to the jth training sample in the training set) is defined as follows:
Figure BDA0003450294090000071
n represents the total number of nodes in the computational network Q, djlRepresenting the Euclidean distance between the node j and the node l, dc representing the truncation distance, and χ (…) representing the distance function when
Figure BDA0003450294090000072
When χ (…) ═ 1, conversely χ (…) ═ 0.
The invention also discloses a computer program of the product recommendation sorting algorithm based on the big data and a storage medium for storing the computer program.
While the invention has been described in connection with a preferred embodiment, it is not intended to limit the invention, and it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the spirit and scope of the invention.

Claims (7)

1. The product recommendation sorting algorithm based on big data is characterized in that: the recommendation system comprises a database, a recommendation server and a recommendation server, wherein the database is used for generating recommendation candidate information comprising products expected by users by using a big data analysis algorithm installed in advance, generating a similar tendency historical relationship information set by collecting historical relationship information among the users and based on the calculated historical relationship information, recommending candidate products according to the arrangement of the recommendation algorithm generating the recommendation candidate information and the similar tendency historical relationship information set of the users, and extracting a preset amount of recommendation candidate product information as recommendation product information; extracting the similarity of the previous m recommended candidate products and the previous n users, accumulating the similarity of each user to the m recommended candidate products, obtaining the recommendation scores of the m recommended candidate products, and recommending the product information in the order of the scores.
2. The big-data based product recommendation ranking algorithm of claim 1, wherein: the historical relationship information comprises: browsing, purchasing, searching, collecting operation and order adding; the recommendation candidate information contains product name, product introduction, and purchase information.
3. The big-data based product recommendation ranking algorithm of claim 1, wherein: said historical relationship information is stored in a user profile, context data of the user is accessible, the context data being indicative of user activity; the user profile corresponding to the activities of the user is created through induction, comparison, clustering and the like, and the habit and the intention which are similar to each other among the users are evaluated.
4. The big-data based product recommendation ranking algorithm of claim 3 wherein: the user similarity is the similarity between the contextual data in the user profile of the first user and the contextual data in the user profile of the second user.
5. The big-data based product recommendation ranking algorithm of claim 3 wherein: the user activities comprise browsing, purchasing, searching, collecting operation, order adding and corresponding product names, product introduction and purchasing information.
6. A computer program implementing a big-data based product recommendation ranking algorithm according to claim 1.
7. A storage medium storing the computer program of claim 6.
CN202111674049.1A 2021-12-31 2021-12-31 Product recommendation sorting algorithm based on big data Pending CN114297521A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111674049.1A CN114297521A (en) 2021-12-31 2021-12-31 Product recommendation sorting algorithm based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111674049.1A CN114297521A (en) 2021-12-31 2021-12-31 Product recommendation sorting algorithm based on big data

Publications (1)

Publication Number Publication Date
CN114297521A true CN114297521A (en) 2022-04-08

Family

ID=80974892

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111674049.1A Pending CN114297521A (en) 2021-12-31 2021-12-31 Product recommendation sorting algorithm based on big data

Country Status (1)

Country Link
CN (1) CN114297521A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116777529A (en) * 2023-08-11 2023-09-19 腾讯科技(深圳)有限公司 Object recommendation method, device, equipment, storage medium and program product

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116777529A (en) * 2023-08-11 2023-09-19 腾讯科技(深圳)有限公司 Object recommendation method, device, equipment, storage medium and program product
CN116777529B (en) * 2023-08-11 2024-02-06 腾讯科技(深圳)有限公司 Object recommendation method, device, equipment, storage medium and program product

Similar Documents

Publication Publication Date Title
Zahra et al. Novel centroid selection approaches for KMeans-clustering based recommender systems
Bagher et al. User trends modeling for a content-based recommender system
CN102053971B (en) Recommending method and equipment for sequencing-oriented collaborative filtering
CN107451894B (en) Data processing method, device and computer readable storage medium
Yang et al. Venue recommendation: Submitting your paper with style
US20150066904A1 (en) Integrating and extracting topics from content of heterogeneous sources
CN106548255A (en) A kind of Method of Commodity Recommendation based on mass users behavior
US20160170993A1 (en) System and method for ranking news feeds
Han Personalized news recommendation and simulation based on improved collaborative filtering algorithm
Lumauag et al. An enhanced recommendation algorithm based on modified user-based collaborative filtering
CN110727872A (en) Method and device for mining ambiguous selection behavior based on implicit feedback
Wasid et al. Multi-criteria clustering-based recommendation using Mahalanobis distance
Hamada et al. A fuzzy-based approach for modelling preferences of users in multi-criteria recommender systems
CN111209469A (en) Personalized recommendation method and device, computer equipment and storage medium
CN114297521A (en) Product recommendation sorting algorithm based on big data
Li et al. Multidimensional clustering based collaborative filtering approach for diversified recommendation
Zhao et al. Improving top-N recommendation performance using missing data
Ifada et al. Do-rank: DCG optimization for learning-to-rank in tag-based item recommendation systems
Gao et al. Construction of digital marketing recommendation model based on random forest algorithm
Regi et al. A survey on recommendation techniques in E-Commerce
CN114638316A (en) Data clustering method, device and equipment
CN113988149A (en) Service clustering method based on particle swarm fuzzy clustering
CN114329167A (en) Hyper-parameter learning, intelligent recommendation, keyword and multimedia recommendation method and device
Gangwar et al. An adaptive boosting technique to mitigate popularity bias in recommender system
Menendez Clustering: finding patterns in the darkness

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination