CN108921670A - A kind of potential interest of fusion user, the Drug trading recommended method of space-time data and classification popularity - Google Patents
A kind of potential interest of fusion user, the Drug trading recommended method of space-time data and classification popularity Download PDFInfo
- Publication number
- CN108921670A CN108921670A CN201810724191.4A CN201810724191A CN108921670A CN 108921670 A CN108921670 A CN 108921670A CN 201810724191 A CN201810724191 A CN 201810724191A CN 108921670 A CN108921670 A CN 108921670A
- Authority
- CN
- China
- Prior art keywords
- user
- drug
- matrix
- medicine
- category
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000003814 drug Substances 0.000 title claims abstract description 231
- 229940079593 drug Drugs 0.000 title claims abstract description 133
- 238000000034 method Methods 0.000 title claims abstract description 28
- 230000004927 fusion Effects 0.000 title claims abstract description 8
- 239000011159 matrix material Substances 0.000 claims abstract description 150
- 238000000354 decomposition reaction Methods 0.000 claims abstract description 21
- 238000004422 calculation algorithm Methods 0.000 claims description 12
- 238000013519 translation Methods 0.000 claims description 6
- 238000004364 calculation method Methods 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 238000004091 panning Methods 0.000 claims description 3
- 238000005295 random walk Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0631—Item recommendations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Mathematical Physics (AREA)
- Development Economics (AREA)
- Mathematical Analysis (AREA)
- Entrepreneurship & Innovation (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- General Business, Economics & Management (AREA)
- Educational Administration (AREA)
- Marketing (AREA)
- Computational Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Quality & Reliability (AREA)
- Game Theory and Decision Science (AREA)
- Algebra (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Operations Research (AREA)
- Tourism & Hospitality (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of potential interest of fusion user, the Drug trading recommended method of space-time data and classification popularity, including obtaining the purchaser record data of user's drug purchase from the data set of electric business platform, and purchaser record data are arranged to obtain user-drug rating matrix;Purchaser record based on similar users in purchaser record data establishes the potential interest model of user, and obtains the potential interesting data of user based on the potential interest model of user;The potential interesting data of user is merged into user-drug rating matrix;The popularity for the drug generic bought based on user in purchaser record data and user establish classification correlation model to the preference of the category;Matrix decomposition is carried out to the user-drug rating matrix for incorporating the potential interesting data of user, and obtained user preference prediction matrix and classification correlation model progress linear fusion generation recommendation list will be decomposed.The present invention solves the problems, such as that rating matrix sparsity impacts recommendation efficiency in the prior art.
Description
Technical Field
The invention relates to the technical field of computers, in particular to a medicine transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of a user.
Background
In recent years, electronic commerce has been actively carried out with the development of internet and information technology, and more consumers have started shopping online. Electronic commerce not only opens up a new commercial profit channel, but also subverts the traditional sales mode, and endows more convenience and autonomy to both sides of the transaction from space and time. In particular, medicine, as a necessity of daily life, has gradually come into the field of electronic commerce in recent years, and more pharmaceutical enterprises have acquired the qualification of establishing a pharmaceutical electronic commerce platform, and the development prospect of electronic commerce in the pharmaceutical industry is clear.
Because the medicine e-commerce platform contains multiple types and large quantities of medicines, a user needs to spend a large amount of time and energy to screen out the required medicines, and the user experience of the platform is greatly reduced. In order to solve the problem that the user experience is poor due to the fact that the user consumes too much time in massive medicines, it is necessary to introduce a personalized recommendation technology into a medicine e-commerce platform.
In the medical e-commerce platform, due to the particularity of medicines, the number of the medicines scored by a user is far lower than that of the articles (music and movies) scored by the user in the traditional recommendation, a user-medicine scoring matrix is quite sparse, and the recommendation of the medical e-commerce platform faces a more serious data cold start problem than the traditional recommendation.
In the face of massive and diversified medicines in the field of medical and e-commerce, how to design an excellent recommendation algorithm to provide accurate recommendation for users is a puzzling problem. Currently, some recommendation algorithms exist in the field, but most of the algorithms are performed on an original user-medicine scoring matrix, and the sparsity of the scoring matrix is greatly influenced.
Therefore, how to effectively alleviate the influence of the sparsity of the scoring matrix on the recommendation efficiency is an urgent problem to be solved.
Disclosure of Invention
In view of the above, the invention provides a medicine transaction recommendation method fusing user potential interest, spatio-temporal data and category popularity, which learns the user potential interest through historical purchase data of a user, and then fills the user potential interest into a user-medicine scoring matrix, thereby effectively solving the problem that the scoring matrix sparsity affects the recommendation efficiency in the prior art.
In order to achieve the above object, the present invention provides a drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity, the method comprising the steps of:
s1, acquiring purchase record data of the medicine purchased by the user from the data set of the E-commerce platform, and sorting the purchase record data to obtain a user-medicine scoring matrix;
s2, establishing a user potential interest model based on the purchase records of similar users in the purchase record data, and obtaining user potential interest data based on the user potential interest model;
s3, merging the potential interest data of the user into a user-medicine scoring matrix; the influence of the matrix sparsity on the recommendation result is relieved, and the recommendation efficiency is improved;
s4, establishing a category correlation model based on the popularity of the category of the medicine purchased by the user in the purchase record data and the preference of the user for the category;
and S5, performing matrix decomposition on the user-medicine scoring matrix combined with the potential interest data of the user, and performing linear fusion on the user preference prediction matrix obtained by decomposition and the category correlation model in the step S4 to generate a recommendation list.
Preferably, the step S1 includes the steps of:
s1-1, collating the purchase record data of the user, the purchase record data including the user' S score, time of purchase, and type of medicine, and obtaining a user set U ═ { U ═1,u2,...,ui...,unD ═ D } and drug set1,d2,...,dj...,dmU represents a user, i represents an ID of the user; d represents a drug, j represents the ID of the drug;
s1-2, counting the number of the medicines purchased and scored by each user, and if the number of the medicines purchased and scored by the user is lower than a preset value, deleting the user; to obtain a user containing sufficient user information;
s1-3, counting the times of purchasing and grading each medicine, and if the frequency of purchasing the medicines is lower than a preset value, deleting the related records of the medicines; due to the loss of data, noise is easy to occur;
and S1-4, obtaining an original user-medicine scoring matrix based on the sorted purchase record data.
Preferably, the step S2 includes the steps of:
s2-1, merging similar user set F of time factorsi:
1) Dividing a year into T discrete time periods by adopting a time discretization method, and dividing the original user-medicine scoring matrix in the step S1 into T time periods-user-medicine scoring matrices according to the purchasing scoring time;
2) given a target user i, defining a scoring vector of the user i in a time period T (T ∈ T) as: r isi,t={ri,t,1,ri,t,2,..ri,t,mWherein r isi,t,mIndicating the value of the user i's score for the drug m over time period t. For a user i, calculating the time interval t of the user in any two time intervalspAnd tqScore vector ofAndthen taking the average value of the cosine similarity values of all the users in the two time periods as the similarity of the two time periods, thereby obtaining the similarity of the users between any two time periods in the discrete time period;
3) representing the similarity of all users between any two discrete time periods as a time period similarity matrix TS, and translating the time period-user-medicine scoring matrix by using the time period similarity matrix TS, wherein the specific translation formula is as follows:
wherein,is the new time period-user-drug scoring matrix to be used for calculation obtained after the panning;is to indicate the periods t and t*Time interval similarity of (d), t*∈[1,T];Is the user i's score for drug j over time period t;
then, the matrix after translation is used for calculating the similarity of the users, and s users with the highest similarity are obtained for the user i and serve as similar users Fi;
S2-2, based on similar users FiObtaining potential interest data of a user:
for user i, the step is similar user F of the user in S2-1iAnd the medicine purchased but not purchased by the user i is used as the standby potential interest medicine of the user i, and a user potential interest model is established to learn the potential interest of the user, so that the potential interest data of the user is obtained.
Preferably, the step S3 includes the steps of:
s3-1, filling the user potential interest data into the original user-medicine scoring matrix in the step S1, and for each user i, dividing medicines into three categories: diIs a collection of drugs purchased by the user; piIs a set of potential purchases of drugs by the user; u shapeiIs a set of not purchased and not potentially purchased drugs by the user, the original user-drug scoring matrix is transformed into a new scoring matrix and weighting matrix:
wherein NewR is a new scoring matrix, NewRi,jRepresents the user i's score for drug j; NewW is a new weight matrix, NewWi,jPreference of drug j for user i;when the medicine is a potential medicine purchased by the user, the user can use the medicineIs a number between 0 and 1; μ is the tuning parameter, here taken to be 0.3, multiplied by.
Preferably, the step S4 includes the steps of:
s4-1, establishing a scoring matrix B of the user for a certain medicine category through the scoring matrix of the user for the medicine and the type of the medicineN,|C|Where N is the number of users, | C | is the number of types of drugs, each element in the scoring matrix represents the user's score for the category to which the purchased drug belongs;
s4-2, constructing a medicine popularity matrix P|C|,MWhere | C | is the number of types of drugs, M is the number of drugs, each element in the drug popularity matrix represents the popularity of the drug in the category to which it belongs, and the number of times a drug in a category is purchased is used to represent the popularity of the drug in the category;
s4-3, obtaining a category-related model of the medicine purchased by the user as follows:
wherein, yi,jRepresenting the grade of the drug j by the user i under the category model; b isi,c∈BN,|C|,Pc,j∈P|C|,M。
Preferably, the step S5 includes the following steps:
s5-1, decomposing the obtained new scoring matrix and the weight matrix by using a matrix decomposition algorithm, wherein an error function in the decomposition process is as follows:
where i denotes a user, j denotes a medicine, N denotes the number of users, M denotes the number of medicines,the product of the user implicit factor matrix and the drug implicit factor matrix vector represents the score of the user i on the drug j; gamma represents the weight of the user and the drug; | U | represents a user hidden factor matrix, | D | represents a drug hidden factor matrix,the square of the frobenius norm representing the user hidden factor matrix,a square of a Frobenius norm representing a drug hidden factor matrix;
s5-2, decomposing the new scoring matrix and the new weighting matrix to obtain a user hidden feature matrix and a medicine hidden feature matrix, multiplying the two matrixes obtained after decomposition to obtain a user preference prediction matrix, and then combining the user preference prediction matrix and the category correlation model to obtain a final recommendation model as follows:
wherein,is the user i's score for drug j;the product of the updated user hidden factor matrix and the medicine hidden factor matrix vector represents the prediction score of the user i on the medicine j; y isi,jRepresenting the grade of the drug j by the user i under the category model; oc means proportional to; denotes multiplication.
S5-3, according toThe magnitude of the score values is sorted in order,and then selecting the medicines with the score values ranked from high to low to generate a recommendation list, and recommending the recommendation list to the user.
In summary, the invention discloses a medicine transaction recommendation method fusing the potential interest, the time-space data and the category popularity of a user, which comprises the steps of firstly obtaining the purchase record data of medicines purchased by the user from the data set of an e-commerce platform, and sorting the purchase record data to obtain a user-medicine scoring matrix; then, establishing a user potential interest model based on the purchase records of similar users in the purchase record data, and obtaining user potential interest data based on the user potential interest model; then merging the potential interest data of the users into a user-medicine scoring matrix; establishing a category correlation model based on the popularity of the category of the medicine purchased by the user in the purchase record data and the preference of the user to the category; and finally, performing matrix decomposition on the user-medicine scoring matrix combined with the user potential interest data, and performing linear fusion on the user preference prediction matrix obtained by decomposition and the category correlation model to generate a recommendation list. According to the method and the device, the potential interest of the user is learned through the historical purchase data of the user, and then the potential interest of the user is filled into the user-medicine scoring matrix, so that the problem that the sparsity of the scoring matrix influences the recommendation efficiency in the prior art is effectively solved.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The above and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a basic flow chart of a method for recommending a drug transaction according to the present invention, which combines the user's potential interest, spatiotemporal data and category popularity;
FIG. 2 is a schematic diagram of a learning algorithm of potential interest of a user according to the present disclosure;
FIG. 3 is a schematic diagram of a process for creating a category-dependent model according to the present disclosure.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the description of the present invention, it is to be understood that the terms "longitudinal", "lateral", "upper", "lower", "front", "rear", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", and the like, indicate orientations or positional relationships based on those shown in the drawings, and are used merely for convenience of description and for simplicity of description, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed in a particular orientation, and be operated, and thus, are not to be construed as limiting the present invention.
In the description of the present invention, unless otherwise specified and limited, it is to be noted that the terms "mounted," "connected," and "connected" are to be interpreted broadly, and may be, for example, a mechanical connection or an electrical connection, a communication between two elements, a direct connection, or an indirect connection via an intermediate medium, and specific meanings of the terms may be understood by those skilled in the art according to specific situations.
The invention provides a medicine transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of a user, which comprises the following steps as shown in figures 1-3:
s1, acquiring purchase record data of the medicine purchased by the user from the data set of the E-commerce platform, and sorting the purchase record data to obtain a user-medicine scoring matrix;
s2, establishing a user potential interest model based on the purchase records of similar users in the purchase record data, and obtaining user potential interest data based on the user potential interest model;
s3, merging the potential interest data of the user into a user-medicine scoring matrix; the influence of the matrix sparsity on the recommendation result is relieved, and the recommendation efficiency is improved;
s4, establishing a category correlation model based on the popularity of the category of the medicine purchased by the user in the purchase record data and the preference of the user for the category;
and S5, performing matrix decomposition on the user-medicine scoring matrix combined with the potential interest data of the user, and performing linear fusion on the user preference prediction matrix obtained by decomposition and the category correlation model in the step S4 to generate a recommendation list.
Preferably, step S1 includes the steps of:
s1-1, collating the purchase record data of the user, the purchase record data including the user' S score, time of purchase, and type of medicine, and obtaining a user set U ═ { U ═1,u2,...,ui...,unD ═ D } and drug set1,d2,...,dj...,dmU represents a user, i represents an ID of the user; d represents a drug, j represents the ID of the drug;
s1-2, counting the number of the medicines purchased and scored by each user, and if the number of the medicines purchased and scored by the user is lower than a preset value, deleting the user; to obtain a user containing sufficient user information;
s1-3, counting the times of purchasing and grading each medicine, and if the frequency of purchasing the medicines is lower than a preset value, deleting the related records of the medicines; due to the loss of data, noise is easy to occur;
and S1-4, obtaining an original user-medicine scoring matrix based on the sorted purchase record data.
Preferably, step S2 includes the steps of:
s2-1, merging similar user set F of time factorsi:
1) Dividing a year into T discrete time periods by adopting a time discretization method, and dividing the original user-medicine scoring matrix in the step S1 into T time periods-user-medicine scoring matrices according to the purchasing scoring time;
2) given a target user i, defining a scoring vector of the user i in a time period T (T ∈ T) as: r isi,t={ri,t,1,ri,t,2,..ri,t,mIn which r isi,t,mIndicating the value of the user i's score for the drug m over time period t. For a user i, calculating the time interval t of the user in any two time intervalspAnd tqScore vector ofAndthen taking the average value of the cosine similarity values of all the users in the two time periods as the similarity of the two time periods, thereby obtaining the similarity of the users between any two time periods in the discrete time period;
3) representing the similarity of all users between any two discrete time periods as a time period similarity matrix TS, and translating the time period-user-medicine scoring matrix by using the time period similarity matrix TS, wherein the specific translation formula is as follows:
wherein,is the new time period-user-drug scoring matrix to be used for calculation obtained after the panning;is to indicate the periods t and t*Time interval similarity of (d), t*∈[1,T];Is the user i's score for drug j over time period t.
Then, the matrix after translation is used for calculating the similarity of the users, and s users with the highest similarity are obtained for the user i and serve as similar users Fi;
S2-2, based on similar users FiObtaining potential interest data of a user:
for user i, the step is similar user F of the user in S2-1iAnd the medicine purchased but not purchased by the user i is used as the standby potential interest medicine of the user i, and a user potential interest model is established to learn the potential interest of the user, so that the potential interest data of the user is obtained.
Preferably, step S3 includes the steps of:
s3-1, filling the user potential interest data into the original user-medicine scoring matrix in the step S1, and for each user i, dividing medicines into three categories: diIs a collection of drugs purchased by the user; piIs a set of potential purchases of drugs by the user; u shapeiIs a set of not purchased and not potentially purchased drugs by the user, the original user-drug scoring matrix is transformed into a new scoring matrix and weighting matrix:
wherein NewR is a new scoring matrix, NewRi,jRepresents the user i's score for drug j; NewW is a new weight matrix, NewWi,jPreference of drug j for user i;when the medicine is a potential purchased medicine of the user, the score of the user for the medicine is a numerical value between 0 and 1; μ is the tuning parameter, here taken to be 0.3, multiplied by.
Preferably, step S4 includes the steps of:
s4-1, establishing a scoring matrix B of the user for a certain medicine category through the scoring matrix of the user for the medicine and the type of the medicineN,|C|Where N is the number of users, | C | is the number of types of drugs, each element in the scoring matrix represents the user's score for the category to which the purchased drug belongs;
s4-2, constructing a medicine popularity matrix P|C|,MWhere | C | is the number of types of drugs, M is the number of drugs, each element in the drug popularity matrix represents the popularity of the drug in the category to which it belongs, and the number of times a drug in a category is purchased is used to represent the popularity of the drug in the category;
s4-3, obtaining a category-related model of the medicine purchased by the user as follows:
wherein, yi,jRepresenting the grade of the drug j by the user i under the category model; b isi,c∈BN,|C|,Pc,j∈P|C|,M。
Preferably, step S5 includes the following steps:
s5-1, decomposing the obtained new scoring matrix and the weight matrix by using a matrix decomposition algorithm, wherein an error function in the decomposition process is as follows:
where i denotes a user, j denotes a medicine, N denotes the number of users, M denotes the number of medicines,the product of the user implicit factor matrix and the drug implicit factor matrix vector represents the score of the user i on the drug j; gamma represents the weight of the user and the drug; | U | represents a user hidden factor matrix, | D | represents a drug hidden factor matrix,the square of the frobenius norm representing the user hidden factor matrix,the square of the Frobenius norm representing the drug hidden factor matrix.
S5-2, decomposing the new scoring matrix and the new weighting matrix to obtain a user hidden feature matrix and a medicine hidden feature matrix, multiplying the two matrixes obtained after decomposition to obtain a user preference prediction matrix, and then combining the user preference prediction matrix and the category correlation model to obtain a final recommendation model as follows:
wherein,is the user i's score for drug j;the product of the updated user hidden factor matrix and the medicine hidden factor matrix vector represents the prediction score of the user i on the medicine j; y isi,jRepresenting the grade of the drug j by the user i under the category model; oc means proportional to; denotes multiplication.
S5-3, according toThe size of the score values is sorted, and then the medicines with the score values ranked from large to small at the top k are selected to generate a recommendation list.
In summary, the invention discloses a medicine transaction recommendation method fusing the potential interest, the time-space data and the category popularity of a user, which comprises the steps of firstly obtaining the purchase record data of medicines purchased by the user from the data set of an e-commerce platform, and sorting the purchase record data to obtain a user-medicine scoring matrix; then, establishing a user potential interest model based on the purchase records of similar users in the purchase record data, and obtaining user potential interest data based on the user potential interest model; then merging the potential interest data of the users into a user-medicine scoring matrix; establishing a category correlation model based on the popularity of the category of the medicine purchased by the user in the purchase record data and the preference of the user to the category; and finally, performing matrix decomposition on the user-medicine scoring matrix combined with the user potential interest data, and performing linear fusion on the user preference prediction matrix obtained by decomposition and the category correlation model to generate a recommendation list. According to the method and the device, the potential interest of the user is learned through the historical purchase data of the user, and then the potential interest of the user is filled into the user-medicine scoring matrix, so that the problem that the sparsity of the scoring matrix influences the recommendation efficiency in the prior art is effectively solved.
Specifically, in the above embodiment, the user potential interest model is established in step S2-2 to learn the user potential interest, and the user potential interest model can specifically be learned by the following two selection algorithms:
the first selection algorithm is a maximum value selection strategy, which represents the preference of the user by using the similar user of the target user i who purchased the medicine j with the highest similarity to the target user, and the linear model is represented as follows:
wherein, pri,jIndicating the user i's score for the drug j,is the similarity of the preference of the user i and the related users for the medicine j, F belongs to FiAre relevant users of user i.
The second selection algorithm is a meta-path selection strategy in the heterogeneous network G<V,E,A>In (1), V is a set of nodes, E is a set of edges, and A is a set of node categories. A meta path is defined as a path of the formWherein A isi∈A,RiRepresenting relationships existing between nodes, RiE.g. { U-U, U-D, D-D }. Then for this meta path P, if there is an instance path P ═ { v ═ v1,v2...vn+1Is the instance of the meta-path, and all such instance paths are defined as instance paths P' of the meta-path P. For each instance path, the paper defines a feature value concept for describing the node v1And vn+1Is denoted cor (p), then the feature value of the meta-path is the sum of the feature values of all the instance paths, denoted as:
for example path p ═ a1,a2...an+1},a1E.g. U is a user node, an+1e.D is the node of the drug, other aiIs an intermediate node in the instance path. Indicating the p start node of the pathThe degree of association between points cor (p) is the idea of random walk, assuming an object from node a1Starting from random walk in the network, defining cor (p) as object to walk to node a according to example path pn+1Since each of the random walks are assumed to be independent of each other. The probability of the object walking according to p is equal to the product of the probabilities of each step of walking, and the calculation formula is as follows
Wherein Pro (a)i,ai+1) Representing the slave node a in the random walk processiDirectly to node ai+1The probability of (c). In a heterogeneous network, its formula is defined as:
wherein N (a)i) Is represented byi+1Node types of consistent type.
The end user interests are expressed as:
pri,j=Eig(Pi,j)
and finally, obtaining potential interest points of the target user.
Specifically, in the above embodiment, the matrix decomposition algorithm in step S5-1 adopts the following pseudo code of the hidden matrix learning algorithm:
it should be noted that the system structures or method flows shown in fig. 1 to fig. 3 of the present invention are only some preferred embodiments of the present invention, and the illustration is only for the convenience of understanding the present invention and is not to be construed as a limitation of the present invention.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
While embodiments of the invention have been shown and described, it will be understood by those of ordinary skill in the art that: various changes, modifications, substitutions and alterations can be made to the embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.
Claims (6)
1. A drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity is characterized by comprising the following steps:
s1, acquiring purchase record data of the medicine purchased by the user from the data set of the E-commerce platform, and sorting the purchase record data to obtain a user-medicine scoring matrix;
s2, establishing a user potential interest model based on the purchase records of similar users in the purchase record data, and obtaining user potential interest data based on the user potential interest model;
s3, merging the potential interest data of the user into a user-medicine scoring matrix;
s4, establishing a category correlation model based on the popularity of the category of the medicine purchased by the user in the purchase record data and the preference of the user for the category;
and S5, performing matrix decomposition on the user-medicine scoring matrix combined with the potential interest data of the user, and performing linear fusion on the user preference prediction matrix obtained by decomposition and the category correlation model in the step S4 to generate a recommendation list.
2. The drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity according to claim 1, wherein the step S1 comprises the steps of:
s1-1, collating the purchase record data of the user, the purchase record data including the user' S score, time of purchase, and type of medicine, and obtaining a user set U ═ { U ═1,u2,...,ui...,unD ═ D } and drug set1,d2,...,dj...,dmU represents a user, i represents an ID of the user; d represents a drug, j represents the ID of the drug;
s1-2, counting the number of the medicines purchased and scored by each user, and if the number of the medicines purchased and scored by the user is lower than a preset value, deleting the user;
s1-3, counting the times of purchasing and grading each medicine, and if the frequency of purchasing the medicines is lower than a preset value, deleting the related records of the medicines;
and S1-4, obtaining a user-drug scoring matrix based on the sorted purchase record data.
3. The drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity according to claim 1, wherein the step S2 comprises the steps of:
s2-1, merging similar user set F of time factorsi:
1) Dividing a year into T discrete time periods by adopting a time discretization method, and dividing the original user-medicine scoring matrix in the step S1 into T time periods-user-medicine scoring matrices according to the purchasing scoring time;
2) given a target user i, defining a scoring vector of the user i in a time period T (T ∈ T) as: r isi,t={ri,t,1,ri,t,2,..ri,t,mIn which r isi,t,mThe score value of the medicine m of the user i in the time period t is represented, and for the user i, the user i is calculated in any two time periods tpAnd tqScore vector ofAndthen taking the average value of the cosine similarity values of all the users in the two time periods as the similarity of the two time periods, thereby obtaining the similarity of the users between any two time periods in the discrete time period;
3) representing the similarity of all users between any two discrete time periods as a time period similarity matrix TS, and translating the time period-user-medicine scoring matrix by using the time period similarity matrix TS, wherein the specific translation formula is as follows:
wherein,is the new time period-user-drug scoring matrix to be used for calculation obtained after the panning;is to indicate the periods t and t*Time interval similarity of (d), t*∈[1,T];Is the user i's score for drug j over time period t;
then, the matrix after translation is used for calculating the similarity of the users, and s users with the highest similarity are obtained for the user i and serve as similar users Fi;
S2-2, based on similar users FiObtaining potential interest data of a user:
for user i, the step is similar user F of the user in S2-1iAnd the medicine purchased but not purchased by the user i is used as the standby potential interest medicine of the user i, and a user potential interest model is established to learn the potential interest of the user, so that the potential interest data of the user is obtained.
4. The drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity according to claim 1, said step S3 comprising the steps of:
s3-1, filling the user potential interest data into the user-medicine scoring matrix in the step S1, and for each user i, dividing medicines into three categories: diIs a collection of drugs purchased by the user; piIs a set of potential purchases of drugs by the user; u shapeiIs a set of not purchased and not potentially purchased drugs by the user, the original user-drug scoring matrix is transformed into a new scoring matrix and weighting matrix:
wherein NewR is a new scoring matrix, NewRi,jRepresents the user i's score for drug j; NewW is a new weight matrix, NewWi,jPreference of drug j for user i;is when the medicine is the userWhen a user purchases a drug, the user's score for the drug is a value between 0 and 1; μ is the tuning parameter.
5. The drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity according to claim 1, wherein the step S4 comprises the steps of:
s4-1, establishing a scoring matrix B of the user for a certain medicine category through the scoring matrix of the user for the medicine and the type of the medicineN,|C|Where N is the number of users, | C | is the number of types of drugs, each element in the scoring matrix represents the user's score for the category to which the purchased drug belongs;
s4-2, constructing a medicine popularity matrix P|C|,MWhere | C | is the number of types of drugs, M is the number of drugs, each element in the drug popularity matrix represents the popularity of the drug in the category to which it belongs, and the number of times a drug in a category is purchased is used to represent the popularity of the drug in the category;
s4-3, obtaining a category-related model of the medicine purchased by the user as follows:
wherein, yi,jRepresenting the grade of the drug j by the user i under the category model; b isi,c∈BN,|C|,Pc,j∈P|C|,M。
6. The drug transaction recommendation method fusing user potential interest, spatiotemporal data and category popularity according to claim 1, said step S5 comprising the steps of:
s5-1, decomposing the obtained new scoring matrix and the weight matrix by using a matrix decomposition algorithm, wherein an error function in the decomposition process is as follows:
where i denotes a user, j denotes a medicine, N denotes the number of users, M denotes the number of medicines,the product of the user implicit factor matrix and the drug implicit factor matrix vector represents the score of the user i on the drug j; gamma represents the weight of the user and the drug; | U | represents a user hidden factor matrix, | D | represents a drug hidden factor matrix,the square of the frobenius norm representing the user hidden factor matrix,a square of a Frobenius norm representing a drug hidden factor matrix;
s5-2, decomposing the new scoring matrix and the new weighting matrix to obtain a user hidden feature matrix and a medicine hidden feature matrix, multiplying the two matrixes obtained after decomposition to obtain a user preference prediction matrix, and then combining the user preference prediction matrix and the category correlation model to obtain a final recommendation model as follows:
wherein,is the user i's score for drug j;the product of the updated user hidden factor matrix and the medicine hidden factor matrix vector represents the prediction score of the user i on the medicine j; y isi,jRepresenting the grade of the drug j by the user i under the category model; oc means proportional to; denotes multiplication;
s5-3, according toThe size of the score values is sorted, and then the medicines with the score values ranked from large to small at the top k are selected to generate a recommendation list.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810724191.4A CN108921670B (en) | 2018-07-04 | 2018-07-04 | Drug transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of user |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810724191.4A CN108921670B (en) | 2018-07-04 | 2018-07-04 | Drug transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of user |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108921670A true CN108921670A (en) | 2018-11-30 |
CN108921670B CN108921670B (en) | 2022-06-14 |
Family
ID=64424469
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810724191.4A Active CN108921670B (en) | 2018-07-04 | 2018-07-04 | Drug transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of user |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108921670B (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110085292A (en) * | 2019-04-28 | 2019-08-02 | 广东技术师范大学 | Drug recommended method, device and computer readable storage medium |
CN110442797A (en) * | 2019-08-19 | 2019-11-12 | 重庆华医康道科技有限公司 | A kind of internet hospital products configuration optimization method |
CN111311324A (en) * | 2020-02-18 | 2020-06-19 | 电子科技大学 | User-commodity preference prediction system and method based on stable neural collaborative filtering |
CN111325419A (en) * | 2018-12-13 | 2020-06-23 | 北京沃东天骏信息技术有限公司 | Method and device for identifying blacklist user |
CN111564201A (en) * | 2020-05-08 | 2020-08-21 | 深圳市万佳安人工智能数据技术有限公司 | Particle swarm optimization-based intelligent prediction method and device for children diet |
CN111815351A (en) * | 2020-05-29 | 2020-10-23 | 杭州览众数据科技有限公司 | Cooperative filtering and association rule-based clothing recommendation method |
CN113221000A (en) * | 2021-05-17 | 2021-08-06 | 上海博亦信息科技有限公司 | Talent data intelligent retrieval and recommendation method |
CN113449210A (en) * | 2021-07-01 | 2021-09-28 | 深圳市数字尾巴科技有限公司 | Personalized recommendation method and device based on space-time characteristics, electronic equipment and storage medium |
CN113569155A (en) * | 2021-07-30 | 2021-10-29 | 西南大学 | Recommendation recall method and system based on improved recurrent neural network algorithm |
CN114881689A (en) * | 2022-04-26 | 2022-08-09 | 驰众信息技术(上海)有限公司 | Building recommendation method and system based on matrix decomposition |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106339502A (en) * | 2016-09-18 | 2017-01-18 | 电子科技大学 | Modeling recommendation method based on user behavior data fragmentation cluster |
CN107463645A (en) * | 2017-07-21 | 2017-12-12 | 雷锤智能科技南京有限公司 | The personalized recommendation system and its recommendation method being oriented to based on user property scoring |
US20180075512A1 (en) * | 2016-09-13 | 2018-03-15 | Adobe Systems Incorporated | Item recommendation techniques |
-
2018
- 2018-07-04 CN CN201810724191.4A patent/CN108921670B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20180075512A1 (en) * | 2016-09-13 | 2018-03-15 | Adobe Systems Incorporated | Item recommendation techniques |
CN106339502A (en) * | 2016-09-18 | 2017-01-18 | 电子科技大学 | Modeling recommendation method based on user behavior data fragmentation cluster |
CN107463645A (en) * | 2017-07-21 | 2017-12-12 | 雷锤智能科技南京有限公司 | The personalized recommendation system and its recommendation method being oriented to based on user property scoring |
Non-Patent Citations (1)
Title |
---|
郁钢等: "基于用户兴趣模型的个性推荐算法", 《智能计算机与应用》 * |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111325419A (en) * | 2018-12-13 | 2020-06-23 | 北京沃东天骏信息技术有限公司 | Method and device for identifying blacklist user |
CN110085292A (en) * | 2019-04-28 | 2019-08-02 | 广东技术师范大学 | Drug recommended method, device and computer readable storage medium |
CN110085292B (en) * | 2019-04-28 | 2022-07-26 | 广东技术师范大学 | Medicine recommendation method and device and computer-readable storage medium |
CN110442797B (en) * | 2019-08-19 | 2022-02-08 | 重庆华医康道科技有限公司 | Internet hospital product configuration optimization method |
CN110442797A (en) * | 2019-08-19 | 2019-11-12 | 重庆华医康道科技有限公司 | A kind of internet hospital products configuration optimization method |
CN111311324A (en) * | 2020-02-18 | 2020-06-19 | 电子科技大学 | User-commodity preference prediction system and method based on stable neural collaborative filtering |
CN111311324B (en) * | 2020-02-18 | 2022-05-20 | 电子科技大学 | User-commodity preference prediction system and method based on stable neural collaborative filtering |
CN111564201A (en) * | 2020-05-08 | 2020-08-21 | 深圳市万佳安人工智能数据技术有限公司 | Particle swarm optimization-based intelligent prediction method and device for children diet |
CN111815351A (en) * | 2020-05-29 | 2020-10-23 | 杭州览众数据科技有限公司 | Cooperative filtering and association rule-based clothing recommendation method |
CN111815351B (en) * | 2020-05-29 | 2024-06-21 | 杭州览众数据科技有限公司 | Clothing recommendation method based on collaborative filtering and association rules |
CN113221000A (en) * | 2021-05-17 | 2021-08-06 | 上海博亦信息科技有限公司 | Talent data intelligent retrieval and recommendation method |
CN113221000B (en) * | 2021-05-17 | 2023-02-28 | 上海博亦信息科技有限公司 | Talent data intelligent retrieval and recommendation method |
CN113449210A (en) * | 2021-07-01 | 2021-09-28 | 深圳市数字尾巴科技有限公司 | Personalized recommendation method and device based on space-time characteristics, electronic equipment and storage medium |
CN113449210B (en) * | 2021-07-01 | 2023-01-31 | 深圳市数字尾巴科技有限公司 | Personalized recommendation method and device based on space-time characteristics, electronic equipment and storage medium |
CN113569155A (en) * | 2021-07-30 | 2021-10-29 | 西南大学 | Recommendation recall method and system based on improved recurrent neural network algorithm |
CN113569155B (en) * | 2021-07-30 | 2022-05-03 | 西南大学 | Recommendation recall method and system based on improved recurrent neural network algorithm |
CN114881689A (en) * | 2022-04-26 | 2022-08-09 | 驰众信息技术(上海)有限公司 | Building recommendation method and system based on matrix decomposition |
Also Published As
Publication number | Publication date |
---|---|
CN108921670B (en) | 2022-06-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108921670B (en) | Drug transaction recommendation method fusing potential interest, spatio-temporal data and category popularity of user | |
CN111259133B (en) | Personalized recommendation method integrating multiple information | |
Naeem et al. | Pythagorean m-polar fuzzy sets and TOPSIS method for the selection of advertisement mode | |
Alkahtani et al. | E-agricultural supply chain management coupled with blockchain effect and cooperative strategies | |
CN111259263B (en) | Article recommendation method and device, computer equipment and storage medium | |
CN104063481B (en) | A kind of film personalized recommendation method based on the real-time interest vector of user | |
Hu et al. | Movie collaborative filtering with multiplex implicit feedbacks | |
CN109508419A (en) | A kind of recommended method and system of knowledge based study | |
CN104657133B (en) | A kind of motivational techniques for single-time-window task in mobile intelligent perception | |
CN101482884A (en) | Cooperation recommending system based on user predilection grade distribution | |
CN103353880B (en) | A kind of utilization distinctiveness ratio cluster and the data digging method for associating | |
CN103425763B (en) | User based on SNS recommends method and device | |
CN112258260A (en) | Page display method, device, medium and electronic equipment based on user characteristics | |
CN105630742B (en) | Feature vector calculation method and device | |
CN113643103A (en) | Product recommendation method, device, equipment and storage medium based on user similarity | |
CN107292648A (en) | A kind of user behavior analysis method and device | |
CN112347362A (en) | Personalized recommendation method based on graph self-encoder | |
CN108182268A (en) | A kind of collaborative filtering recommending method and system based on community network | |
CN115860880B (en) | Personalized commodity recommendation method and system based on multi-layer heterogeneous graph convolution model | |
CN112328832A (en) | Movie recommendation method integrating labels and knowledge graph | |
CN115186197A (en) | User recommendation method based on end-to-end hyperbolic space | |
CN115860875A (en) | Commodity recommendation method based on bilinear pooling and multi-mode knowledge fusion | |
Zhang et al. | Micro-blog topic recommendation based on knowledge flow and user selection | |
CN114493786A (en) | Information recommendation method and device | |
Serrano | Intelligent recommender system for big data applications based on the random neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |