CN109166017A

CN109166017A - Method for pushing, device, computer equipment and storage medium based on reunion class

Info

Publication number: CN109166017A
Application number: CN201811191703.1A
Authority: CN
Inventors: 吴壮伟
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2018-10-12
Filing date: 2018-10-12
Publication date: 2019-01-08
Anticipated expiration: 2038-10-12
Also published as: CN109166017B; WO2020073534A1

Abstract

The invention discloses method for pushing, device, computer equipment and storage mediums based on reunion class.This method is by carrying out DBSCAN cluster to row vector each in user-rating matrix, obtain the cluster group to classify by user, and with each one-to-one child user-rating matrix of cluster group, the row vector selected in child user-rating matrix is obtained as target user, according to the similar users group's rating matrix for obtaining target user in child user-rating matrix where target user, and commercial product recommending row vector corresponding with similar users group's rating matrix, commercial product recommending list is obtained according to commercial product recommending row vector, the commercial product recommending list is pushed into the corresponding receiving end of target user.The method achieve user-rating matrix is divided into multiple submatrixs to safeguard respectively, maintenance cost is reduced, and merchandise news push precisely can be carried out to target user according to submatrix cross.

Description

Method for pushing, device, computer equipment and storage medium based on reunion class

Technical field

The present invention relates to information advancing technique field more particularly to a kind of method for pushing based on reunion class, device, calculating Machine equipment and storage medium.

Background technique

Currently, progress shopping at network is more and more frequent on online store Internet-based, these online stores pair When user carries out commercial product recommending, usually used is that (collaborative filtering, principle are users to the proposed algorithm based on collaborative filtering The commodity that those users with similar interests liked are liked, for example your friend likes film Harry Potter I, then will You is recommended, this is the simplest collaborative filtering based on user).

Proposed algorithm based on collaborative filtering commonly uses user-rating matrix, and user-rating matrix indicates user to project The scoring of (project can be understood as specific commodity), user-rating matrix horizontal axis are project, and the longitudinal axis is user, in the middle Value is scoring of the user i to project j.As the data volume of commodity is increasing, user-rating matrix scoring of full dose is safeguarded The cost of system can be higher and higher.

Summary of the invention

The embodiment of the invention provides a kind of method for pushing based on reunion class, device, computer equipment and storage medium, Aim to solve the problem that the full dose user-corresponding points-scoring system of rating matrix in online store in the prior art with the increasing of commodity amount It is more, it is more and more too fat to move, cause to full dose user-rating matrix problem difficult in maintenance.

In a first aspect, the embodiment of the invention provides a kind of method for pushing based on reunion class comprising:

By DBSCAN cluster acquired user-rating matrix is clustered, obtain at least one cluster group, and with Each one-to-one child user-rating matrix of cluster group；

According to the corresponding target user of the row vector chosen in child user-rating matrix, the row of target user is obtained The corresponding cluster group of vector；

In the corresponding cluster group of target user, calculates and obtain between each scoring row vector and the row vector of target user Euclidean distance obtains the corresponding scoring row of Euclidean distance before ranking is located at preset first rank threshold in each Euclidean distance Vector, to form similar users group's rating matrix；

According to the row vector that respectively scores in similar users group's rating matrix, similar users group is obtained to the comprehensive score of each commodity Value, to form commercial product recommending row vector；And

Comprehensive grading value institute before being located at preset second rank threshold by the ranking that scores in commercial product recommending row vector is right Commodity are answered to obtain commercial product recommending list, the commercial product recommending list is pushed into the corresponding receiving end of target user.

Second aspect, the embodiment of the invention provides a kind of driving means based on reunion class comprising:

User's cluster cell, for by DBSCAN cluster acquired user-rating matrix is clustered, obtain to A few cluster group, and with each one-to-one child user-rating matrix of cluster group；

Judging unit is clustered, for according to the corresponding target user of row vector that is chosen in child user-rating matrix, Obtain the corresponding cluster group of row vector of target user；

Similar users rating matrix acquiring unit, for calculating and obtaining each scoring in the corresponding cluster group of target user Euclidean distance between row vector and the row vector of target user obtains ranking in each Euclidean distance and is located at preset first ranking The corresponding scoring row vector of Euclidean distance before threshold value, to form similar users group's rating matrix；

Commercial product recommending row vector acquiring unit, for obtaining according to the row vector that respectively scores in similar users group's rating matrix Similar users group is to the comprehensive grading values of each commodity, to form commercial product recommending row vector；

Information push unit, for being located at before preset second rank threshold by the ranking that scores in commercial product recommending row vector Comprehensive grading value corresponding to commodity to obtain commercial product recommending list, it is corresponding that the commercial product recommending list is pushed into target user Receiving end.

The third aspect, the embodiment of the present invention provide a kind of computer equipment again comprising memory, processor and storage On the memory and the computer program that can run on the processor, the processor execute the computer program Method for pushing based on reunion class described in the above-mentioned first aspect of Shi Shixian.

Fourth aspect, the embodiment of the invention also provides a kind of computer readable storage mediums, wherein the computer can It reads storage medium and is stored with computer program, it is above-mentioned that the computer program when being executed by a processor executes the processor Method for pushing based on reunion class described in first aspect.

The embodiment of the invention provides a kind of method for pushing based on reunion class, device, computer equipment and storage mediums. This method by row vector each in user-rating matrix carry out DBSCAN cluster, obtain by user classify cluster group, and with Each one-to-one child user-rating matrix of cluster group obtains the row vector conduct selected in child user-rating matrix Target user, according in child user-rating matrix where target user obtain target user similar users group's rating matrix, And commercial product recommending row vector corresponding with similar users group's rating matrix, commercial product recommending column are obtained according to commercial product recommending row vector The commercial product recommending list is pushed to the corresponding receiving end of target user by table.The method achieve draw user-rating matrix It is divided into multiple submatrixs to be safeguarded respectively, reduces maintenance cost, and can precisely carry out to target user according to submatrix cross Merchandise news push.

Detailed description of the invention

Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.

Fig. 1 is the flow diagram of the method for pushing provided in an embodiment of the present invention based on reunion class；

Fig. 2 is the sub-process schematic diagram of the method for pushing provided in an embodiment of the present invention based on reunion class；

Fig. 3 is another sub-process schematic diagram of the method for pushing provided in an embodiment of the present invention based on reunion class；

Fig. 4 is the schematic block diagram of the driving means provided in an embodiment of the present invention based on reunion class；

Fig. 5 is the subelement schematic block diagram of the driving means provided in an embodiment of the present invention based on reunion class；

Fig. 6 is another subelement schematic block diagram of the driving means provided in an embodiment of the present invention based on reunion class；

Fig. 7 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.

It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded Body, step, operation, the presence or addition of element, component and/or its set.

It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.

It will be further appreciated that the term "and/or" used in description of the invention and the appended claims is Refer to any combination and all possible combinations of one or more of associated item listed, and including these combinations.

Referring to Fig. 1, Fig. 1 is the flow diagram of the method for pushing provided in an embodiment of the present invention based on reunion class, it should Method for pushing based on reunion class is applied in management server, and this method passes through the application software that is installed in management server It is executed, management server is the enterprise terminal for carrying out the push based on reunion class.

As shown in Figure 1, the method comprising the steps of S110~S150.

S110, acquired user-rating matrix is clustered by DBSCAN cluster, obtains at least one cluster Group, and with each one-to-one child user-rating matrix of cluster group.

In the present embodiment, user-rating matrix indicates user to commodity (project can be understood as specific commodity) Scoring, user-rating matrix horizontal axis are project, and the longitudinal axis is user, and value in the middle is scoring of the user i to project j.Such as with The matrix that family-rating matrix S is 4 × 5, such as:

Wherein, the row vector of the first row indicates the commenting for commodity 1- commodity 5 respectively of user 1 in user-rating matrix S Point, the row vector of the second row indicates that user 2 is directed to the scoring of commodity 1- commodity 5 respectively, and the row vector of the third line indicates that user 3 divides Scoring of the safety pin to commodity 1- commodity 5, the row vector of fourth line indicate that user 4 is directed to the scoring of commodity 1- commodity 5 respectively.

Each row vector in user-rating matrix is clustered by DBSCAN Clustering Model, is realized according to user couple Similar user is divided into same cluster group by the scoring of each commodity, and each user comments each commodity in same cluster group It is approximate (namely smaller to point difference of the scoring of each commodity) for dividing.By the way that the user of full dose-rating matrix is carried out DBSCAN clustering is after multiple child user-rating matrixs, only need be safeguarded i.e. respectively to each child user-rating matrix Can, improve the efficiency of maintenance.

In one embodiment, as shown in Fig. 2, step S110 includes:

S111, using any one row vector in user-rating matrix as initial cluster center；

S112, points are included according to preset minimum, obtains the spacing between initial cluster center in preset scanning Row vector within radius, using as initial clustering group；

S113, using row vector each in initial clustering group as cluster centre, obtain in user-rating matrix and in cluster The row vector that the direct density of the heart is reachable, density is reachable or density is connected, using as cluster group adjusted.

In the present embodiment, for the clearer detailed process for understanding DBSCAN cluster, below in DBSCAN cluster Related Feature Words are introduced.

Eps indicates sweep radius；

MinPts indicates minimum comprising points；

ξ neighborhood, indicates centered on given object, the region within the scope of the sweep radius for giving object；

Kernel object, if indicating, object number included in the ξ neighborhood of given object includes more than or equal to minimum Points, then the given object is referred to as kernel object；

Direct density is reachable, indicates for sample set D, if sample point q, in the ξ neighborhood of p, and p is kernel object, So object q is reachable from the direct density of object p；

Density is reachable, indicates to give a string of sample point p for sample set D₁、p₂、……、p_nIf p₁=q and p_n=p, If object p_iFrom p_i-1Direct density is reachable, then object q is reachable from object p density；

Density is connected, and indicates that there are the point o in sample set D, if object o to object p and object q are that density can It reaches, then p with q density is connected.

DBSCAN (Density-Based Spatial Clustering of Applications with Noise) is One more representational density-based algorithms.

DBSCAN needs two parameters: sweep radius (eps) and minimum include points (minPts).Optional one is not interviewed It asks that the point of (unvisited) starts, finds out all points nearby with its distance within eps (including eps).

If quantity >=the minPts nearby put, current point and its one cluster of point formation nearby, and starting point is marked It is denoted as and has accessed (visited).Then recurrence handles all in the cluster be not labeled as having accessed in the same way (visited) point, to be extended to cluster.If quantity < the minPts nearby put, which is temporarily labeled to be used as and make an uproar Sound point.If cluster is fully extended, i.e., all the points in cluster are marked as having accessed, and then go processing not with same algorithm Accessed point.

The corresponding target user of row vector that S120, basis are chosen in child user-rating matrix, obtains target user The corresponding cluster group of row vector.

In the present embodiment, in the corresponding child user-rating matrix of one of cluster group in multiple cluster groups, choosing In a row vector as object vector after, while cluster group belonging to the row vector can be obtained, can quickly judge mesh in this way Mark the similar users of user.

S130, in the corresponding cluster group of target user, calculate obtain it is each scoring row vector and target user row vector Between Euclidean distance, the Euclidean distance obtained before ranking is located at preset first rank threshold in each Euclidean distance is corresponding Score row vector, to form similar users group's rating matrix.

It in the present embodiment, can be in order to judge user similar with target user in cluster group belonging to target user The Euclidean distance respectively to score between row vector and the row vector of target user in cluster group is obtained, each Euclidean distance is arranged in descending order After sequence, the corresponding scoring row of Euclidean distance before ranking is located at preset first rank threshold in each Euclidean distance is obtained Vector, to form similar users group's rating matrix.11 such as are set by the first rank threshold, then obtains ranking in each Euclidean distance The corresponding scoring row vector of 1-10 Euclidean distances forms similar users group's rating matrix with this 10 row vectors.It obtains After the similar users of target user, it can be calculated by row vector selected on a small quantity, without calling the user-of full dose Rating matrix reduces the calculation amount in operational process.

In one embodiment, include: in step S130

Obtain the corresponding scoring row of Euclidean distance before ranking is located at preset first rank threshold in each Euclidean distance Vector is arranged according to scoring row vector sequencing of row serial number in corresponding child user-rating matrix, is obtained similar User group rating matrix.

In the present embodiment, occur according to each scoring row vector each row in corresponding child user-rating matrix successive Sequentially, the scoring row vectors of each similar users is sequentially obtained, aforesaid way can accurately obtain the corresponding scoring of each similar users Similar users group's rating matrix of row vector composition, convenient for subsequent calculating user to the comprehensive grading value of each commodity.

S140, according to the row vector that respectively scores in similar users group's rating matrix, obtain similar users group to the comprehensive of each commodity Score value is closed, to form commercial product recommending row vector.

In the present embodiment, after obtaining the similar users of target user to the scoring of each commodity, each quotient can be directed to Product calculate similar users to its comprehensive grading value.Euclidean in comprehensively considering target user and similar users between each user away from From and similar users to the score values of each commodity, can operation obtain commercial product recommending row vector, with commercial product recommending row vector work For the foundation of commercial product recommending.

In one embodiment, as shown in figure 3, step S140 includes:

S141, according to respectively scoring row vector in similar users group's rating matrix respectively between the row vector of target user Euclidean distance, to form similar users group's Euclidean distance row vector；

S142, it is multiplied to obtain similar users with similar users group's rating matrix according to similar users group's Euclidean distance row vector Group is to the comprehensive grading values of each commodity, to form commercial product recommending row vector.

In the present embodiment, after such as carrying out DBSCAN cluster by user-rating matrix S, the first cluster group and the are obtained Dimerization monoid, wherein the first cluster group includes the scoring row vector of user 1 and user 2, the second cluster group includes user 3 and user 4 scoring row vector.At this time first the corresponding child user-rating matrix of group is clustered by the scoring row vector group of user 1 and user 2 At the corresponding child user-rating matrix of the second cluster group is made of the scoring row vector of user 3 and user 4.

If having selected user 1 as target user, the corresponding cluster group of the scoring row vector of user 1 is the first cluster Group further includes the scoring row vector of user 2 in addition to the scoring vector including user 1 in the first cluster group.At this point, similar use Family group's rating matrix is [0 413 2], the scoring row vector of similar users group rating matrix [0 413 2] and target user Euclidean distance between [1 315 2] isBy similar users group's Euclidean distance row vectorWith the row vector of mark user [1 315 2] it is multiplied and obtainsSimilar users group has been obtained to the synthesis of each commodity The commercial product recommending row vector of score value composition.

It is commented when the corresponding scoring row vector of each user in the cluster group according to belonging to target user and with target user Commercial product recommending row vector is calculated in the Euclidean distance of branch's vector, arranges in commercial product recommending row vector the scoring of each commodity The forward commodity in position can be used as one of the component of commercial product recommending list, be pushed away in this way by what commercial product recommending row vector obtained Hobby of the commodity due to having fully considered approximated user is recommended, therefore can accurately reflect the hobby of target user.

S150, the comprehensive grading value being located at before preset second rank threshold by the ranking that scores in commercial product recommending row vector Corresponding commodity push to the corresponding receiving end of target user to obtain commercial product recommending list, by the commercial product recommending list.

In the present embodiment, in calculating obtained commercial product recommending row vector, you can learn that gathering belonging to target user Each user is located at before the second rank threshold (such as the comprehensive score of each commodity with the ranking that scores in the comprehensive score in monoid The second rank threshold, which is arranged, can be used as the recommendation items of commercial product recommending list for commodity corresponding to scoring 4).

For example, before scoring is located in the corresponding commercial product recommending row vector of similar users (user 2) of target user (user 1) 3 commodity are commodity 2, commodity 4 and commodity 5 respectively, and above-mentioned 3 commodity are pushed to target as items list at this time and are used Family.

By the above-mentioned calculating based on Euclidean distance, the commodity of similar users can be liked and recommend quotient as to target user The principal element considered when product can more reasonably carry out commercial product recommending.

In one embodiment, before step S110 further include:

History merchandise news set is obtained, by word frequency-inverse document frequency model to the history merchandise news collection Each history merchandise news carries out key word information extraction in conjunction, and it is crucial to obtain commodity corresponding with each history merchandise news Set of words；

Pass through the corresponding term vector of commodity keyword each in each commodity keyword set of Word2Vec model acquisition；

Obtain the average value of term vector corresponding to each commodity keyword in each commodity keyword set, with obtain with it is each The corresponding statistical vector of commodity keyword set；

The corresponding statistical vector of commodity keyword set is clustered by DBSCAN Clustering Model, obtains at least one Commercial articles clustering cluster；

If in user-rating matrix including blank value, according to the corresponding product name of the blank value, with acquisition and commodity The corresponding statistical vector of title；

Obtain the commercial articles clustering cluster that statistical vector corresponding with product name is belonged to；

According to the commercial articles clustering cluster that statistical vector corresponding with product name is belonged to, obtain corresponding with the blank value Product name similar product name accordingly, using as similar commodity result；

According to the corresponding row vector of the blank value, obtain corresponding with each product name in the similar commodity result Scoring；

It is weighted and averaged according to scoring corresponding with product name each in the similar commodity result, obtains the sky The corresponding commodity weighted scoring of white value, is updated to corresponding commodity weighted scoring for blank value.

In the present embodiment, in the user oriented interface UI provided in management server, user can choose a variety of One of commodity are a variety of and bought.It is to be stored with history merchandise news set, history commodity in the management server Each history merchandise news includes by product name and item property in information aggregate, and wherein the attribute of commodity includes the valence of commodity Lattice, label, brand and function etc..

When by TF-IDF model, (i.e. term frequency-inverse document frequency indicates word Frequently-inverse document frequency model) keyword letter is carried out to each history merchandise news in the history merchandise news set Breath extracts, and the simplification of each history merchandise news can be expressed as corresponding commodity keyword set.

Word frequency-inverse document frequency model is a kind of common weighting technique for information retrieval and data mining.TF Mean that word frequency (Term Frequency), IDF mean inverse document frequency (Inverse Document Frequency).TF-IDF is a kind of statistical method, to assess a words in a file set or a corpus The significance level of a copy of it file.The importance of words is with the directly proportional increase of number that it occurs hereof, but simultaneously Can be inversely proportional decline with the frequency that it occurs in corpus.

For example, there is the basketball of a Spalding brand XX model YY member, there are also the merchandise news abundant such as the place of production, size, But after TF-IDF model carries out keyword abstraction, the commodity keyword set finally obtained is combined into " basketball+Spalding+XX Model ".In this way, each history merchandise news is reduced to corresponding commodity keyword set, can be convenient for being converted into term vector.

It, can after converting commodity keyword set corresponding with each history merchandise news for history merchandise news set again By Word2Vec model, (Word2Vec is a kind of mould for learning semantic knowledge in unsupervised mode from a large amount of corpus of text Type) it converts commodity keyword set to and each one-to-one term vector of history merchandise news.

Such as in corpus, basketball, Spalding, XX model one vector of each correspondence, only one value is 1 in vector, Remaining is all 0, can convert the corresponding vector input Word2Vec model of above- mentioned information to the successive value of low dimensional, that is, Dense vector, and wherein the word of similar import will be mapped to that similar position in vector space.

The term vector of each commodity keyword in obtaining commodity keyword set, then take the word of each commodity keyword to Measure statistical vector of the average value as the commodity.It is at this time that history merchandise news each in history merchandise news set is equal Be converted into corresponding statistical vector, later will by DBSCAN Clustering Model to the corresponding statistical vector of commodity keyword set into Row cluster, can be obtained at least one commercial articles clustering cluster.

Since it is determined may know that is after the row vector that the blank value is belonged in initial user-rating matrix Which user is blank value for the commodity scoring of which commodity, first obtains the corresponding product name of the blank value at this time Know statistical vector corresponding to the product name.Then judge the commercial articles clustering cluster that the statistical vector is belonged to, can obtain The similar product name of other commodity in the commercial articles clustering cluster, using the similar commodity as the corresponding product name of the blank value As a result.In the row vector belonged in initial user-rating matrix by the blank value, it would know that the user for similar The scoring of each similar product name in commodity result.Finally it is weighted according to the user for the scoring of each similar product name It is average, obtain the corresponding commodity weighted scoring of the blank value.

In one embodiment, basis scoring corresponding with product name each in the similar commodity result is added Weight average obtains the corresponding commodity weighted scoring of the blank value, comprising:

Using the corresponding statistical vector of product name each in the similar commodity result as statistical vector group, by the sky The corresponding statistical vector of white value corresponding goods title obtains each in the statistical vector group as commodity to be predicted scoring vector The distance between statistical vector and commodity to be predicted scoring vector, to obtain vector distance set；

By the corresponding scoring of product name each in the similar commodity result multiplied by corresponding vector in vector distance set Distance is simultaneously summed, and commodity weighting overall score is obtained；

By commodity weighting overall score divided by the sum of each vector distance in vector distance set, it is corresponding to obtain the blank value Commodity weighted scoring.

In the present embodiment, if by between each statistical vector in the statistical vector group and commodity to be predicted scoring vector Distance be denoted as d_ck, by the corresponding user of row vector where the blank value to each product name in the similar commodity result Corresponding scoring is denoted as S_ic, the corresponding commodity weighted scoring of the blank value is calculated by following formula:

Wherein, Score_kIndicate that the commodity weighted scoring of the blank value corresponding goods k, m are the similar commodity result In similar commodity c total number.

For example, user 1 is blank value for the scoring of commodity 2, and obtaining the corresponding similar commodity result of commodity 2 is commodity 4 and commodity 5, and user 1 is respectively 3 and 4 for the scoring of commodity 4 and commodity 5, the corresponding statistical vector of commodity 4 and commodity 2 are right The distance between statistical vector answered is 0.5, between the corresponding statistical vector of commodity 5 statistical vector corresponding with commodity 2 away from From being 1, then:

Score₂=(0.5*3+1*4)/(0.5+1)=11/3；

At this point, by the above-mentioned Score being calculated₂As the corresponding commodity weighted scoring of the blank value.

It, can effective completion user-by the prediction technique based on content by scoring commodity lacking in new user Rating matrix avoids the problem of being cold-started in recommendation process.

The method achieve user-rating matrix is divided into multiple submatrixs to safeguard respectively, maintenance cost is reduced, And merchandise news push precisely can be carried out to target user according to submatrix cross.

The embodiment of the present invention also provides a kind of driving means based on reunion class, is somebody's turn to do the driving means based on reunion class and is used for Execute any embodiment of the aforementioned method for pushing based on reunion class.Specifically, referring to Fig. 4, Fig. 4 is that the embodiment of the present invention mentions The schematic block diagram of the driving means based on reunion class supplied.The driving means 100 based on reunion class can be configured at management In server.

As shown in figure 4, the driving means 100 based on reunion class include user's cluster cell 110, cluster judging unit 120, Similar users rating matrix acquiring unit 130, commercial product recommending row vector acquiring unit 140 and information push unit 150.

User's cluster cell 110 is obtained for being clustered by DBSCAN cluster to acquired user-rating matrix To at least one cluster group, and with each one-to-one child user-rating matrix of cluster group.

In one embodiment, as shown in figure 5, user's cluster cell 110 includes:

Initial center acquiring unit 111, for using any one row vector in user-rating matrix as in initial clustering The heart；

Initial clustering group acquiring unit 112, for according to preset minimum comprising points, obtain with initial cluster center it Between row vector of the spacing within preset sweep radius, using as initial clustering group；

Group's adjustment unit 113 is clustered, for obtaining user-using row vector each in initial clustering group as cluster centre In rating matrix with the row vector that the direct density of cluster centre is reachable, density is reachable or density is connected, using as adjusted poly- Monoid.

Judging unit 120 is clustered, for using according to the corresponding target of the row vector chosen in child user-rating matrix Family obtains the corresponding cluster group of row vector of target user.

Similar users rating matrix acquiring unit 130 is respectively commented in the corresponding cluster group of target user, calculating to obtain Euclidean distance between branch's vector and the row vector of target user obtains ranking in each Euclidean distance and is located at preset first row The corresponding scoring row vector of Euclidean distance before name threshold value, to form similar users group's rating matrix.

In one embodiment, it obtains ranking in each Euclidean distance and is located at the Euclidean distance before preset first rank threshold Corresponding scoring row vector, to form similar users group's rating matrix, comprising:

Commercial product recommending row vector acquiring unit 140, for obtaining according to the row vector that respectively scores in similar users group's rating matrix Take similar users group to the comprehensive grading value of each commodity, to form commercial product recommending row vector.

In one embodiment, as shown in fig. 6, commercial product recommending row vector acquiring unit 140 includes:

Euclidean distance row vector acquiring unit 141, for according to the row vector point that respectively scores in similar users group's rating matrix Euclidean distance not between the row vector of target user, to form similar users group's Euclidean distance row vector；

Comprehensive grading value computing unit 142, for being commented according to similar users group's Euclidean distance row vector with similar users group Sub-matrix is multiplied to obtain similar users group to the comprehensive grading value of each commodity, to form commercial product recommending row vector.

If having selected user 1 as target user, the corresponding cluster group of the scoring row vector of user 1 is the first cluster Group further includes the scoring row vector of user 2 in addition to the scoring vector including user 1 in the first cluster group.At this point, similar use Family group's rating matrix is [0 413 2], the scoring row vector of similar users group rating matrix [0 413 2] and target user Euclidean distance between [1 315 2] isBy similar users group's Euclidean distance row vectorWith mark user row to Amount [1 315 2], which is multiplied, to be obtainedSimilar users group has been obtained to the comprehensive of each commodity Close the commercial product recommending row vector of score value composition.

Information push unit 150, for being located at preset second rank threshold by the ranking that scores in commercial product recommending row vector Commodity corresponding to comprehensive grading value before push to target user to obtain commercial product recommending list, by the commercial product recommending list Corresponding receiving end.

In one embodiment, based on the driving means 100 of reunion class, further includes:

History keyword set of words acquiring unit is referred to for obtaining history merchandise news set by the inverse text frequency of word frequency- Exponential model carries out key word information extraction to each history merchandise news in the history merchandise news set, obtain with it is each The corresponding commodity keyword set of history merchandise news；

Term vector conversion unit, it is crucial for obtaining each commodity in each commodity keyword set by Word2Vec model The corresponding term vector of word；

Statistical vector acquiring unit, for obtaining term vector corresponding to each commodity keyword in each commodity keyword set Average value, to obtain statistical vector corresponding with each commodity keyword set；

Commercial articles clustering unit, for being carried out by DBSCAN Clustering Model to the corresponding statistical vector of commodity keyword set Cluster, obtains at least one commercial articles clustering cluster；

Vector acquiring unit to be predicted, if for including blank value in user-rating matrix, it is corresponding according to the blank value Product name, to obtain corresponding with product name statistical vector；

Commercial articles clustering cluster belongs to judging unit, poly- for obtaining the commodity that statistical vector corresponding with product name is belonged to Class cluster；

Similar commodity result acquiring unit, the commercial articles clustering for being belonged to according to statistical vector corresponding with product name Cluster obtains the corresponding similar product name of corresponding with blank value product name, using as similar commodity result；

Similar commodity scoring acquiring unit, for according to the corresponding row vector of the blank value, acquisition and the similar quotient The corresponding scoring of each product name in product result；

Commodity weighted scoring acquiring unit, for according to each product name is corresponding in the similar commodity result comments Divide and be weighted and averaged, obtains the corresponding commodity weighted scoring of the blank value, blank value is updated to corresponding commodity and is added Power scoring.

In one embodiment, commodity weighted scoring acquiring unit, comprising:

Vector distance set acquiring unit, for by each product name in the similar commodity result it is corresponding count to Amount be used as statistical vector group, using the corresponding statistical vector of blank value corresponding goods title as commodity to be predicted score to Amount obtains the distance between each statistical vector and commodity to be predicted scoring vector in the statistical vector group, to obtain vector Distance set；

Commodity weight overall score acquiring unit, for multiplying the corresponding scoring of product name each in the similar commodity result It with corresponding vector distance in vector distance set and sums, obtains commodity weighting overall score；

Average mark acquiring unit, for by commodity weighting overall score divided by the sum of each vector distance in vector distance set, Obtain the corresponding commodity weighted scoring of the blank value.

Score₂=(0.5*3+1*4)/(0.5+1)=11/3；

The arrangement achieves user-rating matrix is divided into multiple submatrixs to safeguard respectively, maintenance cost is reduced, And merchandise news push precisely can be carried out to target user according to submatrix cross.

The above-mentioned driving means based on reunion class can be implemented as the form of computer program, which can be It is run in computer equipment as shown in Figure 7.

Referring to Fig. 7, Fig. 7 is the schematic block diagram of computer equipment provided in an embodiment of the present invention.

Refering to Fig. 7, which includes processor 502, memory and the net connected by system bus 501 Network interface 505, wherein memory may include non-volatile memory medium 503 and built-in storage 504.

The non-volatile memory medium 503 can storage program area 5031 and computer program 5032.The computer program 5032 are performed, and processor 502 may make to execute the method for pushing based on reunion class.

The processor 502 supports the operation of entire computer equipment 500 for providing calculating and control ability.

The built-in storage 504 provides environment for the operation of the computer program 5032 in non-volatile memory medium 503, should When computer program 5032 is executed by processor 502, processor 502 may make to execute the method for pushing based on reunion class.

The network interface 505 is for carrying out network communication, such as the transmission of offer data information.Those skilled in the art can To understand, structure shown in Fig. 7, only the block diagram of part-structure relevant to the present invention program, is not constituted to this hair The restriction for the computer equipment 500 that bright scheme is applied thereon, specific computer equipment 500 may include than as shown in the figure More or fewer components perhaps combine certain components or with different component layouts.

Wherein, the processor 502 is for running computer program 5032 stored in memory, to realize following function Can: by DBSCAN cluster acquired user-rating matrix is clustered, obtain at least one cluster group, and with it is each Cluster the one-to-one child user-rating matrix of group；According to the corresponding mesh of the row vector chosen in child user-rating matrix User is marked, the corresponding cluster group of row vector of target user is obtained；In the corresponding cluster group of target user, calculates to obtain and respectively comment Euclidean distance between branch's vector and the row vector of target user obtains ranking in each Euclidean distance and is located at preset first row The corresponding scoring row vector of Euclidean distance before name threshold value, to form similar users group's rating matrix；According to similar users group Respectively score row vector in rating matrix, obtains similar users group to the comprehensive grading values of each commodity, with form commercial product recommending row to Amount；And as scoring corresponding to the comprehensive grading value that ranking is located at before preset second rank threshold in commercial product recommending row vector Commodity push to the corresponding receiving end of target user to obtain commercial product recommending list, by the commercial product recommending list.

In one embodiment, processor 502 gathers user-rating matrix in described clustered by DBSCAN of execution Class performs the following operations when obtaining the step of at least one cluster group: any one row vector in user-rating matrix is made For initial cluster center；Include points according to preset minimum, the spacing obtained between initial cluster center is swept preset The row vector within radius is retouched, using as initial clustering group；Using row vector each in initial clustering group as cluster centre, obtain In user-rating matrix with the row vector that the direct density of cluster centre is reachable, density is reachable or density is connected, as after adjustment Cluster group.

In one embodiment, processor 502 execute it is described according to the row vector that respectively scores in similar users group's rating matrix, Similar users group is obtained to the comprehensive grading values of each commodity, when step to form commercial product recommending row vector, performed the following operations: According to Euclidean distance of the row vector respectively between the row vector of target user that respectively score in similar users group's rating matrix, with group At similar users group's Euclidean distance row vector；According to similar users group's Euclidean distance row vector and similar users group's rating matrix phase It is multiplied to arrive similar users group to the comprehensive grading value of each commodity, to form commercial product recommending row vector.

In one embodiment, the ranking in executing each Euclidean distance of acquisition of processor 502 is located at preset first row Euclidean distance corresponding scoring row vector before name threshold value when step to form similar users group's rating matrix, executes such as Lower operation: obtain the corresponding scoring row of Euclidean distance before ranking is located at preset first rank threshold in each Euclidean distance to Amount is arranged according to scoring row vector sequencing of row serial number in corresponding child user-rating matrix, obtains similar use Family group's rating matrix.

In one embodiment, processor 502 execute it is described by DBSCAN cluster to acquired user-rating matrix Clustered, obtain at least one cluster group, and the step of child user-rating matrix one-to-one with each cluster group it Before, it also performs the following operations: history merchandise news set is obtained, by word frequency-inverse document frequency model to the history Each history merchandise news carries out key word information extraction in merchandise news set, obtains corresponding with each history merchandise news Commodity keyword set；Pass through the corresponding word of commodity keyword each in each commodity keyword set of Word2Vec model acquisition Vector；The average value of term vector corresponding to each commodity keyword in each commodity keyword set is obtained, to obtain and each quotient The corresponding statistical vector of product keyword set；By DBSCAN Clustering Model to the corresponding statistical vector of commodity keyword set into Row cluster, obtains at least one commercial articles clustering cluster；It is corresponding according to the blank value if in user-rating matrix including blank value Product name, to obtain corresponding with product name statistical vector；Statistical vector corresponding with product name is obtained to be belonged to Commercial articles clustering cluster；According to the commercial articles clustering cluster that statistical vector corresponding with product name is belonged to, obtain and the blank value Corresponding product name similar product name accordingly, using as similar commodity result；According to the corresponding row of the blank value to Amount obtains scoring corresponding with product name each in the similar commodity result；According to every to the similar commodity result The corresponding scoring of one product name is weighted and averaged, and the corresponding commodity weighted scoring of the blank value is obtained, by blank value It is updated to corresponding commodity weighted scoring.

In one embodiment, processor 502 is executing the basis and each product name in the similar commodity result Corresponding scoring is weighted and averaged, and when obtaining the step of the corresponding commodity weighted scoring of the blank value, is performed the following operations: It is using the corresponding statistical vector of product name each in the similar commodity result as statistical vector group, the blank value is corresponding The corresponding statistical vector of product name obtains each statistical vector in the statistical vector group as commodity to be predicted scoring vector The distance between commodity to be predicted scoring vector, to obtain vector distance set；By each quotient in the similar commodity result The name of an article claims corresponding scoring multiplied by corresponding vector distance in vector distance set and sums, and obtains commodity weighting overall score；By quotient Product weight overall score divided by the sum of each vector distance in vector distance set, obtain the corresponding commodity weighting of the blank value and comment Point.

It will be understood by those skilled in the art that the embodiment of computer equipment shown in Fig. 7 is not constituted to computer The restriction of equipment specific composition, in other embodiments, computer equipment may include components more more or fewer than diagram, or Person combines certain components or different component layouts.For example, in some embodiments, computer equipment can only include depositing Reservoir and processor, in such embodiments, the structure and function of memory and processor are consistent with embodiment illustrated in fig. 7, Details are not described herein.

It should be appreciated that in embodiments of the present invention, processor 502 can be central processing unit (Central ProcessingUnit, CPU), which can also be other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic Device, discrete gate or transistor logic, discrete hardware components etc..Wherein, general processor can be microprocessor or Person's processor is also possible to any conventional processor etc..

Computer readable storage medium is provided in another embodiment of the invention.The computer readable storage medium can be with For non-volatile computer readable storage medium.The computer-readable recording medium storage has computer program, wherein calculating Machine program performs the steps of when being executed by processor gathers acquired user-rating matrix by DBSCAN cluster Class, obtain at least one cluster group, and with each one-to-one child user-rating matrix of cluster group；It is commented according in child user- The corresponding target user of the row vector chosen in sub-matrix obtains the corresponding cluster group of row vector of target user；In target In the corresponding cluster group of user, the Euclidean distance obtained between each scoring row vector and the row vector of target user is calculated, is obtained Ranking is located at the corresponding scoring row vector of Euclidean distance before preset first rank threshold in each Euclidean distance, to form phase Like user group rating matrix；According to the row vector that respectively scores in similar users group's rating matrix, similar users group is obtained to each commodity Comprehensive grading value, to form commercial product recommending row vector；And it is located at preset the by the ranking that scores in commercial product recommending row vector Commodity corresponding to comprehensive grading value before two rank thresholds are pushed the commercial product recommending list with obtaining commercial product recommending list To the corresponding receiving end of target user.

In one embodiment, described that user-rating matrix is clustered by DBSCAN cluster, it is poly- to obtain at least one Monoid, comprising: using any one row vector in user-rating matrix as initial cluster center；Include according to preset minimum Points obtain row vector of the spacing within preset sweep radius between initial cluster center, using as initial clustering Group；Using row vector each in initial clustering group as cluster centre, obtain in user-rating matrix with the direct density of cluster centre The row vector that reachable, density is reachable or density is connected, using as cluster group adjusted.

In one embodiment, described according to the row vector that respectively scores in similar users group's rating matrix, obtain similar users group To the comprehensive grading value of each commodity, to form commercial product recommending row vector, comprising: respectively score according in similar users group's rating matrix The row vector Euclidean distance between the row vector of target user respectively, to form similar users group's Euclidean distance row vector；Root It is multiplied to obtain similar users group to the comprehensive of each commodity with similar users group's rating matrix according to similar users group's Euclidean distance row vector Score value is closed, to form commercial product recommending row vector.

In one embodiment, European before ranking is located at preset first rank threshold in each Euclidean distance of acquisition Apart from corresponding scoring row vector, to form similar users group's rating matrix, comprising: obtain ranking in each Euclidean distance and be located in advance If the first rank threshold before the corresponding scoring row vector of Euclidean distance, according to scoring row vector in corresponding child user- The sequencing of row serial number is arranged in rating matrix, obtains similar users group's rating matrix.

In one embodiment, it is described by DBSCAN cluster acquired user-rating matrix is clustered, obtain to A few cluster group, and with before each one-to-one child user-rating matrix of cluster group, further includes: acquisition history commodity Information aggregate, by word frequency-inverse document frequency model to each history merchandise news in the history merchandise news set Key word information extraction is carried out, commodity keyword set corresponding with each history merchandise news is obtained；Pass through Word2Vec Model obtains the corresponding term vector of each commodity keyword in each commodity keyword set；It obtains in each commodity keyword set The average value of term vector corresponding to each commodity keyword, to obtain statistical vector corresponding with each commodity keyword set；It is logical It crosses DBSCAN Clustering Model to cluster the corresponding statistical vector of commodity keyword set, obtains at least one commercial articles clustering Cluster；If in user-rating matrix including blank value, according to the corresponding product name of the blank value, with acquisition and product name Corresponding statistical vector；Obtain the commercial articles clustering cluster that statistical vector corresponding with product name is belonged to；According to product name The commercial articles clustering cluster that corresponding statistical vector is belonged to obtains the corresponding similar commodity of corresponding with blank value product name Title, using as similar commodity result；According to the corresponding row vector of the blank value, obtain every to the similar commodity result The corresponding scoring of one product name；It is weighted according to scoring corresponding with product name each in the similar commodity result flat , the corresponding commodity weighted scoring of the blank value is obtained, blank value is updated to corresponding commodity weighted scoring.

In one embodiment, basis scoring corresponding with product name each in the similar commodity result is added Weight average obtains the corresponding commodity weighted scoring of the blank value, comprising: by each product name in the similar commodity result Corresponding statistical vector is as statistical vector group, using the corresponding statistical vector of blank value corresponding goods title as to be predicted Commodity scoring vector obtains the distance between each statistical vector and commodity to be predicted scoring vector in the statistical vector group, To obtain vector distance set；By the corresponding scoring of product name each in the similar commodity result multiplied by vector distance set Corresponding vector distance is simultaneously summed, and commodity weighting overall score is obtained；By commodity weighting overall score divided by each in vector distance set The sum of vector distance obtains the corresponding commodity weighted scoring of the blank value.

It is apparent to those skilled in the art that for convenience of description and succinctly, foregoing description is set The specific work process of standby, device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein. Those of ordinary skill in the art may be aware that unit described in conjunction with the examples disclosed in the embodiments of the present disclosure and algorithm Step can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware and software Interchangeability generally describes each exemplary composition and step according to function in the above description.These functions are studied carefully Unexpectedly the specific application and design constraint depending on technical solution are implemented in hardware or software.Professional technician Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed The scope of the present invention.

In several embodiments provided by the present invention, it should be understood that disclosed unit and method, it can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only logical function partition, there may be another division manner in actual implementation, can also will be with the same function Unit set is at a unit, such as multiple units or components can be combined or can be integrated into another system or some Feature can be ignored, or not execute.In addition, shown or discussed mutual coupling, direct-coupling or communication connection can Be through some interfaces, the indirect coupling or communication connection of device or unit, be also possible to electricity, mechanical or other shapes Formula connection.

The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.Some or all of unit therein can be selected to realize the embodiment of the present invention according to the actual needs Purpose.

It, can also be in addition, the functional units in various embodiments of the present invention may be integrated into one processing unit It is that each unit physically exists alone, is also possible to two or more units and is integrated in one unit.It is above-mentioned integrated Unit both can take the form of hardware realization, can also realize in the form of software functional units.

If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in one storage medium.Based on this understanding, technical solution of the present invention is substantially in other words to existing The all or part of part or the technical solution that technology contributes can be embodied in the form of software products, should Computer software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be Personal computer, server or network equipment etc.) execute all or part of step of each embodiment the method for the present invention Suddenly.And storage medium above-mentioned include: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or The various media that can store program code such as person's CD.

The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims

1. a kind of method for pushing based on reunion class characterized by comprising

By DBSCAN cluster acquired user-rating matrix is clustered, obtain at least one cluster group, and with it is each Cluster the one-to-one child user-rating matrix of group；

According to the corresponding target user of the row vector chosen in child user-rating matrix, the row vector of target user is obtained Corresponding cluster group；

In the corresponding cluster group of target user, calculate European between each scoring row vector of acquisition and the row vector of target user Distance, obtain the corresponding scoring row of Euclidean distance before ranking is located at preset first rank threshold in each Euclidean distance to Amount, to form similar users group's rating matrix；

According to the row vector that respectively scores in similar users group's rating matrix, similar users group is obtained to the comprehensive grading value of each commodity, To form commercial product recommending row vector；And

As the quotient corresponding to the comprehensive grading value that ranking is located at before preset second rank threshold that scores in commercial product recommending row vector Product push to the corresponding receiving end of target user to obtain commercial product recommending list, by the commercial product recommending list.

2. the method for pushing according to claim 1 based on reunion class, which is characterized in that described to pass through DBSCAN cluster pair User-rating matrix clusters, and obtains at least one cluster group, comprising:

Using any one row vector in user-rating matrix as initial cluster center；

Include points according to preset minimum, obtains the spacing between initial cluster center within preset sweep radius Row vector, using as initial clustering group；

Using row vector each in initial clustering group as cluster centre, obtain directly close with cluster centre in user-rating matrix The row vector that reachable, density is reachable or density is connected is spent, using as cluster group adjusted.

3. the method for pushing according to claim 1 based on reunion class, which is characterized in that described to be commented according to similar users group Respectively score row vector in sub-matrix, obtains similar users group to the comprehensive grading values of each commodity, to form commercial product recommending row vector, Include:

According to Euclidean distance of the row vector respectively between the row vector of target user that respectively score in similar users group's rating matrix, To form similar users group's Euclidean distance row vector；

It is multiplied to obtain similar users group to each quotient with similar users group's rating matrix according to similar users group's Euclidean distance row vector The comprehensive grading value of product, to form commercial product recommending row vector.

4. the method for pushing according to claim 1 based on reunion class, which is characterized in that described to obtain in each Euclidean distance Ranking is located at the corresponding scoring row vector of Euclidean distance before preset first rank threshold, to form similar users group scoring Matrix, comprising:

It obtains ranking in each Euclidean distance and is located at the corresponding scoring row vector of Euclidean distance before preset first rank threshold, It is arranged according to scoring row vector sequencing of row serial number in corresponding child user-rating matrix, obtains similar users Group's rating matrix.

5. the method for pushing according to claim 1 based on reunion class, which is characterized in that described to pass through DBSCAN cluster pair Acquired user-rating matrix clusters, and obtains at least one cluster group, and use with the one-to-one son of each cluster group Before family-rating matrix, further includes:

History merchandise news set is obtained, by word frequency-inverse document frequency model in the history merchandise news set Each history merchandise news carries out key word information extraction, obtains commodity keyword set corresponding with each history merchandise news It closes；

The average value of term vector corresponding to each commodity keyword in each commodity keyword set is obtained, to obtain and each commodity The corresponding statistical vector of keyword set；

The corresponding statistical vector of commodity keyword set is clustered by DBSCAN Clustering Model, obtains at least one commodity Clustering cluster；

If in user-rating matrix including blank value, according to the corresponding product name of the blank value, with acquisition and product name Corresponding statistical vector；

According to the commercial articles clustering cluster that statistical vector corresponding with product name is belonged to, commodity corresponding with the blank value are obtained Title similar product name accordingly, using as similar commodity result；

According to the corresponding row vector of the blank value, obtain and each product name is corresponding in the similar commodity result comments Point；

It is weighted and averaged according to scoring corresponding with product name each in the similar commodity result, obtains the blank value Blank value is updated to corresponding commodity weighted scoring by corresponding commodity weighted scoring.

6. the method for pushing according to claim 5 based on reunion class, which is characterized in that the basis and the similar quotient Corresponding score of each product name is weighted and averaged in product result, obtains the corresponding commodity weighted scoring of the blank value, Include:

Using the corresponding statistical vector of product name each in the similar commodity result as statistical vector group, by the blank value The corresponding statistical vector of corresponding goods title obtains each statistics in the statistical vector group as commodity to be predicted scoring vector The distance between vector and commodity to be predicted scoring vector, to obtain vector distance set；

By the corresponding scoring of product name each in the similar commodity result multiplied by corresponding vector distance in vector distance set And sum, obtain commodity weighting overall score；

By commodity weighting overall score divided by the sum of each vector distance in vector distance set, the corresponding commodity of the blank value are obtained Weighted scoring.

7. a kind of driving means based on reunion class characterized by comprising

User's cluster cell obtains at least one for clustering by DBSCAN cluster to acquired user-rating matrix A cluster group, and with each one-to-one child user-rating matrix of cluster group；

Judging unit is clustered, for obtaining according to the corresponding target user of row vector chosen in child user-rating matrix The corresponding cluster group of the row vector of target user；

Similar users rating matrix acquiring unit, in the corresponding cluster group of target user, calculate obtain each scoring it is capable to Euclidean distance between amount and the row vector of target user, obtains ranking in each Euclidean distance and is located at preset first rank threshold The corresponding scoring row vector of Euclidean distance before, to form similar users group's rating matrix；

Commercial product recommending row vector acquiring unit, for obtaining similar according to the row vector that respectively scores in similar users group's rating matrix User group is to the comprehensive grading values of each commodity, to form commercial product recommending row vector；

Information push unit, for comprehensive before being located at preset second rank threshold by the ranking that scores in commercial product recommending row vector Commodity corresponding to score value are closed to obtain commercial product recommending list, the commercial product recommending list pushes to target user is corresponding to be connect Receiving end.

8. the driving means according to claim 7 based on reunion class, which is characterized in that user's cluster cell, packet It includes:

Initial center acquiring unit, for using any one row vector in user-rating matrix as initial cluster center；

Initial clustering group's acquiring unit, for including points according to preset minimum, between obtaining between initial cluster center Away from the row vector within preset sweep radius, using as initial clustering group；

Group's adjustment unit is clustered, for obtaining user-rating matrix using row vector each in initial clustering group as cluster centre In with the row vector that the direct density of cluster centre is reachable, density is reachable or density is connected, using as cluster group adjusted.

9. a kind of computer equipment, including memory, processor and it is stored on the memory and can be on the processor The computer program of operation, which is characterized in that the processor realizes such as claim 1 to 6 when executing the computer program Any one of described in the method for pushing based on reunion class.

10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer journey Sequence, the computer program make the processor execute such as base as claimed in any one of claims 1 to 6 when being executed by a processor In the method for pushing of reunion class.