CN104731867B - A kind of method and apparatus that object is clustered - Google Patents

A kind of method and apparatus that object is clustered Download PDF

Info

Publication number
CN104731867B
CN104731867B CN201510090184.XA CN201510090184A CN104731867B CN 104731867 B CN104731867 B CN 104731867B CN 201510090184 A CN201510090184 A CN 201510090184A CN 104731867 B CN104731867 B CN 104731867B
Authority
CN
China
Prior art keywords
transfer
keyword
information
user
transfer case
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510090184.XA
Other languages
Chinese (zh)
Other versions
CN104731867A (en
Inventor
周泽伟
程涛远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510090184.XA priority Critical patent/CN104731867B/en
Publication of CN104731867A publication Critical patent/CN104731867A/en
Application granted granted Critical
Publication of CN104731867B publication Critical patent/CN104731867B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention designs the method clustered to object in computer equipment, and this method includes:The transfer case information of multiple objects is obtained, the transfer case information is used to indicate transfer case based on object information acquisition behavior, that user is in the multiple object;According to the transfer case information, the multiple object is clustered, obtains the cluster result of the multiple object.The present invention can cluster object by analyzing transfer case information of the user in object, therefore identified object classification is more objective, accurate.

Description

A kind of method and apparatus that object is clustered
Technical field
The present invention relates to field of computer technology more particularly to a kind of method and apparatus clustered to object.
Background technology
In the prior art, natural language analysis is usually carried out by the description text to object, to classify to object. Particularly, when object is related to commercial use, such as when object is brand, in addition to brand name carry out natural language analysis with Outside, can also be in conjunction with the data from object angle, industry and region, the sales situation of brand and market as belonging to brand need The factors such as seek, to classify to brand.
Invention content
The mesh of the present invention includes providing a kind of method and apparatus clustered to object.
According to an aspect of the present invention, a kind of method for being clustered to object in computer equipment is provided, Wherein, this method includes:
The transfer case information of multiple objects is obtained, the transfer case information is used to indicate to be obtained based on object information and be gone For, transfer case that user is in the multiple object;
According to the transfer case information, the multiple object is clustered, obtains the cluster knot of the multiple object Fruit.
According to another aspect of the present invention, it additionally provides a kind of for being clustered to object in computer equipment Device, wherein the device includes:
Device for the transfer case information for obtaining multiple objects, the transfer case information are used to indicate based on object Information acquirement behavior, transfer case that user is in the multiple object;
For according to the transfer case information, being clustered to the multiple object, the poly- of the multiple object is obtained The device of class result.
Compared with prior art, the present invention has the following advantages:1) the solution of the present invention has broken the prejudice of this field, energy It is enough that object is clustered by analyzing transfer case information of the user in object;2) number from object angle is compared According to the scheme carried out to object in the present invention by analyzing transfer case of the user in multiple objects, closer to user angle Degree, more can intuitively reflect understanding of the user to object, therefore, object classification determined by the solution of the present invention is more objective, Accurately;Even if 3) in the data from user perspective, transfer case information of the invention is not common data yet, true On, if clearly referring to the data from user perspective, those skilled in the art are easier it is envisioned that from the user direct Evaluation (such as gives a mark, comments on word).
Description of the drawings
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, of the invention other Feature, objects and advantages will become more apparent upon:
Fig. 1 is the method flow schematic diagram clustered to object of a preferred embodiment of the invention;
Fig. 2 is the structural schematic diagram of the clustering apparatus clustered to object of a preferred embodiment of the invention;
Fig. 3 shows the schematic diagram of transfer path of the user of a preferred embodiment in multiple objects;
Fig. 4 shows the schematic diagram of transfer path of the user of a preferred embodiment in multiple keywords;
Fig. 5 shows the transfer path of a reticular structure from keyword to the transfer path of the reticular structure of object The specific example of conversion;
Fig. 6 show preferred embodiment, from an object to the schematic diagram of the transfer of multiple objects;
Fig. 7 shows a specific example of Fig. 6.
Same or analogous reference numeral represents same or analogous component in attached drawing.
Specific implementation mode
Present invention is further described in detail below in conjunction with the accompanying drawings.
Fig. 1 is the method flow schematic diagram clustered to object of a preferred embodiment of the invention.Wherein, this implementation The method of example mainly realizes that the computer equipment includes the network equipment and user equipment by computer equipment.The network Equipment includes but not limited to single network server, the server group of multiple network servers composition or is based on cloud computing (Cloud Computing the cloud being made of a large amount of computers or network server), wherein cloud computing is one kind of Distributed Calculation, by One super virtual computer of the computer collection composition of a group loose couplings;Network residing for the network equipment includes but not It is limited to internet, wide area network, Metropolitan Area Network (MAN), LAN, VPN network etc..The user equipment includes but not limited to PC machine, tablet electricity Brain, smart mobile phone, PDA, IPTV etc..
It should be noted that the computer equipment and network are only for example, other are existing or are likely to occur from now on Computing device or network are such as applicable to the present invention, should also be included within the scope of the present invention, and include by reference In this.
Method according to the present embodiment includes step S1 and step S2.
In step sl, computer equipment obtains the transfer case information of multiple objects.
Wherein, the object may include any object that can be clustered.Preferably, the object has commerciality Matter.It is highly preferred that the object includes brand.
Wherein, the transfer case information is used to indicate obtains behavior, user the multiple right based on object information Transfer case as in.Wherein, the object information obtains the behavior that behavior includes any information that can be used in obtaining object; For example, it includes obtaining the behavior of object information by searching for keyword related with object that the object information, which obtains behavior,; In another example it includes obtaining object information by clicking and browsing content related with object that the object information, which obtains behavior, Behavior.Wherein, described " obtaining behavior based on object information " indicates that the transfer case reflects user and obtained in object information The transfer case generated in behavior, it is preferable that the transfer case needs to obtain behavior based on object information to be determined;Example Such as, the object search changed in search behavior by counting multiple users, or by counting multiple users in search behavior Change with the associated search key of object, to determine transfer case information etc. of the user in middle object.
Preferably, the transfer case information of the multiple object includes but not limited at least one of following:
1) transfer path information of the user in multiple objects.
Wherein, the transfer path information indicates transfer path of the user in multiple objects.For example, right there are three As Object1, Object2 and Object3, transfer path information indicates transfer path packet of multiple users in three objects It includes:It is transferred to Object2 from Object1, and, it is transferred to Object3 from Object1.
2) transfer number information of the user between each object.
Wherein, the transfer number information indicates transfer number of the user between each object.For example, there are three Object Object1, Object2 and Object3, transfer number information indicate transfer of multiple users between three objects Number includes:It is transferred to Object2 five times from Object1, and, it is transferred to Object3 eight times from Object1.
3) transition probability information of the user between each object.
Wherein, the transition probability information indicates transition probability of the user between each object.For example, there are three Object Object1, Object2 and Object3, transition probability information indicate transfer of multiple users between three objects Probability includes:The probability that Object2 is transferred to from Object1 is 38.46%, and, it is transferred to Object3's from Object1 Probability is 61.54%.
It should be noted that transfer path may be not present between partial objects in multiple objects, (i.e. user is in object Do not carried out transfer in information acquirement behavior between such partial objects), then the transfer number between such partial objects and Transition probability is zero.Furthermore, it is possible to the case where in the presence of the object itself is transferred to from an object;For example, user is searching for Possible continuous several times search for the information of the same object using different search keys in behavior, to generate from an object The case where being transferred to the object itself.
Preferably, which can be used a variety of storage modes.
For example, the transfer case information storage is table, and transfer road of the user in multiple objects is had recorded in table The transfer number and transition probability of diameter and user between each object, as shown in table 1 below.
Transfer path Transfer number Transition probability
Object1→Object2 5 38.46%
Object1→Object3 8 61.54%
Table 1
In another example the transfer case information includes:It is stored as the transfer path of reticular structure, and, in the reticular structure Transfer number and/or transition probability between each node (i.e. between each object).Such as extremely for 9 object Object1 Object9, the transfer case information of 9 objects include transfer path as shown in Figure 3, and, there is arrow connection in Fig. 3 Each node between (such as from Object1 to Object2, from Object1 to Object3, from Object1 to Object4 Deng) transfer number and/or transition probability.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that any be used to indicate obtains behavior, user the multiple right based on object information The transfer case information of transfer case as in, should be included in the scope of the present invention.
Specifically, the mode that computer equipment obtains the transfer case information of multiple objects includes but not limited to:
1) computer equipment directly acquires the transfer case information of pre-determining, multiple object.
For example, computer equipment reads the transfer case of pre-determining, multiple object from local or other equipment Information.
2) step S1 further comprises step S11 and step S12.
In step s 11, computer equipment obtains the transfer case information of multiple keywords.
Wherein, the transfer case information of the multiple keyword is used to indicate obtains behavior, user based on object information Transfer case in multiple keywords.Preferably, the multiple keyword is related to the object in object information acquisition behavior Connection;If for example, object information obtain behavior be object search behavior, keyword can be the search behavior in it is input by user or The search key etc. of selection.
Preferably, the transfer case information of the multiple keyword includes at least one of following:
A) transfer path information of the user in the multiple keyword.
Wherein, transfer path information of the user in the multiple keyword indicates user and turns in multiple keywords Move path.For example, there are three keywords Query1, Query2 and Query3, transfer path information indicates multiple users at this Transfer path in three keywords includes:It is transferred to Query2 from Query1, and, it is transferred to Query3 from Query1.
B) transfer number information of the user between each keyword.
Wherein, transfer number information of the user between each keyword indicates user and turns between each keyword Move number.For example, there are three keywords Query1, Query2 and Query3, transfer number information indicates multiple users at this Transfer number between three keywords includes:It is transferred to Query2 five times from Query1, and, it is transferred to from Query1 Query3 eight times.
It should be noted that transfer path may be not present between Partial key word in multiple keywords, (i.e. user exists In object information acquisition behavior transfer was not carried out between such Partial key word), then between such Partial key word Transfer number is zero.
Preferably, a variety of storage modes can be used in the transfer case information of multiple keyword.
For example, the transfer case information storage is table, and transfer of the user in multiple keywords is had recorded in table The transfer number of path and user between each keyword, as shown in table 2 below.
Transfer path Transfer number
Query1→Query2 5
Query1→Query3 8
Table 2
In another example the transfer case information includes:It is stored as the transfer path of reticular structure, and, in the reticular structure Transfer number between each node (between i.e. each keyword).Such as 9 keyword Query1 to Query9, this 9 The transfer case information of object includes transfer path as shown in Figure 4, and, each node with arrow connection in Fig. 4 it Between transfer number.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that any be used to indicate obtains behavior, user in multiple keywords based on object information In transfer case transfer case information, should be included in the scope of the present invention.
Specifically, the realization method that computer equipment obtains the transfer case information of multiple keywords includes but not limited to:
A) computer equipment directly acquires the transfer case information of pre-determining, multiple keyword.
For example, computer equipment reads the transfer feelings of pre-determining, multiple keyword from local or other equipment Condition information.
B) computer equipment obtains the keyword concern record of at least one user, and is paid close attention to and remembered according to the keyword Record, determines the transfer case information of the multiple keyword.
Wherein, keyword concern record includes the key that the multiple user paid close attention in object information acquisition behavior The temporal information that word and the keyword are concerned.Preferably, it includes search behavior that object information, which obtains behavior, the concern The keyword crossed includes searched keyword;Preferably, it includes navigation patterns that object information, which obtains behavior, described to pay close attention to Keyword includes being clicked come browsing the keyword of contents of object.
Record is paid close attention to preferably for the keyword of each user, computer equipment is according in keyword concern record Including the temporal information that is concerned of keyword, determine between transfer path and each keyword of the user in keyword Transfer number;Also, computer equipment is by merging transfer path and each keyword of each user in keyword Between transfer number, determine the transfer case information of the multiple keyword.
For example, computer equipment obtains the keyword concern record of user A and user B;Wherein, user A and user B Keyword concern record is respectively as shown in the following table 3 and table 4:
The keyword of concern The time that keyword is concerned
Query1 2014-12-13-10:40
Query3 2014-12-13-10:36
Table 3
The keyword of concern The time that keyword is concerned
Query1 2014-11-10-00:14
Query2 2014-11-10-00:23
Table 4
Then the keyword of user A is paid close attention to and is recorded, computer equipment determines transfer path packets of the user A in keyword It includes " Query1 → Query3 ", and the transfer number of " Query1 → Query3 " is 1;Similarly, computer equipment determines user B Transfer path in keyword includes " Query1 → Query2 ", and the transfer number of " Query1 → Query2 " is 1.It connects It, computer equipment merges the transfer number between the transfer path and each keyword of user A and B in keyword, really The transfer case information of fixed the multiple keyword is as shown in table 5 below.
Transfer path Transfer number
Query1→Query2 1
Query1→Query3 1
Table 5
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the realization method of any transfer case information for obtaining multiple keywords, should all include Within the scope of the invention.
In step s 12, computer equipment according to multiple keywords be respectively associated to object and the multiple key The transfer case information of word determines the transfer case information of multiple objects.
Specifically, computer equipment can be according to transfer path information of the user in multiple keywords and multiple keywords Be respectively associated to object, determine transfer path information of the user in multiple objects, also, according to user in each keyword Between transfer number information and multiple keywords be respectively associated to object, determine transfer of the user between each object Number and/or transition probability information.
For example, the transfer case information of multiple keywords is as shown in Table 2 above, and Query1, Query2 and Query3 points It is not associated with to Object1, Object2 and Object3;Computer equipment is according to the transfer case information of keyword shown in table 2 And aforementioned incidence relation, determine that transfer path of the user in object Object1, Object2 and Object3 includes " Object1 → Object2 " and " Object1 → Object3 ", and the transfer number of 2 transfer paths is respectively 5 and 8;It connects It, computer equipment calculates transition probability=5/ of " Object1 → Object2 " according to the transfer number of 2 transfer paths (5+8)=38.46%, transition probability=8/ (5+8)=61.54% of " Object1 → Object3 ", that is, computer equipment Obtain transfer case information as shown in Table 1.
It should be noted that being associated with to same since a user may pay close attention in continuous several times object information acquisition behavior The different keywords (plain keyword is such as searched using the difference of the corresponding same object in multiple search) of one object, therefore, There may be the object transition probability of itself is transferred to from an object, as p shown in Fig. 6 may be present00Deng.The one of Fig. 6 A specific example can be found in Fig. 7.As shown in Figure 7, " Gymboree " probability of itself is transferred to from " Gymboree " may be up to 71.86%.
It should be noted that preferably, computer equipment can be based on following formula come computing object i to the transfer of object j Probability pij
Wherein, aijIndicate the transfer number of object i to object j,Transfer numbers of the expression object i to all objects.
For example, as shown in fig. 6, object Object0 is transferred to its own and other multiple object Object1 extremely Object13;For being transferred to Object8 from object Object1, transition probabilities of the object Objectp0 to object Object8Wherein,Indicate from object Object0 to Object0 itself and object Object1 to All transfer numbers of Object13.
It should be noted that multiple keywords be respectively associated to object can be determined in advance, Fig. 5 is shown from keyword Reticular structure transfer path to object reticular structure transfer path transform instances.In Fig. 5, the reticular structure of top In each node be keyword, each node in the reticular structure of lower section is and the keyword pair in the respective nodes of top The object answered.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the realization method of any transfer case information for obtaining multiple objects, should be included in In the scope of the present invention.
In step s 2, computer equipment is according to the transfer case information of the multiple objects obtained in step sl, to this Multiple objects are clustered, and the cluster result of the multiple object is obtained.
Wherein, the cluster result of the multiple object can behave as diversified forms;For example, the cluster result includes multiple collection It closes, each gathers the object for including and belong to a kind of;In another example the cluster result includes:Object ID and corresponding with the object ID Category IDs, then can determine the classification belonging to the object by the corresponding category IDs of each object ID.
Specifically, computer equipment clusters multiple object according to the transfer case information of multiple objects, obtains The realization method of the cluster result of the multiple object includes but not limited to:
1) computer equipment directly clusters multiple objects according to the transfer case information of multiple objects, obtains more The cluster result of a object.Wherein, the transition probability between two objects or transfer number are higher, then two objects, which are gathered, is A kind of possibility is higher.
For example, the transfer case information that computer equipment obtains in step sl is as shown in aforementioned table 1, then computer equipment Be less than predetermined threshold 60% according to the transition probability between Object1 and Object2 38.46%, determine Object1 and Object2 cannot gather for one kind, also, computer equipment is according to the transition probability 61.54% between Object1 and Object3 More than predetermined threshold 60%, determine that Object1 and Object3 gather for one kind.Then computer equipment acquisition shows as two set Cluster result [Object1, Object3], [Object2];Wherein, which belongs to Same category, Object2 individually belong to a classification.
It should be noted that in the multiple object it is existing gathered for a kind of object in the case of it is (such as multiple right As it is middle may include to have been gathered for a kind of object by artificial or computer equipment operation), in fact it could happen that judgement, which has gathered, is Whether a kind of multiple objects and other one or more objects can gather for a kind of situation, then:One object is with having gathered Transition probability or transfer number are higher between one or more of a kind of multiple objects object, then an object with gathered It is that a kind of possibility is higher to be gathered for a kind of multiple objects;Gather for one or more of a kind of multiple objects object Transition probability between one or more of a kind of multiple objects object is gathered with other or transfer number is higher, then should Gathered for a kind of multiple objects gathered with other be a kind of multiple objects by gathered be one kind possibility it is higher.
2) computer equipment is by based on the transfer distance between the transfer case information acquisition object, to described more A object is clustered, and the cluster result of the multiple object is obtained.
Specifically, computer equipment can first obtain the transfer distance between all objects, further according to transfer distance to multiple Object is clustered, and the cluster result of the multiple object is obtained;Alternatively, computer equipment can perform multiple cluster operation to obtain Multiple objects cluster result, such as in each cluster operation from multiple objects selected section object, and determination the part Transfer distance needed between object, to carry out cluster operation to the partial objects.
Preferably, the transfer distance between the object includes but not limited at least one of following:
A) transfer distance between an object in the multiple object and another object in the multiple object. Wherein, the transfer distance is smaller, and in the multiple object a object is gathered with another object in the multiple object It is bigger for a kind of possibility.
Wherein, the transfer distance between two objects by transfer number information in transfer case information and/or can turn Probabilistic information is moved to determine.
For example, the transfer distance between two objects can be determined by following formula:
Wherein, dijIndicate the transfer distance between object i and object j, pijIndicate that the transfer between object i to object j is general Rate, pjiIndicate that the transition probability between object j to object i, r expression parameters, the parameter can manually be set.
It should be noted that above-mentioned formula can adjust as needed, such as by the (p in formulaij+pji)/2 are adjusted toDeng.
B) transfer distance between an object in the multiple object and multiple objects in the multiple object.Its In, the transfer distance is smaller, and it is one that in the multiple object a object is gathered with multiple objects in the multiple object The possibility of class is bigger.Preferably, multiple objects in the multiple object are usually gathered for one kind.
Wherein, the transfer between an object in the multiple object and multiple objects in the multiple object away from From, can according to the transfer distance between one or more of multiple objects in an object and multiple object object come It determines, it also can be according to the transfer time between one or more of multiple objects in an object to multiple object object Number/transition probability determines.
For example, co-existing in 9 object Object1 to Object9, wherein it is one that an object Object1 and three, which have gathered, Transfer distance between object Object4, Object7 and Object8 of class can be determined by following any mode:
The first:By between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer distance in minimum transfer distance as the transfer between Object1 and Object4, Object7 and Object8 away from From.
Second:By between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer distance in maximum transfer distance as the transfer between Object1 and Object4, Object7 and Object8 away from From.
The third:Between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between three transfer distances calculated, such as averaged, and using result of calculation as Object1 and Object4, Transfer distance between Object7 and Object8.
4th kind:Determine between Object1 and Object4, between Object1 and Object7, Object1 and Object8 Between transfer number/transition probability in maximum transfer number/transition probability, and according to the maximum transfer number/transfer Probability seeks transfer distance, as the transfer distance between Object1 and Object4, Object7 and Object8.
5th kind:Determine between Object1 and Object4, between Object1 and Object7, Object1 and Object8 Between transfer number/transition probability in minimum transfer number/transition probability, and according to transfer number/transfer of the minimum Probability seeks transfer distance, as the transfer distance between Object1 and Object4, Object7 and Object8.
6th kind:Between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer number/transition probability calculated, such as averaged, and transfer distance is sought according to result of calculation, as Transfer distance between Object1 and Object4, Object7 and Object8.
C) transfer between other multiple objects in the multiple objects and the multiple object in the multiple object away from From.Wherein, the transfer distance is smaller, multiple objects in the multiple object and other multiple objects in the multiple object It is bigger for the possibility of one kind by being gathered.
Wherein, the transfer between other multiple objects in the multiple objects and the multiple object in the multiple object Distance, can be according between one or more of one or more of multiple object object and other multiple objects object Transfer distance determine, also can be according to one in one or more of multiple object object and other multiple objects Or between multiple objects between transfer number/transition probability determine.
For example, co-existing in 9 object Object1 to Object9, wherein gathered for a kind of two object Object1 and Object3 and two transfer distance gathered between a kind of object Object4 and Object8, can be by following any Mode determines:
The first:By between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, transfer distance minimum in the transfer distance between Object3 and Object8 as Object1 and Object3 with Transfer distance between Object4 and Object8.
Second:By between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, in the transfer distance between Object3 and Object8 maximum transfer distance as Object1 and Object3 with Transfer distance between Object4 and Object8.
The third:Between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, the shifting science and technology in four directions distance between Object3 and Object8 calculated, such as averaged, and using result of calculation as Transfer distance between Object1 and Object3 and Object4 and Object8.
4th kind:Determine between Object1 and Object4, between Object1 and Object8, Object3 and Object4 Between, maximum transfer number/transition probability in transfer number/transition probability between Object3 and Object8, and according to Maximum transfer number/the transition probability seeks transfer distance, as Object1 and Object3 and Object4 and Object8 Between transfer distance.
5th kind:Determine between Object1 and Object4, between Object1 and Object8, Object3 and Object4 Between, transfer number/transition probability minimum in transfer number/transition probability between Object3 and Object8, and according to Transfer number/transition probability of the minimum seeks transfer distance, as Object1 and Object3 and Object4 and Object8 Between transfer distance.
6th kind:Between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, transfer number/transition probability between Object3 and Object8 calculated, such as averaged, and according to calculating As a result transfer distance is sought, as the transfer distance between Object1 and Object3 and Object4 and Object8.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the transfer distance between any object, should be included in the scope of the present invention.
Realization method 2 as step S2) one of preferred embodiment, step S2 further comprise step S21, step S22, Step S23, step S24 and step S25.
In the step s 21, computer equipment selects first part's object and second part object in the multiple object.
Wherein, first part's object, which can be one or more of the multiple object object, second part object, to be The one or more objects different from first part's object in the multiple object.Preferably, when first part's object or second Should include that first part's object of multiple objects or second part object belong to a kind of when partial objects are multiple.
In step S22, computer equipment, which obtains, to be based on having with first part's object and the second part object Transfer distance between the transfer case information of pass determines, first part's object and second part object.
It should be noted that before step S22, the transfer distance between first part's object and second part object can It can have existed;For example, transfer distance between first part's object and second part object may be previous in step Determined by computer equipment etc..
Preferably, when the transfer distance between first part's object and second part object is existing, computer Equipment directly reads the transfer distance between first part's object and second part object.As computer equipment is directly read Transfer distance between local already present first part's object and second part object.
In the absence of the transfer distance between first part's object and second part object, computer equipment according to Determined based on the transfer case information between first part's object and the second part object, first part's object One or more of transfer distance between one or more of object and the second part object object, determine described Transfer distance between a part of object and second part object.Wherein, how to determine between two objects, an object with it is more The mode of transfer distance between a object, between multiple objects and multiple objects is directed to the " transfer between object aforementioned Distance " is described in detail in illustrating, details are not described herein.If in addition, one or more of first part's object object with this Transfer distance between one or more of two partial objects object has existed before the execution of this step, then in this step In directly read, if not yet obtaining the transfer distance when this step executes, need be based on first part's object and institute The transfer case information between second part object is stated to determine the transfer distance.
In step S23, computer equipment according to the transfer distance between first part's object and second part object, Determine whether first part's object gathers with second part object for one kind.
Wherein, the transfer distance is smaller, and first part's object is gathered higher for the possibility of one kind with second part object; The transfer distance is bigger, and first part's object is gathered smaller for the possibility of one kind with second part object.
In step s 24, computer equipment reselects first part's object and second part object, wherein selects again It was not carried out cluster operation between the first part's object selected and second part object.
In step s 25, computer equipment repeats step S22, step S23 and step S24, until acquisition is the multiple right The cluster result of elephant.Preferably, various ways can be used in computer equipment, to determine whether having obtained the poly- of the multiple object Class result;For example, whether number of repetition has been more than predetermined repetition threshold value, if be not present and be not carried out the first of cluster operation Partial objects and second part object etc..
It gives an example below, this preferred embodiment is better described:
For example, co-existing in 5 objects Object1, Object2, Object3, Object4, Object5.
In the step s 21, computer equipment selects Object1 as first part's object, selects Object2 as second Partial objects.Then, in step S22, computer equipment is according to the transfer case information between Object1 and Object2, really Determine transfer distance between Object1 and Object2.Then, in step S23, computer equipment according to Object1 with Transfer distance between Object2 determines that Object1 and Object2 gathers for one kind;Then, in step s 24, computer equipment It selects to have gathered for a kind of Object1 and Object2, as first part's object, to select Object3 as second part object.
Then, computer equipment repeats step S22 to step S23, determines that Object1 and Object2 and Object3 cannot Gather for one kind, and repeat step S24, select to have gathered for a kind of Object1 and Object2, as first part's object, to select Object4 is as second part object.
Then, computer equipment repeats step S22 to step S23, determines that Object1 and Object2 and Object4 cannot Gather for one kind, and repeat step S24, select to have gathered for a kind of Object1 and Object2, as first part's object, to select Object5 is as second part object.
Then, computer equipment repeats step S22 to step S23, determines that Object1 and Object2 and Object5 cannot Gather for one kind, and repeat step S24, selects Object3 as first part's object, select Object4 as second part pair As.
Then, computer equipment repeats step S22 to step S23, determines that Object3 and Object4 gather for one kind, lays equal stress on Multiple step S24 select to have gathered Object3 and Object4 for one kind as first part's object, selects Object5 as the Two partial objects.
Then, computer equipment repeats step S22 to step S23, determines that Object3 and Object4 gather with Object5 and is One kind, and step S24 is repeated, it selects to have gathered for a kind of Object1 and Object2 as first part's object, selection has gathered It is a kind of Object3, Object4 and Object5 as second part object.
Then, computer equipment repeats step S22 to step S23, determine Object1 and Object2 and Object3, Object4, Object5 cannot gather for one kind.Also, computer equipment, which judges currently to be not present, did not carried out the first of cluster Partial objects and second part object stop cluster operation.Then object Object1, Object2, Object3, Object4, The cluster result of Object5 is:[Object1, Object2], [Object3, Object4, Object5].
In the prior art, natural language analysis is usually carried out by the description text to object, to classify to object. Particularly, when object is related to commercial use, such as when object is brand, the influence that artificial supervisor judges is received, in addition to object Title carries out other than natural language analysis, can also in conjunction with from the data of object angle, industry and region as belonging to object, The factors such as the sales situation and the market demand of object, to classify to object.That is, to design commercial use object into When row classification, those skilled in the art can have such prejudice:According to the commercial data from object angle, to right As classifying.
The solution of the present invention has broken above-mentioned prejudice, can by analyze transfer case information of the user in object come pair Object is clustered;Also, compared to the data from object angle, by analyzing user in multiple objects in the present invention Transfer case more can intuitively reflect understanding of the user to object come the scheme clustered to object closer to user perspective, Therefore, object classification determined by the solution of the present invention is more objective, accurate;Even if in addition, in the number from user perspective In, transfer case information of the invention is not common data yet, if in fact, clearly referring to the number from user perspective According to those skilled in the art are easier it is envisioned that direct evaluation from the user (such as give a mark, comment on word).
Fig. 2 is the structural schematic diagram of the clustering apparatus clustered to object of a preferred embodiment of the invention.This is poly- Class device can mount in computer equipment, which includes:Dress for the transfer case information for obtaining multiple objects It sets (hereinafter referred to as " acquisition device 1 "), for according to the transfer case information, being clustered to the multiple object, obtains The device (hereinafter referred to as " sub- clustering apparatus 2 ") of the cluster result of the multiple object.
Acquisition device 1 obtains the transfer case information of multiple objects.
Wherein, the object may include any object that can be clustered.Preferably, the object has commerciality Matter.It is highly preferred that the object includes brand.
Wherein, the transfer case information is used to indicate obtains behavior, user the multiple right based on object information Transfer case as in.Wherein, the object information obtains the behavior that behavior includes any information that can be used in obtaining object; For example, it includes obtaining the behavior of object information by searching for keyword related with object that the object information, which obtains behavior,; In another example it includes obtaining object information by clicking and browsing content related with object that the object information, which obtains behavior, Behavior.Wherein, described " obtaining behavior based on object information " indicates that the transfer case reflects user and obtained in object information The transfer case generated in behavior, it is preferable that the transfer case needs to obtain behavior based on object information to be determined;Example Such as, the object search changed in search behavior by counting multiple users, or by counting multiple users in search behavior Change with the associated search key of object, to determine transfer case information etc. of the user in middle object.
Preferably, the transfer case information of the multiple object includes but not limited at least one of following:
1) transfer path information of the user in multiple objects.
Wherein, the transfer path information indicates transfer path of the user in multiple objects.For example, right there are three As Object1, Object2 and Object3, transfer path information indicates transfer path packet of multiple users in three objects It includes:It is transferred to Object2 from Object1, and, it is transferred to Object3 from Object1.
2) transfer number information of the user between each object.
Wherein, the transfer number information indicates transfer number of the user between each object.For example, there are three Object Object1, Object2 and Object3, transfer number information indicate transfer of multiple users between three objects Number includes:It is transferred to Object2 five times from Object1, and, it is transferred to Object3 eight times from Object1.
3) transition probability information of the user between each object.
Wherein, the transition probability information indicates transition probability of the user between each object.For example, there are three Object Object1, Object2 and Object3, transition probability information indicate transfer of multiple users between three objects Probability includes:The probability that Object2 is transferred to from Object1 is 38.46%, and, it is transferred to Object3's from Object1 Probability is 61.54%.
It should be noted that transfer path may be not present between partial objects in multiple objects, (i.e. user is in object Do not carried out transfer in information acquirement behavior between such partial objects), then the transfer number between such partial objects and Transition probability is zero.Furthermore, it is possible to the case where in the presence of the object itself is transferred to from an object;For example, user is searching for Possible continuous several times search for the information of the same object using different search keys in behavior, to generate from an object The case where being transferred to the object itself.
Preferably, which can be used a variety of storage modes.
For example, the transfer case information storage is table, and transfer road of the user in multiple objects is had recorded in table The transfer number and transition probability of diameter and user between each object, as shown in aforementioned table 1.
In another example the transfer case information includes:It is stored as the transfer path of reticular structure, and, in the reticular structure Transfer number and/or transition probability between each node (i.e. between each object).Such as extremely for 9 object Object1 Object9, the transfer case information of 9 objects include transfer path as shown in Figure 3, and, there is arrow connection in Fig. 3 Each node between (such as from Object1 to Object2, from Object1 to Object3, from Object1 to Object4 Deng) transfer number and/or transition probability.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that any be used to indicate obtains behavior, user the multiple right based on object information The transfer case information of transfer case as in, should be included in the scope of the present invention.
Specifically, the mode that acquisition device 1 obtains the transfer case information of multiple objects includes but not limited to:
1) acquisition device 1 is standby directly acquires pre-determining, multiple object transfer case information.
For example, acquisition device 1 reads the transfer case letter of pre-determining, multiple object from local or other equipment Breath.
2) acquisition device 1 further comprises the device of the transfer case information for obtaining multiple keywords (hereinafter referred to as It is " the first sub- acquisition device ", not shown) and for according to multiple keywords be respectively associated to object and the multiple key The transfer case information of word, determine the transfer case information of multiple objects device (hereinafter referred to as " the first determining device ", figure not Show).
First sub- acquisition device obtains the transfer case information of multiple keywords.
Wherein, the transfer case information of the multiple keyword is used to indicate obtains behavior, user based on object information Transfer case in multiple keywords.Preferably, the multiple keyword is related to the object in object information acquisition behavior Connection;If for example, object information obtain behavior be object search behavior, keyword can be the search behavior in it is input by user or The search key etc. of selection.
Preferably, the transfer case information of the multiple keyword includes at least one of following:
A) transfer path information of the user in the multiple keyword.
Wherein, transfer path information of the user in the multiple keyword indicates user and turns in multiple keywords Move path.For example, there are three keywords Query1, Query2 and Query3, transfer path information indicates multiple users at this Transfer path in three keywords includes:It is transferred to Query2 from Query1, and, it is transferred to Query3 from Query1.
B) transfer number information of the user between each keyword.
Wherein, transfer number information of the user between each keyword indicates user and turns between each keyword Move number.For example, there are three keywords Query1, Query2 and Query3, transfer number information indicates multiple users at this Transfer number between three keywords includes:It is transferred to Query2 five times from Query1, and, it is transferred to from Query1 Query3 eight times.
It should be noted that transfer path may be not present between Partial key word in multiple keywords, (i.e. user exists In object information acquisition behavior transfer was not carried out between such Partial key word), then between such Partial key word Transfer number is zero.
Preferably, a variety of storage modes can be used in the transfer case information of multiple keyword.
For example, the transfer case information storage is table, and transfer of the user in multiple keywords is had recorded in table The transfer number of path and user between each keyword, as shown in Table 2 above.
In another example the transfer case information includes:It is stored as the transfer path of reticular structure, and, in the reticular structure Transfer number between each node (between i.e. each keyword).Such as 9 keyword Query1 to Query9, this 9 The transfer case information of object includes transfer path as shown in Figure 4, and, each node with arrow connection in Fig. 4 it Between transfer number.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that any be used to indicate obtains behavior, user in multiple keywords based on object information In transfer case transfer case information, should be included in the scope of the present invention.
Specifically, the realization method that the first sub- acquisition device obtains the transfer case information of multiple keywords includes but unlimited In:
A) the first sub- acquisition device directly acquires the transfer case information of pre-determining, multiple keyword.
For example, the first sub- acquisition device reads turn from local or other equipment Condition of shifting one's love information.
B) the first sub- acquisition device obtains the keyword concern record of at least one user, and is paid close attention to according to the keyword Record, determines the transfer case information of the multiple keyword.
Wherein, keyword concern record includes the key that the multiple user paid close attention in object information acquisition behavior The temporal information that word and the keyword are concerned.Preferably, it includes search behavior that object information, which obtains behavior, the concern The keyword crossed includes searched keyword;Preferably, it includes navigation patterns that object information, which obtains behavior, described to pay close attention to Keyword includes being clicked come browsing the keyword of contents of object.
Record is paid close attention to preferably for the keyword of each user, the first sub- acquisition device is paid close attention to according to the keyword to be remembered The temporal information that the keyword for including in record is concerned determines transfer path and each keyword of the user in keyword Between transfer number;Also, the first sub- acquisition device is by merging transfer path in keyword of each user and each Transfer number between a keyword determines the transfer case information of the multiple keyword.
For example, the first sub- acquisition device obtains the keyword concern record of user A and user B;Wherein, user A and use The keyword concern record of family B is respectively as shown in aforementioned table 3 and table 4.
Then the keyword of user A is paid close attention to and is recorded, the first sub- acquisition device determines transfer roads of the user A in keyword Diameter includes " Query1 → Query3 ", and the transfer number of " Query1 → Query3 " is 1;Similarly, the first sub- acquisition device Determine that transfer paths of the user B in keyword includes " Query1 → Query2 ", and the transfer of " Query1 → Query2 " is secondary Number is 1.Then, between transfer paths and each keyword of first sub- acquisition device merging the user A and B in keyword Transfer number determines the transfer case information of the multiple keyword as shown in aforementioned table 5.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the realization method of any transfer case information for obtaining multiple keywords, should all include Within the scope of the invention.
First determining device according to multiple keywords be respectively associated to object and the multiple keyword transfer feelings Condition information determines the transfer case information of multiple objects.
Specifically, the first determining device can be according to transfer path information of the user in multiple keywords and multiple keys Word be respectively associated to object, determine transfer path information of the user in multiple objects, also, the first determining device according to Transfer number information and multiple keyword of the family between each keyword be respectively associated to object, determine user each Transfer number between object and/or transition probability information.
For example, the transfer case information of multiple keywords is as shown in Table 2 above, and Query1, Query2 and Query3 points It is not associated with to Object1, Object2 and Object3;First determining device is believed according to the transfer case of keyword shown in table 2 Breath and aforementioned incidence relation, determine that transfer path of the user in object Object1, Object2 and Object3 includes " Object1 → Object2 " and " Object1 → Object3 ", and the transfer number of 2 transfer paths is respectively 5 and 8;It connects , the first determining device according to the transfer numbers of 2 transfer paths, calculate the transition probability of " Object1 → Object2 "= 5/ (5+8)=38.46%, transition probability=8/ (5+8)=61.54% of " Object1 → Object3 ", that is, first determines Device obtains the transfer case information as shown in aforementioned table 1.
It should be noted that being associated with to same since a user may pay close attention in continuous several times object information acquisition behavior The different keywords (plain keyword is such as searched using the difference of the corresponding same object in multiple search) of one object, therefore, There may be the object transition probability of itself is transferred to from an object, as p shown in Fig. 6 may be present00Deng.The one of Fig. 6 A specific example can be found in Fig. 7.As shown in Figure 7, " Gymboree " probability of itself is transferred to from " Gymboree " may be up to 71.86%.
It should be noted that preferably, the first determining device can be turned based on following formula come computing object i to object j Move Probability pij
Wherein, aijIndicate the transfer number of object i to object j,Transfer numbers of the expression object i to all objects.
For example, as shown in fig. 6, object Object0 is transferred to its own and other multiple object Object1 extremely Object13;For being transferred to Object8 from object Object1, transition probabilities of the object Objectp0 to object Object8Wherein,Indicate from object Object0 to Object0 itself and object Object1 to All transfer numbers of Object13.
It should be noted that multiple keywords be respectively associated to object can be determined in advance, Fig. 5 is shown from keyword Reticular structure transfer path to object reticular structure transfer path transform instances.In Fig. 5, the reticular structure of top In each node be keyword, each node in the reticular structure of lower section is and the keyword pair in the respective nodes of top The object answered.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the realization method of any transfer case information for obtaining multiple objects, should be included in In the scope of the present invention.
The transfer case information for multiple objects that sub- clustering apparatus 2 is obtained according to acquisition device 1 carries out multiple object Cluster, obtains the cluster result of the multiple object.
Wherein, the cluster result of the multiple object can behave as diversified forms;For example, the cluster result includes multiple collection It closes, each gathers the object for including and belong to a kind of;In another example the cluster result includes:Object ID and corresponding with the object ID Category IDs, then can determine the classification belonging to the object by the corresponding category IDs of each object ID.
Specifically, sub- clustering apparatus 2 clusters multiple object, obtains according to the transfer case information of multiple objects The realization method of cluster result of the multiple object includes but not limited to:
1) sub- clustering apparatus 2 directly clusters multiple objects according to the transfer case information of multiple objects, obtains The cluster result of multiple objects.Wherein, the transition probability between two objects or transfer number are higher, then two objects are gathered It is higher for a kind of possibility.
For example, the transfer case information that acquisition device 1 obtains is as shown in aforementioned table 1, then 2 basis of sub- clustering apparatus Transition probability 38.46% between Object1 and Object2 is less than predetermined threshold 60%, determines Object1 and Object2 It cannot gather for one kind, also, sub- clustering apparatus 2 is more than pre- according to the transition probability 61.54% between Object1 and Object3 Determine threshold value 60%, determines that Object1 and Object3 gather for one kind.Then sub- clustering apparatus 2 obtains the cluster for showing as two set As a result [Object1, Object3], [Object2];Wherein, which belongs to same class Not, Object2 individually belongs to a classification.
It should be noted that in the multiple object it is existing gathered for a kind of object in the case of it is (such as multiple right As it is middle may include to have been gathered for a kind of object by artificial or clustering apparatus operation), in fact it could happen that it is one to judge to have gathered Whether multiple objects of class and other one or more objects can gather for a kind of situation, then:One object and to have gathered be one Transition probability or transfer number are higher between one or more of multiple objects of class object, then an object is with having gathered A kind of multiple objects are gathered higher for the possibility of one kind;Gathered for one or more of a kind of multiple objects object with Other have gathered transition probability between one or more of a kind of multiple objects object or transfer number is higher, then this has been Gather for a kind of multiple objects gathered with other be a kind of multiple objects by gathered be one kind possibility it is higher.
2) sub- clustering apparatus 2 is by based on the transfer distance between the transfer case information acquisition object, to described Multiple objects are clustered, and the cluster result of the multiple object is obtained.
Specifically, sub- clustering apparatus 2 can first obtain the transfer distance between all objects, further according to transfer distance to multiple Object is clustered, and the cluster result of the multiple object is obtained;Alternatively, sub- clustering apparatus 2 can perform multiple cluster operation Obtain the cluster result of multiple objects, such as in each cluster operation from multiple objects selected section object, and determine the portion Transfer distance needed between point object, to carry out cluster operation to the partial objects.
Preferably, the transfer distance between the object includes but not limited at least one of following:
A) transfer distance between an object in the multiple object and another object in the multiple object. Wherein, the transfer distance is smaller, and in the multiple object a object is gathered with another object in the multiple object It is bigger for a kind of possibility.
Wherein, the transfer distance between two objects by transfer number information in transfer case information and/or can turn Probabilistic information is moved to determine.
For example, the transfer distance between two objects can be determined by following formula:
Wherein, dijIndicate the transfer distance between object i and object j, pijIndicate that the transfer between object i to object j is general Rate, pjiIndicate that the transition probability between object j to object i, r expression parameters, the parameter can manually be set.
It should be noted that above-mentioned formula can adjust as needed, such as by the (p in formulaij+pji)/2 are adjusted toDeng.
B) transfer distance between an object in the multiple object and multiple objects in the multiple object.Its In, the transfer distance is smaller, and it is one that in the multiple object a object is gathered with multiple objects in the multiple object The possibility of class is bigger.Preferably, multiple objects in the multiple object are usually gathered for one kind.
Wherein, the transfer between an object in the multiple object and multiple objects in the multiple object away from From, can according to the transfer distance between one or more of multiple objects in an object and multiple object object come It determines, it also can be according to the transfer time between one or more of multiple objects in an object to multiple object object Number/transition probability determines.
For example, co-existing in 9 object Object1 to Object9, wherein it is one that an object Object1 and three, which have gathered, Transfer distance between object Object4, Object7 and Object8 of class can be determined by following any mode:
The first:By between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer distance in minimum transfer distance as the transfer between Object1 and Object4, Object7 and Object8 away from From.
Second:By between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer distance in maximum transfer distance as the transfer between Object1 and Object4, Object7 and Object8 away from From.
The third:Between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between three transfer distances calculated, such as averaged, and using result of calculation as Object1 and Object4, Transfer distance between Object7 and Object8.
4th kind:Determine between Object1 and Object4, between Object1 and Object7, Object1 and Object8 Between transfer number/transition probability in maximum transfer number/transition probability, and according to the maximum transfer number/transfer Probability seeks transfer distance, as the transfer distance between Object1 and Object4, Object7 and Object8.
5th kind:Determine between Object1 and Object4, between Object1 and Object7, Object1 and Object8 Between transfer number/transition probability in minimum transfer number/transition probability, and according to transfer number/transfer of the minimum Probability seeks transfer distance, as the transfer distance between Object1 and Object4, Object7 and Object8.
6th kind:Between Object1 and Object4, between Object1 and Object7, Object1 and Object8 it Between transfer number/transition probability calculated, such as averaged, and transfer distance is sought according to result of calculation, as Transfer distance between Object1 and Object4, Object7 and Object8.
C) transfer between other multiple objects in the multiple objects and the multiple object in the multiple object away from From.Wherein, the transfer distance is smaller, multiple objects in the multiple object and other multiple objects in the multiple object It is bigger for the possibility of one kind by being gathered.
Wherein, the transfer between other multiple objects in the multiple objects and the multiple object in the multiple object Distance, can be according between one or more of one or more of multiple object object and other multiple objects object Transfer distance determine, also can be according to one in one or more of multiple object object and other multiple objects Or between multiple objects between transfer number/transition probability determine.
For example, co-existing in 9 object Object1 to Object9, wherein gathered for a kind of two object Object1 and Object3 and two transfer distance gathered between a kind of object Object4 and Object8, can be by following any Mode determines:
The first:By between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, transfer distance minimum in the transfer distance between Object3 and Object8 as Object1 and Object3 with Transfer distance between Object4 and Object8.
Second:By between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, in the transfer distance between Object3 and Object8 maximum transfer distance as Object1 and Object3 with Transfer distance between Object4 and Object8.
The third:Between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, the shifting science and technology in four directions distance between Object3 and Object8 calculated, such as averaged, and using result of calculation as Transfer distance between Object1 and Object3 and Object4 and Object8.
4th kind:Determine between Object1 and Object4, between Object1 and Object8, Object3 and Object4 Between, maximum transfer number/transition probability in transfer number/transition probability between Object3 and Object8, and according to Maximum transfer number/the transition probability seeks transfer distance, as Object1 and Object3 and Object4 and Object8 Between transfer distance.
5th kind:Determine between Object1 and Object4, between Object1 and Object8, Object3 and Object4 Between, transfer number/transition probability minimum in transfer number/transition probability between Object3 and Object8, and according to Transfer number/transition probability of the minimum seeks transfer distance, as Object1 and Object3 and Object4 and Object8 Between transfer distance.
6th kind:Between Object1 and Object4, between Object1 and Object8, Object3 and Object4 it Between, transfer number/transition probability between Object3 and Object8 calculated, such as averaged, and according to calculating As a result transfer distance is sought, as the transfer distance between Object1 and Object3 and Object4 and Object8.
It should be noted that the above-mentioned examples are merely illustrative of the technical solutions of the present invention, rather than to the limit of the present invention System, it should be appreciated by those skilled in the art that the transfer distance between any object, should be included in the scope of the present invention.
Realization method 2 as sub- clustering apparatus 2) one of preferred embodiment, sub- clustering apparatus 2 further comprise for The device of first part's object and second part object is selected in multiple objects, and (hereinafter referred to as " first choice device ", figure is not Show), for obtain it is being determined based on transfer case information related with first part's object and the second part object, Device (hereinafter referred to as " the second sub- acquisition device ", the figure of transfer distance between first part's object and second part object Do not show), for according to the transfer distance between first part's object and second part object, determine first part's object with Whether second part object gather for a kind of device (hereinafter referred to as " the second determining device ", not shown), for reselecting the The device (hereinafter referred to as " the second selection device ", not shown) of a part of object and second part object and for triggering described Two sub- acquisition device, second determining device, second selection device repeat operation, until acquisition is the multiple right The device (hereinafter referred to as " trigger device ", not shown) of the cluster result of elephant.
First choice device selects first part's object and second part object in multiple objects.
Wherein, first part's object, which can be one or more of the multiple object object, second part object, to be The one or more objects different from first part's object in the multiple object.Preferably, when first part's object or second Should include that first part's object of multiple objects or second part object belong to a kind of when partial objects are multiple.
Second sub- acquisition device, which obtains, is based on transfer related with first part's object and the second part object Transfer distance between situation information determines, first part's object and second part object.
It should be noted that before the second sub- acquisition device executes operation, first part's object and second part object Between transfer distance may have existed;For example, the transfer distance between first part's object and second part object may Device determination etc. is clustered in operating previous.
Preferably, when the transfer distance between first part's object and second part object is existing, the second son Acquisition device directly reads the transfer distance between first part's object and second part object.Such as the second sub- acquisition device Directly read the transfer distance between local already present first part's object and second part object.
In the absence of the transfer distance between first part's object and second part object, the second sub- acquisition device According to determined based on the transfer case information between the first part's object and the second part object, first part Transfer distance between one or more of one or more of object object and the second part object object, determines institute State the transfer distance between first part's object and second part object.Wherein, how to determine between two objects, an object The mode of transfer distance between multiple objects, between multiple objects and multiple objects, aforementioned for " between object Transfer distance " is described in detail in illustrating, details are not described herein.If in addition, one or more of first part's object object with Transfer distance between one or more of second part object object before the second sub- acquisition device executes operation Through existing, then the second sub- acquisition device is directly read, if not yet obtaining this turn when the second sub- acquisition device executes operation Distance is moved, then needs to determine the transfer based on the transfer case information between first part's object and the second part object Distance.
Second determining device according to the transfer distance between first part's object and second part object, determine this first Whether partial objects gather with second part object for one kind.
Wherein, the transfer distance is smaller, and first part's object is gathered higher for the possibility of one kind with second part object; The transfer distance is bigger, and first part's object is gathered smaller for the possibility of one kind with second part object.
Second selection device reselects first part's object and second part object, wherein first reselected Divide and was not carried out cluster operation between object and second part object.
Trigger device triggers the second sub- acquisition device, second determining device, second selection device and repeats Operation is executed, until obtaining the cluster result of the multiple object.Preferably, various ways can be used in trigger device, to judge Whether the cluster result of the multiple object has been obtained;For example, whether number of repetition has been more than predetermined repetition threshold value, if not In the presence of the first part's object and second part object etc. for being not carried out cluster operation.
It gives an example below, this preferred embodiment is better described:
For example, co-existing in 5 objects Object1, Object2, Object3, Object4, Object5.
First choice device selects Object1 as first part's object, selects Object2 as second part object. Then, the second sub- acquisition device is according to the transfer case information between Object1 and Object2, determine Object1 with Transfer distance between Object2.Then, the second determining device is determined according to transfer distance between Object1 and Object2 Object1 and Object2 gathers for one kind;Then, the second selection device selects to have gathered and make for a kind of Object1 and Object2 For first part's object, select Object3 as second part object.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object1 and Object2 and Object3 cannot gather for one kind, and trigger device triggers the second selection device and repeats operation, It selects to have gathered as a kind of Object1 and Object2 as first part's object, selects Object4 as second part pair As.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object1 and Object2 and Object4 cannot gather for one kind, and trigger device triggers the second selection device and repeats operation, It selects to have gathered as a kind of Object1 and Object2 as first part's object, selects Object5 as second part pair As.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object1 and Object2 and Object5 cannot gather for one kind, and trigger device triggers the second selection device and repeats operation, It selects the Object3 as first part's object, selects Object4 as second part object.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object3 and Object4 gathers for one kind, and trigger device triggers the second selection device and repeats operation, come select to have gathered for A kind of Object3 and Object4 selects Object5 as second part object as first part's object.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object3 and Object4 and Object5 gather for one kind, and trigger device triggers the second selection device and repeats operation, to select Select and gathered for a kind of Object1 and Object2 as first part's object, select to have gathered the Object3 for one kind, Object4 and Object5 is as second part object.
Then, trigger device triggers the second sub- acquisition device and the second determining device repeats operation, to determine Object1 and Object2 and Object3, Object4, Object5 cannot gather for one kind.Also, trigger device judges currently There is no the first part's objects and second part object that did not carried out cluster, stop cluster operation.Then object Object1, The cluster result of Object2, Object3, Object4, Object5 is:[Object1, Object2], [Object3, Object4, Object5].
In the prior art, natural language analysis is usually carried out by the description text to object, to classify to object. Particularly, when object is related to commercial use, such as when object is brand, the influence that artificial supervisor judges is received, in addition to object Title carries out other than natural language analysis, can also in conjunction with from the data of object angle, industry and region as belonging to object, The factors such as the sales situation and the market demand of object, to classify to object.That is, to design commercial use object into When row classification, those skilled in the art can have such prejudice:According to the commercial data from object angle, to right As classifying.
The solution of the present invention has broken above-mentioned prejudice, can by analyze transfer case information of the user in object come pair Object is clustered;Also, compared to the data from object angle, by analyzing user in multiple objects in the present invention Transfer case carrys out the scheme carried out to object, closer to user perspective, more can intuitively reflect understanding of the user to object, because This, object classification determined by the solution of the present invention is more objective, accurate;Even if in addition, in the data from user perspective In, transfer case information of the invention is not common data yet, if in fact, clearly refer to the data from user perspective, Those skilled in the art are easier it is envisioned that direct evaluation from the user (such as give a mark, comment on word).
It should be noted that the present invention can be carried out in the assembly of software and/or software and hardware, for example, this hair Application-specific integrated circuit (ASIC) can be used in bright each device or any other is realized similar to hardware device.In one embodiment In, software program of the invention can be executed by processor to realize steps described above or function.Similarly, of the invention Software program (including relevant data structure) can be stored in computer readable recording medium storing program for performing, for example, RAM memory, Magnetic or optical driver or floppy disc and similar devices.In addition, hardware can be used to realize in some steps or function of the present invention, example Such as, coordinate to execute the circuit of each step or function as with processor.
It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie In the case of without departing substantially from spirit or essential attributes of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power Profit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent requirements of the claims Variation includes within the present invention.Any reference signs in the claims should not be construed as limiting the involved claims.This Outside, it is clear that one word of " comprising " is not excluded for other units or step, and odd number is not excluded for plural number.That is stated in system claims is multiple Unit or device can also be realized by a unit or device by software or hardware.The first, the second equal words are used for table Show title, and does not represent any particular order.

Claims (16)

1. a kind of method for being clustered to object in computer equipment, wherein this method includes:
The transfer case information of multiple objects is obtained, the transfer case information is used to indicate obtains behavior based on object information , the transfer case that user is in the multiple object;
According to the transfer case information, the multiple object is clustered, obtains the cluster result of the multiple object;
Wherein, the step of obtaining the transfer case information include:
Obtain the transfer case information of multiple keywords, wherein the transfer case information of the multiple keyword is used to indicate base Behavior, transfer case that user is in multiple keywords are obtained in object information;
According to the multiple keyword be respectively associated to multiple objects and the multiple keyword transfer case information, really Determine transfer case information of the user in the multiple object;
Wherein, the step of transfer case information for obtaining the multiple keyword includes:
The keyword concern record of at least one user is obtained, keyword concern record includes that the multiple user believes in object The temporal information that the keyword and the keyword paid close attention in breath acquisition behavior are concerned;
It is paid close attention to and is recorded according to the keyword, determine the transfer case information of the multiple keyword.
2. according to the method described in claim 1, wherein, described the step of being clustered, includes:
By based on the transfer distance between the transfer case information acquisition object, to be clustered to the multiple object, Obtain the cluster result of the multiple object.
3. according to the method described in claim 2, wherein, described the step of being clustered, includes:
First part's object and second part object are selected in the multiple object;
Obtain it is being determined based on transfer case information related with first part's object and the second part object, this Transfer distance between a part of object and second part object;
According to the transfer distance between first part's object and second part object, first part's object and second are determined Divide whether object gathers for one kind;
Reselect first part's object and second part object, wherein the first part's object and second part reselected Cluster operation was not carried out between object;
Repeat the transfer distance obtained between first part's object and second part object, the determination first part pair As whether gathering for a kind of, described the step of reselecting first part's object and second part object with second part object, directly To the cluster result for obtaining the multiple object.
4. according to the method described in claim 3, wherein, obtaining and turning between first part's object and second part object Move apart from the step of include:
When the transfer distance between first part's object and second part object is existing, described first is directly read Divide the transfer distance between object and second part object;
In the absence of the transfer distance between first part's object and second part object, according to based on described first One or more of the transfer case information divided between object and the second part object determines, first part's object Transfer distance between one or more of object and the second part object object determines first part's object and Transfer distance between two partial objects.
5. method according to any one of claim 2 to 4, wherein the transfer distance between the object includes following At least one of:
Transfer distance between an object in the multiple object and another object in the multiple object;
Transfer distance between an object in the multiple object and multiple objects in the multiple object;
The transfer distance between other multiple objects in multiple objects and the multiple object in the multiple object.
6. according to the method described in claim 1, wherein, the transfer case information of the multiple keyword includes following at least one :
Transfer path information of the user in the multiple keyword;
Transfer number information of the user between each keyword.
7. method according to claim 1 to 4, wherein the transfer case information of the multiple object includes At least one of below:
Transfer path information of the user in the multiple object;
Transfer number information of the user between each object;
Transition probability information of the user between each object.
8. method according to claim 1 to 4, wherein the object includes brand.
9. a kind of device for being clustered to object in computer equipment, wherein the device includes:
Device for the transfer case information for obtaining multiple objects, the transfer case information are used to indicate based on object information Acquisition behavior, transfer case that user is in the multiple object;
For according to the transfer case information, being clustered to the multiple object, the cluster knot of the multiple object is obtained The device of fruit;
Wherein, the device for obtaining the transfer case information includes:
Device for the transfer case information for obtaining multiple keywords, wherein the transfer case information of the multiple keyword It is used to indicate transfer cases based on object information acquisition behavior, that user is in multiple keywords;
For according to the multiple keyword be respectively associated to multiple objects and the multiple keyword transfer case believe Breath, determines the device of transfer case information of the user in the multiple object;
Wherein, the device of the transfer case information for obtaining the multiple keyword includes:
Keyword for obtaining at least one user pays close attention to the device of record, and keyword concern record includes the multiple use The temporal information that the keyword and the keyword that family was paid close attention in object information acquisition behavior are concerned;
It is recorded for being paid close attention to according to the keyword, determines the device of the transfer case information of the multiple keyword.
10. device according to claim 9, wherein include for the device clustered:
For by based on the transfer distance between the transfer case information acquisition object, to gather to the multiple object Class obtains the device of the cluster result of the multiple object.
11. device according to claim 10, wherein include for the device clustered:
Device for selecting first part's object and second part object in the multiple object;
For obtain it is being determined based on transfer case information related with first part's object and the second part object, The device of transfer distance between first part's object and second part object;
For according to the transfer distance between first part's object and second part object, determining first part's object and Whether two partial objects gather for a kind of device;
Device for reselecting first part's object and second part object, wherein the first part's object reselected It was not carried out cluster operation between second part object;
For triggering device for obtaining the transfer distance between first part's object and second part object, for determine should Whether first part's object gathers with second part object for a kind of device, for reselecting first part's object and second The device of object is divided to repeat operation, until obtaining the device of the cluster result of the multiple object.
12. according to the devices described in claim 11, wherein for obtain first part's object and second part object it Between the device of transfer distance include:
For when the transfer distance between first part's object and second part object is existing, directly reading described The device of transfer distance between a part of object and second part object;
For in the absence of the transfer distance between first part's object and second part object, according to based on described One that transfer case information between a part of object and the second part object determines, in first part's object or Transfer distance between one or more of multiple objects and the second part object object determines first part's object The device of transfer distance between second part object.
13. device according to any one of claims 10 to 12, wherein the transfer distance between the object include with It is at least one of lower:
Transfer distance between an object in the multiple object and another object in the multiple object;
Transfer distance between an object in the multiple object and multiple objects in the multiple object;
The transfer distance between other multiple objects in multiple objects and the multiple object in the multiple object.
14. device according to claim 9, wherein the transfer case information of the multiple keyword include it is following at least One:
Transfer path information of the user in the multiple keyword;
Transfer number information of the user between each keyword.
15. the device according to any one of claim 9 to 12, wherein the transfer case packet of the multiple object It includes at least one of following:
Transfer path information of the user in the multiple object;
Transfer number information of the user between each object;
Transition probability information of the user between each object.
16. the device according to any one of claim 9 to 12, wherein the object includes brand.
CN201510090184.XA 2015-02-27 2015-02-27 A kind of method and apparatus that object is clustered Active CN104731867B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510090184.XA CN104731867B (en) 2015-02-27 2015-02-27 A kind of method and apparatus that object is clustered

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510090184.XA CN104731867B (en) 2015-02-27 2015-02-27 A kind of method and apparatus that object is clustered

Publications (2)

Publication Number Publication Date
CN104731867A CN104731867A (en) 2015-06-24
CN104731867B true CN104731867B (en) 2018-09-07

Family

ID=53455754

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510090184.XA Active CN104731867B (en) 2015-02-27 2015-02-27 A kind of method and apparatus that object is clustered

Country Status (1)

Country Link
CN (1) CN104731867B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110069542B (en) * 2017-09-26 2021-06-29 北京国双科技有限公司 Keyword evaluation method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5504887A (en) * 1993-09-10 1996-04-02 International Business Machines Corporation Storage clustering and packing of objects on the basis of query workload ranking
CN101527000A (en) * 2009-04-03 2009-09-09 南京航空航天大学 Fast movable object orbit clustering method based on sampling
CN104142950A (en) * 2013-05-10 2014-11-12 中国人民大学 Microblog user classifying method based on keyword extraction and gini coefficient
CN104199969A (en) * 2014-09-22 2014-12-10 北京国双科技有限公司 Webpage data analysis method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5504887A (en) * 1993-09-10 1996-04-02 International Business Machines Corporation Storage clustering and packing of objects on the basis of query workload ranking
CN101527000A (en) * 2009-04-03 2009-09-09 南京航空航天大学 Fast movable object orbit clustering method based on sampling
CN104142950A (en) * 2013-05-10 2014-11-12 中国人民大学 Microblog user classifying method based on keyword extraction and gini coefficient
CN104199969A (en) * 2014-09-22 2014-12-10 北京国双科技有限公司 Webpage data analysis method and device

Also Published As

Publication number Publication date
CN104731867A (en) 2015-06-24

Similar Documents

Publication Publication Date Title
Yin et al. Adapting to user interest drift for poi recommendation
CN105447730B (en) Target user orientation method and device
Cai et al. What are popular: exploring twitter features for event detection, tracking and visualization
Zhang et al. Evaluating geo-social influence in location-based social networks
WO2019233258A1 (en) Method, apparatus and system for sending information, and computer-readable storage medium
Pham et al. S3g2: A scalable structure-correlated social graph generator
KR101347407B1 (en) Digital image retrieval by aggregating search results based on visual annotations
US8161050B2 (en) Visualizing hyperlinks in a search results list
JP2013225319A (en) Visual multidimensional retrieval
CN102135983A (en) Group dividing method and device based on network user behavior
CN111125422A (en) Image classification method and device, electronic equipment and storage medium
CN105723402A (en) Systems and methods for determining influencers in a social data network
Mo et al. Event recommendation in social networks based on reverse random walk and participant scale control
Ying et al. Semantic trajectory-based high utility item recommendation system
CN108959595B (en) Website construction and experience method and device based on virtual and reality
US10135723B2 (en) System and method for supervised network clustering
CN109190033A (en) A kind of user's friend recommendation method and system
US20140147048A1 (en) Document quality measurement
Lim et al. A topological approach for detecting twitter communities with common interests
US20170236226A1 (en) Computerized systems, processes, and user interfaces for globalized score for a set of real-estate assets
Zhao et al. Detecting profilable and overlapping communities with user-generated multimedia contents in LBSNs
Wang et al. Knowledge graph-based spatial-aware user community preference query algorithm for lbsns
CN104252487B (en) A kind of method and apparatus for generating entry information
Zhu et al. Exploring sequential probability tree for movement-based community discovery
CN104731867B (en) A kind of method and apparatus that object is clustered

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant