CN104750789B - The recommendation method and device of label - Google Patents

The recommendation method and device of label Download PDF

Info

Publication number
CN104750789B
CN104750789B CN201510107973.XA CN201510107973A CN104750789B CN 104750789 B CN104750789 B CN 104750789B CN 201510107973 A CN201510107973 A CN 201510107973A CN 104750789 B CN104750789 B CN 104750789B
Authority
CN
China
Prior art keywords
label
candidate
user
data
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510107973.XA
Other languages
Chinese (zh)
Other versions
CN104750789A (en
Inventor
李国洪
匡柘溪
杨帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201510107973.XA priority Critical patent/CN104750789B/en
Publication of CN104750789A publication Critical patent/CN104750789A/en
Application granted granted Critical
Publication of CN104750789B publication Critical patent/CN104750789B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of recommendation method and device of label.The embodiment of the present invention passes through the description data according to candidate label, and user's generated behavioral data and the user generated behavioral data in the other application other than the application in identified application, the description data of the user obtained, obtain at least one candidate label, using as recommend label, make it possible to show obtained recommendation label to user, the recommendation label obtained by being then based on user's generated behavioral data in the whole network, so that these recommend label to be probably the label interested to user, to guide user according to these recommendation labels, execute relevant operation, such as, label customization etc., in this way, the operating efficiency of label can be effectively improved.

Description

The recommendation method and device of label
【Technical field】
The present invention relates to the recommendation method and devices of the recommended technology of label more particularly to a kind of label.
【Background technology】
Social Label (Social tagging) is referred to as label, is a kind of more flexible, interesting mode classification, it permits Family allowable freely marks the resources such as various resources, such as webpage, scientific paper and multimedia.Social Label can help user Taxonomic revision and inquiry various information, be widely used in Social Label website (for example, Flickr, Picassa, YouTube, Plaxo etc.), blog (for example, Blogger, WordPress, LiveJournal etc.), encyclopaedia (for example, Wikipedia, PBWiki etc.), the systems such as microblogging (for example, Twitter, Jaiku etc.).
Recommend interested label for user, becomes a current research hotspot.
【Invention content】
The many aspects of the present invention provide a kind of recommendation method and device of label, to recommend interested mark for user Label.
An aspect of of the present present invention provides a kind of recommendation method of label, including:
Determine the currently used application of user;
According to the description data of the description data of candidate label and the user, at least one candidate label is obtained, to make To recommend label;The description data of the user are to be obtained according to the historical behavior data of the user, the history of the user Behavioral data include the user in the application caused by behavioral data and the user other than the application Other application in generated behavioral data;
Show the recommendation label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The description data of label include the behavior of the semantic description data of candidate label, the distribution description data and candidate label of candidate label At least one in data is described.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The semantic description data of label include at least one in following data:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, it is described according to time It selects the description data of label and the description data of the user, obtains at least one candidate label, using as before recommending label, Further include:
According to a candidate label, text collection is obtained;
According to the text collection, the keyword under candidate's label is obtained.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The distribution description data of label include at least one in following data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is to be more than or wait In 1 integer.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The behavior description data of label include at least one in following data:
With the relevant correlation tag of behavior of candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The description data of label further include the application characteristic of candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark Label using characteristic include in following data at least one of:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
Another aspect of the present invention provides a kind of recommendation apparatus of label, including:
Determination unit, for determining the currently used application of user;
Obtaining unit, for according to the description data of candidate label and the description data of the user, obtaining at least one Candidate label, using as recommend label;The description data of the user are to be obtained according to the historical behavior data of the user, institute The historical behavior data for stating user include the user in the application caused by behavioral data and the user in addition to Generated behavioral data in other application except the application;
Show unit, for showing the recommendation label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The description data of label include the behavior of the semantic description data of candidate label, the distribution description data and candidate label of candidate label At least one in data is described.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The semantic description data of label include at least one in following data:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, described device is also Including excavating unit, it is used for
According to a candidate label, text collection is obtained;And
According to the text collection, the keyword under candidate's label is obtained.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The distribution description data of label include at least one in following data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is to be more than or wait In 1 integer.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The behavior description data of label include at least one in following data:
With the relevant correlation tag of behavior of candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark The description data of label further include the application characteristic of candidate label.
The aspect and any possible implementation manners as described above, it is further provided a kind of realization method, the candidate mark Label using characteristic include in following data at least one of:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
As shown from the above technical solution, the embodiment of the present invention according to the description data of candidate label and user by existing Generated behavioral data and the user are produced in the other application other than the application in identified application Behavioral data, the description data of the user of acquisition obtain at least one candidate label, using as recommending label so that Obtained recommendation label can be showed to user, obtained by being then based on user's generated behavioral data in the whole network Recommend label so that these recommend label to be probably the label interested to user, to guide user to be pushed away according to these Label is recommended, relevant operation is executed, for example, label customization etc., in such manner, it is possible to effectively improve the operating efficiency of label.
In addition, using technical solution provided by the invention, since the description data of candidate label include the language of candidate label At least one of in the behavior description data of justice description data, the distribution description data of candidate label and candidate label so that energy It is enough that candidate label is described from multiple dimensions, therefore, it is possible to effectively improve the reliability of label recommendations.
In addition, using technical solution provided by the invention, by the way that the application characteristic of candidate label is increased to candidate In the description data of label, enabling candidate label is described from using dimension, without being directed to each application distribution pair Candidate label is described, therefore, it is possible to effectively improve the efficiency of label recommendations.
In addition, using technical solution provided by the invention, due to considering user other than identified application Generated behavioral data in other application, enabling the dimension of the other application except the application carries out the user Description, the label recommendations in the case of cold start-up therefore, it is possible to realize the application, simultaneously, additionally it is possible to effectively improve label recommendations Coverage.
【Description of the drawings】
It to describe the technical solutions in the embodiments of the present invention more clearly, below will be to embodiment or description of the prior art Needed in attached drawing be briefly described, it should be apparent that, the accompanying drawings in the following description be the present invention some realities Example is applied, it for those of ordinary skill in the art, without having to pay creative labor, can also be attached according to these Figure obtains other attached drawings.
Fig. 1 is the flow diagram of the recommendation method for the label that one embodiment of the invention provides;
Fig. 2 is the structural schematic diagram of the recommendation apparatus for the label that another embodiment of the present invention provides;
Fig. 3 is the structural schematic diagram of the recommendation apparatus for the label that another embodiment of the present invention provides.
【Specific implementation mode】
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art The whole other embodiments obtained without creative efforts, shall fall within the protection scope of the present invention.
It should be noted that terminal involved in the embodiment of the present invention can include but is not limited to mobile phone, individual digital Assistant (Personal Digital Assistant, PDA), radio hand-held equipment, tablet computer (Tablet Computer), PC (Personal Computer, PC), MP3 player, MP4 players, wearable device (for example, intelligent glasses, Smartwatch, Intelligent bracelet etc.) etc..
In addition, the terms "and/or", only a kind of incidence relation of description affiliated partner, indicates may exist Three kinds of relationships, for example, A and/or B, can indicate:Individualism A exists simultaneously A and B, these three situations of individualism B.Separately Outside, character "/" herein, it is a kind of relationship of "or" to typically represent forward-backward correlation object.
Fig. 1 is the flow diagram of the recommendation method for the label that one embodiment of the invention provides, as shown in Figure 1.
101, the currently used application of user is determined.
102, according to the description data of the description data of candidate label and the user, at least one candidate label is obtained, Using as recommend label;The description data of the user are to be obtained according to the historical behavior data of the user, the user's Historical behavior data include the user in the application caused by behavioral data and the user in addition to the application Except other application in generated behavioral data.
The currently used application of user is different with the other application other than the currently used application of user Application, installed in different types of terminal with the Taobao's application etc. installed on mobile phone for example, the Taobao installed on PC applies Different application, alternatively, for another example the Baidu map installed on mobile phone apply with installed on mobile phone Baidu search application etc. phases The different application, etc. installed in same type terminal, the present embodiment is to this without being particularly limited to.
It is understood that the other application other than the currently used application of user, can also be approximately considered is Whole applications in entire internet including currently used application.
103, show the recommendation label.
It should be noted that some or all of 101~103 executive agent can be the application for being located locally terminal, Or can also be the plug-in unit being located locally in the application of terminal or Software Development Kit (Software Development Kit, SDK) etc. functional units, can also be either processing engine in the server of network side or can also be position In the distributed system of network side, the present embodiment is to this without being particularly limited to, and the present embodiment is to this without being particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal, or may be used also To be a web page program (webApp) of browser in terminal, the present embodiment is to this without being particularly limited to.
In this way, passing through description data and the user generated behavior in identified application according to candidate label Data and the user generated behavioral data in the other application other than the application, the user's of acquisition Data are described, at least one candidate label is obtained, using as recommending label, enabling show obtained recommendation mark to user Label, the recommendation label obtained by being then based on user's generated behavioral data in the whole network so that these recommend label very It may be exactly the label interested to user, to guide user according to these recommendation labels, relevant operation be executed, for example, mark Signing system etc., in such manner, it is possible to effectively improve the operating efficiency of label.
Optionally, in a possible realization method of the present embodiment, in 102, the used candidate label The distributions of the description data semantic description data, candidate label that can include but is not limited to candidate label data and candidate are described At least one of in the behavior description data of label, the present embodiment is to this without being particularly limited to.
The semantic description data of so-called candidate's label, from itself semantic angle of candidate label, in particulate The semanteme that the candidate label of description of degree is included.In addition, also further contemplate itself semantic information content of candidate label compared with It is few, it therefore, can be by the provided content-data of the currently used application of user, utilizing unsupervised machine learning mould Type, for example, Word2Vec, RandomWalk etc., semantic related expanding is carried out to itself semantic description of candidate label.
During a concrete implementation, it is described candidate label semantic description data can include but is not limited to it is following At least one of in data:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
Keyword under candidate's label, refers in word or phrase of the subject content for expressing candidate label etc. Hold.Specifically, before 102, text collection can also be obtained, for example, question and answer class application further according to a candidate label In question and answer content-data etc. corresponding to candidate's label, in turn, then can obtain candidate's label according to the text collection Under keyword.
The distribution of so-called candidate's label describes data, from the system angle of candidate label, in coarseness The concept and classification system of the candidate label of description.
During a concrete implementation, it is described candidate label distribution describe data can include but is not limited to it is following At least one of in data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is to be more than or wait In 1 integer.
Specifically, in one example, the theme distribution of the category distribution of the candidate label and the candidate label, It can set manually.
Specifically, in another example, machine excavation method can be utilized, point for specifying classification about several is generated Class device.For example, can specifically utilize history tab and the associated content-data of the history tab, training is specified about several The history tab grader of classification.In turn, using the history tab grader, the category distribution of candidate label is generated.
Specifically, in another example, measure of supervision can be utilized, the grader about M specific subject is generated.Example Such as, it can specifically determine the definition of the quantity and each specific subject of specific subject, collect training data, using training data, History tab grader of the training about M specific subject, to obtain the grader about M specific subject.In turn, this is utilized About the grader of M specific subject, the theme distribution of candidate label is generated.This method, precision is high, but recall rate is low.
Specifically, in another example, non-supervisory method can be utilized, the theme mould about M specific subject is established Type.For example, specifically the topic in history tab and the associated content-data of the history tab can be combined into a segment, Word segmentation processing is carried out to the segment, to generate training data.Utilize training data, training topic model (Topic Model).So Afterwards, operation is optimized to topic model, for example, semantic-based delete operation, semantic-based deduplication operation etc., to obtain Topic model about M specific subject.In turn, the topic model using this about M specific subject generates candidate label Theme distribution.This method, precision is slightly lower, but recall rate is high.
It is understood that in another example, it specifically can also be special about M to being generated using measure of supervision Determine the grader of theme, and the topic model about M specific subject is established using non-supervisory method, carries out at integration Reason, to obtain model of the relatively reliable generation about M specific subject.
Specifically, the grader about M specific subject generated using measure of supervision may be used, generate candidate mark The theme distribution of label.For example, the history tab in the historical behavior data of the whole users of acquisition, using as candidate label, in turn Using the grader, the theme distribution of candidate label is generated.
Specifically, it may be used and establish topic model about M specific subject using non-supervisory method, generate candidate The theme distribution of label.For example, the history tab in the historical behavior data of the whole users of acquisition, using as candidate label, with And the topic in the associated content-data of candidate's label, by the topic in candidate label and the associated content-data of candidate's label Mesh is combined into a segment, and word segmentation processing is carried out to the segment, to generate word segmentation result, and then utilizes the topic model, generates The theme distribution of candidate label.
It is understood that the theme distribution progress for the candidate label that can also be specifically generated to above two method is whole It closes, to obtain the theme distribution of relatively reliable candidate label.
The behavior description data of so-called candidate's label, from user to label recommendations resulting subsequent feedback behavior angle Degree sets out, to describe the dynamic behaviour feature of candidate label.
During a concrete implementation, it is described candidate label behavior description data can include but is not limited to it is following At least one of in data:
With the relevant correlation tag of behavior of candidate label.
Specifically, in one example, the behavior of candidate label can be the customization behavior of candidate label, or can be with For the behavior of applying of candidate label, such as the question and answer behavior in the application of question and answer class, the present embodiment is to this without being particularly limited to.
For example, can specifically utilize the history question and answer class behavior data corresponding to some candidate label, candidate mark is calculated The co-occurrence probabilities of label and other labels, will meet other labels corresponding to the co-occurrence probabilities of pre-set Correlation Criteria, really It is set to and the relevant correlation tag of the behavior of candidate's label.
Optionally, in a possible realization method of the present embodiment, in 102, the used candidate label Description data can further include but be not limited to the application characteristic of candidate label, the present embodiment is to this without spy It does not limit.
The application characteristic of so-called candidate's label, and from user to the resulting subsequent feedback row of label recommendations It sets out for angle, to describe the dynamic behaviour feature of candidate label.
During a concrete implementation, it is described candidate label application characteristic can include but is not limited to it is following At least one of in data:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
Optionally, in a possible realization method of the present embodiment, in 102, the used user's retouches It states data and can include but is not limited in the currently used application of user as described in the user description data and in addition to user works as It is preceding used in application except other application in as described in the user description data, the present embodiment is to this without especially limiting It is fixed.
Wherein, in the currently used application of user as described in the user description data, specifically can be according to the user Generated behavioral data obtains in the application.
During a concrete implementation, in the currently used application of user as described in the user description data, tool Body can include but is not limited in the behavior description data of the semantic description data of user, the distribution description data of user and user At least one of, the present embodiment is to this without being particularly limited to.
The semantic description data of so-called user, it is emerging in currently used application to can include but is not limited to user At least one of in interest description and historical conventions tag set.
Specifically, in one example, log-on message that specifically can be according to user in currently used application, is obtained Interesting measure of the user in currently used application is obtained, for example, interest set of words.
Specifically, in another example, historical behavior that specifically can be according to user in currently used application Data obtain interesting measure of the user in currently used application, for example, interest set of words.
The distribution of so-called user describes data, and it is emerging in currently used application to can include but is not limited to user At least one of in the category distribution of interest and the theme distribution of interest.
Specifically, in one example, machine excavation method can be utilized, the classification for specifying classification about several is generated Device.For example, can be specifically associated with the user interest using user interest (user interests of the whole users acquired) Content-data, training specifies the user interest grader of classification about several.In turn, using the user interest grader, Generate the category distribution of interest of the user in currently used application.
Specifically, in another example, measure of supervision can be utilized, the grader about M specific subject is generated.Example Such as, it can specifically determine the definition of the quantity and each specific subject of specific subject, collect training data, using training data, User interest label grader of the training about M specific subject, to obtain the grader about M specific subject.In turn, it utilizes The grader about M specific subject generates the theme distribution of interest of the user in currently used application.This side Method, precision is high, but recall rate is low.
Specifically, in another example, non-supervisory method can be utilized, the theme mould about M specific subject is established Type.For example, specifically the topic in user interest and the associated content-data of the user interest can be combined into a segment, Word segmentation processing is carried out to the segment, to generate training data.Utilize training data, training topic model (Topic Model).So Afterwards, operation is optimized to topic model, for example, semantic-based delete operation, semantic-based deduplication operation etc., to obtain Topic model about M specific subject.In turn, the topic model using this about M specific subject generates user current The theme distribution of interest in used application.This method, precision is slightly lower, but recall rate is high.
It is understood that in another example, it specifically can also be special about M to being generated using measure of supervision Determine the grader of theme, and the topic model about M specific subject is established using non-supervisory method, carries out at integration Reason, to obtain model of the relatively reliable generation about M specific subject.
Specifically, the grader about M specific subject generated using measure of supervision may be used, generate user and exist The theme distribution of interest in currently used application.For example, history row of the acquisition user in currently used application For the interest in data, using the interest as user in currently used application, and then the grader is utilized, generate user The theme distribution of interest in currently used application.
Specifically, it may be used and establish topic model about M specific subject using non-supervisory method, generate user The theme distribution of interest in currently used application.For example, the interest in the historical behavior data of the whole users of acquisition, Topic using in the content-data of interest and the interest relationship as user in currently used application, by user Interest in currently used application and the topic in the content-data of the interest relationship are combined into a segment, to the piece Duan Jinhang word segmentation processings to generate word segmentation result, and then utilize the topic model, generate user in currently used application Interest theme distribution.
It is understood that specifically can also be to user that above two method is generated in currently used application The theme distribution of interest integrated, to obtain the theme of interest of the relatively reliable user in currently used application Distribution.
The behavior description data of so-called user, the behavior that can include but is not limited to association user related to user are retouched State at least one in data.
Specifically, in one example, the behavior description data of association user, to describe the behavior of association user, tool Body can be the customization behavior of association user, or can also apply behavior for association user, asking in being applied such as question and answer class Behavior etc. is answered, the present embodiment is to this without being particularly limited to.
During another concrete implementation, in the currently used application of user as described in the user description data, The specific application characteristic that can further include user, the present embodiment is to this without being particularly limited to.
The application characteristic of so-called user can include but is not limited at least one in following data:
The Annual distribution of interest of the user in currently used application in this application.
Wherein, in the other application other than the currently used application of user as described in the user description data, tool Body can generated behavioral data obtains in the other application other than the application according to the user.
During a concrete implementation, about this in the other application other than the currently used application of user The description data of user can specifically include but be not limited in the semantic description data of user and the distribution description data of user At least one of, the present embodiment is to this without being particularly limited to.
The semantic description data of so-called user can include but is not limited to user and be answered in addition to user is currently used The interesting measure in other application except and at least one in historical conventions tag set.
Specifically, in one example, it can specifically obtain user according to log-on message of the user in other application and exist Interesting measure in other application, for example, interest set of words.
Specifically, in another example, it can specifically be obtained according to historical behavior data of the user in other application Interesting measure of the user in other application, for example, interest set of words.
Optionally, in a possible realization method of the present embodiment, in 102, candidate label can specifically be calculated The relevance parameter of description data of description data and the user can will then meet pre-set recommendation item in turn Candidate label corresponding to the relevance parameter of part, as recommendation label.
Wherein, so-called relevance parameter can include but is not limited at least one in following data:
The relevance score of the semantic description data of candidate label and the semantic description data of user;
The relevance score of the distribution description data of candidate label and the distribution description data of user;
The relevance score of the behavior description data of candidate label and the behavior description data of user;And
The relevance score using characteristic using characteristic and user of candidate label.
During a concrete implementation, the mode of crossing dependency specifically may be used, respectively from user current Historical conventions tag set in currently used application of interesting measure, user in used application, user except Interesting measure in other application and user except user currently used application in addition to user it is currently used Application except other application in the semantic description data of users such as historical conventions tag set sub- dimension, calculate its with The bookmark name of candidate label, the keyword under candidate label, the tag name with the relevant extension tag of semanteme of candidate label Claim and the son with the semantic description data of the candidate label such as the relevant expanded keyword of semanteme of the keyword under candidate label The degree of cross-correlation of dimension using the weight of every sub- dimension, in a manner of Weighted Fusion, obtains the language of candidate label in turn The relevance score of justice description data and the semantic description data of user.
During another concrete implementation, the mode of crossing dependency specifically may be used, working as respectively from user The theme distribution of interest of the category distribution and user of interest in application used in preceding in currently used application The sub- dimension of the distribution description data of equal users calculates itself and the category distribution of candidate label and the theme point of candidate label The degree of cross-correlation of the sub- dimension of the distribution description data of candidate's label such as cloth, in turn, using the weight of every sub- dimension, to add The mode of fusion is weighed, the relevance score of the distribution description data of candidate label and the distribution description data of user is obtained.
It, specifically can be according to the behavior description data of candidate label and the row of user during another concrete implementation To describe data, using matrix decomposition (Matrix Factorization, MF) algorithm, for example, collaborative filtering (Collaborative Filtering, CF) algorithm, singular value decomposition (Singular Value Decomposition, SVD) Deng the relevance score of the behavior description data of the candidate label of acquisition and the behavior description data of user.
Another concrete implementation during, specifically can according to candidate label answering using characteristic and user With characteristic, the composition of the two is inquired into state, is inquired in heuristics rule base, by query result, as candidate The relevance score using characteristic using characteristic and user of label.
After obtaining these relevance scores, specifically each relevance score can all be marked respectively as candidate The relevance parameter of the description data of label and the description data of the user in turn then can be according to the relevance parameter, respectively It obtains and recommends label, or candidate label can also be obtained according to the weight of each relevance score, in a manner of Weighted Fusion The relevance parameter of description data of description data and the user can then be pushed away in turn according to the relevance parameter Label is recommended, the present embodiment is to this without being particularly limited to.
During another concrete implementation, relevance parameter can specifically be arranged according to sequence from big to small Sequence, then, selection come the candidate label corresponding to preceding P relevance parameters, as recommendation label.
During another concrete implementation, a relevance threshold can be specifically pre-set, then, selection is more than Or the candidate label corresponding to the relevance parameter equal to the relevance threshold, as recommendation label.
The technical solution that the present embodiment is provided, can be abstracted as metadata layer, characteristic layer and model layer these three Details.Wherein, metadata layer includes mainly the currently used application of user, and in addition to user is currently used The user behavior data in other application except, for example, the problem of user in answer platform scene issues, answer are used The data etc. that answer of the family on platform generates;Characteristic layer is user's row in the currently used application accumulation of user In data basis, to complete the description to user in the description and application of candidate label to be recommended, and work as in addition to user On the basis of the user behavior data of other application accumulation except application used in preceding, user in other application is retouched in completion It states, the two describe constituted intermediate data;Model layer can be divided into several recommendation subsystems, each to recommend subsystem According to respective Generalization bounds, relevance parameter is generated, finally, then recommends correlation ginseng caused by subsystem according to each Number completes the determination for recommending label.
In the present embodiment, produced by according to the description data and user of candidate label in identified application Behavioral data and the user generated behavioral data in the other application other than the application, acquisition it is described The description data of user obtain at least one candidate label, using as recommending label, enabling show to user and obtained Recommend label, the recommendation label obtained by being then based on user's generated behavioral data in the whole network so that these recommendations Label is probably the label interested to user, to guide user according to these recommendation labels, executes relevant operation, example Such as, label customization etc., in such manner, it is possible to effectively improve the operating efficiency of label.
In addition, using technical solution provided by the invention, since the description data of candidate label include the language of candidate label At least one of in the behavior description data of justice description data, the distribution description data of candidate label and candidate label so that energy It is enough that candidate label is described from multiple dimensions, therefore, it is possible to effectively improve the reliability of label recommendations.
In addition, using technical solution provided by the invention, by the way that the application characteristic of candidate label is increased to candidate In the description data of label, enabling candidate label is described from using dimension, without being directed to each application distribution pair Candidate label is described, therefore, it is possible to effectively improve the efficiency of label recommendations.
In addition, using technical solution provided by the invention, due to considering user other than identified application Generated behavioral data in other application, enabling the dimension of the other application except the application carries out the user Description, the label recommendations in the case of cold start-up therefore, it is possible to realize the application, simultaneously, additionally it is possible to effectively improve label recommendations Coverage.
It should be noted that for each method embodiment above-mentioned, for simple description, therefore it is all expressed as a series of Combination of actions, but those skilled in the art should understand that, the present invention is not limited by the described action sequence because According to the present invention, certain steps can be performed in other orders or simultaneously.Secondly, those skilled in the art should also know It knows, embodiment described in this description belongs to preferred embodiment, and involved action and module are not necessarily of the invention It is necessary.
In the above-described embodiments, it all emphasizes particularly on different fields to the description of each embodiment, there is no the portion being described in detail in some embodiment Point, it may refer to the associated description of other embodiment.
Fig. 2 is the structural schematic diagram of the recommendation apparatus for the label that another embodiment of the present invention provides, as shown in Figure 2.This reality The recommendation apparatus for applying the label of example can include determining that unit 21, obtaining unit 22 and show unit 23.Wherein it is determined that unit 21, for determining the currently used application of user;Obtaining unit 22 is used for the description data according to candidate label and the use The description data at family obtain at least one candidate label, using as recommending label;The description data of the user are according to The historical behavior data of user obtain, and the historical behavior data of the user include that the user is generated in the application Behavioral data and the user generated behavioral data in the other application other than the application;Show unit 23, For showing the recommendation label.
It should be noted that some or all of recommendation apparatus of label that the present embodiment is provided can be to be located locally The application of terminal, or can also be the plug-in unit being located locally in the application of terminal or Software Development Kit (Software Development Kit, SDK) etc. functional units, or can also be processing engine in the server of network side, or Person can also be positioned at network side distributed system, the present embodiment to this without being particularly limited to, the present embodiment to this not into Row is particularly limited to.
It is understood that the application can be mounted in the local program (nativeApp) in terminal, or may be used also To be a web page program (webApp) of browser in terminal, the present embodiment is to this without being particularly limited to.
Optionally, in a possible realization method of the present embodiment, the time used by the obtaining unit 22 The distribution that the description data of label can include but is not limited to the semantic description data, candidate label of candidate label is selected to describe data With at least one in the behavior description data of candidate label, the present embodiment is to this without being particularly limited to.
During a concrete implementation, it is described candidate label semantic description data can include but is not limited to it is following At least one of in data:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
Specifically, as shown in figure 3, the recommendation apparatus for the label that the present embodiment is provided can further include excavation list Member 31, for according to a candidate label, obtaining text collection;And according to the text collection, obtain under candidate's label Keyword.
During a concrete implementation, it is described candidate label distribution describe data can include but is not limited to it is following At least one of in data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is to be more than or wait In 1 integer.
During a concrete implementation, it is described candidate label behavior description data can include but is not limited to it is following At least one of in data:
With the relevant correlation tag of behavior of candidate label.
Optionally, in a possible realization method of the present embodiment, the time used by the obtaining unit 22 Select the description data of label can further include but be not limited to the application characteristic of candidate label, the present embodiment to this not It is particularly limited.
During a concrete implementation, it is described candidate label application characteristic can include but is not limited to it is following At least one of in data:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
It should be noted that method in the corresponding embodiments of Fig. 1, it can be by the recommendation apparatus of label provided in this embodiment It realizes.Detailed description may refer to the correlated resources in the corresponding embodiments of Fig. 1, and details are not described herein again.
It is true in determination unit institute according to the description data and user of candidate label by obtaining unit in the present embodiment Generated behavioral data and the user generated row in the other application other than the application in fixed application For data, the description data of the user of acquisition obtain at least one candidate label, using as recommending label so that show Unit can show obtained recommendation label to user, be obtained by being then based on user's generated behavioral data in the whole network The recommendation label obtained so that these recommend labels to be probably the label interested to user, to guide user according to this It is a little to recommend label, relevant operation is executed, for example, label customization etc., in such manner, it is possible to effectively improve the operating efficiency of label.
In addition, using technical solution provided by the invention, since the description data of candidate label include the language of candidate label At least one of in the behavior description data of justice description data, the distribution description data of candidate label and candidate label so that energy It is enough that candidate label is described from multiple dimensions, therefore, it is possible to effectively improve the reliability of label recommendations.
In addition, using technical solution provided by the invention, by the way that the application characteristic of candidate label is increased to candidate In the description data of label, enabling candidate label is described from using dimension, without being directed to each application distribution pair Candidate label is described, therefore, it is possible to effectively improve the efficiency of label recommendations.
In addition, using technical solution provided by the invention, due to considering user other than identified application Generated behavioral data in other application, enabling the dimension of the other application except the application carries out the user Description, the label recommendations in the case of cold start-up therefore, it is possible to realize the application, simultaneously, additionally it is possible to effectively improve label recommendations Coverage.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided by the present invention, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of division of logic function, formula that in actual implementation, there may be another division manner, such as multiple units or component It can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be the indirect coupling by some interfaces, device or unit It closes or communicates to connect, can be electrical, machinery or other forms.
The unit illustrated as separating component may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, you can be located at a place, or may be distributed over multiple In network element.Some or all of unit therein can be selected according to the actual needs to realize the mesh of this embodiment scheme 's.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also It is that each unit physically exists alone, it can also be during two or more units be integrated in one unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of hardware adds SFU software functional unit.
The above-mentioned integrated unit being realized in the form of SFU software functional unit can be stored in one and computer-readable deposit In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, including some instructions are used so that a computer It is each that equipment (can be personal computer, server or the network equipment etc.) or processor (processor) execute the present invention The part steps of embodiment the method.And storage medium above-mentioned includes:USB flash disk, mobile hard disk, read-only memory (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. it is various The medium of program code can be stored.
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, it will be understood by those of ordinary skill in the art that:It still may be used With technical scheme described in the above embodiments is modified or equivalent replacement of some of the technical features; And these modifications or replacements, various embodiments of the present invention technical solution that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims (14)

1. a kind of recommendation method of label, which is characterized in that including:
Determine the currently used application of user;
According to the description data of the description data of candidate label and the user, at least one candidate label is obtained, using as pushing away Recommend label;The description data of the user are to be obtained according to the historical behavior data of the user, the historical behavior of the user Data include the user in the application caused by behavioral data and the user in its other than the application Generated behavioral data during he applies;The description data of candidate's label include the semantic description data of candidate label, wait Select at least one in the distribution description data of label and the behavior description data of candidate label;
Show the recommendation label.
2. according to the method described in claim 1, it is characterized in that, the semantic description data of candidate's label include following number At least one of in:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
3. according to the method described in claim 2, it is characterized in that, the description data according to candidate label and the user Description data, at least one candidate label is obtained, as before recommending label, to further include:
According to a candidate label, text collection is obtained;
According to the text collection, the keyword under candidate's label is obtained.
4. according to the method described in claims 1 to 3 any claim, which is characterized in that the distribution of candidate's label is retouched It includes at least one in following data to state data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is more than or equal to 1 Integer.
5. according to the method described in claims 1 to 3 any claim, which is characterized in that the behavior of candidate's label is retouched It includes at least one in following data to state data:
With the relevant correlation tag of behavior of candidate label.
6. according to the method described in claims 1 to 3 any claim, which is characterized in that the description number of candidate's label According to the application characteristic for further including candidate label.
7. according to the method described in claim 6, it is characterized in that, the application characteristic of candidate's label includes lower columns At least one of in:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
8. a kind of recommendation apparatus of label, which is characterized in that including:
Determination unit, for determining the currently used application of user;
Obtaining unit, for according to the description data of candidate label and the description data of the user, obtaining at least one candidate Label, using as recommend label;The description data of the user are to be obtained according to the historical behavior data of the user, the use The historical behavior data at family include the user in the application caused by behavioral data and the user in addition to described Generated behavioral data in other application except;The description data of candidate's label include the semanteme of candidate label The distribution of description data, candidate label describes at least one in data and the behavior description data of candidate label;
Show unit, for showing the recommendation label.
9. device according to claim 8, which is characterized in that the semantic description data of candidate's label include following number At least one of in:
The bookmark name of candidate label;
Keyword under candidate label;
With the bookmark name of the relevant extension tag of semanteme of candidate label;And
With the relevant expanded keyword of semanteme of the keyword under candidate label.
10. device according to claim 9, which is characterized in that described device further includes excavating unit, is used for
According to a candidate label, text collection is obtained;And
According to the text collection, the keyword under candidate's label is obtained.
11. according to the device described in claim 8~10 any claim, which is characterized in that the distribution of candidate's label Description data include at least one in following data:
The category distribution of candidate label;And
The theme distribution of candidate label, the theme distribution include the distributed intelligence of M specific subject, and M is more than or equal to 1 Integer.
12. according to the device described in claim 8~10 any claim, which is characterized in that the behavior of candidate's label Description data include at least one in following data:
With the relevant correlation tag of behavior of candidate label.
13. according to the device described in claim 8~10 any claim, which is characterized in that the description of candidate's label Data further include the application characteristic of candidate label.
14. device according to claim 13, which is characterized in that the application characteristic of candidate's label includes following At least one of in data:
The temperature of candidate label in the application;And
The Annual distribution of candidate label in the application.
CN201510107973.XA 2015-03-12 2015-03-12 The recommendation method and device of label Active CN104750789B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510107973.XA CN104750789B (en) 2015-03-12 2015-03-12 The recommendation method and device of label

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510107973.XA CN104750789B (en) 2015-03-12 2015-03-12 The recommendation method and device of label

Publications (2)

Publication Number Publication Date
CN104750789A CN104750789A (en) 2015-07-01
CN104750789B true CN104750789B (en) 2018-10-16

Family

ID=53590473

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510107973.XA Active CN104750789B (en) 2015-03-12 2015-03-12 The recommendation method and device of label

Country Status (1)

Country Link
CN (1) CN104750789B (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105138572B (en) * 2015-07-27 2019-12-10 百度在线网络技术(北京)有限公司 Method and device for acquiring relevance weight of user tag
CN105117449B (en) * 2015-08-14 2019-08-16 百度在线网络技术(北京)有限公司 A kind of method and apparatus for generating the label of content item
CN105701139A (en) * 2015-11-26 2016-06-22 中国传媒大学 Holographic video material indexing method
CN105975472A (en) * 2015-12-09 2016-09-28 乐视网信息技术(北京)股份有限公司 Method and device for recommendation
CN105701182A (en) * 2016-01-07 2016-06-22 百度在线网络技术(北京)有限公司 Information pushing method and apparatus
CN105808642B (en) * 2016-02-24 2019-12-24 北京百度网讯科技有限公司 Recommendation method and device
CN107301188B (en) * 2016-04-15 2020-11-10 北京搜狗科技发展有限公司 Method for acquiring user interest and electronic equipment
CN105976161A (en) * 2016-04-29 2016-09-28 随身云(北京)信息技术有限公司 Time axis-based intelligent recommendation calendar and user-based presentation method
CN107436875B (en) * 2016-05-25 2020-12-04 华为技术有限公司 Text classification method and device
CN107526741B (en) * 2016-06-21 2021-05-18 华为技术有限公司 User label generation method and device
CN106850780A (en) * 2017-01-16 2017-06-13 北京奇虎科技有限公司 System-level application information recommends method, device and mobile terminal
CN106960033B (en) * 2017-03-22 2021-09-14 阿里巴巴(中国)有限公司 Method and device for labeling information stream
CN110555155B (en) * 2017-08-30 2023-04-07 腾讯科技(北京)有限公司 Article information recommendation method, device and storage medium
CN109636430A (en) * 2017-10-09 2019-04-16 北京京东尚科信息技术有限公司 Object identifying method and its system
CN108170665B (en) * 2017-11-29 2021-06-04 有米科技股份有限公司 Keyword expansion method and device based on comprehensive similarity
CN108363550A (en) * 2017-12-28 2018-08-03 中兴智能交通股份有限公司 A kind of method and apparatus of data cached update and storage
CN110134783B (en) * 2018-02-09 2023-11-10 阿里巴巴集团控股有限公司 Personalized recommendation method, device, equipment and medium
CN108897592A (en) * 2018-06-22 2018-11-27 珠海市君天电子科技有限公司 A kind of software methods of exhibiting and relevant device
CN110737824B (en) * 2018-07-03 2022-08-09 百度在线网络技术(北京)有限公司 Content query method and device
CN109801101A (en) * 2019-01-03 2019-05-24 深圳壹账通智能科技有限公司 Label determines method, apparatus, computer equipment and storage medium
CN111046224B (en) * 2019-12-02 2023-04-07 上海麦克风文化传媒有限公司 Real-time recall method for audio products
CN111899047A (en) * 2020-07-14 2020-11-06 拉扎斯网络科技(上海)有限公司 Resource recommendation method and device, computer equipment and computer-readable storage medium
CN113051344A (en) * 2020-09-15 2021-06-29 卢霞浩 Information pushing method and information pushing system based on cloud computing and big data
CN117235586B (en) * 2023-11-16 2024-02-09 青岛小帅智能科技股份有限公司 Hotel customer portrait construction method, system, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102760163A (en) * 2012-06-12 2012-10-31 奇智软件(北京)有限公司 Personalized recommendation method and device of characteristic information
CN104123360A (en) * 2014-07-18 2014-10-29 腾讯科技(深圳)有限公司 Application recommendation data acquisition method, device and system and electronic device
CN104216881A (en) * 2013-05-29 2014-12-17 腾讯科技(深圳)有限公司 Method and device for recommending individual labels

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102760163A (en) * 2012-06-12 2012-10-31 奇智软件(北京)有限公司 Personalized recommendation method and device of characteristic information
CN104216881A (en) * 2013-05-29 2014-12-17 腾讯科技(深圳)有限公司 Method and device for recommending individual labels
CN104123360A (en) * 2014-07-18 2014-10-29 腾讯科技(深圳)有限公司 Application recommendation data acquisition method, device and system and electronic device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
社交网络中的用户标签推荐;陆子龙;《中国优秀硕士学位论文全文数据库 信息科技辑》;20140315;I139-110第21-32页 *

Also Published As

Publication number Publication date
CN104750789A (en) 2015-07-01

Similar Documents

Publication Publication Date Title
CN104750789B (en) The recommendation method and device of label
Ozsoy From word embeddings to item recommendation
US11995409B2 (en) Content generation using target content derived modeling and unsupervised language modeling
Abdel-Hafez et al. A survey of user modelling in social media websites
US9910930B2 (en) Scalable user intent mining using a multimodal restricted boltzmann machine
US11487946B2 (en) Content editing using content modeling and semantic relevancy scoring
Kanwal et al. A review of text-based recommendation systems
US20170024389A1 (en) Method and system for multimodal clue based personalized app function recommendation
Guo et al. An effective and economical architecture for semantic-based heterogeneous multimedia big data retrieval
CN103577549A (en) Crowd portrayal system and method based on microblog label
CN103136188A (en) Method and system used for sentiment estimation of web browsing user
CN107103049A (en) A kind of recommendation method and the network equipment
CN105518661A (en) Browsing images via mined hyperlinked text snippets
US10176260B2 (en) Measuring semantic incongruity within text data
CN103678304A (en) Method and device for pushing specific content for predetermined webpage
Jiang et al. Cloud service recommendation based on unstructured textual information
CN103955464A (en) Recommendation method based on situation fusion sensing
CN104751354A (en) Advertisement cluster screening method
CN110069713B (en) Personalized recommendation method based on user context perception
CN105389329A (en) Open source software recommendation method based on group comments
CN104077415A (en) Searching method and device
JP2014106661A (en) User state prediction device, method and program
CN104142990A (en) Search method and device
CN106462588B (en) Content creation from extracted content
JP2014146218A (en) Information providing device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant