CN105893397A - Video recommendation method and apparatus - Google Patents

Video recommendation method and apparatus Download PDF

Info

Publication number
CN105893397A
CN105893397A CN201510379649.3A CN201510379649A CN105893397A CN 105893397 A CN105893397 A CN 105893397A CN 201510379649 A CN201510379649 A CN 201510379649A CN 105893397 A CN105893397 A CN 105893397A
Authority
CN
China
Prior art keywords
participle
coefficient
search word
recommendation
historical search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510379649.3A
Other languages
Chinese (zh)
Other versions
CN105893397B (en
Inventor
孙国宇
周燕红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing IQIYI Science and Technology Co Ltd
Original Assignee
Beijing IQIYI Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing IQIYI Science and Technology Co Ltd filed Critical Beijing IQIYI Science and Technology Co Ltd
Priority to CN201510379649.3A priority Critical patent/CN105893397B/en
Publication of CN105893397A publication Critical patent/CN105893397A/en
Application granted granted Critical
Publication of CN105893397B publication Critical patent/CN105893397B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/735Filtering based on additional data, e.g. user or group profiles

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Embodiments of the present invention disclose a video recommendation method and apparatus, which are applied to a server. The method comprises: acquiring a text string input by a user, and identifying at least one word segment included in the text string; based on text information of each identified word segment, determining a text coefficient corresponding to each word segment; separately determining whether each word segment is the same as a historical search word in a preset weighted word library, and if a determination result is yes, weighting the text coefficient corresponding to the word segment by using a weighting coefficient corresponding to the historical search word that is the same as the word segment, so as to obtain a recommendation coefficient corresponding to the word segment; if the determination result is no, weighting the text coefficient corresponding to the word segment by using a pre-determined weighting coefficient, so as to obtain a recommendation coefficient corresponding to the word segment; normalizing the recommendation coefficient of each word segment; and based on a standard recommendation coefficient that is obtained through normalization and corresponds to each word segment, recommending a video to the user in a preset manner. According to the method and apparatus disclosed by the embodiments of the present invention, video recommendation is more accurate, and user experience is better.

Description

A kind of video recommendation method and device
Technical field
The present invention relates to Video Applications system, particularly to a kind of video recommendation method and device.
Background technology
Video website is set up and is equipped with the function recommending video to user now, and this function can be supplied to user more The selection of many associated videos, further increases click volume and the attention rate of video website.
When prior art recommends video, it it is self text envelope of the text string utilizing user to input in the search box Breath is recommended, and concrete a kind of implementation is: first, it is thus achieved that the text string of user's input;Secondly, Word frequency-reverse document-frequency (TF-IDF) algorithm is utilized to determine the first weight of each participle in text string; Again, with reference to the part of speech of each participle in text string, to each participle on above-mentioned first weighted basis Weighting, obtains text coefficient;Finally, by text coefficient directly as the recommendation coefficient of video, and according to The size of this recommendation coefficient recommends video to user.In above-mentioned specific implementation, general acquiescence noun Weight coefficient is more than verb or adjectival weight coefficient, but when the most famous in the text string of user's input Word has again verb, and the expective right of verb is great when noun, such as: " cracking of password " this text string, Mainly " cracking " desired by user, the expective right that i.e. verb " cracks " is great in noun " password ", According to above-mentioned specific implementation, part of speech weighting is misfitted with user's expection, is caused the recommendation system finally obtained Number is inaccurate, and then the most inaccurate to the video of user's recommendation, and user's impression is the best.
Summary of the invention
Based on the problems referred to above, the embodiment of the invention discloses a kind of video recommendation method and device, so that pushing away The video recommended is more accurate.Technical scheme is as follows:
Embodiments provide a kind of video recommendation method, be applied to server, may include that
Obtain the text string of user's input, and identify at least one participle included in described text string;
Self text message based on each participle identified, determines the text coefficient that each participle is corresponding;
Judge that each participle is the most identical with the historical search word in default weighting dictionary respectively, if it is determined that Result is yes, utilizes weight coefficient historical search word corresponding to identical with described participle, to described participle Corresponding text coefficient weighting, obtains the recommendation coefficient that described participle is corresponding;If it is judged that be no, profit Use predetermined weight coefficient, described participle correspondence text coefficient is weighted, obtains the recommendation system that described participle is corresponding Number, wherein, the weight coefficient corresponding to described historical search word is more than described predetermined weight coefficient;
The recommendation coefficient of each participle is normalized, obtains the standard recommendation system that each participle is corresponding Number;
Based on the standard recommendation coefficient corresponding to each participle, recommend video according to predetermined manner to user.
Optionally, the forming process of the weighting dictionary preset, including:
Obtain multiple historical search word and volumes of searches corresponding to each historical search word;
Volumes of searches corresponding for each historical search word is normalized, to obtain each historical search word Corresponding weight coefficient, using the set of each historical search word and the weight coefficient of correspondence thereof as weighting dictionary.
Optionally, described volumes of searches corresponding for each historical search word is normalized, every to obtain The weight coefficient that individual historical search word is corresponding, including:
Use min-max standardized method, volumes of searches corresponding for each historical search word is mapped to 0.0~1.0 models In enclosing, obtain primary weight coefficient;
In the range of using sigmoid function that described primary weight coefficient is mapped to 0.5~1.0, obtain each history The weight coefficient that search word is corresponding.
Optionally, described predetermined weight coefficient is more than 0 and to be less than the designated value in the range of 0.5.
Optionally, the described recommendation coefficient to each participle is normalized, and obtains each participle corresponding Standard recommendation coefficient, including:
Using all participles recommend the value of quadratic sum evolution of coefficient as denominator, with the recommendation system of each participle Number, as molecule, calculates the standard recommendation coefficient that each participle is corresponding.
The embodiment of the present invention additionally provides a kind of video recommendations device, is applied to server, may include that knowledge Other unit, determine unit, weighted units, normalization unit and recommendation unit;Wherein,
Described recognition unit, for obtaining the text string of user's input, and identifies in described text string and is wrapped At least one participle included;
Described determine unit, self text message of each participle for identifying based on described recognition unit, Determine the text coefficient that each participle is corresponding;
Described weighted units, for judging whether each participle is searched with the history in default weighting dictionary respectively Rope word is identical, if it is judged that be yes, utilizes adding corresponding to the historical search word identical with described participle Weight coefficient, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding;As Really judged result is no, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains institute Stating the recommendation coefficient that participle is corresponding, wherein, the weight coefficient corresponding to described historical search word is more than described pre- Determine weight coefficient;
Described normalization unit, the recommendation coefficient of each participle for obtaining described weighted units is returned One change processes, and obtains the standard recommendation coefficient that each participle is corresponding;
Described recommendation unit, for based on the standard obtained by the described normalization unit corresponding to each participle Recommend coefficient, recommend video according to predetermined manner to user.
Optionally, also include: dictionary forms unit, and wherein, described dictionary forms unit, including: obtain Subelement and formation subelement,
Described acquisition subelement, for obtaining multiple historical search word and each corresponding searching of historical search word Suo Liang;
Described formation subelement, for volumes of searches corresponding for each historical search word is normalized, The weight coefficient corresponding to obtain each historical search word, by each historical search word and the weighting system of correspondence thereof The set of number is as weighting dictionary.
Optionally, described formation subelement, including: the first mapping block, the second mapping block;Wherein,
Described first mapping block, is used for using min-max standardized method, and each historical search word is corresponding Volumes of searches be mapped to 0.0~1.0 in the range of, obtain primary weight coefficient;
Described second mapping block, is used for using sigmoid function to be mapped to by described primary weight coefficient In the range of 0.5~1.0, obtain the weight coefficient that each historical search word is corresponding, by each historical search word and The set of corresponding weight coefficient is as weighting dictionary.
Optionally, described predetermined weight coefficient is more than 0 and to be less than the designated value in the range of 0.5.
Optionally, described normalization unit, specifically for: open with the quadratic sum recommending coefficient of all participles The value of side as denominator, using the recommendation coefficient of each participle as molecule, calculates each participle corresponding Standard recommendation coefficient.
In the embodiment of the present invention, server obtains the text string of user's input, and identifies in described text string At least one included participle;Self text message based on each participle identified, determines each point The text coefficient that word is corresponding;Judge respectively each participle whether with the historical search word in default weighting dictionary Identical, if it is judged that be yes, utilize weighting system historical search word corresponding to identical with described participle Number, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding;If sentenced Disconnected result is no, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains described point The recommendation coefficient that word is corresponding;The recommendation coefficient of each participle of normalization;Each participle obtained based on normalization Corresponding standard recommendation coefficient, recommends video according to predetermined manner to user.Compared with prior art, originally On the text coefficient basis of the part participle that text string that inventive embodiments inputs user is corresponding, according to weighting Weight coefficient in storehouse, weights text coefficient, and another part participle weights according to pre-determined factor, Jin Ergen Carry out recommending video according to the recommendation coefficient after weighting.Because the weight coefficient in weighting storehouse is according to numerous users The factor such as historical search amount determine, so, the text coefficient of participle is directly determined by hinge structure For recommending coefficient, the recommendation coefficient after weighting is truer, it is possible to reflect the temperature of participle more accurately.Profit The video recommended with such recommendation coefficient is the most accurate, more conforms to the wish of user, and user experiences More preferably.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to enforcement In example or description of the prior art, the required accompanying drawing used is briefly described, it should be apparent that, describe below In accompanying drawing be only some embodiments of the present invention, for those of ordinary skill in the art, do not paying On the premise of going out creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
The flow chart of a kind of video recommendation method that Fig. 1 is provided by the embodiment of the present invention;
Fig. 2 is the substantially mapping scattergram of primary weight coefficient;
Fig. 3 is the substantially mapping scattergram of weight coefficient;
A kind of structural representation of a kind of video recommendations device that Fig. 4 is provided by the embodiment of the present invention;
The another kind of structural representation of a kind of video recommendations device that Fig. 5 is provided by the embodiment of the present invention;
The another kind of structural representation of a kind of video recommendations device that Fig. 6 is provided by the embodiment of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clearly Chu, be fully described by, it is clear that described embodiment be only a part of embodiment of the present invention rather than Whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not making creation The every other embodiment obtained under property work premise, broadly falls into the scope of protection of the invention.
Embodiments provide a kind of video recommendation method and device, be applied to server, described method Comprise the following steps:
Obtain the text string of user's input, and identify at least one participle included in described text string;
Self text message based on each participle identified, determines the text coefficient that each participle is corresponding; Judge that each participle is the most identical with the historical search word in default weighting dictionary respectively, if it is judged that It is yes, utilizes weight coefficient historical search word corresponding to identical with described participle, corresponding to described participle Text coefficient weighting, obtain the recommendation coefficient that described participle is corresponding;If it is judged that be no, utilize pre- Determine weight coefficient, described participle correspondence text coefficient weighted, obtains the recommendation coefficient that described participle is corresponding, Wherein, the weight coefficient corresponding to described historical search word is more than described predetermined weight coefficient;
The recommendation coefficient of each participle is normalized, obtains the standard recommendation system that each participle is corresponding Number;
Based on the standard recommendation coefficient corresponding to each participle, recommend video according to predetermined manner to user.
It should be noted that a kind of video recommendation method that the embodiment of the present invention is provided is applied to server, User is typically by the search box input text string of the software such as search engine, video website, and above-mentioned server is i.e. For the server that the software such as search engine, website is corresponding, text string can be title or the mark of movie and television play Sign, it is also possible to be the description to movie and television play.
Compared with prior art, the literary composition of the part participle that the embodiment of the present invention inputs user text string is corresponding On this coefficient basis, according to the weight coefficient in weighting storehouse, weighting text coefficient, another part participle is pressed Weight according to pre-determined factor, and then carry out recommending video according to the recommendation coefficient after weighting, because in weighting storehouse Weight coefficient is that the factors such as the historical search amount according to numerous users determine, so, hinge structure is straight Connecing and be defined as the text coefficient of participle recommending coefficient, the recommendation coefficient after weighting is truer, it is possible to more accurate The temperature of true reflection participle, the video utilizing such recommendation coefficient to recommend is the most accurate, more accords with Share the wish at family, user experiences more preferably.
A kind of flow chart of a kind of video recommendation method that Fig. 1 is provided by the embodiment of the present invention, the method should For server, may comprise steps of:
S101, it is thus achieved that the text string of user's input, and identify in described text string included at least one Participle;
It should be noted that the text string of above-mentioned user's input can be individual character, word, it is also possible to be to retouch Stating one section of word of character, text string can be divided at least one participle, wherein, this at least one participle Part of speech can be noun, verb or adjective.
Concrete, when user is by the search box input text string of the software with function of search, should be with The text string of the software receipt user input of function of search, and text string is sent to server, then press According to corresponding segmenting method, text string is divided into some words.Conventional a kind of segmenting method is: should Text string goes to compare with the participle in locally stored dictionary, obtains several participles that text string is corresponding; It is of course also possible to utilize other segmenting methods known in those skilled in the art, the embodiment of the present invention is not This is defined.If simply an individual character or single word in text string, server identify only It it is a participle;If text string is one section of description, what server identified is then several participles.
S102, self text message based on each participle identified, determine the text that each participle is corresponding Coefficient;
It should be noted that self text message above-mentioned refers to the expressing information of the various dimensions for participle, This dimension of participle part of speech such as can be utilized to express this participle, it is also possible to utilize participle word to anticipate this dimension table Reach this participle, it is, of course, also possible to utilize this participle of feature representation of other dimension.Self text envelope of participle Breath is only relevant with participle itself, unrelated with the linguistic context residing for participle, source etc..
Concrete, in S101 step, after identifying at least one participle included by text string, each participle Self text message can apparent, utilize self text message of each participle, it may be determined that each participle Corresponding text coefficient.Text coefficient can be that self text message based on participle is precalculated, Directly invoke when needed, it is also possible to calculate temporarily.Determine that the mode of text coefficient is a lot, letter here Single one of introducing: utilize word frequency-reverse document-frequency (TF-IDF) algorithm and the method for part of speech weighting, determine The text coefficient of each participle.
Wherein, word frequency-reverse document-frequency (TF-IDF) algorithm is a kind of statistical method, in order to assess one Words is for the significance level of a copy of it file in a file set or a corpus.Words important Property is directly proportional increase along with the number of times that it occurs hereof, but can occur in corpus along with it simultaneously Frequency is inversely proportional to decline.Part of speech is the character of participle, such as: the participle of noun character, verb character Participle etc..
Describe for convenience, here clearly to determine the process of its text coefficient, this point for any one participle Word is relevant with a video content participle.Utilize word frequency-reverse document-frequency (TF-IDF) algorithm and word Property weighting method, determine the concretely comprising the following steps of text coefficient of any one participle: for the literary composition identified Any one participle in this string, adds up its occurrence number in a default videotext, then divided by All words sum of this videotext, obtains this participle probability in this videotext, is TF;System Count and concentrate at default videotext, the videotext quantity of this participle occurs, this videotext concentrates Business, divided by there is the videotext quantity of this participle, is then taken the logarithm by all videotexts, the numerical value obtained For IDF;Last TF is multiplied by IDF and is the TF-IDF coefficient of this participle;According to default part of speech weight coefficient pair The TF-IDF coefficient weighting obtained, the result obtained is the text coefficient of this participle.The text system of other participles Number can also be obtained by said method, repeats no more here.
Certainly, the method for above-mentioned word frequency-reverse document-frequency (TF-IDF) algorithm and part of speech weighting only determines that A kind of common method of participle text coefficient, those skilled in the art can also utilize method known in other to determine The text coefficient of participle, this is not defined by the embodiment of the present invention.
S103, judges that each participle is the most identical with the historical search word in default weighting dictionary respectively, as Really judged result is yes, performs S104 step;If it is judged that be no, perform S105 step;
S104, utilizes weight coefficient historical search word corresponding to identical with described participle, to described participle Corresponding text coefficient weighting, obtains the recommendation coefficient that described participle is corresponding;
S105, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains described participle Corresponding recommendation coefficient;
Wherein, the weight coefficient corresponding to described historical search word is more than described predetermined weight coefficient;
It should be noted that above-mentioned weighting dictionary at least includes herein below: historical search word and its correspondence Weight coefficient, this historical search word is user's search word in historical time section, according to weight coefficient Coefficient after the volumes of searches normalization of historical search word.This weighting dictionary can real-time update, it is provided that to user Numerical value accurately, and most of word that user searched for appears in this weighting dictionary, certainly, big portion The weight coefficient of participle language is different.
Concrete, each participle that S101 step identifies compares with the historical search word in this weighting storehouse, Judge whether this weighting storehouse exists the historical search word identical with the participle identified, it is judged that after terminating, right In there is the participle of identical historical search word, according to weight coefficient corresponding to identical historical search word to by The text coefficient weighting that S102 step determines, obtains the recommendation coefficient of such participle;Identical for not existing The participle of historical search word, according to the text coefficient weighting to being determined by S102 step of the predetermined weight coefficient, Recommendation coefficient to such participle.Generally, the historical search word in weighting dictionary is that user searches for The word that frequency ratio is higher, so weight coefficient corresponding to historical search word is more than predetermined weight coefficient.
Below the forming process in weighted words storehouse is described in detail.The forming process of the weighting dictionary preset, May comprise steps of:
Obtain multiple historical search word and volumes of searches corresponding to each historical search word;
Volumes of searches corresponding for each historical search word is normalized, to obtain each historical search word Corresponding weight coefficient, using the set of each historical search word and the weight coefficient of correspondence thereof as weighting dictionary.
It should be noted that the historical search word of user is stored in specific file, corresponding historical search word Corresponding volumes of searches can be added up and be stored in this specific file, certainly, and the search that historical search word is corresponding Amount can not also store, but calculated when needed temporarily.Server can directly obtain from this specific file Take this historical search word, obtain the volumes of searches of correspondence.For Baidu user searches for record, historical search Word can periodically write in the tables of data of Long Yuan, and server can obtain history in real time from the tables of data of this Long Yuan and search Rope word, obtains the volumes of searches of correspondence, it is also possible to periodically obtained from the tables of data of this Long Yuan by timed task Historical search word, obtains the volumes of searches of correspondence.
Concrete, obtain multiple historical search word and volumes of searches corresponding to each historical search word, the most right The volumes of searches that each historical search word is corresponding is normalized, and obtains each corresponding adding of historical search word Weight coefficient, the weight coefficient of these all historical search words obtained and correspondence collectively forms weighting dictionary.
In reality, historical search word is probably the word in different field, so the volumes of searches of correspondence just may be used Can have different dimensions, so be unfavorable for comparing and analyzing, so often by search corresponding for historical search word Amount is normalized, and normalized method is numerous, and conventional has two kinds: min-max Standardization Act and Z-score Standardized method, because both approaches is technology known in those skilled in the art, thus the most right Min-max Standardization Act is simply introduced down, and Z-score standardized method is no longer introduced, and can refer to correlation technique literary composition Shelves are known.
Utilize min-max Standardization Act, normalized is divided into data prediction and sigmoid function normalization Two steps are carried out.Then at the second step of the default forming process weighting dictionary: described by each history The volumes of searches that search word is corresponding is normalized, the weight coefficient corresponding to obtain each historical search word, May comprise steps of:
A uses min-max standardized method, and volumes of searches corresponding for each historical search word is mapped to 0.0~1.0 In the range of, obtain primary weight coefficient;
In the range of b uses sigmoid function that described primary weight coefficient is mapped to 0.5~1.0, obtain each going through The weight coefficient that history search word is corresponding.
Concrete, in a step, use formula (1) that the volumes of searches that historical search word is corresponding is calculated, In the range of volumes of searches is mapped to 0.0~1.0, obtain primary weight coefficient;
m = log ( 1 + x ) - log ( 1 + m i n ) log ( 1 + m a x ) - log ( 1 + m i n ) - - - ( 1 )
Wherein x is the volumes of searches that historical search word is corresponding, and m is the primary weight coefficient obtained.
In b step, the primary weight coefficient using formula (2) to obtain a step calculates, by primary In the range of weight coefficient is mapped to 0.5~1.0, obtain the weight coefficient that each historical search word is corresponding.
n = 1.0 1.0 + e - 5 m - - - ( 2 )
Wherein m is primary weight coefficient, and n is the weight coefficient finally obtained.
Illustrate, in the specific file of server, there is four pairs of historical search words and corresponding volumes of searches, point The most as follows:
Wherein, left column is historical search word, and the right side is classified as the volumes of searches of correspondence.
The substantially mapping scattergram of the primary weight coefficient obtained after a step is as in figure 2 it is shown, wherein, horizontal Axle represents volumes of searches, and the longitudinal axis is the primary weight coefficient obtained after min-max standardization.Through b step After the substantially mapping scattergram of weight coefficient that obtains as it is shown on figure 3, wherein, transverse axis represents primary weight coefficient, The longitudinal axis is the weight coefficient obtained after sigmoid function normalization.
Through above-mentioned steps, the weighting dictionary of formation includes herein below:
Wherein, left column is historical search word, and the right side is classified as the weight coefficient of correspondence.Thus by historical search word In the range of corresponding volumes of searches is normalized to 0.5~1.0, thus obtain weighting number of times.
Generally, the historical search word in weighting dictionary is the word that user's search rate is higher, So weight coefficient corresponding to historical search word is more than predetermined weight coefficient.Above-mentioned weight coefficient is at 0.5~1.0 models In enclosing, therefore described predetermined weight coefficient can be set as the designated value in the range of more than 0 and less than 0.5.Assume Predetermined weight coefficient is 0.4, then be then to the participle identical with historical search word, according to right in weighting dictionary Answer weight coefficient weighting, this weight coefficient is all the numerical value between 0.5~1.0, and other search less than with The participle that historical search word is identical, then according to 0.4 weighting.
S106, is normalized the recommendation coefficient of each participle, obtains the standard that each participle is corresponding Recommend coefficient;
Concrete, S103 step obtains the recommendation coefficient that participle is corresponding, and this recommendation coefficient is at text coefficient base The numerical value after the weight coefficient of correspondence it is multiplied by plinth.The text coefficient obtained based on participle self text message is not There is uniformity, accordingly, it is recommended that coefficient does not the most have uniformity, it is impossible to directly compare.Therefore this They are normalized by Shi Changchang, so that recommending coefficient to be at the same order of magnitude, the most more having can Ratio property, the standard recommendation coefficient that after normalization, available each participle is corresponding.
Below a kind of method for normalizing is sketched, then, in this step, the recommendation coefficient of each participle is entered Row normalized, obtains the standard recommendation coefficient that each participle is corresponding, may include that
Using all participles recommend the value of quadratic sum evolution of coefficient as denominator, with the recommendation system of each participle Number, as molecule, calculates the standard recommendation coefficient that each participle is corresponding.
Concrete, using the value of the quadratic sum evolution recommending coefficient of all participles as denominator, with each participle Recommendation coefficient as molecule, the standard recommendation coefficient that each participle is corresponding can be drawn.Illustrate, literary composition The recommendation coefficient of all participles that this string is corresponding is successively: 0.5,1.2, then according to after this method for normalizing To standard recommendation coefficient be then followed successively by:Approximate the most successively: 0.4, 0.9。
S107, based on the standard recommendation coefficient corresponding to each participle, recommends to regard to user according to predetermined manner Frequently.
Concrete, after S106 step obtains standard recommendation coefficient, recommend video according to predetermined manner to user. This predetermined manner is the method chosen standard recommendation coefficient according to predetermined rule, it is recommended that during video, permissible Choose participle corresponding to maximum standard recommendation coefficient to go to recommend video, it is also possible to the top being sized The participle that standard recommendation coefficient is corresponding together decides on the video of recommendation.Such as: the mark of the participle that text string is corresponding Quasi-recommendation coefficient is followed successively by: 0.51,0.49,0.54,0.88,0,78, if predetermined manner is to choose maximum Standard recommendation coefficient recommends video, then in this example, be then to recommend video according to the participle of 0.88 correspondence.
In the embodiment of the present invention, server obtains the text string of user's input, and identifies in described text string At least one included participle;Self text message based on each participle identified, determines each point The text coefficient that word is corresponding;Judge respectively each participle whether with the historical search word in default weighting dictionary Identical, if it is judged that be yes, utilize weighting system historical search word corresponding to identical with described participle Number, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding;If sentenced Disconnected result is no, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains described point The recommendation coefficient that word is corresponding;The recommendation coefficient of each participle of normalization;Each participle obtained based on normalization Corresponding standard recommendation coefficient, recommends video according to predetermined manner to user.Compared with prior art, originally On the text coefficient basis of the part participle that text string that inventive embodiments inputs user is corresponding, according to weighting Weight coefficient in storehouse, weights text coefficient, and another part participle weights according to pre-determined factor, Jin Ergen Carry out recommending video according to the recommendation coefficient after weighting.Because the weight coefficient in weighting storehouse is according to numerous users The factor such as historical search amount determine, so, the text coefficient of participle is directly determined by hinge structure For recommending coefficient, the recommendation coefficient after weighting is truer, it is possible to reflect the temperature of participle more accurately.Profit The video recommended with such recommendation coefficient is the most accurate, more conforms to the wish of user, and user experiences More preferably.
Corresponding to said method embodiment, the embodiment of the present invention additionally provides a kind of video recommendations device, such as Fig. 4 Shown in, this device may include that recognition unit 210, determines unit 220, weighted units 230, normalization list Unit 220 and recommendation unit 250;Wherein,
Described recognition unit 210, for obtaining the text string of user's input, and identifies institute in described text string Including at least one participle;
Described determine unit 220, self literary composition of each participle for identifying based on described recognition unit 210 This information, determines the text coefficient that each participle is corresponding;
Described weighted units 230, for judge respectively each participle whether with the history in default weighting dictionary Search word is identical, if it is judged that be yes, utilizes corresponding to the historical search word identical with described participle Weight coefficient, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding; If it is judged that be no, utilize predetermined weight coefficient, described participle correspondence text coefficient is weighted, obtains The recommendation coefficient that described participle is corresponding, wherein, the weight coefficient corresponding to described historical search word is more than described Predetermined weight coefficient;
Described normalization unit 240, for the recommendation coefficient to each participle that described weighted units 230 obtains It is normalized, obtains the standard recommendation coefficient that each participle is corresponding;
Described recommendation unit 250, obtained by based on the described normalization unit 240 corresponding to each participle Standard recommendation coefficient, recommend video according to predetermined manner to user.
In the embodiment of the present invention, server obtains the text string of user's input, and identifies in described text string At least one included participle;Self text message based on each participle identified, determines each point The text coefficient that word is corresponding;Judge respectively each participle whether with the historical search word in default weighting dictionary Identical, if it is judged that be yes, utilize weighting system historical search word corresponding to identical with described participle Number, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding;If sentenced Disconnected result is no, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains described point The recommendation coefficient that word is corresponding;The recommendation coefficient of each participle of normalization;Each participle obtained based on normalization Corresponding standard recommendation coefficient, recommends video according to predetermined manner to user.Compared with prior art, originally On the text coefficient basis of the part participle that text string that inventive embodiments inputs user is corresponding, according to weighting Weight coefficient in storehouse, weights text coefficient, and another part participle weights according to pre-determined factor, Jin Ergen Carry out recommending video according to the recommendation coefficient after weighting.Because the weight coefficient in weighting storehouse is according to numerous users The factor such as historical search amount determine, so, the text coefficient of participle is directly determined by hinge structure For recommending coefficient, the recommendation coefficient after weighting is truer, it is possible to reflect the temperature of participle more accurately.Profit The video recommended with such recommendation coefficient is the most accurate, more conforms to the wish of user, and user experiences More preferably.
As it is shown in figure 5, the another kind of structural representation of a kind of video recommendations device provided by the embodiment of the present invention Figure, compared with Fig. 4 shown device, Fig. 5 shown device also includes: dictionary forms unit, wherein, institute's predicate Storehouse forms unit, including: obtain subelement 310a and form subelement 310b,
Described acquisition subelement 310a, is used for obtaining multiple historical search word and each historical search word is corresponding Volumes of searches;
Described formation subelement 310b, for being normalized place by volumes of searches corresponding for each historical search word Reason, the weight coefficient corresponding to obtain each historical search word, by adding of each historical search word and correspondence thereof The set of weight coefficient is as weighting dictionary.
As shown in Figure 6, the another kind of structural representation of a kind of video recommendations device provided by the embodiment of the present invention Figure, in Fig. 6 shown device, the formation subelement 310b in Fig. 5 shown device, may include that first reflects Penetrate module 310b1, the second mapping block 310b2;Wherein,
Described first mapping block 310b1, is used for using min-max standardized method, by each historical search In the range of the volumes of searches that word is corresponding is mapped to 0.0~1.0, obtain primary weight coefficient;
Described second mapping block 310b2, is used for using sigmoid function to be mapped to by described primary weight coefficient In the range of 0.3~1.0, obtain the weight coefficient that each historical search word is corresponding, by each historical search word and The set of corresponding weight coefficient is as weighting dictionary.
On the basis of embodiment illustrated in fig. 6, described predetermined weight coefficient is more than 0 and to be less than in the range of 0.5 Designated value.
On the basis of embodiment illustrated in fig. 4, described normalization unit 240, specifically for: with all participles The value of the quadratic sum evolution of recommendation coefficient is as denominator, using the recommendation coefficient of each participle as molecule, through meter Calculate and obtain the standard recommendation coefficient that each participle is corresponding.
For system or device embodiment, owing to it is substantially similar to embodiment of the method, so describe Fairly simple, relevant part sees the part of embodiment of the method and illustrates.
It should be noted that in this article, the relational terms of such as first and second or the like be used merely to by One entity or operation separate with another entity or operating space, and not necessarily require or imply these Relation or the order of any this reality is there is between entity or operation.And, term " includes ", " comprising " Or its any other variant is intended to comprising of nonexcludability, so that include the mistake of a series of key element Journey, method, article or equipment not only include those key elements, but also other including being not expressly set out Key element, or also include the key element intrinsic for this process, method, article or equipment.Do not having In the case of more restrictions, statement " including ... " key element limited, it is not excluded that including described wanting Process, method, article or the equipment of element there is also other identical element.
One of ordinary skill in the art will appreciate that all or part of step realizing in said method embodiment The program that can be by completes to instruct relevant hardware, and described program can be stored in computer-readable Take in storage medium, the storage medium obtained designated herein, such as: ROM/RAM, magnetic disc, CD etc..
The foregoing is only presently preferred embodiments of the present invention, be not intended to limit protection scope of the present invention. All any modification, equivalent substitution and improvement etc. made within the spirit and principles in the present invention, are all contained in In protection scope of the present invention.

Claims (10)

1. a video recommendation method, it is characterised in that be applied to server, including:
Obtain the text string of user's input, and identify at least one participle included in described text string;
Self text message based on each participle identified, determines the text coefficient that each participle is corresponding;
Judge that each participle is the most identical with the historical search word in default weighting dictionary respectively, if it is determined that Result is yes, utilizes weight coefficient historical search word corresponding to identical with described participle, to described participle Corresponding text coefficient weighting, obtains the recommendation coefficient that described participle is corresponding;If it is judged that be no, profit Use predetermined weight coefficient, described participle correspondence text coefficient is weighted, obtains the recommendation system that described participle is corresponding Number, wherein, the weight coefficient corresponding to described historical search word is more than described predetermined weight coefficient;
The recommendation coefficient of each participle is normalized, obtains the standard recommendation system that each participle is corresponding Number;
Based on the standard recommendation coefficient corresponding to each participle, recommend video according to predetermined manner to user.
Method the most according to claim 1, it is characterised in that the forming process of default weighting dictionary, Including:
Obtain multiple historical search word and volumes of searches corresponding to each historical search word;
Volumes of searches corresponding for each historical search word is normalized, to obtain each historical search word Corresponding weight coefficient, searches the set of weight coefficient of element word and correspondence thereof as weighting dictionary using each history.
Method the most according to claim 2, it is characterised in that described that each historical search word is corresponding Volumes of searches be normalized, the weight coefficient corresponding to obtain each historical search word, including:
Use min-max standardized method, volumes of searches corresponding for each historical search word is mapped to 0.0~1.0 models In enclosing, obtain primary weight coefficient;
In the range of using sigmoid function that described primary weight coefficient is mapped to 0.5~1.0, obtain each history The weight coefficient that search word is corresponding.
Method the most according to claim 3, it is characterised in that described predetermined weight coefficient be more than 0 and Designated value in the range of less than 0.5.
Method the most according to claim 1, it is characterised in that the described recommendation coefficient to each participle It is normalized, obtains the standard recommendation coefficient that each participle is corresponding, including:
Using all participles recommend the value of quadratic sum evolution of coefficient as denominator, with the recommendation system of each participle Number, as molecule, calculates the standard recommendation coefficient that each participle is corresponding.
6. a video recommendations device, it is characterised in that be applied to server, including: recognition unit, really Cell, weighted units, normalization unit and recommendation unit;Wherein,
Described recognition unit, for obtaining the text string of user's input, and identifies in described text string and is wrapped At least one participle included;
Described determine unit, self text message of each participle for identifying based on described recognition unit, Determine the text coefficient that each participle is corresponding;
Described weighted units, for judging whether each participle is searched with the history in default weighting dictionary respectively Rope word is identical, if it is judged that be yes, utilizes adding corresponding to the historical search word identical with described participle Weight coefficient, the text coefficient weighting corresponding to described participle, obtain the recommendation coefficient that described participle is corresponding;As Really judged result is no, utilizes predetermined weight coefficient, weights described participle correspondence text coefficient, obtains institute Stating the recommendation coefficient that participle is corresponding, wherein, the weight coefficient corresponding to described historical search word is more than described pre- Determine weight coefficient;
Described normalization unit, the recommendation coefficient of each participle for obtaining described weighted units is returned One change processes, and obtains the standard recommendation coefficient that each participle is corresponding;
Described recommendation unit, for based on the standard obtained by the described normalization unit corresponding to each participle Recommend coefficient, recommend video according to predetermined manner to user.
Device the most according to claim 6, it is characterised in that also include: dictionary forms unit, its In, described dictionary forms unit, including: obtain subelement and form subelement,
Described acquisition subelement, for obtaining multiple historical search word and each corresponding searching of historical search word Suo Liang;
Described formation subelement, for volumes of searches corresponding for each historical search word is normalized, The weight coefficient corresponding to obtain each historical search word, by each historical search word and the weighting system of correspondence thereof The set of number is as weighting dictionary.
Device the most according to claim 7, it is characterised in that described formation subelement, including: the One mapping block, the second mapping block;Wherein,
Described first mapping block, is used for using min-max standardized method, and each historical search word is corresponding Volumes of searches be mapped to 0.0~1.0 in the range of, obtain primary weight coefficient;
Described second mapping block, is used for using sigmoid function to be mapped to by described primary weight coefficient In the range of 0.5~1.0, obtain the weight coefficient that each historical search word is corresponding, by each historical search word and The set of corresponding weight coefficient is as weighting dictionary.
Device the most according to claim 8, it is characterised in that described predetermined weight coefficient be more than 0 and Designated value in the range of less than 0.5.
Device the most according to claim 6, it is characterised in that described normalization unit, specifically for: Using the value of the quadratic sum evolution recommending coefficient of all participles as denominator, make with the recommendation coefficient of each participle For molecule, calculate the standard recommendation coefficient that each participle is corresponding.
CN201510379649.3A 2015-06-30 2015-06-30 A kind of video recommendation method and device Active CN105893397B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510379649.3A CN105893397B (en) 2015-06-30 2015-06-30 A kind of video recommendation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510379649.3A CN105893397B (en) 2015-06-30 2015-06-30 A kind of video recommendation method and device

Publications (2)

Publication Number Publication Date
CN105893397A true CN105893397A (en) 2016-08-24
CN105893397B CN105893397B (en) 2019-03-15

Family

ID=57001745

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510379649.3A Active CN105893397B (en) 2015-06-30 2015-06-30 A kind of video recommendation method and device

Country Status (1)

Country Link
CN (1) CN105893397B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475725A (en) * 2020-04-01 2020-07-31 百度在线网络技术(北京)有限公司 Method, apparatus, device, and computer-readable storage medium for searching for content
CN111858564A (en) * 2019-04-28 2020-10-30 腾讯科技(深圳)有限公司 Data processing method, service processing method, device, terminal and storage medium
CN113807090A (en) * 2021-08-10 2021-12-17 三峡大学 Word sense weighting-based TF-IDF disease characterization word extraction method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102246169A (en) * 2008-12-15 2011-11-16 摩托罗拉移动公司 Assigning an indexing weight to a search term
CN103020164A (en) * 2012-11-26 2013-04-03 华北电力大学 Semantic search method based on multi-semantic analysis and personalized sequencing
CN103729359A (en) * 2012-10-12 2014-04-16 阿里巴巴集团控股有限公司 Method and system for recommending search terms
CN104077327A (en) * 2013-03-29 2014-10-01 阿里巴巴集团控股有限公司 Core word importance recognition method and equipment and search result sorting method and equipment
CN104123332A (en) * 2014-01-24 2014-10-29 腾讯科技(深圳)有限公司 Search result display method and device
CN104376065A (en) * 2014-11-05 2015-02-25 百度在线网络技术(北京)有限公司 Determination method and device for importance degree of search word

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102246169A (en) * 2008-12-15 2011-11-16 摩托罗拉移动公司 Assigning an indexing weight to a search term
CN103729359A (en) * 2012-10-12 2014-04-16 阿里巴巴集团控股有限公司 Method and system for recommending search terms
CN103020164A (en) * 2012-11-26 2013-04-03 华北电力大学 Semantic search method based on multi-semantic analysis and personalized sequencing
CN104077327A (en) * 2013-03-29 2014-10-01 阿里巴巴集团控股有限公司 Core word importance recognition method and equipment and search result sorting method and equipment
CN104123332A (en) * 2014-01-24 2014-10-29 腾讯科技(深圳)有限公司 Search result display method and device
CN104376065A (en) * 2014-11-05 2015-02-25 百度在线网络技术(北京)有限公司 Determination method and device for importance degree of search word

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111858564A (en) * 2019-04-28 2020-10-30 腾讯科技(深圳)有限公司 Data processing method, service processing method, device, terminal and storage medium
CN111475725A (en) * 2020-04-01 2020-07-31 百度在线网络技术(北京)有限公司 Method, apparatus, device, and computer-readable storage medium for searching for content
CN111475725B (en) * 2020-04-01 2023-11-07 百度在线网络技术(北京)有限公司 Method, apparatus, device and computer readable storage medium for searching content
CN113807090A (en) * 2021-08-10 2021-12-17 三峡大学 Word sense weighting-based TF-IDF disease characterization word extraction method
CN113807090B (en) * 2021-08-10 2024-04-30 三峡大学 TF-IDF disease characterization word extraction method based on word sense weighting

Also Published As

Publication number Publication date
CN105893397B (en) 2019-03-15

Similar Documents

Publication Publication Date Title
US10489399B2 (en) Query language identification
US7783629B2 (en) Training a ranking component
CN109815314B (en) Intent recognition method, recognition device and computer readable storage medium
CN105893476B (en) Intelligent answer method, knowledge base optimization method and device, Intelligence repository
US8386240B2 (en) Domain dictionary creation by detection of new topic words using divergence value comparison
CN111797214A (en) FAQ database-based problem screening method and device, computer equipment and medium
JP5379138B2 (en) Creating an area dictionary
CN110334356B (en) Article quality determining method, article screening method and corresponding device
CN106407420B (en) Multimedia resource recommendation method and system
CN107291939B (en) Hotel information clustering matching method and system
CN109597874B (en) Information recommendation method, device and server
CN110968800A (en) Information recommendation method and device, electronic equipment and readable storage medium
CN110276009B (en) Association word recommendation method and device, electronic equipment and storage medium
KR101541306B1 (en) Computer enabled method of important keyword extraction, server performing the same and storage media storing the same
CN110032622B (en) Keyword determination method, keyword determination device, keyword determination equipment and computer readable storage medium
CN111767393A (en) Text core content extraction method and device
US20140289260A1 (en) Keyword Determination
CN111144112A (en) Text similarity analysis method and device and storage medium
CN107085568A (en) A kind of text similarity method of discrimination and device
Widiyaningtyas et al. Sentiment Analysis Of Hotel Review Using N-Gram And Naive Bayes Methods
CN105893397A (en) Video recommendation method and apparatus
CN105677664A (en) Compactness determination method and device based on web search
US8005775B2 (en) System and method for detecting human judgment drift and variation control
CN110909532B (en) User name matching method and device, computer equipment and storage medium
CN109213937B (en) Intelligent search method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant