CN103870507B - Method and device of searching based on category - Google Patents

Method and device of searching based on category Download PDF

Info

Publication number
CN103870507B
CN103870507B CN201210548686.9A CN201210548686A CN103870507B CN 103870507 B CN103870507 B CN 103870507B CN 201210548686 A CN201210548686 A CN 201210548686A CN 103870507 B CN103870507 B CN 103870507B
Authority
CN
China
Prior art keywords
classification
matching
similarity
global
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210548686.9A
Other languages
Chinese (zh)
Other versions
CN103870507A (en
Inventor
王全剑
汤佳宇
林锋
翁晓颖
韦丽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Tmall Technology Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201210548686.9A priority Critical patent/CN103870507B/en
Publication of CN103870507A publication Critical patent/CN103870507A/en
Application granted granted Critical
Publication of CN103870507B publication Critical patent/CN103870507B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a device of searching based on category, and aims at solving the problems of complex searching method, long time taken and low efficiency in original category search. The method comprises the steps that a search request sent by a user in a platform, wherein the search request comprises a search keyword; global categories in a global category library are matched by using the search keyword so as to obtain first matched categories, and a first similarity of the search keyword and the first matched categories is calculated, wherein categories defined in the platform are taken as the global categories and stored in the global category library; the personalized information of the user is obtained, the first matched categories are matched secondarily based on the personalized information so as to obtain second matched categories, and a second similarity of the second matched categories and the search keyword is calculated; and the first matched categories are ranked according to the first similarity and the second similarity before feedback.

Description

A kind of searching method and device based on classification
Technical field
The application is related to search technique, more particularly to a kind of searching method and device based on classification.
Background technology
Resource on network is very enriched, and user can search all trades and professions, the information in each face of each side from network. Resource on network can be divided according to respective classification, facilitate user's storage resource and search resource.
In some comprehensive websites, generally big classification just has tens, after the big classification is accurately divided, specifically Classification may have thousand of.When the comprehensive website is browsed, user can issue resource to user, it is also possible to search, under Resource is carried, therefore website can provide classification navigation bar to facilitate user to find the resource wanted, and also allow for user's locating resource Suitable classification.
But, user will be from thousand of classifications of classification navigation bar, according to the descending sequential search of scope to suitable The classification of conjunction is extremely difficult.Possible searching route is when user searches for when such as doing shopping:Clothing → women's dress → chiffon shirt → Cotta ... → crew neck → pullover → cultivate one's moral character etc..And for example, during uploaded videos, the searching route of user may be:Video → electricity Depending on play → Hong Kong and Taiwan ... → policemen and bandits → 2012 etc..The lookup of classification, searching method are very loaded down with trivial details, and the time of consuming is long, Efficiency comparison is low.In addition, when user is according to scope descending sequential search classification, client used by user is needed repeatedly Inquiry request is sent to server, when number of users is larger, tremendous influence certainly will be caused to the access pressure of server.Example Such as:The searching route of user is:Clothing → women's dress → chiffon shirt → cotta ... → crew neck → pullover → cultivate one's moral character, when user clicks on When " clothing ", subscription client can send inquiry request to server, and " women's dress " is fed back to user by server after computing, When user clicks on " women's dress ", subscription client can again send inquiry request to server, and server will " snow after computing Spin shirt " user is fed back to, by that analogy, it is seen that this access stencil can cause larger access pressure to server.
Therefore, the technical problem that those skilled in the art urgently solve is to propose a kind of searcher based on classification Method, to solve original class heading search in searching method it is loaded down with trivial details, the time of consuming is long, and efficiency comparison is low and causes for server The technical problem of larger access pressure.
The content of the invention
The application provides a kind of searching method and device based on classification, to solve original class heading search in searching method it is numerous Trivial, the time of consuming is long, and efficiency comparison is low and causes the larger technical problem for accessing pressure for server.
In order to solve the above problems, this application discloses a kind of searching method based on classification, including:
The searching request that user sends in receiving platform, wherein, the searching request includes search keyword;
The global classification in global classification storehouse is matched using the search keyword, is obtained first and is matched classification, And the first similarity of the search keyword and the first matching classification is calculated, wherein, using the classification defined in platform as complete Office's classification is stored in global classification storehouse;
The individual information of user is obtained, based on the individual information the described first matching classification is carried out Secondary Match to obtain The second matching classification is obtained, and calculates the second similarity of the second matching classification and search keyword;
According to first similarity and the second similarity, the first matching classification is ranked up and is fed back.
In the embodiment of the present application, after the searching request of the receive user, also include:Search in searching request is closed Keyword is processed, and obtains at least one in following search terms:Centre word, word, centre word phrase and holophrastic.
In the embodiment of the present application, the global classification in global classification storehouse is matched using the search keyword, obtained The first matching classification is taken, and calculates the search keyword and first and match the first similarity of classification, including:It is respectively adopted each Individual search terms is matched to the global classification in global classification storehouse, obtains global classification that each search terms matched as One matching classification, and calculate the probit of Corresponding matching;The global weight of each search terms is obtained, according to the complete of each search terms Office's weight is weighted to respective probit, calculates the first similarity of the global classification of the search keyword and matching.
In the embodiment of the present application, the individual information of user is obtained, based on the individual information to the described first matching classification Secondary Match is carried out to obtain the second matching classification, and it is similar to the second of search keyword to calculate the second matching classification Degree, including:The individual information of user is obtained, and Secondary Match is carried out to the described first matching classification using the individual information, Obtain corresponding second matching classification;The centre word and/or word in search terms is obtained, the search terms for obtaining is calculated to described the The probit of two matching classifications;The individual character weight of each search terms is obtained, according to the individual character weight of each search terms to each general Rate value is weighted, and calculates the second similarity of the search keyword and the second matching classification.
In the embodiment of the present application, after calculating the first similarity of the search keyword and the first matching classification, also wrap Include:According to default screening threshold value, the described first matching classification is screened according to respective first similarity, obtain screening The first matching classification afterwards.
In the embodiment of the present application, according to first similarity and the second similarity, the first matching classification is ranked up And user is fed back to, including:The first classification weight for matching classification is obtained, according to the classification weight to the first matching classification First similarity is weighted;For based on individual information carry out Secondary Match first matching classification, by weighting after first Similarity matches the second similarity summation of classification with second that Secondary Match is obtained;Described first matching classification is ranked up And feed back.
In the embodiment of the present application, in e-commerce field, the global classification storehouse is to carry out process to product title to obtain After taking product treatment item, calculate what is constituted after the probability of product treatment item described in the corresponding classification of the product title.
In the embodiment of the present application, in e-commerce field, if user is seller, the individual information of the user is right The product title that the seller issues is carried out processing and obtained after seller's processing item, is calculated and is sold described in the corresponding classification of the seller Constitute after the probability of family's processing item.
In the embodiment of the present application, in e-commerce field, if user is buyer, the individual information of the user is right The product title that the buyer browses is carried out processing and obtained after buyer's processing item, calculates institute in the corresponding classification of the product title State what is constituted after the probability of buyer's processing item.
Accordingly, the invention also discloses a kind of searcher based on classification, including:
Receiver module, for the searching request that user in receiving platform sends, wherein, the searching request includes search Key word;
Global search module, for being matched to the global classification in global classification storehouse using the search keyword, Obtain first and match classification, and calculate the search keyword and first and match the first similarity of classification, wherein, by platform The classification of definition is stored in global classification storehouse as global classification;
Personalized search module, for obtaining the individual information of user, based on the individual information to the described first matching class Mesh carries out Secondary Match to obtain the second matching classification, and it is similar to the second of search keyword to calculate the second matching classification Degree;
Sequence and feedback module, for according to first similarity and the second similarity, carrying out to the first matching classification Sort and feed back to user.
Compared with prior art, the application includes advantages below:
First, prior art will successively be searched when classification is searched for according to the descending order of scope, and method is very numerous Trivial, the time of consuming is long, and efficiency comparison is low.The application can obtain search keyword from the searching request of user, so Carry out the matching based on global classification storehouse and the Secondary Match based on individual information respectively afterwards, user only needs input search crucial Word, it is relatively time-consuming and efficiency is higher without the lookup one by one of machinery, and avoid in prior art according to scope by Arriving little order greatly carries out sending the access pressure caused by inquiry request to server repeatedly due to needing when classification is inquired about.Its In, by being matched and be calculated the first similarity to all of global classification in platform in global classification storehouse, now obtain The the first matching classification for taking is very comprehensive;On this basis, when carrying out Secondary Match according still further to individual information and second is calculated Similarity, the second matching classification for now obtaining very meets the historical behavior of user, the demand of the user that can more fit. Then the first matching classification is ranked up according still further to respective similarity and feeds back to user.It is non-that the application searches out the classification for coming Normal is comprehensive, and meets the demand of user, may provide the user with the class heading search result of accurate, comprehensive laminating demand.
Secondly, the application can be processed the search keyword in searching request, obtains the search terms of refinement, then According to each search terms, the first matching classification is obtained after matching to the global classification in global classification storehouse, then based on individual character Information carries out Secondary Match, obtains second and matches classification.Such that it is able to acquire more accurate matching result, and then search for Key word and first matching classification the first similarity, and search keyword and second matching classification the second similarity all compare Relatively accurately, to user feedback accurately, thorough result.
Again, the application can preset screening threshold value, the first matching classification be screened, such that it is able to reduce feedback one The first low matching classification of a little similarity-rough sets, reduces the waste of resource.
Again, the application can apply in e-commerce field, and it is suitable to recommend when both can search for product for buyer Classification, or recommend suitable classification during seller's release product, application very comprehensively, user's request of fitting.
Description of the drawings
Fig. 1 is a kind of searching method flow chart based on classification described in the embodiment of the present application;
Fig. 2 is the matching flow chart in global classification storehouse in a kind of searching method based on classification described in the embodiment of the present application;
Fig. 3 is the matching flow chart of individual information in a kind of searching method based on classification described in the embodiment of the present application;
Fig. 4 is the flow chart for sorting in a kind of searching method based on classification described in the application preferred embodiment and feeding back;
Fig. 5 is a kind of searching method schematic diagram based on classification described in the application preferred embodiment;
Fig. 6 is a kind of searcher structure chart based on classification described in the embodiment of the present application;
Fig. 7 is a kind of searcher structure chart based on classification described in the application preferred embodiment.
Specific embodiment
It is understandable to enable the above-mentioned purpose of the application, feature and advantage to become apparent from, it is below in conjunction with the accompanying drawings and concrete real Apply mode to be described in further detail the application.
With reference to Fig. 1, a kind of searching method flow chart based on classification described in the embodiment of the present application is given.
Step 101, the searching request that user sends in receiving platform;
In some comprehensive websites, the big classification of generally definition just has tens, and the big classification is accurately divided Afterwards, specific classification may have thousand of.If regarding a website as a platform, the classification defined in the platform may There are thousand of, when user carries out class heading search in platform, be input into after search keyword in search box, can trigger and send out Searching request is sent, wherein it is possible to the search keyword is then wrapped as the Transfer Parameters of searching request in the searching request Search keyword is included, therefore correspondence is received after the searching request of user's transmission, can be with from the Transfer Parameters of searching request Obtain search keyword.
Search keyword that subsequently can be in searching request, carries out global search and personalized search, wherein, by platform Defined in classification as global classification, therefore the global search is that all classifications defined in platform are scanned for.And Individual character classification is that the classification to matching in global search carries out Secondary Match, so that the classification of matching more meets user's Demand.
Step 102, is matched using the search keyword in global search to the global classification in global classification storehouse, Obtain first and match classification, and calculate the first similarity of the search keyword and the first matching classification;
It is stored in the classification defined in platform as global classification in global classification storehouse, in global search in the application In, the global classification in global classification storehouse can be matched using the search keyword, for example, search keyword is " apple ", then the global classification for coming is matched from global classification storehouse can include cell phone type, panel computer class, notebook electricity Brain class, fruits and dry fruit etc., using the above-mentioned global classification for matching as the first matching classification.
Then the first similarity of the search keyword and the first matching classification can be calculated, wherein, X and Y similarities The X possibility degree related to Y is referred to, then what the first similarity referred to can be search keyword and the first matching classification matching Probability.
A kind of computational methods can be as, calculate the global classification C in the case of given search keyword Q probit P (C | Q), using probit P (C | Q) as the search keyword and the first similarity of the first matching classification.
Wherein, probability, also known as probability, chance rate, probability or probability, is the basic conception of mathematical probabilities opinion, is one Real number between 0 to 1, is the tolerance of the probability occurred to random event.Then represent that the probability that an event occurs is big Little number, is called the probability of the event, then probit is exactly the value of the probability that an event occurs.
Then P (C | Q) what is referred to can be the probability that search keyword Q belongs to global classification C value, namely search keyword Q and the probability of global classification C matchings, therefore can be using P (C | Q) as first similarity of Q and C.Such as P (C | Q)=30%, The probability that then illustrating search keyword Q has 30% belongs to global classification C, then illustrate the search keyword and the first matching class The probability of mesh matching is 0.3, i.e., the first similarity is 0.3.
Step 103, obtains the individual information of user, the described first matching classification is carried out based on the individual information secondary Matching calculates the second similarity of the second matching classification and search keyword to obtain the second matching classification;
Each user in platform has its event trace in platform, and for example, browsing pages send message etc., Event trace in these platforms may be constructed the individual information of user.
In order that Search Results more meet the demand of user, the first above-mentioned matching classification can be carried out based on user Individual information Secondary Match, and calculate the second similarity of the second matching classification and the search keyword.
As above search keyword " apple " in example, the first matching classification includes cell phone type, panel computer class, notebook Computer, fruits and dry fruit.If only including electronic product in the individual information of user, entered based on individual information During row Secondary Match, the second matching classification for getting is cell phone type, panel computer class and notebook computer class.
The second similarity of the search keyword and the second matching classification is further calculated, then second similarity What is referred to can be the possible degree of search keyword and the second matching classification matching.Equally can calculate in given search keyword Probit P (C ' | Q) of the second matching classification C ' in the case of Q, that is, determine that search keyword Q belongs to the second matching classification C ''s Probability, has also determined that the possibility degree of search keyword Q and the second matching classification C ' matching, you can with by the probit P (C ' | Q) as the search keyword and the second similarity of the second matching classification.
Step 104, according to first similarity and the second similarity, is ranked up and feeds back to the first matching classification.
It is above-mentioned to have got the first matching classification and the first similarity, and the first matching classification is entered by individual information After row Secondary Match, get to the second matching classification and the second similarity, can be according to similarity to the first matching class Mesh is ranked up.
Wherein, some first matching classifications are simultaneously comprising the first similarity and the second similarity, and some first matching classes Mesh only includes the first similarity, therefore, before being ranked up to the global classification for matching, can first calculate each the first matching Total similarity of classification.Therefore, total similarity of some the first matching classifications is the sum of the first similarity and the second similarity, such as Cell phone type, panel computer class and notebook computer class in upper example;And total similarity of some the first matching classifications is first Similarity, as above fruits and dry fruit in example.Then classification is matched by all first, is carried out according to corresponding total similarity Sequence, and the result after sequence is fed back to into user.
In sum, prior art will successively be searched when classification is searched for according to the descending order of scope, and method is non- Often loaded down with trivial details, the time of consuming is long, and efficiency comparison is low and access pressure of server is larger.The application can searching from user Search keyword is obtained in rope request, the matching based on global classification storehouse and secondary based on individual information are then carried out respectively Match somebody with somebody, user only needs to be input into search keyword, and without the lookup one by one of machinery, relatively time-consuming, efficiency is higher, and reduces The access pressure of server.Wherein, all of global classification in platform can be matched and is counted by global classification storehouse The first similarity is calculated, now the first matching classification is very comprehensive;On this basis, Secondary Match is carried out according still further to individual information When and calculate the second similarity, the second matching classification for now obtaining very meets the historical behavior of user, can more stick on Share the demand at family.Then the first matching classification is ranked up according still further to respective similarity and feeds back to user.The application is searched Rope classification out is very comprehensive, and meets the demand of user, may provide the user with accurate, comprehensive, laminating demand Class heading search result.
Preferably, after the searching request of the receive user, also include:
Search keyword in searching request is processed, at least one in following search terms is obtained:Centre word, list Word, centre word phrase and holophrastic.
For example, in e-commerce website, can be comprising phrase or word, wherein institute in the search keyword of user input Stating word can be divided into two classes on semantic understanding, and a class is the product word of clear and definite basic product type, such as car, bus etc., This kind of word is referred to as centre word (CenterWord);An other class is to modify other vocabulary of the said goods word, such as blue car In blue, this kind of word referred to as modifies vocabulary (NormalWord), or is directly referred to as word.
For for phrase (Noun Phrase, NP), can be according to above-mentioned centre word and word(Modification vocabulary), together Phrase is divided into two classes, the i.e. phrase comprising centre word and is also referred to as prime phrase (CenterNP) by sample, and the phrase comprising word then claims For holophrastic (NormalNP).
Assume that search keyword Q is obtained after centre word extraction:
Q={NWs,CWs,NPs,CNPs} (1)
Wherein, NWs represents the set of all words, and CWs represents the set of all centre words, and NPs represents that all words are short The set of language, CNPs represents the set of all prime phrases.
For the text message of all products in website, such as product title can do identical understanding.Can be with the application Realized to product title or the place of search keyword using centre word extraction tool (Center Word Extractor, CWE) Reason.In the dependency of predicted query keyword and classification, centre word, word, centre word phrase and holophrastic for matching The impact of global classification be different, can be weighed by weight in being embodied as.
Preferably, in e-commerce field, the global classification storehouse is that product title is carried out processing to obtain at product After reason item, calculate what is constituted after the probability of product treatment item described in the corresponding classification of the product title.
The text handling method of above-mentioned discussion is equally applicable to the process to product title, specifically includes:
2.1 text-processing;
It is short using the product treatment item in centre word extraction tool extraction product title, including centre word, word, centre word Language and common phrase, and root is carried out to the product word process, such as lights and lighting go root to process after when light。
2.2 probability calculation;
The probability distribution of global classification, that is, determine that the product treatment item belongs to this in the case of the given product treatment item of calculating The probability of global classification, then with the product treatment item as key word(KEY)Set up the index in global classification storehouse.Pass through The key word(KEY)Each global classification can be found, and finds the probability that KEY belongs to each global classification.
For example, after text-processing, the global classification (GetInitialCandidate) of the candidate of lookup, Ke Yijian Sign an agreement, a contract, a receipt, etc. allusion quotation, including the co-occurrence dictionary (coccurIndex) of centre word and centre word periphery word, such as " a b c " is a product Title, c is centre word, then using centre word periphery word " a b " and " bc " all as KEY " a c " and " b c " can exist respectively The number of times that each global class occurs now is used as key value(Value)In being added to the index in global classification storehouse:
{“a c”,(cat1,cnt1)(cat2,cnt2).....}
{“b c”,(cat1,cnt1)(cat2,cnt2).....}
The prior probability index of word or phrase in global class now is similar, and correspondence can obtain prior probability rope Draw(catTokenIndex).
By above-mentioned method, subsequently get after search keyword, can be by the search keyword and co-occurrence dictionary Matched, it is determined that the KEY and corresponding Value of matching, and the first matching classification is determined according to Value, then subsequently calculated the One similarity.
Further, model training can also be carried out on the basis of above-mentioned process, so as to obtain each product treatment item Global weight, it is specific as follows:
2.3 model training;
Product treatment item and global classification in preset search key word, sets up training data set, and each described in labelling The value of the dependency of individual product treatment item and global classification, then can take RANK-SVM models to learn on the training data, Obtain the global weight of every kind of product treatment item, i.e. centre word, word, centre word phrase and holophrastic global weight.
Wherein, RANK-SVM models are selected to train global weight, its ultimate principle is to estimate general by SVM model solutions The parameter of rate P, then the parameter be global weight, the global weight is product treatment item relative to the important of global classification Degree.Product treatment item is divided into centre word, word, centre word phrase and holophrastic, and for example, centre word is iphone 4s, single Word is apple, then determine that the number of times for belonging to electronic product is more than by iphone4s and belong to electronic product by apple determinations Number of times, it is therefore to be understood that for same global classification, determining that product title belongs to the global classification with centre word Number of times, more than the number of times for belonging to the global classification with word determination.It is for centre word importance phase for global classification To higher, the importance of word is relatively lower, for example, by calculating centre word, word, centre word phrase and holophrastic Global weight be respectively 0.4,0.2,0.3,0.1.
Wherein, the SVM(Support Vector Machine, support vector machine)It is a kind of trainable machine learning Method.In present application example in terms of machine learning method, in addition to using RANK-SVM models, other machines can also be adopted Device learning algorithm, such as Pranking, Rankboost and other Learning-To-Rank models.
Preferably, in e-commerce field, if user is seller, the individual information includes what the seller issued Product title, then the individual information of the user is that the product title that the seller issues is carried out processing to obtain seller's processing item Afterwards, calculate what is constituted after the probability of seller's processing item described in the corresponding classification of the seller.If user is buyer, described Property information include the product title that browses of the buyer, then the individual information of the user is the product mark browsed to the buyer Topic is carried out processing and obtained after buyer's processing item, described in the corresponding classification of the calculating product title after the probability of buyer's processing item Constitute.
When the data to user are processed to obtain individual information, text can be processed.Wherein, if user For seller, then processed mainly for the product title of seller issue;If user is buyer, buy mainly for described The product title that family browses is processed.Wherein, seller's processing item and buyer's processing item can include centre word and word, can be with Do not mark off centre word phrase and holophrastic, and concrete processing method and 2.1 texts in above-mentioned global classification storehouse method for building up Process basically identical, here is omitted.
In probability calculation, if user is seller, seller's processing item is calculated general in the corresponding classification of the seller Rate is distributed.Such as the mainly electronic product of seller sale, the corresponding classification of the seller includes mobile phone, mp3/mp4 and electricity Brain accessory, then carry out processing after acquisition seller's processing item to the product title that the seller issues, and just calculates seller's processing item and exists Probability distribution in mobile phone, mp3/mp4 and computer fittings.Then the individual information of the seller can be got.
If user is buyer, buyer's processing item buyer's processing item described in the corresponding classification of the product title is calculated Probability.The product title for such as being browsed according to buyer, gets the corresponding classification of the product title, i.e., what described seller browsed Classification includes:One-piece dress, women's shoes, suitcase and shirt, then carry out processing at acquisition buyer to the product title that the buyer browses After reason item, probability distribution of the buyer's processing item in one-piece dress, women's shoes, suitcase and shirt can be calculated, and then obtain institute State the individual information of buyer.
The method of above-mentioned " 2.2 probability calculation " can be subsequently adopted to calculate probability, and using in above-mentioned " 2.3 model training " Method, determine the individual character weight of each buyer's processing item and seller's processing item, here is omitted.
With reference to Fig. 2, give global classification storehouse in a kind of searching method based on classification described in the embodiment of the present application With flow chart.
Global search be to platform in global classification match, concrete grammar includes:
The global classification in global classification storehouse is matched using the search keyword, is obtained first and is matched classification, And calculate the search keyword and first matching classification the first similarity, including:
Step 201, is respectively adopted each search terms and the global classification in global classification storehouse is matched, and obtains each and searches The global classification that rope item is matched calculates the probit of Corresponding matching as the first matching classification;
Search terms after search keyword is processed has been got by said method, including centre word, word, centre word be short Language and holophrastic.Also include product treatment item in global classification storehouse, i.e., the corresponding centre word of described product title, word, in Heart word phrase and holophrastic, and the probability in the corresponding global classification of the product title.
Therefore, it is possible to each search terms is respectively adopted match to the global classification in global classification storehouse, first is obtained Matching classification.Then in the global classification storehouse, the probit that each search terms correspondence first matches classification is obtained, that is, is given The probit of the first matching classification in the case of search terms.As in the centre word in search terms, with global classification storehouse certain The corresponding centre word matching of global classification, then probit of the centre word in global classification storehouse is in the search terms The probit of centre word.
Step 202, obtains the global weight of each search terms, according to the global weight of each search terms to respective probit It is weighted, calculates the first similarity of the search keyword and the first matching classification.
First similarity of the search keyword and the first matching classification, is made up of the probit of each search terms, Because the significance level relative to the first matching classification of each search terms is different, therefore it is accomplished by by global weight in mark The significance level of each search terms is noted, i.e., described global weight refers to the probit of each search terms relative to first similarity Significance level.If search terms is matched with the product treatment item in global classification storehouse in the embodiment of the present application, the product treatment Global weight can oppose the global weight of the search terms.
The global weight of each search terms can be obtained by above-mentioned processing method, then according to each search terms is complete Office's weight is weighted to respective probit, is calculated the first similarity of the search keyword and the first matching classification.
A kind of processing method can be in above-mentioned formula(1)On the basis of, calculate first in the case of given search keyword Probit with classification, concrete formula is as follows:
P (C | Q)=P (C | (NWs, CWs, NPs, CNPs))(2)
It is different with the holophrastic impact in the prediction of global classification in view of centre word, word, centre word phrase, i.e., Centre word, word, centre word phrase and the holophrastic global weighted in the global classification.And centre word, list Word, centre word phrase and it is holophrastic between mutually independent, the new probability formula of above-mentioned condition probability(2)Can be converted into:
P (C | Q)=P (C | (NWs, CWs, NPs, CNPs))
=p (C | NWs)wBWs*p(C|CWs)wCWs*p(C|NPs)wNPs*p(C|CNPs)wCNPs(3)
Wherein:WNWs is the corresponding global weight of word for NWs, and wCWs is the corresponding global weight of centre word for CWs, WNPs is holophrastic corresponding global weight for NPs, and wCNPs is the corresponding global weight of prime phrase for CNPs.
Wherein, probability distribution of the P (C | NWs) for word in the first matching classification C, therefore its probability Estimation can be launched For the form of equation below:
P (C | NWs)=P (C | (nw1, nw2...nwk))=P (C | nw1) * P (C | nw2) ... P (C | nwk)(4)
Also, formula(3)In each factor can take such as formula(4)Form launched.
Then the similarity of search keyword Q and the first matching classification C can be according to above-mentioned formula(3)Calculated.And In order to solve convenience in Practical Calculation, to formula(3)Both sides carry out logarithm log calculating, and concrete formula is as follows:
Log (p (C | Q))=wNWs*log (p (C | NWs))+wCWs*log (p (c | CWs))+wNPs*
log(p(c|NPs))+wCNPs*log(p(c|CNPs))
(5)
Through above-mentioned conversion, the probability calculation formula of search keyword Q and the first matching classification C can be converted to linearly The Solve problems of model parameter, formula(5)Can also be used as the amount of the similarity between search keyword and the first matching classification Change the computing formula of index.
In formula(3)In, search keyword Q and first matching classification C probability rely on two data, one be NWs, The probability distribution of CWs, NPs and CNPs under the first matching classification C, another is the corresponding power of NWs, CWs, NPs and CNPs Weight, i.e. wNWs, wCWs, wNPs and wCNPs.
In being embodied as, such as in e-commerce platform, probability distribution P of word (C | W) can be by statistics platform The corresponding word of product title, obtains in all global classes prior distribution now, i.e. frequencies of the word W in global classification C is removed Estimation of the sum frequency occurred using word W as P (C | W).Wherein, centre word, holophrastic and prime phrase probability distribution Computational methods are identical, and here is omitted.In addition, for word W can give acquiescence without when occurring in C to it Value.
With reference to Fig. 3, individual information matching stream in a kind of searching method based on classification described in the embodiment of the present application is given Cheng Tu.
In addition to scanning for the overall situation, it is also contemplated that the preference of user itself, based on global classification storehouse Get in matching somebody with somebody on the basis of matching result, rearrangement can be entered to the classification of global search according to user preference, i.e., secondary Match somebody with somebody.
Generally in e-commerce website, the field that seller is managed is limited and concentrates that is, seller is generally managed The classification of product be limited, and concentrate in several classifications.Such as " Plastic Mat " divides in seller A and seller B Not Dui Ying tea cup mat and ground cushion, in above-mentioned global search, seller A will obtain the consistent recommendation results of sequence with B.The application Further improve on the basis of global search and recommend, realize recommending for the individual character of different sellers, according to seller A and can sell The difference of the classification of the product that family B is managed, recommends the inconsistent recommendation results of different sellers' sequences.Consider user's itself Preference, the method matched to individual information is specifically included:
Preferably, the individual information for obtaining user, is carried out based on the individual information to the described first matching classification Secondary Match calculates the second matching classification and the second similarity of search keyword to obtain the second matching classification, wraps Include:
Step 301, obtains the individual information of user, and carries out two to the described first matching classification using the individual information Secondary matching, obtains corresponding second matching classification;
When matching to individual information, the individual information of user can be obtained, then using the individual information to institute Stating the first matching classification carries out Secondary Match, obtains corresponding second matching classification.
Step 302, obtains the centre word and/or word in search terms, calculates the search terms for obtaining and matches to described second The probit of classification;
When processing individual information, the centre word or word in search terms can be obtained, or while in acquisition Heart word and word.Then the probit of the second matching classification can in the case of given search terms, be calculated.
Step 303, obtains the individual character weight of each search terms, according to the individual character weight of each search terms to respective probit It is weighted, calculates the second similarity of the search keyword and the second matching classification.
Because the impact of the relative second matching classification of different search termses is different in the embodiment of the present application, therefore can adopt Significance level of the search terms to the second matching classification, computational methods and the overall situation of the individual character weight are marked with individual character weight Weight is basically identical, and here is omitted.
After the individual character weight for obtaining each search terms, can be according to the individual character weight of each search terms to respective probit It is weighted, then the probit after weighting is sued for peace, calculates the second similarity of the search keyword and the second matching classification, Circular is similar with the computational methods in above-mentioned global search, and here is omitted.
The above-mentioned process based on individual information can be considered personalized recommendation, due in the firm-wide that is located in seller, will Holophrastic and prime phrase is relatively difficult to form statistical significance as the feature of personalized recommendation, therefore mainly considers global search In Data Representation in user scope of word NWs and centre word CNWs.By taking seller as an example, the distribution limit of word or centre word It is scheduled in seller's firm-wide, that is, is limited in the range of the corresponding classification of product title that the seller issues.For example, word W In probability distribution P (C | (W, Comp)) of classification C under company Comp, the number of times under classification C is occurred in company Comp by W The counted prior probability of total degree occurred in Comp divided by W is represented.
Preferably, after calculating the first similarity of the search keyword and the first matching classification, also include:
According to default screening threshold value, the described first matching classification is screened according to respective first similarity, obtained Take the first matching classification after screening.
Search keyword Q is assumed after global search, the first matching classification collection for obtaining recommending is combined into { C1,C2, C3,C4..., the first matching classification is according to formula(5)Fraction arrange from big to small.
For the Search Results for being supplied to user more accurate, in view of the classification belonging to a product is limited, and reduce The waste of resource, therefore the application has preset screening threshold value, the value of some similarities in the first matching classification set of recommendation Than relatively low, it is impossible to which the first matching classification for reaching screening threshold requirement will be rejected directly, i.e., only similarity reaches screening threshold The first matching classification that value is required can just enter personalized recommendation flow process.
It is for instance possible to use whether the first matching classification and the optimum first fraction ratio matched between classification reach to measure To the requirement of screening threshold value.Wherein, the optimum first matching classification refers to that similarity highest first matches classification.
Such as formula(5)The result for obtaining is negative, therefore matches classification and optimum first when coming below first Matching classification C1Between multiple exceed screening threshold value T when, it is believed that this first matching classification and afterwards first matching class All with search keyword Q dependency less, i.e. the first similarity is unable to reach the requirement of screening threshold value to mesh, will not enter follow-up In personalized search flow process.
Certainly, the screening threshold value can be used for limiting the first matching classification quantity, for example, arrange screening threshold It is worth for 10, then only obtains the front 10 first matchings classification in the first matching classification set of recommendation.The application is for first Screening technique with classification is not limited.
With reference to Fig. 4, give in a kind of searching method based on classification described in the application preferred embodiment and sort and feed back Flow chart.
Preferably, according to first similarity and the second similarity, the first matching classification is ranked up and is fed back to User, including:
Step 401, obtains the first classification weight for matching classification, according to the classification weight to the first matching classification the One similarity is weighted.
Because the first matching classification and second matches impact difference of the classification for final Search Results, therefore the application For the first matching classification is also provided with classification weight, the computational methods of the classification weight and above basic in embodiment Cause, here is omitted.
Therefore just the first similarity of the first matching classification can be weighted according to the classification weight.So that it is determined that Go out the similarity of each the first matching classification.
Step 402, detects whether the first matching classification has carried out Secondary Match based on individual information.
Some first matching classifications are only comprising the first similarity in the embodiment of the present application, and some the first matching classification also bases Secondary Match is carried out in individual information, so that it is determined that the second matching classification, i.e. the first matching classification had both included first phase Like spending, the second similarity of the second matching classification for also obtaining comprising Secondary Match.Therefore, matching to first according to similarity Before classification is ranked up, total similarity of each first matching classification is first determined.
Then total similarity of only the comprising the first similarity first matching classification is exactly the first similarity after above-mentioned weighting, As above total similarity of fruits and dry fruit is exactly the first similarity after its weighting in example.
If so, i.e. described first matching classification has carried out Secondary Match based on individual information, then subsequent execution step 403; If it is not, i.e. described first matching classification does not carry out Secondary Match, then subsequent execution step 404.
Step 403, by weighting after the first similarity and match the second similar of classification with second that Secondary Match is obtained Degree summation.
And be directed to based on individual information carried out Secondary Match first matching classification, by weighting after the first similarity and It is exactly first similar after weighting that second obtained with Secondary Match match the second similarity summation of classification, i.e. its total similarity The sum of degree and second similarity.As above cell phone type in example, panel computer class and notebook computer class, its total similarity is just Be weighting after the first similarity and second similarity sum.
Step 403, is ranked up and feeds back to the described first matching classification.
Most at last each the first matching classification is ranked up according to respective total similarity, and by the Search Results after sequence Feed back to user.
For example, the first matching classification is got in global search for A1, B1, C1 and D1, the value point of the first similarity of correspondence Not Wei 15,9,8,2, first matching classification classification weight be 1.5.In personalized search obtain the second matching classification for B1 and D1, the value of its corresponding second similarity is 10 and 5.Then total similarity of final each the first matching classification is respectively:
A1:15*1.5=22.5;
B1:9*1.5+10=23.5;
C1:8*1.5=12;
D1:2*1.5+5=8
First matching classification is ranked up, the Search Results for feeding back to user are B, A, C and D.
With reference to Fig. 5, a kind of searching method schematic diagram based on classification described in the application preferred embodiment is given.
Search keyword Query enters the global classification of candidate after pretreatment (preprocess), i.e. CWE process and looks into Process is looked for, i.e., global class heading search (catGlobalRec).
It is the corresponding global weight of search terms that each feature will be loaded in global search flow process, while searching prior probability Index (catTokenIndex) determines the first matching classification, and according to formula(5)It is the first similarity to calculate recommender score Value.
After global search, the fractional multiple sieve between classification is matched according to the first matching classification after sequence and optimum first The first matching classification is selected, the first matching classification after screening enters the personalized search stage.
Personalized search can search the information under user's dimension, the i.e. rearrangement on the basis of global recommendation (catCompanyRerank)。
Subsequently into last handling process (postprocess), mainly recommendation results are done with some letters according to requirement of engineering Single-filtering, such as limit first matching classification number N of recommendation etc., finally obtains N number of first matching classification and recommends user.
By above-mentioned method, on the first matching classification of first three recommendation, can be with analog subscriber to recommendation first Matching classification mark is correct(The classification that i.e. user to be searched for is related)And mistake(That is the classification to be searched for of user is uncorrelated), one In the individual search set (comprising 1000 search keywords) for including 35 big industries, the accuracy rate such as table 1 below of acquisition:
First matching classification ranking (Position) Accuracy rate (Precision)
1 95%
2 92%
3 86%
Table 1
By table 1, the accuracy rate of first first matching classification of recommendation is 95%, second first of recommendation Accuracy rate with classification is 92%, and the accuracy rate of the 3rd first matching classification of recommendation is 86%.
The application is reduced in e-commerce field, causes the product of seller's issue on platform because recommendation is inaccurate The misplaced ratio of classification.The global classification of the product that random inspection website is newly issued, the ratio declines 2%.The application is therefore either Still directly or indirectly affect had lifting by a relatively large margin from it from recommendation precision, and can be all search keywords Accurately classification recommendation service is provided.
In sum, the application can be processed the search keyword in searching request, obtains the search terms of refinement, Then according to each search terms, the first matching classification is obtained after matching to the global classification in global classification storehouse, then is based on Individual information carries out Secondary Match, obtains second and matches classification.Such that it is able to acquire more accurate matching result, and then First similarity of search keyword and the first matching classification, and the second similarity of search keyword and the second matching classification It is all relatively more accurate, to user feedback is accurate, comprehensive result.
Secondly, the application can preset screening threshold value, the first matching classification be screened, such that it is able to reduce feedback one The first low matching classification of a little similarity-rough sets, reduces the waste of resource.
Again, the application can apply in e-commerce field, and it is suitable to recommend when both can search for product for buyer Classification, or recommend suitable classification during seller's release product, application very comprehensively, user's request of fitting.
It is a kind of searcher structure chart based on classification described in the embodiment of the present application with reference to Fig. 6.
Accordingly, present invention also provides a kind of searcher based on classification, including:Receiver module 11, global search Module 12, personalized search module 13 and sequence and feedback module 14, wherein:
Receiver module 11, for the searching request that user in receiving platform sends, wherein, the searching request includes searching Rope key word;
Global search module 12, for the search keyword to be adopted in global search to the global class in global classification storehouse Mesh is matched, the matching classification of acquisition first, and calculates the first similarity of the search keyword and the first matching classification, its In, it is stored in the classification defined in platform as global classification in global classification storehouse;
Personalized search module 13, for obtaining the individual information of user, is matched based on the individual information to described first Classification carries out Secondary Match to obtain the second matching classification, and calculates the second phase of the second matching classification and search keyword Like degree;
Sequence and feedback module 14, for according to first similarity and the second similarity, entering to the first matching classification Row sequence simultaneously feeds back to user.
With reference to Fig. 7, a kind of searcher structure chart based on classification described in the application preferred embodiment.
Preferably, present invention also provides a kind of searcher for being preferably based on classification, including:
Receiver module 21, for the searching request that user in receiving platform sends, wherein, the searching request includes searching Rope key word;
Preferably, described device also includes:
Key word processing module 22, for processing the search keyword in searching request, obtains following search terms In at least one:Centre word, word, centre word phrase and holophrastic.
Global search module 23, for the search keyword to be adopted in global search to the global class in global classification storehouse Mesh is matched, the matching classification of acquisition first, and calculates the first similarity of the search keyword and the first matching classification, its In, it is stored in the classification defined in platform as global classification in global classification storehouse;
Preferably, global search module 23 includes:
Matched sub-block 231, matches for each search terms to be respectively adopted to the global classification in global classification storehouse, Global classification that each search terms matched is obtained as the first matching classification, and calculates the probit of Corresponding matching;
Calculating sub module 232, for obtaining the global weight of each search terms, according to the global weight pair of each search terms Each probit is weighted, and calculates the first similarity of the search keyword and the first matching classification.
Preferably, described device also includes:
Screening module 24, for according to default screening threshold value, matching to described first according to respective first similarity Classification is screened, and obtains the first matching classification after screening.
Personalized search module 25, for obtaining the individual information of user, is matched based on the individual information to described first Classification carries out Secondary Match to obtain the second matching classification, and calculates the second phase of the second matching classification and search keyword Like degree;
Preferably, the personalized search module 25 includes:
Acquisition submodule 251, for obtaining the individual information of user, and is matched using the individual information to described first Classification carries out Secondary Match, obtains corresponding second matching classification;
First calculating sub module 252, for obtaining search terms in centre word and/or word, calculate obtain search terms Probit to the described second matching classification;
Second calculating sub module 253, for obtaining the individual character weight of each search terms, weighs according to the individual character of each search terms Respective probit is weighted again, calculates the second similarity of the search keyword and the second matching classification.
Sequence and feedback module 26, for according to first similarity and the second similarity, entering to the first matching classification Row sequence simultaneously feeds back to user.
Preferably, simultaneously feedback module 26 includes for the sequence:
Weighting submodule 261, for obtaining the first classification weight for matching classification, according to the classification weight to first The first similarity with classification is weighted;
Summation submodule 262, for the first matching classification for carrying out Secondary Match based on individual information, after weighting The first similarity match with second that Secondary Match is obtained classification the second similarity summation;
Submodule 263 is sorted and feeds back, for the described first matching classification to be ranked up and fed back.
Preferably, in e-commerce field, the global classification storehouse is that product title is carried out processing to obtain at product After reason item, calculate what is constituted after the probability of product treatment item described in the corresponding classification of the product title.
Preferably, in e-commerce field, if user is seller, the individual information of the user is to the seller The product title of issue is carried out processing and obtained after seller's processing item, calculates seller's processing item described in the corresponding classification of the seller Probability after constitute.
Preferably, in e-commerce field, if user is buyer, the individual information of the user is to the buyer The product title for browsing is carried out processing and obtained after buyer's processing item, described in the corresponding classification of the calculating product title at buyer Constitute after the probability of reason item.
For device embodiment, due to itself and embodiment of the method basic simlarity, so description is fairly simple, it is related Part is illustrated referring to the part of embodiment of the method.
Each embodiment in this specification is described by the way of progressive, what each embodiment was stressed be with The difference of other embodiment, between each embodiment identical similar part mutually referring to.
Those skilled in the art are it should be appreciated that embodiments herein can be provided as method, system or computer program Product.Therefore, the application can be using complete hardware embodiment, complete software embodiment or with reference to the reality in terms of software and hardware Apply the form of example.And, the application can be adopted and wherein include the computer of computer usable program code at one or more Usable storage medium(Including but not limited to disk memory, CD-ROM, optical memory etc.)The computer program of upper enforcement is produced The form of product.
Although having been described for the preferred embodiment of the application, those skilled in the art once know basic creation Property concept, then can make other change and modification to these embodiments.So, claims are intended to be construed to include excellent Select embodiment and fall into having altered and changing for the application scope.
The application is with reference to method, the equipment according to the embodiment of the present application(System)And the flow process of computer program Figure and/or block diagram are describing.It should be understood that can be by computer program instructions flowchart and/or each stream in block diagram The combination of journey and/or square frame and flow chart and/or the flow process in block diagram and/or square frame.These computer programs can be provided The processor of general purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that produced for reality by the instruction of computer or the computing device of other programmable data processing devices The device of the function of specifying in present one flow process of flow chart or one square frame of multiple flow processs and/or block diagram or multiple square frames.
These computer program instructions may be alternatively stored in can guide computer or other programmable data processing devices with spy In determining the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory is produced to be included referring to Make the manufacture of device, the command device realize in one flow process of flow chart or one square frame of multiple flow processs and/or block diagram or The function of specifying in multiple square frames.
These computer program instructions also can be loaded in computer or other programmable data processing devices so that in meter Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented process, so as in computer or The instruction performed on other programmable devices is provided for realizing in one flow process of flow chart or multiple flow processs and/or block diagram one The step of function of specifying in individual square frame or multiple square frames.
Finally, in addition it is also necessary to explanation, herein, such as first and second or the like relational terms be used merely to by One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation Between there is any this actual relation or order.And, term " including ", "comprising" or its any other variant meaning Covering including for nonexcludability, so that a series of process, method, commodity or equipment including key elements not only includes that A little key elements, but also including other key elements being not expressly set out, or also include for this process, method, commodity or The intrinsic key element of equipment.In the absence of more restrictions, the key element for being limited by sentence "including a ...", does not arrange Except also there is other identical element in including the process of the key element, method, commodity or equipment.
Above to the searching method based on classification and device provided herein, it is described in detail, herein should The principle and embodiment of the application are set forth with specific case, the explanation of above example is only intended to help and manages Solution the present processes and its core concept;Simultaneously for one of ordinary skill in the art, according to the thought of the application, Will change in specific embodiment and range of application, in sum, this specification content should not be construed as to this Shen Restriction please.

Claims (10)

1. a kind of searching method based on classification, it is characterised in that include:
The searching request that user sends in receiving platform, wherein, the searching request includes search keyword;
The global classification in global classification storehouse is matched using the search keyword, is obtained first and is matched classification, and counted The first similarity of the search keyword and the first matching classification is calculated, wherein, using the classification defined in platform as global class Mesh is stored in global classification storehouse;
The individual information of user is obtained, based on the individual information the described first matching classification is carried out Secondary Match to obtain the Two matching classifications, and the second similarity of the second matching classification and search keyword is calculated, wherein, by user in platform Event trace constitute user individual information;
According to first similarity and the second similarity, the first matching classification is ranked up and is fed back.
2. method according to claim 1, it is characterised in that after the searching request of the receive user, also include:
Search keyword in searching request is processed, at least one in following search terms is obtained:Centre word, word, Centre word phrase and holophrastic.
3. method according to claim 2, it is characterised in that using the search keyword to global classification storehouse in it is complete Office's classification is matched, and is obtained first and is matched classification, and calculates the first similar of the search keyword and the first matching classification Degree, including:
Each search terms is respectively adopted to match the global classification in global classification storehouse, obtains what each search terms was matched Global classification calculates the probit of Corresponding matching as the first matching classification;
The global weight of each search terms is obtained, respective probit is weighted according to the global weight of each search terms, counted Calculate the first similarity of the search keyword and the first matching classification.
4. method according to claim 2, it is characterised in that the individual information of user is obtained, based on the individual information Secondary Match is carried out to the described first matching classification to obtain the second matching classification, and calculates the second matching classification and search Second similarity of key word, including:
The individual information of user is obtained, and Secondary Match is carried out to the described first matching classification using the individual information, obtained Corresponding second matching classification;
The centre word and/or word in search terms is obtained, probit of the search terms for obtaining to the described second matching classification is calculated;
The individual character weight of each search terms is obtained, respective probit is weighted according to the individual character weight of each search terms, counted Calculate the second similarity of the search keyword and the second matching classification.
5. according to the arbitrary described method of claim 1 or 3, it is characterised in that calculate the search keyword and the first matching After first similarity of classification, also include:
According to default screening threshold value, the described first matching classification is screened according to respective first similarity, obtain sieve The first matching classification after choosing.
6. method according to claim 1, it is characterised in that according to first similarity and the second similarity, to One matching classification is ranked up and feeds back to user, including:
The first classification weight for matching classification is obtained, the first similarity of the first matching classification is carried out according to the classification weight Weighting;
For based on individual information carry out Secondary Match first matching classification, by weighting after the first similarity and Secondary Match The second similarity summation of the second matching classification for obtaining;
Described first matching classification is ranked up and is fed back.
7. according to the arbitrary described method of claim 1 or 3, it is characterised in that in e-commerce field, the global classification Storehouse is that product title is carried out processing to obtain after product treatment item, described in the corresponding classification of the calculating product title at product Constitute after the probability of reason item.
8. according to the arbitrary described method of claim 1 or 4, it is characterised in that in e-commerce field, if user is to sell Family, then the individual information of the user is that the product title that the seller issues is carried out processing to obtain after seller's processing item, meter Calculate what is constituted after the probability of seller's processing item described in the corresponding classification of the seller.
9. according to the arbitrary described method of claim 1 or 4, it is characterised in that in e-commerce field, if user is to buy Family, then the individual information of the user is that the product title that the buyer browses is carried out processing to obtain after buyer's processing item, meter Calculate what is constituted after the probability of buyer's processing item described in the corresponding classification of the product title.
10. a kind of searcher based on classification, it is characterised in that include:
Receiver module, for the searching request that user in receiving platform sends, wherein, the searching request includes that search is crucial Word;
Global search module, for being matched to the global classification in global classification storehouse using the search keyword, is obtained First matching classification, and the first similarity of the search keyword and the first matching classification is calculated, wherein, defined in platform Classification be stored in global classification storehouse as global classification;
Personalized search module, for obtaining the individual information of user, is entered based on the individual information to the described first matching classification Row Secondary Match calculates the described second the second similarity for matching classification and search keyword to obtain the second matching classification, Wherein, the event trace by user in platform constitutes the individual information of user;
Sequence and feedback module, for according to first similarity and the second similarity, being ranked up to the first matching classification And feed back to user.
CN201210548686.9A 2012-12-17 2012-12-17 Method and device of searching based on category Active CN103870507B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210548686.9A CN103870507B (en) 2012-12-17 2012-12-17 Method and device of searching based on category

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210548686.9A CN103870507B (en) 2012-12-17 2012-12-17 Method and device of searching based on category

Publications (2)

Publication Number Publication Date
CN103870507A CN103870507A (en) 2014-06-18
CN103870507B true CN103870507B (en) 2017-04-12

Family

ID=50909048

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210548686.9A Active CN103870507B (en) 2012-12-17 2012-12-17 Method and device of searching based on category

Country Status (1)

Country Link
CN (1) CN103870507B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104484380A (en) * 2014-12-09 2015-04-01 百度在线网络技术(北京)有限公司 Personalized search method and personalized search device
CN106708817B (en) * 2015-07-17 2020-11-06 腾讯科技(深圳)有限公司 Information searching method and device
CN106294674A (en) * 2016-08-02 2017-01-04 郑州悉知信息科技股份有限公司 A kind of information detecting method and device
CN108108380B (en) * 2016-11-25 2022-05-31 阿里巴巴集团控股有限公司 Search sorting method, search sorting device, search method and search device
CN108319888B (en) * 2017-01-17 2023-04-07 阿里巴巴集团控股有限公司 Video type identification method and device and computer terminal
CN107977866A (en) * 2017-12-14 2018-05-01 北京木业邦科技有限公司 Wood products recommend method, apparatus and computer-readable recording medium
CN108897734B (en) * 2018-06-13 2023-08-22 康键信息技术(深圳)有限公司 User portrait generation method, device, computer equipment and storage medium
CN110674429B (en) * 2018-07-03 2022-05-31 百度在线网络技术(北京)有限公司 Method, apparatus, device and computer readable storage medium for information retrieval
CN111221928B (en) * 2018-11-27 2024-02-23 上海擎感智能科技有限公司 Thematic map display method and vehicle-mounted terminal
CN111428123A (en) * 2019-01-09 2020-07-17 阿里巴巴集团控股有限公司 Query method and device
CN112015762A (en) * 2019-05-30 2020-12-01 广州慧睿思通信息科技有限公司 Case retrieval method and device, computer equipment and storage medium
CN112115177A (en) * 2020-09-30 2020-12-22 欧冶云商股份有限公司 Processing service resource searching and matching method, matching system and medium
CN112860978B (en) * 2021-03-31 2024-05-17 中国工商银行股份有限公司 Global searching method and device
CN113536118B (en) * 2021-06-29 2023-04-07 未鲲(上海)科技服务有限公司 Configuration method and device for searching default words, computer equipment and storage medium
CN113570413B (en) * 2021-07-28 2023-12-05 杭州王道控股有限公司 Advertisement keyword generation method and device, storage medium and electronic equipment
CN115708085A (en) * 2021-08-09 2023-02-21 腾讯科技(深圳)有限公司 Business processing method, neural network model training method, device, equipment and medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101231638A (en) * 2007-01-24 2008-07-30 阿里巴巴公司 Method, system and apparatus for implementation of class heading search
CN102063468A (en) * 2010-12-03 2011-05-18 百度在线网络技术(北京)有限公司 Device and method for confirming query sort of query sequence
WO2011137125A1 (en) * 2010-04-30 2011-11-03 Alibaba Group Holding Limited Vertical search-based query method, system and apparatus

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040260677A1 (en) * 2003-06-17 2004-12-23 Radhika Malpani Search query categorization for business listings search
KR20080087591A (en) * 2007-03-27 2008-10-01 삼성전자주식회사 Content search method and apparatus using category based on meta-data, and recording medium thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101231638A (en) * 2007-01-24 2008-07-30 阿里巴巴公司 Method, system and apparatus for implementation of class heading search
WO2011137125A1 (en) * 2010-04-30 2011-11-03 Alibaba Group Holding Limited Vertical search-based query method, system and apparatus
CN102063468A (en) * 2010-12-03 2011-05-18 百度在线网络技术(北京)有限公司 Device and method for confirming query sort of query sequence

Also Published As

Publication number Publication date
CN103870507A (en) 2014-06-18

Similar Documents

Publication Publication Date Title
CN103870507B (en) Method and device of searching based on category
Ramzan et al. An intelligent data analysis for recommendation systems using machine learning
CN103870505B (en) One kind inquiry words recommending method and query word commending system
US9818142B2 (en) Ranking product search results
US9910930B2 (en) Scalable user intent mining using a multimodal restricted boltzmann machine
US20110225161A1 (en) Categorizing products
US20130290320A1 (en) Recommending keywords
US20140279774A1 (en) Classifying Resources Using a Deep Network
US10229190B2 (en) Latent semantic indexing in application classification
US20190066185A1 (en) Method and system for attribute extraction from product titles using sequence labeling algorithms
US20140012840A1 (en) Generating search results
JP2016532173A (en) Semantic information, keyword expansion and related keyword search method and system
US11741094B2 (en) Method and system for identifying core product terms
CN110909182A (en) Multimedia resource searching method and device, computer equipment and storage medium
CN108763321A (en) A kind of related entities recommendation method based on extensive related entities network
CN104933100A (en) Keyword recommendation method and device
US10970296B2 (en) System and method for data mining and similarity estimation
US20180246880A1 (en) System for generating synthetic sentiment using multiple points of reference within a hierarchical head noun structure
CN110427480B (en) Intelligent personalized text recommendation method and device and computer readable storage medium
US11249993B2 (en) Answer facts from structured content
CN106815265B (en) Method and device for searching referee document
TW201401088A (en) Search method and apparatus
CN112100396A (en) Data processing method and device
CN111428100A (en) Data retrieval method and device, electronic equipment and computer-readable storage medium
CN110489531B (en) Method and device for determining high-frequency problem

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1195143

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1195143

Country of ref document: HK

TR01 Transfer of patent right

Effective date of registration: 20221116

Address after: Room 507, floor 5, building 3, No. 969, Wenyi West Road, Wuchang Street, Yuhang District, Hangzhou City, Zhejiang Province

Patentee after: ZHEJIANG TMALL TECHNOLOGY Co.,Ltd.

Address before: Box 847, four, Grand Cayman capital, Cayman Islands, UK

Patentee before: ALIBABA GROUP HOLDING Ltd.

TR01 Transfer of patent right