CN103020224B - A kind of intelligent search method and device - Google Patents

A kind of intelligent search method and device Download PDF

Info

Publication number
CN103020224B
CN103020224B CN201210536673.XA CN201210536673A CN103020224B CN 103020224 B CN103020224 B CN 103020224B CN 201210536673 A CN201210536673 A CN 201210536673A CN 103020224 B CN103020224 B CN 103020224B
Authority
CN
China
Prior art keywords
keyword
demand
word
terminal
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210536673.XA
Other languages
Chinese (zh)
Other versions
CN103020224A (en
Inventor
何晏成
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210536673.XA priority Critical patent/CN103020224B/en
Publication of CN103020224A publication Critical patent/CN103020224A/en
Application granted granted Critical
Publication of CN103020224B publication Critical patent/CN103020224B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of intelligent search method and devices, the method comprise the steps that A. obtains the searching request of user;B. according to described search request in terminal attribute judge the type of the user, when the type of the user is mobile subscriber, execute step C;C. according to described search request in keyword judge whether the demand of the user related to terminal, if it is, by described search request in keyword expansion be keyword relevant to terminal;D. search result is obtained using the keyword relevant to terminal.By the above-mentioned means, the present invention improves the accuracy of search.

Description

A kind of intelligent search method and device
[technical field]
The present invention relates to search technique, in particular to a kind of intelligent search method and device.
[background technique]
With the development of mobile internet, also more and more using the user of mobile terminal Internet access, and existing search is drawn Hold up, the response mode to the user for using all terminals to scan for be it is identical, i.e., either which kind of terminal use user, Obtained search result is the same.Since existing search engine cannot obtain automatically symbol according to the terminal situation that user uses The search result of user demand is closed, therefore the accuracy searched for is lower.
[summary of the invention]
Technical problem to be solved by the invention is to provide a kind of intelligent search method and devices, to improve the accurate of search Property.
The present invention is in order to solve the technical problem and the technical solution adopted is that provide a kind of intelligent search method, comprising: A. is obtained Take the searching request at family;B. according to described search request in terminal attribute judge the type of the user, as the user Type be mobile subscriber when, execute step C;C. according to described search request in keyword judge that the demand of the user is It is no related to terminal, if it is, by described search request in keyword expansion be keyword relevant to terminal;D. it utilizes The keyword relevant to terminal obtains search result.
One of according to the present invention preferred embodiment, the step C includes: that C1. searches the demand vocabulary obtained in advance, with true The keyword in described search request is determined in the demand vocabulary with the presence or absence of matching word, if it is, determining the use The demand at family is related to terminal, executes step C2;C2. by described search request in terminal attribute and described search request in Keyword merges into keyword relevant to terminal.
One of according to the present invention preferred embodiment, the step of obtaining demand vocabulary in advance include: that S1. obtains search log; S2. cutting is carried out to described search log as unit of session, wherein each cutting segment corresponds to a conversation recording;S3. needle To each cutting segment, the keyword pair in the segment is extracted, wherein two keywords of the keyword centering are according in the piece Appearance sequence in section arranges, and is only contained in the word in first keyword and preset terminal attribute vocabulary in rear keyword; S4. demand word is chosen, from the first keyword of the keyword pair of extraction to obtain demand vocabulary.
One of according to the present invention preferred embodiment, choosing the mode of demand word in the step S4 includes being greater than conversion ratio The first keyword of setting value is chosen for demand word, wherein the conversion ratio of each first keyword calculates according to the following formula:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate the keyword centering extracted The number of keyword pair comprising w, t2 (w) indicate the number that w occurs in search log.
The present invention is in order to solve the technical problem and the technical solution adopted is that provide a kind of intelligent search device, comprising: receives Unit, for obtaining the searching request of user;First judging unit, for the terminal attribute judgement in being requested according to described search The type of the user, when the type of the user is mobile subscriber, triggering second judgment unit is executed;Second judgement is single Member judges whether the demand of the user is related to terminal for the keyword in requesting according to described search, if it is, will Keyword expansion in described search request is keyword relevant to terminal;Search unit, for using described with terminal phase The keyword of pass obtains search result.
Preferred embodiment, the second judgment unit include: matching unit one of according to the present invention, are obtained in advance for searching The demand vocabulary taken, to determine that the keyword in described search request whether there is matching word in the demand vocabulary, such as Fruit is, it is determined that the demand of the user is related to terminal, and triggers expanding element execution;Expanding element, for being searched described Terminal attribute in rope request merges into keyword relevant with terminal to the keyword in described search request.
Preferred embodiment, described device further comprise one of according to the present invention: vocabulary acquiring unit, for obtaining in advance The demand vocabulary, the vocabulary acquiring unit include: log acquisition unit, for obtaining search log;Log cutting unit, For carrying out cutting to described search log as unit of session, wherein each cutting segment corresponds to a conversation recording;Word pair Extraction unit extracts the keyword pair in the segment, wherein two of the keyword centering close for being directed to each cutting segment Keyword is arranged according to the appearance sequence in the segment, and is only contained in first keyword and preset terminal category in rear keyword Word in property vocabulary;Demand word selection unit, for choosing demand word in the first keyword of the keyword pair from extraction, with To demand vocabulary.
One of according to the present invention preferred embodiment, it includes that will convert that the demand selected ci poem, which takes the mode of unit selection demand word, The first keyword that rate is greater than the set value is chosen for demand word, wherein the conversion ratio of each first keyword is counted according to the following formula It calculates:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate the keyword centering extracted The number of keyword pair comprising w, t2 (w) indicate the number that w occurs in search log.
As can be seen from the above technical solutions, the present invention realizes a kind of end that the search need by user is used with user End type combines the method to obtain search result, and this method, which does not need the accurate keyword of user's input, to be automatically User returns to the very high search result of suitability, improves the accuracy of search.
[Detailed description of the invention]
Fig. 1 is the flow diagram of the embodiment of intelligent search method in the present invention;
Fig. 2 is the schematic diagram of the search result got in the present invention using keyword relevant to terminal;
Fig. 3 is the structural schematic block diagram of the embodiment one of intelligent search device in the present invention;
Fig. 4 is the structural schematic block diagram of the embodiment two of intelligent search device in the present invention;
Fig. 5 is the structural schematic block diagram of one embodiment of the second judgment unit 303 in the present invention.
[specific embodiment]
To make the objectives, technical solutions, and advantages of the present invention clearer, right in the following with reference to the drawings and specific embodiments The present invention is described in detail.
Referring to FIG. 1, Fig. 1 is the flow diagram of the embodiment of intelligent search method in the present invention.As shown in Figure 1, should Embodiment includes:
Step S101: the searching request of user is obtained.
Step S102: judging the type of user according to the terminal attribute in searching request, when the type of user is mobile uses When family, step S103 is executed.
Step S103: judge whether the demand of user is related to terminal according to the keyword (query) in searching request, such as Fruit is, then is keyword relevant to terminal by the keyword expansion in searching request.
Step S104: search result is obtained using keyword relevant to terminal.
Above-mentioned steps are described in detail below.
Several parameters are contained in the searching request of user, one of parameter indicates to be made when user issues searching request With the attribute of terminal, a parameter indicates the keyword that user is used to obtain search result and uses.Terminal attribute can be end The information such as model, the manufacturer at end, such as " Samsung " or " iphone5 " information.
When user issues searching request using search client, search client can call the system interface of equipment, from And it gets terminal attributive information and is encapsulated in the searching request of user.But if search client can not get terminal Attribute information, then the terminal attribute in the searching request of user can be null value.
After search server receives the searching request of user in step s101, in step s 102, so that it may from Terminal attribute is extracted in the searching request at family, if the terminal attribute extracted is null value either PC model etc Information, it may be considered that initiation user's non-moving subscribers of searching request, if the terminal attribute extracted is " iphone5 " etc Mobile terminal model etc information, then can determine that the initiation user of searching request is mobile subscriber, and execute step S103。
As an implementation, step S103 is specifically included:
Step S1031: the demand vocabulary obtained in advance is searched, to determine the keyword in searching request in the demand vocabulary In with the presence or absence of matching word, if it is, determine user demand it is related to terminal, execution step S1032.
Step S1032: the terminal attribute in searching request is merged into the keyword in searching request related with terminal Keyword.
The mode of acquisition demand vocabulary will be introduced subsequent.The word for including in the demand vocabulary, can represent user Demand be relevant to terminal, if the keyword in searching request matches with the word in the demand vocabulary, illustrate to use The demand at family is related to terminal.Such as the keyword in searching request is " mobile phone EMS memory deficiency is what if ", which is needing Asking in vocabulary has matching word, then may determine that using the demand of the user of the keyword be relevant to terminal, therefore executes Step S1032.
The keyword in searching request is extended in step S1032, it is a kind of to obtain keyword relevant to terminal Mode is that the terminal attribute in searching request is incorporated as keyword relevant with terminal to the keyword in searching request.Example If the terminal attribute in searching request is " HTC ", and the keyword in searching request is " what if is mobile phone EMS memory deficiency ", then may be used Using by " what if is HTC mobile phone EMS memory deficiency " as keyword relevant to terminal.
Referring to FIG. 2, Fig. 2 is the signal of the search result got in the present invention using keyword relevant to terminal Figure.As it can be seen that by means of the invention it is also possible to the search need of adaptive user, improves the accuracy of search.
Demand vocabulary in the present invention can be the known vocabulary of third party's offer, in addition, demand vocabulary can also pass through Mode acquires below.
Specifically, the mode for obtaining demand vocabulary includes step S201, step S202, step S203 and step S204.
Step S201: search log is obtained.Search log is the file for recording multiple user's search behaviors.It is searching for In log, the search behavior of the same user may be recorded in multiple sessions (session), and session is to indicate one section of user behaviour Make the unit of time.In a session, it can recorde the same user to be used to obtain multiple keywords that search result uses, Such as user elder generation search key " what if is mobile phone EMS memory deficiency ", then modifying keyword is that " iphone mobile phone EMS memory is insufficient What if " scan for, if the time of the two search operations all within a session period, is searching for the same of log In a conversation recording, the information such as the two keywords and its time searched respectively are just had.
Step S202: being that unit carries out cutting to search log with session (session), wherein each cutting segment is corresponding One conversation recording.By the narration of front it is known that search log contains multiple conversation recordings, step S202 will then be searched Each conversation recording of Suo Zhi is as a cutting segment.
Step S203: being directed to each cutting segment, extract the keyword pair in the segment, wherein the keyword centering extracted Two keywords are arranged according to the appearance sequence in the segment, and are only contained in first keyword and preset in rear keyword Word in terminal attribute vocabulary.Terminal attribute vocabulary can be collected to obtain by data mining, include in the vocabulary and mobile terminal The relevant various information of attribute, such as can have the type of the brand name (such as iphone, nokia) of mobile terminal, mobile terminal Number title (such as I9100, galaxy S3), mobile terminal affiliated system platform title (such as ios, android).In cutting piece The appearance sequence of Duan Zhong, keyword represent time sequencing when being searched for by user, and first keyword is the pass that user first searches for Keyword, the keyword searched for after rear keyword is user.If being only contained in rear keyword and first closing in a cutting segment Word in keyword and terminal attribute vocabulary, then the two there are the first keyword of inclusion relation and in rear keyword extraction Out, as a keyword pair.Such as have recorded three keywords in a cutting segment respectively sequentially in time, respectively It is:
Keyword A: mobile phone EMS memory
Keyword B: mobile phone EMS memory is insufficient
Keyword C: Samsung mobile phone EMS memory
It include first keyword A in rear keyword B, and further include word " deficiency " for keyword A and B, due to " deficiency " is not the word in terminal attribute vocabulary, so keyword A and B are not as keyword pair.For keyword B and C, Keyword C does not include first keyword B afterwards, therefore keyword B and C are also not as keyword pair.For keyword A and C, Keyword C includes first keyword A afterwards, and further includes word " Samsung " in rear keyword C, and " Samsung " is terminal attribute word Word in table, therefore keyword A and C can be extracted as a keyword pair.
Step S204: demand word is chosen, from the first keyword of the keyword pair of extraction to obtain demand vocabulary.As All first keywords of the keyword centering of extraction can be chosen for demand word by a kind of embodiment.But it is more highly preferred to Mode, be first to calculate the conversion ratio of each first keyword, and the first keyword that conversion ratio is greater than the set value is chosen for Demand word.Wherein, the conversion ratio of each first keyword can calculate according to the following formula:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate that the keyword centering extracted includes the key of w The number of word pair, t2 (w) indicate the number that w occurs in search log.
The demand word of selection is recorded in database table, so that it may obtain the demand vocabulary in the present invention.Above to this The mode for obtaining demand vocabulary in invention in advance has carried out complete introduction.
The feasible embodiment of demand another kind whether relevant to terminal that user is judged in step S103 is described below.
Whether the demand in order to judge user in step s 103 is related to terminal, can also advance with training corpus pair The disaggregated model of one unknown parameter is trained, and utilizes the disaggregated model of known parameters after this training in step s 103 Classify to the keyword in searching request to determine whether user demand is related to terminal.
The acquisition process of the disaggregated model of known parameters is introduced below.
Firstly, a collection of demand keyword relevant to terminal can be marked in search log as training corpus.Then from Characteristic of division is extracted in the training corpus marked.The finally classification using the training corpus with characteristic of division to unknown parameter Model is trained, and after training, disaggregated model is just provided with known parameter, can be used for the key in searching request Word is classified.The disaggregated model of unknown parameter is trained, can be carried out by existing various machine learning methods, herein Repeat no more its realization process.And in the present embodiment, the type of characteristic of division includes at least following one kind:
Feature one: the text feature of keyword.The text feature of word refers to after segmenting to keyword, word N-gram information, in the present embodiment, N can be 1 or 2.
Feature two: the title feature of the page is clicked caused by keyword.Feature two can obtain in the following manner: to key Page title is clicked caused by word carries out cutting, point to match from extraction in participle with the word in preset terminal attribute vocabulary Word is as feature two.The page is clicked caused by keyword, is to search after user searches for the keyword in the correlation that search engine returns The page clicked in hitch fruit.The acquisition modes and meaning of terminal attribute vocabulary in terminal attribute vocabulary and preceding embodiment It is all the same.
Feature three: expansion word feature of the keyword in search log.Feature three can obtain in the following manner: from search The expansion word of keyword is extracted in log as feature three, wherein the expansion word of keyword X refers to the keyword using X as substring Y subtracts the remaining character string after X.Such as occur keyword X " surfing Internet with cell phone setting " and keyword Y " connection in search log Logical surfing Internet with cell phone setting ", since X is the substring of Y, Y subtracts the expansion word that remaining character string " connection " after X is exactly X.
Feature four: conversion ratio feature of the keyword in search log.Feature four can obtain in the following manner: with session Be that unit carries out cutting to search log, and is directed to keyword X, statistics and X appear in same session in rear keyword Quantity t1, wherein each going out in search log in the word that rear keyword only includes in X and terminal attribute vocabulary, and statistics X Existing number t2 obtains conversion ratio feature=t1/t2 of the keyword X in search log.
Complete introduction has been carried out to the acquisition process of the disaggregated model of known parameters above.In this way, in step s 103, First to the characteristic of division of keyword extraction and training corpus same type in searching request, then will have in searching request The keyword of characteristic of division is input to the disaggregated model of known parameters, and the key in searching request can be exported by the disaggregated model Word generic, it will be understood that the category includes that user demand is related to terminal or user demand is uncorrelated with terminal.
Referring to FIG. 3, Fig. 3 is the structural schematic block diagram of the embodiment one of intelligent search device in the present invention.Such as Fig. 3 institute Show, which includes: receiving unit 301, the first judging unit 302, second judgment unit 303 and search unit 304.
Wherein, receiving unit 301, for obtaining the searching request of user.If containing trepang in the searching request of user Number, one of parameter indicate that user issues the attribute of searching request when institute using terminal, and a parameter indicates user for obtaining The keyword for taking search result and using.Terminal attribute can be the information such as the model of terminal, manufacturer, such as " Samsung " or Information such as " iphone5 ".
When user issues searching request using search client, search client can call the system interface of equipment, from And it gets terminal attributive information and is encapsulated in the searching request of user.But if search client can not get terminal Attribute information, then the terminal attribute in the searching request of user can be null value.
First judging unit 302, for judging the type of user according to the terminal attribute in searching request, when the class of user When type is mobile subscriber, triggering second judgment unit 303 is executed.First judging unit 302 can be from the searching request of user Terminal attribute is extracted, if the terminal attribute extracted is the information of null value either PC model etc, can be recognized For initiation user's non-moving subscribers of searching request, if the terminal attribute extracted is the mobile terminal type of " iphone5 " etc Number etc information, then can determine that the initiation user of searching request is mobile subscriber, thus just triggering second judgment unit 303 It executes.
Second judgment unit 303, for judged according to the keyword in searching request user demand whether with terminal phase It closes, if it is, being keyword relevant to terminal by the keyword expansion in searching request.Specifically, second judgment unit 303 include matching unit 3031 and expanding element 3032.
Wherein, matching unit 3031, for searching the demand vocabulary obtained in advance, to determine the keyword in searching request With the presence or absence of matching word in demand vocabulary, if it is, determining that the demand of user is related to terminal, and expanding element is triggered 3032 execute.Expanding element 3032, for by the keyword in the terminal attribute and searching request in searching request merge into The relevant keyword of terminal.
In the present embodiment, demand vocabulary can be the known vocabulary of third party's offer, the word for including in the demand vocabulary, It is relevant to terminal that the demand of user, which can be represented, if the keyword in searching request and the word phase in the demand vocabulary Match, then illustrates that the demand of user is related to terminal.Such as the keyword in searching request is " mobile phone EMS memory deficiency is what if ", it should Keyword has matching word in demand vocabulary, then matching unit 3031 may determine that the demand of the user using the keyword is It is relevant to terminal, therefore trigger expanding element 3032 and execute.Expanding element 3032 expands the keyword in searching request Exhibition, to obtain keyword relevant to terminal, a kind of mode is by the pass in the terminal attribute and searching request in searching request Keyword is incorporated as keyword relevant to terminal.Such as the terminal attribute in searching request is " HTC ", and in searching request Keyword is " what if is mobile phone EMS memory deficiency ", then can be by " what if is HTC mobile phone EMS memory deficiency " as relevant to terminal Keyword.
Search unit 304, for obtaining search result using keyword relevant to terminal.Referring to FIG. 2, Fig. 2 is this The schematic diagram of the search result got in invention using keyword relevant to terminal.As it can be seen that by means of the present invention, it can With the search need of adaptive user, the accuracy of search is improved.
Referring to FIG. 4, Fig. 4 is the structural schematic block diagram of the embodiment two of intelligent search device in the present invention.Such as Fig. 4 institute Show, embodiment two further includes vocabulary acquiring unit 305 compared with embodiment one.Wherein vocabulary acquiring unit 305 includes log again Acquiring unit 3051, log cutting unit 3052, word are to extraction unit 3053 and demand word selection unit 3054.
Log acquisition unit 3051, for obtaining search log.Searching for log is for recording multiple user's search behaviors File.In search log, the search behavior of the same user may be recorded in multiple sessions (session), and session is Indicate the unit of one section of operating time of user.In a session, can recorde the same user makes for obtaining search result Multiple keywords, such as user elder generation search key " what if is mobile phone EMS memory deficiency ", then modifying keyword is " what if is iphone mobile phone EMS memory deficiency " scans for, if the time of the two search operations is all a session period It is interior, then in the same conversation recording of search log, just have the letter such as the two keywords and its time searched respectively Breath.
Log cutting unit 3052, for carrying out cutting to search log using session as unit, wherein each cutting segment A corresponding conversation recording.
Word extracts the keyword pair in the segment, wherein keyword for being directed to each cutting segment to extraction unit 3053 Two keywords of centering according in the segment appearance sequence arrange, and rear keyword be only contained in first keyword and Word in preset terminal attribute vocabulary.
Such as have recorded three keywords in a cutting segment respectively sequentially in time, it is respectively:
Keyword A: mobile phone EMS memory
Keyword B: mobile phone EMS memory is insufficient
Keyword C: Samsung mobile phone EMS memory
It include first keyword A in rear keyword B, and further include word " deficiency " for keyword A and B, due to " deficiency " is not the word in terminal attribute vocabulary, so keyword A and B are not as keyword pair.For keyword B and C, Keyword C does not include first keyword B afterwards, therefore keyword B and C are also not as keyword pair.For keyword A and C, Keyword C includes first keyword A afterwards, and further includes word " Samsung " in rear keyword C, and " Samsung " is terminal attribute word Word in table, therefore keyword A and C can be extracted as a keyword pair.
Demand word selection unit 3054, for choosing demand word in the first keyword of the keyword pair from extraction, with To demand vocabulary.As an implementation, demand word selection unit 3054 can by the keyword centering of extraction it is all First keyword is chosen for demand word.But more preferably mode, demand word selection unit 3054 first calculate each first key The conversion ratio of word, and the first keyword that conversion ratio is greater than the set value is chosen for demand word.Wherein, each first keyword Conversion ratio can calculate according to the following formula:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate that the keyword centering extracted includes the key of w The number of word pair, t2 (w) indicate the number that w occurs in search log.
Referring to FIG. 5, Fig. 5 is the structural schematic block diagram of another embodiment of the second judgment unit 303 in the present invention. As shown in figure 5, second judgment unit 303 includes taxon 303a and expanding element 303b in the embodiment.Wherein grouping sheet First 303a classifies to the keyword in searching request using the disaggregated model of known parameters, to determine that the demand of user is It is no related to terminal, if it is, triggering expanding element 303b is executed.Expanding element 303b, for by the end in searching request Attribute is held to merge into keyword relevant with terminal to the keyword in searching request.
The disaggregated model of above-mentioned known parameters is to utilize to be extracted classification of the training corpus to unknown parameter of characteristic of division What model obtained after being trained.Training corpus can be by marking a collection of demand key relevant to terminal in search log Word obtains.In the present embodiment, the type of characteristic of division includes at least following one kind:
Feature one: the text feature of keyword.The text feature of word refers to after segmenting to keyword, word N-gram information, in the present embodiment, N can be 1 or 2.
Feature two: the title feature of the page is clicked caused by keyword.Feature two can obtain in the following manner: to key Page title is clicked caused by word carries out cutting, point to match from extraction in participle with the word in preset terminal attribute vocabulary Word is as feature two.Here the acquisition modes and meaning of terminal attribute vocabulary and the terminal attribute vocabulary in preceding embodiment are equal It is identical.
Feature three: expansion word feature of the keyword in search log.Feature three can obtain in the following manner: from search The expansion word of keyword is extracted in log as feature three, wherein the expansion word of keyword X refers to the keyword using X as substring Y subtracts the remaining character string after X.Such as occur keyword X " surfing Internet with cell phone setting " and keyword Y " connection in search log Logical surfing Internet with cell phone setting ", since X is the substring of Y, Y subtracts the expansion word that remaining character string " connection " after X is exactly X.
Feature four: conversion ratio feature of the keyword in search log.Feature four can obtain in the following manner: with session Be that unit carries out cutting to search log, for keyword X, statistics and X appear in same session in rear keyword quantity T1, wherein each in the word that rear keyword only includes in X and terminal attribute vocabulary, and count what X occurred in search log Number t2 obtains conversion ratio feature=t1/t2 of the keyword X in search log.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Within mind and principle, any modification, equivalent substitution, improvement and etc. done be should be included within the scope of the present invention.

Claims (4)

1. a kind of intelligent search method, comprising:
A. the searching request of user is obtained;
B. according to described search request in terminal attribute judge the type of the user, when the type of the user is mobile uses When family, C is executed;
C. search the demand vocabulary obtained in advance, with determine described search request in keyword in the demand vocabulary whether In the presence of matching word, if it is, determine that the demand of the user is related to terminal, by described search request in terminal attribute Keyword relevant with terminal is merged into the keyword in described search request;
D. search result is obtained using the keyword relevant to terminal;
Wherein, demand vocabulary is obtained in advance in the following manner:
S1. search log is obtained;
S2. cutting is carried out to described search log as unit of session, wherein each cutting segment corresponds to a conversation recording;
S3. it is directed to each cutting segment, the keyword pair in the segment is extracted, wherein two keywords of the keyword centering are pressed The appearance sequence impinged upon in the segment arranges, and is only contained in first keyword and preset terminal attribute vocabulary in rear keyword In word;
S4. demand word is chosen, from the first keyword of the keyword pair of extraction to obtain demand vocabulary.
2. the method according to claim 1, wherein the mode for choosing demand word in the S4 includes by conversion ratio The first keyword being greater than the set value is chosen for demand word, wherein the conversion ratio of each first keyword calculates according to the following formula:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate that the keyword centering extracted includes The number of the keyword pair of w, t2 (w) indicate the number that w occurs in search log.
3. a kind of intelligent search device, comprising:
Receiving unit, for obtaining the searching request of user;
First judging unit judges the type of the user for the terminal attribute in requesting according to described search, when the use When the type at family is mobile subscriber, triggering second judgment unit is executed;
Second judgment unit, for searching the demand vocabulary obtained in advance, to determine the keyword in described search request in institute It states with the presence or absence of matching word in demand vocabulary, if it is, determining that the demand of the user is related to terminal, by described search Terminal attribute in request merges into keyword relevant with terminal to the keyword in described search request;
Search unit, for obtaining search result using the keyword relevant to terminal;
Vocabulary acquiring unit, for obtaining the demand vocabulary in advance, the vocabulary acquiring unit includes:
Log acquisition unit, for obtaining search log;
Log cutting unit, for carrying out cutting to described search log as unit of session, wherein each cutting segment is corresponding One conversation recording;
Word extracts the keyword pair in the segment, wherein the keyword centering for being directed to each cutting segment to extraction unit Two keywords arranged according to the appearance sequence in the segment, and be only contained in first keyword and default in rear keyword Terminal attribute vocabulary in word;
Demand word selection unit, for choosing demand word in the first keyword of the keyword pair from extraction, to obtain demand word Table.
4. device according to claim 3, which is characterized in that the demand selected ci poem takes the mode packet of unit selection demand word It includes the first keyword for being greater than the set value conversion ratio and is chosen for demand word, wherein the conversion ratio of each first keyword is according to following Formula calculates:
Wherein p (w) indicates that the conversion ratio of first keyword w, t1 (w) indicate that the keyword centering extracted includes The number of the keyword pair of w, t2 (w) indicate the number that w occurs in search log.
CN201210536673.XA 2012-12-12 2012-12-12 A kind of intelligent search method and device Active CN103020224B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210536673.XA CN103020224B (en) 2012-12-12 2012-12-12 A kind of intelligent search method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210536673.XA CN103020224B (en) 2012-12-12 2012-12-12 A kind of intelligent search method and device

Publications (2)

Publication Number Publication Date
CN103020224A CN103020224A (en) 2013-04-03
CN103020224B true CN103020224B (en) 2019-01-15

Family

ID=47968828

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210536673.XA Active CN103020224B (en) 2012-12-12 2012-12-12 A kind of intelligent search method and device

Country Status (1)

Country Link
CN (1) CN103020224B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9754034B2 (en) * 2013-11-27 2017-09-05 Microsoft Technology Licensing, Llc Contextual information lookup and navigation
CN104063455B (en) * 2014-06-24 2017-04-12 北京奇虎科技有限公司 Method and device for acquiring counseling messages of disease based on searching

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1677389A (en) * 2004-03-31 2005-10-05 宇龙计算机通信科技(深圳)有限公司 Mobile internet intelligent information retrieval engine based on key-word retrieval
CN101008946A (en) * 2006-01-26 2007-08-01 冉征 Search method of Chinese mobile communication information and device thereof
WO2007098710A1 (en) * 2006-03-03 2007-09-07 China Mobile Communications Corporation Searching system and method based on personalized information
CN101052181A (en) * 2007-05-22 2007-10-10 中国移动通信集团浙江有限公司 Wireless searching system and its wireless searching method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1677389A (en) * 2004-03-31 2005-10-05 宇龙计算机通信科技(深圳)有限公司 Mobile internet intelligent information retrieval engine based on key-word retrieval
CN101008946A (en) * 2006-01-26 2007-08-01 冉征 Search method of Chinese mobile communication information and device thereof
WO2007098710A1 (en) * 2006-03-03 2007-09-07 China Mobile Communications Corporation Searching system and method based on personalized information
CN101052181A (en) * 2007-05-22 2007-10-10 中国移动通信集团浙江有限公司 Wireless searching system and its wireless searching method

Also Published As

Publication number Publication date
CN103020224A (en) 2013-04-03

Similar Documents

Publication Publication Date Title
CN102866990B (en) A kind of theme dialogue method and device
CN103760991B (en) Physical input method and physical input device
CN105183787A (en) Information input method and apparatus
CN103916436B (en) Information-pushing method, device, terminal and server
CN103440243B (en) A kind of teaching resource recommendation method and device thereof
CN102402619A (en) Search method and device
CN104360994A (en) Natural language understanding method and natural language understanding system
CN109582969A (en) Methodology for Entities Matching, device and electronic equipment
CN104468959A (en) Method, device and mobile terminal displaying image in communication process of mobile terminal
CN102905233A (en) Method and device for recommending terminal function
CN109739961A (en) A kind of man-machine language exchange method and device
CN107846426A (en) The tracking and device of user trajectory in page access
CN105989120B (en) A kind of personalization content recommendation method and individualized content recommender system
JP2018014765A5 (en)
KR101696499B1 (en) Apparatus and method for interpreting korean keyword search phrase
CN108304452A (en) Article processing method and processing device, storage medium
CN109522405A (en) File information processing method, electronic equipment and computer-readable deposit medium
CN106095998B (en) Topic method and device is precisely searched applied to intelligent terminal
CN103020224B (en) A kind of intelligent search method and device
CN103853771B (en) A kind of method for pushing and system of search result
CN109558531A (en) News information method for pushing, device and computer equipment
CN107784024B (en) Construct the method and device of party's portrait
CN103997492B (en) A kind of adaption system and method
CN105589935A (en) Social group recognition method
CN105094810B (en) Data processing method and device based on CGI(Common gateway interface) plug-in unit

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant