CN107229659A - A kind of information search method and device - Google Patents

A kind of information search method and device Download PDF

Info

Publication number
CN107229659A
CN107229659A CN201610179888.9A CN201610179888A CN107229659A CN 107229659 A CN107229659 A CN 107229659A CN 201610179888 A CN201610179888 A CN 201610179888A CN 107229659 A CN107229659 A CN 107229659A
Authority
CN
China
Prior art keywords
keyword
word
information
scope
conjunctive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610179888.9A
Other languages
Chinese (zh)
Other versions
CN107229659B (en
Inventor
蒋亿松
刘燚灵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201610179888.9A priority Critical patent/CN107229659B/en
Publication of CN107229659A publication Critical patent/CN107229659A/en
Application granted granted Critical
Publication of CN107229659B publication Critical patent/CN107229659B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of information search method and device, the accuracy to improve the information search result in information seeking processes.A kind of information retrieval device includes:Inquiry request acquisition module, for obtaining inquiry request;Keyword acquisition module, for obtaining at least one keyword from inquiry request;Scope prescribed information acquisition module, for obtaining scope prescribed information;Conjunctive word searching modul, for each keyword at least one keyword, search meet scope prescribed information limit in the range of the keyword one or more conjunctive words;Search module, for according to each keyword found one or more conjunctive words carry out information search, obtain be located at scope prescribed information limit in the range of information search result.During due to carrying out information search by the conjunctive word that finds, in the range of obtained information search result is limited positioned at scope prescribed information, thus information search result is more accurate.

Description

A kind of information search method and device
Technical field
The present invention relates to technical field of information processing, more particularly to a kind of information search method and device.
Background technology
With the arrival of information age, people will face the information of a large amount of numerous and complicateds daily, such as:Mutually Information in networking, then, how to correctly search for out the information of needs from substantial amounts of information to be presented to User, is a urgent problem.
By taking the information search in internet as an example, during information search, a kind of common searching method It is to be scanned for according to keyword.But, keyword generally semantically has complexity, such as:One Word generally can all have multiple synonyms, it is also possible to there are multiple near synonym, if the pass only inputted to user Keyword is retrieved, it will usually cause the entry searched less, so the pass that generally can be all inputted to user Keyword and its synonym, near synonym are scanned for, now, how to select synonym, near synonym generally to determine The accuracy of information search result.
Therefore, synonym and/or near synonym how are accurately determined, to improve the accuracy of information search result It is a urgent problem to be solved in information seeking processes.
The content of the invention
The embodiment of the present invention provides a kind of information search method and device, to solve to believe in information seeking processes The problem of accuracy of breath search result is low.
In a first aspect, a kind of information search method of the embodiment of the present invention, this method can be applied to search into row information On the server of rope, wherein, the server obtains the inquiry request for information search, and from the inquiry At least one keyword is obtained in request;In addition, the server obtains the scope for prescribed information search Scope prescribed information, server is at least one keyword described in being obtained from the inquiry request Each keyword, search meet the scope prescribed information limit in the range of one of the keyword or Multiple conjunctive words, wherein, conjunctive word may include synonym and/or near synonym;And according to each found One or more of conjunctive words of keyword carry out information search, obtain being located at the scope prescribed information institute Information search result in the range of restriction.
Using such scheme, server can according to the scope prescribed information of at least one keyword got, For each keyword at least one keyword, find out and meet scope prescribed information and limit scope Interior one or more conjunctive words, and enter according to one or more conjunctive words of each keyword found Row information search for, obtain be located at scope prescribed information limit in the range of information search result.Wherein, close Joining word includes synonym and/or near synonym.
By the conjunctive word found out be meet scope prescribed information limit in the range of conjunctive word, therefore When carrying out information search according to the conjunctive word found, obtained information search result is limited also in scope Information limit in the range of information search result so that information search result accuracy is higher.
In a kind of possible implementation, if getting a keyword, server is searched entering row information Suo Shi, can enter row information according to one or more of conjunctive words of the one keyword found and search Rope;Or according to one or more of conjunctive words of the one keyword found, and it is one Keyword carries out information search.
Using such scheme, information search or the conjunctive word according to the keyword are carried out according to a keyword Information search is carried out, is compared with the method for only carrying out information search according to keyword, information search can be expanded Scope.
Wherein, the situation only scanned for for former according to keyword, the search knot that information search is obtained Fruit may not include the result searched for and obtained according to conjunctive word;For latter both according to keyword, also according to pass Join the situation of word search, the search result that information search is obtained includes searching for obtained result according to conjunctive word.
There is provided the implementation of two kinds of information searches in this optional implementation.
In a kind of possible implementation, if server gets at least two keywords, server exists , can also be by the institute found before the conjunctive word progress information search found after lookup conjunctive word It is combined between the conjunctive word for stating the different keywords at least two keywords, and at least two by described in Partial key word in individual keyword and it is combined between the conjunctive word of remaining keyword found;
In information search, information search can be carried out according to each combination of formation;Or
According at least two keyword, and each combination formed carries out information search.Using upper State scheme, due to carry out information search when be according between keyword and conjunctive word use different combination sides What the combination that formula is obtained was carried out, therefore all possible combination can be scanned for, ensureing to search On the premise of the accuracy of hitch fruit, make search result more complete.
Wherein, the situation only scanned for for former according to keyword, the search knot that information search is obtained Fruit may not include the result searched for and obtained according to conjunctive word;For latter both according to keyword, also according to pass Join the situation of word search, the search result that information search is obtained includes the result obtained according to keyword search. There is provided the optional implementation of two kinds of information searches.
In a kind of possible implementation, server can search conjunctive word as follows:
Search all conjunctive words of a keyword;For each conjunctive word found, the association is obtained The information of the scope of application of word;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make For meet the scope prescribed information limit in the range of the keyword conjunctive word.
Using such scheme, the information of the scope of application of each conjunctive word due to obtaining keyword, and Therefrom filter out the scope of application and scope prescribed information limited range identical conjunctive word, it is thus possible to arrange Except the different conjunctive word of the scope of application, make the conjunctive word filtered out more accurate, so that search result is more Accurately.
In a kind of possible implementation, server can obtain the inquiry request from client;Server Find meet the scope prescribed information limit in the range of one or many of one keyword After individual conjunctive word, one or more of conjunctive words are sent to the client, and each to transmission Conjunctive word, sends the information of the scope of application of the conjunctive word.
Using such scheme, because server to client have sent one or more conjunctive words, and to sending Each conjunctive word, send the information of the scope of application of the conjunctive word, thus can have selection in client Property conjunctive word and its scope of application are shown, facilitate user selection which conjunctive word is row information is entered using Search.
In a kind of possible implementation, server obtains the inquiry request from client;
If getting at least two keywords, server after conjunctive word is found, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found, And by between the Partial key word and the conjunctive word of remaining keyword found at least two keywords It is combined;
For each combination of formation, the scope of application of the combination is determined;
Wherein, if a combination includes keyword, by the scope prescribed information limited range with Common factor between the scope of application of each conjunctive word in the combination, is used as the scope of application of the combination;If Do not include keyword in one combination, then by the friendship between the scope of application of each conjunctive word in combining Collection, is used as the scope of application of the combination;
Server can send one or more combinations with the non-NULL scope of application to the client, and to hair Each combination sent, sends the information of the scope of application of the combination.
Using such scheme, one or more there is the non-NULL scope of application because server have sent to client Combination, and to each combination of transmission, send the information of the scope of application of the combination, thus can be Client there is the scope of application of combination and the combination of the non-NULL scope of application to be shown each, convenient User's selection carries out information search using which combination.
In a kind of possible implementation, information of the server in the scope of application for obtaining each conjunctive word Before, a conjunctive word can be obtained from text;
Server judges whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including server is by the word of the scope of application for describing the conjunctive word, labeled as the association The information of the scope of application of word.
Using such scheme, due to obtaining the word of the scope of application for describing a conjunctive word from text Language, and using the word as the use scope of conjunctive word information there is provided it is a kind of determine conjunctive word be applicable The method of scope.
In a kind of possible implementation, server can obtain the scope from the inquiry request and limit Information;Or
If server gets a keyword and the meaning of a word of one keyword defines information search Scope, then server generate for describing the information search scope that the meaning of a word of one keyword is limited, It is used as the scope prescribed information;Or
If server gets part or all of at least two keywords and at least two keyword The meaning of a word of keyword defines the scope of information search, then server can determine that the part or all of keyword In each keyword the information search scope that is limited of the meaning of a word, and by each keyword of determination Common factor is taken between the information search scope that the meaning of a word is limited, the common factor is regard as the scope prescribed information.
Using such scheme, limited due to obtaining scope from inquiry request or from the meaning of a word of keyword There is provided the method for obtaining scope prescribed information for information.
In a kind of possible implementation, server can obtain the inquiry request from client;Server Obtain be located at the scope prescribed information limit in the range of information search result after, can be to the visitor Family end sends obtained information search result, and to each entry in information search result, sends described Scope prescribed information.
Using such scheme, because server to the client sends obtained information search result, and it is right Each entry in information search result, sends the scope prescribed information, thus can be in client pair The scope prescribed information of each entry in information search result and information search result is shown, and is made Search result is more directly perceived.
Second aspect, the embodiment of the present invention provides a kind of information retrieval device, and the information retrieval device has real The function of the information search method of existing above-mentioned first aspect.The function can be realized by hardware, can also Corresponding software is performed by hardware to realize.The hardware or software include one or more with above-mentioned functions phase Corresponding module.
In a kind of optional implementation, described information searcher includes:Inquiry request acquisition module, Keyword acquisition module, scope prescribed information acquisition module, conjunctive word searching modul and search module.
Alternatively, word combination module, conjunctive word sending module, scope of application information flag can also be included Module and search result sending module.
Inquiry request acquisition module is configured as supporting information retrieval device to perform above-mentioned first aspect and provided Method in acquisition inquiry request function;Keyword acquisition module is configured as supporting information retrieval device Perform the function of the acquisition keyword in the method that above-mentioned first aspect is provided;Scope prescribed information obtains mould Block is configured as supporting information retrieval device to perform the acquisition scope in the method that above-mentioned first aspect is provided The function of prescribed information;Conjunctive word searching modul is configured as supporting information retrieval device to perform above-mentioned first party The function of the conjunctive word of lookup keyword in the method that face is provided;Search module is configured as supporting information Searcher performs the function of the search in the method that above-mentioned first aspect is provided;Word combination module by with It is set to the function for the word combination for supporting information retrieval device to perform in the method that above-mentioned first aspect is provided; Conjunctive word sending module is configured as supporting information retrieval device to perform the method that above-mentioned first aspect is provided In transmission conjunctive word function;Scope of application information flag module is configured as supporting information retrieval device to hold The function of the scope of application information of mark conjunctive word in the method that the above-mentioned first aspect of row is provided;Search knot Fruit sending module is configured as supporting information retrieval device to perform in the method that above-mentioned first aspect is provided The function of search result is sent to client.
The third aspect, the embodiment of the present invention provides a kind of information search system, including:Client, for sending out Send inquiry request and receive search result;
Server, for performing the information search method that above-mentioned first aspect is provided;
Memory, is returned for the database access request of the reception server transmission and by database query result Back to server.
Fourth aspect, the embodiment of the present invention provides a kind of computer-readable storage medium, for being stored as above-mentioned second The computer software instructions used in information retrieval device described in aspect, it, which is included, is used to perform above-mentioned aspect institute The program of design.
5th aspect, the embodiment of the present invention is provided in a kind of information acquisition method, this method, and server is from text One or more conjunctive words of a keyword are obtained in this, wherein, conjunctive word includes synonym and/or nearly justice Word;For each conjunctive word of acquisition, server is searched for describing being applicable for the conjunctive word in the text The word of scope;And by the word found, labeled as the information of the scope of application of the conjunctive word.
In a kind of possible implementation, server can find the keyword and the key from text The conjunctive word marker character of word;Server determines the matching range of conjunctive word marker character in the text;Then, take Business device obtains one or more conjunctive words out of matching range.
Wherein, conjunctive word marker character is used for the conjunctive word for marking the keyword with the incidence relation of the keyword, Matching range is used to mark the position range that conjunctive word is likely to occur in the text.
6th aspect, the embodiment of the present invention provides a kind of information acquisition device, and the device, which has, realizes above-mentioned the The function of the method for five aspects.The function can be realized by hardware, can also be performed by hardware corresponding Software realize.The hardware or software include one or more modules corresponding with above-mentioned functions.
In a kind of optional implementation, the information acquisition device includes:Conjunctive word acquisition module, word Searching modul and range flags module.
Alternatively, keyword lookup module, conjunctive word marker character searching modul and matching range can also be included Determining module.
Conjunctive word acquisition module is configured as supporting information acquisition device to perform what above-mentioned 5th aspect was provided The function of acquisition conjunctive word in method;Word searching modul is configured as supporting in information acquisition device execution State the function that the lookup in the method that the 5th aspect is provided is used to describe the word of the scope of application of conjunctive word; Range flags module is configured as supporting information acquisition device to perform in the method that above-mentioned 5th aspect is provided Mark the conjunctive word scope of application function;Keyword lookup module is configured as supporting information acquisition device to hold The function of lookup keyword in the method that above-mentioned 5th aspect of row is provided;Conjunctive word marker character searching modul It is configured as supporting information acquisition device to perform the lookup conjunctive word in the method that above-mentioned 5th aspect is provided The function of marker character;Matching range determining module is configured as supporting information acquisition device to perform above-mentioned 5th side The function of the matching range of determination conjunctive word marker character in the method that face is provided.
7th aspect, the embodiment of the present invention provides a kind of Information Acquisition System, including:
Client, for sending keyword and receiving acquired information;
Server, for performing the information search method that above-mentioned 5th aspect is provided;
Memory, is returned for the database access request of the reception server transmission and by database query result Back to server.
Eighth aspect, the embodiment of the present invention provides a kind of computer-readable storage medium, for saving as the above-mentioned 6th The computer software instructions used in information acquisition device described in aspect, it, which is included, is used to perform above-mentioned aspect institute The program of design.
To sum up, the embodiment of the present invention provides a kind of information search method and device, wherein, according to what is got The scope prescribed information of at least one keyword, for each keyword at least one keyword, is looked into Find out meet scope prescribed information limit in the range of one or more conjunctive words, it is and every according to what is found One or more conjunctive words of one keyword carry out information search, obtain being limited positioned at scope prescribed information In the range of information search result.Wherein conjunctive word includes synonym and/or near synonym.
By the conjunctive word found out be meet scope prescribed information limit in the range of conjunctive word, therefore When carrying out information search according to the conjunctive word found, obtained information search result is limited also in scope Information limit in the range of information search result so that information search result accuracy is higher.
Brief description of the drawings
Fig. 1 is a kind of schematic diagram of the network architecture of information search system provided in an embodiment of the present invention;
Fig. 2 is a kind of structural representation of server for information search provided in an embodiment of the present invention;
Fig. 3 is a kind of flow chart of information search method provided in an embodiment of the present invention;
Fig. 4 is a kind of scope of application for showing that each is combined and each is combined provided in an embodiment of the present invention The schematic diagram of the mode of information;
Fig. 5 is a kind of one or more combinations with the non-NULL scope of application of displaying provided in an embodiment of the present invention Mode schematic diagram;
Fig. 6 is a kind of signal of the mode of client exhibition information search result provided in an embodiment of the present invention Figure;
Fig. 7 is a kind of flow chart of information acquisition method provided in an embodiment of the present invention;
Fig. 8 is the flow chart of another information search method provided in an embodiment of the present invention;
Fig. 9 is the flow chart of another information acquisition method provided in an embodiment of the present invention;
Figure 10 is a kind of structural representation of information retrieval device provided in an embodiment of the present invention;
Figure 11 is a kind of structural representation of information acquisition device provided in an embodiment of the present invention;
Figure 12 is the structural representation of another information retrieval device provided in an embodiment of the present invention.
Embodiment
The above-mentioned purpose of embodiment, scheme and advantage for a better understanding of the present invention, provided hereinafter detailed Description.The detailed description by using the accompanying drawings such as block diagram, flow chart and/or example, illustrate device and/or The various embodiments of method.In these block diagrams, flow chart and/or example, one or more functions are included And/or operation.It will be understood by the skilled person that:Each function in these block diagrams, flow chart or example And/or operation, can separately or cooperatively it be implemented by various hardware, software, firmware, or pass through Any combination of hardware, software and firmware is implemented.
The embodiment of the present invention provides a kind of information search method and device, wherein, according at least one got The scope prescribed information of individual keyword, for each keyword at least one keyword, finds out symbol Close scope prescribed information limit in the range of one or more conjunctive words, and according to find each close One or more conjunctive words of keyword carry out information search, obtain limiting positioned at scope prescribed information in the range of Information search result.Wherein conjunctive word includes synonym and/or near synonym.
, can be according to scope prescribed information to each keyword using scheme provided in an embodiment of the present invention One or more conjunctive words are screened, and obtain meeting the conjunctive word of scope prescribed information limited range, And the one or more conjunctive words obtained according to each keyword and screening carry out information search.Therefore, may be used To be screened according to scope prescribed information to conjunctive word, and then it can be carried out when carrying out information search Garbled, more accurate search result.
Below, in order to make it easy to understand, introducing the concept being related in the embodiment of the present invention.
First, keyword and conjunctive word
In information search, it will usually carry out information search, these keywords according to one or more keywords It can be inputted by user, it is also possible to obtained from text.These keywords, which are used for representative, to be searched for Information in main contents.
In the embodiment of the present invention, the conjunctive word of a keyword can include the synonym of the keyword and/or near Adopted word.
Such as, " gardenia also known as cape jasmine, Yellow Fructus Gardeniae ", then can be assumed that the same of gardenia according to this text Adopted word is cape jasmine or Yellow Fructus Gardeniae.For another example, " discrimination " refers to resolution, difference, and " discriminating " refers to by investigating And the property or feature of things are determined, the two similar import, it is believed that " discriminating " is the near of " discrimination " Adopted word.The synonym or near synonym of one word can be called the conjunctive word of this word.
The synonym and/or near synonym of word can over time or the change of the scope of application such as region and it is different. During social development, the words of some words sense over time or the change of the scope of application such as region and become Change, such as:In the Yuan Dynasty, woman servant is synonymous with father, in ancient times, and brother is synonymous with sister;Also some words exist Some region has identical implication, such as in Sichuan province, and people " capsicum " are called " hot pepper ", in Shaanxi Area, people " capsicum " are called " long and thin hot pepper ".
Existing searching method does not account for being applicable for the keyword to be searched for and its synonym and/or near synonym Scope, influences the accuracy of search result.Such as, for " in Sichuan province, people capsicum is called hot pepper, In In Shanxi Area, people capsicum is called long and thin hot pepper " this synonym matched text, when search " Chili Peppers In Sichuan Province, China " When, " Sichuan hot pepper " and " Sichuan long and thin hot pepper " the two search suggestions and corresponding search result can be provided.But It is that due to not accounting for keyword " Sichuan " this scope of application prescribed information in search, thus can give Go out " Sichuan long and thin hot pepper " this search suggestion and corresponding search result, this search suggestion and search result are obvious Required for not being the user scanned for, thus this search suggestion and search result are redundancies, influence The accuracy of search result.
2nd, scope prescribed information
Scope prescribed information refers to that being used in inquiry request shows the limit of the query context of this inquiry request Determine information, such as temporal information or regional information.
When scope prescribed information is temporal information, show inquiry request needs inquiry is the temporal information In the time range that is characterized, the search result that includes the keyword in inquiry request;When scope prescribed information During for regional information, show the inquiry request need inquire about be it is in the territorial scope, comprising inquiry request In keyword search result.
Scope prescribed information can be obtained from inquiry request, can also be from the word of the keyword in inquiry request Obtained in justice.
Alternatively, priority can be set to the mode that above two obtains scope prescribed information.Such as, can be with Set:The priority of the scope prescribed information obtained from inquiry request is higher than to be obtained from the meaning of a word of keyword Scope prescribed information.
The mode of scope prescribed information is obtained from inquiry request can a variety of, and three kinds are only enumerated below from looking into The example that scope prescribed information is obtained in request is ask, actual acquisition modes are not limited to following three kinds:
Scope prescribed information in the keyword inputted in mode one, inquiry request.
Such as, " the Yuan Dynasty " this scope can be obtained from " woman servant's the Yuan Dynasty " this inquiry request and limits letter Breath.
Mode two, by setting the functional module of input range prescribed information obtain scope prescribed information.
Such as, input range prescribed information can be used in inquiry request page setup window or plug-in unit.
Mode three, the typing for carrying out by the representation of agreement Query Information.
Such as, it is to need behind scope prescribed information, colon to be before colon when can arrange input inquiry request The keyword to be inquired about, such as input " Ming Dynasty:During Liu Baiwen ", mark is limited in " Ming Dynasty " this scope Search " Liu Baiwen " this keyword in the time range that information is limited.
The mode of scope prescribed information is obtained from the meaning of a word of keyword to be:Some personalities, ancient books, Historical events etc. is substantially associated with the scope prescribed information such as some times, region, then make when these words When being input to for keyword in inquiry request, these scope prescribed informations can be obtained, the inquiry request is used as In scope prescribed information.For example, when including " Cao Xueqin " in the keyword of input, can be from " Cao This keyword of snow celery " is associated with " Qing Dynasty " this region, so that please as this inquiry by " Qing Dynasty " The scope prescribed information asked.
Alternatively, when scope prescribed information be temporal information or regional information, judge some keyword whether with Temporal information or regional information are associated, can by set a time tag storehouse or region tag library come Realize.
Record and (such as gone through with the time corresponding to personality, ancient books, historical events etc. in time tag storehouse The age that the time of historical event part generation, personality are present) information, when bag in the keyword in inquiry request During containing these personalities, ancient books, historical events, can by the personality in time tag storehouse, ancient books, The corresponding temporal information such as historical events as the inquiry request scope prescribed information.
Record and (such as gone through with the region corresponding to personality, ancient books, historical events etc. in the tag library of region The region of history locale, personality's birth or life) information, when the key in inquiry request In word include these personalities, ancient books, historical events when, can by the personality in the tag library of region, The corresponding regional information such as ancient books, historical events as the inquiry request scope prescribed information.
If in addition, scope prescribed information is regional information, can also pass through the user's asked input inquiry Understood or obtain ground by positioner positioning in IP (Internet Protocol, procotol) address Domain information.
3rd, the scope of application of conjunctive word
Search some keyword conjunctive word when, the conjunctive word might not under any circumstance all with key Word is synonymous, but synonymous with the keyword in some scope of application.Such as, in Sichuan province, people claim Capsicum is hot pepper, then hot pepper is not all synonymous with capsicum in all regions, but only " Sichuan " this It is synonymous with capsicum in one scope of application;For another example, woman servant is synonymous with father in the Yuan Dynasty, then woman servant is not It is all synonymous with father in all dynasties, but it is only synonymous with father in " the Yuan Dynasty " this scope of application.On State the scope of application that " Sichuan " and " the Yuan Dynasty " is conjunctive word.
Alternatively, the scope of application of conjunctive word can be time or region.
4th, conjunctive word marker character
Text in internet or database is analyzed, and then obtains the conjunctive word of some keyword During, conjunctive word marker character is used for marking the incidence relation that the keyword is associated between word.For example, For " gardenia is also known as cape jasmine, Yellow Fructus Gardeniae." this text, searching for the conjunctive word of " gardenia " During, pass through " being also known as " in the text behind " gardenia ", it can be appreciated that after " being also known as " The word in face is the conjunctive word of " gardenia "." being also known as " is a kind of conjunctive word marker character.
Conjunctive word marker character is not limited to a kind of above-mentioned form, and it can be that word can also be symbol.Such as, Exist in the entry of gardenia "【Alias】:Cape jasmine, yellow chicken, yellow sprout, Yellow Fructus Gardeniae, yellow Cape jasmine, mountain Yellow Cape jasmine, beautiful lotus etc.." this text, wherein "【Alias】:" it is also a kind of conjunctive word marker character.
5th, the matching range of conjunctive word marker character in the text
The matching range of conjunctive word marker character in the text is used to mark what conjunctive word was likely to occur in the text Position range.
Such as, for " gardenia is also known as cape jasmine, Yellow Fructus Gardeniae." this matched text, finding conjunctive word After marker character, in addition it is also necessary to know that the conjunctive word of the sphere of action of conjunctive word marker character, i.e. gardenia is likely to occur Position range.By analyzing fullstop last in the text it is recognised that the word after fullstop is no longer Cape jasmine The conjunctive word of son flower, the i.e. matching range of conjunctive word marker character in the text terminates to fullstop.
Fig. 1 shows a kind of network architecture of information search system.As shown in figure 1, information search system bag Include:Server 101, client 102 and memory 103, server 101 can also include processor, Memory and I/O interfaces.
Server 101 passes through processor pair by inquiry request of the I/O interfaces from client 102 The inquiry request of reception is handled, and the search result obtained after processing can be returned into client 102 and entered Row displaying.The programmed instruction stored in the run memory of server 101, is handled inquiry request.This Outside, server 101 can also be by the ephemeral data storage produced during processing inquiry request in memory. Server 101 handle inquiry request when may need access database (such as thesaurus, time tag storehouse, Region tag library etc.) server 101 memory of itself is can come from, it can be from the memory of outside 103。
Wherein, thesaurus is used for the scope of application for conjunctive word and each conjunctive word for storing keyword Information, one kind optionally realizes that structure refers to table 1;Time tag storehouse be used for record such as personality, (the year that such as time of historical events generation, personality are present time corresponding to ancient books, historical events etc. Generation) information, when including these personalities, ancient books, historical events in the keyword in inquiry request, The corresponding temporal information such as the personality in time tag storehouse, ancient books, historical events can be looked into as this Ask the scope prescribed information of request;Region tag library is used to record such as personality, ancient books, historical events Deng corresponding region (region of place, personality's birth or life that such as historical events occurs) information, , can be by region when including these personalities, ancient books, historical events in the keyword in inquiry request The corresponding regional informations such as personality, ancient books, historical events in tag library as the inquiry request model Enclose prescribed information.
Wherein, the inquiry request of client 102 can be that the search instruction inputted from user (such as, exists The search instruction inputted on webpage).
Alternatively, client 102 can select exhibition after the search result of the return of server 101 is received Show the search result.Memory in server 101 can be disk, CD, flash memory.Memory 103 Can be disk array, hard disk, flash memory, CD, the memory technology of use can be conventional storage technologies, It can also be cloud storage technology.
Fig. 2 is a kind of structural representation of server for information search, letter provided in an embodiment of the present invention Breath searching method can be applied in server 101 as shown in Figure 2, and the server 101 can be applied to figure In information search system shown in 1, including I/O interfaces 201, processor 202 and memory 203.
Memory 203 can be used for storage program, database.Memory 203 can be CD, hard disk, interior Deposit.Wherein, database can be that server execution information searching method in the embodiment of the present invention is called Program and used database (such as above-mentioned thesaurus, time tag storehouse, region tag library); Server 101 receives the inquiry request from client by I/O interfaces 201, by 202 pairs of processor The inquiry request of reception is handled, and can return the search result obtained after processing by I/O interfaces 201 It is shown back to client.The programmed instruction stored in the run memory 203 of server 101, to inquiry Request is handled.In addition, the ephemeral data produced in processing procedure can also be stored in by processor 202 In memory 203.Processor 202 may need the database accessed (such as synonymous when handling inquiry request Dictionary, time tag storehouse, region tag library etc.) memory 203 of server 101 itself is can come from, It can be from the memory of outside;I/O interfaces 201 are used to connect various input/output devices, can be used for Receive outside search instruction and search result is exported.
Wherein, thesaurus is used for the scope of application for conjunctive word and each conjunctive word for storing keyword Information, one kind optionally realizes that structure refers to table 1;Time tag storehouse be used for record such as personality, (the year that such as time of historical events generation, personality are present time corresponding to ancient books, historical events etc. Generation) information, when including these personalities, ancient books, historical events in the keyword in inquiry request, The corresponding temporal information such as the personality in time tag storehouse, ancient books, historical events can be looked into as this Ask the scope prescribed information of request;Region tag library is used to record such as personality, ancient books, historical events Deng corresponding region (region of place, personality's birth or life that such as historical events occurs) information, , can be by region when including these personalities, ancient books, historical events in the keyword in inquiry request The corresponding regional informations such as personality, ancient books, historical events in tag library as the inquiry request model Enclose prescribed information.
Below, various embodiments of the present invention are described in detail.
Fig. 3 is a kind of flow chart of information search method provided in an embodiment of the present invention.This method can be by Fig. 1 Performed with the server 101 shown in Fig. 2.As shown in figure 3, the flow comprises the following steps:
S301:Obtain the inquiry request for information search;
Alternatively, it can obtain inquiry request from client to obtain inquiry request.
S302:At least one keyword is obtained from inquiry request;
Wherein, maximum forward matching algorithm can be used, passes through the individual character and participle in the inquiry request by input Dictionary carries out Forward Maximum Method, and participle is carried out to Chinese, the result after participle is extracted, so as to be formed at least One keyword.Such as, acquisition " woman servant " and " the Yuan Dynasty " two after participle is carried out to " woman servant's the Yuan Dynasty " Keyword.Alternatively it is also possible to carry out participle with reverse maximum matching method and bi-directional matching method.
S303:Obtain scope prescribed information;
Wherein, scope prescribed information is used for the scope that prescribed information is searched for;
Alternatively, the mode of acquisition scope prescribed information can be:Scope is obtained from inquiry request and limits letter Breath;Or, if a keyword is got, and the meaning of a word of a keyword defines the scope of information search, Then generate the scope prescribed information for describing the information search scope that the meaning of a word of a keyword is limited;Or Person, if two keywords are got, and the meaning of a word of the part or all of keyword in multiple keywords is defined The scope of information search, it is determined that what the meaning of a word of each keyword in part or all of keyword was limited The scope of information search;The scope for the information search that the meaning of a word of each keyword of determination is limited takes friendship Collection;Generate the scope prescribed information for describing the common factor.
Wherein, the concrete mode that scope prescribed information is obtained can refer to the explanation previously with regard to scope prescribed information Provided in acquisition modes.
S304:For each keyword at least one keyword, lookup meets scope prescribed information institute One or more conjunctive words of the keyword in the range of restriction;
Wherein, conjunctive word includes synonym and/or near synonym.
Alternatively, search meet scope prescribed information limit in the range of a keyword it is one or more The mode of conjunctive word can be:Search all conjunctive words of the keyword;For each association found Word, obtains the information of the scope of application of the conjunctive word;The scope of application and scope prescribed information are limited into scope Have overlapping conjunctive word, as meet scope prescribed information limit in the range of the keyword conjunctive word.
Alternatively, before the information of the scope of application for obtaining conjunctive word, a pass can also be obtained from text Join word;Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;If including, Then by the word of the scope of application for describing the conjunctive word, labeled as the letter of the scope of application of the conjunctive word Breath.For example, according to " capsicum another name for Sichuan Province claims hot pepper " this text, obtaining keyword " capsicum " and this being crucial In the conjunctive word " hot pepper " of word, the text, the word " another name for Sichuan Province " comprising the scope of application for describing the conjunctive word, So " another name for Sichuan Province " can serve as the information of the scope of application of the conjunctive word.
Alternatively, if getting a keyword, meet scope prescribed information finding and limit scope After one or more conjunctive words (step S304) of an interior keyword, it can also be sent to client One or more conjunctive words, and to each conjunctive word of transmission, send the letter of the scope of application of the conjunctive word Breath, the information for showing one or more conjunctive words and its corresponding scope of application in client.
Alternatively, if getting at least two keywords, each in at least one keyword Keyword, find meet scope prescribed information limit in the range of the keyword one or more associations , can also be by the pass of the different keywords at least two keywords found after word (step S304) Connection word between be combined, and by the Partial key word at least two keywords and find remaining pass It is combined between the conjunctive word of keyword;For each combination of formation, the scope of application of the combination is determined; One or more combinations with the non-NULL scope of application are sent to client, and to each combination of transmission, Send the information of the scope of application of the combination, for client show each combination and each combination The information of the scope of application.
Wherein, if a combination includes keyword, by scope prescribed information limited range and the group The common factor of the scope of application of each conjunctive word in conjunction, is used as the scope of application of the combination;If a combination In do not include keyword, then by the common factor of the scope of application of each conjunctive word in combining, be used as the group The scope of application of conjunction.
Wherein, show that the mode of the information of each combination and the scope of application of each combination can in client To be to add the information of the scope of application of the combination in the front or behind of each combination, as shown in Figure 4; Or, when scope prescribed information is two or more, under same scope prescribed information, displaying Relative one or more combinations with the non-NULL scope of application, for example, when scope prescribed information is Between information and during regional information, the exhibition methods of one or more combinations with the non-NULL scope of application can be as Shown in Fig. 5.
S305:Information search is carried out according to one or more conjunctive words of each keyword found, is obtained Information search result in the range of being limited positioned at scope prescribed information.
Alternatively, if getting a keyword, according to one or many of each keyword found Individual conjunctive word carries out information search, including:Only according to one or more associations of the keyword found Word carries out information search;Or according to one or more conjunctive words of the keyword found, and one Keyword carries out information search.
Alternatively, if getting at least two keywords, each in at least two keywords Keyword, find meet scope prescribed information limit in the range of the keyword one or more associations After word (step S304), believed according to one or more conjunctive words of each keyword found , can also be by the different keywords at least two keywords found before breath search (step S305) Conjunctive word between be combined, and by the Partial key word at least two keywords and find its It is combined between the conjunctive word of remaining keyword;According to one or more passes of each keyword found Join word and carry out information search, can include:Information search is carried out according to each combination of formation;Or according to At least two keywords, and each combination formed carry out information search.
Alternatively, obtain be located at scope prescribed information limit in the range of information search result after, and also Including:Obtained information search result is sent to client, and to each entry in information search result, Range of transmission prescribed information, in client exhibition information search result.
Wherein, the mode of client exhibition information search result can be with as shown in fig. 6, in information search result The above or below label range prescribed information of (content title for searching for obtained entry), i.e. displaying are searched While the content title for the entry that rope is obtained, the scope prescribed information associated with the title is shown.
Fig. 7 is a kind of flow chart of information acquisition method of the offer of the embodiment of the present invention, and this method is mainly used In the information of the scope of application of the conjunctive word of one keyword of acquisition and each conjunctive word from text, information The result of acquisition can provide the scope prescribed information of some keyword for abovementioned steps S303.Such as Fig. 7 institutes Show, the flow of this method is as follows:
S701:One or more conjunctive words of a keyword are obtained from text;
Conjunctive word includes synonym and/or near synonym;
Alternatively, can also be from before one or more conjunctive words of a keyword are obtained from text Keyword is found in text;The conjunctive word marker character of keyword is found from text;Determine conjunctive word marker character Matching range in the text, matching range is used to mark the position model that conjunctive word is likely to occur in the text Enclose;One or more conjunctive words of a keyword, Ke Yishi are obtained from text:Obtained out of matching range Take one or more conjunctive words.
Wherein, conjunctive word marker character is used to mark the conjunctive word of keyword and the incidence relation of keyword.
S702:For each conjunctive word of acquisition, searching is used for the applicable model for describing the conjunctive word in text The word enclosed;
S703:By the scope of application representated by the word found, labeled as the scope of application of the conjunctive word.
Fig. 8 is the flow chart of another information search method provided in an embodiment of the present invention.Wherein, with key Word is two, conjunctive word is exemplified by synonym, scope prescribed information are temporal information and regional information, to provide One example of method shown in Fig. 3.
S801:Obtain the inquiry request for information search;
Such as:Obtain user and input " woman servant's the Yuan Dynasty " this inquiry request in searched page query frame.
Alternatively, inquiry request can be the inquiry request of user's input or by a certain device or be The inquiry request of system generation.
S802:Extract inquiry content keyword;
Keyword in inquiry content is obtained using certain technological means.It can such as be matched and calculated using maximum forward Method, reverse maximum matching algorithm or bi-directional matching algorithm carry out participle.Being extracted from the result after participle will The keyword of inquiry.Such as, participle is carried out to " woman servant's the Yuan Dynasty " and obtains " woman servant " and " the Yuan Dynasty " two Individual keyword.
Wherein, step S802 can be considered an abovementioned steps S302 example.
S803:Searched for first using the keyword of acquisition, obtain search result first;
The keyword of acquisition is scanned for searching algorithm or instrument, the result obtained here is referred to as " search result first ", i.e., the search obtained in the case where being not introduced into the synonym of the keyword to be inquired about As a result.Such as, " woman servant " and " the Yuan Dynasty " two keywords obtained in S803 are scanned for, obtained Obtain search result first.
The keyword of acquisition is scanned for be considered as and existing searching method identical in step S803 Searching method, the search result first of acquisition can be with the binary search knot that is obtained in step S811 below Fruit is merged.
S804:Judge whether contain time or regional information in keyword;If so, performing step S806; If it is not, performing step S805;
Wherein, temporal information, regional information can be considered as an example of aforementioned range prescribed information.
Such as:If the meaning of a word of only one keyword and a keyword defines the scope of information search, Then generate for describing the information search scope that the meaning of a word of a keyword is limited, limited as foregoing scope Determine information.
For another example:If there is the part or all of keyword at least two keywords and at least two keywords The meaning of a word define the scope of information search, it is determined that each keyword in part or all of keyword The information search scope that the meaning of a word is limited, and the information that the meaning of a word of each keyword of determination is limited searches Common factor is taken between rope scope, scope prescribed information is used as using occuring simultaneously.
One word related to time or regional information, is not limited only to the word clearly word containing temporal information, Such as time, place name.It should also also include some words that can be substantially associated with time or regional information, The time or dynasty in place or books writing as where " Mount Huang " or " A Dream of Red Mansions " can be associated with things.
How to judge that a word is related to time or regional information, can be by setting up a time tag storehouse or ground Domain tag library is realized.
Be associated with personality in time tag storehouse or region tag library, ancient books, the time of historical events etc. or Regional information.If the keyword in inquiry content is included in time tag storehouse, itself and temporal information phase Association;If inquiring about the keyword in content to be included in the tag library of region, it is associated with regional information.
After analyzing keyword, the temporal information or regional information of the crucial word association are exported.If Keyword has different temporal informations or different regional informations, then these information is taken and occur simultaneously and export. If the keyword in inquiry content is not associated with temporal information or regional information, then it is assumed that the inquiry request Associate all times or regional information.
Such as:" woman servant " and " the Yuan Dynasty " two keywords are analyzed, the keyword with association in time is obtained. " woman servant " and " the Yuan Dynasty " is contrasted with time tag storehouse successively during analysis.In time mark " woman servant " does not have temporal information in label storehouse, and " the Yuan Dynasty " possesses temporal information.Therefore " member is obtained Temporal information associated by court ".The temporal information of association can be towards code name:The Yuan Dynasty or time Section:1271~1236 Christian era, while can also be the information of other expression times.
In the present invention, when the temporal information or regional information of crucial word association can also be asked by input inquiry Specified, such as to user by providing window or plug-in unit come the temporal information or regional information of input inquiry. In this case, it is higher than by providing window or the temporal information of plug-in unit input or the priority of regional information The temporal information or regional information obtained after being contrasted by keyword and tag library.
In addition, the acquisition of the regional information of keyword in inquiry request it is also possible to use IP address understand, it is fixed The modes such as position device positioning are realized.
Wherein, judge in step S804 keyword whether the purpose containing time or regional information and foregoing step Rapid S303 is identical, is to find scope prescribed information.
S805:All synonyms are obtained from the thesaurus with time and regional information;
Certainly, thesaurus also can only include temporal information, or only include regional information, or, for portion Divide synonym, these synonyms have temporal information;And for other synonyms, these synonyms have low In information.These information are used for the scope of application for limiting synonym, for screening synonym.
S806:From the thesaurus with time and regional information obtain with step S804 in obtain when Between information or the corresponding synonym of regional information;
Such as:Analyze " woman servant " in the synonym in period in the Yuan Dynasty when, " father " and " mother " this two Temporal information associated by individual synonym is " the Yuan Dynasty ", then " father " and " mother " believes for the corresponding time Cease (the Yuan Dynasty) corresponding synonym.
Wherein, synonym is an example of foregoing conjunctive word, time or regional information associated by synonym The information of the scope of application of conjunctive word in as abovementioned steps S304.
Wherein, step S805 and step S806 can be considered an abovementioned steps S304 example.
S807:Original keyword is substituted using the synonym of acquisition, and the additional phase after the completion of replacement The temporal information answered, forms new crucial phrase;
Such as:The new keywords group in " woman servant's the Yuan Dynasty " is " woman servant member ", " father's member ", " father's the Yuan Dynasty ", " mother's member ", " mother's the Yuan Dynasty " etc..
Here the alternative of synonym can be using " full combination " method, the Chinese key group of such as input " Chinese word 2 " of Chinese word 1, Chinese word 1 has 5 synonyms, and Chinese word 2 has 4 synonyms, The new keywords group then formed is 29 kinds (29=6*5-1).Wherein, " Chinese is not included in new keywords group This crucial phrase of the Chinese word 2 " of word 1.
Particular/special requirement is not made to the replacement method of synonym in embodiments herein, as long as can realize synonymous The replacement of word.
Wherein, step S807 can be considered in abovementioned steps S304, " will when there is at least two keywords It is combined between the conjunctive word of different keywords at least two keywords found, and will at least Partial key word in two keywords and be combined between the conjunctive word of remaining keyword found " One example.
S808:New keywords group with time or regional information is handled, search suggestion is formed;
In step S808, foregoing can be considered to the new keywords group progress processing with time or regional information In step S304, it will be carried out between the conjunctive word of the different keywords at least two keywords found Combination, and by the Partial key word at least two keywords and the conjunctive word of remaining keyword found Between be combined after, one of process of the scope of application of the combination is determined to each obtained combination Example.
Carry out logicality analysis to the new keywords group of acquisition first, such as " Chinese word 2 " of Chinese word 1 it is new Crucial phrase is " synonym 1-1 synonym 2-1 " then analyze synonym 1-1 and synonym 2-1 time It is new crucial if overlapped, then it is assumed that it is an effective new keywords group or whether regional information overlaps The temporal information of phrase is the common factor of synonym 1-1 and synonym 2-1 temporal information, new keywords group The common factor of regional information synonym 1-1 and synonym 2-1 regional information.If synonym 1-1 and synonymous Word 2-1 temporal information or regional information is misaligned, then it is assumed that it is an invalid new keywords group.
Wherein, new keywords group is " by least two keywords found in abovementioned steps S304 Different keywords conjunctive word between be combined, and by the Partial key word at least two keywords Be combined between the conjunctive word of remaining keyword found " after an obtained example of combination, newly The temporal information or regional information of crucial phrase are the " scope of application of the combination in abovementioned steps S304 Information " example.
After effective new keywords group is obtained, according to the formation search suggestion of effective new keywords group.Searching In Suo Jianyi forming process, effective new keywords group can be ranked up, according to setting output wherein It is one or more, formed search suggestion.The degree of correlation of current new keywords group and former crucial phrase is such as evaluated, Arranged according to descending, extract the formation search suggestion of the first two new keywords group.Here the evaluation of the degree of correlation can be with Different modes are taken, are such as ranked up by the length of time span or the size of region of new keywords group, Or be ranked up according to the number of the historical search number of times of new keywords group.It is right in embodiments of the invention Sort method is not construed as limiting.
Such as:In new keywords group " woman servant's member ", " father's member ", " father's the Yuan Dynasty ", " mother's member " is " female It may be selected when search suggestion is chosen in close the Yuan Dynasty " etc. containing a pair of minimum crucial phrases of time range, it is such as " female Parent " and the synonymy of " woman servant " are not limited to the Yuan Dynasty, and define the Yuan Dynasty in the keyword this time searched for This scope prescribed information, therefore prioritizing selection contains the search suggestion of " father ", " mother " then conduct Not preferred search suggestion;And " the Yuan Dynasty " can accurately more state temporal information than " member ", thus it is excellent First search of the selection containing " the Yuan Dynasty " advises that " member " then advises as not preferred search.Ultimately form " father's the Yuan Dynasty " this search is advised.
It is alternatively possible to be built obtaining effective new keywords group and forming search according to these new keywords groups The search suggestion of formation is shown after view, the exhibition method of search suggestion may be referred to Fig. 4 and Fig. 5 Shown exhibition method.
S809:Judge whether to perform the search suggestion formed in step S808;If so, step S811 is performed, If it is not, performing step S810;
S810:Search result first is obtained, step S813 is performed;
S811:The search formed in step S808 is performed to advise and form binary search result;
For " search result first " for making the search result obtained in step S811 with being obtained in step S803 Distinguish, the search result obtained in step S811 is referred to as " binary search result ".
Alternatively, if the search suggestion formed in step S808 has multiple, it can be selected in search suggestion In it is one or more perform.
S812:Merge search result and binary search result first;
Wherein, the mode of fusion can be that the result retrieved is ranked up with searching order rule, such as According to the matching degree of keyword, position, the frequency, the link quality of appearance etc. in webpage, calculate and respectively search The degree of correlation and ranking grade of hitch fruit, then according to degree of association height, in order return to search result User.
Such as:Suggestion, search result and the search " woman servant's the Yuan Dynasty " of acquisition must be searched for by performing " father's the Yuan Dynasty " Acquired search result is blended.
S813:Return to search result and search is advised.
If performing the search suggestion obtained in step S809, the search result returned is search knot first Search result after really being merged with binary search result, in addition, the search formed in also return to step S808 It is recommended that.
If being not carried out the search suggestion obtained in step S809, the search result returned is search knot first Really, in addition, the search suggestion formed in also return to step S808.
Search suggestion is returned to after client, client can select displaying search suggestion, such as, searching " father (the Yuan Dynasty) " is shown in Suo Jianyi columns, " (the Yuan Dynasty) father ", " father, the Yuan Dynasty ", " the Yuan Dynasty, The search that father " etc. has temporal information is advised.It should be noted that the presentation of final search result is disobeyed The displaying of Lai Yu search suggestions.
Method shown in Fig. 8 can be considered as a citing of method shown in Fig. 3.In the flow of method shown in Fig. 8 In the embodiment be not described in detail can refer to the description of method shown in Fig. 3.
Fig. 9 shows the flow chart of another information acquisition method provided in an embodiment of the present invention.Shown in Fig. 9 Method can be considered an example of method shown in Fig. 7.Below, with reference to Fig. 9, the present invention is illustrated real A kind of information acquisition method of example offer is provided.
Fig. 9 gives the synonym that a keyword is obtained from text, and time/region of synonym is believed The flow chart of the method for breath, may finally form one using the information acquired in this method has time/region The form of the thesaurus of information, the thesaurus is different from existing thesaurus, and it includes time letter Breath and regional information, the thesaurus can be by adding temporal information and region in existing thesaurus Information realization, its structure can be as shown in table 1.
In table 1, keyword is Chinese word, its may have multiple synonyms (synonym 1, it is synonymous Word 2, synonym 3 etc.).Also to associated by each synonym while the synonym of keyword is recorded Time or regional information are recorded.
Herein, temporal information can be the dynasty, and in the time, the information such as period, regional information can be region, The information such as province.
It should be noted that the thesaurus with time or regional information is not limited to the knot shown in table 1 The structure of structure, other times that can embody synonym or regional information also may be used.
Table 1
Wherein, before the thesaurus with time or regional information obtained in the method shown in Fig. 9 can be used for State the time in step S806 associated by acquisition synonym or regional information.
S901:Obtain synonym matched text;
Web page text is read by spiders technology, or mode is imported etc. by database text and is obtained With text.Explaining in detail for entry, citation explanation etc. are obtained such as by websites such as " Chinese allusion quotations ".
Wherein, synonym matched text is an example of the text in abovementioned steps S701.
S902:Extract synonym marker character;
Wherein, synonym marker character is an example of the conjunctive word marker character in abovementioned steps S701. The position that the conjunctive word of the keyword to be searched to mark occurs in the text.
Matched text is traveled through, synonym marker character contained in all matched texts is extracted, such as " abbreviation ", " again Name " etc..Obtaining the mode of synonym marker character can be, by by the word and mark in synonym matched text Quasi-synonym marker character storehouse is compared, so as to obtain synonym marker character.
Wherein, standard synonym marker character storehouse is used to record all synonym marker characters.
S903:Judge whether to have analyzed all synonym marker characters;
If so, step S909 is performed, if it is not, performing step S904.
S904:The matching range of next synonym marker character is analyzed, the synonym in the range of this is obtained;
The matching range of synonym marker character be conjunctive word marker character in abovementioned steps S701 in the text Matching range an example.
There may be multiple synonyms in the matching range of one synonym marker character.Such as " gardenia also known as Cape jasmine There is the synonym of two " gardenia " in son, Yellow Fructus Gardeniae ":" cape jasmine ", " Yellow Fructus Gardeniae ".So, exist The matching range of the synonym marker character is also obtained after obtaining synonymous unified word marker character, to determine which is arrived Untill word or which punctuate, in text behind word be no longer the keyword synonym.
The acquisition of matching range can be divided by words, sentence is divided, and paragraph is drawn grading mode and realized. During some knowledge class texts are explained, during such as the entry of " Chinese allusion quotation " is explained, synonym marker character is more special, As " word explanation ", " citation is explained " ensuing several sections of texts may be explained all to the entry Content, this several sections of texts belong to matching range.
S905:Import time tag storehouse and region tag library;
Time tag storehouse and region tag library are used to record and historical events, personage, books, the correlation such as article The temporal information and regional information of connection.The temporal information such as associated with " Cao Xueqin " can for " Qing Dynasty " or It is lived the time;The regional information associated with " Mount Huang " can be " Anhui " or " Mt. Huang in Anhui city " etc. Regional information.
S906:Obtain the time in synonym marker character or regional information;
As having and temporal information in the synonym marker character " Ming Dynasty claims " in " the sub- Ming Dynasty Cheng Dong gardens of The South Pool " Related word " Ming Dynasty ", " Ming Dynasty " can as " Dong Yuan " this synonym temporal information.Herein to same Time or regional information in adopted word marker character do not make particular/special requirement, are not limited to the above method.
S907:The time in matching range or regional information are obtained, and it is associated with synonym;
Time or regional information in matching range obtain can also by comprising content of text carry out Acquired results are simultaneously contrasted and realized by participle with time tag storehouse and region tag library.
After time or the regional information in matching range is obtained, it is associated on the synonym that it includes. Establishment on the time in matching range or regional information and synonym to correlation time information, can use but It is not limited to following method.
I) contain one or more synonyms in matching range, contain a time or regional information.It is all Synonymous word association unique time or regional information.
II) contain one or more synonyms in matching range, contain multiple times or regional information.Each Closest time or regional information in sentence where synonymous word association or paragraph.If nothing in current paragraph Temporal information association " modern times " or " current " etc. the expression of correlation time and regional information, the then synonym Current temporal information, regional information associates the regional information that " Zone Full " etc. represents all regions.
III one or more synonyms pair, no time or regional information) are contained in matching range.By synonym To the current temporal information of temporal information association " modern times " or " current " etc. expression, association " whole areas Domain " etc. represents the temporal information in all regions.
Wherein, step S907 can be considered an abovementioned steps S702~step S703 example
S908:The step S907 synonyms with time and regional information obtained are added into thesaurus In.
It is alternatively possible to carry out filtration treatment to the synonym for adding thesaurus, i.e.,:If existed Time associated by the synonym or regional information, then be added to as shown in table 1 same by identical synonym Temporal information or the column of regional information one in adopted dictionary.If there is no identical synonym, then when will have Between or the synonym of regional information be added in thesaurus, and record the temporal information associated by the synonym Or regional information.
Perform after step S908, return to step S903.That is, step S903~step S908 is a circulation Process, until synonym marker character all in text all analyzes completion, cyclic process terminates, and output has The thesaurus of temporal information and regional information.
S909:Thesaurus of the output with time and regional information.
Method shown in Fig. 9 can be considered as not detailed in an example of method shown in Fig. 7, method shown in Fig. 9 The part of description, which can refer in Fig. 7, accordingly to be described.
Figure 10 is a kind of structural representation of information retrieval device provided in an embodiment of the present invention, and the information is searched Rope device is used to perform the information search method shown in Fig. 3.As shown in Figure 10, the device includes:
Inquiry request acquisition module 1001, for obtaining the inquiry request for information search;
Keyword acquisition module 1002, for obtaining at least one keyword from inquiry request;
Scope prescribed information acquisition module 1003, for obtaining scope prescribed information, scope prescribed information is used for The scope of prescribed information search;
Conjunctive word searching modul 1004, for for each keyword at least one keyword, searching Meet scope prescribed information limit in the range of the keyword one or more conjunctive words, conjunctive word includes Synonym and/or near synonym;
Search module 1005, for each keyword for being found according to conjunctive word searching modul 1004 One or more conjunctive words carry out information searches, obtain being located at scope prescribed information limit in the range of information Search result.
Alternatively, search module 1005 is pressed when keyword acquisition module 1002 gets a keyword One or more conjunctive words of the keyword found according to conjunctive word searching modul 1004 enter row information and searched Rope;Or one or more conjunctive words of the keyword found according to conjunctive word searching modul 1004, And the keyword that keyword acquisition module 1002 is got carries out information search.
Alternatively, the information retrieval device also includes:Word combination module, in keyword acquisition module 1002 when getting at least two keywords, in conjunctive word searching modul 1004 at least two keywords In each keyword, find meet scope prescribed information limit in the range of one of the keyword Or after multiple conjunctive words, search module 1005 according to conjunctive word searching modul 1004 find it is each One or more conjunctive words of individual keyword are carried out before information search, and conjunctive word searching modul 1004 is looked into It is combined between the conjunctive word of different keywords at least two keywords found, and will at least two Partial key word in individual keyword and it is combined between the conjunctive word of remaining keyword found;
Search module 1005 specifically for:At least two keywords are got in keyword acquisition module 1002 When, carry out information search according to each combination of word combination module formation;Or it is crucial according at least two Word, and each combination of word combination module formation carry out information search.
Wherein, search module 1005, can be merely with conjunctive word searching modul 1004 when carrying out information search The conjunctive word found is scanned for, the keyword that can also be obtained using keyword acquisition module 1002 And the conjunctive word that conjunctive word searching modul 1004 is found is scanned for.
Alternatively, conjunctive word searching modul 1004 specifically for:For each at least one keyword Individual keyword, searches all conjunctive words of the keyword;For each conjunctive word found, obtaining should The information of the scope of application of conjunctive word;Have overlapping between with scope prescribed information the scope of application is limited into scope Conjunctive word, as meet scope prescribed information limit in the range of the keyword conjunctive word.
Alternatively, inquiry request acquisition module 1001 specifically for:Inquiry request is obtained from client;
The information retrieval device also includes:Conjunctive word sending module, is used for:
When keyword acquisition module 1002 gets a keyword, looked into conjunctive word searching modul 1004 Find meet scope prescribed information limit in the range of a keyword one or more conjunctive words after, One or more conjunctive words are sent to client, and to each conjunctive word of transmission, send the conjunctive word The information of the scope of application;
Or be used for:
When keyword acquisition module 1002 gets at least two keywords, in conjunctive word searching modul 1004 for each keyword at least two keywords, finds and meets scope prescribed information and limited In the range of the keyword one or more conjunctive words after, conjunctive word searching modul 1004 is found At least two keywords in different keywords conjunctive word between be combined, and by least two close Partial key word in keyword and it is combined between the conjunctive word of remaining keyword found;For being formed Each combination, determine the scope of application of the combination;Send one or more suitable with non-NULL to client With the combination of scope, and to each combination of transmission, send the information of the scope of application of the combination.
Wherein, if a combination includes keyword, by scope prescribed information limited range and the group Common factor between the scope of application of each conjunctive word in conjunction, is used as the scope of application of the combination;
If not including keyword in a combination, the scope of application of each conjunctive word during this is combined it Between common factor, be used as the scope of application of the combination.
Alternatively, the information retrieval device also includes:Scope of application information flag module, in conjunctive word Searching modul 1004 is obtained before the information of the scope of application of each conjunctive word, and one is obtained from text Conjunctive word;Judge whether include being used to describe the word of the scope of application of the conjunctive word in text;If including, Then by the word of the scope of application for describing the conjunctive word, labeled as the letter of the scope of application of the conjunctive word Breath.
Alternatively, scope prescribed information acquisition module 1003 specifically for:
Scope prescribed information is obtained from inquiry request;Or
If keyword acquisition module 1002 gets a keyword and the meaning of a word of a keyword is defined The scope of information search, then generate for describing the information search scope that the meaning of a word of a keyword is limited, It is used as scope prescribed information;Or
If keyword acquisition module 1002 is got at least two keywords and at least two keywords The meaning of a word of part or all of keyword defines the scope of information search, it is determined that in part or all of keyword Each keyword the information search scope that is limited of the meaning of a word, and by the word of each keyword of determination Common factor is taken between the information search scope that justice is limited, scope prescribed information is used as using occuring simultaneously.
Alternatively, inquiry request acquisition module 1001 specifically for:Inquiry request is obtained from client;Dress Putting also includes:Search result sending module, for obtaining being located at scope prescribed information in search module 1005 After information search result in the range of limiting, obtained information search result is sent to client, and it is right Each entry in information search result, range of transmission prescribed information.
In Figure 10 shown devices, inquiry request, which obtains mould 1001, to be used to perform abovementioned steps S301;It is crucial Word acquisition module 1002 is used to perform abovementioned steps S302;Scope prescribed information acquisition module 1003 is used to hold Row abovementioned steps S303;Conjunctive word searching modul 1004 is used to perform abovementioned steps S304;Search module 1005 are used to perform abovementioned steps S305;Word combination module is used to perform difference in abovementioned steps S304 It is combined between the conjunctive word of keyword and carries out the conjunctive word of Partial key word and remaining keyword The step of combination;Conjunctive word composite module, which is used to perform in abovementioned steps S304, to be sent conjunctive word and its is applicable The step of scope;Scope of application information flag module is used to perform the mark conjunctive word in abovementioned steps S304 The scope of application the step of;Search result sending module is used for after performing abovementioned steps S305, will search for As a result it is sent to client.
The function not being described in detail in each module shown in Figure 10 and operation, in detail as shown in Figure 3 in flow Corresponding description.
The modules included by device shown in Figure 10, can be by the processor 202 in Fig. 2 when realizing The programmed instruction that is stored in run memory 203 is realized., may when each module performs corresponding operation Can be related to server 101 and other equipment, such as:Client 102 or outside memory 103 it Between interaction, can be controlled when realizing by processor 202 I/O interfaces 201 complete these interaction.In addition, When modules perform corresponding operation, the access to memory 203 may be related to, can be by when realizing Processor 202 obtains data storage from memory 203.
A kind of structural representation for information acquisition device that Figure 11 provides for the application, as shown in figure 11, should Device includes:
Conjunctive word acquisition module 1101, one or more associations for obtaining a keyword from text Word, conjunctive word includes synonym and/or near synonym;
Word searching modul 1102, for each conjunctive word obtained for conjunctive word module, in the text Search the word of the scope of application for describing the conjunctive word;
Range flags module 1103, for the word for finding word searching modul, labeled as the conjunctive word The scope of application information.
Alternatively, the device also includes:
Keyword lookup module, one or more conjunctive words for obtaining keyword in conjunctive word acquisition module Before, keyword is found from text;
Conjunctive word marker character searching modul, states the conjunctive word marker character that keyword is found in text, conjunctive word mark Note symbol is used to mark the conjunctive word of keyword and the incidence relation of keyword;
Matching range determining module, for determining the matching range of conjunctive word marker character in the text, matches model Enclose the position range for marking conjunctive word to be likely to occur in the text;
Range flags module specifically for:
One or more conjunctive words are obtained out of matching range.
In information acquisition device shown in Figure 11, conjunctive word acquisition module 1101 is used to perform abovementioned steps S701, word searching modul 1102 is used to perform abovementioned steps S702, and range flags module 1103 is used to hold Row abovementioned steps S703, the lookup keyword that keyword lookup module is used to perform in abovementioned steps S701 Operation, conjunctive word marker character searching modul is used to perform the lookup conjunctive word marker character in abovementioned steps S701 Operation, matching range determining module is used to perform determination conjunctive word marker character in abovementioned steps S701 The operation of matching range.
The modules included by device shown in Figure 11, can be by the processor 202 in Fig. 2 when realizing The program stored in memory 203 is called to realize.When each module performs corresponding operation, it may relate to And to server 101 and other equipment, such as:Between the memory 103 of client 102 or outside Interaction, can be controlled I/O interfaces 201 to complete these interactions by processor 202 when realizing.In addition, at each When module performs corresponding operation, the access to memory 203 may be related to, can be by handling when realizing Device 202 obtains data storage from memory 203.
The detailed flow as shown in Figure 7 of function or operation that information acquisition device shown in Figure 11 is not described in detail In corresponding description.
Below, with reference to Figure 12, another information retrieval device provided in an embodiment of the present invention is illustrated.Its In, Figure 12 using keyword be at least two, conjunctive word as synonym, scope prescribed information be temporal information Exemplified by regional information, an example of Figure 10 shown devices is provided.
As shown in figure 12, the information retrieval device includes:
Keyword acquisition module 1201, the keyword for obtaining search from client.Wherein, keyword can To pass through the keyword that participle is obtained for the search statement that is inputted by user, or user specifies or selected The keyword selected, or select or input by some setting input windows and obtain keyword etc..
Thesaurus memory module 1202 with time or regional information, for storing keyword acquisition module The synonym of 1201 keywords obtained, the synonym has time or regional information, and its structure can be table Structure shown in 1.In table 1, temporal information can be dynasty, time, the information such as period, region letter It can be region to cease, the information such as province.
Thesaurus memory module with time or regional information is not limited to the structure described in table 1, its It can embody the time of synonym or the result of regional information also may be used.
Synonym processing module 1205 is when handling synonym according to time or regional information The temporal information or regional information that thesaurus memory module 1202 obtains each synonym are (i.e. foregoing to close Join an example of the information of the scope of application of word).
Time/region tag library memory module 1203, for record and historical events, personage, books, thing The words such as product are associated time or regional information, in the embodiment shown in fig. 12, keyword processing module It will be recorded when 1204 pairs of keywords are handled in keyword and time/region tag library memory module 1203 Word is contrasted, and obtains the temporal information included in keyword or regional information (i.e. aforementioned range restriction One example of information).
Keyword processing module 1204, for judging whether keyword is related to time or regional information, if It is related then obtain corresponding time or regional information.By keyword and time/region tag library memory module The word recorded in 1203 is contrasted, if keyword is included in time/region tag library memory module 1203 In, then obtain the time of the keyword or regional information in time/region tag library memory module 1203.This Outside, if the not no keyword related to time or regional information, its output time information can be following two: Without time or regional information, or all times or regional information.As needed one of which can be selected defeated Go out mode.
Synonym processing module 1205, for the keyword that obtains keyword acquisition module 1201 with having The synonym of the keyword in the thesaurus memory module 1202 of time or regional information is substituted, And additional period or regional information, form the synonym crucial phrase with time or regional information (i.e. foregoing One example of the new keywords group in step S807).
Search suggestion processing module 1206, the synonym with time or regional information obtained for filtering Crucial phrase, forms search suggestion.
Search suggestion sending module 1207, builds for sending the search with time or regional information to client View.
Search module 1208, is scanned for for treating searching keyword group and its synonym crucial phrase.
Search result is stored and sending module 1209, for storing search result and sending search knot to client Really.
In Figure 12, keyword acquisition module 1201 is an example of foregoing keyword acquisition module 1001; Thesaurus memory module 1202 with time or regional information has time or ground for what is obtained in Fig. 9 One example of the thesaurus of domain information, for meeting for foregoing conjunctive word searching modul 1004 in lookup Scope prescribed information limit in the range of the keyword one or more conjunctive words when provide information sum According to;Time/region tag library memory module 1203 is that aforementioned range prescribed information acquisition module 1003 is being obtained Information and data are provided during scope prescribed information;Keyword processing module 1204 is aforementioned range prescribed information One example of acquisition module 1003;Synonym processing module 1205 is one of foregoing word combination module Example;Search suggestion processing module 1206 provides conjunctive word and its applicable model for foregoing keyword sending module The information enclosed;Search suggestion sending module 1207 is an example of foregoing conjunctive word sending module;Search Module 1208 is an example of previous searches module 1005;Search result is stored and sending module 1209 For an example of previous searches result sending module.
The function for each module not being described in detail in Figure 12 and operation refer to the corresponding description in Figure 10.
The modules included by device shown in Figure 12, can be by the processor 202 in Fig. 2 when realizing The programmed instruction that is stored in run memory 203 is realized., may when each module performs corresponding operation Can be related to server 101 and other equipment, such as:Client 102 or outside memory 103 it Between interaction, can be controlled when realizing by processor 202 I/O interfaces 201 complete these interaction.In addition, When modules perform corresponding operation, the access to memory 203 may be related to, can be by when realizing Processor 202 obtains data storage from memory 203.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or meter Calculation machine program product.Therefore, the present invention can be using complete hardware embodiment, complete software embodiment or knot The form of embodiment in terms of conjunction software and hardware.Wherein wrapped one or more moreover, the present invention can be used Containing computer usable program code computer-usable storage medium (include but is not limited to magnetic disk storage, CD-ROM, optical memory etc.) on the form of computer program product implemented.
The present invention is with reference to the production of method according to embodiments of the present invention, equipment (system) and computer program The flow chart and/or block diagram of product is described.It should be understood that can by computer program instructions implementation process figure and / or each flow and/or square frame in block diagram and the flow in flow chart and/or block diagram and/ Or the combination of square frame.These computer program instructions can be provided to all-purpose computer, special-purpose computer, insertion Formula processor or the processor of other programmable data processing devices are to produce a machine so that pass through and calculate The instruction of the computing device of machine or other programmable data processing devices is produced for realizing in flow chart one The device for the function of being specified in individual flow or multiple flows and/or one square frame of block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or the processing of other programmable datas to set In the standby computer-readable memory worked in a specific way so that be stored in the computer-readable memory Instruction produce include the manufacture of command device, the command device realization in one flow or multiple of flow chart The function of being specified in one square frame of flow and/or block diagram or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices, made Obtain and perform series of operation steps on computer or other programmable devices to produce computer implemented place Reason, so that the instruction performed on computer or other programmable devices is provided for realizing in flow chart one The step of function of being specified in flow or multiple flows and/or one square frame of block diagram or multiple square frames.
, but those skilled in the art once know base although preferred embodiments of the present invention have been described This creative concept, then can make other change and modification to these embodiments.So, appended right will Ask and be intended to be construed to include preferred embodiment and fall into having altered and changing for the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification without departing from this hair to the present invention Bright spirit and scope.So, if the present invention these modifications and variations belong to the claims in the present invention and Within the scope of its equivalent technologies, then the present invention is also intended to comprising including these changes and modification.

Claims (22)

1. a kind of information search method, it is characterised in that including:
Obtain the inquiry request for information search;
At least one keyword is obtained from the inquiry request;
Scope prescribed information is obtained, the scope prescribed information is used for the scope that prescribed information is searched for;
For each keyword at least one described keyword, lookup meets the scope prescribed information One or more conjunctive words of the keyword in the range of limiting, the conjunctive word includes synonym and/or near Adopted word;
Information search is carried out according to one or more of conjunctive words of each keyword found, is obtained Positioned at the scope prescribed information limit in the range of information search result.
2. the method as described in claim 1, it is characterised in that if getting a keyword, press Information search is carried out according to one or more of conjunctive words of each keyword found, including:
Information search is carried out according to one or more of conjunctive words of the one keyword found;Or
According to one or more of conjunctive words of the one keyword found, and one pass Keyword carries out information search.
3. the method as described in claim 1, it is characterised in that if getting at least two keywords, Then each keyword in at least two keyword, finds and meets the scope restriction letter Breath limit in the range of the keyword one or more conjunctive words after, according to find each close One or more of conjunctive words of keyword are carried out before information search, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found, And by the Partial key word at least two keyword and the conjunctive word of remaining keyword found Between be combined;
One or more of conjunctive words according to each keyword found carry out information search, bag Include:
Information search is carried out according to each combination of formation;Or
According at least two keyword, and each combination formed carries out information search.
4. the method as described in any one of claims 1 to 3, it is characterised in that lookup meets the scope Prescribed information limit in the range of a keyword one or more conjunctive words, including:
Search all conjunctive words of the keyword;
For each conjunctive word found, the information of the scope of application of the conjunctive word is obtained;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make For meet the scope prescribed information limit in the range of the keyword conjunctive word.
5. method as claimed in claim 4, it is characterised in that obtain the inquiry request, including: The inquiry request is obtained from client;
If getting a keyword, meet finding in the range of the scope prescribed information limits After one or more conjunctive words of one keyword, in addition to:
One or more of conjunctive words are sent to the client, and to each conjunctive word of transmission, hair Give the information of the scope of application of the conjunctive word.
6. method as claimed in claim 4, it is characterised in that obtain the inquiry request, including: The inquiry request is obtained from client;
If getting at least two keywords, each in at least two keyword is crucial Word, find meet the scope prescribed information limit in the range of the keyword one or more associations After word, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found, And by between the Partial key word and the conjunctive word of remaining keyword found at least two keywords It is combined;
For each combination of formation, the scope of application of the combination is determined;Wherein, if being wrapped in a combination Include keyword, then each conjunctive word during the scope prescribed information limited range is combined with this Common factor between the scope of application, is used as the scope of application of the combination;If not including keyword in a combination, Then by the common factor between the scope of application of each conjunctive word in combining, the applicable model of the combination is used as Enclose;
One or more combinations with the non-NULL scope of application are sent to the client, and to each of transmission Individual combination, sends the information of the scope of application of the combination.
7. the method as described in any one of claim 4~6, it is characterised in that obtain each conjunctive word The scope of application information before, in addition to:
A conjunctive word is obtained from text;
Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including by the word of the scope of application for describing the conjunctive word, labeled as the suitable of the conjunctive word With the information of scope.
8. the method as described in any one of claim 1~7, it is characterised in that obtain the scope and limit Information, including:
The scope prescribed information is obtained from the inquiry request;Or
If the meaning of a word for getting a keyword and one keyword defines the scope of information search, Then generate for describing the information search scope that the meaning of a word of one keyword is limited, be used as the scope Prescribed information;Or
If getting the part or all of keyword at least two keywords and at least two keyword The meaning of a word define the scope of information search, it is determined that each in the part or all of keyword is crucial The information search scope that the meaning of a word of word is limited, and the letter that the meaning of a word of each keyword of determination is limited Breath takes common factor between hunting zone, regard the common factor as the scope prescribed information.
9. the method as described in any one of claim 1~8, it is characterised in that obtain the inquiry request, Including:The inquiry request is obtained from client;
Obtain be located at the scope prescribed information limit in the range of information search result after, also wrap Include:
Obtained information search result is sent to the client, and to each in information search result Mesh, sends the scope prescribed information.
10. a kind of information acquisition method, it is characterised in that including:
From text obtain a keyword one or more conjunctive words, the conjunctive word include synonym and / or near synonym;
For each conjunctive word of acquisition, the applicable model for describing the conjunctive word is searched in the text The word enclosed;
By the word found, labeled as the information of the scope of application of the conjunctive word.
11. the method stated such as claim 10 a, it is characterised in that keyword is being obtained from text One or more conjunctive words before, in addition to:
The keyword is found from the text;
The conjunctive word marker character of the keyword is found from the text, the conjunctive word marker character is used to mark Remember the conjunctive word of the keyword and the incidence relation of the keyword;
Matching range of the conjunctive word marker character in the text is determined, the matching range is used to mark The position range that the conjunctive word is likely to occur in the text;
One or more conjunctive words of a keyword are obtained from text, including:
One or more of conjunctive words are obtained out of described matching range.
12. a kind of information retrieval device, it is characterised in that including:
Inquiry request acquisition module, for obtaining the inquiry request for information search;
Keyword acquisition module, for obtaining at least one keyword from the inquiry request;
Scope prescribed information acquisition module, for obtaining scope prescribed information, the scope prescribed information is used for The scope of prescribed information search;
Conjunctive word searching modul, for for each keyword at least one described keyword, searching Meet the scope prescribed information limit in the range of the keyword one or more conjunctive words, the pass Joining word includes synonym and/or near synonym;
Search module, for found according to the conjunctive word searching modul described the one of each keyword Individual or multiple conjunctive words carry out information searches, obtain being located at the scope prescribed information limit in the range of letter Cease search result.
13. device as claimed in claim 12, it is characterised in that the search module specifically for: When the keyword acquisition module gets a keyword,
One or more of passes of the one keyword found according to the conjunctive word searching modul Join word and carry out information search;Or
One or more of passes of the one keyword found according to the conjunctive word searching modul Join word, and one keyword that the keyword acquisition module is got carries out information search.
14. device as claimed in claim 12, it is characterised in that
Described device also includes:Word combination module, for being got at least in the keyword acquisition module During two keywords, in the conjunctive word searching modul for each pass at least two keyword Keyword, find meet the scope prescribed information limit in the range of the keyword one or more passes After connection word, each keyword found in the search module according to the conjunctive word searching modul One or more of conjunctive words are carried out before information search, the institute that the conjunctive word searching modul is found It is combined between the conjunctive word for stating the different keywords at least two keywords, and at least two is closed Partial key word in keyword and it is combined between the conjunctive word of remaining keyword found;
The search module specifically for:At least two keywords are got in the keyword acquisition module When,
Information search is carried out according to each combination of word combination module formation;Or
Each combination according at least two keyword, and word combination module formation is carried out Information search.
15. the device as described in any one of claim 12~14, it is characterised in that the conjunctive word is searched Module specifically for:
For each keyword at least one described keyword, the institute for searching the keyword is relevant Word;
For each conjunctive word found, the information of the scope of application of the conjunctive word is obtained;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make For meet the scope prescribed information limit in the range of the keyword conjunctive word.
16. device as claimed in claim 15, it is characterised in that
The inquiry request acquisition module specifically for:The inquiry request is obtained from client;
Described device also includes:Conjunctive word sending module, is used for:
When the keyword acquisition module gets a keyword, searched in the conjunctive word searching modul To meet the scope prescribed information limit in the range of one keyword one or more associations After word, one or more of conjunctive words are sent to the client, and to each conjunctive word of transmission, Send the information of the scope of application of the conjunctive word.
17. device as claimed in claim 15, it is characterised in that
The inquiry request acquisition module specifically for:The inquiry request is obtained from client;
Described device also includes:Conjunctive word sending module, is used for:
When the keyword acquisition module gets at least two keywords, in the conjunctive word searching modul For each keyword at least two keyword, find and meet the scope prescribed information institute After one or more conjunctive words of the keyword in the range of restriction, the conjunctive word searching modul is searched To at least two keyword in different keywords conjunctive word between be combined, and will at least Partial key word in two keywords and it is combined between the conjunctive word of remaining keyword found;
For each combination of formation, the scope of application of the combination is determined;
Wherein, if a combination includes keyword, by the scope prescribed information limited range with Common factor between the scope of application of each conjunctive word in the combination, is used as the scope of application of the combination;
If not including keyword in a combination, the scope of application of each conjunctive word during this is combined it Between common factor, be used as the scope of application of the combination;
One or more combinations with the non-NULL scope of application are sent to the client, and to each of transmission Individual combination, sends the information of the scope of application of the combination.
18. the device as described in any one of claim 15~17, it is characterised in that described device also includes: Scope of application information flag module, for obtaining being applicable for each conjunctive word in the conjunctive word searching modul Before the information of scope,
A conjunctive word is obtained from text;
Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including by the word of the scope of application for describing the conjunctive word, labeled as the suitable of the conjunctive word With the information of scope.
19. the device as described in any one of claim 12~18, it is characterised in that the scope limits letter Cease acquisition module specifically for:
The scope prescribed information is obtained from the inquiry request;Or
If the keyword acquisition module gets a keyword and the meaning of a word of one keyword is limited The scope of information search, then generate for describing the information search that the meaning of a word of one keyword is limited Scope, is used as the scope prescribed information;Or
If the keyword acquisition module is got at least two keywords and at least two keyword The meaning of a word of part or all of keyword define the scope of information search, it is determined that it is described part or all of to close The information search scope that the meaning of a word of each keyword in keyword is limited, and each by determination is crucial Common factor is taken between the information search scope that the meaning of a word of word is limited, is used as the scope to limit the common factor and believes Breath.
20. the device as described in any one of claim 12~19, it is characterised in that the inquiry request is obtained Modulus block specifically for:The inquiry request is obtained from client;
Described device also includes:Search result sending module, for being obtained in the search module positioned at described Scope prescribed information limit in the range of information search result after, send obtained letter to the client Search result is ceased, and to each entry in information search result, sends the scope prescribed information.
21. a kind of information acquisition device, it is characterised in that including:
Conjunctive word acquisition module, one or more conjunctive words for obtaining a keyword from text, institute Stating conjunctive word includes synonym and/or near synonym;
Word searching modul, for each conjunctive word obtained for the conjunctive word module, in the text The word of the scope of application for describing the conjunctive word is searched in this;
Range flags module, for the word for finding the word searching modul, labeled as the conjunctive word The scope of application information.
22. device as claimed in claim 21, it is characterised in that described device also includes:
Keyword lookup module, for obtaining one or many of the keyword in the conjunctive word acquisition module Before individual conjunctive word, the keyword is found from the text;
Conjunctive word marker character searching modul, the conjunctive word for finding the keyword from the text is marked Symbol, the conjunctive word marker character is used to mark the conjunctive word of the keyword and associating for the keyword System;
Matching range determining module, for determining matching model of the conjunctive word marker character in the text Enclose, the matching range is used to mark the position range that the conjunctive word is likely to occur in the text;
The range flags module specifically for:
One or more of conjunctive words are obtained out of described matching range.
CN201610179888.9A 2016-03-25 2016-03-25 Information searching method and device Active CN107229659B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610179888.9A CN107229659B (en) 2016-03-25 2016-03-25 Information searching method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610179888.9A CN107229659B (en) 2016-03-25 2016-03-25 Information searching method and device

Publications (2)

Publication Number Publication Date
CN107229659A true CN107229659A (en) 2017-10-03
CN107229659B CN107229659B (en) 2021-06-22

Family

ID=59931969

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610179888.9A Active CN107229659B (en) 2016-03-25 2016-03-25 Information searching method and device

Country Status (1)

Country Link
CN (1) CN107229659B (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108446345A (en) * 2018-03-07 2018-08-24 维沃移动通信有限公司 A kind of data search method and mobile terminal
CN109684633A (en) * 2018-12-14 2019-04-26 北京百度网讯科技有限公司 Search processing method, device, equipment and storage medium
CN110941609A (en) * 2019-10-12 2020-03-31 贝壳技术有限公司 Multi-dimensional searching method and system
CN111241126A (en) * 2020-01-16 2020-06-05 联想(北京)有限公司 Data searching method and device and query interaction method
CN111382374A (en) * 2020-02-29 2020-07-07 中国平安人寿保险股份有限公司 Information display method and device, electronic equipment and storage medium
CN111435376A (en) * 2019-01-15 2020-07-21 北京京东尚科信息技术有限公司 Information processing method and system, computer system, and computer-readable storage medium
CN112464081A (en) * 2020-09-08 2021-03-09 广东省华南技术转移中心有限公司 Project information matching method, device and storage medium
CN112596646A (en) * 2020-12-21 2021-04-02 维沃移动通信有限公司 Information display method and device and electronic equipment
CN112650839A (en) * 2021-01-12 2021-04-13 深圳市鹰硕技术有限公司 Retrieval information optimization method and device
CN112825088A (en) * 2019-11-21 2021-05-21 阿里巴巴集团控股有限公司 Information display method, device, equipment and storage medium
CN113743981A (en) * 2021-08-03 2021-12-03 深圳市东信时代信息技术有限公司 Material putting cost prediction method and device, computer equipment and storage medium
CN114697748A (en) * 2020-12-25 2022-07-01 深圳Tcl新技术有限公司 Video recommendation method based on voice recognition and computer equipment
WO2022262621A1 (en) * 2021-06-17 2022-12-22 华为技术有限公司 Method and apparatus for searching point of information
CN117112736A (en) * 2023-10-24 2023-11-24 云南瀚文科技有限公司 Information retrieval analysis method and system based on semantic analysis model
CN118277537A (en) * 2024-06-03 2024-07-02 福建省君诺科技成果转化服务有限公司 Intellectual property retrieval management method and device based on big data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09231227A (en) * 1996-02-20 1997-09-05 Inter Group:Kk Information retrieval device and method therefor
CN101888503A (en) * 2010-06-12 2010-11-17 中山大学 Classification retrieving method for digital television program
CN103123632A (en) * 2011-11-21 2013-05-29 阿里巴巴集团控股有限公司 Determining method for searching headword and device of searching headword, searching method and searching equipment
CN103353894A (en) * 2013-07-19 2013-10-16 武汉睿数信息技术有限公司 Data searching method and system based on semantic analysis
CN104268175A (en) * 2014-09-15 2015-01-07 乐视网信息技术(北京)股份有限公司 Data search device and method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09231227A (en) * 1996-02-20 1997-09-05 Inter Group:Kk Information retrieval device and method therefor
CN101888503A (en) * 2010-06-12 2010-11-17 中山大学 Classification retrieving method for digital television program
CN103123632A (en) * 2011-11-21 2013-05-29 阿里巴巴集团控股有限公司 Determining method for searching headword and device of searching headword, searching method and searching equipment
CN103353894A (en) * 2013-07-19 2013-10-16 武汉睿数信息技术有限公司 Data searching method and system based on semantic analysis
CN104268175A (en) * 2014-09-15 2015-01-07 乐视网信息技术(北京)股份有限公司 Data search device and method thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王屾: ""基于Lucene的同义词扩展检索的研究与实现"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108446345A (en) * 2018-03-07 2018-08-24 维沃移动通信有限公司 A kind of data search method and mobile terminal
CN109684633A (en) * 2018-12-14 2019-04-26 北京百度网讯科技有限公司 Search processing method, device, equipment and storage medium
CN109684633B (en) * 2018-12-14 2023-05-16 北京百度网讯科技有限公司 Search processing method, device, equipment and storage medium
CN111435376A (en) * 2019-01-15 2020-07-21 北京京东尚科信息技术有限公司 Information processing method and system, computer system, and computer-readable storage medium
CN110941609B (en) * 2019-10-12 2023-10-20 贝壳找房(北京)科技有限公司 Multi-dimensional searching method and system
CN110941609A (en) * 2019-10-12 2020-03-31 贝壳技术有限公司 Multi-dimensional searching method and system
CN112825088A (en) * 2019-11-21 2021-05-21 阿里巴巴集团控股有限公司 Information display method, device, equipment and storage medium
CN111241126A (en) * 2020-01-16 2020-06-05 联想(北京)有限公司 Data searching method and device and query interaction method
CN111382374A (en) * 2020-02-29 2020-07-07 中国平安人寿保险股份有限公司 Information display method and device, electronic equipment and storage medium
CN112464081A (en) * 2020-09-08 2021-03-09 广东省华南技术转移中心有限公司 Project information matching method, device and storage medium
CN112596646A (en) * 2020-12-21 2021-04-02 维沃移动通信有限公司 Information display method and device and electronic equipment
CN112596646B (en) * 2020-12-21 2022-05-20 维沃移动通信有限公司 Information display method and device and electronic equipment
CN114697748B (en) * 2020-12-25 2024-05-03 深圳Tcl新技术有限公司 Video recommendation method and computer equipment based on voice recognition
CN114697748A (en) * 2020-12-25 2022-07-01 深圳Tcl新技术有限公司 Video recommendation method based on voice recognition and computer equipment
CN112650839A (en) * 2021-01-12 2021-04-13 深圳市鹰硕技术有限公司 Retrieval information optimization method and device
WO2022262621A1 (en) * 2021-06-17 2022-12-22 华为技术有限公司 Method and apparatus for searching point of information
CN113743981B (en) * 2021-08-03 2023-11-28 深圳市东信时代信息技术有限公司 Material delivery cost prediction method and device, computer equipment and storage medium
CN113743981A (en) * 2021-08-03 2021-12-03 深圳市东信时代信息技术有限公司 Material putting cost prediction method and device, computer equipment and storage medium
CN117112736A (en) * 2023-10-24 2023-11-24 云南瀚文科技有限公司 Information retrieval analysis method and system based on semantic analysis model
CN117112736B (en) * 2023-10-24 2024-01-05 云南瀚文科技有限公司 Information retrieval analysis method and system based on semantic analysis model
CN118277537A (en) * 2024-06-03 2024-07-02 福建省君诺科技成果转化服务有限公司 Intellectual property retrieval management method and device based on big data

Also Published As

Publication number Publication date
CN107229659B (en) 2021-06-22

Similar Documents

Publication Publication Date Title
CN107229659A (en) A kind of information search method and device
CN109710701B (en) Automatic construction method for big data knowledge graph in public safety field
CN104933113B (en) A kind of expression input method and device based on semantic understanding
CN105393263B (en) Feature in compuman's interactive learning is completed
Chen Information visualization: Beyond the horizon
US8972440B2 (en) Method and process for semantic or faceted search over unstructured and annotated data
CN113065003B (en) Knowledge graph generation method based on multiple indexes
CN108268580A (en) The answering method and device of knowledge based collection of illustrative plates
CN105528437B (en) A kind of question answering system construction method extracted based on structured text knowledge
CN104462056B (en) For the method and information handling systems of knouledge-based information to be presented
CN106104518A (en) For the framework extracted according to the data of example
CN110909170B (en) Interest point knowledge graph construction method and device, electronic equipment and storage medium
CN106663117A (en) Constructing a graph that facilitates provision of exploratory suggestions
CN105843796A (en) Microblog emotional tendency analysis method and device
CN109582799A (en) The determination method, apparatus and electronic equipment of knowledge sample data set
CN103617192B (en) The clustering method and device of a kind of data object
CN107784014A (en) Information search method, equipment and electronic equipment
CN104331438B (en) To novel web page contents selectivity abstracting method and device
CN104239570B (en) The searching method and device of paper
CN110309432A (en) Method, map point of interest processing method are determined based on the synonym of point of interest
CN109857952A (en) A kind of search engine and method for quickly retrieving with classification display
CN113190593A (en) Search recommendation method based on digital human knowledge graph
CN105653546A (en) Method and system for searching target theme
Menezes et al. Building a massive corpus for named entity recognition using free open data sources
Castellani Ribeiro et al. An urban data profiler

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200201

Address after: 518129 Bantian HUAWEI headquarters office building, Longgang District, Guangdong, Shenzhen

Applicant after: HUAWEI TECHNOLOGIES Co.,Ltd.

Address before: 210012 HUAWEI Nanjing base, 101 software Avenue, Yuhuatai District, Jiangsu, Nanjing

Applicant before: Huawei Technologies Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant