CN107229659A - A kind of information search method and device - Google Patents
A kind of information search method and device Download PDFInfo
- Publication number
- CN107229659A CN107229659A CN201610179888.9A CN201610179888A CN107229659A CN 107229659 A CN107229659 A CN 107229659A CN 201610179888 A CN201610179888 A CN 201610179888A CN 107229659 A CN107229659 A CN 107229659A
- Authority
- CN
- China
- Prior art keywords
- keyword
- word
- information
- scope
- conjunctive
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of information search method and device, the accuracy to improve the information search result in information seeking processes.A kind of information retrieval device includes:Inquiry request acquisition module, for obtaining inquiry request;Keyword acquisition module, for obtaining at least one keyword from inquiry request;Scope prescribed information acquisition module, for obtaining scope prescribed information;Conjunctive word searching modul, for each keyword at least one keyword, search meet scope prescribed information limit in the range of the keyword one or more conjunctive words;Search module, for according to each keyword found one or more conjunctive words carry out information search, obtain be located at scope prescribed information limit in the range of information search result.During due to carrying out information search by the conjunctive word that finds, in the range of obtained information search result is limited positioned at scope prescribed information, thus information search result is more accurate.
Description
Technical field
The present invention relates to technical field of information processing, more particularly to a kind of information search method and device.
Background technology
With the arrival of information age, people will face the information of a large amount of numerous and complicateds daily, such as:Mutually
Information in networking, then, how to correctly search for out the information of needs from substantial amounts of information to be presented to
User, is a urgent problem.
By taking the information search in internet as an example, during information search, a kind of common searching method
It is to be scanned for according to keyword.But, keyword generally semantically has complexity, such as:One
Word generally can all have multiple synonyms, it is also possible to there are multiple near synonym, if the pass only inputted to user
Keyword is retrieved, it will usually cause the entry searched less, so the pass that generally can be all inputted to user
Keyword and its synonym, near synonym are scanned for, now, how to select synonym, near synonym generally to determine
The accuracy of information search result.
Therefore, synonym and/or near synonym how are accurately determined, to improve the accuracy of information search result
It is a urgent problem to be solved in information seeking processes.
The content of the invention
The embodiment of the present invention provides a kind of information search method and device, to solve to believe in information seeking processes
The problem of accuracy of breath search result is low.
In a first aspect, a kind of information search method of the embodiment of the present invention, this method can be applied to search into row information
On the server of rope, wherein, the server obtains the inquiry request for information search, and from the inquiry
At least one keyword is obtained in request;In addition, the server obtains the scope for prescribed information search
Scope prescribed information, server is at least one keyword described in being obtained from the inquiry request
Each keyword, search meet the scope prescribed information limit in the range of one of the keyword or
Multiple conjunctive words, wherein, conjunctive word may include synonym and/or near synonym;And according to each found
One or more of conjunctive words of keyword carry out information search, obtain being located at the scope prescribed information institute
Information search result in the range of restriction.
Using such scheme, server can according to the scope prescribed information of at least one keyword got,
For each keyword at least one keyword, find out and meet scope prescribed information and limit scope
Interior one or more conjunctive words, and enter according to one or more conjunctive words of each keyword found
Row information search for, obtain be located at scope prescribed information limit in the range of information search result.Wherein, close
Joining word includes synonym and/or near synonym.
By the conjunctive word found out be meet scope prescribed information limit in the range of conjunctive word, therefore
When carrying out information search according to the conjunctive word found, obtained information search result is limited also in scope
Information limit in the range of information search result so that information search result accuracy is higher.
In a kind of possible implementation, if getting a keyword, server is searched entering row information
Suo Shi, can enter row information according to one or more of conjunctive words of the one keyword found and search
Rope;Or according to one or more of conjunctive words of the one keyword found, and it is one
Keyword carries out information search.
Using such scheme, information search or the conjunctive word according to the keyword are carried out according to a keyword
Information search is carried out, is compared with the method for only carrying out information search according to keyword, information search can be expanded
Scope.
Wherein, the situation only scanned for for former according to keyword, the search knot that information search is obtained
Fruit may not include the result searched for and obtained according to conjunctive word;For latter both according to keyword, also according to pass
Join the situation of word search, the search result that information search is obtained includes searching for obtained result according to conjunctive word.
There is provided the implementation of two kinds of information searches in this optional implementation.
In a kind of possible implementation, if server gets at least two keywords, server exists
, can also be by the institute found before the conjunctive word progress information search found after lookup conjunctive word
It is combined between the conjunctive word for stating the different keywords at least two keywords, and at least two by described in
Partial key word in individual keyword and it is combined between the conjunctive word of remaining keyword found;
In information search, information search can be carried out according to each combination of formation;Or
According at least two keyword, and each combination formed carries out information search.Using upper
State scheme, due to carry out information search when be according between keyword and conjunctive word use different combination sides
What the combination that formula is obtained was carried out, therefore all possible combination can be scanned for, ensureing to search
On the premise of the accuracy of hitch fruit, make search result more complete.
Wherein, the situation only scanned for for former according to keyword, the search knot that information search is obtained
Fruit may not include the result searched for and obtained according to conjunctive word;For latter both according to keyword, also according to pass
Join the situation of word search, the search result that information search is obtained includes the result obtained according to keyword search.
There is provided the optional implementation of two kinds of information searches.
In a kind of possible implementation, server can search conjunctive word as follows:
Search all conjunctive words of a keyword;For each conjunctive word found, the association is obtained
The information of the scope of application of word;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make
For meet the scope prescribed information limit in the range of the keyword conjunctive word.
Using such scheme, the information of the scope of application of each conjunctive word due to obtaining keyword, and
Therefrom filter out the scope of application and scope prescribed information limited range identical conjunctive word, it is thus possible to arrange
Except the different conjunctive word of the scope of application, make the conjunctive word filtered out more accurate, so that search result is more
Accurately.
In a kind of possible implementation, server can obtain the inquiry request from client;Server
Find meet the scope prescribed information limit in the range of one or many of one keyword
After individual conjunctive word, one or more of conjunctive words are sent to the client, and each to transmission
Conjunctive word, sends the information of the scope of application of the conjunctive word.
Using such scheme, because server to client have sent one or more conjunctive words, and to sending
Each conjunctive word, send the information of the scope of application of the conjunctive word, thus can have selection in client
Property conjunctive word and its scope of application are shown, facilitate user selection which conjunctive word is row information is entered using
Search.
In a kind of possible implementation, server obtains the inquiry request from client;
If getting at least two keywords, server after conjunctive word is found, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found,
And by between the Partial key word and the conjunctive word of remaining keyword found at least two keywords
It is combined;
For each combination of formation, the scope of application of the combination is determined;
Wherein, if a combination includes keyword, by the scope prescribed information limited range with
Common factor between the scope of application of each conjunctive word in the combination, is used as the scope of application of the combination;If
Do not include keyword in one combination, then by the friendship between the scope of application of each conjunctive word in combining
Collection, is used as the scope of application of the combination;
Server can send one or more combinations with the non-NULL scope of application to the client, and to hair
Each combination sent, sends the information of the scope of application of the combination.
Using such scheme, one or more there is the non-NULL scope of application because server have sent to client
Combination, and to each combination of transmission, send the information of the scope of application of the combination, thus can be
Client there is the scope of application of combination and the combination of the non-NULL scope of application to be shown each, convenient
User's selection carries out information search using which combination.
In a kind of possible implementation, information of the server in the scope of application for obtaining each conjunctive word
Before, a conjunctive word can be obtained from text;
Server judges whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including server is by the word of the scope of application for describing the conjunctive word, labeled as the association
The information of the scope of application of word.
Using such scheme, due to obtaining the word of the scope of application for describing a conjunctive word from text
Language, and using the word as the use scope of conjunctive word information there is provided it is a kind of determine conjunctive word be applicable
The method of scope.
In a kind of possible implementation, server can obtain the scope from the inquiry request and limit
Information;Or
If server gets a keyword and the meaning of a word of one keyword defines information search
Scope, then server generate for describing the information search scope that the meaning of a word of one keyword is limited,
It is used as the scope prescribed information;Or
If server gets part or all of at least two keywords and at least two keyword
The meaning of a word of keyword defines the scope of information search, then server can determine that the part or all of keyword
In each keyword the information search scope that is limited of the meaning of a word, and by each keyword of determination
Common factor is taken between the information search scope that the meaning of a word is limited, the common factor is regard as the scope prescribed information.
Using such scheme, limited due to obtaining scope from inquiry request or from the meaning of a word of keyword
There is provided the method for obtaining scope prescribed information for information.
In a kind of possible implementation, server can obtain the inquiry request from client;Server
Obtain be located at the scope prescribed information limit in the range of information search result after, can be to the visitor
Family end sends obtained information search result, and to each entry in information search result, sends described
Scope prescribed information.
Using such scheme, because server to the client sends obtained information search result, and it is right
Each entry in information search result, sends the scope prescribed information, thus can be in client pair
The scope prescribed information of each entry in information search result and information search result is shown, and is made
Search result is more directly perceived.
Second aspect, the embodiment of the present invention provides a kind of information retrieval device, and the information retrieval device has real
The function of the information search method of existing above-mentioned first aspect.The function can be realized by hardware, can also
Corresponding software is performed by hardware to realize.The hardware or software include one or more with above-mentioned functions phase
Corresponding module.
In a kind of optional implementation, described information searcher includes:Inquiry request acquisition module,
Keyword acquisition module, scope prescribed information acquisition module, conjunctive word searching modul and search module.
Alternatively, word combination module, conjunctive word sending module, scope of application information flag can also be included
Module and search result sending module.
Inquiry request acquisition module is configured as supporting information retrieval device to perform above-mentioned first aspect and provided
Method in acquisition inquiry request function;Keyword acquisition module is configured as supporting information retrieval device
Perform the function of the acquisition keyword in the method that above-mentioned first aspect is provided;Scope prescribed information obtains mould
Block is configured as supporting information retrieval device to perform the acquisition scope in the method that above-mentioned first aspect is provided
The function of prescribed information;Conjunctive word searching modul is configured as supporting information retrieval device to perform above-mentioned first party
The function of the conjunctive word of lookup keyword in the method that face is provided;Search module is configured as supporting information
Searcher performs the function of the search in the method that above-mentioned first aspect is provided;Word combination module by with
It is set to the function for the word combination for supporting information retrieval device to perform in the method that above-mentioned first aspect is provided;
Conjunctive word sending module is configured as supporting information retrieval device to perform the method that above-mentioned first aspect is provided
In transmission conjunctive word function;Scope of application information flag module is configured as supporting information retrieval device to hold
The function of the scope of application information of mark conjunctive word in the method that the above-mentioned first aspect of row is provided;Search knot
Fruit sending module is configured as supporting information retrieval device to perform in the method that above-mentioned first aspect is provided
The function of search result is sent to client.
The third aspect, the embodiment of the present invention provides a kind of information search system, including:Client, for sending out
Send inquiry request and receive search result;
Server, for performing the information search method that above-mentioned first aspect is provided;
Memory, is returned for the database access request of the reception server transmission and by database query result
Back to server.
Fourth aspect, the embodiment of the present invention provides a kind of computer-readable storage medium, for being stored as above-mentioned second
The computer software instructions used in information retrieval device described in aspect, it, which is included, is used to perform above-mentioned aspect institute
The program of design.
5th aspect, the embodiment of the present invention is provided in a kind of information acquisition method, this method, and server is from text
One or more conjunctive words of a keyword are obtained in this, wherein, conjunctive word includes synonym and/or nearly justice
Word;For each conjunctive word of acquisition, server is searched for describing being applicable for the conjunctive word in the text
The word of scope;And by the word found, labeled as the information of the scope of application of the conjunctive word.
In a kind of possible implementation, server can find the keyword and the key from text
The conjunctive word marker character of word;Server determines the matching range of conjunctive word marker character in the text;Then, take
Business device obtains one or more conjunctive words out of matching range.
Wherein, conjunctive word marker character is used for the conjunctive word for marking the keyword with the incidence relation of the keyword,
Matching range is used to mark the position range that conjunctive word is likely to occur in the text.
6th aspect, the embodiment of the present invention provides a kind of information acquisition device, and the device, which has, realizes above-mentioned the
The function of the method for five aspects.The function can be realized by hardware, can also be performed by hardware corresponding
Software realize.The hardware or software include one or more modules corresponding with above-mentioned functions.
In a kind of optional implementation, the information acquisition device includes:Conjunctive word acquisition module, word
Searching modul and range flags module.
Alternatively, keyword lookup module, conjunctive word marker character searching modul and matching range can also be included
Determining module.
Conjunctive word acquisition module is configured as supporting information acquisition device to perform what above-mentioned 5th aspect was provided
The function of acquisition conjunctive word in method;Word searching modul is configured as supporting in information acquisition device execution
State the function that the lookup in the method that the 5th aspect is provided is used to describe the word of the scope of application of conjunctive word;
Range flags module is configured as supporting information acquisition device to perform in the method that above-mentioned 5th aspect is provided
Mark the conjunctive word scope of application function;Keyword lookup module is configured as supporting information acquisition device to hold
The function of lookup keyword in the method that above-mentioned 5th aspect of row is provided;Conjunctive word marker character searching modul
It is configured as supporting information acquisition device to perform the lookup conjunctive word in the method that above-mentioned 5th aspect is provided
The function of marker character;Matching range determining module is configured as supporting information acquisition device to perform above-mentioned 5th side
The function of the matching range of determination conjunctive word marker character in the method that face is provided.
7th aspect, the embodiment of the present invention provides a kind of Information Acquisition System, including:
Client, for sending keyword and receiving acquired information;
Server, for performing the information search method that above-mentioned 5th aspect is provided;
Memory, is returned for the database access request of the reception server transmission and by database query result
Back to server.
Eighth aspect, the embodiment of the present invention provides a kind of computer-readable storage medium, for saving as the above-mentioned 6th
The computer software instructions used in information acquisition device described in aspect, it, which is included, is used to perform above-mentioned aspect institute
The program of design.
To sum up, the embodiment of the present invention provides a kind of information search method and device, wherein, according to what is got
The scope prescribed information of at least one keyword, for each keyword at least one keyword, is looked into
Find out meet scope prescribed information limit in the range of one or more conjunctive words, it is and every according to what is found
One or more conjunctive words of one keyword carry out information search, obtain being limited positioned at scope prescribed information
In the range of information search result.Wherein conjunctive word includes synonym and/or near synonym.
By the conjunctive word found out be meet scope prescribed information limit in the range of conjunctive word, therefore
When carrying out information search according to the conjunctive word found, obtained information search result is limited also in scope
Information limit in the range of information search result so that information search result accuracy is higher.
Brief description of the drawings
Fig. 1 is a kind of schematic diagram of the network architecture of information search system provided in an embodiment of the present invention;
Fig. 2 is a kind of structural representation of server for information search provided in an embodiment of the present invention;
Fig. 3 is a kind of flow chart of information search method provided in an embodiment of the present invention;
Fig. 4 is a kind of scope of application for showing that each is combined and each is combined provided in an embodiment of the present invention
The schematic diagram of the mode of information;
Fig. 5 is a kind of one or more combinations with the non-NULL scope of application of displaying provided in an embodiment of the present invention
Mode schematic diagram;
Fig. 6 is a kind of signal of the mode of client exhibition information search result provided in an embodiment of the present invention
Figure;
Fig. 7 is a kind of flow chart of information acquisition method provided in an embodiment of the present invention;
Fig. 8 is the flow chart of another information search method provided in an embodiment of the present invention;
Fig. 9 is the flow chart of another information acquisition method provided in an embodiment of the present invention;
Figure 10 is a kind of structural representation of information retrieval device provided in an embodiment of the present invention;
Figure 11 is a kind of structural representation of information acquisition device provided in an embodiment of the present invention;
Figure 12 is the structural representation of another information retrieval device provided in an embodiment of the present invention.
Embodiment
The above-mentioned purpose of embodiment, scheme and advantage for a better understanding of the present invention, provided hereinafter detailed
Description.The detailed description by using the accompanying drawings such as block diagram, flow chart and/or example, illustrate device and/or
The various embodiments of method.In these block diagrams, flow chart and/or example, one or more functions are included
And/or operation.It will be understood by the skilled person that:Each function in these block diagrams, flow chart or example
And/or operation, can separately or cooperatively it be implemented by various hardware, software, firmware, or pass through
Any combination of hardware, software and firmware is implemented.
The embodiment of the present invention provides a kind of information search method and device, wherein, according at least one got
The scope prescribed information of individual keyword, for each keyword at least one keyword, finds out symbol
Close scope prescribed information limit in the range of one or more conjunctive words, and according to find each close
One or more conjunctive words of keyword carry out information search, obtain limiting positioned at scope prescribed information in the range of
Information search result.Wherein conjunctive word includes synonym and/or near synonym.
, can be according to scope prescribed information to each keyword using scheme provided in an embodiment of the present invention
One or more conjunctive words are screened, and obtain meeting the conjunctive word of scope prescribed information limited range,
And the one or more conjunctive words obtained according to each keyword and screening carry out information search.Therefore, may be used
To be screened according to scope prescribed information to conjunctive word, and then it can be carried out when carrying out information search
Garbled, more accurate search result.
Below, in order to make it easy to understand, introducing the concept being related in the embodiment of the present invention.
First, keyword and conjunctive word
In information search, it will usually carry out information search, these keywords according to one or more keywords
It can be inputted by user, it is also possible to obtained from text.These keywords, which are used for representative, to be searched for
Information in main contents.
In the embodiment of the present invention, the conjunctive word of a keyword can include the synonym of the keyword and/or near
Adopted word.
Such as, " gardenia also known as cape jasmine, Yellow Fructus Gardeniae ", then can be assumed that the same of gardenia according to this text
Adopted word is cape jasmine or Yellow Fructus Gardeniae.For another example, " discrimination " refers to resolution, difference, and " discriminating " refers to by investigating
And the property or feature of things are determined, the two similar import, it is believed that " discriminating " is the near of " discrimination "
Adopted word.The synonym or near synonym of one word can be called the conjunctive word of this word.
The synonym and/or near synonym of word can over time or the change of the scope of application such as region and it is different.
During social development, the words of some words sense over time or the change of the scope of application such as region and become
Change, such as:In the Yuan Dynasty, woman servant is synonymous with father, in ancient times, and brother is synonymous with sister;Also some words exist
Some region has identical implication, such as in Sichuan province, and people " capsicum " are called " hot pepper ", in Shaanxi
Area, people " capsicum " are called " long and thin hot pepper ".
Existing searching method does not account for being applicable for the keyword to be searched for and its synonym and/or near synonym
Scope, influences the accuracy of search result.Such as, for " in Sichuan province, people capsicum is called hot pepper,
In In Shanxi Area, people capsicum is called long and thin hot pepper " this synonym matched text, when search " Chili Peppers In Sichuan Province, China "
When, " Sichuan hot pepper " and " Sichuan long and thin hot pepper " the two search suggestions and corresponding search result can be provided.But
It is that due to not accounting for keyword " Sichuan " this scope of application prescribed information in search, thus can give
Go out " Sichuan long and thin hot pepper " this search suggestion and corresponding search result, this search suggestion and search result are obvious
Required for not being the user scanned for, thus this search suggestion and search result are redundancies, influence
The accuracy of search result.
2nd, scope prescribed information
Scope prescribed information refers to that being used in inquiry request shows the limit of the query context of this inquiry request
Determine information, such as temporal information or regional information.
When scope prescribed information is temporal information, show inquiry request needs inquiry is the temporal information
In the time range that is characterized, the search result that includes the keyword in inquiry request;When scope prescribed information
During for regional information, show the inquiry request need inquire about be it is in the territorial scope, comprising inquiry request
In keyword search result.
Scope prescribed information can be obtained from inquiry request, can also be from the word of the keyword in inquiry request
Obtained in justice.
Alternatively, priority can be set to the mode that above two obtains scope prescribed information.Such as, can be with
Set:The priority of the scope prescribed information obtained from inquiry request is higher than to be obtained from the meaning of a word of keyword
Scope prescribed information.
The mode of scope prescribed information is obtained from inquiry request can a variety of, and three kinds are only enumerated below from looking into
The example that scope prescribed information is obtained in request is ask, actual acquisition modes are not limited to following three kinds:
Scope prescribed information in the keyword inputted in mode one, inquiry request.
Such as, " the Yuan Dynasty " this scope can be obtained from " woman servant's the Yuan Dynasty " this inquiry request and limits letter
Breath.
Mode two, by setting the functional module of input range prescribed information obtain scope prescribed information.
Such as, input range prescribed information can be used in inquiry request page setup window or plug-in unit.
Mode three, the typing for carrying out by the representation of agreement Query Information.
Such as, it is to need behind scope prescribed information, colon to be before colon when can arrange input inquiry request
The keyword to be inquired about, such as input " Ming Dynasty:During Liu Baiwen ", mark is limited in " Ming Dynasty " this scope
Search " Liu Baiwen " this keyword in the time range that information is limited.
The mode of scope prescribed information is obtained from the meaning of a word of keyword to be:Some personalities, ancient books,
Historical events etc. is substantially associated with the scope prescribed information such as some times, region, then make when these words
When being input to for keyword in inquiry request, these scope prescribed informations can be obtained, the inquiry request is used as
In scope prescribed information.For example, when including " Cao Xueqin " in the keyword of input, can be from " Cao
This keyword of snow celery " is associated with " Qing Dynasty " this region, so that please as this inquiry by " Qing Dynasty "
The scope prescribed information asked.
Alternatively, when scope prescribed information be temporal information or regional information, judge some keyword whether with
Temporal information or regional information are associated, can by set a time tag storehouse or region tag library come
Realize.
Record and (such as gone through with the time corresponding to personality, ancient books, historical events etc. in time tag storehouse
The age that the time of historical event part generation, personality are present) information, when bag in the keyword in inquiry request
During containing these personalities, ancient books, historical events, can by the personality in time tag storehouse, ancient books,
The corresponding temporal information such as historical events as the inquiry request scope prescribed information.
Record and (such as gone through with the region corresponding to personality, ancient books, historical events etc. in the tag library of region
The region of history locale, personality's birth or life) information, when the key in inquiry request
In word include these personalities, ancient books, historical events when, can by the personality in the tag library of region,
The corresponding regional information such as ancient books, historical events as the inquiry request scope prescribed information.
If in addition, scope prescribed information is regional information, can also pass through the user's asked input inquiry
Understood or obtain ground by positioner positioning in IP (Internet Protocol, procotol) address
Domain information.
3rd, the scope of application of conjunctive word
Search some keyword conjunctive word when, the conjunctive word might not under any circumstance all with key
Word is synonymous, but synonymous with the keyword in some scope of application.Such as, in Sichuan province, people claim
Capsicum is hot pepper, then hot pepper is not all synonymous with capsicum in all regions, but only " Sichuan " this
It is synonymous with capsicum in one scope of application;For another example, woman servant is synonymous with father in the Yuan Dynasty, then woman servant is not
It is all synonymous with father in all dynasties, but it is only synonymous with father in " the Yuan Dynasty " this scope of application.On
State the scope of application that " Sichuan " and " the Yuan Dynasty " is conjunctive word.
Alternatively, the scope of application of conjunctive word can be time or region.
4th, conjunctive word marker character
Text in internet or database is analyzed, and then obtains the conjunctive word of some keyword
During, conjunctive word marker character is used for marking the incidence relation that the keyword is associated between word.For example,
For " gardenia is also known as cape jasmine, Yellow Fructus Gardeniae." this text, searching for the conjunctive word of " gardenia "
During, pass through " being also known as " in the text behind " gardenia ", it can be appreciated that after " being also known as "
The word in face is the conjunctive word of " gardenia "." being also known as " is a kind of conjunctive word marker character.
Conjunctive word marker character is not limited to a kind of above-mentioned form, and it can be that word can also be symbol.Such as,
Exist in the entry of gardenia "【Alias】:Cape jasmine, yellow chicken, yellow sprout, Yellow Fructus Gardeniae, yellow Cape jasmine, mountain
Yellow Cape jasmine, beautiful lotus etc.." this text, wherein "【Alias】:" it is also a kind of conjunctive word marker character.
5th, the matching range of conjunctive word marker character in the text
The matching range of conjunctive word marker character in the text is used to mark what conjunctive word was likely to occur in the text
Position range.
Such as, for " gardenia is also known as cape jasmine, Yellow Fructus Gardeniae." this matched text, finding conjunctive word
After marker character, in addition it is also necessary to know that the conjunctive word of the sphere of action of conjunctive word marker character, i.e. gardenia is likely to occur
Position range.By analyzing fullstop last in the text it is recognised that the word after fullstop is no longer Cape jasmine
The conjunctive word of son flower, the i.e. matching range of conjunctive word marker character in the text terminates to fullstop.
Fig. 1 shows a kind of network architecture of information search system.As shown in figure 1, information search system bag
Include:Server 101, client 102 and memory 103, server 101 can also include processor,
Memory and I/O interfaces.
Server 101 passes through processor pair by inquiry request of the I/O interfaces from client 102
The inquiry request of reception is handled, and the search result obtained after processing can be returned into client 102 and entered
Row displaying.The programmed instruction stored in the run memory of server 101, is handled inquiry request.This
Outside, server 101 can also be by the ephemeral data storage produced during processing inquiry request in memory.
Server 101 handle inquiry request when may need access database (such as thesaurus, time tag storehouse,
Region tag library etc.) server 101 memory of itself is can come from, it can be from the memory of outside
103。
Wherein, thesaurus is used for the scope of application for conjunctive word and each conjunctive word for storing keyword
Information, one kind optionally realizes that structure refers to table 1;Time tag storehouse be used for record such as personality,
(the year that such as time of historical events generation, personality are present time corresponding to ancient books, historical events etc.
Generation) information, when including these personalities, ancient books, historical events in the keyword in inquiry request,
The corresponding temporal information such as the personality in time tag storehouse, ancient books, historical events can be looked into as this
Ask the scope prescribed information of request;Region tag library is used to record such as personality, ancient books, historical events
Deng corresponding region (region of place, personality's birth or life that such as historical events occurs) information,
, can be by region when including these personalities, ancient books, historical events in the keyword in inquiry request
The corresponding regional informations such as personality, ancient books, historical events in tag library as the inquiry request model
Enclose prescribed information.
Wherein, the inquiry request of client 102 can be that the search instruction inputted from user (such as, exists
The search instruction inputted on webpage).
Alternatively, client 102 can select exhibition after the search result of the return of server 101 is received
Show the search result.Memory in server 101 can be disk, CD, flash memory.Memory 103
Can be disk array, hard disk, flash memory, CD, the memory technology of use can be conventional storage technologies,
It can also be cloud storage technology.
Fig. 2 is a kind of structural representation of server for information search, letter provided in an embodiment of the present invention
Breath searching method can be applied in server 101 as shown in Figure 2, and the server 101 can be applied to figure
In information search system shown in 1, including I/O interfaces 201, processor 202 and memory 203.
Memory 203 can be used for storage program, database.Memory 203 can be CD, hard disk, interior
Deposit.Wherein, database can be that server execution information searching method in the embodiment of the present invention is called
Program and used database (such as above-mentioned thesaurus, time tag storehouse, region tag library);
Server 101 receives the inquiry request from client by I/O interfaces 201, by 202 pairs of processor
The inquiry request of reception is handled, and can return the search result obtained after processing by I/O interfaces 201
It is shown back to client.The programmed instruction stored in the run memory 203 of server 101, to inquiry
Request is handled.In addition, the ephemeral data produced in processing procedure can also be stored in by processor 202
In memory 203.Processor 202 may need the database accessed (such as synonymous when handling inquiry request
Dictionary, time tag storehouse, region tag library etc.) memory 203 of server 101 itself is can come from,
It can be from the memory of outside;I/O interfaces 201 are used to connect various input/output devices, can be used for
Receive outside search instruction and search result is exported.
Wherein, thesaurus is used for the scope of application for conjunctive word and each conjunctive word for storing keyword
Information, one kind optionally realizes that structure refers to table 1;Time tag storehouse be used for record such as personality,
(the year that such as time of historical events generation, personality are present time corresponding to ancient books, historical events etc.
Generation) information, when including these personalities, ancient books, historical events in the keyword in inquiry request,
The corresponding temporal information such as the personality in time tag storehouse, ancient books, historical events can be looked into as this
Ask the scope prescribed information of request;Region tag library is used to record such as personality, ancient books, historical events
Deng corresponding region (region of place, personality's birth or life that such as historical events occurs) information,
, can be by region when including these personalities, ancient books, historical events in the keyword in inquiry request
The corresponding regional informations such as personality, ancient books, historical events in tag library as the inquiry request model
Enclose prescribed information.
Below, various embodiments of the present invention are described in detail.
Fig. 3 is a kind of flow chart of information search method provided in an embodiment of the present invention.This method can be by Fig. 1
Performed with the server 101 shown in Fig. 2.As shown in figure 3, the flow comprises the following steps:
S301:Obtain the inquiry request for information search;
Alternatively, it can obtain inquiry request from client to obtain inquiry request.
S302:At least one keyword is obtained from inquiry request;
Wherein, maximum forward matching algorithm can be used, passes through the individual character and participle in the inquiry request by input
Dictionary carries out Forward Maximum Method, and participle is carried out to Chinese, the result after participle is extracted, so as to be formed at least
One keyword.Such as, acquisition " woman servant " and " the Yuan Dynasty " two after participle is carried out to " woman servant's the Yuan Dynasty "
Keyword.Alternatively it is also possible to carry out participle with reverse maximum matching method and bi-directional matching method.
S303:Obtain scope prescribed information;
Wherein, scope prescribed information is used for the scope that prescribed information is searched for;
Alternatively, the mode of acquisition scope prescribed information can be:Scope is obtained from inquiry request and limits letter
Breath;Or, if a keyword is got, and the meaning of a word of a keyword defines the scope of information search,
Then generate the scope prescribed information for describing the information search scope that the meaning of a word of a keyword is limited;Or
Person, if two keywords are got, and the meaning of a word of the part or all of keyword in multiple keywords is defined
The scope of information search, it is determined that what the meaning of a word of each keyword in part or all of keyword was limited
The scope of information search;The scope for the information search that the meaning of a word of each keyword of determination is limited takes friendship
Collection;Generate the scope prescribed information for describing the common factor.
Wherein, the concrete mode that scope prescribed information is obtained can refer to the explanation previously with regard to scope prescribed information
Provided in acquisition modes.
S304:For each keyword at least one keyword, lookup meets scope prescribed information institute
One or more conjunctive words of the keyword in the range of restriction;
Wherein, conjunctive word includes synonym and/or near synonym.
Alternatively, search meet scope prescribed information limit in the range of a keyword it is one or more
The mode of conjunctive word can be:Search all conjunctive words of the keyword;For each association found
Word, obtains the information of the scope of application of the conjunctive word;The scope of application and scope prescribed information are limited into scope
Have overlapping conjunctive word, as meet scope prescribed information limit in the range of the keyword conjunctive word.
Alternatively, before the information of the scope of application for obtaining conjunctive word, a pass can also be obtained from text
Join word;Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;If including,
Then by the word of the scope of application for describing the conjunctive word, labeled as the letter of the scope of application of the conjunctive word
Breath.For example, according to " capsicum another name for Sichuan Province claims hot pepper " this text, obtaining keyword " capsicum " and this being crucial
In the conjunctive word " hot pepper " of word, the text, the word " another name for Sichuan Province " comprising the scope of application for describing the conjunctive word,
So " another name for Sichuan Province " can serve as the information of the scope of application of the conjunctive word.
Alternatively, if getting a keyword, meet scope prescribed information finding and limit scope
After one or more conjunctive words (step S304) of an interior keyword, it can also be sent to client
One or more conjunctive words, and to each conjunctive word of transmission, send the letter of the scope of application of the conjunctive word
Breath, the information for showing one or more conjunctive words and its corresponding scope of application in client.
Alternatively, if getting at least two keywords, each in at least one keyword
Keyword, find meet scope prescribed information limit in the range of the keyword one or more associations
, can also be by the pass of the different keywords at least two keywords found after word (step S304)
Connection word between be combined, and by the Partial key word at least two keywords and find remaining pass
It is combined between the conjunctive word of keyword;For each combination of formation, the scope of application of the combination is determined;
One or more combinations with the non-NULL scope of application are sent to client, and to each combination of transmission,
Send the information of the scope of application of the combination, for client show each combination and each combination
The information of the scope of application.
Wherein, if a combination includes keyword, by scope prescribed information limited range and the group
The common factor of the scope of application of each conjunctive word in conjunction, is used as the scope of application of the combination;If a combination
In do not include keyword, then by the common factor of the scope of application of each conjunctive word in combining, be used as the group
The scope of application of conjunction.
Wherein, show that the mode of the information of each combination and the scope of application of each combination can in client
To be to add the information of the scope of application of the combination in the front or behind of each combination, as shown in Figure 4;
Or, when scope prescribed information is two or more, under same scope prescribed information, displaying
Relative one or more combinations with the non-NULL scope of application, for example, when scope prescribed information is
Between information and during regional information, the exhibition methods of one or more combinations with the non-NULL scope of application can be as
Shown in Fig. 5.
S305:Information search is carried out according to one or more conjunctive words of each keyword found, is obtained
Information search result in the range of being limited positioned at scope prescribed information.
Alternatively, if getting a keyword, according to one or many of each keyword found
Individual conjunctive word carries out information search, including:Only according to one or more associations of the keyword found
Word carries out information search;Or according to one or more conjunctive words of the keyword found, and one
Keyword carries out information search.
Alternatively, if getting at least two keywords, each in at least two keywords
Keyword, find meet scope prescribed information limit in the range of the keyword one or more associations
After word (step S304), believed according to one or more conjunctive words of each keyword found
, can also be by the different keywords at least two keywords found before breath search (step S305)
Conjunctive word between be combined, and by the Partial key word at least two keywords and find its
It is combined between the conjunctive word of remaining keyword;According to one or more passes of each keyword found
Join word and carry out information search, can include:Information search is carried out according to each combination of formation;Or according to
At least two keywords, and each combination formed carry out information search.
Alternatively, obtain be located at scope prescribed information limit in the range of information search result after, and also
Including:Obtained information search result is sent to client, and to each entry in information search result,
Range of transmission prescribed information, in client exhibition information search result.
Wherein, the mode of client exhibition information search result can be with as shown in fig. 6, in information search result
The above or below label range prescribed information of (content title for searching for obtained entry), i.e. displaying are searched
While the content title for the entry that rope is obtained, the scope prescribed information associated with the title is shown.
Fig. 7 is a kind of flow chart of information acquisition method of the offer of the embodiment of the present invention, and this method is mainly used
In the information of the scope of application of the conjunctive word of one keyword of acquisition and each conjunctive word from text, information
The result of acquisition can provide the scope prescribed information of some keyword for abovementioned steps S303.Such as Fig. 7 institutes
Show, the flow of this method is as follows:
S701:One or more conjunctive words of a keyword are obtained from text;
Conjunctive word includes synonym and/or near synonym;
Alternatively, can also be from before one or more conjunctive words of a keyword are obtained from text
Keyword is found in text;The conjunctive word marker character of keyword is found from text;Determine conjunctive word marker character
Matching range in the text, matching range is used to mark the position model that conjunctive word is likely to occur in the text
Enclose;One or more conjunctive words of a keyword, Ke Yishi are obtained from text:Obtained out of matching range
Take one or more conjunctive words.
Wherein, conjunctive word marker character is used to mark the conjunctive word of keyword and the incidence relation of keyword.
S702:For each conjunctive word of acquisition, searching is used for the applicable model for describing the conjunctive word in text
The word enclosed;
S703:By the scope of application representated by the word found, labeled as the scope of application of the conjunctive word.
Fig. 8 is the flow chart of another information search method provided in an embodiment of the present invention.Wherein, with key
Word is two, conjunctive word is exemplified by synonym, scope prescribed information are temporal information and regional information, to provide
One example of method shown in Fig. 3.
S801:Obtain the inquiry request for information search;
Such as:Obtain user and input " woman servant's the Yuan Dynasty " this inquiry request in searched page query frame.
Alternatively, inquiry request can be the inquiry request of user's input or by a certain device or be
The inquiry request of system generation.
S802:Extract inquiry content keyword;
Keyword in inquiry content is obtained using certain technological means.It can such as be matched and calculated using maximum forward
Method, reverse maximum matching algorithm or bi-directional matching algorithm carry out participle.Being extracted from the result after participle will
The keyword of inquiry.Such as, participle is carried out to " woman servant's the Yuan Dynasty " and obtains " woman servant " and " the Yuan Dynasty " two
Individual keyword.
Wherein, step S802 can be considered an abovementioned steps S302 example.
S803:Searched for first using the keyword of acquisition, obtain search result first;
The keyword of acquisition is scanned for searching algorithm or instrument, the result obtained here is referred to as
" search result first ", i.e., the search obtained in the case where being not introduced into the synonym of the keyword to be inquired about
As a result.Such as, " woman servant " and " the Yuan Dynasty " two keywords obtained in S803 are scanned for, obtained
Obtain search result first.
The keyword of acquisition is scanned for be considered as and existing searching method identical in step S803
Searching method, the search result first of acquisition can be with the binary search knot that is obtained in step S811 below
Fruit is merged.
S804:Judge whether contain time or regional information in keyword;If so, performing step S806;
If it is not, performing step S805;
Wherein, temporal information, regional information can be considered as an example of aforementioned range prescribed information.
Such as:If the meaning of a word of only one keyword and a keyword defines the scope of information search,
Then generate for describing the information search scope that the meaning of a word of a keyword is limited, limited as foregoing scope
Determine information.
For another example:If there is the part or all of keyword at least two keywords and at least two keywords
The meaning of a word define the scope of information search, it is determined that each keyword in part or all of keyword
The information search scope that the meaning of a word is limited, and the information that the meaning of a word of each keyword of determination is limited searches
Common factor is taken between rope scope, scope prescribed information is used as using occuring simultaneously.
One word related to time or regional information, is not limited only to the word clearly word containing temporal information,
Such as time, place name.It should also also include some words that can be substantially associated with time or regional information,
The time or dynasty in place or books writing as where " Mount Huang " or " A Dream of Red Mansions " can be associated with things.
How to judge that a word is related to time or regional information, can be by setting up a time tag storehouse or ground
Domain tag library is realized.
Be associated with personality in time tag storehouse or region tag library, ancient books, the time of historical events etc. or
Regional information.If the keyword in inquiry content is included in time tag storehouse, itself and temporal information phase
Association;If inquiring about the keyword in content to be included in the tag library of region, it is associated with regional information.
After analyzing keyword, the temporal information or regional information of the crucial word association are exported.If
Keyword has different temporal informations or different regional informations, then these information is taken and occur simultaneously and export.
If the keyword in inquiry content is not associated with temporal information or regional information, then it is assumed that the inquiry request
Associate all times or regional information.
Such as:" woman servant " and " the Yuan Dynasty " two keywords are analyzed, the keyword with association in time is obtained.
" woman servant " and " the Yuan Dynasty " is contrasted with time tag storehouse successively during analysis.In time mark
" woman servant " does not have temporal information in label storehouse, and " the Yuan Dynasty " possesses temporal information.Therefore " member is obtained
Temporal information associated by court ".The temporal information of association can be towards code name:The Yuan Dynasty or time
Section:1271~1236 Christian era, while can also be the information of other expression times.
In the present invention, when the temporal information or regional information of crucial word association can also be asked by input inquiry
Specified, such as to user by providing window or plug-in unit come the temporal information or regional information of input inquiry.
In this case, it is higher than by providing window or the temporal information of plug-in unit input or the priority of regional information
The temporal information or regional information obtained after being contrasted by keyword and tag library.
In addition, the acquisition of the regional information of keyword in inquiry request it is also possible to use IP address understand, it is fixed
The modes such as position device positioning are realized.
Wherein, judge in step S804 keyword whether the purpose containing time or regional information and foregoing step
Rapid S303 is identical, is to find scope prescribed information.
S805:All synonyms are obtained from the thesaurus with time and regional information;
Certainly, thesaurus also can only include temporal information, or only include regional information, or, for portion
Divide synonym, these synonyms have temporal information;And for other synonyms, these synonyms have low
In information.These information are used for the scope of application for limiting synonym, for screening synonym.
S806:From the thesaurus with time and regional information obtain with step S804 in obtain when
Between information or the corresponding synonym of regional information;
Such as:Analyze " woman servant " in the synonym in period in the Yuan Dynasty when, " father " and " mother " this two
Temporal information associated by individual synonym is " the Yuan Dynasty ", then " father " and " mother " believes for the corresponding time
Cease (the Yuan Dynasty) corresponding synonym.
Wherein, synonym is an example of foregoing conjunctive word, time or regional information associated by synonym
The information of the scope of application of conjunctive word in as abovementioned steps S304.
Wherein, step S805 and step S806 can be considered an abovementioned steps S304 example.
S807:Original keyword is substituted using the synonym of acquisition, and the additional phase after the completion of replacement
The temporal information answered, forms new crucial phrase;
Such as:The new keywords group in " woman servant's the Yuan Dynasty " is " woman servant member ", " father's member ", " father's the Yuan Dynasty ",
" mother's member ", " mother's the Yuan Dynasty " etc..
Here the alternative of synonym can be using " full combination " method, the Chinese key group of such as input
" Chinese word 2 " of Chinese word 1, Chinese word 1 has 5 synonyms, and Chinese word 2 has 4 synonyms,
The new keywords group then formed is 29 kinds (29=6*5-1).Wherein, " Chinese is not included in new keywords group
This crucial phrase of the Chinese word 2 " of word 1.
Particular/special requirement is not made to the replacement method of synonym in embodiments herein, as long as can realize synonymous
The replacement of word.
Wherein, step S807 can be considered in abovementioned steps S304, " will when there is at least two keywords
It is combined between the conjunctive word of different keywords at least two keywords found, and will at least
Partial key word in two keywords and be combined between the conjunctive word of remaining keyword found "
One example.
S808:New keywords group with time or regional information is handled, search suggestion is formed;
In step S808, foregoing can be considered to the new keywords group progress processing with time or regional information
In step S304, it will be carried out between the conjunctive word of the different keywords at least two keywords found
Combination, and by the Partial key word at least two keywords and the conjunctive word of remaining keyword found
Between be combined after, one of process of the scope of application of the combination is determined to each obtained combination
Example.
Carry out logicality analysis to the new keywords group of acquisition first, such as " Chinese word 2 " of Chinese word 1 it is new
Crucial phrase is " synonym 1-1 synonym 2-1 " then analyze synonym 1-1 and synonym 2-1 time
It is new crucial if overlapped, then it is assumed that it is an effective new keywords group or whether regional information overlaps
The temporal information of phrase is the common factor of synonym 1-1 and synonym 2-1 temporal information, new keywords group
The common factor of regional information synonym 1-1 and synonym 2-1 regional information.If synonym 1-1 and synonymous
Word 2-1 temporal information or regional information is misaligned, then it is assumed that it is an invalid new keywords group.
Wherein, new keywords group is " by least two keywords found in abovementioned steps S304
Different keywords conjunctive word between be combined, and by the Partial key word at least two keywords
Be combined between the conjunctive word of remaining keyword found " after an obtained example of combination, newly
The temporal information or regional information of crucial phrase are the " scope of application of the combination in abovementioned steps S304
Information " example.
After effective new keywords group is obtained, according to the formation search suggestion of effective new keywords group.Searching
In Suo Jianyi forming process, effective new keywords group can be ranked up, according to setting output wherein
It is one or more, formed search suggestion.The degree of correlation of current new keywords group and former crucial phrase is such as evaluated,
Arranged according to descending, extract the formation search suggestion of the first two new keywords group.Here the evaluation of the degree of correlation can be with
Different modes are taken, are such as ranked up by the length of time span or the size of region of new keywords group,
Or be ranked up according to the number of the historical search number of times of new keywords group.It is right in embodiments of the invention
Sort method is not construed as limiting.
Such as:In new keywords group " woman servant's member ", " father's member ", " father's the Yuan Dynasty ", " mother's member " is " female
It may be selected when search suggestion is chosen in close the Yuan Dynasty " etc. containing a pair of minimum crucial phrases of time range, it is such as " female
Parent " and the synonymy of " woman servant " are not limited to the Yuan Dynasty, and define the Yuan Dynasty in the keyword this time searched for
This scope prescribed information, therefore prioritizing selection contains the search suggestion of " father ", " mother " then conduct
Not preferred search suggestion;And " the Yuan Dynasty " can accurately more state temporal information than " member ", thus it is excellent
First search of the selection containing " the Yuan Dynasty " advises that " member " then advises as not preferred search.Ultimately form
" father's the Yuan Dynasty " this search is advised.
It is alternatively possible to be built obtaining effective new keywords group and forming search according to these new keywords groups
The search suggestion of formation is shown after view, the exhibition method of search suggestion may be referred to Fig. 4 and Fig. 5
Shown exhibition method.
S809:Judge whether to perform the search suggestion formed in step S808;If so, step S811 is performed,
If it is not, performing step S810;
S810:Search result first is obtained, step S813 is performed;
S811:The search formed in step S808 is performed to advise and form binary search result;
For " search result first " for making the search result obtained in step S811 with being obtained in step S803
Distinguish, the search result obtained in step S811 is referred to as " binary search result ".
Alternatively, if the search suggestion formed in step S808 has multiple, it can be selected in search suggestion
In it is one or more perform.
S812:Merge search result and binary search result first;
Wherein, the mode of fusion can be that the result retrieved is ranked up with searching order rule, such as
According to the matching degree of keyword, position, the frequency, the link quality of appearance etc. in webpage, calculate and respectively search
The degree of correlation and ranking grade of hitch fruit, then according to degree of association height, in order return to search result
User.
Such as:Suggestion, search result and the search " woman servant's the Yuan Dynasty " of acquisition must be searched for by performing " father's the Yuan Dynasty "
Acquired search result is blended.
S813:Return to search result and search is advised.
If performing the search suggestion obtained in step S809, the search result returned is search knot first
Search result after really being merged with binary search result, in addition, the search formed in also return to step S808
It is recommended that.
If being not carried out the search suggestion obtained in step S809, the search result returned is search knot first
Really, in addition, the search suggestion formed in also return to step S808.
Search suggestion is returned to after client, client can select displaying search suggestion, such as, searching
" father (the Yuan Dynasty) " is shown in Suo Jianyi columns, " (the Yuan Dynasty) father ", " father, the Yuan Dynasty ", " the Yuan Dynasty,
The search that father " etc. has temporal information is advised.It should be noted that the presentation of final search result is disobeyed
The displaying of Lai Yu search suggestions.
Method shown in Fig. 8 can be considered as a citing of method shown in Fig. 3.In the flow of method shown in Fig. 8
In the embodiment be not described in detail can refer to the description of method shown in Fig. 3.
Fig. 9 shows the flow chart of another information acquisition method provided in an embodiment of the present invention.Shown in Fig. 9
Method can be considered an example of method shown in Fig. 7.Below, with reference to Fig. 9, the present invention is illustrated real
A kind of information acquisition method of example offer is provided.
Fig. 9 gives the synonym that a keyword is obtained from text, and time/region of synonym is believed
The flow chart of the method for breath, may finally form one using the information acquired in this method has time/region
The form of the thesaurus of information, the thesaurus is different from existing thesaurus, and it includes time letter
Breath and regional information, the thesaurus can be by adding temporal information and region in existing thesaurus
Information realization, its structure can be as shown in table 1.
In table 1, keyword is Chinese word, its may have multiple synonyms (synonym 1, it is synonymous
Word 2, synonym 3 etc.).Also to associated by each synonym while the synonym of keyword is recorded
Time or regional information are recorded.
Herein, temporal information can be the dynasty, and in the time, the information such as period, regional information can be region,
The information such as province.
It should be noted that the thesaurus with time or regional information is not limited to the knot shown in table 1
The structure of structure, other times that can embody synonym or regional information also may be used.
Table 1
Wherein, before the thesaurus with time or regional information obtained in the method shown in Fig. 9 can be used for
State the time in step S806 associated by acquisition synonym or regional information.
S901:Obtain synonym matched text;
Web page text is read by spiders technology, or mode is imported etc. by database text and is obtained
With text.Explaining in detail for entry, citation explanation etc. are obtained such as by websites such as " Chinese allusion quotations ".
Wherein, synonym matched text is an example of the text in abovementioned steps S701.
S902:Extract synonym marker character;
Wherein, synonym marker character is an example of the conjunctive word marker character in abovementioned steps S701.
The position that the conjunctive word of the keyword to be searched to mark occurs in the text.
Matched text is traveled through, synonym marker character contained in all matched texts is extracted, such as " abbreviation ", " again
Name " etc..Obtaining the mode of synonym marker character can be, by by the word and mark in synonym matched text
Quasi-synonym marker character storehouse is compared, so as to obtain synonym marker character.
Wherein, standard synonym marker character storehouse is used to record all synonym marker characters.
S903:Judge whether to have analyzed all synonym marker characters;
If so, step S909 is performed, if it is not, performing step S904.
S904:The matching range of next synonym marker character is analyzed, the synonym in the range of this is obtained;
The matching range of synonym marker character be conjunctive word marker character in abovementioned steps S701 in the text
Matching range an example.
There may be multiple synonyms in the matching range of one synonym marker character.Such as " gardenia also known as Cape jasmine
There is the synonym of two " gardenia " in son, Yellow Fructus Gardeniae ":" cape jasmine ", " Yellow Fructus Gardeniae ".So, exist
The matching range of the synonym marker character is also obtained after obtaining synonymous unified word marker character, to determine which is arrived
Untill word or which punctuate, in text behind word be no longer the keyword synonym.
The acquisition of matching range can be divided by words, sentence is divided, and paragraph is drawn grading mode and realized.
During some knowledge class texts are explained, during such as the entry of " Chinese allusion quotation " is explained, synonym marker character is more special,
As " word explanation ", " citation is explained " ensuing several sections of texts may be explained all to the entry
Content, this several sections of texts belong to matching range.
S905:Import time tag storehouse and region tag library;
Time tag storehouse and region tag library are used to record and historical events, personage, books, the correlation such as article
The temporal information and regional information of connection.The temporal information such as associated with " Cao Xueqin " can for " Qing Dynasty " or
It is lived the time;The regional information associated with " Mount Huang " can be " Anhui " or " Mt. Huang in Anhui city " etc.
Regional information.
S906:Obtain the time in synonym marker character or regional information;
As having and temporal information in the synonym marker character " Ming Dynasty claims " in " the sub- Ming Dynasty Cheng Dong gardens of The South Pool "
Related word " Ming Dynasty ", " Ming Dynasty " can as " Dong Yuan " this synonym temporal information.Herein to same
Time or regional information in adopted word marker character do not make particular/special requirement, are not limited to the above method.
S907:The time in matching range or regional information are obtained, and it is associated with synonym;
Time or regional information in matching range obtain can also by comprising content of text carry out
Acquired results are simultaneously contrasted and realized by participle with time tag storehouse and region tag library.
After time or the regional information in matching range is obtained, it is associated on the synonym that it includes.
Establishment on the time in matching range or regional information and synonym to correlation time information, can use but
It is not limited to following method.
I) contain one or more synonyms in matching range, contain a time or regional information.It is all
Synonymous word association unique time or regional information.
II) contain one or more synonyms in matching range, contain multiple times or regional information.Each
Closest time or regional information in sentence where synonymous word association or paragraph.If nothing in current paragraph
Temporal information association " modern times " or " current " etc. the expression of correlation time and regional information, the then synonym
Current temporal information, regional information associates the regional information that " Zone Full " etc. represents all regions.
III one or more synonyms pair, no time or regional information) are contained in matching range.By synonym
To the current temporal information of temporal information association " modern times " or " current " etc. expression, association " whole areas
Domain " etc. represents the temporal information in all regions.
Wherein, step S907 can be considered an abovementioned steps S702~step S703 example
S908:The step S907 synonyms with time and regional information obtained are added into thesaurus
In.
It is alternatively possible to carry out filtration treatment to the synonym for adding thesaurus, i.e.,:If existed
Time associated by the synonym or regional information, then be added to as shown in table 1 same by identical synonym
Temporal information or the column of regional information one in adopted dictionary.If there is no identical synonym, then when will have
Between or the synonym of regional information be added in thesaurus, and record the temporal information associated by the synonym
Or regional information.
Perform after step S908, return to step S903.That is, step S903~step S908 is a circulation
Process, until synonym marker character all in text all analyzes completion, cyclic process terminates, and output has
The thesaurus of temporal information and regional information.
S909:Thesaurus of the output with time and regional information.
Method shown in Fig. 9 can be considered as not detailed in an example of method shown in Fig. 7, method shown in Fig. 9
The part of description, which can refer in Fig. 7, accordingly to be described.
Figure 10 is a kind of structural representation of information retrieval device provided in an embodiment of the present invention, and the information is searched
Rope device is used to perform the information search method shown in Fig. 3.As shown in Figure 10, the device includes:
Inquiry request acquisition module 1001, for obtaining the inquiry request for information search;
Keyword acquisition module 1002, for obtaining at least one keyword from inquiry request;
Scope prescribed information acquisition module 1003, for obtaining scope prescribed information, scope prescribed information is used for
The scope of prescribed information search;
Conjunctive word searching modul 1004, for for each keyword at least one keyword, searching
Meet scope prescribed information limit in the range of the keyword one or more conjunctive words, conjunctive word includes
Synonym and/or near synonym;
Search module 1005, for each keyword for being found according to conjunctive word searching modul 1004
One or more conjunctive words carry out information searches, obtain being located at scope prescribed information limit in the range of information
Search result.
Alternatively, search module 1005 is pressed when keyword acquisition module 1002 gets a keyword
One or more conjunctive words of the keyword found according to conjunctive word searching modul 1004 enter row information and searched
Rope;Or one or more conjunctive words of the keyword found according to conjunctive word searching modul 1004,
And the keyword that keyword acquisition module 1002 is got carries out information search.
Alternatively, the information retrieval device also includes:Word combination module, in keyword acquisition module
1002 when getting at least two keywords, in conjunctive word searching modul 1004 at least two keywords
In each keyword, find meet scope prescribed information limit in the range of one of the keyword
Or after multiple conjunctive words, search module 1005 according to conjunctive word searching modul 1004 find it is each
One or more conjunctive words of individual keyword are carried out before information search, and conjunctive word searching modul 1004 is looked into
It is combined between the conjunctive word of different keywords at least two keywords found, and will at least two
Partial key word in individual keyword and it is combined between the conjunctive word of remaining keyword found;
Search module 1005 specifically for:At least two keywords are got in keyword acquisition module 1002
When, carry out information search according to each combination of word combination module formation;Or it is crucial according at least two
Word, and each combination of word combination module formation carry out information search.
Wherein, search module 1005, can be merely with conjunctive word searching modul 1004 when carrying out information search
The conjunctive word found is scanned for, the keyword that can also be obtained using keyword acquisition module 1002
And the conjunctive word that conjunctive word searching modul 1004 is found is scanned for.
Alternatively, conjunctive word searching modul 1004 specifically for:For each at least one keyword
Individual keyword, searches all conjunctive words of the keyword;For each conjunctive word found, obtaining should
The information of the scope of application of conjunctive word;Have overlapping between with scope prescribed information the scope of application is limited into scope
Conjunctive word, as meet scope prescribed information limit in the range of the keyword conjunctive word.
Alternatively, inquiry request acquisition module 1001 specifically for:Inquiry request is obtained from client;
The information retrieval device also includes:Conjunctive word sending module, is used for:
When keyword acquisition module 1002 gets a keyword, looked into conjunctive word searching modul 1004
Find meet scope prescribed information limit in the range of a keyword one or more conjunctive words after,
One or more conjunctive words are sent to client, and to each conjunctive word of transmission, send the conjunctive word
The information of the scope of application;
Or be used for:
When keyword acquisition module 1002 gets at least two keywords, in conjunctive word searching modul
1004 for each keyword at least two keywords, finds and meets scope prescribed information and limited
In the range of the keyword one or more conjunctive words after, conjunctive word searching modul 1004 is found
At least two keywords in different keywords conjunctive word between be combined, and by least two close
Partial key word in keyword and it is combined between the conjunctive word of remaining keyword found;For being formed
Each combination, determine the scope of application of the combination;Send one or more suitable with non-NULL to client
With the combination of scope, and to each combination of transmission, send the information of the scope of application of the combination.
Wherein, if a combination includes keyword, by scope prescribed information limited range and the group
Common factor between the scope of application of each conjunctive word in conjunction, is used as the scope of application of the combination;
If not including keyword in a combination, the scope of application of each conjunctive word during this is combined it
Between common factor, be used as the scope of application of the combination.
Alternatively, the information retrieval device also includes:Scope of application information flag module, in conjunctive word
Searching modul 1004 is obtained before the information of the scope of application of each conjunctive word, and one is obtained from text
Conjunctive word;Judge whether include being used to describe the word of the scope of application of the conjunctive word in text;If including,
Then by the word of the scope of application for describing the conjunctive word, labeled as the letter of the scope of application of the conjunctive word
Breath.
Alternatively, scope prescribed information acquisition module 1003 specifically for:
Scope prescribed information is obtained from inquiry request;Or
If keyword acquisition module 1002 gets a keyword and the meaning of a word of a keyword is defined
The scope of information search, then generate for describing the information search scope that the meaning of a word of a keyword is limited,
It is used as scope prescribed information;Or
If keyword acquisition module 1002 is got at least two keywords and at least two keywords
The meaning of a word of part or all of keyword defines the scope of information search, it is determined that in part or all of keyword
Each keyword the information search scope that is limited of the meaning of a word, and by the word of each keyword of determination
Common factor is taken between the information search scope that justice is limited, scope prescribed information is used as using occuring simultaneously.
Alternatively, inquiry request acquisition module 1001 specifically for:Inquiry request is obtained from client;Dress
Putting also includes:Search result sending module, for obtaining being located at scope prescribed information in search module 1005
After information search result in the range of limiting, obtained information search result is sent to client, and it is right
Each entry in information search result, range of transmission prescribed information.
In Figure 10 shown devices, inquiry request, which obtains mould 1001, to be used to perform abovementioned steps S301;It is crucial
Word acquisition module 1002 is used to perform abovementioned steps S302;Scope prescribed information acquisition module 1003 is used to hold
Row abovementioned steps S303;Conjunctive word searching modul 1004 is used to perform abovementioned steps S304;Search module
1005 are used to perform abovementioned steps S305;Word combination module is used to perform difference in abovementioned steps S304
It is combined between the conjunctive word of keyword and carries out the conjunctive word of Partial key word and remaining keyword
The step of combination;Conjunctive word composite module, which is used to perform in abovementioned steps S304, to be sent conjunctive word and its is applicable
The step of scope;Scope of application information flag module is used to perform the mark conjunctive word in abovementioned steps S304
The scope of application the step of;Search result sending module is used for after performing abovementioned steps S305, will search for
As a result it is sent to client.
The function not being described in detail in each module shown in Figure 10 and operation, in detail as shown in Figure 3 in flow
Corresponding description.
The modules included by device shown in Figure 10, can be by the processor 202 in Fig. 2 when realizing
The programmed instruction that is stored in run memory 203 is realized., may when each module performs corresponding operation
Can be related to server 101 and other equipment, such as:Client 102 or outside memory 103 it
Between interaction, can be controlled when realizing by processor 202 I/O interfaces 201 complete these interaction.In addition,
When modules perform corresponding operation, the access to memory 203 may be related to, can be by when realizing
Processor 202 obtains data storage from memory 203.
A kind of structural representation for information acquisition device that Figure 11 provides for the application, as shown in figure 11, should
Device includes:
Conjunctive word acquisition module 1101, one or more associations for obtaining a keyword from text
Word, conjunctive word includes synonym and/or near synonym;
Word searching modul 1102, for each conjunctive word obtained for conjunctive word module, in the text
Search the word of the scope of application for describing the conjunctive word;
Range flags module 1103, for the word for finding word searching modul, labeled as the conjunctive word
The scope of application information.
Alternatively, the device also includes:
Keyword lookup module, one or more conjunctive words for obtaining keyword in conjunctive word acquisition module
Before, keyword is found from text;
Conjunctive word marker character searching modul, states the conjunctive word marker character that keyword is found in text, conjunctive word mark
Note symbol is used to mark the conjunctive word of keyword and the incidence relation of keyword;
Matching range determining module, for determining the matching range of conjunctive word marker character in the text, matches model
Enclose the position range for marking conjunctive word to be likely to occur in the text;
Range flags module specifically for:
One or more conjunctive words are obtained out of matching range.
In information acquisition device shown in Figure 11, conjunctive word acquisition module 1101 is used to perform abovementioned steps
S701, word searching modul 1102 is used to perform abovementioned steps S702, and range flags module 1103 is used to hold
Row abovementioned steps S703, the lookup keyword that keyword lookup module is used to perform in abovementioned steps S701
Operation, conjunctive word marker character searching modul is used to perform the lookup conjunctive word marker character in abovementioned steps S701
Operation, matching range determining module is used to perform determination conjunctive word marker character in abovementioned steps S701
The operation of matching range.
The modules included by device shown in Figure 11, can be by the processor 202 in Fig. 2 when realizing
The program stored in memory 203 is called to realize.When each module performs corresponding operation, it may relate to
And to server 101 and other equipment, such as:Between the memory 103 of client 102 or outside
Interaction, can be controlled I/O interfaces 201 to complete these interactions by processor 202 when realizing.In addition, at each
When module performs corresponding operation, the access to memory 203 may be related to, can be by handling when realizing
Device 202 obtains data storage from memory 203.
The detailed flow as shown in Figure 7 of function or operation that information acquisition device shown in Figure 11 is not described in detail
In corresponding description.
Below, with reference to Figure 12, another information retrieval device provided in an embodiment of the present invention is illustrated.Its
In, Figure 12 using keyword be at least two, conjunctive word as synonym, scope prescribed information be temporal information
Exemplified by regional information, an example of Figure 10 shown devices is provided.
As shown in figure 12, the information retrieval device includes:
Keyword acquisition module 1201, the keyword for obtaining search from client.Wherein, keyword can
To pass through the keyword that participle is obtained for the search statement that is inputted by user, or user specifies or selected
The keyword selected, or select or input by some setting input windows and obtain keyword etc..
Thesaurus memory module 1202 with time or regional information, for storing keyword acquisition module
The synonym of 1201 keywords obtained, the synonym has time or regional information, and its structure can be table
Structure shown in 1.In table 1, temporal information can be dynasty, time, the information such as period, region letter
It can be region to cease, the information such as province.
Thesaurus memory module with time or regional information is not limited to the structure described in table 1, its
It can embody the time of synonym or the result of regional information also may be used.
Synonym processing module 1205 is when handling synonym according to time or regional information
The temporal information or regional information that thesaurus memory module 1202 obtains each synonym are (i.e. foregoing to close
Join an example of the information of the scope of application of word).
Time/region tag library memory module 1203, for record and historical events, personage, books, thing
The words such as product are associated time or regional information, in the embodiment shown in fig. 12, keyword processing module
It will be recorded when 1204 pairs of keywords are handled in keyword and time/region tag library memory module 1203
Word is contrasted, and obtains the temporal information included in keyword or regional information (i.e. aforementioned range restriction
One example of information).
Keyword processing module 1204, for judging whether keyword is related to time or regional information, if
It is related then obtain corresponding time or regional information.By keyword and time/region tag library memory module
The word recorded in 1203 is contrasted, if keyword is included in time/region tag library memory module 1203
In, then obtain the time of the keyword or regional information in time/region tag library memory module 1203.This
Outside, if the not no keyword related to time or regional information, its output time information can be following two:
Without time or regional information, or all times or regional information.As needed one of which can be selected defeated
Go out mode.
Synonym processing module 1205, for the keyword that obtains keyword acquisition module 1201 with having
The synonym of the keyword in the thesaurus memory module 1202 of time or regional information is substituted,
And additional period or regional information, form the synonym crucial phrase with time or regional information (i.e. foregoing
One example of the new keywords group in step S807).
Search suggestion processing module 1206, the synonym with time or regional information obtained for filtering
Crucial phrase, forms search suggestion.
Search suggestion sending module 1207, builds for sending the search with time or regional information to client
View.
Search module 1208, is scanned for for treating searching keyword group and its synonym crucial phrase.
Search result is stored and sending module 1209, for storing search result and sending search knot to client
Really.
In Figure 12, keyword acquisition module 1201 is an example of foregoing keyword acquisition module 1001;
Thesaurus memory module 1202 with time or regional information has time or ground for what is obtained in Fig. 9
One example of the thesaurus of domain information, for meeting for foregoing conjunctive word searching modul 1004 in lookup
Scope prescribed information limit in the range of the keyword one or more conjunctive words when provide information sum
According to;Time/region tag library memory module 1203 is that aforementioned range prescribed information acquisition module 1003 is being obtained
Information and data are provided during scope prescribed information;Keyword processing module 1204 is aforementioned range prescribed information
One example of acquisition module 1003;Synonym processing module 1205 is one of foregoing word combination module
Example;Search suggestion processing module 1206 provides conjunctive word and its applicable model for foregoing keyword sending module
The information enclosed;Search suggestion sending module 1207 is an example of foregoing conjunctive word sending module;Search
Module 1208 is an example of previous searches module 1005;Search result is stored and sending module 1209
For an example of previous searches result sending module.
The function for each module not being described in detail in Figure 12 and operation refer to the corresponding description in Figure 10.
The modules included by device shown in Figure 12, can be by the processor 202 in Fig. 2 when realizing
The programmed instruction that is stored in run memory 203 is realized., may when each module performs corresponding operation
Can be related to server 101 and other equipment, such as:Client 102 or outside memory 103 it
Between interaction, can be controlled when realizing by processor 202 I/O interfaces 201 complete these interaction.In addition,
When modules perform corresponding operation, the access to memory 203 may be related to, can be by when realizing
Processor 202 obtains data storage from memory 203.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or meter
Calculation machine program product.Therefore, the present invention can be using complete hardware embodiment, complete software embodiment or knot
The form of embodiment in terms of conjunction software and hardware.Wherein wrapped one or more moreover, the present invention can be used
Containing computer usable program code computer-usable storage medium (include but is not limited to magnetic disk storage,
CD-ROM, optical memory etc.) on the form of computer program product implemented.
The present invention is with reference to the production of method according to embodiments of the present invention, equipment (system) and computer program
The flow chart and/or block diagram of product is described.It should be understood that can by computer program instructions implementation process figure and
/ or each flow and/or square frame in block diagram and the flow in flow chart and/or block diagram and/
Or the combination of square frame.These computer program instructions can be provided to all-purpose computer, special-purpose computer, insertion
Formula processor or the processor of other programmable data processing devices are to produce a machine so that pass through and calculate
The instruction of the computing device of machine or other programmable data processing devices is produced for realizing in flow chart one
The device for the function of being specified in individual flow or multiple flows and/or one square frame of block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or the processing of other programmable datas to set
In the standby computer-readable memory worked in a specific way so that be stored in the computer-readable memory
Instruction produce include the manufacture of command device, the command device realization in one flow or multiple of flow chart
The function of being specified in one square frame of flow and/or block diagram or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices, made
Obtain and perform series of operation steps on computer or other programmable devices to produce computer implemented place
Reason, so that the instruction performed on computer or other programmable devices is provided for realizing in flow chart one
The step of function of being specified in flow or multiple flows and/or one square frame of block diagram or multiple square frames.
, but those skilled in the art once know base although preferred embodiments of the present invention have been described
This creative concept, then can make other change and modification to these embodiments.So, appended right will
Ask and be intended to be construed to include preferred embodiment and fall into having altered and changing for the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification without departing from this hair to the present invention
Bright spirit and scope.So, if the present invention these modifications and variations belong to the claims in the present invention and
Within the scope of its equivalent technologies, then the present invention is also intended to comprising including these changes and modification.
Claims (22)
1. a kind of information search method, it is characterised in that including:
Obtain the inquiry request for information search;
At least one keyword is obtained from the inquiry request;
Scope prescribed information is obtained, the scope prescribed information is used for the scope that prescribed information is searched for;
For each keyword at least one described keyword, lookup meets the scope prescribed information
One or more conjunctive words of the keyword in the range of limiting, the conjunctive word includes synonym and/or near
Adopted word;
Information search is carried out according to one or more of conjunctive words of each keyword found, is obtained
Positioned at the scope prescribed information limit in the range of information search result.
2. the method as described in claim 1, it is characterised in that if getting a keyword, press
Information search is carried out according to one or more of conjunctive words of each keyword found, including:
Information search is carried out according to one or more of conjunctive words of the one keyword found;Or
According to one or more of conjunctive words of the one keyword found, and one pass
Keyword carries out information search.
3. the method as described in claim 1, it is characterised in that if getting at least two keywords,
Then each keyword in at least two keyword, finds and meets the scope restriction letter
Breath limit in the range of the keyword one or more conjunctive words after, according to find each close
One or more of conjunctive words of keyword are carried out before information search, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found,
And by the Partial key word at least two keyword and the conjunctive word of remaining keyword found
Between be combined;
One or more of conjunctive words according to each keyword found carry out information search, bag
Include:
Information search is carried out according to each combination of formation;Or
According at least two keyword, and each combination formed carries out information search.
4. the method as described in any one of claims 1 to 3, it is characterised in that lookup meets the scope
Prescribed information limit in the range of a keyword one or more conjunctive words, including:
Search all conjunctive words of the keyword;
For each conjunctive word found, the information of the scope of application of the conjunctive word is obtained;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make
For meet the scope prescribed information limit in the range of the keyword conjunctive word.
5. method as claimed in claim 4, it is characterised in that obtain the inquiry request, including:
The inquiry request is obtained from client;
If getting a keyword, meet finding in the range of the scope prescribed information limits
After one or more conjunctive words of one keyword, in addition to:
One or more of conjunctive words are sent to the client, and to each conjunctive word of transmission, hair
Give the information of the scope of application of the conjunctive word.
6. method as claimed in claim 4, it is characterised in that obtain the inquiry request, including:
The inquiry request is obtained from client;
If getting at least two keywords, each in at least two keyword is crucial
Word, find meet the scope prescribed information limit in the range of the keyword one or more associations
After word, in addition to:
It will be combined between the conjunctive word of different keywords at least two keyword found,
And by between the Partial key word and the conjunctive word of remaining keyword found at least two keywords
It is combined;
For each combination of formation, the scope of application of the combination is determined;Wherein, if being wrapped in a combination
Include keyword, then each conjunctive word during the scope prescribed information limited range is combined with this
Common factor between the scope of application, is used as the scope of application of the combination;If not including keyword in a combination,
Then by the common factor between the scope of application of each conjunctive word in combining, the applicable model of the combination is used as
Enclose;
One or more combinations with the non-NULL scope of application are sent to the client, and to each of transmission
Individual combination, sends the information of the scope of application of the combination.
7. the method as described in any one of claim 4~6, it is characterised in that obtain each conjunctive word
The scope of application information before, in addition to:
A conjunctive word is obtained from text;
Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including by the word of the scope of application for describing the conjunctive word, labeled as the suitable of the conjunctive word
With the information of scope.
8. the method as described in any one of claim 1~7, it is characterised in that obtain the scope and limit
Information, including:
The scope prescribed information is obtained from the inquiry request;Or
If the meaning of a word for getting a keyword and one keyword defines the scope of information search,
Then generate for describing the information search scope that the meaning of a word of one keyword is limited, be used as the scope
Prescribed information;Or
If getting the part or all of keyword at least two keywords and at least two keyword
The meaning of a word define the scope of information search, it is determined that each in the part or all of keyword is crucial
The information search scope that the meaning of a word of word is limited, and the letter that the meaning of a word of each keyword of determination is limited
Breath takes common factor between hunting zone, regard the common factor as the scope prescribed information.
9. the method as described in any one of claim 1~8, it is characterised in that obtain the inquiry request,
Including:The inquiry request is obtained from client;
Obtain be located at the scope prescribed information limit in the range of information search result after, also wrap
Include:
Obtained information search result is sent to the client, and to each in information search result
Mesh, sends the scope prescribed information.
10. a kind of information acquisition method, it is characterised in that including:
From text obtain a keyword one or more conjunctive words, the conjunctive word include synonym and
/ or near synonym;
For each conjunctive word of acquisition, the applicable model for describing the conjunctive word is searched in the text
The word enclosed;
By the word found, labeled as the information of the scope of application of the conjunctive word.
11. the method stated such as claim 10 a, it is characterised in that keyword is being obtained from text
One or more conjunctive words before, in addition to:
The keyword is found from the text;
The conjunctive word marker character of the keyword is found from the text, the conjunctive word marker character is used to mark
Remember the conjunctive word of the keyword and the incidence relation of the keyword;
Matching range of the conjunctive word marker character in the text is determined, the matching range is used to mark
The position range that the conjunctive word is likely to occur in the text;
One or more conjunctive words of a keyword are obtained from text, including:
One or more of conjunctive words are obtained out of described matching range.
12. a kind of information retrieval device, it is characterised in that including:
Inquiry request acquisition module, for obtaining the inquiry request for information search;
Keyword acquisition module, for obtaining at least one keyword from the inquiry request;
Scope prescribed information acquisition module, for obtaining scope prescribed information, the scope prescribed information is used for
The scope of prescribed information search;
Conjunctive word searching modul, for for each keyword at least one described keyword, searching
Meet the scope prescribed information limit in the range of the keyword one or more conjunctive words, the pass
Joining word includes synonym and/or near synonym;
Search module, for found according to the conjunctive word searching modul described the one of each keyword
Individual or multiple conjunctive words carry out information searches, obtain being located at the scope prescribed information limit in the range of letter
Cease search result.
13. device as claimed in claim 12, it is characterised in that the search module specifically for:
When the keyword acquisition module gets a keyword,
One or more of passes of the one keyword found according to the conjunctive word searching modul
Join word and carry out information search;Or
One or more of passes of the one keyword found according to the conjunctive word searching modul
Join word, and one keyword that the keyword acquisition module is got carries out information search.
14. device as claimed in claim 12, it is characterised in that
Described device also includes:Word combination module, for being got at least in the keyword acquisition module
During two keywords, in the conjunctive word searching modul for each pass at least two keyword
Keyword, find meet the scope prescribed information limit in the range of the keyword one or more passes
After connection word, each keyword found in the search module according to the conjunctive word searching modul
One or more of conjunctive words are carried out before information search, the institute that the conjunctive word searching modul is found
It is combined between the conjunctive word for stating the different keywords at least two keywords, and at least two is closed
Partial key word in keyword and it is combined between the conjunctive word of remaining keyword found;
The search module specifically for:At least two keywords are got in the keyword acquisition module
When,
Information search is carried out according to each combination of word combination module formation;Or
Each combination according at least two keyword, and word combination module formation is carried out
Information search.
15. the device as described in any one of claim 12~14, it is characterised in that the conjunctive word is searched
Module specifically for:
For each keyword at least one described keyword, the institute for searching the keyword is relevant
Word;
For each conjunctive word found, the information of the scope of application of the conjunctive word is obtained;
There is overlapping conjunctive word between with the scope prescribed information scope of application is limited into scope, make
For meet the scope prescribed information limit in the range of the keyword conjunctive word.
16. device as claimed in claim 15, it is characterised in that
The inquiry request acquisition module specifically for:The inquiry request is obtained from client;
Described device also includes:Conjunctive word sending module, is used for:
When the keyword acquisition module gets a keyword, searched in the conjunctive word searching modul
To meet the scope prescribed information limit in the range of one keyword one or more associations
After word, one or more of conjunctive words are sent to the client, and to each conjunctive word of transmission,
Send the information of the scope of application of the conjunctive word.
17. device as claimed in claim 15, it is characterised in that
The inquiry request acquisition module specifically for:The inquiry request is obtained from client;
Described device also includes:Conjunctive word sending module, is used for:
When the keyword acquisition module gets at least two keywords, in the conjunctive word searching modul
For each keyword at least two keyword, find and meet the scope prescribed information institute
After one or more conjunctive words of the keyword in the range of restriction, the conjunctive word searching modul is searched
To at least two keyword in different keywords conjunctive word between be combined, and will at least
Partial key word in two keywords and it is combined between the conjunctive word of remaining keyword found;
For each combination of formation, the scope of application of the combination is determined;
Wherein, if a combination includes keyword, by the scope prescribed information limited range with
Common factor between the scope of application of each conjunctive word in the combination, is used as the scope of application of the combination;
If not including keyword in a combination, the scope of application of each conjunctive word during this is combined it
Between common factor, be used as the scope of application of the combination;
One or more combinations with the non-NULL scope of application are sent to the client, and to each of transmission
Individual combination, sends the information of the scope of application of the combination.
18. the device as described in any one of claim 15~17, it is characterised in that described device also includes:
Scope of application information flag module, for obtaining being applicable for each conjunctive word in the conjunctive word searching modul
Before the information of scope,
A conjunctive word is obtained from text;
Judge whether include being used to describe the word of the scope of application of the conjunctive word in the text;
If including by the word of the scope of application for describing the conjunctive word, labeled as the suitable of the conjunctive word
With the information of scope.
19. the device as described in any one of claim 12~18, it is characterised in that the scope limits letter
Cease acquisition module specifically for:
The scope prescribed information is obtained from the inquiry request;Or
If the keyword acquisition module gets a keyword and the meaning of a word of one keyword is limited
The scope of information search, then generate for describing the information search that the meaning of a word of one keyword is limited
Scope, is used as the scope prescribed information;Or
If the keyword acquisition module is got at least two keywords and at least two keyword
The meaning of a word of part or all of keyword define the scope of information search, it is determined that it is described part or all of to close
The information search scope that the meaning of a word of each keyword in keyword is limited, and each by determination is crucial
Common factor is taken between the information search scope that the meaning of a word of word is limited, is used as the scope to limit the common factor and believes
Breath.
20. the device as described in any one of claim 12~19, it is characterised in that the inquiry request is obtained
Modulus block specifically for:The inquiry request is obtained from client;
Described device also includes:Search result sending module, for being obtained in the search module positioned at described
Scope prescribed information limit in the range of information search result after, send obtained letter to the client
Search result is ceased, and to each entry in information search result, sends the scope prescribed information.
21. a kind of information acquisition device, it is characterised in that including:
Conjunctive word acquisition module, one or more conjunctive words for obtaining a keyword from text, institute
Stating conjunctive word includes synonym and/or near synonym;
Word searching modul, for each conjunctive word obtained for the conjunctive word module, in the text
The word of the scope of application for describing the conjunctive word is searched in this;
Range flags module, for the word for finding the word searching modul, labeled as the conjunctive word
The scope of application information.
22. device as claimed in claim 21, it is characterised in that described device also includes:
Keyword lookup module, for obtaining one or many of the keyword in the conjunctive word acquisition module
Before individual conjunctive word, the keyword is found from the text;
Conjunctive word marker character searching modul, the conjunctive word for finding the keyword from the text is marked
Symbol, the conjunctive word marker character is used to mark the conjunctive word of the keyword and associating for the keyword
System;
Matching range determining module, for determining matching model of the conjunctive word marker character in the text
Enclose, the matching range is used to mark the position range that the conjunctive word is likely to occur in the text;
The range flags module specifically for:
One or more of conjunctive words are obtained out of described matching range.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610179888.9A CN107229659B (en) | 2016-03-25 | 2016-03-25 | Information searching method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610179888.9A CN107229659B (en) | 2016-03-25 | 2016-03-25 | Information searching method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107229659A true CN107229659A (en) | 2017-10-03 |
CN107229659B CN107229659B (en) | 2021-06-22 |
Family
ID=59931969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610179888.9A Active CN107229659B (en) | 2016-03-25 | 2016-03-25 | Information searching method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107229659B (en) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108446345A (en) * | 2018-03-07 | 2018-08-24 | 维沃移动通信有限公司 | A kind of data search method and mobile terminal |
CN109684633A (en) * | 2018-12-14 | 2019-04-26 | 北京百度网讯科技有限公司 | Search processing method, device, equipment and storage medium |
CN110941609A (en) * | 2019-10-12 | 2020-03-31 | 贝壳技术有限公司 | Multi-dimensional searching method and system |
CN111241126A (en) * | 2020-01-16 | 2020-06-05 | 联想(北京)有限公司 | Data searching method and device and query interaction method |
CN111382374A (en) * | 2020-02-29 | 2020-07-07 | 中国平安人寿保险股份有限公司 | Information display method and device, electronic equipment and storage medium |
CN111435376A (en) * | 2019-01-15 | 2020-07-21 | 北京京东尚科信息技术有限公司 | Information processing method and system, computer system, and computer-readable storage medium |
CN112464081A (en) * | 2020-09-08 | 2021-03-09 | 广东省华南技术转移中心有限公司 | Project information matching method, device and storage medium |
CN112596646A (en) * | 2020-12-21 | 2021-04-02 | 维沃移动通信有限公司 | Information display method and device and electronic equipment |
CN112650839A (en) * | 2021-01-12 | 2021-04-13 | 深圳市鹰硕技术有限公司 | Retrieval information optimization method and device |
CN112825088A (en) * | 2019-11-21 | 2021-05-21 | 阿里巴巴集团控股有限公司 | Information display method, device, equipment and storage medium |
CN113743981A (en) * | 2021-08-03 | 2021-12-03 | 深圳市东信时代信息技术有限公司 | Material putting cost prediction method and device, computer equipment and storage medium |
CN114697748A (en) * | 2020-12-25 | 2022-07-01 | 深圳Tcl新技术有限公司 | Video recommendation method based on voice recognition and computer equipment |
WO2022262621A1 (en) * | 2021-06-17 | 2022-12-22 | 华为技术有限公司 | Method and apparatus for searching point of information |
CN117112736A (en) * | 2023-10-24 | 2023-11-24 | 云南瀚文科技有限公司 | Information retrieval analysis method and system based on semantic analysis model |
CN118277537A (en) * | 2024-06-03 | 2024-07-02 | 福建省君诺科技成果转化服务有限公司 | Intellectual property retrieval management method and device based on big data |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09231227A (en) * | 1996-02-20 | 1997-09-05 | Inter Group:Kk | Information retrieval device and method therefor |
CN101888503A (en) * | 2010-06-12 | 2010-11-17 | 中山大学 | Classification retrieving method for digital television program |
CN103123632A (en) * | 2011-11-21 | 2013-05-29 | 阿里巴巴集团控股有限公司 | Determining method for searching headword and device of searching headword, searching method and searching equipment |
CN103353894A (en) * | 2013-07-19 | 2013-10-16 | 武汉睿数信息技术有限公司 | Data searching method and system based on semantic analysis |
CN104268175A (en) * | 2014-09-15 | 2015-01-07 | 乐视网信息技术(北京)股份有限公司 | Data search device and method thereof |
-
2016
- 2016-03-25 CN CN201610179888.9A patent/CN107229659B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH09231227A (en) * | 1996-02-20 | 1997-09-05 | Inter Group:Kk | Information retrieval device and method therefor |
CN101888503A (en) * | 2010-06-12 | 2010-11-17 | 中山大学 | Classification retrieving method for digital television program |
CN103123632A (en) * | 2011-11-21 | 2013-05-29 | 阿里巴巴集团控股有限公司 | Determining method for searching headword and device of searching headword, searching method and searching equipment |
CN103353894A (en) * | 2013-07-19 | 2013-10-16 | 武汉睿数信息技术有限公司 | Data searching method and system based on semantic analysis |
CN104268175A (en) * | 2014-09-15 | 2015-01-07 | 乐视网信息技术(北京)股份有限公司 | Data search device and method thereof |
Non-Patent Citations (1)
Title |
---|
王屾: ""基于Lucene的同义词扩展检索的研究与实现"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108446345A (en) * | 2018-03-07 | 2018-08-24 | 维沃移动通信有限公司 | A kind of data search method and mobile terminal |
CN109684633A (en) * | 2018-12-14 | 2019-04-26 | 北京百度网讯科技有限公司 | Search processing method, device, equipment and storage medium |
CN109684633B (en) * | 2018-12-14 | 2023-05-16 | 北京百度网讯科技有限公司 | Search processing method, device, equipment and storage medium |
CN111435376A (en) * | 2019-01-15 | 2020-07-21 | 北京京东尚科信息技术有限公司 | Information processing method and system, computer system, and computer-readable storage medium |
CN110941609B (en) * | 2019-10-12 | 2023-10-20 | 贝壳找房(北京)科技有限公司 | Multi-dimensional searching method and system |
CN110941609A (en) * | 2019-10-12 | 2020-03-31 | 贝壳技术有限公司 | Multi-dimensional searching method and system |
CN112825088A (en) * | 2019-11-21 | 2021-05-21 | 阿里巴巴集团控股有限公司 | Information display method, device, equipment and storage medium |
CN111241126A (en) * | 2020-01-16 | 2020-06-05 | 联想(北京)有限公司 | Data searching method and device and query interaction method |
CN111382374A (en) * | 2020-02-29 | 2020-07-07 | 中国平安人寿保险股份有限公司 | Information display method and device, electronic equipment and storage medium |
CN112464081A (en) * | 2020-09-08 | 2021-03-09 | 广东省华南技术转移中心有限公司 | Project information matching method, device and storage medium |
CN112596646A (en) * | 2020-12-21 | 2021-04-02 | 维沃移动通信有限公司 | Information display method and device and electronic equipment |
CN112596646B (en) * | 2020-12-21 | 2022-05-20 | 维沃移动通信有限公司 | Information display method and device and electronic equipment |
CN114697748B (en) * | 2020-12-25 | 2024-05-03 | 深圳Tcl新技术有限公司 | Video recommendation method and computer equipment based on voice recognition |
CN114697748A (en) * | 2020-12-25 | 2022-07-01 | 深圳Tcl新技术有限公司 | Video recommendation method based on voice recognition and computer equipment |
CN112650839A (en) * | 2021-01-12 | 2021-04-13 | 深圳市鹰硕技术有限公司 | Retrieval information optimization method and device |
WO2022262621A1 (en) * | 2021-06-17 | 2022-12-22 | 华为技术有限公司 | Method and apparatus for searching point of information |
CN113743981B (en) * | 2021-08-03 | 2023-11-28 | 深圳市东信时代信息技术有限公司 | Material delivery cost prediction method and device, computer equipment and storage medium |
CN113743981A (en) * | 2021-08-03 | 2021-12-03 | 深圳市东信时代信息技术有限公司 | Material putting cost prediction method and device, computer equipment and storage medium |
CN117112736A (en) * | 2023-10-24 | 2023-11-24 | 云南瀚文科技有限公司 | Information retrieval analysis method and system based on semantic analysis model |
CN117112736B (en) * | 2023-10-24 | 2024-01-05 | 云南瀚文科技有限公司 | Information retrieval analysis method and system based on semantic analysis model |
CN118277537A (en) * | 2024-06-03 | 2024-07-02 | 福建省君诺科技成果转化服务有限公司 | Intellectual property retrieval management method and device based on big data |
Also Published As
Publication number | Publication date |
---|---|
CN107229659B (en) | 2021-06-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107229659A (en) | A kind of information search method and device | |
CN109710701B (en) | Automatic construction method for big data knowledge graph in public safety field | |
CN104933113B (en) | A kind of expression input method and device based on semantic understanding | |
CN105393263B (en) | Feature in compuman's interactive learning is completed | |
Chen | Information visualization: Beyond the horizon | |
US8972440B2 (en) | Method and process for semantic or faceted search over unstructured and annotated data | |
CN113065003B (en) | Knowledge graph generation method based on multiple indexes | |
CN108268580A (en) | The answering method and device of knowledge based collection of illustrative plates | |
CN105528437B (en) | A kind of question answering system construction method extracted based on structured text knowledge | |
CN104462056B (en) | For the method and information handling systems of knouledge-based information to be presented | |
CN106104518A (en) | For the framework extracted according to the data of example | |
CN110909170B (en) | Interest point knowledge graph construction method and device, electronic equipment and storage medium | |
CN106663117A (en) | Constructing a graph that facilitates provision of exploratory suggestions | |
CN105843796A (en) | Microblog emotional tendency analysis method and device | |
CN109582799A (en) | The determination method, apparatus and electronic equipment of knowledge sample data set | |
CN103617192B (en) | The clustering method and device of a kind of data object | |
CN107784014A (en) | Information search method, equipment and electronic equipment | |
CN104331438B (en) | To novel web page contents selectivity abstracting method and device | |
CN104239570B (en) | The searching method and device of paper | |
CN110309432A (en) | Method, map point of interest processing method are determined based on the synonym of point of interest | |
CN109857952A (en) | A kind of search engine and method for quickly retrieving with classification display | |
CN113190593A (en) | Search recommendation method based on digital human knowledge graph | |
CN105653546A (en) | Method and system for searching target theme | |
Menezes et al. | Building a massive corpus for named entity recognition using free open data sources | |
Castellani Ribeiro et al. | An urban data profiler |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200201 Address after: 518129 Bantian HUAWEI headquarters office building, Longgang District, Guangdong, Shenzhen Applicant after: HUAWEI TECHNOLOGIES Co.,Ltd. Address before: 210012 HUAWEI Nanjing base, 101 software Avenue, Yuhuatai District, Jiangsu, Nanjing Applicant before: Huawei Technologies Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |