CN102999556B - Text search method, device and terminal device - Google Patents

Text search method, device and terminal device Download PDF

Info

Publication number
CN102999556B
CN102999556B CN201210390486.5A CN201210390486A CN102999556B CN 102999556 B CN102999556 B CN 102999556B CN 201210390486 A CN201210390486 A CN 201210390486A CN 102999556 B CN102999556 B CN 102999556B
Authority
CN
China
Prior art keywords
search results
search
described search
positional information
context
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210390486.5A
Other languages
Chinese (zh)
Other versions
CN102999556A (en
Inventor
刘娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210390486.5A priority Critical patent/CN102999556B/en
Publication of CN102999556A publication Critical patent/CN102999556A/en
Application granted granted Critical
Publication of CN102999556B publication Critical patent/CN102999556B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of text search method, comprise the following steps: the search word receiving user's input; Generate search result set according to search word to destination document search, wherein, search result set comprises multiple Search Results, and each Search Results comprises the positional information in the destination document of search word place; Generate multiple result context subclass respectively for the positional information in each Search Results in search result set, wherein, each context subclass is corresponding with the Search Results of in search result set; And based on context subclass screens Search Results, and the Search Results after screening is supplied to described user.This method solve the content of both having considered to search in text search process, consider again the technical matters of contextual information, this method improves validity, the timeliness of user's text search, and has agility and ease for use.The invention also discloses a kind of text search device and terminal device.

Description

Text search method, device and terminal device
Technical field
The present invention relates to information search technique field, particularly a kind of text search method and apparatus and terminal device.
Background technology
At present, in text search processing procedure, as long as input the content that will search for, such as word " AB ", then search the content matched with " AB " in the whole text.Sometimes, although there is " AB " in file, this " AB " has been not the result that user wants to search really, and therefore, whether user must check it to be the target that user wants to search one by one, and it is loaded down with trivial details to search for and search deterministic process, and efficiency is low.
Summary of the invention
Object of the present invention is intended at least solve one of described technological deficiency.
For this reason, first object of the present invention is to propose a kind of text search method, the method increases user and carries out the high efficiency of text search and have agility and ease for use.Second object of the present invention is to propose a kind of text search device.3rd object of the present invention is to propose a kind of terminal device.
To achieve these goals, the embodiment of first aspect present invention provides a kind of text search method, comprises the following steps: the search word receiving user's input; Generate search result set according to described search word to destination document search, wherein, described search result set comprises multiple Search Results, and each described Search Results comprises the positional information in destination document described in described search word place; Generate multiple result context subclass respectively for the positional information in each described Search Results in described search result set, wherein, each described context subclass is corresponding with the Search Results of in described search result set; And according to described context subclass, described Search Results is screened, and the Search Results after screening is supplied to described user.
According to the text search method of the embodiment of the present invention, user generates search result set according to search word to destination document search, is supplied to user by screening by generating multiple result context subclass respectively for the positional information in each Search Results in search result set.This word searching method improves user to carry out the high efficiency of text search and has agility and ease for use.
In one embodiment of the invention, described based on context subclass screens described Search Results and comprises further:
Obtain described user determined Search Results in described search result set, and using described Search Results as standard search results, and using corresponding for described Search Results context subclass as standard context subclass; Calculate the similarity between other context subclass and described standard context subclass; And if described similarity is greater than predetermined threshold value, then the Search Results of the set of context of correspondence is deleted.Thus, improve the accuracy of text search.
In one embodiment of the invention, describedly generate multiple result context subclass respectively for the positional information in each described Search Results in described search result set and comprise further:
The contextual information of each described Search Results is obtained according to the positional information of each described Search Results; And the context subclass of each described Search Results is generated according to the contextual information of each described Search Results.Thus, improve high efficiency and the ease for use of text search.
In one embodiment of the invention, the positional information of each described Search Results is obtained according to the positional information of each described Search Results; And according to the contextual information of each described Search Results and the context subclass of each described Search Results of positional information generation.Thus, improve the alternative of text search method, and there is high efficiency and ease for use.
In one embodiment of the invention, the punctuation mark information of each described Search Results is obtained according to the positional information of each described Search Results; And according to the contextual information of each described Search Results and the context subclass of each described Search Results of punctuation mark information generation.Thus, improve the alternative of text search method, and there is high efficiency and ease for use.
The embodiment of second aspect present invention proposes a kind of text search device, comprising: receiver module, receives the search word of user's input; Search module, searches for destination document according to search word; Generation module, for generating search result set, and generate multiple result context subclass respectively for the positional information in each Search Results of described search result set, wherein, described search result set comprises multiple Search Results, each described Search Results comprises the positional information in destination document described in described search word place, and each described context subclass is corresponding with the Search Results of in described search result set; Screening module, screens described Search Results according to described context subclass; Sending module, is supplied to described user for the described Search Results after described screening module screening.
According to the text search device of the embodiment of the present invention, user searches for by generation module to generate search result set to destination document by search module according to the search word received by receiver module, is supplied to user by the screening of screening module by generating multiple result context subclass respectively for the positional information in each Search Results in search result set by sending module.This word searcher improves user to carry out the high efficiency of text search and has agility and ease for use.
In one embodiment of the invention, described screening module comprises:
Acquiring unit, obtains described user determined Search Results in described search result set, and using described Search Results as standard search results, and using corresponding for described Search Results context subclass as standard context subclass; Computing unit, for calculating the similarity between other context subclass and described standard context subclass; Judging unit, if for judging that described similarity is greater than predetermined threshold value, then deleted the Search Results of the set of context of correspondence.Thus, improve the accuracy of text search.
In one embodiment of the invention, described generation module is used for the contextual information obtaining each described Search Results according to the positional information of each described Search Results, and generates the context subclass of each described Search Results according to the contextual information of each described Search Results.Thus, improve high efficiency and the ease for use of text search.
In one embodiment of the invention, described generation module also for obtaining the positional information of each described Search Results according to the positional information of each described Search Results, and generates the context subclass of each described Search Results according to the contextual information of each described Search Results and positional information.Thus, improve the alternative of text search method, and there is high efficiency and ease for use.
In one embodiment of the invention, described generation module also for obtaining the punctuation mark information of each described Search Results according to the positional information of each described Search Results, and generates the context subclass of each described Search Results according to the contextual information of each described Search Results and punctuation mark information.Thus, improve the alternative of text search method, and there is high efficiency and ease for use.
The embodiment of third aspect present invention proposes a kind of terminal device, comprises the text search device that above-described embodiment provides.
According to the terminal device of the embodiment of the present invention, user generates search result set according to search word to destination document search, is supplied to user by screening by generating multiple result context subclass respectively for the positional information in each Search Results in search result set.This terminal device improves user to carry out the high efficiency of text search and has agility and ease for use.
In one embodiment of the invention, described terminal device is mobile phone, PC PC or panel computer.Thus, improve the diversity of terminal device.
The aspect that the present invention adds and advantage will part provide in the following description, and part will become obvious from the following description, or be recognized by practice of the present invention.
Accompanying drawing explanation
Of the present invention and/or additional aspect and advantage will become obvious and easy understand from the following description of the accompanying drawings of embodiments, wherein:
Fig. 1 is the process flow diagram of text search method according to an embodiment of the invention;
Fig. 2 is the schematic diagram of the text search device according to the embodiment of the present invention;
Fig. 3 is the schematic diagram of the screening module according to the embodiment of the present invention; And
Fig. 4 is the schematic diagram of the terminal device according to the embodiment of the present invention.
Embodiment
Be described below in detail embodiments of the invention, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has element that is identical or similar functions from start to finish.Being exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not limitation of the present invention being interpreted as.
In addition, term " first ", " second " only for describing object, and can not be interpreted as instruction or hint relative importance or imply the quantity indicating indicated technical characteristic.Thus, be limited with " first ", the feature of " second " can express or impliedly comprise one or more these features.In describing the invention, the implication of " multiple " is two or more, unless otherwise expressly limited specifically.
In describing the invention, it should be noted that, unless otherwise prescribed and limit, term " installation ", " being connected ", " connection " should be interpreted broadly, such as, can be mechanical connection or electrical connection, also can be the connection of two element internals, can be directly be connected, also indirectly can be connected by intermediary, for the ordinary skill in the art, the concrete meaning of described term can be understood as the case may be.
As shown in Figure 1, the process flow diagram of the text search method of the embodiment of the present invention, comprises the steps:
Step S101, receives the search word of user's input.Wherein, the mode of user's input includes but not limited to: by handwriting input, combination etc. by one or more modes of key-press input.
User can according to the search need inputted search word of oneself, and wherein, search word can be one or more combination of word, numeral, character and letter.Such as: early morning 5 point.
Step S102, generates search result set according to search word to destination document search.Wherein, search result set comprises multiple Search Results, and each Search Results comprises the positional information in the destination document of search word place.
The search word of one or more user input may be comprised in destination document, then search at destination document according to the search word of user's input in step S101, and obtain multiple Search Results comprising this search word information.Wherein, each Search Results comprises the position of this search word in destination document.Such as, in destination document, " point in early morning 5 " appears in text and title respectively, then search result set comprises the first Search Results and the second Search Results, and wherein, the first Search Results comprises text, and the second Search Results comprises title.
It should be noted that, the content of Search Results is not limited to the positional information of search word in destination document, can also comprise other information relevant to search word.
Step S103, generates multiple result context subclass respectively for the positional information in each Search Results in search result set.Wherein, each context subclass is corresponding with the Search Results of in search result set.Such as, for the positional information of the first Search Results in search result set and each Search Results in the second Search Results, multiple result context subclass is generated respectively.Wherein, the first Search Results comprises text, and the second Search Results comprises title.
Particularly, the contextual information of each Search Results is obtained according to the positional information of each Search Results.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, then from text and title, obtain the contextual information of " point in early morning 5 " respectively.
Then, the context subclass of each Search Results is generated according to the contextual information of each Search Results.Such as, the contextual information of " point in early morning 5 " that obtain is generated the context subclass of each Search Results.
Then, the positional information of each Search Results is obtained according to the positional information of each Search Results.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, obtains the positional information of " point in early morning 5 " from text and title two positional informations further.
After the positional information obtaining each Search Results, generate the context subclass of each Search Results according to the contextual information of each Search Results and positional information.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, then generate the context subclass of each Search Results according to the contextual information of " point in early morning 5 " and text, title two positions information.
Further, generate multiple result context subclass respectively for the positional information in each Search Results in search result set can also carry out in the following manner:
First, the punctuation mark information of each Search Results is obtained according to the positional information of each Search Results.Then, the context subclass of each Search Results is generated according to the contextual information of each Search Results and punctuation mark information.
Step S104, based on context subclass screens Search Results.
Particularly, first, user's determined Search Results in search result set is obtained, and using Search Results as standard search results, and using corresponding for Search Results context subclass as standard context subclass.
Then, the similarity between other context subclass and standard context subclass is calculated.Wherein, the mathematical probabilities comparison algorithms such as current existing cosine similarity algorithm or BM25 similarity calculating method can be utilized.
If the similarity between other context subclass and standard context subclass is greater than predetermined threshold value, then the Search Results of the set of context of correspondence is deleted.Wherein, predetermined threshold value is arranged according to actual needs for user.
Step S105, the Search Results after screening is supplied to user, thus user can view its Search Results wanted.
According to the text search method of the embodiment of the present invention, user generates search result set according to search word to destination document search, is supplied to user by screening by generating multiple result context subclass respectively for the positional information in each Search Results in search result set.This word searching method is conducive to showing discriminatively to be in user not allowing Search Results under varying environment, obviously identify not to be the Search Results that he wants to contribute to user, improve user simultaneously and carry out the high efficiency of text search and there is agility and ease for use.
Further, illustrate and text search method is described in detail.
First, receive the search word " AB " of user's input, from file, search for all results meeting " AB " condition, set up search result set.Wherein, search result set comprises multiple Search Results, and each Search Results comprises the positional information in the destination document of search word place.From results set, search for the contextual information before and after it successively according to each result, and set up the result set of context relevant to each result in results set.Further, multiple result context subclass is generated respectively for the positional information in each Search Results in search result set.Such as, search result set is combined into " AB1, AB2 ... ABn ".The search result context set of AB1 be " AB1-a, AB1-b ... AB1-m ".Here, set of context can comprise multiple information.Preferably, such as, the particular location that AB occurs is comprised, such as, in text, in title, in form etc.The particular location that AB occurs is not limited to foregoing description, and situation in addition repeats no more.Preferably, some word information that the front and back of AB are contiguous are comprised, such as, word before AB, the word after AB.Preferably, whether the surrounding comprising AB appearance also exists correlation word, and whether the front of such as AB comprises B to form the form of BAB, or whether the rear of AB exists A to form the form of ABA.Preferably, relevant with the particular location that AB occurs to the character number etc. arriving punctuate before and after AB attribute information.
Obtain user's determined Search Results in search result set, and using Search Results as standard search results, and using corresponding for Search Results context subclass as standard context subclass, extract the context subclass of the reference " AB " that will search at user's current page place place as standard context subclass reference basis.The set of context of each Search Results is carried out similarity system design with it, current existing mathematical probabilities comparison algorithm can be utilized.If similarity probability exceedes certain threshold value, that is, both similar possibilities are very large, then can normally show, if similarity probability is lower than threshold value, that is, and the very large difference of both existence, then relatively highlighted display.
According to the text search method of the embodiment of the present invention, user generates search result set according to search word to destination document search, is supplied to user by screening by generating multiple result context subclass respectively for the positional information in each Search Results in search result set.This word searching method is conducive to showing discriminatively to be in user not allowing Search Results under varying environment, obviously identify not to be the Search Results that he wants to contribute to user, improve user simultaneously and carry out the high efficiency of text search and there is agility and ease for use.
As shown in Figure 2, the text search device 300 of the embodiment of the present invention, comprising: receiver module 310, search module 320, generation module 330, screening module 340 and sending module 350.
Receiver module 310 is the search word receiving user's input.Wherein, the mode receiving user's input is but is not limited to: by handwriting input, combination etc. by one or more modes of key-press input.User can according to the search need inputted search word of oneself, and wherein, search word can be one or more combination of word, numeral, character and letter.Such as: receiver module 310 receives the search word " point in early morning 5 " of user's input
Search module 320 is for search for destination document according to search word.
The search word of one or more user input may be comprised in destination document, then search at destination document according to the search word receiving user's input in receiver module, and obtain multiple Search Results comprising this search word information.Wherein, each Search Results comprises the position of this search word in destination document.Such as, in destination document, " point in early morning 5 " appears in text and title respectively, then search result set comprises the first Search Results and the second Search Results, and wherein, the first Search Results comprises text, and the second Search Results comprises title.
It should be noted that, the content of Search Results is not limited to the positional information of search word in destination document, can also comprise other information relevant to search word.
Generation module 330 is for generating search result set, and generate multiple result context subclass respectively for the positional information in each Search Results of search result set, such as, for the positional information of the first Search Results in search result set and each Search Results in the second Search Results, generate multiple result context subclass respectively.Wherein, the first Search Results comprises text, and the second Search Results comprises title.Wherein, search result set comprises multiple Search Results, and each Search Results comprises the positional information in the destination document of search word place, and each context subclass is corresponding with the Search Results of in search result set.
Further, generation module 330 also for obtaining the contextual information of each Search Results according to the positional information of each Search Results, and generates the context subclass of each Search Results according to the contextual information of each Search Results.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, then from text and title, obtain the contextual information of " point in early morning 5 " respectively.The contextual information of " point in early morning 5 " that obtain is generated the context subclass of each Search Results.
Generation module 330 also for obtaining the positional information of each Search Results according to the positional information of each Search Results, and generates the context subclass of each Search Results according to the contextual information of each Search Results and positional information.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, obtains the positional information of " point in early morning 5 " from text and title two positional informations further.The context subclass of each Search Results is then generated according to the contextual information of " point in early morning 5 " and text, title two positions information.
Generation module 330 also for obtaining the punctuation mark information of each Search Results according to the positional information of each Search Results, and generates the context subclass of each Search Results according to the contextual information of each Search Results and punctuation mark information.Such as, " point in early morning 5 ", the positional information in destination document is respectively in text and title, obtains the punctuation mark information of " point in early morning 5 " from text and title two positional informations further.Such as, be preset in the character number 10 arriving punctuate before and after " point in early morning 5 ", then generate the context subclass of each Search Results according to the contextual information of " point in early morning 5 " and punctuation mark information.
Screening module 340 is that based on context subclass screens Search Results.Wherein, as shown in Figure 3, screen module to comprise: acquiring unit 301, computing unit 302 and judging unit 303.
Acquiring unit 301 obtains user's determined Search Results in described search result set, and using Search Results as standard search results, and using corresponding for Search Results context subclass as standard context subclass.
Computing unit 302 is for calculating the similarity between other context subclass and standard context subclass.Wherein, the mathematical probabilities comparison algorithms such as current existing cosine similarity algorithm or BM25 similarity calculating method can be utilized.
If the Search Results of the set of context of correspondence for judging that similarity is greater than predetermined threshold value, is then deleted by judging unit 303.Wherein, predetermined threshold value is arranged according to actual needs for user.
Sending module 350 is for being supplied to user by through screening the Search Results after module 340 is screened, thus user can view its Search Results wanted.
According to the text search device of the embodiment of the present invention, user searches for by generation module to generate search result set to destination document by search module according to the search word received by receiver module, is supplied to user by the screening of screening module by generating multiple result context subclass respectively for the positional information in each Search Results in search result set by sending module.This word searcher improves user to carry out the high efficiency of text search and has agility and ease for use.
As shown in Figure 4, the terminal device 400 of the embodiment of the present invention, comprising: text search device 300.Wherein terminal device can be mobile phone, PC PC or panel computer.
According to the terminal device of the embodiment of the present invention, user generates search result set according to search word to destination document search, is supplied to user by screening by generating multiple result context subclass respectively for the positional information in each Search Results in search result set.This terminal device improves user to carry out the high efficiency of text search and has agility and ease for use.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.
In flow charts represent or in this logic otherwise described and/or step, such as, the sequencing list of the executable instruction for realizing logic function can be considered to, may be embodied in any computer-readable medium, for instruction execution system, device or equipment (as computer based system, comprise the system of processor or other can from instruction execution system, device or equipment instruction fetch and perform the system of instruction) use, or to use in conjunction with these instruction execution systems, device or equipment.With regard to this instructions, " computer-readable medium " can be anyly can to comprise, store, communicate, propagate or transmission procedure for instruction execution system, device or equipment or the device that uses in conjunction with these instruction execution systems, device or equipment.The example more specifically (non-exhaustive list) of computer-readable medium comprises following: the electrical connection section (electronic installation) with one or more wiring, portable computer diskette box (magnetic device), random-access memory (ram), ROM (read-only memory) (ROM), erasablely edit ROM (read-only memory) (EPROM or flash memory), fiber device, and portable optic disk ROM (read-only memory) (CDROM).In addition, computer-readable medium can be even paper or other suitable media that can print described program thereon, because can such as by carrying out optical scanning to paper or other media, then carry out editing, decipher or carry out process with other suitable methods if desired and electronically obtain described program, be then stored in computer memory.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, multiple step or method can with to store in memory and the software performed by suitable instruction execution system or firmware realize.Such as, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: the discrete logic with the logic gates for realizing logic function to data-signal, there is the special IC of suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is that the hardware that can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, this program perform time, step comprising embodiment of the method one or a combination set of.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, also can be that the independent physics of unit exists, also can be integrated in a module by two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and the form of software function module also can be adopted to realize.If described integrated module using the form of software function module realize and as independently production marketing or use time, also can be stored in a computer read/write memory medium.
The above-mentioned storage medium mentioned can be ROM (read-only memory), disk or CD etc.
In the description of this instructions, specific features, structure, material or feature that the description of reference term " embodiment ", " some embodiments ", " example ", " concrete example " or " some examples " etc. means to describe in conjunction with this embodiment or example are contained at least one embodiment of the present invention or example.In this manual, identical embodiment or example are not necessarily referred to the schematic representation of above-mentioned term.And the specific features of description, structure, material or feature can combine in an appropriate manner in any one or more embodiment or example.
Although illustrate and describe embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, those of ordinary skill in the art can change above-described embodiment within the scope of the invention when not departing from principle of the present invention and aim, revising, replacing and modification.Scope of the present invention is by claims extremely equivalency.

Claims (12)

1. a text search method, is characterized in that, comprises the following steps:
Receive the search word of user's input;
Generate search result set according to described search word to destination document search, wherein, described search result set comprises multiple Search Results, and each described Search Results comprises the positional information in destination document described in described search word place;
Generate multiple result context subclass respectively for the positional information in each described Search Results in described search result set, wherein, each described context subclass is corresponding with the Search Results of in described search result set; And according to the similarity of described context subclass and standard context subclass, described Search Results is screened, and the Search Results after screening is supplied to described user.
2. text search method as claimed in claim 1, it is characterized in that, the similarity of described based on context subclass and standard context subclass is screened described Search Results and is comprised further:
Obtain described user determined Search Results in described search result set, and using described Search Results as standard search results, and using corresponding for described Search Results context subclass as standard context subclass;
Calculate the similarity between other context subclass and described standard context subclass; And
If described similarity is greater than predetermined threshold value, then the Search Results of the set of context of correspondence is deleted.
3. text search method as claimed in claim 1 or 2, is characterized in that, describedly generates multiple result context subclass respectively for the positional information in each described Search Results in described search result set and comprises further:
The contextual information of each described Search Results is obtained according to the positional information of each described Search Results; And
The context subclass of each described Search Results is generated according to the contextual information of each described Search Results.
4. text search method as claimed in claim 3, is characterized in that, also comprise:
The positional information of each described Search Results is obtained according to the positional information of each described Search Results; And
According to the contextual information of each described Search Results and the context subclass of each described Search Results of positional information generation.
5. text search method as claimed in claim 3, is characterized in that, also comprise:
The punctuation mark information of each described Search Results is obtained according to the positional information of each described Search Results; And
According to the contextual information of each described Search Results and the context subclass of each described Search Results of punctuation mark information generation.
6. a text search device, is characterized in that, comprising:
Receiver module, receives the search word of user's input;
Search module, searches for destination document according to search word;
Generation module, for generating search result set, and generate multiple result context subclass respectively for the positional information in each Search Results of described search result set, wherein, described search result set comprises multiple Search Results, each described Search Results comprises the positional information in destination document described in described search word place, and each described context subclass is corresponding with the Search Results of in described search result set;
Screening module, the similarity according to described context subclass and described standard context subclass is screened described Search Results;
Sending module, is supplied to described user for the described Search Results after described screening module screening.
7. text search device as claimed in claim 6, it is characterized in that, described screening module comprises:
Acquiring unit, obtains described user determined Search Results in described search result set, and using described Search Results as standard search results, and using corresponding for described Search Results context subclass as standard context subclass;
Computing unit, for calculating the similarity between other context subclass and described standard context subclass;
Judging unit, if for judging that described similarity is greater than predetermined threshold value, then deleted the Search Results of the set of context of correspondence.
8. text search device as claimed in claims 6 or 7, is characterized in that, described generation module
For obtaining the contextual information of each described Search Results according to the positional information of each described Search Results, and generate the context subclass of each described Search Results according to the contextual information of each described Search Results.
9. text search device as claimed in claim 8, it is characterized in that, described generation module also for obtaining the positional information of each described Search Results according to the positional information of each described Search Results, and generates the context subclass of each described Search Results according to the contextual information of each described Search Results and positional information.
10. text search device as claimed in claim 8, it is characterized in that, described generation module also for obtaining the punctuation mark information of each described Search Results according to the positional information of each described Search Results, and generates the context subclass of each described Search Results according to the contextual information of each described Search Results and punctuation mark information.
11. 1 kinds of terminal devices, is characterized in that, comprise the text search device as described in any one of claim 6-10.
12. terminal devices as claimed in claim 11, it is characterized in that, described terminal device is mobile phone, PC PC or panel computer.
CN201210390486.5A 2012-10-15 2012-10-15 Text search method, device and terminal device Active CN102999556B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210390486.5A CN102999556B (en) 2012-10-15 2012-10-15 Text search method, device and terminal device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210390486.5A CN102999556B (en) 2012-10-15 2012-10-15 Text search method, device and terminal device

Publications (2)

Publication Number Publication Date
CN102999556A CN102999556A (en) 2013-03-27
CN102999556B true CN102999556B (en) 2016-02-10

Family

ID=47928124

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210390486.5A Active CN102999556B (en) 2012-10-15 2012-10-15 Text search method, device and terminal device

Country Status (1)

Country Link
CN (1) CN102999556B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109308299B (en) * 2018-09-12 2020-01-14 北京字节跳动网络技术有限公司 Method and apparatus for searching information
CN109344299A (en) * 2018-11-12 2019-02-15 考拉征信服务有限公司 Object search method, apparatus, electronic equipment and computer readable storage medium
CN110674617A (en) * 2019-08-15 2020-01-10 阿里巴巴集团控股有限公司 Disease display method and device in health check process
CN112783918A (en) * 2021-03-15 2021-05-11 北京百度网讯科技有限公司 Search method, search apparatus, electronic device, storage medium, and program product

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101620631A (en) * 2008-07-02 2010-01-06 奥多比公司 Systems and methods for providing hi-fidelity contextual search results
CN101661484A (en) * 2008-08-29 2010-03-03 株式会社理光 Query method and query system
CN102663088A (en) * 2012-03-31 2012-09-12 百度在线网络技术(北京)有限公司 Method and equipment for providing search results

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3964630B2 (en) * 2001-03-07 2007-08-22 日本電信電話株式会社 Information search apparatus, information search program, and recording medium recording the program
KR100902172B1 (en) * 2007-12-12 2009-06-10 한국전자통신연구원 System and method for searching a document based on policy

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101620631A (en) * 2008-07-02 2010-01-06 奥多比公司 Systems and methods for providing hi-fidelity contextual search results
CN101661484A (en) * 2008-08-29 2010-03-03 株式会社理光 Query method and query system
CN102663088A (en) * 2012-03-31 2012-09-12 百度在线网络技术(北京)有限公司 Method and equipment for providing search results

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于Lucene的二次全文检索系统设计与实现;吴代文;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120315(第3期);35-55 *
基于Lucene的企业文档搜索引擎研究与应用;李海丰;《中国优秀硕士学位论文全文数据库 信息科技辑》;20120315(第3期);13-19,44 *
基于文档与搜索结果上下文的查询扩展方法;蒋辉等;《计算机应用》;20090301;第29卷(第3期);852-853 *

Also Published As

Publication number Publication date
CN102999556A (en) 2013-03-27

Similar Documents

Publication Publication Date Title
US9552138B2 (en) Mobile terminal and method for controlling the same
CN105988996B (en) Index file generation method and device
CN103593333B (en) A kind of processing method, terminal and the electronic equipment of e-book document
CN102902758B (en) Search engine is utilized to obtain wallpaper method to set up and the device of network picture
CN103729122A (en) Method and device for unlocking mobile terminal
CN102999556B (en) Text search method, device and terminal device
CN111814885A (en) Method, system, device and medium for managing image frames
CN104462496A (en) Search method, device and mobile terminal
CN106294564A (en) A kind of video recommendation method and device
CN112989148A (en) Error correction word ordering method and device, terminal equipment and storage medium
CN105069013A (en) Control method and device for providing input interface in search interface
CN104951491A (en) Information searching method and device
CN104102733A (en) Search content providing method and search engine
CN104267872A (en) Application program APP information display method and device and mobile terminal
CN111666100B (en) Software framework generation method and device, electronic equipment and storage medium
CN104683963A (en) Information processing method and electronic equipment
CN101515292B (en) Method and device for concerning item location
CN109656385A (en) Input prediction method and device based on knowledge graph and electronic equipment
CN103390060A (en) Song recommending method and device based on mobile terminal
CN103530385A (en) Method and device for searching for information based on vertical searching channels
CN104063400A (en) Data search method and data search device
CN104063432A (en) Information searching method and information searching device
CN108491502B (en) News tracking method, terminal, server and storage medium
CN104699836A (en) Multi-keyword search prompting method and multi-keyword search prompting device
CN111857466B (en) Message display method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant