CN101089850A - System for global search using comparison single work position relation - Google Patents
System for global search using comparison single work position relation Download PDFInfo
- Publication number
- CN101089850A CN101089850A CN 200710018281 CN200710018281A CN101089850A CN 101089850 A CN101089850 A CN 101089850A CN 200710018281 CN200710018281 CN 200710018281 CN 200710018281 A CN200710018281 A CN 200710018281A CN 101089850 A CN101089850 A CN 101089850A
- Authority
- CN
- China
- Prior art keywords
- vocabulary
- file
- module
- individual character
- retrieval
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A full-text index system utilizing comparison relation of single character position-relation is prepared as connecting dictionary databank separately to input module and index module, setting character disconnection module between input module and index module connecting index module to display module and connecting display module to input module.
Description
Technical field
The present invention relates to a kind of system and method thereof of full-text search, particularly can be applied to have a kind of system and method thereof of utilizing the comparison single work position relation to carry out full-text search of the hand held data processor of dictionary function.
Background technology
In existing electronic dictionary, be broadly divided into following several mode for the retrieval mode of individual character or vocabulary.
First kind of mode, earlier the every document in the dictionary is carried out the data numbering, come across in those data numberings according to each individual character again and set up an index file, writing down the corresponding relation of each individual character and data numbering in the index file, when the user imports a vocabulary to be retrieved, result for retrieval is just according to the record of index file, to include that the data of each individual character is judged to be related data in the vocabulary, and by the ordering all list, for example, the vocabulary that the user imports the retrieval of institute desire is during for " China ", in the result for retrieval also " in the state " will appear, the data of " country in the exploitation " etc. and vocabulary wide of the mark to be retrieved, in other words, if contain " in " data that reaches " state " two words all can be judged as the height correlation data, and list in result for retrieval.In addition, data in electronic dictionary system usually stores in the compressed file mode, when the user want further to understand whether really meet in the listed related data of result for retrieval desire to search condition the time, must click each data decompressing each data content by pen, and then inspect one by one.Thus, not only the influence speed of carrying out full-text search more increases the extra burden of user, when being difficult to satisfy the user and carrying out full-text search, fast, save time and accurate requirement.
Known e-dictionary full-text search mode, be easy to generate the result for retrieval of non-directly related property, the user must read by pen at the listed Query Result of result for retrieval, but store data with the compressed file form because dictionary database is many, therefore the time that must additionally expend the decompression data just is able to the reading data content, therefore, the e-dictionary retrieval mode that oneself knows can't satisfy the requirement of user for accurate retrieval, and relatively increases expending of many times.
Summary of the invention
Defective or deficiency at above-mentioned prior art operation the objective of the invention is to, and propose a kind of system and method thereof of utilizing the comparison single work position relation to carry out full-text search, are specially adapted to have the hand held data processor of dictionary function.
In order to realize above-mentioned task, the present invention takes following technical solution:
A kind of system that utilizes the comparison single work position relation to carry out full-text search, this system comprises the dictionary database of hand held data processor, it is characterized in that, system also includes:
One load module is used to provide the user to import vocabulary to be retrieved;
One module of divining by means of characters is used for an individual character that pluralizes disassembled in the vocabulary to be retrieved that is received;
One retrieval module is used for the individual character that module disassembles out according to divining by means of characters, and retrieves the vocabulary file that meets each single work position relation in the vocabulary to be retrieved in the indexed file, and generates a result for retrieval and tabulate;
One display module is used to show result for retrieval and tabulation;
Dictionary database is connected with retrieval module with load module respectively, is connected with the module of divining by means of characters between load module and the retrieval module, and wherein retrieval module also links to each other with display module, and display module is connected with load module.
Store complex vocabulary file and index file in the described dictionary database, each vocabulary file includes reference number of a document, Position Number and has the written historical materials of a plurality of individual characters, and index file records and corresponding reference number of a document of each individual character and Position Number.
Described retrieval module also includes interconnective file comparing module, position comparing module and order module;
The file comparing module is according to disassembling out individual character, and the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the indexed file;
The position comparing module is in comprising the vocabulary file of disassembling out individual character, and the Position Number of comparison individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved and to generate result for retrieval tabulation;
Order module is in order to be sorted the result for retrieval tabulation according to degree of correlation.
Above-mentioned utilization comparison single work position relation carries out the search method of the system of full-text search, it is characterized in that, comprises the following steps:
At first, dictionary database is numbered the complex vocabulary file, makes each vocabulary file have reference number of a document; And then each individual character in each vocabulary file is numbered, make each individual character have a Position Number;
Then, set up index file, record pairing reference number of a document of each individual character and Position Number in this index file, reference number of a document is communicated with vocabulary file;
Read a vocabulary to be retrieved, an individual character that pluralizes disassembled in vocabulary to be retrieved, and according to the individual character of disassembling out, the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the self-indexed file;
The Position Number of comparison individual character in the self-contained vocabulary file of disassembling out individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved;
Comparison result is generated the result for retrieval tabulation with sortord and show.
The system and method that utilizes the individual character relevant position to carry out full-text search of the present invention, do not need each document in the dictionary database is carried out searching work after decompressing again, save the time that the user carries out retrieval greatly, and pass through the individual character relevant position with judgement as data searching, can avoid obtaining uncorrelated result for retrieval, improved the efficient of full-text search, satisfied the requirement of user for accurate retrieval.
Description of drawings
Fig. 1 carries out the text retrieval system synoptic diagram for utilization comparison single work position relation of the present invention;
Fig. 2 is an index file synoptic diagram of the present invention;
The method flow diagram that Fig. 3 carries out full-text search for utilization comparison single work position relation of the present invention;
Fig. 4 is a retrieval module synoptic diagram of the present invention.
Label among the figure is respectively: 100, dictionary database, 101, the file comparing module, 102, the position comparing module, 103, order module, 110, vocabulary file, 120, index file, 130, load module, 140, the module of divining by means of characters, 150, retrieval module, 160, display module.
For making purpose of the present invention, structure, feature and function thereof there is further understanding, the present invention is described in further detail below in conjunction with embodiment that accompanying drawing and applicant provide, its content is enough to the technique effect that makes those skilled in the art understand technology contents of the present invention and bring, and those skilled in the art is enough to implement the present invention according to these contents.
Embodiment
See also Fig. 1, this figure is the system schematic that utilization comparison single work position relation of the present invention carries out full-text search.As shown in the figure, system of the present invention is applicable in the hand held data processor with dictionary function, and this system includes a dictionary database 100, a load module 130, divine by means of characters module 140, a retrieval module 150 and a display module 160.Wherein, store a complex vocabulary file 110 and an index file 120 in the dictionary database 100.When each vocabulary file 1 10 is stored to dictionary database 100, can carry out reference number of a document work at each vocabulary file 100 earlier, make each vocabulary file 110 all have a reference number of a document, be communicated with vocabulary file 110 by reference number of a document, in addition, each individual character in the vocabulary file 110 all has a Position Number, to write down the position relation of each individual character in vocabulary file 110, therefore, each vocabulary file 110 at dictionary database all includes a reference number of a document, a Position Number and have the written historical materials of a plurality of individual characters, and this literal data system usually is stored in the dictionary database 100 in the compressed file mode, and a plurality of individual character can be Chinese individual character or English-word.As shown in Figure 2, record pairing reference number of a document of each individual character and Position Number in the index file 120, for example, " in " word appears in the 15th, 20,70 and No. 100 vocabulary file, and the position in No. 15 vocabulary file is 2 and 5, (only as the enumerating of embodiment, but non-limiting application of the present invention).
The module of divining by means of characters 140 connects load module 130, in order to the individual independently individual character that pluralizes disassembled in vocabulary or short sentence that the user imported.
In addition, the retrieval module 150 in the system of the present invention also comprises file comparing module 101, position comparing module 102 and the order module 103 that is interconnected.File comparing module 101 can be according to the single word information of disassembling out, the pairing reference number of a document of each single word information of comparison in the self-indexed file 120, and find out and comprise the vocabulary file of disassembling out individual character simultaneously; 102 of position comparing module are from comprising in the vocabulary file of disassembling out individual character simultaneously, by the Position Number of index file 120 each individual character of comparison, finding out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved, and generate the result for retrieval tabulation; Order module 103 is used for the result for retrieval tabulation is sorted according to degree of correlation, generally is preceding, so that the user further retrieves with the ordering of correlativity data.
See also Fig. 3, this figure is the method flow diagram that the present invention utilizes the comparison single work position relation to carry out full-text search, be applied to have the hand held data processor of dictionary function, have and comprise the vocabulary file that complex is used to explain the vocabulary connotation in a dictionary database and the database in this device.
At first, be numbered, make each vocabulary file 110 all have a reference number of a document (step 200) at the complex vocabulary file 110 that is stored in the dictionary database 100; Then each individual character that is comprised in each vocabulary file 110 is numbered, makes each individual character all have a Position Number (step 210); After the numbering operation is finished, set up an index file 120 simultaneously, to write down pairing reference number of a document of each individual character and Position Number (step 220); By load module 130 to read the vocabulary to be retrieved (step 230) that the user is imported; At this moment, the module of divining by means of characters 140 will receive this lexical information to be retrieved, and it is disassembled a single word information (step 240) that pluralizes; Then, retrieval module 150 will be according to the single word information of disassembling out, carry out the work of full-text search, at first, see through the pairing reference number of a document of single word information that comparison is disassembled in file comparing module 101 self-indexed files 120, comprise the vocabulary file 110 (step 250) of disassembling out single word information simultaneously to find out; Then, respectively disassemble out the Position Number of single word information by position comparing module 102 from comprising comparison in the vocabulary file 110 of disassembling out single word information simultaneously, to find out the vocabulary file 110 (step 260) that meets each individual character relative position in the vocabulary to be retrieved; 103 degree of relevancy according to vocabulary file 110 of order module generate result for retrieval tabulation (step 270) with sortord, normally the high person of degree of correlation are sorted preceding; At last, utilize display module 160 (as LCD or OLED) that result for retrieval is presented to the user and watch (step 280), so as the user further select the vocabulary file 110 desiring to consult.
The system and method that utilization comparison single work position relation of the present invention carries out full-text search, can not carry out earlier under the prerequisite of decompression vocabulary file, directly retrieve the user vocabulary and the short sentence data desiring to search, therefore, not only save the required consumed time of retrieval, improve the execution speed of data-gathering, can reach the requirement of accurate retrieval, avoid retrieving the information paper of non-correlation in the known way.
Though the present invention has made detailed description with above-mentioned preferred embodiment to the present invention, be not to limit the present invention with the foregoing description.Those skilled in the art should recognize under the situation that does not break away from given technical characterictic of technical solution of the present invention and scope, and the increase that technical characterictic is done or with the replacement of some same contents of this area all should belong to protection scope of the present invention.
Claims (4)
1. one kind is utilized the comparison single work position relation system that carries out full-text search, and this system comprises the dictionary database of hand held data processor, it is characterized in that, system also includes:
One load module is used to provide the user to import vocabulary to be retrieved;
One module of divining by means of characters is used for an individual character that pluralizes disassembled in the vocabulary to be retrieved that is received;
One retrieval module is used for the individual character that module disassembles out according to divining by means of characters, and retrieves the vocabulary file that meets each single work position relation in the vocabulary to be retrieved in the indexed file, and generates a result for retrieval and tabulate;
One display module is used to show result for retrieval and tabulation;
Dictionary database is connected with retrieval module with load module respectively, is connected with the module of divining by means of characters between load module and the retrieval module, and wherein retrieval module also links to each other with display module, and display module is connected with load module.
2. the system that utilization comparison single work position relation as claimed in claim 1 carries out full-text search, it is characterized in that, store complex vocabulary file and index file in the described dictionary database, each vocabulary file includes reference number of a document, Position Number and has the written historical materials of a plurality of individual characters, and index file records and corresponding reference number of a document of each individual character and Position Number.
3. the system that utilization comparison single work position relation as claimed in claim 1 carries out full-text search is characterized in that described retrieval module also includes interconnective file comparing module, position comparing module and order module;
The file comparing module is according to disassembling out individual character, and the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the indexed file;
The position comparing module is in comprising the vocabulary file of disassembling out individual character, and the Position Number of comparison individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved and to generate result for retrieval tabulation;
Order module is in order to be sorted the result for retrieval tabulation according to degree of correlation.
4. the described utilization comparison of claim 1 single work position relation carries out the search method of the system of full-text search, it is characterized in that, and, comprise the following steps:
At first, dictionary database is numbered the complex vocabulary file, makes each vocabulary file have reference number of a document; And then each individual character in each vocabulary file is numbered, make each individual character have a Position Number;
Then, set up index file, record pairing reference number of a document of each individual character and Position Number in this index file, reference number of a document is communicated with vocabulary file;
Read a vocabulary to be retrieved, an individual character that pluralizes disassembled in vocabulary to be retrieved, and according to the individual character of disassembling out, the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the self-indexed file;
The Position Number of comparison individual character in the self-contained vocabulary file of disassembling out individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved;
Comparison result is generated the result for retrieval tabulation with sortord and show.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200710018281 CN101089850A (en) | 2007-07-17 | 2007-07-17 | System for global search using comparison single work position relation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 200710018281 CN101089850A (en) | 2007-07-17 | 2007-07-17 | System for global search using comparison single work position relation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101089850A true CN101089850A (en) | 2007-12-19 |
Family
ID=38943210
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 200710018281 Pending CN101089850A (en) | 2007-07-17 | 2007-07-17 | System for global search using comparison single work position relation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101089850A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102810096A (en) * | 2011-06-02 | 2012-12-05 | 阿里巴巴集团控股有限公司 | Retrieval method and device based on separate character indexing system |
CN102043779B (en) * | 2009-10-20 | 2014-04-02 | 英业达股份有限公司 | Page type display system of search result of electronic dictionary and method thereof |
CN102004598B (en) * | 2009-09-01 | 2014-05-14 | 炬力集成电路设计有限公司 | Media player and character input method thereof |
-
2007
- 2007-07-17 CN CN 200710018281 patent/CN101089850A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102004598B (en) * | 2009-09-01 | 2014-05-14 | 炬力集成电路设计有限公司 | Media player and character input method thereof |
CN102043779B (en) * | 2009-10-20 | 2014-04-02 | 英业达股份有限公司 | Page type display system of search result of electronic dictionary and method thereof |
CN102810096A (en) * | 2011-06-02 | 2012-12-05 | 阿里巴巴集团控股有限公司 | Retrieval method and device based on separate character indexing system |
CN102810096B (en) * | 2011-06-02 | 2016-03-16 | 阿里巴巴集团控股有限公司 | A kind of search method based on individual character directory system and device |
US9311389B2 (en) | 2011-06-02 | 2016-04-12 | Alibaba Group Holding Limited | Finding indexed documents |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10565273B2 (en) | Tenantization of search result ranking | |
CN107122400B (en) | Method, computing system and storage medium for refining query results using visual cues | |
Kaljuvee et al. | Efficient web form entry on pdas | |
CN101876878A (en) | Word prediction input system and method | |
CN100437585C (en) | Method for carrying out retrieval hint based on inverted list | |
US6735559B1 (en) | Electronic dictionary | |
US8099416B2 (en) | Generalized language independent index storage system and searching method | |
Kopparapu | Automatic extraction of usable information from unstructured resumes to aid search | |
CN101099129A (en) | Organizing pointers to objects | |
CN102725759A (en) | Semantic directory for search results | |
CN102782677B (en) | Use the improvement search of semantic key | |
CN101287206A (en) | Contact positioning method, system and mobile communication terminal | |
WO2011079415A1 (en) | Generating related input suggestions | |
CN102609452A (en) | Data storage method and data storage device | |
US20150127657A1 (en) | Method and Computer for Indexing and Searching Structures | |
CN106599153A (en) | Multi-data-source-based waste industry search system and method | |
CN102270243A (en) | Information search method and system | |
CN115391495B (en) | Method, device and equipment for searching keywords in Chinese context | |
CN111400323A (en) | Data retrieval method, system, device and storage medium | |
CN103235789B (en) | A kind of Chinese character is converted to the method for spelling and initial | |
US20090077031A1 (en) | System and method for creating full-text indexes of patent documents | |
CN101089850A (en) | System for global search using comparison single work position relation | |
CN103220387A (en) | Searching method and searching device for touch-screen phone | |
CN101446975B (en) | File location method and device | |
CN102546961A (en) | Contact lookup method and mobile terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |