CN101089850A - System for global search using comparison single work position relation - Google Patents

System for global search using comparison single work position relation Download PDF

Info

Publication number
CN101089850A
CN101089850A CN 200710018281 CN200710018281A CN101089850A CN 101089850 A CN101089850 A CN 101089850A CN 200710018281 CN200710018281 CN 200710018281 CN 200710018281 A CN200710018281 A CN 200710018281A CN 101089850 A CN101089850 A CN 101089850A
Authority
CN
China
Prior art keywords
vocabulary
file
module
individual character
retrieval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200710018281
Other languages
Chinese (zh)
Inventor
陈淮琰
李重锦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Besta Xian Co Ltd
Original Assignee
Inventec Besta Xian Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Besta Xian Co Ltd filed Critical Inventec Besta Xian Co Ltd
Priority to CN 200710018281 priority Critical patent/CN101089850A/en
Publication of CN101089850A publication Critical patent/CN101089850A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A full-text index system utilizing comparison relation of single character position-relation is prepared as connecting dictionary databank separately to input module and index module, setting character disconnection module between input module and index module connecting index module to display module and connecting display module to input module.

Description

System and the method thereof of utilizing the comparison single work position relation to carry out full-text search
Technical field
The present invention relates to a kind of system and method thereof of full-text search, particularly can be applied to have a kind of system and method thereof of utilizing the comparison single work position relation to carry out full-text search of the hand held data processor of dictionary function.
Background technology
In existing electronic dictionary, be broadly divided into following several mode for the retrieval mode of individual character or vocabulary.
First kind of mode, earlier the every document in the dictionary is carried out the data numbering, come across in those data numberings according to each individual character again and set up an index file, writing down the corresponding relation of each individual character and data numbering in the index file, when the user imports a vocabulary to be retrieved, result for retrieval is just according to the record of index file, to include that the data of each individual character is judged to be related data in the vocabulary, and by the ordering all list, for example, the vocabulary that the user imports the retrieval of institute desire is during for " China ", in the result for retrieval also " in the state " will appear, the data of " country in the exploitation " etc. and vocabulary wide of the mark to be retrieved, in other words, if contain " in " data that reaches " state " two words all can be judged as the height correlation data, and list in result for retrieval.In addition, data in electronic dictionary system usually stores in the compressed file mode, when the user want further to understand whether really meet in the listed related data of result for retrieval desire to search condition the time, must click each data decompressing each data content by pen, and then inspect one by one.Thus, not only the influence speed of carrying out full-text search more increases the extra burden of user, when being difficult to satisfy the user and carrying out full-text search, fast, save time and accurate requirement.
Known e-dictionary full-text search mode, be easy to generate the result for retrieval of non-directly related property, the user must read by pen at the listed Query Result of result for retrieval, but store data with the compressed file form because dictionary database is many, therefore the time that must additionally expend the decompression data just is able to the reading data content, therefore, the e-dictionary retrieval mode that oneself knows can't satisfy the requirement of user for accurate retrieval, and relatively increases expending of many times.
Summary of the invention
Defective or deficiency at above-mentioned prior art operation the objective of the invention is to, and propose a kind of system and method thereof of utilizing the comparison single work position relation to carry out full-text search, are specially adapted to have the hand held data processor of dictionary function.
In order to realize above-mentioned task, the present invention takes following technical solution:
A kind of system that utilizes the comparison single work position relation to carry out full-text search, this system comprises the dictionary database of hand held data processor, it is characterized in that, system also includes:
One load module is used to provide the user to import vocabulary to be retrieved;
One module of divining by means of characters is used for an individual character that pluralizes disassembled in the vocabulary to be retrieved that is received;
One retrieval module is used for the individual character that module disassembles out according to divining by means of characters, and retrieves the vocabulary file that meets each single work position relation in the vocabulary to be retrieved in the indexed file, and generates a result for retrieval and tabulate;
One display module is used to show result for retrieval and tabulation;
Dictionary database is connected with retrieval module with load module respectively, is connected with the module of divining by means of characters between load module and the retrieval module, and wherein retrieval module also links to each other with display module, and display module is connected with load module.
Store complex vocabulary file and index file in the described dictionary database, each vocabulary file includes reference number of a document, Position Number and has the written historical materials of a plurality of individual characters, and index file records and corresponding reference number of a document of each individual character and Position Number.
Described retrieval module also includes interconnective file comparing module, position comparing module and order module;
The file comparing module is according to disassembling out individual character, and the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the indexed file;
The position comparing module is in comprising the vocabulary file of disassembling out individual character, and the Position Number of comparison individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved and to generate result for retrieval tabulation;
Order module is in order to be sorted the result for retrieval tabulation according to degree of correlation.
Above-mentioned utilization comparison single work position relation carries out the search method of the system of full-text search, it is characterized in that, comprises the following steps:
At first, dictionary database is numbered the complex vocabulary file, makes each vocabulary file have reference number of a document; And then each individual character in each vocabulary file is numbered, make each individual character have a Position Number;
Then, set up index file, record pairing reference number of a document of each individual character and Position Number in this index file, reference number of a document is communicated with vocabulary file;
Read a vocabulary to be retrieved, an individual character that pluralizes disassembled in vocabulary to be retrieved, and according to the individual character of disassembling out, the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the self-indexed file;
The Position Number of comparison individual character in the self-contained vocabulary file of disassembling out individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved;
Comparison result is generated the result for retrieval tabulation with sortord and show.
The system and method that utilizes the individual character relevant position to carry out full-text search of the present invention, do not need each document in the dictionary database is carried out searching work after decompressing again, save the time that the user carries out retrieval greatly, and pass through the individual character relevant position with judgement as data searching, can avoid obtaining uncorrelated result for retrieval, improved the efficient of full-text search, satisfied the requirement of user for accurate retrieval.
Description of drawings
Fig. 1 carries out the text retrieval system synoptic diagram for utilization comparison single work position relation of the present invention;
Fig. 2 is an index file synoptic diagram of the present invention;
The method flow diagram that Fig. 3 carries out full-text search for utilization comparison single work position relation of the present invention;
Fig. 4 is a retrieval module synoptic diagram of the present invention.
Label among the figure is respectively: 100, dictionary database, 101, the file comparing module, 102, the position comparing module, 103, order module, 110, vocabulary file, 120, index file, 130, load module, 140, the module of divining by means of characters, 150, retrieval module, 160, display module.
For making purpose of the present invention, structure, feature and function thereof there is further understanding, the present invention is described in further detail below in conjunction with embodiment that accompanying drawing and applicant provide, its content is enough to the technique effect that makes those skilled in the art understand technology contents of the present invention and bring, and those skilled in the art is enough to implement the present invention according to these contents.
Embodiment
See also Fig. 1, this figure is the system schematic that utilization comparison single work position relation of the present invention carries out full-text search.As shown in the figure, system of the present invention is applicable in the hand held data processor with dictionary function, and this system includes a dictionary database 100, a load module 130, divine by means of characters module 140, a retrieval module 150 and a display module 160.Wherein, store a complex vocabulary file 110 and an index file 120 in the dictionary database 100.When each vocabulary file 1 10 is stored to dictionary database 100, can carry out reference number of a document work at each vocabulary file 100 earlier, make each vocabulary file 110 all have a reference number of a document, be communicated with vocabulary file 110 by reference number of a document, in addition, each individual character in the vocabulary file 110 all has a Position Number, to write down the position relation of each individual character in vocabulary file 110, therefore, each vocabulary file 110 at dictionary database all includes a reference number of a document, a Position Number and have the written historical materials of a plurality of individual characters, and this literal data system usually is stored in the dictionary database 100 in the compressed file mode, and a plurality of individual character can be Chinese individual character or English-word.As shown in Figure 2, record pairing reference number of a document of each individual character and Position Number in the index file 120, for example, " in " word appears in the 15th, 20,70 and No. 100 vocabulary file, and the position in No. 15 vocabulary file is 2 and 5, (only as the enumerating of embodiment, but non-limiting application of the present invention).
Load module 130 is in order to provide the user one input interface, and its form of expression can be a key board or a touch panel, is beneficial to a vocabulary to be retrieved or a short sentence data that the user imports institute's desire inquiry.
The module of divining by means of characters 140 connects load module 130, in order to the individual independently individual character that pluralizes disassembled in vocabulary or short sentence that the user imported.
Retrieval module 150 is connected with divine by means of characters module 140 and dictionary database 100, the single word information that retrieval module 150 meetings are disassembled out according to the module 140 of divining by means of characters, retrieve the vocabulary file 110 that meets vocabulary condition to be retrieved in the self-indexed file 120, and the result after will retrieving simultaneously generates result for retrieval tabulation.
Display module 160 is connected with retrieval module 150, is used to show the result for retrieval tabulation, is beneficial to the result after user's understanding system carries out retrieval and inquisition, and then allows the user pass through the vocabulary file 110 that load module 130 is selected institute's desire retrieval.
In addition, the retrieval module 150 in the system of the present invention also comprises file comparing module 101, position comparing module 102 and the order module 103 that is interconnected.File comparing module 101 can be according to the single word information of disassembling out, the pairing reference number of a document of each single word information of comparison in the self-indexed file 120, and find out and comprise the vocabulary file of disassembling out individual character simultaneously; 102 of position comparing module are from comprising in the vocabulary file of disassembling out individual character simultaneously, by the Position Number of index file 120 each individual character of comparison, finding out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved, and generate the result for retrieval tabulation; Order module 103 is used for the result for retrieval tabulation is sorted according to degree of correlation, generally is preceding, so that the user further retrieves with the ordering of correlativity data.
See also Fig. 3, this figure is the method flow diagram that the present invention utilizes the comparison single work position relation to carry out full-text search, be applied to have the hand held data processor of dictionary function, have and comprise the vocabulary file that complex is used to explain the vocabulary connotation in a dictionary database and the database in this device.
At first, be numbered, make each vocabulary file 110 all have a reference number of a document (step 200) at the complex vocabulary file 110 that is stored in the dictionary database 100; Then each individual character that is comprised in each vocabulary file 110 is numbered, makes each individual character all have a Position Number (step 210); After the numbering operation is finished, set up an index file 120 simultaneously, to write down pairing reference number of a document of each individual character and Position Number (step 220); By load module 130 to read the vocabulary to be retrieved (step 230) that the user is imported; At this moment, the module of divining by means of characters 140 will receive this lexical information to be retrieved, and it is disassembled a single word information (step 240) that pluralizes; Then, retrieval module 150 will be according to the single word information of disassembling out, carry out the work of full-text search, at first, see through the pairing reference number of a document of single word information that comparison is disassembled in file comparing module 101 self-indexed files 120, comprise the vocabulary file 110 (step 250) of disassembling out single word information simultaneously to find out; Then, respectively disassemble out the Position Number of single word information by position comparing module 102 from comprising comparison in the vocabulary file 110 of disassembling out single word information simultaneously, to find out the vocabulary file 110 (step 260) that meets each individual character relative position in the vocabulary to be retrieved; 103 degree of relevancy according to vocabulary file 110 of order module generate result for retrieval tabulation (step 270) with sortord, normally the high person of degree of correlation are sorted preceding; At last, utilize display module 160 (as LCD or OLED) that result for retrieval is presented to the user and watch (step 280), so as the user further select the vocabulary file 110 desiring to consult.
The system and method that utilization comparison single work position relation of the present invention carries out full-text search, can not carry out earlier under the prerequisite of decompression vocabulary file, directly retrieve the user vocabulary and the short sentence data desiring to search, therefore, not only save the required consumed time of retrieval, improve the execution speed of data-gathering, can reach the requirement of accurate retrieval, avoid retrieving the information paper of non-correlation in the known way.
Though the present invention has made detailed description with above-mentioned preferred embodiment to the present invention, be not to limit the present invention with the foregoing description.Those skilled in the art should recognize under the situation that does not break away from given technical characterictic of technical solution of the present invention and scope, and the increase that technical characterictic is done or with the replacement of some same contents of this area all should belong to protection scope of the present invention.

Claims (4)

1. one kind is utilized the comparison single work position relation system that carries out full-text search, and this system comprises the dictionary database of hand held data processor, it is characterized in that, system also includes:
One load module is used to provide the user to import vocabulary to be retrieved;
One module of divining by means of characters is used for an individual character that pluralizes disassembled in the vocabulary to be retrieved that is received;
One retrieval module is used for the individual character that module disassembles out according to divining by means of characters, and retrieves the vocabulary file that meets each single work position relation in the vocabulary to be retrieved in the indexed file, and generates a result for retrieval and tabulate;
One display module is used to show result for retrieval and tabulation;
Dictionary database is connected with retrieval module with load module respectively, is connected with the module of divining by means of characters between load module and the retrieval module, and wherein retrieval module also links to each other with display module, and display module is connected with load module.
2. the system that utilization comparison single work position relation as claimed in claim 1 carries out full-text search, it is characterized in that, store complex vocabulary file and index file in the described dictionary database, each vocabulary file includes reference number of a document, Position Number and has the written historical materials of a plurality of individual characters, and index file records and corresponding reference number of a document of each individual character and Position Number.
3. the system that utilization comparison single work position relation as claimed in claim 1 carries out full-text search is characterized in that described retrieval module also includes interconnective file comparing module, position comparing module and order module;
The file comparing module is according to disassembling out individual character, and the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the indexed file;
The position comparing module is in comprising the vocabulary file of disassembling out individual character, and the Position Number of comparison individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved and to generate result for retrieval tabulation;
Order module is in order to be sorted the result for retrieval tabulation according to degree of correlation.
4. the described utilization comparison of claim 1 single work position relation carries out the search method of the system of full-text search, it is characterized in that, and, comprise the following steps:
At first, dictionary database is numbered the complex vocabulary file, makes each vocabulary file have reference number of a document; And then each individual character in each vocabulary file is numbered, make each individual character have a Position Number;
Then, set up index file, record pairing reference number of a document of each individual character and Position Number in this index file, reference number of a document is communicated with vocabulary file;
Read a vocabulary to be retrieved, an individual character that pluralizes disassembled in vocabulary to be retrieved, and according to the individual character of disassembling out, the pairing reference number of a document of comparison comprises the vocabulary file of disassembling out individual character to find out in the self-indexed file;
The Position Number of comparison individual character in the self-contained vocabulary file of disassembling out individual character is to find out the vocabulary file that meets each individual character relative position in the vocabulary to be retrieved;
Comparison result is generated the result for retrieval tabulation with sortord and show.
CN 200710018281 2007-07-17 2007-07-17 System for global search using comparison single work position relation Pending CN101089850A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200710018281 CN101089850A (en) 2007-07-17 2007-07-17 System for global search using comparison single work position relation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200710018281 CN101089850A (en) 2007-07-17 2007-07-17 System for global search using comparison single work position relation

Publications (1)

Publication Number Publication Date
CN101089850A true CN101089850A (en) 2007-12-19

Family

ID=38943210

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200710018281 Pending CN101089850A (en) 2007-07-17 2007-07-17 System for global search using comparison single work position relation

Country Status (1)

Country Link
CN (1) CN101089850A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102810096A (en) * 2011-06-02 2012-12-05 阿里巴巴集团控股有限公司 Retrieval method and device based on separate character indexing system
CN102043779B (en) * 2009-10-20 2014-04-02 英业达股份有限公司 Page type display system of search result of electronic dictionary and method thereof
CN102004598B (en) * 2009-09-01 2014-05-14 炬力集成电路设计有限公司 Media player and character input method thereof

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102004598B (en) * 2009-09-01 2014-05-14 炬力集成电路设计有限公司 Media player and character input method thereof
CN102043779B (en) * 2009-10-20 2014-04-02 英业达股份有限公司 Page type display system of search result of electronic dictionary and method thereof
CN102810096A (en) * 2011-06-02 2012-12-05 阿里巴巴集团控股有限公司 Retrieval method and device based on separate character indexing system
CN102810096B (en) * 2011-06-02 2016-03-16 阿里巴巴集团控股有限公司 A kind of search method based on individual character directory system and device
US9311389B2 (en) 2011-06-02 2016-04-12 Alibaba Group Holding Limited Finding indexed documents

Similar Documents

Publication Publication Date Title
US10565273B2 (en) Tenantization of search result ranking
CN107122400B (en) Method, computing system and storage medium for refining query results using visual cues
Kaljuvee et al. Efficient web form entry on pdas
CN101876878A (en) Word prediction input system and method
CN100437585C (en) Method for carrying out retrieval hint based on inverted list
US6735559B1 (en) Electronic dictionary
US8099416B2 (en) Generalized language independent index storage system and searching method
Kopparapu Automatic extraction of usable information from unstructured resumes to aid search
CN101099129A (en) Organizing pointers to objects
CN102725759A (en) Semantic directory for search results
CN102782677B (en) Use the improvement search of semantic key
CN101287206A (en) Contact positioning method, system and mobile communication terminal
WO2011079415A1 (en) Generating related input suggestions
CN102609452A (en) Data storage method and data storage device
US20150127657A1 (en) Method and Computer for Indexing and Searching Structures
CN106599153A (en) Multi-data-source-based waste industry search system and method
CN102270243A (en) Information search method and system
CN115391495B (en) Method, device and equipment for searching keywords in Chinese context
CN111400323A (en) Data retrieval method, system, device and storage medium
CN103235789B (en) A kind of Chinese character is converted to the method for spelling and initial
US20090077031A1 (en) System and method for creating full-text indexes of patent documents
CN101089850A (en) System for global search using comparison single work position relation
CN103220387A (en) Searching method and searching device for touch-screen phone
CN101446975B (en) File location method and device
CN102546961A (en) Contact lookup method and mobile terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication