CN1558344A - Code list data structure - Google Patents
Code list data structure Download PDFInfo
- Publication number
- CN1558344A CN1558344A CNA2004100152462A CN200410015246A CN1558344A CN 1558344 A CN1558344 A CN 1558344A CN A2004100152462 A CNA2004100152462 A CN A2004100152462A CN 200410015246 A CN200410015246 A CN 200410015246A CN 1558344 A CN1558344 A CN 1558344A
- Authority
- CN
- China
- Prior art keywords
- multiword
- user
- data structure
- word
- concordance list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention discloses one kind of code table data structure and aims at providing one kind of code table data structure with less memory occupied during running and fast looking up speed. The code table data structure includes six files, including the first one for saving system single character table, the second one for saving system multiple character table, the third one for saving user's multiple character table, the fourth one for saving system single character index table, the fifth one for saving system multiple character index table, and the sixth one for saving user's multiple character index table. The present invention may be used in cellphone and other electronic products.
Description
Technical field
The present invention relates to a kind of code table data structure.
Background technology
In current life, electronic products such as computing machine, mobile phone have become the indispensable instrument of people.For all electronic tools, the input of Chinese character all is absolutely necessary for Chinese.Code table data structure in hand-hold electronic equipments (as mobile phone), be that Chinese character is deposited in the storer as a whole character library, because itself internal memory is little, when the user in when typewriting, CPU just calls in whole character library internal memory and stores in the mode of storehouse, when searching, according to the coding of user's input, required word searched in all words of traversal character library, so travelling speed can reduce greatly.Therefore, committed memory is big when operation, seek rate is slow for existing code table data structure.
Summary of the invention
Technical matters to be solved by this invention is to overcome the deficiencies in the prior art, the code table data structure that committed memory was little when a kind of the operation was provided, seek rate is fast.
The technical solution adopted in the present invention is: the present invention includes two files, first document storage system word list, second document storage system multiword table.
The content of described system word list and described system multiword table is fixed.
The present invention also comprises the 3rd file, described the 3rd document storage user multiword table, and the content of described user's multiword table is changeable.
Described system word list is divided into high frequency region and low frequency range.
Described system multiword table is divided into system two blocks, system three blocks ... system N block.
Described user's multiword table is divided into user two blocks, user three blocks ... user N block.
The present invention also comprises the 4th, the 5th, the 6th file, described the 4th document storage system individual character concordance list, described the 5th document storage system multiword concordance list, described the 6th document storage user multiword concordance list, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
Described system multiword concordance list is by system's two word indexing tables, system's three word indexing tables ... system N word indexing table is formed.
Described user's multiword concordance list is by user's two word indexing tables, user's three word indexing tables ... user N word indexing table is formed.
The invention has the beneficial effects as follows: because the present invention is divided into system individual character district and system multiword district with character library, when typewriting, when the user imports individual character, CPU calls in internal memory with system's word list, when the user imported phrase, CPU only needed system's multiword table is called in internal memory, and will all character libraries not call in internal memory, committed memory is little when therefore moving, and seek rate is also fast; Because the present invention also is provided with user's multiword table, the speech tabulation that the user can be used always, when the user typewrited, CPU only called in internal memory with user's multiword table, and committed memory is littler in the time of therefore can making operation, and seek rate is faster; Because the present invention also comprises system's individual character concordance list, system's multiword concordance list and user's multiword concordance list, it is with each word, the corresponding corresponding call number of speech, during typewriting, CPU is according to the corresponding call number of coding lookup of user's input, search required word, speech according to the call number correspondence again, and all travel through all words when not needing to search at every turn, therefore make the typewriting seek rate faster.
Description of drawings
Fig. 1 is a structural representation of the present invention;
Fig. 2 is a use schematic flow sheet of the present invention.
Embodiment
As shown in Figure 1, the present invention includes six files, first document storage system word list, second document storage system multiword table, the 3rd document storage user multiword table, the 4th document storage system individual character concordance list, the 5th document storage system multiword concordance list, the 6th document storage user multiword concordance list.
Described system word list is divided into high frequency region and low frequency range.
Described system multiword table is divided into system two blocks, system three blocks ... system N block.
Described user's multiword table is divided into user two blocks, user three blocks ... user N block.
The content of described system word list and described system multiword table is fixed, the content of described user's multiword table is self-defined changeable according to the user, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
As shown in Figure 2, when typewriting, when the user imports encode Chinese characters for computer, CPU just can be according to the coding lookup algorithm, judge that at first the zone gets area code, system's word list or system's multiword table or user's multiword table are called in internal memory, get concordance list again and search call number, search entry and show the entry tabulation according to the call number correspondence, the user just can select required entry according to the entry tabulation.
Specify below by example:
Individual character is searched
According to the coding of user's input, the high frequency region in word list is searched--→ find satisfactory high frequency word and deposit the entry tabulation of returning in; If high frequency region do not have the word that will search--→ search to the individual character concordance list--→ find call number--→ search to the low frequency range of word list according to call number--→ find satisfactory individual character and deposit entry in and tabulate.
As: with king's word is example, with the four-stroke Chinese character input method, the user presses numerical key " 161 ", the high frequency region that arrives first word list is searched and is met 161 high frequency word, is that key word is searched call number in the individual character concordance list with " 16 " again--→ search at the low frequency range of word list according to call number then meet 161 word--→ deposit the word that finds in the entry tabulation.
System's multiword table is searched
Coding according to user's input, search two words districts in system's multiword table earlier: in system's multiword concordance list, search the call number of this coding in two words districts-→ find call number-→ search entry according to call number two words districts in system's multiword table--→ find satisfactory entry and deposit the entry tabulation of returning in, and then search three words districts successively ... N words district.
As: with " Fan Yuyu " is example, the user presses numerical key " 643116 ", searches two words districts in system's multiword table earlier--→ with " 64 " call number that to be key word search two these codings of words district in system's multiword concordance list--→ search the entry that meets " 643116 " according to call number two words districts in system's multiword table then--→ do not find entry--→ search three words districts in system's multiword table again--→ find entry and deposit the entry tabulation in--→ search the phrase that each district sees if there is same-code successively.
The table handling of user's multiword
Deposit entry in and produce dynamic index:, in looking into coded file, search the coding of this entry according to the entry of user's input--→ find the coding of entry--→ determine the zone that entry deposits in--→ entry and coding are deposited in by indexed mode the appointed area of user's multiword table--→ user's multiword table is carried out dynamic index.
As: take " Xu Hao " as example, the user inputs " Xu Hao "--→ determine to look into the coded file name according to first byte of Chinese character " slowly "--→ in looking into coded file, search the coding " 3381 " of " slowly "--→ determine to look into the coded file name according to first byte of Chinese character " great " again--→ the coding " 4836 " of searching " great " in looking into coded file finds the coding of entry--→ coding rule according to phrase generates phrase coding " 338483 " again--→ entry and coding are deposited in by indexed mode the appointed area of user's multiword table--→ user's multiword table is carried out dynamic index.
Search during typewriting: according to the coding of input, search two words districts in user's multiword table earlier: in the dynamic index table of user's multiword table, search the call number of this coding in two words districts-→ find call number-→ search entry according to call number two words districts in user's multiword table--→ find satisfactory entry and deposit the entry tabulation of returning in, and then search three words districts successively ... N words district.
As: with " Wang Yansong " is example, the user presses numerical key " 163348 ", searches two words districts in user's multiword table earlier: with " 16 " call number that to be key word search two these codings of words district in the dynamic index table of user's multiword table--→ search the entry that meets " 163348 " according to call number two words districts in user's multiword table then--→ do not find entry--→ search three words districts in user's multiword table again--→ find entry and deposit the entry tabulation in--→ search the phrase that each district sees if there is same-code successively.
Claims (9)
1, a kind of code table data structure is characterized in that: it comprises two files, first document storage system word list, second document storage system multiword table.
2, code table data structure according to claim 1 is characterized in that: the content of described system word list and described system multiword table is fixed.
3, code table data structure according to claim 1 and 2 is characterized in that: it also comprises the 3rd file, described the 3rd document storage user multiword table, and the content of described user's multiword table is changeable.
4, code table data structure according to claim 3 is characterized in that: described system word list is divided into high frequency region and low frequency range.
5, code table data structure according to claim 4 is characterized in that: described system multiword table is divided into system two blocks, system three blocks ... system N block.
6, code table data structure according to claim 5 is characterized in that: described user's multiword table is divided into user two blocks, user three blocks ... user N block.
7, code table data structure according to claim 6, it is characterized in that: it also comprises the 4th, the 5th, the 6th file, described the 4th document storage system individual character concordance list, described the 5th document storage system multiword concordance list, described the 6th document storage user multiword concordance list, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
8, code table data structure according to claim 7 is characterized in that: described system multiword concordance list is by system's two word indexing tables, system's three word indexing tables ... system N word indexing table is formed.
9, code table data structure according to claim 8 is characterized in that: described user's multiword concordance list is by user's two word indexing tables, user's three word indexing tables ... user N word indexing table is formed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2004100152462A CN1558344A (en) | 2004-01-21 | 2004-01-21 | Code list data structure |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2004100152462A CN1558344A (en) | 2004-01-21 | 2004-01-21 | Code list data structure |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1558344A true CN1558344A (en) | 2004-12-29 |
Family
ID=34351386
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004100152462A Pending CN1558344A (en) | 2004-01-21 | 2004-01-21 | Code list data structure |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1558344A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102609510A (en) * | 2012-02-06 | 2012-07-25 | 中国农业银行股份有限公司 | Chinese name data processing method and device |
-
2004
- 2004-01-21 CN CNA2004100152462A patent/CN1558344A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102609510A (en) * | 2012-02-06 | 2012-07-25 | 中国农业银行股份有限公司 | Chinese name data processing method and device |
CN102609510B (en) * | 2012-02-06 | 2014-05-28 | 中国农业银行股份有限公司 | Chinese name data processing method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1558344A (en) | Code list data structure | |
CN1801152A (en) | Multi-keyword matching method for text or network content analysis | |
CN116301656A (en) | Data storage method, system and equipment based on log structure merging tree | |
Wang et al. | Incremental web page template detection | |
CN1841281A (en) | Chinese phonetic transcription input method using digital code for mobile phone | |
CN1349157A (en) | Digital configuration code Chinese character input method | |
CN1822001A (en) | Single word searching method for hand held data processor | |
CN1700156A (en) | Method for linking phrases in Chinese character input method | |
CN1141634C (en) | Chinese character search and input stroke coding | |
CN1234061C (en) | General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard | |
CN1109957C (en) | Chinese character digital coding input method based on Chinese character basic elements and normal parts | |
CN1402114A (en) | Rotor type phonetic Chinese character input method and keyboard | |
CN1254136A (en) | Method for inquiring about index multi-media header data and its device | |
CN1744014A (en) | Digital two-stroke and Chinese character input method and key board | |
CN1243300C (en) | Three-stroke digital code Chinese character input method in computer | |
CN1425975A (en) | Stroke digital Chinese character input method | |
CN1534444A (en) | <<Chinese character structure> input method> | |
CN1102716A (en) | Method for putting Chinese character into computer by using numerals | |
CN1204799A (en) | Coding method of Chinese character unit stroke numbers | |
CN1851707A (en) | Full-text inquiry method in hand-held data processing device | |
CN1141632C (en) | Chinese character two-bit digital code input method | |
CN1121007C (en) | Chinese-character five tones-digital code input method and keyboard | |
CN1428683A (en) | Modern Chinese character four-code direct input and direct detection method | |
CN1344985A (en) | Digital-key Chinese character input method | |
CN1435745A (en) | Left and right digit Chinese character computer input method and keyboard thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |