CN1558344A - Code list data structure - Google Patents

Code list data structure Download PDF

Info

Publication number
CN1558344A
CN1558344A CNA2004100152462A CN200410015246A CN1558344A CN 1558344 A CN1558344 A CN 1558344A CN A2004100152462 A CNA2004100152462 A CN A2004100152462A CN 200410015246 A CN200410015246 A CN 200410015246A CN 1558344 A CN1558344 A CN 1558344A
Authority
CN
China
Prior art keywords
multiword
user
data structure
word
concordance list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2004100152462A
Other languages
Chinese (zh)
Inventor
王衍淞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA2004100152462A priority Critical patent/CN1558344A/en
Publication of CN1558344A publication Critical patent/CN1558344A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention discloses one kind of code table data structure and aims at providing one kind of code table data structure with less memory occupied during running and fast looking up speed. The code table data structure includes six files, including the first one for saving system single character table, the second one for saving system multiple character table, the third one for saving user's multiple character table, the fourth one for saving system single character index table, the fifth one for saving system multiple character index table, and the sixth one for saving user's multiple character index table. The present invention may be used in cellphone and other electronic products.

Description

Code table data structure
Technical field
The present invention relates to a kind of code table data structure.
Background technology
In current life, electronic products such as computing machine, mobile phone have become the indispensable instrument of people.For all electronic tools, the input of Chinese character all is absolutely necessary for Chinese.Code table data structure in hand-hold electronic equipments (as mobile phone), be that Chinese character is deposited in the storer as a whole character library, because itself internal memory is little, when the user in when typewriting, CPU just calls in whole character library internal memory and stores in the mode of storehouse, when searching, according to the coding of user's input, required word searched in all words of traversal character library, so travelling speed can reduce greatly.Therefore, committed memory is big when operation, seek rate is slow for existing code table data structure.
Summary of the invention
Technical matters to be solved by this invention is to overcome the deficiencies in the prior art, the code table data structure that committed memory was little when a kind of the operation was provided, seek rate is fast.
The technical solution adopted in the present invention is: the present invention includes two files, first document storage system word list, second document storage system multiword table.
The content of described system word list and described system multiword table is fixed.
The present invention also comprises the 3rd file, described the 3rd document storage user multiword table, and the content of described user's multiword table is changeable.
Described system word list is divided into high frequency region and low frequency range.
Described system multiword table is divided into system two blocks, system three blocks ... system N block.
Described user's multiword table is divided into user two blocks, user three blocks ... user N block.
The present invention also comprises the 4th, the 5th, the 6th file, described the 4th document storage system individual character concordance list, described the 5th document storage system multiword concordance list, described the 6th document storage user multiword concordance list, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
Described system multiword concordance list is by system's two word indexing tables, system's three word indexing tables ... system N word indexing table is formed.
Described user's multiword concordance list is by user's two word indexing tables, user's three word indexing tables ... user N word indexing table is formed.
The invention has the beneficial effects as follows: because the present invention is divided into system individual character district and system multiword district with character library, when typewriting, when the user imports individual character, CPU calls in internal memory with system's word list, when the user imported phrase, CPU only needed system's multiword table is called in internal memory, and will all character libraries not call in internal memory, committed memory is little when therefore moving, and seek rate is also fast; Because the present invention also is provided with user's multiword table, the speech tabulation that the user can be used always, when the user typewrited, CPU only called in internal memory with user's multiword table, and committed memory is littler in the time of therefore can making operation, and seek rate is faster; Because the present invention also comprises system's individual character concordance list, system's multiword concordance list and user's multiword concordance list, it is with each word, the corresponding corresponding call number of speech, during typewriting, CPU is according to the corresponding call number of coding lookup of user's input, search required word, speech according to the call number correspondence again, and all travel through all words when not needing to search at every turn, therefore make the typewriting seek rate faster.
Description of drawings
Fig. 1 is a structural representation of the present invention;
Fig. 2 is a use schematic flow sheet of the present invention.
Embodiment
As shown in Figure 1, the present invention includes six files, first document storage system word list, second document storage system multiword table, the 3rd document storage user multiword table, the 4th document storage system individual character concordance list, the 5th document storage system multiword concordance list, the 6th document storage user multiword concordance list.
Described system word list is divided into high frequency region and low frequency range.
Described system multiword table is divided into system two blocks, system three blocks ... system N block.
Described user's multiword table is divided into user two blocks, user three blocks ... user N block.
The content of described system word list and described system multiword table is fixed, the content of described user's multiword table is self-defined changeable according to the user, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
As shown in Figure 2, when typewriting, when the user imports encode Chinese characters for computer, CPU just can be according to the coding lookup algorithm, judge that at first the zone gets area code, system's word list or system's multiword table or user's multiword table are called in internal memory, get concordance list again and search call number, search entry and show the entry tabulation according to the call number correspondence, the user just can select required entry according to the entry tabulation.
Specify below by example:
Individual character is searched
According to the coding of user's input, the high frequency region in word list is searched--→ find satisfactory high frequency word and deposit the entry tabulation of returning in; If high frequency region do not have the word that will search--→ search to the individual character concordance list--→ find call number--→ search to the low frequency range of word list according to call number--→ find satisfactory individual character and deposit entry in and tabulate.
As: with king's word is example, with the four-stroke Chinese character input method, the user presses numerical key " 161 ", the high frequency region that arrives first word list is searched and is met 161 high frequency word, is that key word is searched call number in the individual character concordance list with " 16 " again--→ search at the low frequency range of word list according to call number then meet 161 word--→ deposit the word that finds in the entry tabulation.
System's multiword table is searched
Coding according to user's input, search two words districts in system's multiword table earlier: in system's multiword concordance list, search the call number of this coding in two words districts-→ find call number-→ search entry according to call number two words districts in system's multiword table--→ find satisfactory entry and deposit the entry tabulation of returning in, and then search three words districts successively ... N words district.
As: with " Fan Yuyu " is example, the user presses numerical key " 643116 ", searches two words districts in system's multiword table earlier--→ with " 64 " call number that to be key word search two these codings of words district in system's multiword concordance list--→ search the entry that meets " 643116 " according to call number two words districts in system's multiword table then--→ do not find entry--→ search three words districts in system's multiword table again--→ find entry and deposit the entry tabulation in--→ search the phrase that each district sees if there is same-code successively.
The table handling of user's multiword
Deposit entry in and produce dynamic index:, in looking into coded file, search the coding of this entry according to the entry of user's input--→ find the coding of entry--→ determine the zone that entry deposits in--→ entry and coding are deposited in by indexed mode the appointed area of user's multiword table--→ user's multiword table is carried out dynamic index.
As: take " Xu Hao " as example, the user inputs " Xu Hao "--→ determine to look into the coded file name according to first byte of Chinese character " slowly "--→ in looking into coded file, search the coding " 3381 " of " slowly "--→ determine to look into the coded file name according to first byte of Chinese character " great " again--→ the coding " 4836 " of searching " great " in looking into coded file finds the coding of entry--→ coding rule according to phrase generates phrase coding " 338483 " again--→ entry and coding are deposited in by indexed mode the appointed area of user's multiword table--→ user's multiword table is carried out dynamic index.
Search during typewriting: according to the coding of input, search two words districts in user's multiword table earlier: in the dynamic index table of user's multiword table, search the call number of this coding in two words districts-→ find call number-→ search entry according to call number two words districts in user's multiword table--→ find satisfactory entry and deposit the entry tabulation of returning in, and then search three words districts successively ... N words district.
As: with " Wang Yansong " is example, the user presses numerical key " 163348 ", searches two words districts in user's multiword table earlier: with " 16 " call number that to be key word search two these codings of words district in the dynamic index table of user's multiword table--→ search the entry that meets " 163348 " according to call number two words districts in user's multiword table then--→ do not find entry--→ search three words districts in user's multiword table again--→ find entry and deposit the entry tabulation in--→ search the phrase that each district sees if there is same-code successively.

Claims (9)

1, a kind of code table data structure is characterized in that: it comprises two files, first document storage system word list, second document storage system multiword table.
2, code table data structure according to claim 1 is characterized in that: the content of described system word list and described system multiword table is fixed.
3, code table data structure according to claim 1 and 2 is characterized in that: it also comprises the 3rd file, described the 3rd document storage user multiword table, and the content of described user's multiword table is changeable.
4, code table data structure according to claim 3 is characterized in that: described system word list is divided into high frequency region and low frequency range.
5, code table data structure according to claim 4 is characterized in that: described system multiword table is divided into system two blocks, system three blocks ... system N block.
6, code table data structure according to claim 5 is characterized in that: described user's multiword table is divided into user two blocks, user three blocks ... user N block.
7, code table data structure according to claim 6, it is characterized in that: it also comprises the 4th, the 5th, the 6th file, described the 4th document storage system individual character concordance list, described the 5th document storage system multiword concordance list, described the 6th document storage user multiword concordance list, described system individual character concordance list and described system multiword concordance list are the static index table, and described user's multiword concordance list is the dynamic index table.
8, code table data structure according to claim 7 is characterized in that: described system multiword concordance list is by system's two word indexing tables, system's three word indexing tables ... system N word indexing table is formed.
9, code table data structure according to claim 8 is characterized in that: described user's multiword concordance list is by user's two word indexing tables, user's three word indexing tables ... user N word indexing table is formed.
CNA2004100152462A 2004-01-21 2004-01-21 Code list data structure Pending CN1558344A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2004100152462A CN1558344A (en) 2004-01-21 2004-01-21 Code list data structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2004100152462A CN1558344A (en) 2004-01-21 2004-01-21 Code list data structure

Publications (1)

Publication Number Publication Date
CN1558344A true CN1558344A (en) 2004-12-29

Family

ID=34351386

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2004100152462A Pending CN1558344A (en) 2004-01-21 2004-01-21 Code list data structure

Country Status (1)

Country Link
CN (1) CN1558344A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102609510A (en) * 2012-02-06 2012-07-25 中国农业银行股份有限公司 Chinese name data processing method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102609510A (en) * 2012-02-06 2012-07-25 中国农业银行股份有限公司 Chinese name data processing method and device
CN102609510B (en) * 2012-02-06 2014-05-28 中国农业银行股份有限公司 Chinese name data processing method and device

Similar Documents

Publication Publication Date Title
CN1558344A (en) Code list data structure
CN1801152A (en) Multi-keyword matching method for text or network content analysis
CN116301656A (en) Data storage method, system and equipment based on log structure merging tree
Wang et al. Incremental web page template detection
CN1841281A (en) Chinese phonetic transcription input method using digital code for mobile phone
CN1349157A (en) Digital configuration code Chinese character input method
CN1822001A (en) Single word searching method for hand held data processor
CN1700156A (en) Method for linking phrases in Chinese character input method
CN1141634C (en) Chinese character search and input stroke coding
CN1234061C (en) General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1109957C (en) Chinese character digital coding input method based on Chinese character basic elements and normal parts
CN1402114A (en) Rotor type phonetic Chinese character input method and keyboard
CN1254136A (en) Method for inquiring about index multi-media header data and its device
CN1744014A (en) Digital two-stroke and Chinese character input method and key board
CN1243300C (en) Three-stroke digital code Chinese character input method in computer
CN1425975A (en) Stroke digital Chinese character input method
CN1534444A (en) <<Chinese character structure> input method>
CN1102716A (en) Method for putting Chinese character into computer by using numerals
CN1204799A (en) Coding method of Chinese character unit stroke numbers
CN1851707A (en) Full-text inquiry method in hand-held data processing device
CN1141632C (en) Chinese character two-bit digital code input method
CN1121007C (en) Chinese-character five tones-digital code input method and keyboard
CN1428683A (en) Modern Chinese character four-code direct input and direct detection method
CN1344985A (en) Digital-key Chinese character input method
CN1435745A (en) Left and right digit Chinese character computer input method and keyboard thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication