Summary of the invention
Technical matters to be solved by this invention is, overcomes the defective that prior art exists, and provides that learnability is good, left-hand seat can be used, average code length, repeated code word and speech key select rate low, realize using random mixing of multi-level coding, can further improve a kind of five-element's code Chinese character entering method of Chinese character input speed.
The present invention addresses the above problem the technical scheme that is adopted: this five-element's code Chinese character entering method is characterized in: adopt the Chinese character input digit keyboard that indicates Hanzi component, described input method of Chinese character may further comprise the steps:
The Hanzi component of A, selection quantification is as the step of code symbols, and 30 Chinese-character stroke parts that preferably contain 5 singles picture parts and stroke member more than 25 are as code symbols, and described stroke member is:
Single is drawn parts one
Shu (亅) second Pie Dian (Fu)
The wooden extreme misery of the standing grain people Ren Chi Si Lv of many stroke members king Hitachi gold
The wide woman cun Chuo of soil Rui Bing Rolling Http Yan mouth;
B, Hanzi component is assigned to the step on the correspondent button position of described numeric keypad, above-mentioned 30 settings of Hanzi component on numeric keypad are divided into single and draw parts and two districts of many stroke members, five key positions, every district, each is provided with unique single picture parts five key positions of [1] in one district~[5], each is provided with stroke member more than five five key positions of [6] in two districts~[0], constitute ladder-type structure, " gold, wood, water, fire, soil " 5 radicals by which characters are arranged in traditional Chinese dictionaries parts are arranged on the same key position, and the stroke member of each key position is set to:
District's figure case parts
Single 1 one
Pen 2 Shu (亅)
Draw 3 Pie
The 4 Dian (Fu of portion)
Part 5 second
Many 6 king Hitachi standing grain
Pen 7 people Ren Chi Si Lv
Draw 8 metal, wood, water, fire and earth
The 9 Rui Bing Rolling Http Yan of portion
0 mouthful wide woman cun Chuo of part
By statistical study, each is provided with five key position [6]~[0] that many stroke members are set five preferred components and can obtains the minimum comprehensive repetition rate of coding.
C, utilize the step of the particular key position input Chinese character on the above-mentioned keyboard, adopt multi-level coding rule input Chinese character,
First level:
Draw parts with 5 singles and according to stroke order import, with five the numerical key codings in [1]~[5] in a district, its coding rule is:
According to stroke order preceding 4 strokes of individual character and stroke not;
According to stroke order preceding 4 strokes of two-character word lead-in, according to stroke order preceding 3 strokes of secondary word;
According to stroke order preceding 4 strokes of three words lead-ins, according to stroke order preceding 2 strokes of secondary word, not prefix stroke;
According to stroke order preceding 4 strokes of multi-character words lead-in, secondary word, the 3rd word and the first sum of picture of word not;
Second level:
Extend and expand the code symbols of first level, as code symbols, increase the key letter position in two districts with 30 Hanzi components, with [1]~[0] 10 numeric keys coding, its coding rule is:
According to stroke order preceding 4 strokes of individual character or parts and not stroke or parts;
According to stroke order preceding 4 strokes of two-character word lead-in or parts, according to stroke order preceding 3 strokes of secondary word or parts;
According to stroke order preceding 4 strokes of three words lead-ins or parts, according to stroke order preceding 2 strokes of secondary word or parts, not prefix stroke or parts;
According to stroke order preceding 4 strokes of multi-character words lead-in or parts, secondary word, the 3rd word and the not the first sum of picture or the parts of word.
The notch cuttype parts that 10 numeric keys are divided into two districts are provided with structure and corresponding coding rule, have realized that multi-level coding arbitrarily mixes the goal of the invention of use.From stroke coding incision according to stroke order, after using, user's left-hand seat can need not to re-use the high-level coding that improves input speed with switching.Use five-element's code Chinese character entering method in input Chinese-character text process, the Chinese character more than 70% is to be undertaken by the word mode more than two words, has shortened mean code length greatly.
Utilize second level of the particular key position input Chinese character on the above-mentioned keyboard to extend among the described step C of five-element's code Chinese character entering method of the present invention and be extended to tri-layer, when individual character is imported with the initial of this word phonetic as the tail sign indicating number, the coding rule of tri-layer is:
According to stroke order preceding 4 strokes of individual character or parts and not stroke or parts add the initial key of this word phonetic again.
The coding rule of tri-layer increases by a key when individual character is imported, can reduce by about 50% repeated code word, greatly reduces the repeated code word, the speech key selects rate, thereby has greatly improved Chinese character input speed.
It is not replaceable that the single that five-element's code Chinese character entering method of the present invention described step B Zhong Yi district [1]~[5] are provided with is drawn parts, and many stroke members that two districts [6]~[0] is provided with are replaceable, and the parts number average that is provided with on the key position, [6]~[0] is no more than 5.
The Hanzi component of selecting in the described steps A of five-element's code Chinese character entering method of the present invention as code symbols, according to the size and the intelligent degree of Input Software of coded character set, the many stroke members except that " gold, wood, water, fire, soil " can adjust.
The present invention compared with prior art has the following advantages and useful effect: 1, what the present invention adopted original creation presses the diverse location of parts in Chinese character, add up its preferred foundation of occurrence frequency conduct at contemporary Chinese character language material, select 30 Hanzi components as code symbols, by corresponding key position on the numeric keypad being provided with and determining corresponding coding rule, 27533 letters have been realized to standard GB 18030-2000 " expansion of infotech Chinese Character Set Code for Informati baseset " regulation, the multi-level digitally coded technology and the architecture of unsimplified Hanzi.Make repeated code word in the Chinese character input process, speech key select rate and word mean code length all to be better than the technical requirement of standard GB/T18031-2000 " the infotech digital keyboard Chinese character is imported general requirement " to the numerical coding input.2, to cover coded character set big for five-element's code Chinese character entering method, and to meet with Chinese be the crowd's of mother tongue thinking habit, and learnability is good, left-hand seat can usefulness, is to be suitable for the wide Chinese character digital coding input technology of crowd.Draw the multi-level input method of parts or many stroke members code symbols coding with single and can arbitrarily mix use, need not switch, realize being input to seamlessly transitting of quick input, be beneficial to the further raising of Chinese character input speed from begining to learn Chinese character.
Embodiment
Embodiment 1:
This five-element's code Chinese character entering method adopts the Chinese character input digit keyboard that indicates Hanzi component, and the Chinese character input comprises that preferred 30 Hanzi components are as code symbols; These 30 stroke members are assigned on ten correspondent button positions, [1]~[0] of numeric keypad; Utilize particular key position on the keyboard, according to three steps of coding rule input Chinese character.
1, preferred 30 Hanzi components are as the step of code symbols:
Many group word frequency when the present invention's code Design different from the past is selected stroke member for use from stroke member, but adopt original creation by the diverse location of parts in Chinese character add up its at the occurrence frequency of contemporary Chinese character language material as preferred foundation.The code symbols of Xuan Zeing, coding rule make input method receive that easy, average code length, repeated code word are few, the speech key selects the low good result of rate like this.Embodiment is the modern language material with 4,500 ten thousand words, 560 normal parts that the GF3001-1997 " information processing GB 13000.1 character set Hanzi component standards " that the State Language Work Committee is issued stipulates, the frequency that appears at diverse location in each Chinese character is made dynamically statistics, with the component locations frequency data of dynamic statistics data as preferred components.
According to the coding rule of setting, use component locations frequency data principle to be:
1. consider the frequency of first part in the Chinese character,, all will use first part because no matter still speech input imported in word.
2. consider in the Chinese character the not frequency of parts, because the word input must be used the parts of this position.
3. consider the frequency of time parts, because when word and the Chinese word coding below three words, will use the parts of this position.
4. when frequency is close, paying the utmost attention to the radicals by which characters are arranged in traditional Chinese dictionaries parts of the use of looking up the dictionary and the radical of people's custom.
Embodiment is according to above principle, at the preferred code symbols of following stroke member as five-element's code Chinese character entering method:
One
Shu (亅) Pie Dian (Fu) 5 singles of second are drawn;
Gold, wood, water, fire, soil, big, king, people, day, mouth, stand, standing grain, woman, wide, very little totally 15 radicals by which characters are arranged in traditional Chinese dictionaries character formation components;
Ren, Chi, Rui, Bing, Rolling, Si, Lv, Http, Yan, Chuo be totally 10 radicals.
According to the size and the intelligent degree of Input Software of coded character set, the many stroke members except that " gold, wood, water, fire, soil " can adjust.Select 21 such as many stroke members, will stand, woman, Chi, Http remove only choosing:
Gold, wood, water, fire, soil, big, king, people, day, mouth, standing grain, wide, very little totally 13 radicals by which characters are arranged in traditional Chinese dictionaries character formation components;
Ren, Rui, Bing, Rolling, Si, Lv, Yan, Chuo be totally 8 radicals.
2, these 30 stroke members are assigned to step on ten correspondent button positions, [1]~[0] of numeric keypad:
In order to realize that multi-level coding arbitrarily mixes the goal of the invention of using, embodiment is provided with structure with the notch cuttype parts that 10 numeric keys is divided into two districts.Above-mentioned 30 stroke members are provided with on numeric keypad are divided into single and draw parts and two districts of many stroke members, five key positions, every district.
Each is provided with unique single picture parts five key positions of [1] in one district~[5], specifically is set to:
The figure case parts
1 one
2 Shu (亅)
3 Pie
4 Dian (Fu)
5 second
Each is provided with stroke member more than five five key positions of [6] in two districts~[0], constitutes ladder-type structure, and " gold, wood, water, fire, soil " 5 radicals by which characters are arranged in traditional Chinese dictionaries parts are arranged on the same key position, specifically are set to:
The figure case parts
6 king Hitachi standing grain
7 people Ren Chi Si Lv
8 metal, wood, water, fire and earth
9 Rui Bing Rolling Http Yan
0 mouthful wide woman cun Chuo
It is not replaceable that the single that 1 district [1]~[5] are provided with on the keyboard is drawn parts, and many stroke members that 2 districts [6]~[0] is provided with can be replaced, as long as keep ladder-type structure.Such as can be according to following scheme setting:
The figure case parts
6 metal, wood, water, fire and earth
7 big royal people woman days
8 Ren Chi Si Lv Chuo
9 Rui Bing Rolling Http cun
0 mouthful wide Yan standing grain is upright
Select 21 the plan of establishment to be corresponding to above-mentioned many stroke members:
The figure case parts
6 metal, wood, water, fire and earth
7 king's day standing grain
8 Rui Bing Yan Chuo
9 Ren Si Lv cun
0 mouthful wide people Rolling
Repeated code word according to Hanzi features information coding derives from regular repeated code and two aspects of merger repeated code.
The example of rule repeated code:
Example word order of strokes observed in calligraphy order of strokes observed in calligraphy stroke coding
Soil one Shu 1
Scholar one Shu 1
Merger repeated code example:
Example word stroke parts order five-element coding
Body Ren one Shu Pie Dian 1
Benzene Lv one Shu Pie Dian 1
Last input causes in a key position [7] owing to " Ren " and " Lv " merger for " body ", " benzene " repeated code.
For reducing regular repeated code, need to increase addressable part quantity; For reducing the merger repeated code, need to reduce the number of components of same key position.Through statistical study, each is provided with five key position [6]~[0] that many stroke members are set five preferred components and can obtains the minimum comprehensive repetition rate of coding.
" gold, wood, water, fire, soil " is through being commonly used in " adopted portion " position that accounts for the phonogram of modern Chinese characters in common use more than 80%, the composition that the character literal meaning of being made up of them has " restriction or checking relation in five elements " promptly to repel, this " five-element " code symbols is arranged on the same key position to be imported, can less generation merger repeated code.
3, utilize particular key position on the keyboard, according to the step of coding rule input Chinese character:
Present embodiment has been formulated the Chinese character coding rule of two levels corresponding to above Chinese characters for keyboard inputting.
First level:
Utilizing 5 kinds of basic strokes is that single picture parts are according to stroke order imported, and with five the numerical keys codings in 1 district [1]~[5] on the keyboard, its coding rule is:
According to stroke order preceding 4 strokes of individual character and stroke not;
According to stroke order preceding 4 strokes of two-character word lead-in, according to stroke order preceding 3 strokes of secondary word;
According to stroke order preceding 4 strokes of three words lead-ins, according to stroke order preceding 2 strokes of secondary word, not prefix stroke;
According to stroke order preceding 4 strokes of multi-character words lead-in, secondary word, the 3rd word and the first sum of picture of word not.
Second level:
Extend and expand the code element of first level, increase the key letter position in keyboard 2 districts, with [1]~[0] 10 numeric keys coding, its coding rule is:
According to stroke order preceding 4 strokes of individual character or parts and not stroke or parts;
According to stroke order preceding 4 strokes of two-character word lead-in or parts, according to stroke order preceding 3 strokes of secondary word or parts;
According to stroke order preceding 4 strokes of three words lead-ins or parts, according to stroke order preceding 2 strokes of secondary word or parts, not word
The first sum of picture or parts;
According to stroke order preceding 4 strokes of multi-character words lead-in or parts, secondary word, the 3rd word and the not the first sum of picture or the parts of word.
Embodiment 2:
This embodiment extends and is extended to the coding rule of tri-layer on the encode Chinese characters for computer code element of second level of embodiment 1 and coding rule basis, when individual character is imported, increased Chinese character phonetic initial letters as code symbols, and its coding rule is:
According to stroke order preceding 4 strokes of individual character or parts and not stroke or parts, the initial key that adds this word phonetic is again made the tail sign indicating number.
The coding rule of two-character word, three words and multi-character words is with second level.
Other operation of present embodiment is with embodiment 1.
First level of three level codings of the present invention is that user's left-hand seat is just used from stroke coding incision according to stroke order, re-uses the high-level coding that improves input speed with need not to switch.
Example: three level codings of " telling " word are as follows:
First level Shu second Shu one input coding one by one is 25111
Second level mouth soil input coding is 08
Tri-layer mouth soil t input coding is 088
According to the input of first level according to stroke order, easily learn; Quick according to the input of second level; Can reduce repeated code by the tri-layer input.
Hanzi component setting and corresponding coding rule on the above figure case show that first level also is that single picture parts are encoded with 5 basic strokes on five key positions, [1]~[5] in 1 district.Second level coding is the extension and the expansion of first level coding, except 5 basic strokes, has increased the preferred stroke member more than 25 on [6]~[0] that is arranged on 2 districts, encodes on ten key positions, [1]~[0].With [6]~[0] coding, when not having preferred components, when perhaps being unwilling, still use [1]~[5] key position carries out order of strokes observed in calligraphy stroke coding, can realize not having switching and import, and is easy to use when preferred components occurring in the Chinese character with many stroke members codings.Tri-layer coding is the extension and the expansion of second level coding, and only when individual character was imported, the tail sign indicating number adds waited to import the Chinese character phonetic initial letters key, and this yard eliminated about 50% repeated code word, greatly reduces the repeated code word, the speech key selects rate, has improved input speed.
Five-element's code Chinese character entering method of the present invention said " five-element " draws from " The book of Changes five-element ", implication at this is, this input method is preferred " gold, wood, water, fire, soil " these five warps one-tenth word commonly used is as many stroke members code symbols, and this input method is arranged on the meaning of importing on the same key position with these five code symbols.