CN1841278A - Double-code detachment-free high efficiency Chinese character input technology - Google Patents

Double-code detachment-free high efficiency Chinese character input technology Download PDF

Info

Publication number
CN1841278A
CN1841278A CN 200510024802 CN200510024802A CN1841278A CN 1841278 A CN1841278 A CN 1841278A CN 200510024802 CN200510024802 CN 200510024802 CN 200510024802 A CN200510024802 A CN 200510024802A CN 1841278 A CN1841278 A CN 1841278A
Authority
CN
China
Prior art keywords
code
chinese
character
sign indicating
indicating number
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 200510024802
Other languages
Chinese (zh)
Inventor
敬永权
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 200510024802 priority Critical patent/CN1841278A/en
Publication of CN1841278A publication Critical patent/CN1841278A/en
Pending legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The disclosed Hanzi Avoid-split Fast Input method brings convenience for user, which comprises: once learning alphabetic coding, user can convert easily into digital code for Hanzi by inputting number of 1-4 directly without turning page, or compiling a Hanzi dictionary fit to direct turn page and look up only by alphabetic code. This invention has dynamic coincident code rate as 0.54%, covers one-code and two-code Hanzi up to 69.2%, and improves speed 1-3 times to the alphabetic method.

Description

Double-code detachment-free high efficiency Chinese character input technology
Name of the present invention is called " double-code detachment-free high efficiency Chinese character input technology ", belongs to Technology of Chinese Information Processing and modern Chinese character applied research field.
The application of Chinese character input mainly contains three kinds, all fails to solve inefficiency well, finds it difficult to learn and easily forget, do not conform to the problem of standard:
1. alphabetic keypad input, the alphabetic keypad input is in leading position always in the computer Chinese-character input, but owing to there is not good font code, most of users can only use Chinese phonetic alphabet input method, efficient is low, will look up the dictionary earlier and could import unacquainted word;
2. in the Chinese character input of numeric keypads such as mobile phone, it is more to convert the numerical code repeated code to by Chinese phonetic alphabet, and efficient is lower; The font code that is used for numeric keypad is too low in computer Chinese-character input efficient; If the user will then will expend too many time and efforts to two kinds of different font codes of two kinds of keyboard study;
3. the Chinese dictionary searching is with indexing system for Chinese characters or all directly page turning searchings of stroke method, and efficient is very low; Four-corner system repeated code is many, the code fetch mistake is many; The Chinese character sort retrieval can not be connected mutually with Technology of Chinese Information Processing.
" double-code detachment-free high efficiency Chinese character input technology " can make the user only learn alphabetic coding just can grasp two kinds of alphabetic keypad and numeric keypads input method efficiently, it is computing machine, mobile phone, the total solution of every Chinese character input such as dictionary searching problem, it is characterized in that: after the font code input method of user's association's alphabetic coding in the computer Chinese-character input, can convert numerical code according to the easiest rule to and carry out the mobile phone Chinese character input, this alphabetic coding is suitably for Chinese character sort again, can be in order to realize the direct page turning searching of Chinese dictionary, so the Chinese character of various occasions input problem can all be resolved by using the duplex sign indicating number.
Duplex sign indicating number-alphabetic coding
Adopt the component type font code (patent No. ZL01105222.8) that has obtained " Chinese-character fast input method without splitting " of national inventing patent by me, observe " the information processing GB 13000.1 character set Hanzi component standards " of State Language Work Committee promulgation, with 26 letters whole addressable parts there is the expression of motivation, does not need to carry on the back formula memory; Carry out the Chinese character direct coding and do not learn the fractionation rule; Input efficiency improves 1-3 doubly than spelling input method, helps guiding user's use Chinese character that standardizes simultaneously.
Chinese character shape code must possess standardization, rapidity and learnability comprehensively, could satisfy the demand of each level user of society, because input is to use the purpose of font code fast, not good then the comparing with spelling input method of rapidity do not have strong point to say, can not attract people to learn usefulness; Meeting Chinese-character canonical is the precondition that designs font code, promotes the use of Chinese character shape code input method; And the font code of easily forgetting that finds it difficult to learn can make a lot of learners give up halfway, make us hang back.
The technical characterictic of duplex sign indicating number-alphabetic coding is as follows:
(1) is applicable to various Chinese Character Sets
Be applicable to GB2312-80 baseset Chinese character, also be applicable to big character library of GBK and GB18030-2000, maximum code length is all four yards, and to the GBK character library, the repeated code word does not need page turning to seek.
(2) meet various standards
Meet various relevant Chinese-character canonicals such as Hanzi internal code standard, normative stroke order, form of a stroke or a combination of strokes standard, Hanzi component standard, Hanzi structure rule and Chinese character use habit, consistent with language teaching.
(3) keep the basis of Chinese character parts complete
Addressable part is selected according to " information processing GB 13000.1 character set Hanzi component standards ", does not excessively split to keep the basis of Chinese character parts complete, helps rapid identification, and coding is fast, exempts from splitting direct coding for implementation and has created condition.
(4) one of motivation expression: phonography
Solid size character formation component and radicals by which characters are arranged in traditional Chinese dictionaries commonly used are pieced together the first letter of spelling or pronunciation information representation with it, easily learn easily note, as: shellfish Epileptic-B, very little Lv-C, big Dao-D, youngster Fu-E, square opening-F, dagger-axe Http-G, fire one-H, sub-Chuo-Z.(5) two of the motivation expression: the whole representation of dicode parts
For example: weight=ZT (T-soil), one-tenth=CG (G-dagger-axe), wherein first sign indicating number is this first letter of character formation component Chinese phonetic alphabet, second yard is the sound-form information of parts bottom, right part architectural feature or an end stroke.
Need in other font codes to do the complicated single character that splits as become, heavy, tooth, card, more, black, folder, two, year, bent, hang down, grasp, word such as non-, the duplex sign indicating number all is taken as the dicode parts and does whole the expression, need not split, two sign indicating numbers all have the memory foundation, and can accomplish that repeated code is minimum, dissolved the difficult point of maximum in the encode Chinese characters for computer.
Try heavier, as to become two words coding:
Duplex sign indicating number: weight=ZT, one-tenth=CG
Cognitive sign indicating number: weight=Z81 becomes=penta =W9
King's sign indicating number: heavy=Pie soil=TGJF on the one, become=Chang  Yin Pie=DNNT
Zheng's sign indicating number: heavy=thousand day two=MEKB, one-tenth=dagger-axe Pie =HMY
(Zheng's sign indicating number dicode, thousand=ME)
Counterweight, become such single character more as can be known by above-mentioned, does not split, directly getting it is the dicode addressable part, work has the integral body of memory foundation to express, and the easiest learning and memory can be given the Chinese character direct coding that comprises these parts easily, as: heel=KZZT, really=YCG, or the like.(mouth-K ends-Z, Yan-Y)
Represent with M all that as fruit tree and horse following repeated code word is then arranged:
Coltfoal drives to be ridden the astonished proud black horse of hunchbacked female and tests Hua fine horse white horse with a black mane ...,
A Chinese holly pivot chair girder nuclear bridge chess inspection birch shuttle lattice ...,
Get dicode parts horse=MH, guaranteed the motivation expression of parts, solved coincident code problem simultaneously.
In addition, the whole character formation component of expressing also has following benefit:
The one, coding rate is fast.For the people of skilled use Chinese character, identification single character, radicals by which characters are arranged in traditional Chinese dictionaries commonly used are exceedingly fast, but the stroke of Chinese character is constituted, and then can not remember, split out stroke by stroke according to Chinese character pattern, speed slowly many.
Secondly, the code of dicode parts promptly is the coding of these Chinese characters, does not add any complement code, and coding does not change during the input two-character word, and is very easy to use.
Adopt dicode parts person in the existing font code, when the dicode parts are got first yard, when get second yard, when got dicode, determine, increased learning difficulty widely with seven rules.The duplex sign indicating number adopts the dicode parts still only to need to abide by the regulation direct coding of first three back one, need not increase any new regulation, has kept the easily characteristics of note of easy.
(6) three of the motivation expression: noncharacter radical similar shape correspondent method
Noncharacter radical is directly represented by the letter similar to its shape, as:
Contraband-C, Qian-U , ㄒ Myeon-T, Ji-E , Qe-X , Jiu-Y, or the like.(said here word and non-word are to be as the criterion with the baseset Chinese character, and Fang, Qe, Myeon is word in the GBK character library.)
25 kinds of forms of a stroke or a combination of strokes in the Chinese character folding standard are also represented with the similar shape correspondent method.
The similar shape correspondent method is different with configuration code: only express the close parts of shape, do not do too much merger on the same group, " similar shape correspondence " is worthy of the name; Only express noncharacter radical, do not express character formation component, meet people's cognition custom.
(7) the implementation Chinese character is exempted to tear open direct coding and is not learned the fractionation rule
Combined characters realizes exempting from splitting direct coding according to the The Natural Divisions between each component part, without any splitting rule, has only a coding regulation: according to the regulation coding of the order of strokes observed in calligraphy by first three back one.
As follows to all kinds of routine word direct codings:
The Chinese character that constitutes by the solid size character formation component:
Power ≡ MY, to ≡ YC, village ≡ MC, tree ≡ MYC, (wood-M, again-Y, very little-C)
Throat=KYYM, make an uproar=KKKM (mouthful-K),
Mediate=YHHY, walk in small steps=KZYY (speech-Y, fire-H, only-ZOH).
The radicals by which characters are arranged in traditional Chinese dictionaries that include habitual title:
Ren-R (single side), Fu-E (by the ears), Chuo-Z (youngster who walks), one-H (stroke horizontal stroke),
Stop=RM, pay=RC, clump=RRH, deadlock=RHTH (wood-M, very little-C, people-R, the field-T),
Attached=ERC, accompany=ELK, portion=LKE (upright-L, mouthful-K), mistake=CZ forces=HKTZ.
Contain the similar shape corresponding component:
Contraband-C, Qian-U , Myeon-T, Ji-E , Qe-X:
Seek=EC, intelligent=FFEX (very little-C, rich-F, the heart-X),
Page or leaf=TB, clamor=KKTK (shellfish-B, mouthful-K),
District=CX, and frame=MCW (wood-M, the king-W).
Contain dicode parts: weight=ZT, horse=MH, power=LL, car-CS, husband=FR, standing grain=HM, arrow=SD,
Dong=CZT understands=XCZT,
Drive=MHCX, drive=LLKH (first three back one is driven in the word horse and only got second yard),
Wheel=CSRB, man-drawn carriage used in ancient times=FRFS (first three back one, car is only got second yard in man-drawn carriage used in ancient times's word),
Committee=HMN, short=SDHN (in order, standing grain is only got first yard in the short word).
(8) code is used and is better than unreasonable expression
When expressing parts with first letter of the Chinese phonetic alphabet, alphabetical I, O, U and V can't use, and the use of A is very little.For effectively reducing repeated code, must manage they are played one's part to the full.
1. code S is unequal to burden, and stone, parts such as ten have all been expressed with S, and the duplex sign indicating number uses O and V represents S:
Use O and express water Rui: buy=OSK, spring=BO (white-B),
Use V and express hand Rolling: batch=VBB, take=RHKV (people-R, one-H, mouthful-K).
2. the effect of code A is too little, and B will represent eight, shellfish, an ancient type of spoon, white, Epileptic, Bao or the like, and repeated code is a lot, so use A for B, expresses eight: for example: poor=ADB, always=AKX (cutter-D, mouth-K, the heart-X).
3. for to prevent that the parts of representing with Y are too much, adopted zero initial facture (removing the initial consonant Y of I, U front), that is: shooted a retrievable arrow, Yin, Tou (clothing prefix) represents with I, as, generation=RI;
The moon, unit, plumage, then, a kind of monkey mentioned in ancient literature, Yu etc. represent with U, as, object for appreciation=WU.
Code is used with zero initial and is handled in various spelling input methods all usefulness, learns not difficultly, is better than unreasonable expression.
(9) solid size parts word is a trigram
Solid size parts word code regulation: behind the code of solid size parts, add two codes by a first sum of and last form of a stroke or a combination of strokes.
Be five classes with the Chinese-character stroke merger often at present, horizontally propose unification, point is pressed down unification, and all folding pen unifications are used concrete stroke shapes and gone to distinguish with the form of a stroke or a combination of strokes again.Five stroke methods are used to add up stroke number and the research order of strokes observed in calligraphy is more convenient, but have reduced the ability of difference font.The duplex sign indicating number as required with the characteristics of noncharacter radical similar shape corresponding expression, will roll over pen and be divided into six classes by shape, simple and easy, distinguish in addition to carry and also be necessary with horizontal stroke and may.
1. for horizontal, vertical, cast aside, press down, carry five kinds of strokes, press pronunciation and express:
One-H, Shu-O, Pie-P, Dian-N ,/-T, (pronunciation of Shu is expressed it near water with O)
Example word: wood=MHN, thousand=QPO, very little=CHN, factory=CHP, or=GKT.
2. 25 kinds of forms of a stroke or a combination of strokes in the Chinese character folding standard are divided into six classes, express with the similar shape correspondent method:
L shaped folding: Yin ㄥ ㄑ-L comprises 5.4-5.8 in the standard, 5.11 and 5.19, and routine word: an ancient type of spoon=BPL;
Z-shaped folding: Yi I-Z comprises 5.12-5.14 in the standard, 5.16 and 5.22, and routine word: several=JPZ;
Half Z-shaped folding:   Ya-Z comprises 5.1-5.3 and 5.15 in the standard, routine word: ugly=CZH, again=YZN;
S shape folding: ㄅ-S comprises in the standard 5.17,5.18,5.24, routine word: bow=GZS;
Left-falling stroke hook folding: comprise in the standard 5.9, be expressed as P , Ji Wu=PN (in the GBK character library);
3 shapes folding: independent occur (5.20,5.21,5.23,5.25) in the sign indicating number of not being on the permanent staff.
(10) repeated code identification is simple
The duplex sign indicating number does not add complement code to three code words, only two code combination words is added two yards according to a parts end form of a stroke or a combination of strokes,
For example: all-key is looked for=VGTN, shoulder=VGTH, (Rolling-V, dagger-axe-G, worker-G ,/-T, one-H)
Brevity code is looked for=VG.
(11) mean code length minimum
The usage frequency of Chinese character is extremely unbalanced.According to the usage degree data in " Modern Chinese general words data statistic ", the usage degree summation that can calculate the Modern Chinese general words is 9,400,000 (these also are the usage degree summation approximate values of GB baseset Chinese character or whole Chinese characters).Arrange Chinese character from high to low by the usage degree size, can calculate following data (these data and Su Peicheng professor show data consistent in " modern Chinese character outline "):
Usage degree is arranged the usage degree sum of preceding 1500 Chinese characters, accounts for 95% of Chinese character usage degree summation
Usage degree is arranged the usage degree sum of preceding 1000 Chinese characters, accounts for 90% of Chinese character usage degree summation
Usage degree is arranged the usage degree sum of preceding 500 Chinese characters, accounts for 78% of Chinese character usage degree summation
Usage degree is arranged the usage degree sum of preceding 100 Chinese characters, accounts for 42% of Chinese character usage degree summation
Usage degree is arranged the usage degree sum of preceding 50 Chinese characters, accounts for 30% of Chinese character usage degree summation
Usage degree is arranged the usage degree sum of preceding 10 Chinese characters, accounts for 14% of Chinese character usage degree summation
Usage degree arrange first " " usage degree of word, account for 4.3% of Chinese character usage degree summation
Hence one can see that, reduce mean code length, the most important thing is to reduce the code element number of high frequency Chinese character; Simultaneously, high frequency Chinese character does not have repeated code.
The duplex sign indicating number is 26 one code words of configuration meticulously, all are that ultrahigh frequency word and 25 have the memory foundation, as:
Branch-A no-B, goes out-C, and is big-D, worker-G, one-H, be-I and-K ,-L, people-R, on-S, have-U, this-W ,-X, usefulness-Y, in-Z, or the like, word especially by arranging in right hand reference position ,-J.
Two codeword positions have 676, are important coding resources.The duplex sign indicating number constitutes all-key because two codes will be added in input two code combination words, gives in a planned way and has reserved the position by row's two code words.
Easily note two code words mainly contain three sources:
Dicode parts word: be formed in and coming the long electric two this year systems of power heavy just in sending out, or the like;
Two code combination word true forms are made brevity code: the time produce to leading that to stop peace existing when having removed word as changing, or the like;
The head and the tail brevity code: previous crops can ask that the arena kind must pass through all from spending the little industry evolution of sub-portion method etc.
By statistics to duplex sign indicating number one code word and two code word usage degrees, can provide following data: a code word usage degree sum, account for 20.6% (king's sign indicating number 19.7% of Chinese character usage degree summation, Zheng's sign indicating number 17.4%), two code word usage degree sums account for 48.6% (king's sign indicating number 35.3%, Zheng's sign indicating number 41.2%) of Chinese character usage degree summation, two totals, brevity code word usage degree accounts for 69.2% (king's sign indicating number 55%, Zheng's sign indicating number 58.6%) of Chinese character usage degree summation, and promptly the dynamic cover ratio of brevity code word reaches 69.2%, no repeated code, easily learn easily note, very easy to use, and hence one can see that, when single Chinese character is imported, the mean code length minimum of duplex sign indicating number.
(12) individual character input rate of dynamic coincident code is minimum
To the baseset Chinese character, the character library repetition rate of coding (static state) is 5.9% (king's sign indicating number 8%, Zheng's sign indicating number 7.4%).
As calculated, repeated code word usage degree sum is 51178, and repeated code word usage degree ratio is 0.54%, and this is repeated code word frequency that is rate of dynamic coincident code.(86 king's sign indicating number rate of dynamic coincident codes are 1.3-1.8%, are 3 times of duplex sign indicating number; Zheng's sign indicating number---3.8%, be 7 times of duplex sign indicating number.)
(13) the multiple code check of common phrase is low
Only enrolling common phrase, keep the low repetition rate of coding of input method to reach higher coverage rate again, is a kind of satisfactory to both parties selection.Duplex sign indicating number-alphabetic coding has added 21000 phrases in the GB2312-80 character library, based on the general two-character word that writtens language, other has three words, four words and multi-character words, with first-level Chinese characters repeated code not, has higher cover ratio, the phrase repetition rate of coding is 4.1% (86 king's sign indicating numbers, 5000 phrase versions, the repetition rate of coding 6.4%).
With the speed of duplex sign indicating number-alphabetic coding input Chinese character than doubly with the fast 1-3 of spelling input method.
Duplex sign indicating number-numerical code
Duplex sign indicating number-alphabetic coding is carried out letter and digital conversion according to following order corresponding relation, promptly obtains numerical code:
A、B、C-1, D、E、F?-2, G、H -3,
I、J -4, K、L -5, M、N -6,
O、P、Q-7, R、S、T?-8, U、V、W?-9,
X、Y、Z-0;
The numerical code that is converted to meets GB/T18031 infotech digital keyboard Chinese character and imports general requirement ", because duplex sign indicating number-alphabetic coding has been carried out meticulous adjustment, numerical code after the conversion is respectively organized the repeated code word less than 9, can directly select required Chinese character after 1-4 numeral of input during operation, seek without page turning, simple and direct efficient, for example:
Encode Chinese characters for computer converts numerical code numerical code input back prompting to
Give LRHK 5,835 1: give 2: what 3: breathe out 4: feed 5: tremnble
What KRGK 5835
Breathe out KRHK 5835
Feed KTHK 5835
KSGL 5835 tremnbles
Fruit GM 36 1: peace 2: fruit 3: committee 4: standing grain
Standing grain HM 36
Peace GN 36
The HN 36 of committee
The direct page turning searching of duplex sign indicating number dictionary
Duplex sign indicating number-alphabetic coding has preferably with indexing system for Chinese characters aspect following three and is connected:
1. have 80% to be chosen as addressable part in " radical table (draft) unified in Chinese character " that recommend the State Language Work Committee;
2. consistent with indexing system for Chinese characters in the merger of addressable part, as: Xiangxi is included into fire, and  is included into Dao ,  and is included into Jie, and blue prefix is included into eight, and factory is included on anti-word limit, and Nie is included into then, or the like;
3. in duplex sign indicating number indexing system of Chinese Characters dictionary, radicals by which characters are arranged in traditional Chinese dictionaries are positioned at the same radicals by which characters are arranged in traditional Chinese dictionaries word of left side and upside near arranging, and the word of dicode radicals by which characters are arranged in traditional Chinese dictionaries is more concentrated, and according to its sound, shape information representation, the duplex sign indicating number indexing system of Chinese Characters can correspondingly with indexing system for Chinese characters be contrasted to radicals by which characters are arranged in traditional Chinese dictionaries commonly used.
Realize the direct page turning searching of Chinese dictionary with the duplex sign indicating number indexing system of Chinese Characters, speed can surpass the speed with english dictionary verification certificate word than with fast several times of indexing system for Chinese characters or stroke method.
In duplex sign indicating number indexing system of Chinese Characters catalogue, the dicode parts are similar to sub-directory, have following form (G group)
G---worker bends dried melon dagger-axe Chuan Http Mi
The wide GU-bone of the blunt GP-of GG-GV-ghost GX-more
Can therefrom find Http portion in the indexing system for Chinese characters, bow portion, dagger-axe portion, blunt portion, wide portion, osseous part, terrible portion, or the like.Be duplex code letter coding before the form of dictionary, Chinese character, be the Chinese phonetic alphabet (not annotating the four tones of standard Chinese pronunciation) behind the Chinese character.Each Chinese character, preceding pronunciation is represented by the Chinese phonetic alphabet of being made up of the Latin alphabet in the back by the font code expression font information of being made up of the Latin alphabet, makes Chinese character all possess science aspect the expression of shape, message breath, be fit to the needs of information society, it is also extremely beneficial that Chinese character is gone to the world.The segment of this this dictionary following (omitting explanation):
GGT cultivates ken
GGWE is ji both
GGWH and ji
GGX entreats ken
GGZ moves back tui
The strong qiang of GKCC
The obstinate jiang of GKCN
GKK palace gong
GKT or huo
The GKTX huo that deludes
The wide guang of GP
The honest and clean lian of GPAJ
GPAP preface xu
GPB arrange pi
GPB low bei
GU bone gu
GUAN human skeleton lou
GUB thigh bi
GUC is preced with guan
GUCK epiphysis hou
GV ghost gui
GVES chief kui
GVFN demon of drought ba
GVLR demons and monsters liang
GVLV Chi chi
GX is geng more
As if GXE wan
GXED cuts out wan
GXU night xiao

Claims (1)

1. " double-code detachment-free high efficiency Chinese character input technology " can make the user only learn Chinese alphabet coding and just can grasp two kinds of alphabetic keypad and numeric keypads font code input method efficiently, it is computing machine, mobile phone, the total solution of three kinds of Chinese character inputs of dictionary searching problem, it is characterized in that: the user learns alphabetic coding in the computer Chinese-character input after, can convert numerical code according to the easiest rule to and carry out the mobile phone Chinese character input, this alphabetic coding is suitably for Chinese character sort again, can be in order to realize the direct page turning searching of Chinese dictionary, so three kinds of Chinese character input problems can all be resolved by using the duplex sign indicating number, division is as follows:
(1) duplex sign indicating number-alphabetic coding
Adopt the component type font code that has obtained " Chinese-character fast input method without splitting " of national inventing patent by me; Observe " information processing with GB13000.1 character set Hanzi component standard " of State Language Work Committee promulgation, whole addressable parts are had the expression of motivation, do not need to carry on the back formula memory with 26 letters; The implementation Chinese character is exempted to tear open direct coding and is not learned the fractionation rule; Input speed is higher than existing font code, improves 1-3 doubly than spelling input method, possesses standardization, rapidity and learnability, can satisfy student and the most social user demand to input method of Chinese character;
(2) duplex sign indicating number-numerical code
With duplex sign indicating number-alphabetic coding in the following order corresponding relation carry out the letter with the numeral conversion promptly:
A、B、C-1, D、E、F-2, G、H-3,
I、J-4, K、L-5, M、N-6,
0、P、Q-7, R、S、T-8, U、V、W-9,
X、Y、Z-0;
The numerical code that is converted to meets GB/T18031 " the infotech digital keyboard Chinese character is imported general requirement ", by means of meticulous adjustment to alphabetic coding, numerical code is respectively organized the repeated code word less than 9, import with 1-4 numeral in the operation, can directly select required Chinese character, seek without page turning, convenient, quick;
(3) the direct page turning searching of duplex sign indicating number dictionary
Duplex sign indicating number-alphabetic coding has preferably with indexing system for Chinese characters aspect following three and is connected:
1. have 80% to be chosen as addressable part in " radical table (draft) unified in Chinese character " that recommend the State Language Work Committee;
2. consistent with indexing system for Chinese characters in the merger of addressable part, as: Xiangxi is included into fire, and  is included into Dao ,  and is included into Jie, and blue prefix is included into eight, and factory is included on anti-word limit, or the like;
3. in duplex sign indicating number indexing system of Chinese Characters dictionary, radicals by which characters are arranged in traditional Chinese dictionaries are positioned at the same radicals by which characters are arranged in traditional Chinese dictionaries word of left side and upside near arranging, and the word of dicode radicals by which characters are arranged in traditional Chinese dictionaries is more concentrated, and according to its sound, shape information representation, the duplex sign indicating number indexing system of Chinese Characters can correspondingly with indexing system for Chinese characters be contrasted to radicals by which characters are arranged in traditional Chinese dictionaries commonly used;
Realize the direct page turning searching of Chinese dictionary with the duplex sign indicating number indexing system of Chinese Characters, speed surpasses the speed with english dictionary verification certificate word than with fast several times of indexing system for Chinese characters or stroke method;
The input method of Chinese character software of establishment has three kinds according to the present invention:
Duplex sign indicating number-alphabetic coding font code input method, i.e. " Chinese-character fast input method without splitting ",
Duplex sign indicating number-digital code inputting method,
Duplex sign indicating number input method arranged side by side is weaved into input method side by side with alphabetic coding and numerical code, uses for demonstration and study;
The relationship between expression of addressable part and code is as follows in duplex sign indicating number-alphabetic coding:
(1) is used for the addressable part table of GB2312-80 baseset Hanzi font library
-A-eight foretells the recessed=AO of AA=crust
The white inferior Bao Epileptic BA=of-B-shellfish an ancient type of spoon not this BP=of BH=is must BT=nose BW=worn-out
BX=grasps
Contraband-C_-Lv-factory goes out river cun ugly minister volume
Figure A2005100248020003C3
The CC=worm
Figure A2005100248020003C4
CG=becomes the vertical CN=of CH=to scold the CO=string
Figure A2005100248020003C5
CP=
Figure A2005100248020003C6
The Chang Zhang of CS=car CU=tooth CV=
-D-machete  Dao Ding Dan Bo
Figure A2005100248020003C7
DD=east DA=allusion quotation DH=beans DL=
-E-Fu Jie two Bing EE=ear Jian=EG EX=that
Figure A2005100248020003C10
=E ... E
Figure A2005100248020003C11
-F-side Feng Fei narrow-necked earthen jar father mouth
Figure A2005100248020003C12
The city FF=Pu FO=not non-FY=of FR=husband FS=sends out
-G-worker bends dried melon dagger-axe Chuan Http Mi
Figure A2005100248020003C13
The wide GU=bone of the sweet GM=fruit GP=of the blunt GE=of GG=
GV=ghost GX=more
Family  of-H-fire Xiangxi The yellow HM=standing grain of the black HA=of HH=HQ=
-I-day is shooted a retrievable arrow Yin Tou
Figure A2005100248020003C17
II=clothing Yi IL=IR=is smooth
-J-gold Jin and well are seen a few mortar Yin of first
Figure A2005100248020003C19
Pan 
Figure A2005100248020003C20
The huge JL-of JJ=jin JA=tool JD=folder JF=is own
JO=towel JP=nine JR=are of a specified duration The JX=card
Figure A2005100248020003C22
Figure A2005100248020003C23
-K-mouth is opened
Yin-L-has found the woods deer Si yarn one LL=power LA=makes LB=dragon Liu LD=that LM=comes
LV=is happy from LX=in LR=two LS=official's fork-like farm tool used in ancient China LT=
Jiong-female ware the people of M-wood
Figure A2005100248020003C24
MM=door MF=order M4=horse ML=fiber crops MP=lance
MQ=hair MX=end
Figure A2005100248020003C27
-N-woman twenty agricultural European-allies
Figure A2005100248020003C28
That NP=of Dian NN=ox Niu NE=is NS=in the NR=
The respectful ON=book of-O-water Rui Shu bundle scholar OA=
-P-Pie
Figure A2005100248020003C30
San sheet slit bamboo or chopped wood Zhuang PP=skin PR=Chi PS=is flat
Figure A2005100248020004C1
The bent ON=wife of its OH=of-Q-seven dog Quan, thousand mounds and the unanimous QA=of QQ=QO=asks
QR=owes QZ=gas
-R-people Ren sword slowly meat RN=is gone into
Bao-last 3 SS=of S-stone ten generation pigs show that Woo SD=vows SE=thing SF=Si SL=food Cannibals
SN=
Figure A2005100248020004C3
SP=corpse SQ=Shi SU=mountain SX=history
Xia-TU=is protruding in T-soil Tian Tian/TG= TQ=village
Qian-U-month unit plumage then a kind of monkey mentioned in ancient literature Yu UU=and UF=is said UG=Yue UH=fish UO=rain
UO=rain UR=Yu US=in
Si-V-hand Rolling VV=Shen VP=body
-W-king dies, and not have WG=penta WN=be WP=ten thousand to civilian Fan The-Fan watt of not towering WW=five WE=of Wei ball crow
WX=is not
Qe-X-heart Xin Xiao Xin sunset west habit Their-registered township
Figure A2005100248020004C6
XH=smokes XZ=
Figure A2005100248020004C7
Jiu-Y-speech Yan In-particular again with also industry drag YH=Asia, YY=sheep YD=centre YI=YO=at the tenth of the twelve Earthly Branches by
The YP=tooth
Figure A2005100248020004C8
YW=Yao
Pawl Zhao ends the special ZD=system of zhang Chuo  ZA=ZE=boat ZO=million ZP=among Yi -Z- Zizhou
Figure A2005100248020004C11
ZQ=insect without feet or legs ZT=is heavy
Other has 67 of classified components, and code is expressed and the relation of sorting out is:
A-Ha (eight)
B- (an ancient type of spoon)
C- (factory) C-
Figure A2005100248020004C13
(river) C-Guan (Lv) C-
Figure A2005100248020004C15
(volume) C-
Figure A2005100248020004C16
(Cao) C-
Figure A2005100248020004C18
(Contraband)
D-
Figure A2005100248020004C19
(Dao) DL-
Figure A2005100248020004C21
()
E- (Jie) E-
Figure A2005100248020004C22
(Bing) E-Ji
Figure A2005100248020004C23
Figure A2005100248020004C24
Figure A2005100248020004C25
Figure A2005100248020004C26
F-
Figure A2005100248020004C28
F- (mouth) F-
Figure A2005100248020004C29
Figure A2005100248020004C30
Figure A2005100248020004C31
Na (rich)
G-Huan (Chuan) GG-
Figure A2005100248020004C33
Figure A2005100248020004C34
H-Households (family)
I-
Figure A2005100248020004C36
(day)
K-
Figure A2005100248020004C38
L-
Figure A2005100248020004C39
() L-
Figure A2005100248020004C40
(Si)
M-
Figure A2005100248020004C41
N- (twenty)
O-Shui
Figure A2005100248020004C46
(water)
Q-
Figure A2005100248020004C47
() Q-
Figure A2005100248020004C48
Figure A2005100248020004C49
S-
Figure A2005100248020004C50
(pig) S- (thirty)
T- (Xia)
U-
Figure A2005100248020004C56
(moon) U-
Figure A2005100248020004C57
Nie (then) UU- (with)
V- (hand) V-
Figure A2005100248020004C60
(Si)
W-No (not) WE- (nothing)
X- (heart) X-Xi (west) X-※ (Qe) X-
Figure A2005100248020004C65
(little)
Y- (again) Y- (usefulness) Y-
Figure A2005100248020004C68
(Jiu) YY-  (sheep)
Z-
Figure A2005100248020004C69
Figure A2005100248020004C70
(ending)
(2) extend to the whole Chinese characters of GB13000.1 character set, promptly be used for the GBK Hanzi font library, need to increase new addressable part and complicated and simple corresponding addressable part:
B-
Figure A2005100248020005C1
C- G- I- L-
Figure A2005100248020005C5
L-
Figure A2005100248020005C6
N-
Figure A2005100248020005C7
N-Nian S-
Figure A2005100248020005C8
U-
U-
Figure A2005100248020005C10
Figure A2005100248020005C12
W-The-Fan W- Y-Nie
BF-Bi DD-
Figure A2005100248020005C15
EE-
Figure A2005100248020005C16
GL-
Figure A2005100248020005C17
HF-
ML-
Figure A2005100248020005C18
WE-
Figure A2005100248020005C19
WX-Swastika WX- YM-
Figure A2005100248020005C22
YY-Tuft-of-hair ZD-Ze
Si-Si
Figure A2005100248020005C25
Shi-Cannibals
Figure A2005100248020005C26
-volume Yue-first
By-by Ya-Xi Tony-Bei See-see Trucks-Che
Long-long East-Dong Door-Men Asia-Ya Ma-Ma
Ukraine-Wu Fish-fish is As-be Fly-fly Long-Long
Other has non-G row Chinese character own coding parts unlisted;
Utilize the relationship between expression between above-mentioned addressable part and the code to realize the computer Chiense character code input, its rule and step are respectively:
Single Chinese character is imported by following rule encoding:
1. combined characters exempts to tear open direct coding according to the The Natural Divisions between each component part, gets one or four yards of first three backs according to the order of strokes observed in calligraphy:
2. solid size parts word is added two codes according to a first sum of and last form of a stroke or a combination of strokes behind the code of solid size parts;
3. dicode parts word, two codes promptly are its codings;
4. two code combination words are carried out complement code and keep away heavily, add two codes according to an end form of a stroke or a combination of strokes of addressable part;
Four yards of maximum code length, four yards persons of less than add a space bar and finish when input, and required Chinese character is selected if any repeated code in the input back in prompt column;
High frequency Chinese character has one yard brevity code or two yards brevity codes of easy note, no repeated code in addition;
In GB2312-80 baseset Hanzi font library, be composed of 21000 phrases, import by following rule encoding:
1. two-character word: [first prefix coee] [the first word last code] [second prefix coee] [the second word last code]
2. three words: [first prefix coee] [second prefix coee] [the 3rd prefix coee] [the 3rd word last code]
8. four words: [first prefix coee] [second prefix coee] [the 3rd prefix coee] [the 4th prefix coee]
4. multi-character words: [first prefix coee] [second prefix coee] [the 3rd prefix coee] [the most last prefix coee]
Required phrase is selected if any repeated code in the input back in prompt column.
CN 200510024802 2005-03-31 2005-03-31 Double-code detachment-free high efficiency Chinese character input technology Pending CN1841278A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200510024802 CN1841278A (en) 2005-03-31 2005-03-31 Double-code detachment-free high efficiency Chinese character input technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200510024802 CN1841278A (en) 2005-03-31 2005-03-31 Double-code detachment-free high efficiency Chinese character input technology

Publications (1)

Publication Number Publication Date
CN1841278A true CN1841278A (en) 2006-10-04

Family

ID=37030330

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200510024802 Pending CN1841278A (en) 2005-03-31 2005-03-31 Double-code detachment-free high efficiency Chinese character input technology

Country Status (1)

Country Link
CN (1) CN1841278A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101158884B (en) * 2007-10-15 2010-04-21 敬永权 Disassembling-free easy-to-learn high-efficiency Chinese characters font code computer mobile phones integrated input technology

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101158884B (en) * 2007-10-15 2010-04-21 敬永权 Disassembling-free easy-to-learn high-efficiency Chinese characters font code computer mobile phones integrated input technology

Similar Documents

Publication Publication Date Title
CN85101817A (en) An zijie type Chinese-character stroke computer code's method and keyboard thereof
CN1900886A (en) Method for single click and multiple key combining click mixing input Chinese and English and keyboard
CN1019424B (en) High-speed chinese character inputting method using synthetic coding of pronunciations, forms and strokes and keyboard used
CN1841278A (en) Double-code detachment-free high efficiency Chinese character input technology
CN1166997C (en) Chinese-character fast input method without splitting
CN1054447C (en) Coordinate codes coding method for computer Chinese characters input
CN1164689A (en) Computer input method for Chinese characters' sound pattern meaning based on word and Chinese-Spanish compatible keyboard
CN1584798A (en) Chinese inputting method and keyboard thereof
CN1825255A (en) Sum code Chinese character shape code input method and single hand keyboard thereof
CN1374577A (en) General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1129058C (en) Chinese character phonetic code and keyboard design
CN1195260C (en) Chinese character encoding method grouping in consonants
CN1123820C (en) Chinese-character 'shape-pronunciation' input system
CN1107896C (en) Chinese character and coding and input method for automatic transition of simplified original complex form Chinese character
CN1661531A (en) Method of inputting Chinese characters through codes of sound and picture and implementation of inputting embedded type spelling/marking tones in one step
CN1123819C (en) Chinese character key-position code input method for computer
CN1266577C (en) Sound-digit-shape Chinese character input method
CN1150444C (en) Chinese-character 'letters' input method for computer
CN1081810C (en) Pictophonetic Chinese character input method for computer
CN1092815C (en) Chinese character dictionary retrieving and computer input method and keyboard
CN1079061A (en) Chinese character radical code input method for computer
CN1128398C (en) Chinese 'Latin-Chinese code' input system
CN87106169A (en) Two-dimensional character code
CN1357814A (en) Computer Chinese keyboard and its Chinese information inputting and processing method
CN1135614A (en) Three-phonetic code Chinese character input method for computer and its keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C57 Notification of unclear or unknown address
DD01 Delivery of document by public notice

Addressee: Jing Yongquan

Document name: Notice of publication of application for patent for invention

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C57 Notification of unclear or unknown address
DD01 Delivery of document by public notice

Addressee: Jing Yongquan

Document name: Notification of the application for patent for invention to go through the substantive examination procedure

C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication