CN1195263C - Chinese character input technology for instant dictionary - Google Patents

Chinese character input technology for instant dictionary Download PDF

Info

Publication number
CN1195263C
CN1195263C CNB021259909A CN02125990A CN1195263C CN 1195263 C CN1195263 C CN 1195263C CN B021259909 A CNB021259909 A CN B021259909A CN 02125990 A CN02125990 A CN 02125990A CN 1195263 C CN1195263 C CN 1195263C
Authority
CN
China
Prior art keywords
grapheme
word
key
input
chinese character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB021259909A
Other languages
Chinese (zh)
Other versions
CN1474254A (en
Inventor
萧忠义
萧志春
余锦凤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNB021259909A priority Critical patent/CN1195263C/en
Publication of CN1474254A publication Critical patent/CN1474254A/en
Application granted granted Critical
Publication of CN1195263C publication Critical patent/CN1195263C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention relates to a Chinese character input technique for an instant dictionary, which is the keyboard input technique in which Chinese characters, punctuations, foreign languages and technological signs in a unified way. The present invention belongs to the technical category of multilingual information processing. In the present invention, almost one hundred of character elements and letters thereof are used for fetching codes from 27484 Chinese characters in a GB13000.1-93 Chinese character set and an extension set according to the mode of stroke orders or a clockwise or counterclockwise mode. Three codes are taken from each Chinese character, and four codes are taken from a phrase. The present invention has the advantages of general purpose and easy touch typing and input; the auxiliary function is so simple that the Chinese characters or the phrases which are wanted to be input can be obtained step by a method that corresponding strokes or 25 representative character elements are clicked in a character element general table according to a rule, and the memory of the character elements and key mappings where the character elements are positioned is not necessary.

Description

A kind of Autotoll Chinese character input method
A kind of Autotoll Chinese character input method is the keyboard input technology of a kind of Chinese character and numeral, punctuate, foreign language and scientific and technological symbol Unified coding, and it belongs to the multilingual information treatment technology category in the computing technique field.There are a kind of everybody admitted facts that are in past encode Chinese characters for computer input scheme: easily the encode Chinese characters for computer input scheme of learning is beaten unhappy; Beat to such an extent that encode Chinese characters for computer input scheme soon is not eager to learn.Now, " a kind of Autotoll Chinese character input method " changed this fact, and it is not only easily learned, and beats soon.Its reason just is that it has possessed the characteristics that the sound sign indicating number is easily learned, and has kept the font code strong point of input fast again." a kind of Autotoll Chinese character input method " be utilize more or less a hundred commonly used radical and pronunciation thereof initial consonant according to stroke order, clockwise or counterclockwise mode comes to the input of encoding of more than 27,000 Chinese character in GB13000.1-93 Chinese Character Set and the expansion sets thereof, three sign indicating numbers got in each Chinese character, every four sign indicating numbers got in vocabulary, numeral, punctuate, foreign language and scientific and technological symbol and Chinese character Unified coding, it has overcome the shortcoming of sound sign indicating number intrinsic " unacquainted Chinese character just can't be imported ", people's fatigue when having alleviated the input Chinese character, thus versatility had.It is from protection people's eyesight, has a mind to discrete repeated code, makes touch system be achieved, and adds it and also has more auxiliary input medium, make easily learn become inevitable; Its required storage resources also is characteristics less.Based on this, show that it has fast, easily learns and general characteristics, is " a kind of Autotoll Chinese character input method " so name.
One, coding principle
There is a kind of view in encode Chinese characters for computer circle to encode method for entering Chinese characters for a long time: that beats soon is not eager to learn, and eager to learn beating is unhappy.The meaning is that font code is beaten not eager to learnly soon, the sound sign indicating number is eager to learn beat unhappy.How to break this situation, just the present technique problem that will solve.
We know, use the Scheme for the Chinese Phonetic Alphabet to realize the Chinese character input, just must know the pronunciation of this word.All learned the people of the Scheme for the Chinese Phonetic Alphabet for great majority in middle and primary schools, applied Chinese phonetic alphabet can be imported Chinese character, need not learn, and is a kind of input method that is easy to left-hand seat in the everyday character scope.But what can not ignore is that Chinese phonetic alphabet input still has a bottleneck problem that can not overcome, that people can correctly read generally also four, five K words only of the word that draws usually, run into the word that can not read, then must seek method for distinguishing, otherwise just can't import.And, because the Chinese phonetic alphabet method repetition rate of coding is too high, when the input manuscript, eyes will often leave manuscript and move on to presenting bank and remove word selection, cause the displacement of eye gaze focus, promptly watch focus attentively and often will vacillate between contribution and screen, number of times is many more, and is just big more to the injury of eye eyesight.After working two or three hours, eyes can not feel like oneself, even are difficult to stand, the phenomenon that the somebody also can occur dropping tears.Because eyes are too tired, people produce gradually and are sick of sense, more and more do not want can repel computing machine even near computing machine.Also have a sequelae that has become increasingly conspicuous and manifested, that is, some everyday characters can not write yet, and play a report, link of erroneous characters, and the place of makeing mistakes mostly goes out on phonetically similar word.Some people is improved the shortcoming of Chinese phonetic alphabet method, put intelligent function,, caused the importer can't confirm that whether correct word the phenomenon of oneself input take place though reduced the repetition rate of coding, after often importing a word, any word can appear in screen, non-ly own can expect that eyes are attracted by this uncertain sense, watch focus attentively and move on to the work of doing affirmation on the screen, thereby cause people's dispersion attention, thinking is interrupted continually, must not reach " wanting to beat " unimpeded purpose.Handle more than 27,000 Chinese character in GB13000.1-93 Chinese Character Set and the expansion sets thereof, will know the wherein pronunciation of each word, this all is the unusual thing of difficulty for most people.That is to say that pure method for inputting pinyin can only be a kind of not too perfect input method of Chinese character.On the other hand, use the people of font code will remember hundreds of character shape coding units mostly, be difficult for first remembeing; Second forget easily after having spent a period of time, cause predicament not eager to learn.Yet in a single day font code is grasped by the people, and its input speed is just faster than pure Chinese phonetic alphabet input scheme.Therefore, if we decompose Chinese character according to the method for divining by means of characters of font code more only to the pronunciation of radical administration of fixed commonly used, it is few and easily learn and Chinese character entering technique fast just can to form repeated code.
A kind of coding principle of Autotoll Chinese character input method is: Hanzi structure is analyzed, according to the nested structure theory of Chinese character, adopted radical commonly used to split Chinese character according to this as grapheme, the encode Chinese characters for computer that to split formed grapheme sequence be this word.Since by the initial consonant of commonly used radical be easy to recognize the characteristics of reading and remembering, make that the initial consonant with radicals by which characters are arranged in traditional Chinese dictionaries commonly used adopted is that the grapheme natural energy of benchmark all decomposes more than 27,000 Chinese character in GBl3000.1-93 Chinese Character Set and the expansion sets thereof come out simply and easily, the phenomenon of " unacquainted word can't encode input " can never occur.
Two, Autotoll Hanzi keyboard input grapheme table
We select more or less a hundred encode Chinese characters for computer unit from radical commonly used, making it is grapheme.Pronunciation according to this radical is provided with the key position, has laid one to four grapheme on each key position respectively.As; Have on the Q key " seven, Quan, Quan, Qi "; Have on the W key " five, king, watt "; " youngster, Fu, the tenth of the twelve Earthly Branches, ear " arranged on the E key; " soil, Mi, village " arranged on the T key; It is ch " penta, thousand, worm, ugly " that initial consonant is arranged on the U key; It is sh " scholar, corpse, mountain, food " that initial consonant is arranged on the I key; " skin, The-Fan " arranged on the P key: " Si, four " arranged on the S key: " cutter, Dian, big " arranged on the D key; Have on the F key " rich, husband, just,
Figure C0212599000051
": have on the G key " Gu, leather, dagger-axe,
Figure C0212599000052
": have on the H key " , family,Fire "; " golden, several, Jiu, Jiu " arranged on the J key; Have on the K key in " mouth "; " power, Long, Come " arranged on the L key; " only, Chuo, foot " arranged on the Z key; " heart, west, little, cave " arranged on the X key; Have on the C key " Lv "; Have on the V key "
Figure C0212599000054
" and initial consonant be " in, gas, Zhao " of zh sound; " woman, Niu, Birds " arranged on the N key; " mother, ware, order, horse " arranged on the M key;
In order to protect people's eyesight, must discrete repeated code and realization touch system.Be that the grapheme branch of B, R, Y is placed on a plurality of keys with the initial of pronunciation be one of its means.Initial is sent out the having of B sound, and " B, [" two keys have grapheme " Tony, , white, Epileptic " on " B " key, " grapheme " than, crust, or not Http " is arranged on [" key; What initial was sent out the R sound has " O, R " two keys, has on " O " key on grapheme " day, zero ", " R " key grapheme " Cui, people, Ra, Ji " is arranged; Initial send out the having of Y sound " A, Y; ,/" quadruple linkage, grapheme " speech, win " is arranged on " A " key, grapheme " one, again, clothing " is arranged on " Y " key, "; " grapheme ", plumage, fish, rain " is arranged on the key, have on "/" key grapheme " also, the moon, Page,
Figure C0212599000056
".
The initial identical simplified and traditional body radical that pronounces is placed on the different keys and is discrete repeated code and realizes two of touch system means.The first stroke of " Si " is
Figure C0212599000057
Hollow parts with the S key lower left corner " Si " Can represent; “ Si " the initial of first grapheme " one " pronunciation be Y, so be located on the Y key: first grapheme of " Jin " be " ", the hollow parts of " gas " is represented on usefulness " V " key; " Jin " is same as " gold ", so be placed on " J " key; The first stroke of " Yan " is located on " D " key for " Dian ", and " speech " is on " A " key; The first stroke of " horse " is
Figure C0212599000059
Be located on "/" key, " horse " is on " M " key; The first stroke of " bird " is that " Pie " is at , “ Birds on the P key " on " N " key; " car " two "
Figure C02125990000510
On " Q " key, " Trucks " is on " F " key; " shellfish " two "  " is on " F " key, and " Tony " is on " B " key: " page or leaf " two
Figure C02125990000511
" on [" key, " Page " is on "/" key; A two Bi “  of " Cannibals " " "; " on the key; " food " is on " I " key.
" Rui " that usage frequency is high and " Rolling " are independent, as: " Rui, Dao are arranged on ", " key; "." " hand, folding, very little " arranged on the key.The grapheme of these settings and key position can replace or redefine, and the quantity of grapheme and key position can increase, subtract.
More or less a hundred grapheme is positioned at the distribution of grapheme on position on the keyboard and each key and sees Figure of description: " Autotoll Hanzi input keyboard grapheme table ".Accompanying drawing is " an Autotoll Hanzi input keyboard grapheme table ".31 coded message keys among the figure: 29 keys are used for settling the initial of radical pronunciation, 2 keys (and.) be used for " Rui " and " Rolling ", corresponding high frequency word and the most frequently used punctuate are arranged under these keys.Also have one ' key, be used to represent the home key of digital foreign language science and technology graphical symbol.
Three, the input of single Chinese character
For single encode Chinese characters for computer input, its total input principle is that the kanji code of three grapheme sign indicating numbers as this word at most only chosen in each Chinese character.Three grapheme sign indicating numbers can provide 30,783 encodes Chinese characters for computer.Promptly
(1) Chinese character of no more than three graphemes input is a code fetch according to stroke order, when having got all strokes of certain word also during not enough trigram, then should make end code with blank key;
(2) the Chinese character input more than three graphemes is: when kanji code=lead-in element → middle grapheme → tail grapheme is imported, plain from lead-in, mostly serve as the appropriate section that the input unit removes to replace Chinese character according to stroke order, finish input with the tail grapheme with the maximum grapheme of the similar stroke of shape in " Autotoll Hanzi input keyboard grapheme table ".
Wherein: " lead-in element " expression first stroke of a Chinese character when writing this word is drawn, and stroke is maximum, shape is similar, and meets those graphemes in " Autotoll Hanzi input keyboard grapheme table ", and they all are the upper left corner, top, top or the left side at Chinese character.
" middle grapheme " is defined as follows:
When 1. having only behind a grapheme or the polylith prefix block not only a grapheme behind the monolithic prefix word layer, " middle grapheme " is defined as by choosing corresponding grapheme behind the lead-in element clockwise.As: the iron of fine quality=v[b temporarily=q.o debates=ldw keeps away=Fan ilz=lyd is far away=vbz splits=ild Ji=fjt volume=shc
When 2. a grapheme was only arranged behind the monolithic prefix word layer behind a plurality of graphemes or the polylith prefix block, " middle grapheme " was defined as by choosing corresponding grapheme behind the lead-in element counterclockwise.As: to=hs, a piece of writing=vhc is luxuriant=the ie. Ji=.; B court=io/ frequency=zib parrot=fng is whole=and vhz holds high up=[eg
That " tail grapheme " expression comprises is that this word end pen is according to stroke order chosen, stroke is maximum, shape is similar and meet those graphemes in " Autotoll Hanzi input keyboard grapheme table ".Choosing of tail grapheme is prerequisite with the integrality of not destroying middle grapheme or its last group of word grapheme.
The Chinese character of forming by three above graphemes, if this word with horizontal stroke ' ' ending and horizontal last with it when not constituting a grapheme, the tail grapheme must be got its previous grapheme.As: " tail " of " wingceltis " gets " day "; " tail " of " boundary " gets " field "; Ancestral=d; / group=ss/ Tan=gho puts=sim value=rim parrot=fng.If left avertence is other and right avertence is other, go up radicals by which characters are arranged in traditional Chinese dictionaries and following radicals by which characters are arranged in traditional Chinese dictionaries all can decompose the time, then should decompose by radicals by which characters are arranged in traditional Chinese dictionaries and the left avertence, and only to get its tail grapheme mostly be to import unit radicals by which characters are arranged in traditional Chinese dictionaries and right avertence side down as far as possible.That is:
Only form unless 1. go up radicals by which characters are arranged in traditional Chinese dictionaries, otherwise following radicals by which characters are arranged in traditional Chinese dictionaries are only chosen the tail grapheme of the most last grapheme as this word in " towel, shellfish, hair, Xiangxi, water, stone, bird, meat, rice, angle " by a grapheme.
As: Supreme Being=six Mi Shu (lti) Zi, one of the lunar mansions=end an ancient type of spoon
Figure C0212599000061
(z[u) money=d; B bear=s[j Nu=woman is people (nyr) again
Green=king's spoken parts in traditional operas (wbk) slurry=div mandarin duck=; / g sacrifical grain in ancient times=d; L bangs=epq goods=r[b
Only forms unless 2. left avertence is other by a grapheme, otherwise, right avertence other " Jie, San, prop up, strike lightly, see, mao, jin, owe, an ancient weapon made of bamboo, melon, bird, page or leaf, See, Wind " in only choose the tail grapheme of the most last grapheme as this word.As: desire=eight everybody (brr) institute=.st coloured silk=vlp Felt=gfq money=scholar two people (ier) to have a rest=o/r wooden dipper=xez strikes=gky is ruined=the plb inkstone=[ke beats up=qyy crane=trg drum=scholar mouth (iky) Xi=lbi again
Attention: after the lead-in element is selected for use, middle grapheme can not reuse lead-in plain or with the plain overlapping part of lead-in.The tail grapheme can not repeat to take middle grapheme or the part overlapping with middle grapheme.
Have in all words in the time of can handling by the grapheme of the relation of joining, handle by the relation of joining without exception, and split use not according to overlapping relation.As: " standing grain " is decomposed into " Pie " and " wood " and is not decomposed into " thousand " and " eight ".
On the S key On the G key On the E key On
Figure C0212599000065
"; " on the key
Figure C0212599000066
And outer, other All exist On the key.As: Deng.
" " " flat, half, umbrella, folder, come, volume, the chief of a tribe " in only be used as " Ha " and handle.
For example: committee=pln Supreme Being=lti win=afb great waves=, f. spring=fzo rues=ces
Skill=iy is straight=imh resistance=e/h head=bpm we=kpm joint=c/i
In order to improve input speed, following three rules of special increase:
(1) " high frequency word " input
Each " high frequency word * " need only keystroke twice: hit this character place key and a space bar.It is high frequency word=this character place key → space bar.Have 32 characters, " high frequency word " is as follows with its correspondent button position contrast:
Q W E R T Y U I O P [
Especially I and people ground with 1,010 be flat not
A S D F G H J K L ; ′
But the people greatly send out wide one and and,
Z X C V B N M , . /
Good in the heart is hidden.Have
As: I=W (sky)=M (sky) in=V (sky) is outstanding=Q (sky), or the like.
29 high frequency words: in our vast good peaceful People's heart, one and ten not as good as thousand, hide and have plenty of, Yuva can come.
3 punctuate symbols:,.
(* The high frequency wordRefer to high Chinese character of frequency of utilization and symbol, 29 words and three punctuation marks of selecting of data according to statistics, the probability that they in use occur accounts for 1/5th of whole Chinese characters.)
(2) input of grapheme word
When a grapheme was represented a Chinese character, the lead-in element was got its place key-bit code, and middle grapheme is got its place key bit position symbol code, and the tail grapheme is got the code (n) of a regulation.Promptly
Grapheme word=grapheme → grapheme place key bit position symbol code → n
As: grapheme place key bit position symbol code
One=ycn west=xcn → C, ← village=t, n dagger-axe=g, n
Again=yfn order=mfn → F S ← clothing=ysn leather=gsn
(3) the plain word input of double word
When a word was made up of two graphemes, ' tail ' got last grapheme place key bit position symbol code.Promptly
The plain word of double word=lead-in element → tail grapheme → tail grapheme place key bit position symbol code
Complement method synoptic diagram: tail grapheme place key bit position symbol code
Every vmc → the C of benevolence rec, ← kl in addition, the k[of city,
Standing grain pl fruit ol → empty S ← handleless cup vms is from rrs
Four, digital foreign language science and technology graphical symbol input
Digital foreign language science and technology graphical symbols all in the GB character library all can directly be imported under the Chinese character input technology for instant dictionary coded system.Its coding input be first thump high frequency word ", " key (the single closing quote key ' key of western language keyboard), other two key is the code keystroke in accordance with regulations.Promptly all digital foreign language science and technology graphical symbols all are the trigram inputs; They and Chinese character unification are encoded, and make needn't carry out the switching of function key when the general article of typing, thereby increase work efficiency greatly.
Punctuate commonly used, outer Chinese character, arabic numeral, Roman number and scientific and technological symbol incoming symbol coding are given an example:
, 、 。∑ ! ? ∷ ∪ ∩ ∮ ∫ :
, 、 . ′WF ′CM ′B/ ′WK ′WH ′WI ′WS ′WR ′C;
1 2 3 …… 0 A B C …… Z
′CA ′CB ′CC ′CJ ′AA ′AB ′AC ′AZ
a b c …… z ① ② ③ …… ⑩
′BA ′BB ′BC ′BZ ′RA ′RB ′RC ′RJ
(1) (2) (3) ……?(10) 1. 2. 3. …… 10.
′SA ′SB ′SC ′SJ ′TA ′TB ′TC ′TJ
Five, phrase input
If first word is A, second word is B, and the 3rd word is C, and last word is T, and regulation vocabulary code length is necessary for 4, and not enough person must supply.Order
The lead-in element of first word of the first expression of A phrase; The middle grapheme of first word of expression phrase among the A;
The lead-in element of second word of B first expression phrase; The middle grapheme of second word of expression phrase among the B;
The triliteral lead-in element of C first expression phrase; The lead-in element of the first expression of a T phrase end word;
Its coding is input as:
1. two-character word: among two words=A head+A+B head+B in
Example: China=vvfw all one's life=hhnh is strong=, ygk January=ee//
Rule=fffb figure=fphc beauty=bthf ugliness=uueh
2. three words: among three words=A head+A+B head+C head
Example: computing machine=d.vl editor's note=shi. haves no alternative but=[[ps
3. four words or the above speech of four words: the above speech of four words or four words=A head+B head+C head+T head
Example: the descendants of the Yellow=hc; Feel proud and elated=the .[kv People's Republic of China (PRC)=vrrf
Attention: when certain word is a high frequency word or only during a grapheme, its " head " promptly is that " in ", " in " promptly is " head ".
Six, auxiliary input function
What we faced is the input of more than 27,000 Chinese character in GB13000.1-93 Chinese Character Set and the expansion sets thereof, and the shape of Chinese character is again very strange, fully only relies on the Chinese phonetic alphabet to realize the input of these words, then can not accomplish for common people.And auxiliary input function can address this problem preferably.
Auxiliary input function is formed this fact based on Chinese character by basic strokes such as " horizontal one perpendicular Shu cast aside Pie folding second point Dian " fully and is realized.That is, any Chinese character can resolve into the set of " horizontal one perpendicular Shu casts aside Pie folding second point Dian " five strokes.Notice that herein " horizontal one " also comprises and " carrying ", " some Dian " also comprises " pressing down ", " folding second " Shi “ 亅, Ya, ㄅ, ,
Figure C0212599000082
Figure C0212599000083
" wait kind more than 20 to roll over the general name of stroke.If when only the set that Chinese character is decomposed into these five strokes is imported, then do not get up soon certainly; When importing as if the set that Chinese character is decomposed into two strokes, then input speed can be hurry up; When importing as if the set that Chinese character is decomposed into grapheme and stroke, then input speed can be more hurry up.Auxiliary input function is constructed according to this thinking.Its independent on the one hand these five stroke that use, on the other hand two strokes of grapheme are formed the summary table of 25 representative graphemes, needn't remember which grapheme is arranged, but actual have only 24 kinds of combinations, because of the first stroke of a Chinese character is that horizontal (one), inferior pen do not exist in " Chinese character input technology for instant dictionary keyboard grapheme table " for the grapheme of point (Dian), now " zero " is placed in one.As long as according to the corresponding stroke of input rule input or two formed representative graphemes of stroke of grapheme, then just can obtain wanting the grapheme imported, obtain wanting Chinese character or the phrase imported then.Following table is exactly " an auxiliary input function grapheme summary table ".The merging of the stroke number in this grapheme summary table, stroke shapes, stroke shapes and the ordering of stroke shapes can be changed or redefine.
Auxiliary input function grapheme summary table
Figure C0212599000091
First stroke and first of " head " expression grapheme in " auxiliary input function grapheme summary table " middle upper left corner is listed as and uses when single stroke, second stroke of " inferior " expression grapheme and first row, the grapheme of the intersection of every row and every row are the representatives with two all relevant graphemes of this grapheme.The input rule of GPRS Chinese character just can carry out input service.Because, just can present all relevant with it graphemes as long as click the representative of two grapheme of grapheme; As: if during the grapheme of plain " two " representative that is horizontal stroke " " intersection for " head " and " inferior " of lead-in, after click " two ", screen display is as follows.Therefrom select the grapheme that to import, then from " auxiliary input function grapheme summary table ", click corresponding and middle grapheme head two
The representative grapheme that pen is relevant, it corresponding grapheme table also can occur, middle grapheme is wherein necessarily arranged interior.As: when middle grapheme was " little ", screen display was as follows:
Figure C0212599000093
Adding that presenting bank just can find wants the Chinese character imported.For phrase,, also can realize the input of phrase by " auxiliary input function grapheme summary table " according to four codes of phrase input rule input.
Except that the sound sign indicating number and, all available auxiliary input function of other font code, phonetic-stroke code, shape sound sign indicating number and numerical code alleviates the memory burden of people to the character shape coding unit.The appearance of this auxiliary input function encode method for entering Chinese characters that is through with finds it difficult to learn and does not show the history of Chinese character.
Seven, the characteristics of Chinese character input technology for instant dictionary
Chinese character input technology for instant dictionary has following characteristics:
1. the Chinese phonetic alphabet carries out because grapheme (coding unit) is arranged to be based on, so really accomplished easy.
Since considered in the design coding nearly all main radical the coding inequality, will pronounce that identical simplified and traditional body radical places on the different keys, the order of strokes observed in calligraphy have the word of ambiguity give two to trigram (as: zhang hy hpz dz is of a specified duration; Z; Zc P; Z), in one group of repeated code of restriction the repeated code number 4 during with interior, input constantly the statistics Hanzi frequency count, adopted " priority of high frequency " technology, thereby reduced repeated code widely, through training just capable of touch typing, help protecting people's eyesight slightly;
3. owing to also considered the coding of punctuate, foreign language, numeral, science and technology and graphical symbol in the design, go to carry out the operation that function key is switched so when input, needn't interrupt thinking, thereby increase work efficiency greatly;
4. in CJK V2.0,27484 Chinese characters, on software, realize showing simultaneously the pronunciation of this word, had the correctness function of check input word;
5. follow this input technology, provide a kind of easy to other font code, phonetic-stroke code, the auxiliary input function that shape sound sign indicating number and numerical code are all available, auxiliary input function is formed this characteristics based on Chinese character by five basic strokes such as " horizontal one perpendicular Shu cast aside Pie folding second point Dian " fully, independent on the one hand these five strokes that use, on the other hand the occur simultaneously grapheme that forms of two strokes of grapheme is formed the summary table of 25 representative graphemes, as long as click stroke corresponding in the summary table or two formed representative graphemes of stroke of grapheme according to input rule, just can obtain wanting the grapheme imported, obtain wanting Chinese character or the phrase imported then.It can alleviate the memory burden of people to the character pattern input unit, and people needn't be remembered for which grapheme, just can grasp these encode method for entering Chinese characters at an easy rate.This auxiliary input function has solved " long-standing " technical matters of long-standing " input method of Chinese character finds it difficult to learn " and " having many words to think also not show half a day " on the encode Chinese characters for computer history.
This input technology makes amateur operator also can handle Chinese character expeditiously, so it can become non-full-time operator's first-selected input method.From then on, the use of input in Chinese on equipment such as microcomputer, palm PC, PDA, mobile phone, handheld terminal, electronics textbook and electron reading no longer is tired, a difficult thing.

Claims (1)

1. Autotoll Chinese character input method is characterized in that:
(1) initial consonant according to the pronunciation of more or less a hundred grapheme is provided with the key position, has arranged one to four grapheme on each key position respectively: have on the C key " Lv ", " cutter, Dian, big " arranged on the D key, " youngster, Fu, the tenth of the twelve Earthly Branches, ear " arranged on the E, have on the F " rich, husband, just,
Figure C021259900002C1
", have on the G " Gu, leather, dagger-axe,
Figure C021259900002C2
", " , family, Back, fire " arranged on the H, " scholar, corpse, mountain, the food " of sending out the sh sound arranged on the I; " golden, several, Jiu, Jiu " arranged on the J, " mouth " arranged on the K, " power, Long, Come " arranged on the L; " mother, ware, order, horse " arranged on the M; " woman, ox, Ukraine " arranged on the N, " skin, The-Fan " arranged on the P, have on the Q " seven, Quan, Quan, Qi "; " Si, four " arranged on the S; " soil, Mi, village " arranged on the T, " penta; Thousand, worm, ugly " of sending out the ch sound arranged on the U, have on the V send out the zh sound " in, gas, Zhao.", " five, king, watt " are arranged on the W, " heart, west, little, cave " arranged on the X, have on the Z " end, Chuo, foot ";
(2) in order to protect people's eyesight; necessary discrete repeated code and realization touch system; with the initial of pronunciation is that to be divided into a plurality of keys are one of its means for the grapheme of B, R, Y: initial is sent out two keys that have of B sound; they be on the B " Tony, , white, Epileptic " and [on " than, cling to, or not Http "; initial is sent out two keys that have of R sound, they be on the O " day, ", " Cui, people, Ra, Ji " on the R; Initial is sent out the quadruple linkage that has of Y sound, they be on the A on " speech, win ", the Y " one, again, clothing ", "; " on ", plumage, fish, rain " and/on " also, the moon, Page, ";
The initial identical simplified and traditional body radical that pronounces is placed on the different keys and is discrete repeated code and realizes two of touch system means: " Si " is at , “ Si on the S " on Y; " Jin " on V, " Jin " is on J; " Yan " on D, " speech " is on A; " horse "/on, " horse " is on M; " bird " is synthesized, and " Ukraine " is on N; " car " on Q, " Trucks " is on F; " shellfish " on F, " Tony " is on B; " page or leaf " is synthesized, " Page "/on; " Cannibals " on ", " key, " food " is on I;
" Rui " that usage frequency is high and " Rolling " are independent to be noiseless key, and " Rui, Dao " arranged on ", " key, "." " hand, folding, very little " arranged on the key; The grapheme of these settings and key position can replace or redefine, and the quantity of grapheme and key position can increase, subtract;
(3) order of strokes observed in calligraphy have the word of ambiguity give two to trigram, the one group of repeated code of restriction the repeated code number at 4 continuous statistics Hanzi frequency counts and to have adopted " priority of high frequency " technology be one of present technique characteristic during with interior, input;
(4) input of single Chinese character is characterized in that each Chinese character at most only chooses the kanji code of three grapheme sign indicating numbers as this word:
(1) Chinese character of no more than three graphemes input is a code fetch according to stroke order, when having got all strokes of certain word also during not enough trigram, then should make end code with blank key;
(2) the Chinese character input more than three graphemes is: kanji code=lead-in element → middle grapheme → tail grapheme
That A, " lead-in element " expression first stroke of a Chinese character when writing this word is drawn, stroke is maximum, shape is similar and meet those graphemes in " Autotoll Hanzi input keyboard grapheme table ";
B, " middle grapheme " are defined as follows:
1. behind the monolithic prefix word layer only behind grapheme or the polylith prefix block not only during a grapheme, " middle grapheme " is defined as by choosing corresponding grapheme behind the lead-in element clockwise;
When 2. a grapheme was only arranged behind the monolithic prefix word layer behind a plurality of graphemes or the polylith prefix block, " middle grapheme " was defined as by choosing corresponding grapheme behind the lead-in element counterclockwise;
That C, " tail grapheme " expression comprises is that this word end pen is according to stroke order chosen, stroke is maximum, shape is similar and meet those graphemes in " Autotoll Hanzi input keyboard grapheme table "; Choosing of tail grapheme is prerequisite with the integrality of not destroying middle grapheme or its last group of word grapheme; The Chinese character of forming by three above graphemes, if this word with horizontal stroke ' ' ending and horizontal last with it when not constituting a grapheme, the tail grapheme must be got its previous grapheme;
Unless 1. go up only grapheme of radicals by which characters are arranged in traditional Chinese dictionaries, otherwise following radicals by which characters are arranged in traditional Chinese dictionaries are only chosen the tail grapheme of the most last grapheme as this word in " towel, shellfish, hair, Xiangxi, water, stone, bird, meat, rice, angle ";
Unless 2. other only grapheme of left avertence, otherwise, right avertence other " Jie, San, prop up, strike lightly, see, hair, jin, owe, an ancient weapon made of bamboo, melon, bird, page or leaf, See, Wind " in only choose the tail grapheme of the most last grapheme as this word;
(3) in order to improve input speed, following three input rules of special increase:
1. high frequency word=32 " high frequency word " symbols of this character place key → space bar corresponding keys position is as follows:
Q W E R T Y U I O P [
Especially I and people ground with 1,010 be flat not
A S D F G H J K L : ′
But the people greatly send out wide one and and,
Z X C V B N M , . /
Good in the heart is hidden.Have
2. grapheme word=grapheme → grapheme place key bit position symbol code → n grapheme place key bit position accords with code: the upper left corner is C, and the lower-left is F, and the lower right corner is S, and the upper right corner is ", ";
3. grapheme word=lead-in element → tail grapheme → tail grapheme place key bit position symbol code tail grapheme place key bit position accords with code: the upper left corner is C, and the lower-left is empty, and the bottom right is S, and the upper right corner is ", ";
(5) all numeral, foreign language, scientific and technological graphical symbols all are the trigram inputs in the GB character library; They and Chinese character
, 、 。∑ ! ? ∷ ∪ ∩ ∮ ∫ :
, 、 .?′WF?′CM ′B/ ′WK ′WH ′WI ′WS ′WR ′C;
1 2 3 …… 0 A B C …… Z
′CA ′CB?′CC ′CJ ′AA ′AB ′AC ′AZ
a b c ……z ① ② ③ …… ⑩
′BA ′BB ′BC ′BZ ′RA ′RB ′RC ′RJ
(1) (2) (3)?……(10) 1. 2. 3. …… 10.
' SA ' SB ' SC ' SJ ' TA ' TB ' TC ' TJ is unified to be imported, and makes that needn't interrupt thinking when the general article of input goes to carry out the operation that function key is switched;
(6) input of each Chinese phrase is characterized by four yards:
Among two words=A head+A+B head+B in; Among three words=A head+A+B head+C head;
The above speech of four words or four words=A head+B head+C head+T head is wherein: A is phrase first word, and B is phrase second word, and C is phrase the 3rd word, and T is a phrase end word, and regulation phrase code length is necessary for 4, and not enough person must supply;
The lead-in element of first word of the first expression of A phrase; The middle grapheme of first word of expression phrase among the A;
The lead-in element of second word of B first expression phrase; The middle grapheme of second word of expression phrase among the B;
The triliteral lead-in element of C first expression phrase; The lead-in element of the first expression of a T phrase end word; When certain word only was made up of a grapheme or is the high frequency word, its " head " promptly was that " in ", " in " promptly is " head ";
(7) available auxiliary input function alleviates people's memory burden.It is based on Chinese character and is made up of basic strokes such as " horizontal one perpendicular Shu cast aside Pie folding second point Dian ", it uses this five strokes, and two strokes of grapheme are formed the summary table of 25 representative graphemes, needn't remember which grapheme is arranged, as long as according to the stroke of input rule input correspondence or two formed representative graphemes of stroke of grapheme, then just can obtain wanting the grapheme imported, obtain wanting Chinese character or the phrase imported then.
CNB021259909A 2002-08-08 2002-08-08 Chinese character input technology for instant dictionary Expired - Fee Related CN1195263C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB021259909A CN1195263C (en) 2002-08-08 2002-08-08 Chinese character input technology for instant dictionary

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB021259909A CN1195263C (en) 2002-08-08 2002-08-08 Chinese character input technology for instant dictionary

Publications (2)

Publication Number Publication Date
CN1474254A CN1474254A (en) 2004-02-11
CN1195263C true CN1195263C (en) 2005-03-30

Family

ID=34143173

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021259909A Expired - Fee Related CN1195263C (en) 2002-08-08 2002-08-08 Chinese character input technology for instant dictionary

Country Status (1)

Country Link
CN (1) CN1195263C (en)

Also Published As

Publication number Publication date
CN1474254A (en) 2004-02-11

Similar Documents

Publication Publication Date Title
CN1195263C (en) Chinese character input technology for instant dictionary
CN1110738C (en) Literal character input method for notobook computer
CN1059281C (en) Chinese phonetic coding method with initial consonant, simple or compound vowel and tone
CN1435749A (en) Chinese character stroke and phonetic code input method and keyboard thereof
CN100339808C (en) U Code Chinese character inputting method
CN1584809A (en) Inputting method for Chinese code as phonetic Chinese
CN100342310C (en) Two division Chinese character coding small keyboard input and its display method
CN1242314C (en) Modern Chinese pronunciation input method
CN1825254A (en) Chinese character inputting method and computer keyboard therefor
CN1139867C (en) Simple and fast pictophonetic code Chinese character input method
CN1188771C (en) Radical form code Chinese character input method and keyboard
CN1102256C (en) Computer input method of Yuanma codes Chinese characters
CN1031964C (en) Chinese character radical code input method for computer
CN101055502A (en) Quick input method for Chinese characters
CN1162766C (en) Chinese-character 'pronunciation-shape code' input method and its keyboard profile
CN1056007C (en) Codes for inputting Chinese characters
CN1058342C (en) Chinese character byte codes and its keyboard of using the same
CN1086235C (en) Form-phoneme, stroke writing order and four-corner code Chinese character computer input method and keyboard thereof
CN1133113C (en) Computer chinese character input method and keyboard
CN1120406C (en) Computer Chinese character radicals input method and keyboard
CN1054446C (en) Synthetic coding method and Chinese characters input keyboard
CN1167994C (en) Input method for Chinese character
CN1455316A (en) Chinese character 'three-shape association' shape-code input method
CN1420422A (en) Stroke set digit representation method for code element and use
CN1525292A (en) Chinese characters nine-palace input method for computer

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20050330

Termination date: 20110808