CN1236914A - Chinese phrase enter method - Google Patents

Chinese phrase enter method Download PDF

Info

Publication number
CN1236914A
CN1236914A CN 99101513 CN99101513A CN1236914A CN 1236914 A CN1236914 A CN 1236914A CN 99101513 CN99101513 CN 99101513 CN 99101513 A CN99101513 A CN 99101513A CN 1236914 A CN1236914 A CN 1236914A
Authority
CN
China
Prior art keywords
character
code
word
words
stroke
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 99101513
Other languages
Chinese (zh)
Other versions
CN1109287C (en
Inventor
钟明华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 99101513 priority Critical patent/CN1109287C/en
Publication of CN1236914A publication Critical patent/CN1236914A/en
Application granted granted Critical
Publication of CN1109287C publication Critical patent/CN1109287C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A Chinese-character enter system for computer features use of phrase enter as primary means and single word enter as basis. Such techniques as 3D locating index, code length dividing, prior high-frequency phrases or single-words, integration of shape with pronunciation, three codes for one single word and four codes for one phrase are used. It has an own 68000-term phrase library. Its advantages are short code length, low duplicate rate, high enter speed and easy mastering it.

Description

The Chinese letters method for typing-in phrases
Invention Pinyin abbreviation: ZWCZ
The present invention is a kind of Chinese character input scheme of computer, is under the jurisdiction of the computword application category, mainly acts on various computing machine Chinese operating systems.
At present, act on the keyboard Chinese-character input method of China's computing machine Chinese operating platform, the overwhelming majority is establishment more than four yards or four yards, they are existing too fat to move, the mutual repeated code shielding of words of encoding in varying degrees, need to differentiate drawbacks such as sign indicating number, complex operation, brought certain difficulty for people's study, application; And the inputting method of only several trigram establishment exists again that the individual character repeated code is many, phrase is few, can not compatible accept present popular defectives such as five-stroke character input method.Purpose of the present invention, be to overcome above-mentioned drawback, develop than higher, more scientific and normal, easy practicality and the compatible Chinese phrase input scheme of accepting five-stroke character input method learned of present various input method efficient, allow computing machine get close to people, allow people walk close to computing machine, promote popularizing and development of China's computing machine cause.
The present invention be a kind of be input as with phrase leading, be input as the Chinese characters keyboard input system on basis with individual character, its integrated use three-dimensional localization retrieval technique, code length partitioning technique and general priority of high frequency technology, accept scheme by the specific character root of keyboard plan of establishment, word coding method scheme, the dictionary plan of establishment, operation input scheme and compatibility and formed, have characteristics such as coding is short, study is easy, easy and simple to handle, input is quick, accurate positioning.
One, technical essential
1, the three-dimensional localization retrieval technique.
Three-dimensional localization retrieval technique of the present invention is the original creation technology, and main effect is by three different directions, only with three code elements required individual character carried out three-dimensional retrieval by window, have retrieval fast, the characteristics of accurate positioning, brief science.
Each dimension retrieval direction of three-dimensional localization retrieval technique can be carried out from various angle, as: the initial consonant in the phonetic code input method, simple or compound vowel of a Chinese syllable, tone, the prefix in the font code input method, belly, suffix etc.Retrieval orientation angle difference, its searched targets is just different; Retrieval orientation angle gap is big more, and its result for retrieval is accurate more.
Each dimension retrieval direction of three-dimensional localization retrieval technique can be divided into different section levels; The section level of dividing is thin more, and its result for retrieval is just accurate more, and the repetition rate of coding of gained is just low more.
Another distinguishing feature of three-dimensional localization retrieval technique is exactly the triple bond location, promptly only needs three keystrokes just can to retrieve out by required individual character.The input method of every employing three-dimensional localization retrieval technique, its individual character maximum code length is 3, all available triple bond retrieval of 6763 Chinese characters of GB one secondary character library draws.
With regard to computer keyboard, in the key position of these 26 codifieds, it is 26 that singly-bound does not have the repeated code space encoder from A to Z, and it is 676 that two keys do not have the repeated code space encoder, and it is 17576 that triple bond does not have the repeated code space encoder, and it is 456976 that quadruple linkage does not have the repeated code space encoder.Obviously, in the GB GB-2312 character library with 6763 Chinese characters is in the input method of coded object (or be coded object with the GBK large character set of 15000 Chinese characters), the way of encoding with singly-bound or two keys is unpractical, and encodes with quadruple linkage or more than the quadruple linkage, seems too too fat to move again.Have only three key encodes, be only the most appropriate way.We can say:, be science in the keyboard Chinese-character input method, the most brief individual character retrieval technique with the three-dimensional localization retrieval technique of three key encodes.
By putting into practice many times and screening, the present invention has determined the three-dimensional localization retrieval direction of oneself, promptly is: first letter of first radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character pattern, last radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character pattern and phonetic transcriptions of Chinese characters, be called for short: prefix, suffix, sound head.Each retrieval direction is divided into 26 section levels according to computer keyboard codified key position again.26 space encoders of one dimension retrieval are all distributed to the one-level brevity code and are used, 676 space encoders of two dimension retrieval are distributed to the secondary brevity code and are used, 17576 space encoders of three-dimensional search are distributed to the individual character use of encoding fully, all available triple bond hits 6763 Chinese characters of GB one secondary character library (or 15000 Chinese characters of GBK character library), and the repetition rate of coding is extremely low.
2, the code length partitioning technique
Being applied to code length partitioning technique of the present invention, is the original creation technology equally, is characterized in: individual character and phrase are distributed in the coding region of different Baud Lengths, fundamentally avoid between the words repeated code mutually, or the drawback of shielding mutually.
With regard to the present invention, it is characterized in that:
1., trigram one word, four yard one speech: single character code all is arranged in the symbol region below 3 and 3,
The phrase coding all is arranged in the zone of 4 code elements.
2., 3 keys and 3 keys are all distributed to individual character with interior coding region and are used, and 4 key encodes zone is all distributed
Use to phrase.
3., the word coding method zone was both relatively independent, complemented one another again, and input need not the button switching between the words.
3, the priority of high frequency technology.
Priority of high frequency promptly is exactly in encode Chinese characters for computer is arranged, and individual character or phrase commonly used are placed on the foremost, guarantees that thus everyday character and everyday words can hit indiscriminately.
The priority of high frequency technology has dynamic frequency and static frequency modulation dual mode.What the present invention adopted is static mode of frequency regulation.
Two, keyboard plan
1, the character root of keyboard scheme
The character root of keyboard scheme is meant the corresponding scheme of input method radical and computer keyboard position.
Radical is the base unit that constitutes the Chinese character shape, and each Chinese character can be regarded as by different radicals and be formed.The present invention is a benchmark with the standard radical of Chinese character, compatible five-stroke character input method, 200 of radicals are set, and play the gesture feature according to stroke and these radicals are divided into " horizontal, vertical, cast aside, press down, folding " five base regions and special case districts, with the concrete key position of corresponding computer keyboard.
" horizontal stroke " draws " G, F, D, S, A " key position of the corresponding computer keyboard of radical difference of the first stroke of a Chinese character, and area code should be " 11,12,13,14,15 " mutually;
" erect " " H, J, K, L, M " key position of the corresponding computer keyboard of radical difference of drawing the first stroke of a Chinese character, area code should be " 21,22,23,24,25 " mutually;
" left-falling stroke " draws " T, R, E, W, Q " key position of the corresponding computer keyboard of radical difference of the first stroke of a Chinese character, and area code should be " 31,32,33,34,35 " mutually;
" right-falling stroke " draws " Y, U, I, O, P " key position of the corresponding computer keyboard of radical difference of the first stroke of a Chinese character, and area code should be " 41,42,43,44,45 " mutually;
" folding " draws " N, B, V, C, X " key position of the corresponding computer keyboard of radical difference of the first stroke of a Chinese character, and area code should be " 51,52,53,54,55 " mutually;
" Z " key position of the corresponding computer keyboard of these seven radicals of " Rolling " " very little " " " " car " " power " " skin " " youngster ", area code is " 60 ".
Concrete key position radical scheme is seen accompanying drawing 1.
Inquiry wildcard key is " * " key, and any coding all can be inquired about or wildcard by this key.
2, the word coding method scheme
The present invention is a kind of to be input method of Chinese character based on font, the combination of sound shape, determines the word coding method scheme according to the principle of three-dimensional localization retrieval, code length subregion, priority of high frequency.
Code symbols: ABCDEFGHIJKLMNOPQRSTUVWXYZ
Select code element: 1234567890
Wildcard inquiry code: *
Individual character maximum code length: 3
Phrase code length: 4
Code identification:
The words sign Key name word: j one-level brevity code: y secondary brevity code: e characterized radical: c all-key: q two words: ce2 three words: ce3 four words and the above speech of four words: ca4
The ordering sign Positive sequence: the p backward: n does not sort: 0
The individual character sequence number Span: 1~15 ignores: 0
Sound shape sign Font code: s sound sign indicating number: i ignores: 0
The code element sequence number Span: 1~4 ignores code element sequence number face only by its key position: 0
Coding expression:
[words sign]=[ordering sign] [individual character sequence number] [sound shape sign] [code element sequence number]+... [ordering sign] [individual character sequence number] [sound shape sign] [code element sequence number] }
Coding rule:
Coding rule of the present invention is divided into single character code and phrase coding two part, totally 8 rules:
1., single character code
Single character code scheme of the present invention is divided into key name word, one-level brevity code, secondary brevity code, characterized radical, 5 rules of all-key:
(1), key name word.In the present invention, each key position of computer keyboard all can be by a Chinese character representative, and the Chinese character of this representative key position is exactly " a key name word ".The compatible five-stroke character input method of key name word of the present invention, coding rule is:
Key name word=place key position+place key position+key position, place
That is: double hit three times on key position, the place of key name word.
Expression formula is: j=0000+0000+0000
Key name word of the present invention has 26:
The big ddd wood of king ggg soil fff sss worker aaa
Order hhh day jjj mouth kkk field lll mountain mmm
The white rrr of standing grain ttt month eee people www gold qqq
The PPP of the upright uuu water iii fire of speech yyy ooo
Nnn bbb woman vvv ccc Si xxx again
Power zzz
(2), one-level brevity code.Only there is the coding of a code element to be referred to as the one-level brevity code, the compatible five-stroke character input method of one-level brevity code of the present invention, coding rule is:
One-level brevity code=prefix or one-level brevity code=inferior prefix
That is: get first radical or second radical of the inner font code positive sequence of individual character.
Expression formula is: y=p0s1 or: y=p0s2
One-level brevity code of the present invention has 26:
One g ground f wants s worker a at d
Last h is the same m of k state l among the b
With the r of t my q of e people w is arranged
Main y product u not i is this p of o
The people n b send out v with c through x
Beat z
(3), secondary brevity code.The coding of being made up of two code elements is referred to as the secondary brevity code, and coding rule is:
Secondary brevity code=prefix+suffix is promptly: first radical of positive sequence and first radical of backward of getting the inner font code of its individual character.Expression formula is: e=pos1+nos1 the present invention is provided with 665 of secondary brevity codes, specifically sees accompanying drawing 2: in the accompanying drawing 2, single character code adds this word column code element by this word code element of being expert at.
The present invention is provided with one of maximum Chinese-character keyboard input method of secondary brevity code quantity.---in inputting method, the secondary brevity code that can be provided with is many more, means that then its radical setting is even more, and its operating efficiency is high more.
(4), characterized radical.In set 200 radicals of the present invention, except 26 key name words, also have 103 radicals can independently become Chinese character.The radical that these can independently become a Chinese character is exactly so-called " characterized radical ".Its coding rule is:
Characterized radical=key position, place+the first sum of+end pen
That is: hit the key position at this characterized radical place earlier, and then hit the key position of the first stroke and the last stroke of this radical.
Expression formula is:
c=0000+poa1+nos1
95 altogether of characterized radicals of the present invention:
Bad ggy two fgg of one ggg king ggg, five ggg Jian ggt ten fgh do fgh scholar fgg
The ancient dgg stone of rain fgy dgg three dgt of dgg factory dog dgy fork-like farm tool used in ancient China dgy fourth sgh west sgg
But sgg sgh twenty agg seven agn at the tenth of the twelve Earthly Branches shoot a retrievable arrow the last hhg of agy dagger-axe agt leather agh ends hhg
_ hhn foretells hhy and says jhg worm jhy jhh river kth first lhh four lhg ware lhg early
Bone mhg is by mhg shellfish mhy towel mhh boat tyy body ttt bamboo tth hand rth jin rth
Gas rtn is etn insect without feet or legs ett pig egy eight wty qtn fish qtg sunset qty with eth
The little ihy of the wide yyt family yyt side yyn literary composition hot uyh sheep uyh of yyy six uyy door uyn
The own nnn of rice oyy industry oyg second nnn nnn in the sixth of the twelve Earthly Branches corpse nnt heart nyy plumage nng bnh
Also bnn ear bgg hole bng Ji vng mortar vtg nine vtn cutter vnt cling to cnn horse cng
The female xny of one xny is xnn cun several xtn of xgy skin xny car xgh of xnt an ancient type of spoon xtn bow not
Zgh cun zgy skin nhy of a few ztn car of ability zgt
In fact, above-mentioned many characterized radicals are secondary brevity codes, if key in first, inferior two yards just enough.
The coding rule of radical of Chinese character is identical with characterized radical.In this sense, radical of Chinese character also is characterized radical.Except that above-mentioned characterized radical, can also have 40 by the radical that keyboard hits:
Contraband agn Lv agh European-allies agh Jie bnh Fu bnh Qian bnh Shu hhl mouth lhg San ett
One ggl Pie ttl Dian yyl Ren wth Rui iyg Dao jhh door mhn Xin nyy Xiangxi oyy
Mi pyn Http pyn Woo Pyy Yi Pyy Chuo Pyy Yin pny Bao qtn Jin qtn Cannibals qtn
Quan qtt Rolling xgg Chi tth Pie ttt For-additional tty The-Fan tty Bing uyg Epileptic uyg Zhuang uyh
Chuan vnn Tou yyg Yan yyn system cny
(5), all-key.All-key is meant by the individual character of three code element establishments and encodes fully.In the present invention, each Chinese character all has its intact gold coding, and coding rule is:
Individual character is encoded=prefix+suffix+sound head fully
That is: get first code element of positive sequence of first radical of positive sequence, first radical of backward and the sound sign indicating number of the inner font code of individual character.
Expression formula is:
q=p0s1+n0s1+p0i1
Coding is a three-dimensional localization retrieval technique application masterpiece in the present invention fully, prefix, key position, the corresponding radical place of suffix code element, the English key-position of the corresponding computer keyboard of the first letter of spelling unit.(" ü " phonetic alphabet correspondences " V " key position.)
In a sense, key name word and characterized radical are two special cases of encoding fully.
2., phrase coding
According to the principle of code length subregion, phrase coding of the present invention all is distributed in the scope of four code elements, and it is divided into 3 rules of the above speech of two words, three words, four words and four words.
(1), two words.
Two word coding method rules are:
Two words=lead-in prefix+lead-in suffix+secondary word prefix+secondary word suffix.
That is: the first sign indicating number and time sign indicating number of each word got successively in two words.
Expression formula is:
Ce2=p101+p102+p201+p202 (2), three words.Three word coding method rules are:
Three words=lead-in prefix+secondary word prefix+last word prefix+last word suffix is promptly: the first sign indicating number of each word got successively in three words, adds time sign indicating number of last word again.Expression formula be "
Ce3=p101+p201+p301+p302 (3), four words and the above speech of four words.The above Chinese word coding rule of four words and four words is:
The above speech of four words and four words=lead-in lead-in+secondary word prefix+the 3rd word prefix+last word prefix is promptly: four words and the above speech of four words get successively head, inferior, three, the first sign indicating number of last word.Expression formula is:
ca4=p101+p201+p301+n101
3, the dictionary plan of establishment
Dictionary is an organic component of the present invention, it set up three Special Significance:
1., improve the input efficiency of continuous text;
2., exempt the restriction of individual character repeated code;
3., reduce the mistake of literal input.
Dictionary setting of the present invention possesses following properties:
1., rich.Dictionary setting of the present invention is a source with the modern Chinese dictionary, incorporate things of diverse nature other spoken language, slang, common-use words, and 68000 of the total clauses and subclauses of dictionary, wherein two words are 43000,10000 of three words, 15000 of the above speech of four words and four words.
2., practicality.The emphasis practicality included in phrase, except that including the contained formal phrase of modern Chinese dictionary, includes the various unofficial phrases of being made up of noun, verb, pair speech, preposition simultaneously, and the phrase coverage rate that ensures general manuscript is up to more than 95%.
3., novelty.Closing to reality of the present invention is pressed close to the epoch, and makeshift language and popular term when having now are included in the setting of dictionary, make the dictionary setting can adapt to the practical operation needs of modern society.
4, the input operation scheme
1., individual character input.
(1), the key name word by hitting the correspondent button position three times, clicks space bar continuously again;
(2), one-level brevity code:, click space bar again by hitting the correspondent button position once;
(3), secondary brevity code: by hitting head, inferior radical correspondent button position, click space bar more successively;
(4), all-key: by hitting head, inferior, the first correspondent button of last code position, click space bar more successively.
(5), characterized radical: successively by hitting respective symbol key position, click space bar again according to coding.
Wherein, one-level brevity code, secondary brevity code do not have repeated code.Other coding input as repeated code occurs, by space bar then lead-in navigate to the screen cursor position automatically; Importing other individual character as need then selects by numerical key.
2., phrase input.
(1), successively by hitting phrase code element correspondent button position, four yard one speech.
(2), the phrase input navigates to the screen cursor position as repeated code occurring automatically by the then the first phrase of space bar; Importing other phrase as need then selects by numerical key.
3., input in full
(1), phrase is input as the master, and individual character is input as auxilliary.
(2), the phrase input should be as far as possible based on two words and four words.In practical operation, the frequency of utilization of two words and four words is the highest, and the present invention is to the coverage rate of two words and four words, also up to more than 95%.
(3), as the auxiliary individual character input that replenishes part, should be input as the master with one-level brevity code, secondary brevity code as far as possible.Because these two kinds of encoding Chinese characters frequencies of utilization are the highest, they all do not have repeated code simultaneously, help improving text input speed.
5, compatibility is accepted scheme
At present, 86 editions five-stroke character input method is the inputting method that China's range of application is the widest, practicality is the strongest, and the computing machine more than 70% is equipped with five-stroke character input method, and the computer operation person more than 50% is using five-stroke character input method.At present the many input methods in market all because of can't compatibility, accept the limitation that five-stroke character input method causes its use.The present invention is as a kind of brand-new input method, on the basis of comprehensively adhering to Chinese-character canonical, as much as possible five-stroke character input method is carried out compatibility and accept, and make a large amount of work for this reason, make every effort to make the Five-stroke Method user under the prerequisite that only needs study a little, just can grasp the present invention rapidly.
Five-stroke character input method is the input method of a kind of four yards establishments, pure font code, and the present invention is the input method of a kind of trigram establishment, the combination of sound shape, but anatomizing encoding scheme of the present invention can find, the present invention's so-called " combination of sound shape ", mainly be meant individual character encode fully in " prefix+suffix+sound head " rule, if use the phrase input to avoid " sound head " part, or replace " sound head " with " * " wildcard key, then the present invention has become the input method of pure font code.This just for compatibility of the present invention, accept five-stroke character input method and established solid foundation.
The present invention at first carries out compatibility to the compatibility of five-stroke character input method on the character root of keyboard scheme.The inventor is carrying out a large amount of work of gettinging rid of the weed and keep the flower of the leek, removing crudely and store essence to the character root of keyboard scheme of the Five-stroke Method, and is specific as follows:
1., draw the way of the Five-stroke Method key position subregion, with 130 radicals of 200 comprehensive compatible the Five-stroke Methods of radical of more directly perceived, vivid, standard;
2., set up " Z " character code unit key position, code symbols increases by 26 by 25;
3., mobile radical: with seven radicals of " Rolling " " very little " " " " car " " several " " power " respectively by former " R " " F " " L " " M " " L " bond shifting to " Z " key position;
4., change wildcard query key key position: the wildcard query key changes " * " key position into by " Z " key position;
5., set up " body " " boat " " already " " skin " " Quan " " insect without feet or legs " " bone " " skin " " family " " _ " " leather " " fish " " sheep " " Woo " " Yi " " bad " " tenth of the twelve Earthly Branches " " fork-like farm tool used in ancient China " " gas " " ox " " towel " " not " " mother " " skin " " gas " " walk " " can " " " 28 characterized radicals, both made the user more intuitively easy, simultaneously compatible again five-stroke character input method required two radicals when forming above-mentioned radical.
Because above-mentioned five changes, the present invention compatible, accept and become very simple aspect the five-stroke character input method.The Five-stroke Method user is as long as just the present invention on top of of study a little mainly is to note following 3 points:
(1), note the 3-dimensional encoding rule of " prefix+suffix+sound head ": the key name word is original key name word, the one-level brevity code is original one-level brevity code, the secondary brevity code is " first preface root+last radical ", coding is " first letter of first radical+last radical+phonetic " fully, and characterized radical is " key position, place+the first sum of+end pen ".
(2) run into " body " " boat " " already " " skin " " Quan " " insect without feet or legs " " bone " " skin " " family " " _ " " leather " " fish " " sheep " " Woo " " Yi " " bad " " tenth of the twelve Earthly Branches " " fork-like farm tool used in ancient China " " gas " " ox " " towel " " not " " mother " " skin " " gas " " walk " " " " can " during these 28 radicals by which characters are arranged in traditional Chinese dictionaries, will their watchman's clappers be not two radicals again, only need then can by hitting first original radical key position.
(3), attention key displacement condition emotionally: the wildcard key by " Z " bond shifting to " * " key position; " skin " " several " " very little " " " " car " " Rolling " " power " radicals by which characters are arranged in traditional Chinese dictionaries are respectively by original " H " " M " " F " " L " " R " " L " bond shifting " Z " key position till now.
The compatible five-stroke character input method of the present invention, but be not equal to five-stroke character input method.It is compared with five-stroke character input method, exists many tangible differences:
(1), radical quantity: 130 of five-stroke character input methods, 200 of the present invention;
(2), secondary brevity code quantity: 588 of five-stroke character input methods, 665 of the present invention;
(3), individual character maximum code length, 4 yards of five-stroke character input methods, 3 yards of the present invention;
Whether (4), need to differentiate sign indicating number: the five-stroke character input method needs: the present invention does not need;
(5), whether need to recite the radical pithy formula: the five-stroke character input method needs, the present invention does not encourage;
(6), the individual character repetition rate of coding: five-stroke character input method is few, and the present invention is less;
(7), the phrase repetition rate of coding: five-stroke character input method is less, and the present invention is minimum;
(8), whether shield between the words: the five-stroke character input method partly shielding effect, the present invention does not shield:
(9), repeated code whether between the words: five-stroke character input method part repeated code, the present invention is repeated code not;
(10), the dictionary clauses and subclauses are provided with: five-stroke character input method is few, 6.8 ten thousand of the present invention:
(11), whether be easy to study: five-stroke character input method is difficult, and the present invention is easier;
Whether (12), be easy to use: five-stroke character input method is handy, and the present invention better uses;
(13), skilled input method speed: five-stroke character input method per minute 120-150 word, per minute 150-180 word of the present invention.
We can say: the present invention compares with five-stroke character input method (comprising 98 king's sign indicating numbers that release in May, 1998), and except individual character repetition rate of coding index was slightly too late, other index was all won comprehensively, and easier to be faster stronger, raises the efficiency to reach about 25%.
The not only comprehensive compatibility of the present invention is accepted five-stroke character input method, and also comprehensive simultaneously compatibility is accepted me and developed the Chinese character and phrase input method of finishing in May, 98.They are compatible fully aspect radical setting, key name word, one-level brevity code, four word coding methods, there are differences at aspects such as secondary brevity code, all-key, two words, three words, wildcard query key, coding rules.Same only need of Chinese character and phrase input method user learnt a little, also can grasp the present invention immediately, and repeated code still less, and efficient is higher.
Four, special advantages
The present invention is the input method of an original creation, also is a technical scheme of learning wildly from other's strong points, and the strong point of its concentrated numerous input method of Chinese character forms own unique style and advantage, has the coding weak point, learns characteristics such as easy, easy and simple to handle, that input is quick.
1., coding is short
The present invention takes the three-dimensional localization retrieval technique, the all available triple bond of GB one each Chinese character of secondary character library just can hit, as above its I and II brevity code and huge dictionary, make continuous text input stroke reach minimum degree, mean code length only 2.1, be that China's current encoder is the shortest, the inputting method that touch potential is minimum, input speed is the fastest.
2., highly versatile
The trigram establishment technique is adopted in the individual character input among the present invention, the kernel exquisiteness, and committed memory is little, and travelling speed is fast, easily runs on 2M internal memory, 80286 above PC types.On the multipotency Pentium of the above internal memory of 32M, the high-grade type of Pentium, more can give full play to the powerful power of its 32 bit data transmission technology, make the immense phrase of the present invention obtain incisively and vividly performance, further strengthened.
The present invention combines together with existing Chinese operating platform, supports various good characteristics and all application software of Chinese operating platform, is almost loading under the situation of the present invention and can not move without any software.
3., study is easy, easy and simple to handle
(1), trigram one word of the present invention, coding is short.Easily grasped by people;
(2), radical of the present invention is benchmark with the Chinese-character canonical radical, and is visual in image, meets the fractionation rule of Chinese character inside;
(3), character root of keyboard scheme distribution standard of the present invention, point, horizontal, vertical, left-falling stroke, right-falling stroke, bending hook all have corresponding key-position area territory, be provided with evenly, rationally, regular strong;
(4), the keyboard plan of the compatible the Five-stroke Method of the present invention, all only study a little of people that can use five-stroke character input method just can easily be grasped the present invention;
(5), the present invention abandons fully and differentiates sign indicating number, alleviates people's learning burden significantly.
4., input bug
The present invention is input as main, individual character with phrase and is input as auxilliaryly, and manuscript almost can use phrase to realize input fast fully in the whole text.It with following condition as support:
(1), phrase is abundant.The present invention is a source with the modern Chinese dictionary, incorporate things of diverse nature other spoken language, slang, the couplet written on scrolls and hung on the pillars of a hall, common-use words.Except formal phrase, include the required various unofficial phrase of practical operation simultaneously, phrase is provided with 68,000.Adapt to the practical operation needs.
(2), code length subregion.Trigram one word of the present invention, four yard one speech, the contradiction of fundamentally exempting mutual shielding of words output or mutual repeated code, the phrase input is not subjected to pining down of individual character input fully, and manuscript almost can be realized the input of pure phrase in the whole text, and the phrase coverage rate is up to more than 95%.
(3), low repeated code.The phrase code fetch of font code input method is to adopt " prefix+inferior prefix " pattern on the market at present, make the identical Chinese character of prefix like this, when the group speech, repeat so that cause the more phenomenon of phrase repeated code because of code element, as " qtqt " code element in the five-stroke character input method, just there be " cunning " " ferocious " " ruthlessly " " rampantly " " wildness " " to stroll " 12 repeated codes of " fox " " extremely awkward " " wolf dog " " hunting " " macaque " " adventurous headman ", and " qgqg " code element also has " crocodile " " carp " " butterfish " " etc. numerous fish phrase repeated code; As starting the epochal the present invention of computer input method three-dimensional localization retrieval technique, the phrase code fetch is " prefix+suffix " form, the phrase coding has fundamentally evenly distributed, the phrase repeated code is reduced significantly, as: the code element of " cunning " is " qqqe ", the code element of " ferocious " is " qhqs ", and the code element of " fox " is " qrqf " The code element of " crocodile " is " qnqg ", and the code element of " carp " is " qfqg ", and the code element of " butterfish " is " qjqg ", and repeated code is extremely low.
(4), accurate positioning.One aspect of the present invention does not resemble some intelligent input method or the phonitic entry method after whole sentence input to such an extent that return modification again, but required words is directly navigated on the screen cursor position, use the impassioned and forceful sense of making us rising spontaneously " 3,000 chis that fly down straightly, as if the Silver River were falling from Heaven ".
The advantage of maximum of the present invention be exactly fast, phrase input accurately.
5., vast market prospect
At present, various phonitic entry methods, hand-writing input method, scan input method and surge forward, emerge in an endless stream.Compare with the non-keyboard input method of these new emergences, the present invention is except in input speed, outside two indexs of level of comfort are slightly too late, other hardware adaptive mechanism, software adaptability, required hardware condition, required software condition, required environmental requirement, versatility, indexs such as correct rate for input are not a halfpenny the worse, overall target or even be slightly better than first-class, can not rant out, the present invention is not only in function, operation, the practical level aspect present various inputting methods of comprehensively winning, and have other non-keyboard input method the practicality that can not compare, advanced, its China that can yet be regarded as fully has most one of inputting method of practical value, might be best Chinese character coding input method!
According to the statistics made by the departments concerned: the existing microcomputer quantity of China is between 2,000 ten thousand to 2,500 ten thousand at present, annual speed increment (China's microcomputer rate of growth was all above 50% in 1996,1997) with 25-50%, expect 2000, China's microcomputer has quantity can reach 3,000 ten thousand.Have the present invention of first as combination property, have the ability fully in numerous input method, to win the one seat of oneself, and new user is joined voluntarily use ranks of the present invention.
In addition, have computer that keyboard is just arranged, on-keyboard does not then become computer.At least in 10 years of future, keyboard still is one of basic equipment of computing machine; Inputting method (comprising the present invention) will be the main method of computword input.We can say: huge social benefit and economic benefit are being contained in the present invention, are containing immeasurable development space, and it has vast market prospect!
Five, the approach of realization
The present invention is as a kind of keyboard input scheme, and the realization that various Chinese operating platforms are it provides effective instrument, as the Limd of UCDOS, and the Keytooo of TWAY.U.S. Microsoft company and the simplified Chinese edition Windows3.2 code table maker that China new world electronic information technology research institute cooperates also are to realize one of a kind of mode of the present invention.By this input method generator, can generate have own individual character, consistent with Windows operating system style, and can give full play to the Chinese letters method for typing-in phrases of Windows operating system good characteristic.
Concrete steps are as follows:
1, create Chinese letters method for typing-in phrases code table source file
1., start Chinese Windows3.2 system, double-click by " annex ", double-click again " Write ".
2., according to the coding rule of Windows3.2 input method form and one-level brevity code of the present invention, secondary brevity code, all-key, characterized radical and phrase, according to the code symbols ordering, setting up with TXT is the plain text code table source file of suffix:
[Description]
Name=Chinese phrase
MaxCodes=4
UsedCodes=abcdefghijklmnopqrstuvwxyz
WildChar=*
Sort=0
[Text]
-g
R
……
Two fg
Three dg
Key qvj
Dish tlp
……
Horse cng
Ware lbg
……
Chinese khyy
Phrase ykxg
……
The ftnk of Patent Office
Application form jyge
……
Chinese phrase kyyx
Ayuc affords a magnificent spectacle
……
Chinese letters method for typing-in phrases kyyi
The kwwl of the People's Republic of China (PRC)
……
3., withdraw from, save as Windows system zwcz.txt.
2, create Chinese letters method for typing-in phrases
1., " the code table maker " in the double-click " main group group ";
2., click the Browse button, select Windows system zwcz.txt code table source file, click " conversion " button;
3., convert, click " finishing " button, promptly generate the ZWCZ.MB codes table file.
4., withdraw from the code table maker, start-up control panel program is selected " input method " icon, opens " input method dialog box;
5., select " general code table input method ", click " selecting for use " button, in " selected input method " dialog box, click " set " button, open general code table dialog box;
6., click " installation " button, select the ZWCZ.ME file, by " determining " button;
At this moment, screen demonstrates input window of the present invention in the below.Withdraw from " input method " dialog box and control panel, can use the present invention to carry out Chinese character and import.
Chinese letters method for typing-in phrases by the said method generation, it is Windows 3.X simplified Chinese edition input method file, it is except can be using in the Pwindows3.x system, can also be installed in simplified Chinese edition Windowa95, Windows97, the Windows98 system and use, concrete steps are as follows:
1., with the ZWCZ.MB file copy to the WINDOWS that PWindows9x system type is housed under the system directory;
7., click START button, point to " program ", " annex ", click " input method generator " again.
3., click " opening file " button, with ZWCZ.MB code table dictionary File Open.
4., click " inverse conversion ", generate the new code table source file ZWCZ.TXT that meets the PWindows9x form.
5., select " establishment input method " label, click " browsing ", select ZWCZ.TXT code table source file, insert input method information such as " Chinese phrases ", then click OK.
6., click " conversion " button, regenerate the codes table file of new ZWCZ.MB.
7., click the Create button, insert version number and organization names.
8., click " user is given " option, click the Browse button again, select icon (ICO file), bitmap (BMP) and the help file (HLP file) oneself liked respectively.
9., the click OK button can generate a Chinese letters method for typing-in phrases file (ZWCZ.IME) that has the own individual character of user, is consistent, also can gives full play to the various good characteristics of PWindows9x with PWindows9x Chinese edition style.
10., after the generation input method, whether system will point out and install.After selecting to install, system will install input method automatically.At this moment, newly-generated Chinese letters method for typing-in phrases promptly adds in the Chinese Windows9x system, and the operator just can use input method that this is newly-generated as the input method of using other prepackage.
(attached: the code table inverse conversion of the above-mentioned U.S. PWindows9x of Microsoft company input method generator, except being text with codes table file decompiling of the present invention, recompilating to meet the codes table file of PWindows9x form thus, can also comprehensively retrieve and examine radical setting of the present invention, word coding method, dictionary setting thus.) (intact in full)

Claims (10)

1, a kind of computword input system, it is characterized in that with three-dimensional localization retrieval technique, code length partitioning technique, priority of high frequency technology be support, accepting scheme by the specific character root of keyboard plan of establishment, word coding method scheme, the dictionary plan of establishment, operation input scheme and compatibility forms, trigram one word, four yard one speech, the combination of sound shape has that coding is short, study is easy, easy and simple to handle, input is quick, the characteristics of accurate positioning.
2, the character root of keyboard scheme according to claim 1 is characterized in that:
1., radical is a standard with the Chinese-character canonical radicals by which characters are arranged in traditional Chinese dictionaries, and 200 of numbers are set;
2., radical plays the gesture feature according to its stroke and is divided into " horizontal, vertical, cast aside, press down, folding " Wu Da district and special case district;
3., " horizontal stroke " draws the corresponding computer keyboard G of first stroke of a Chinese character radical, F, D, S, A key position;
" erect " and draw the corresponding computer keyboard H of first stroke of a Chinese character radical, J, L, L, M key position;
" left-falling stroke " draws the corresponding computer keyboard T of first stroke of a Chinese character radical, R, E, W, Q key position;
" right-falling stroke " draws the corresponding computer keyboard Y of first stroke of a Chinese character radical, U, I, O, P key position;
" folding " draws the corresponding computer keyboard N of first stroke of a Chinese character radical, B, V, C, X key position;
The corresponding computing machine of " Rolling " " very little " " " " car " " power " " skin " " several " radical
Z key position.
4., the wildcard query key is " * ".
3, the encoding scheme according to claim 1 is characterized in that:
1., encoding scheme is divided into single character code and phrase coding two big class, totally eight rules;
The single character code rule is:
Key name word=place key position+place key position+key position, place
One-level brevity code=prefix or one-level brevity code=inferior prefix
Secondary brevity code=prefix+inferior prefix
Characterized radical=key position, place+the first sum of+end pen
Coding=prefix+suffix+sound head-word group coding rule is fully:
Two words=lead-in prefix+lead-in suffix+secondary word prefix+secondary word suffix
Three words=lead-in prefix+secondary word prefix+last word prefix+last word suffix
The above speech of four words and four words=lead-in prefix+secondary word prefix+three word prefixes+last word prefix
2., 26 of key name words are set, 26 of one-level brevity codes are set, 665 of secondary brevity codes are set,
95 of characterized radicals are set.
4, the dictionary plan of establishment according to claim 1 is characterized in that:
1., the total clauses and subclauses of dictionary are greater than 68000;
2., two words clauses and subclauses are greater than 43000;
3., three words clauses and subclauses are greater than 10000;
4., the above entry order of four words and four words is greater than 15000.
5, the specification file according to claim 1 is characterized in that:
1., to the elaboration of claim 1 general situation of development;
2., to the elaboration of claim 1 functional characteristics;
3., to the elaboration of claim 1 technical scheme;
4., to the elaboration of claim 1 method of operating:
5., to the elaboration of claim 1 mount scheme;
6., to the elaboration of claim 1 upgrading scheme.
6, the word coding method dictionary according to claim 1 is characterized in that:
1., to the summary of claim 1 general situation of development;
2., to the summary of claim 1 functional characteristics;
3., to the summary of claim 1 technical scheme;
4., to the summary of claim 1 method of operating:
5., the single character code of claim 1 is arranged in classification;
6., the phrase coding of claim 1 is arranged in classification.
7, accept scheme according to the compatibility of claim 1, it is characterized in that:
1., compatibility is accepted five-stroke character input method;
2., compatibility is accepted Chinese character and phrase input method.
8, a kind of retrieval by window technology of computword is characterized in that:
1., three-dimensional search---from three different directions Chinese character is carried out the crossings on different level retrieval by window, each direction is divided into 26 section levels according to computer keyboard A to Z key position again;
2., triple bond location---maximum three code elements are promptly determined concrete Chinese character, and the single character code maximum code length is 3.
9, the computword retrieval scheme according to claim 7 is characterized in that:
1., the first dimension retrieval direction is first radicals by which characters are arranged in traditional Chinese dictionaries of individual character inside, and the second dimension retrieval direction is last radicals by which characters are arranged in traditional Chinese dictionaries of individual character inside, and third dimension retrieval direction is first letter of individual character phonetic;
2., 26 space encoders of one dimension retrieval are all distributed to the one-level brevity code and are used, and 676 space encoders of two dimension retrieval are distributed to the secondary brevity code and used, and 17576 space encoders of three-dimensional search are distributed to the individual character use of encoding fully;
3., all available three code elements of 6763 Chinese characters of GB one secondary character library are represented.
10, a kind of technical scheme of dividing word coding method length is characterized in that:
1., trigram one word, four yard one speech: single character code all is arranged in the symbol region below 3 and 3, and the phrase coding all is arranged in the zone of 4 code elements;
2., 3 keys and 3 keys are all distributed to individual character with interior coding region and are used, and 4 key encode zones are all distributed to phrase and used;
3., the word coding method zone was both relatively independent, complemented one another again, and the words input need not button and switches.
CN 99101513 1999-01-01 1999-01-01 Chinese phrase enter method Expired - Fee Related CN1109287C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 99101513 CN1109287C (en) 1999-01-01 1999-01-01 Chinese phrase enter method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 99101513 CN1109287C (en) 1999-01-01 1999-01-01 Chinese phrase enter method

Publications (2)

Publication Number Publication Date
CN1236914A true CN1236914A (en) 1999-12-01
CN1109287C CN1109287C (en) 2003-05-21

Family

ID=5270491

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 99101513 Expired - Fee Related CN1109287C (en) 1999-01-01 1999-01-01 Chinese phrase enter method

Country Status (1)

Country Link
CN (1) CN1109287C (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100397310C (en) * 2006-03-12 2008-06-25 钟明华 Standard Chinese character inputting method
WO2011066757A1 (en) * 2009-12-02 2011-06-09 腾讯科技(深圳)有限公司 Five strokes input system and method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100397310C (en) * 2006-03-12 2008-06-25 钟明华 Standard Chinese character inputting method
WO2011066757A1 (en) * 2009-12-02 2011-06-09 腾讯科技(深圳)有限公司 Five strokes input system and method
RU2510524C2 (en) * 2009-12-02 2014-03-27 Шэньчжэнь Ши Цзи Гуан Су Информейшн Текнолоджи Ко., Лтд. WuBi INPUT SYSTEM AND METHOD

Also Published As

Publication number Publication date
CN1109287C (en) 2003-05-21

Similar Documents

Publication Publication Date Title
CN1577229A (en) Method for inputting note string into computer and diction production, and computer and medium thereof
CN85101817A (en) An zijie type Chinese-character stroke computer code's method and keyboard thereof
CN1109287C (en) Chinese phrase enter method
CN1900886A (en) Method for single click and multiple key combining click mixing input Chinese and English and keyboard
CN1110741C (en) Pictophonetic code Chinese character input method
CN1163815C (en) Chinese character inputting method by shape and sound encode
CN1154502A (en) Method and device for ducation standardized inputting Chinese characters by five stroke
CN1048343C (en) Free combination code Chinese character input method and key board
CN1119741C (en) Chinese word-group inputting method
CN1241101C (en) Chinese syllable double reading scheme, Chinese keyboard and information input and processing method
CN1025896C (en) New concept Chinese character coding
CN1054695C (en) Computer Chinese character eight-four code input method and key board
CN1026924C (en) Chinese-character sound dissection encode and input method
CN1019527B (en) Character pixel input method and its keyboard
CN1107896C (en) Chinese character and coding and input method for automatic transition of simplified original complex form Chinese character
CN1673936A (en) Eight key position phoneticizing double-code Chinese character inputting method for mobile telephone
CN1848051A (en) Standard Chinese character inputting method
CN1275732A (en) Chinese character keyboard input system and applied technology thereof
CN1303504C (en) 'Letter' input-method for Chinese characters
CN1845053A (en) Chinese character and English input technology using assembled and mobile hand-writing virtual keyboard
CN1059280C (en) Radicals code Chinese characters keyboard input system
CN1357814A (en) Computer Chinese keyboard and its Chinese information inputting and processing method
CN1131293A (en) Quick holographic Chinese character coding and its keyboard
CN1239242A (en) Phonetic letters method for typing-in phrases
CN1208187A (en) Holographic universal Chinese character keyboard and its input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee