CN1080748A - Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof - Google Patents

Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof Download PDF

Info

Publication number
CN1080748A
CN1080748A CN 93104822 CN93104822A CN1080748A CN 1080748 A CN1080748 A CN 1080748A CN 93104822 CN93104822 CN 93104822 CN 93104822 A CN93104822 A CN 93104822A CN 1080748 A CN1080748 A CN 1080748A
Authority
CN
China
Prior art keywords
code
radical
stroke
chinese character
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 93104822
Other languages
Chinese (zh)
Inventor
吴桦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 93104822 priority Critical patent/CN1080748A/en
Publication of CN1080748A publication Critical patent/CN1080748A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention relates to a kind of new Chinese character information system, it comprises encoding scheme and the corresponding keyboard that a cover is complete.Key of the present invention is by the Chinese-character writing order of strokes observed in calligraphy Chinese character to be divided into above word four classes of special word, two parts, three parts, four parts and four parts, adopts basic stroke, radical, feature radical code and first and end stroke condition code to represent Chinese character.Sound code plan of the present invention meets people's thinking habit, and the font code scheme can be handled unacquainted Chinese character.Because used code has made full use of the feature of Chinese character initial consonant and shape,, add to have and to import characteristics such as simplified Chinese character and the complex form of Chinese characters and Japan, Korea character, the repetition rate of coding are low, thereby have a good application prospect so easily learn, easily remember.

Description

Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof
The invention belongs to a kind of Hanzi coding technique in field of information processing, mainly is the method that realizes in computing machine and the similar devices and used keyboard thereof.
Along with development of times, the arrival of information age, information processing capacity is increasing, adds that people live, the quickening of work rhythm, and people are also more and more higher to the requirement of Chinese character processing technology.In word processing field, carry out word processing the existing certain user of China with class office equipment such as computing machines, and also have bigger development from now on.Become an important step that influences the computing machine service efficiency as the Chinese character entering technique of realizing computer Chinese-character information transmission and processing.Therefore, many associated schemes and supporting technology thereof have been proposed in recent years both at home and abroad.It is reported existing tens kinds of in fact machine practicality.Existing more than 200 of the patented claim of the relevant this respect that Patent Office of the People's Republic of China has announced.Summarize and opinion, present numerous method can be divided into following several big class:
1. inputting method.
2. acoustic control input method.
3. utility appliance (as mouse, hand-written scanning etc.) input method.
Because the equipment investment that acoustic control input method and utility appliance input method need is more and require harsh to Chinese character initial conditions (as pronunciation, writing etc.).Therefore this class input method has certain limitation.And the basic configuration that main frame, display and keyboard are general computer users to have is easy to realize touch system again with Chinese characters for keyboard inputting; So inputting method is the input method that a kind of users take like a shot.Be the inputting method patented claim mostly in the relevant patented claim that Patent Office of the People's Republic of China has announced at present.
Inputting method can be further divided into following a few class again:
(1) pure font code input method.These class methods split into the parts of some with Chinese character, and part classification is named.Have and to import the Chinese character that can not read, the characteristics that the repetition rate of coding is low.But because parts name randomness is big, so learning and mastering is relatively required great effort.Just because of this, this class input method often is furnished with thicker teaching material and training book.Generally need to handle very skillfully through the training of quite a while.Because in fact people often have a kind of potential inertia of not working for its political affairs not in the position.Amateur keyboarder is reluctant to learn to this method.So these class methods mainly still professional keyboarder or frequently carry out using among the people of word processing more." five fonts " input method of Chinese character that is typically Wang Yongmin of these class methods.
(2) spelling input method.This input method is simple and easy to grasp, but repeated code is too much.The operator need see frequently that in use screen removes to seek the Chinese character that will import, and input speed is slow and point that particularly eyes are tired easily.In recent years, developed the initial and final double-spelling method on the basis of phonetic condensing, adopted means such as additional font code in addition, bigger breakthrough has been arranged aspect the repetition rate of coding reducing.But, the final key position of these class methods need be memorized mechanically, and improves input speed as reducing repeated code, also need grasp supporting other specific process such as font code.That is to say that as will really this method being handled very skillfully, then actual equaling need study sound sign indicating number and two kinds of input methods of font code.In addition, owing to be to be main input medium, be the non-people that writes as the operator with sound, when it runs into the word that can not read, will feel difficult during input.Even the Chinese character of understanding owing to many Chinese character a word multitone (not different in unison rhythms or different in unison rhythm), also can increase the difficulty of input.So these class methods are not suitable for professional keyboarder and use very much.
(3) mixing input method.The usually selected a collection of radical of these class methods is named by the sound shape characteristics of Chinese character in the selection of radical as far as possible.If properly handled for this method, can have the advantage of phonetic method and pure font code input method concurrently, if but deal with improperly, then might not only be difficult for learning but also not easy-to-use.In numerous encoding schemes, these class methods are occupied sizable ratio.But it is actual that promote and few.Wherein more or less fail to resolve learnability and this contradiction of low repeated code relevant preferably with many methods.These class methods are more influential " the five cross units " that Zhang Guofang is arranged at present, " front-three-end-one " of Wang Renfang, " four sound shapes " of Li Xingmin etc.Though these methods differ from one another, and still have deficiency, " five cross units " because the difficulty of radical number when very little, having increased actual tear open yard, and in addition, its clockwise code taking rule is not inconsistent with people's writing style yet.It is close with the font code method that the radical of " front-three-end-one " is named, and its code element number is more." four sound shapes " since with this word initial consonant as one yard, when running into unacquainted word, just have a useless sign indicating number.
In addition, above-mentioned font code and mixing input method are divided into mo(u)ld bottom half with Chinese character usually by structure when code fetch, left right model, subsumption type etc., though this is suitable for most of Chinese character, but because many methods do not provide further rule, so when practical application, have quite a few Chinese character can make the operator not know how to tear sign indicating number open such as words such as " fearful ", " events ".
In addition, present many methods of Chinese character coding can not satisfy the complex form of Chinese characters or comprise the needs of the international standard ISO-10646 of about 20,000 China, Japan and Korea's Chinese characters, thereby versatility is not strong.
The object of the present invention is to provide a kind of deficiency that can overcome existing Chinese character information processing field Hanzi coding technique, just can grasp as long as have the people on a hundreds of basic Chinese characters basis; Yi Xue, easily note, easily grasp; Be suitable for the simplified Chinese character and the complex form of Chinese characters (comprising Japan and Korea S.'s Chinese character); The repetition rate of coding is low; Not only be suitable for the touch system of professional keyboarder's high speed but also be suitable for a kind of pronunciation that general personnel import Chinese character fast, sound shape combining Chinese Character key-in method and practical keyboard thereof that font and Chinese-character stroke feature organically combine with Chinese character.
Design of the present invention is according to being: Chinese character is a kind of a kind of pictograph Chinese characters that is formed by some basic strokes and radical collocation.If regard basic stroke and radical as constitute Chinese character word-building part, as long as summarize the name code that gives of these parts and classification tax so.Then available several codes are represented certain Chinese character.Thereby the name of these parts should be as far as possible with the initial consonant of Chinese character itself or can connect with everyday character and be convenient to memory.As for selected how many radicals, represent a Chinese character with several sign indicating numbers, this is a problem that need are taken all factors into consideration.Some encoding scheme is compressed the number of basic element of character for reducing memory capacitance as far as possible, but makes the bad difficulty of specifically tearing sign indicating number open that tends to increase.In fact, the formation of Chinese character is to constitute word-building part earlier by basic stroke, forms Chinese character by word-building part at a two dimensional surface again.Basic word-building part has hundreds of individual, and not listing to be not equal to does not have.Because the Chinese character importer mostly has certain educational level, its character learning number is at least more than 500, usually about 3,000 even more.So, there is no need it is treated as the children that just go into primary school.As long as the radical name rationally, code taking rule is clear and definite, and quantity is many slightly can't bring too big difficulty to memory.With several representation, mainly contain three kinds of modes as for each Chinese character at present: a kind of is that yardage is unfixing, gets two yards as two parts words, and trigram etc. got in three parts words.Second kind is to get trigram, and the third is to get four yards.From prior art, the mode of fixed code and trigram can make the mean code length of input individual character decrease though adopt not, is difficult to guarantee to have the lower repetition rate of coding.Since the input of present Chinese character mainly with word be input as main, be input as with individual character auxilliary, under the situation of word input, two-character word is got two yards of every word usually, first yard and first word or the 3rd word second yard of every word got in three words, and the above speech of four words gets one, two, three, first yard of last word.Therefore, adopt the advantage of the individual character average code length that fixed code not or three code plans are obtained and not obvious.To be unfavorable for touch system and the individual character repetition rate of coding that causes thus is higher.And four coding modes can accomplish to have the lower repetition rate of coding usually.With present several more popular input methods is example, and " associating 45-3 " repetition rate of coding is about 2% in three code plans, but its code element number is up to 45; The repetition rate of coding of " money sign indicating number " is 4%, and code element number is 39; " two-dimentional trigram " repetition rate of coding is 16%, and sign indicating number is countless to be 28.In four code plans, the repetition rate of coding of " five fonts " is about 3.9%, and code element number is 25; The repetition rate of coding of " five cross units " is 5.5%, and code element number is 26; The repetition rate of coding of " front-three-end-one " is about 1.55%, and code element number is 39.(" Chinese information " 1990.4 and " PC World " 1989.11.29).As seen, the trigram method repetition rate of coding is higher.For reducing the repetition rate of coding, just must increase code element number.But, do also like this and lose more than gain from the angle of ergonomics.Because consider the factors such as fatiguability degree in degree of flexibility that people point and the work, code element should concentrate on as far as possible with the thumb be one of the center among a small circle in.With present international QWERTY keyboard is example, and then should try one's best close is in the semicircular area at center with letter " B ".Its radius is the smaller the better.According to above analysis, the present invention determines to adopt and gets 26 code elements, code length then according to different needs adopt complete code length be 4 decide code length and two kinds of schemes of indefinite code length.
By to the anatomizing as can be seen of Chinese character, Chinese character can be made up of following several modes:
1. the basic Chinese characters of forming by basic strokes such as horizontal, vertical, left-falling stroke, right-falling stroke, foldings.
For example:
The people practises and bow on 10 fourths
The minimum Chinese character unit that this Chinese character normally can not be torn open again, they have formed the basic word-building part of block character.So can be referred to as basic Chinese characters or single character again.
2. in one plane taking structure with two-dimensional approach by basic Chinese characters forms.
For example:
A day+the moon=person of good sense+king=full-time+the moon+ware=alliance
3. combine by the radical with appellation sanctified by usage and basic stroke or Chinese character.
For example:
Ren+two=benevolence literary composition+Dao=Liu
4. by neither Chinese character is not again to have stroke group that some basic strokes of appellation radical sanctified by usage constitute and basic stroke, basic Chinese characters, radical or other this class stroke groups to close and form.
For example:
Examine, always, filial piety, green grass or young crops, poison, duty, the top of waiting word.
To this class stroke group, people's " everyday character is quoted as proof " mode commonly used is in daily life illustrated.
For example:
Old word is exactly an ancient type of spoon word that the top bottom that examines word of examination adds dagger
The top bottom that examines the word person of being exactly word adds a bottom of losing word
Based on above-mentioned thinking, conceived concrete scheme of the present invention.Specifically describe as follows:
(1) Chinese character is regarded as piled up by the order of writing strokes brick pattern by word-building part one by one and formed, said here order of writing strokes is the order of writing strokes of standard.In this respect, country has carried out standardization work.For example, spoken and written languages Working Committee in Beijing's has has just edited and publish " everyday character order of strokes observed in calligraphy dictionary ".Generally speaking, its cardinal rule is: horizontal earlier back is perpendicular, casts aside afterwards earlier and presses down, and from top to bottom, from left to right, from outside to inside, outside in is sealed again, the first intermediate and then both sides.If certain continuous stroke group occurs having write the situation that the back strokes of turning back to write previous parts behind another parts is again write in strokes then earlier by standard order of strokes, then the parts write earlier of stroke are still calculated parts formerly.For example: or after word writes a horizontal stroke earlier, then write the horizontal stroke below " mouth " word and " mouth " word, turn back to write all the other strokes of " dagger-axe " word then, when pressing parts and split by " dagger-axe ", " mouth ", " one " processing.
Word-building part includes basic stroke, basic Chinese characters, radical commonly used and unknown stroke group.
That the present invention selects is horizontal, vertical, left-falling stroke, right-falling stroke, cross break, perpendicular folding, lifting-hook, eight kinds of basic strokes of bending.More than these eight kinds of the actual stroke of Chinese character, but all can be included among these eight kinds of strokes.For example, be included into horizontal stroke by the lower-left to upper right starting writing, be included into left-falling stroke by upper right apostrophe to left down, be included into right-falling stroke by upper left right-falling stroke point to the bottom right, roll over the cross break of being included into of (comprising the second form of a stroke or a combination of strokes) from left to right then, from top to bottom to the perpendicular folding of being included into of right folding one folding, from top to bottom to the lifting-hook that is included into of left folding one folding, except that the second form of a stroke or a combination of strokes, all roll over the bending of being included into of two foldings.Above-mentioned eight kinds of strokes are done code the easiest note beyond doubt by the Chinese Pin Yin pseudonym of its pronunciation or with the more alike English alphabet of the form of a stroke or a combination of strokes.Pretend a part, the invention provides basic stroke code table as shown in Figure 1 for word-building part.
Through a large amount of examination Coded Analysis,, reduce the repetition rate of coding as far as possible and make this unified principle of simplified and traditional body character code as far as possible according to tearing sign indicating number open easily, selected a collection of basic Chinese characters, radical and stroke group and similar likeness in form word and stroke group, with these words, radicals by which characters are arranged in traditional Chinese dictionaries and stroke group are referred to as radical.Form etymon list as shown in Figure 2.Row are radical code hurdle to first hurdle among Fig. 2, and second hurdle is female root hurdle, and third column is the title and the mnemonic(al) word hurdle of female root, and the 4th hurdle is sub-root hurdle, and the 5th hurdle is the complex form of Chinese characters and Japanese, Korea S used Chinese character hurdle.Radical is divided into Chinese character root and radicals by which characters are arranged in traditional Chinese dictionaries radical.These two kinds of radicals can be divided into basic element of character and complex root again.So-called basic element of character is exactly by basic Chinese characters or is similar to the basic radicals by which characters are arranged in traditional Chinese dictionaries of basic Chinese characters or the radical that the stroke group constitutes; And complex root is to be combined by these basic element of characters.The radical of this etymon list is pressed the English alphabet series arrangement.Wherein English alphabet " A ", " E ", " O " just are three of syllabication simple or compound vowel of a Chinese syllable " A ", " E ", " O " separately in the Chinese phonetic alphabet.And " Zh " in the Chinese phonetic alphabet, " Sh ", " Ch " use English alphabet " I " (getting shape), " V " (the factor amount is more, gets position easy to operate), " U " (remaining unique non-consonant key) to replace respectively.First radical corresponding to each field is defined as the key name radical.Two English alphabets of each radical back are represented the stroke code of this a radical first stroke of a Chinese character stroke and an end stroke respectively.Except that indivedual radicals as ten thousand, the side etc., the fixed standard of the order of strokes observed in calligraphy and State Language Work Committee is identical.Rise, arabic numeral 1 this radical of expression of a code front, end is Chinese character root, arabic numeral 2 these radicals of expression are the radicals by which characters are arranged in traditional Chinese dictionaries radical.Rise, the English alphabet behind the code of end is the feature radical code of this radical.Chinese character is to get the code of the initial consonant of its pronunciation as this word substantially in the etymon list, radical is got the initial consonant of a key word in its title as its code, unknown stroke group gets the initial consonant of Chinese characters in common use that include this stroke group as its code and with its certain prefix, by certain word, at the bottom of certain word, certain font, certain word frame as unknown stroke group's title so that memory.Can find out that from table Chinese character is that the radical that constitutes of Chinese character root and radical, unknown stroke group and the radical in the likeness in form radical hurdle are the radicals by which characters are arranged in traditional Chinese dictionaries radical.
Some Chinese character such as Yu-,
Figure 931048222_IMG2
, Jue, You, etc. word belong to non-common word, though be Chinese character, the present invention does not get its this word initial consonant and as its code it is treated as unknown stroke group, so this class radical is regarded as the radicals by which characters are arranged in traditional Chinese dictionaries radical.
Chinese character root in the word-building part mostly is basic Chinese characters greatly, and so-called basic Chinese characters is meant by of the present invention stipulates that they are the minimum Chinese characters that can not tear open again.If tear open again, then can split out certain the stroke group who does not have in the etymon list of basic stroke or Fig. 2.For example: bad word is a basic Chinese characters.If tear open again and will split out a basic stroke.The basic Chinese characters major part is Chinese characters in common use.Its code is just got its pronunciation initial consonant.
To having one unknown stroke group of one's own, as not have the Chinese character that suitable likeness in form radical or this stroke group constitute be not a lot, then gets the code of its first stroke code as this piece radical, provided the part example among Fig. 2, as "
Figure 931048222_IMG3
", " ", "
Figure 931048222_IMG4
" wait the radical that indicates " * ".There is a layout of a page without columns at each English alphabet code end among Fig. 2, has listed the basic Chinese characters that is not similar to radical in the layout of a page without columns.
Owing to be that the radical of initial consonant is more in the Chinese character with " M " and " Y ", be to reduce the repetition rate of coding, radical has carried out necessary adjustment to being code with these two letters: " Yi " (be commonly called as clothing benefit by) is named as by the clothing coat, and code is " A "; With the moon word code be decided to be " O ", get the meaning of circle moon, but a month word is positioned at second parts when later, still code fetch is " Y "; With order, the tenth of the twelve Earthly Branches word be named as respectively and expect by the word, join by the word that code fetch is " P ", but this two word is when being positioned at after second parts, still code fetch is " M " and " Y "; No matter the rice word is positioned at the equal code fetch in which position is " L ".
Consider needs, some radicals carried out the merger processing towards masses, as " seven " incorporate into " an ancient type of spoon ", " scholar " incorporate into " soil ", " dying young " incorporate into " my god ", " saying " incorporate " day " etc. into.
Code fetch was " J " when radical " yarn " was in last component locations, other the time code fetch be " L ".The fault-tolerant input of being more convenient for like this.
Numerous, the allosome radical is more lack of standardization, significantly the allosome radical is unlisted to some features among Fig. 2, as contains the weak point that is equivalent to a stroke perpendicular " Http ", " Epileptic ", " Yi " etc., with a left side press down right cast aside become a left side cast aside right " partly " of pressing down, "
Figure 931048222_IMG5
", “  " etc.To this class radical, unless two class fonts of corresponding Chinese character are arranged in the regulation Chinese character base of the present invention, its first stroke is by the stroke code code fetch of stipulating among Fig. 2.
The first stroke of fire word has left-falling stroke, presses down two kinds of literary styles, and the present invention stipulates that when it is first parts the first stroke is got by casting aside stroke, when it is positioned at after second parts, restrains stroke and gets.Nine word code fetches are defined as elder generation's " second " form of a stroke or a combination of strokes (code fetch is " Y ") back and cast aside the form of a stroke or a combination of strokes (code fetch is " P "), and all the other " second " stroke code fetches are " Z " without exception.The present invention's regulation, in the complex form of Chinese characters, if the front and back of " youngster " word also have other radical, then " youngster " word is by " eight " word code fetch " B " rather than code fetch " E "; In complex form of Chinese characters font code scheme, if component count reaches four when above, " Http " lumps together as " cave " word code fetch with " youngster " word.
(2) according to the Hanzi structure of reality, the present invention is divided into above word four classes of special word, two parts words, three parts words, four parts and four parts to Chinese character.
The special word of indication of the present invention is meant:
<1〉radical among Fig. 2 (not comprising that the back indicates the Chinese character of " # " number);
<2〉do not make the basic Chinese characters of word-building part as " grasping ", " protruding " etc. on a small quantity;
The above word of two parts of indication of the present invention is meant that this word can be made of basic stroke more than two or the radical among Fig. 1 and Fig. 2.To the division of parts, the present invention has stipulated the principle that some are concrete:
<A〉disconnected loose, disconnection constantly wears principle
Combination between each parts of Chinese character has three major types:
<a〉do not join mutually between the parts, call loosing, as alliance, wait slowly.This class word is exactly to press piece to divide parts usually when splitting;
<b〉the unit stroke meaning that links to each other connects, as says, that the right side of word such as strong is parts is continuous.This class connected components also will disconnect usually.The present invention also has some concrete regulations so that be well defined to the division of this class;
<c〉stroke interts wearing of meaning between the parts, as interior, car etc.This base part is treated as a basic Chinese characters, no longer splits.The present invention regulation by dividing left-falling stroke in the perpendicular pen in the middle of a Chinese character or radical and a some stroke or basic Chinese characters or the radical, if right-falling stroke two-stroke group can form a Chinese character, is then treated it as a basic Chinese characters.For example art, too, dog, really, half etc. all as a basic Chinese characters in other words word-building part handle.This class basic Chinese characters is listed among Fig. 2 as radical.And as the basic stroke of Chinese character or a radical and a non-some stroke belong to when arranged apart, it is handled as two parts.Nowadays, word such as skill handles as radical whether its basic stroke of some Chinese character separates with radical and causes easily and obscure that the present invention lays down hard and fast rule to it,, lists among Fig. 2.
<B〉complex root priority is greater than basic element of character
In the process of tearing sign indicating number open, normally tear sign indicating number open by minimum principle.Can reduce requirement like this to the character learning amount.But from reduce the repetition rate of coding and unified with the complex form of Chinese characters as far as possible, hit behind the trigram page turning again look into that aspect such as word considers again should not be fully by this principle.So have in the etymon list of Fig. 2 of the present invention some complex roots as "
Figure 931048222_IMG6
", " ", " suffering ", " opinion " etc.Usually, parts are to split by minimum principle, and promptly a word will split out the unexistent stroke group of basic stroke or Fig. 2 as tearing open again.If but both two radicals and lumping together when being complex root among Fig. 2 of the parts that split out, as long as altogether as behind the radical, this Chinese character still is the above word of two parts, then chooses by the maximum radical among Fig. 2.For example: the capital word is three parts words as tearing open, and code fetch is " Tou ", " mouth ", " little ".But owing to have among Fig. 2 " " this complex root, so the capital word is two parts words, should tear open into "
Figure 931048222_IMG9
", " little " two parts words." weak point " word for another example, should tear open into " arrow ", " ", "
Figure 931048222_IMG11
" three parts words.And should not split into “  ", " greatly ", " one ", " mouth ", " " five parts.With in short summarizing is exactly " radical is toward getting greatly, and component count is toward getting for a short time ".
<C 〉. Chinese character root priority is greater than the radicals by which characters are arranged in traditional Chinese dictionaries radical, and radicals by which characters are arranged in traditional Chinese dictionaries radical priority is greater than basic stroke, and " one " word is handled as basic stroke, when priority is identical with regard to preceding not just after.
Specifically, if when certain unicursal can constitute a radical and can constitute a radical with the radical of its back again with the radical of its front, normal conditions are by with regard to the preceding not principle after just, as " passs " word on " connection " word right side, should tear open and yard be "
Figure 931048222_IMG13
", " greatly " two parts rather than “ Ha ", " my god " two parts because priority is identical, all be that a radicals by which characters are arranged in traditional Chinese dictionaries radical adds a Chinese character root.But " also " a yard Wei “ Ha should be torn open in word ", " opening " two parts rather than "
Figure 931048222_IMG14
", " European-allies " two parts because the latter becomes the combination of a radicals by which characters are arranged in traditional Chinese dictionaries radical and a Chinese character root combination of two radicals by which characters are arranged in traditional Chinese dictionaries radicals.In like manner, " entirely " word should tear open sign indicating number for " people ", " king " rather than " ", " soil " because the latter becomes the combination of two Chinese character roots the combination of a radicals by which characters are arranged in traditional Chinese dictionaries radical and a Chinese character root.The division of Chinese character, radicals by which characters are arranged in traditional Chinese dictionaries radical is exactly for this reason among Fig. 2, in fact, this principle at radical mainly be You “ Ha ", "
Figure 931048222_IMG16
", " people ", "
Figure 931048222_IMG17
", "
Figure 931048222_IMG18
" Chinese character that constitutes as also, close, satisfy, complete, food, gold, can etc., for number limited, grasp these words tear yard rule open after just needn't consider again what class radical a radical has belonged to.Because " one " word handles as basic stroke, thus " illiteracy " word should split into " Lv ", " Mi ", " one " and " pig " four parts rather than " Lv ", " Mi ", " two " and "
Figure 931048222_IMG19
" four parts.
Foregoing is once summarized, then can obtain following tree structure:
Figure 931048222_IMG20
The present invention has provided font code and sound sign indicating number two sets of plan.With regard to the font code scheme, in order not only to avoid the trouble of Chinese-character pronunciation but also to distinguish Chinese character to greatest extent, the present invention has designed the first and end stroke condition code, is called for short condition code.The regulation condition code is chosen by horizontal, vertical, left-falling stroke, right-falling stroke, five kinds of basic strokes of folding.The folding pen all is included in cross break, perpendicular folding, lifting-hook, bending.With horizontal, vertical, cast aside, press down, five first stroke of a Chinese character of folding and horizontal, vertical, cast aside, press down, five end pens of folding are capable of being combined becomes 25 condition codes.These 25 condition codes are selected the code of a similar radical as condition code by its stroke shapes in the etymon list of Fig. 2.As shown in Figure 3.Have only in the feature code table of Fig. 3 the first and last pen for anyhow get " F " (likeness in form); Get " U " (anti-factory) for horizontal right-falling stroke; Get " O " for perpendicular horizontal stroke; Get " X " (likeness in form) for what cast aside to press down.
Because some radical belongs to " greatly " radicals by which characters are arranged in traditional Chinese dictionaries in the Chinese character, and is promptly more as the Chinese character quantity of first part with these class radicals by which characters are arranged in traditional Chinese dictionaries.As mouth, Rui, Lv, Rolling, Jin, , etc., at this moment,, then have only five kinds of possibilities if get the condition code that the first sum of and the last parts end pen of these class radicals by which characters are arranged in traditional Chinese dictionaries constitutes.This has just increased the possibility of repeated code.So the present invention has stipulated 26 key name radicals.When the use characteristic sign indicating number, first part is the word of key name radical, and condition code is got the first stroke of a Chinese character of second parts and the end pen of last parts.The key name radical is not considered.Like this, just making first parts is that the condition code combination of the Chinese character of key name radical rises to 25 by five.Also reduced simultaneously the possibility of the identical first stroke of a Chinese character radical of same code " collision ".With regard to the sound code plan, because first yard got this word initial consonant as code, so needn't consider condition code.
In order to distinguish two parts words effectively, and can make basic Chinese characters tear sign indicating number open easily, the present invention proposes the notion of feature radical.So-called feature radical is looked for radical by order of writing strokes exactly again in radical.How much suitable is the lettering pen selective top-down? according to the formation situation of two parts Chinese characters, the present invention stipulates to get at most four; The principle of " feature radical, four exceed, deficiency is successively decreased " so propose.Exactly a radical is got four at most by order of writing strokes specifically, if but get a radical in four pie graphs 2, this radical code feature radical code that is exactly this radical then; If four can not form a radical, but then get three, two radicals of seeing in the pie graph 2 whether by the principle of successively decreasing; If three, two can not form a radical, then get the feature radical code of the code of the first stroke basic stroke as this radical.Another layer meaning of " deficiency is successively decreased " be if the stroke of structure word radical itself just below four, then the principle of " four exceed " corresponding by the principle of successively decreasing become three exceed, two exceed until only getting one.Mentioned above principle is suitable for basic Chinese characters equally, is " R " as the feature radical code of " first " word, and the feature radical code of " hanging down " word is " Q ", and the feature radical code of " ending " word is " B ", and the feature radical code of " weight " word is " P ".The present invention's regulation, " day, say, the field, the feature radical code of four radicals of order is " O ".
Based on above-mentioned radical code, first and end stroke condition code and feature radical code, constitute the code taking rule of sound sign indicating number of the present invention and font code:
(1) sound sign indicating number code taking rule:
1. to special word (unlisted basic Chinese characters among the radical in Fig. 2 etymon list of the present invention and Fig. 2), code taking rule is:
Basic stroke code+end stroke code behind this word of Chinese character initial consonant+feature radical code+follow closely feature radical
If only remain a basic stroke, then the 4th yard feature radical code of getting the feature radical after getting the feature radical; When if basic stroke is not enough with letter " O " polishing.For example:
One YHOO, two EHHO Three S's EHH mortar JPIH grasp BPHN
The rich FSIE leather of inferior BAHI GNII foot ZKIN vows VGPN
2. two parts word code taking rules are:
This word of Chinese character initial consonant+first part code+last part codes+last component feature radical code
For example:
Class KYGR pretend YDYC such as the old JIRO of RNKT servant PDBI
If last parts are " Chuo ", " Yin " or basic stroke, then get first part feature radical code for the 4th yard
For example:
This IWZW disobeys WWZE and builds JYJE court of a feudal ruler TRJQ dawn DRHO
3. the above word code taking rule of three parts is:
This word of Chinese character initial consonant+first part code+second part codes+last part codes
For example:
Compile the fearful JVKE of the slow MXRY win YWKF of BLHC and earn IBCZ
Because many Chinese characters are a word multitone,, but still there is many initial consonants word of some although the present invention has bypassed the simple or compound vowel of a Chinese syllable of Chinese character.The present invention regulation, to this many initial consonants word, the initial consonant of getting its everyday character is as this word initial consonant, if two kinds of initial consonants are the everyday character initial consonant, get the English alphabet position the preceding letter as its code.
For example:
Bbxw wards off bfbp and takes off bgsl and dig bsba pool ccgu and hide cdzd and watch cfry and scoop up cfxb and collected together czro once
Dgbd bullet dvro pile dytk transfers eyxp to dislike the frfw fofh dried meat gmrs Chinese juniper hdlz that walks back and forth and rams the hres meeting
The capable huhk clam of hrue ijll building ikmh caye ivry falls jjdn towards jefn and separates the self-important kkhw of jmjg and cough
The stubborn plbz rake of the secret mzlo narrow eyes into a slit of the ktpj shell kvbi happy mhbx of card lpln nfyl ptfh Pu pvbk screen
The strange qmdk of qclk eggplant qdkd rides the qmxh qojs circle tfrp of dwelling and carries udie and pass ufsp and mix the urpb spoon
The uwyt poultry vuml vumt that stops contains vwyv and leads wkvt xkxh to frighten xmwu school xpln be that yodd salts down
(2) font code code taking rule:
1. special word code taking rule is identical with the sound sign indicating number
For example:
Soil TVHH scholar VVHH days TEPN YPHN day ROHT that dies young
Say YOHT month YTHH order MOHH rice MCIN YHIH at the tenth of the twelve Earthly Branches
2. two parts word code taking rules are:
First part code+last part codes+last component feature radical code+first and end stroke condition code
If when last parts were " Fu ", " Chuo ", " heart " or basic stroke, trigram was got first part feature radical code.When stroke is not enough with letter " O " polishing.
For example:
The class YGRD play YGGS intermal comflict YGHW tricky ZHOC of YGEI that exposes sb.'s past misdeeds
Why this WZWD Handan GECF is IXGX gadolinium JLGG dawn RHOO
3. three parts word code taking rules:
First part codes+second part codes+the 3rd part codes+condition code
For example:
Tree MYCU distinguishes that XHXI knows YKBD and compiles LHCJ
4. the above word code taking rule of four parts and four parts:
First part codes+second part codes+part codes second from the bottom+last part codes
For example:
Green pepper MVXY wins the lean BDDY seat of WKBF GRRT
For reducing the repetition rate of coding, the present invention has also increased some additional principles to the font code scheme:
Disregard principle when (A) basic stroke is in the above word of four parts the 3rd code fetch position
For example:
Cover CPVX moral RVXX like the CLNS of DLRS Soviet Union
(B) contain eight in the parts, Shi,  and component count reach four when above, if when the parts that these three parts are adjacent can constitute a Chinese character, the principle that presses big the past is combined into a word code fetch with itself and adjacent parts.
For example:
Like CPYX anger XHBU towards VZYO suburb LUEI
Indicate this class Chinese character that is of " # " number among Fig. 2.
(C) first parts are the word of key name radical, get the first stroke of a Chinese character of second parts and the condition code that last parts end pen is combined into, if second a parts first and last condition code got in two parts words.First parts are the Chinese character of nonbonding name radical, get the condition code that the first parts first stroke of a Chinese character and last parts end pen is formed.
The first code word root is up and down during type-word for the structure of day word, will regard the key name radical day as.As:
Scape RJXD drought RGEF
Whenever " Chuo " do not considered in condition code.
For example:
This WZWD of JZZE that encounters disobeys WZEF up to RZZZ
(D) the first code word root is up and down during type-word for the structure of mouth word, and code taking rule is:
First part code+last part codes+first part feature radical code+last component feature radical code
For example:
Member KBTT hangs the slow-witted KMTC of KJTT
In addition, to the non-Chinese character radicals radical among Fig. 2, its this word pseudonym code is got letter " O ", after connect the triliteral pseudonym code of this radical appellation; To the non-Chinese character independence stroke group who does not list among the radical that indicates " * " among Fig. 2 and Fig. 2, its this word pseudonym code is got letter " A ".
For example:
Epileptic OBZT Jin OJZP
Figure 931048222_IMG21
OJZX Yi OYAP
Figure 931048222_IMG22
AEII
Figure 931048222_IMG23
APZZ ALHH
Figure 931048222_IMG25
AEII
Consider some with less demanding to the individual character touch system forever, the present invention also provides indeterminate code rectangular case.Four code length code tables can be converted to indeterminate code long code table by a program, to the tens of counterweight code words in the indeterminate code long code table, use the 4th yard that replaces its non-common word of sign indicating number replacement by the order of " A ", " O ", " E ", the static word repetition rate of coding that then can make individual character is zero.For example, the coding of " Asia " and " tenth of the twelve Earthly Branches " word is " yhih " in the simplified Chinese character font code.The coding of these two words is still for " yhih " in deciding the code length scheme, and in the rectangular case of indeterminate code, and " h " that will the 4th yard of " tenth of the twelve Earthly Branches " word to replace sign indicating number " a " replacement, like this, and being encoded to of " tenth of the twelve Earthly Branches " word " yhia ".
For adapting to above-mentioned Chinese character input method of the present invention, on operation keyboard is provided with, should include 23 consonant keys that can be used as Chinese Pin Yin initial and reach " A ", " E ", " O " three final keys.Also should have " fuzzy key " simultaneously, " end key ", 0~9 numerical key enters the instruction key of the strong attitude of input method of the present invention and realizes other specific functions as making speech, and words is imported simultaneously, and words is the requisite function key of identifier such as input separately.Optional letter easy to identify of each identifier or symbolic representation.Also can set up other character keys and function key as required.The arrangement mode of above-mentioned each key in keyboard can be from handled easily and specialized designs.But consider that considerable people has been familiar with the computing machine that general QWERTY keyboard and many users disposed and has had this universal keyboard.Therefore, making keyboard layout of the present invention and present international keyboard compatibility is optimal scheme.And statistics also shows, the layout of keystroke frequency of the present invention and Qwerty keyboard can be coincide preferably.So the keyboard layout of actual recommendation of the present invention take all have with existing standard universal keyboard in key the key of like-identified symbol is arranged, its position is identical with the position of this key in universal keyboard.The identifier key different with universal keyboard then is arranged on other positions of easy operating.Can adopt the mode of joining keycap so that the beginner is familiar to this universal keyboard.Fig. 4 is the synoptic diagram of the major part of the present invention's keyboard layout of advising.
As can be seen, compare with existing various input method of Chinese character, the present invention has following distinguishing feature:
1. adapt to widely, can form eight code tables by the present invention, promptly simplified Chinese character sound sign indicating number, simplified Chinese character font code, complex form of Chinese characters sound sign indicating number and complex form of Chinese characters font code decides code length (four code lengths) and indeterminate code long code table.Same principle can be used for the input of the simplified Chinese character or the complex form of Chinese characters.The sound code plan is suitable for general people to be used, because general people's typing mostly is the file of oneself, the mode that this word of Chinese character initial consonant got in the word that can not read basically, first code that sound code plan of the present invention is adopted meets people's thinking habit.The font code scheme is suitable for professional keyboarder to be used, because professional keyboarder not necessarily can read the Chinese character in the typing file, the font code scheme has been bypassed the pronunciation of Chinese character, is convenient to professional keyboarder and uses.Simultaneously, because professional keyboarder's job specification, can be very tired if always stare at screen, sometimes, need remove phrase input function simultaneously, only carry out the individual character touch system, the static repetition rate of coding of individual character this moment is the smaller the better, like this, the number of times of warning of blowing a whistle is few, does not need to select too much.The present invention decides the static repetition rate of coding of code length simplified Chinese character font code and is about 2.4%; The static repetition rate of coding of sound sign indicating number is about 3.8%; The static repetition rate of coding of complex form of Chinese characters font code is about 2.8%; The static repetition rate of coding of sound sign indicating number is about 3.8%.If take into account the influence of brevity code words at different levels, the static repetition rate of coding of individual character is also low.And the static repetition rate of coding of the individual character of indefinite code length is below 0.6%, if take into account the sign indicating number factor of replacing, then the repetition rate of coding is zero.As seen, its overall target all is lower than or approaching more popular Chinese-character input scheme at present.
2. the radical name is convenient to memory and is grasped.The selected radical of the present invention mostly is basic Chinese characters and radical commonly used greatly, is convenient to remember most beyond doubt as the radical code with in the initial consonant of these these words of Chinese character and the radical appellation initial consonant commonly used one.To some unknown stroke groups, with name such as by certain word, at the bottom of certain prefix, certain word itself and certain Chinese characters in common use are connected, be the memory regulation that meets people like this.And the given radical in Fig. 2 neutron radical hurdle is because shape is similar to basic element of character, so memory is not difficult.Because it is the radical that provides has comprised all Hanzi components substantially, more convenient during therefore actual tear open yard.
Thereby the first and end stroke condition code of introducing for the influence of avoiding Chinese-character pronunciation dexterously with Fig. 2 in a two-stroke radical connect and removed from the memorizing mechanically of 25 condition codes, make general people also might just grasp the font code input method at an easy rate.For the notion of the key name radical avoiding " greatly " radicals by which characters are arranged in traditional Chinese dictionaries and introduce with the problem that stroke brought with code has further reduced the repetition rate of coding.
4. thereby the introducing of feature radical makes the two parts words that account for the suitable some of Chinese character sum can effectively be distinguished minimizing presenting bank demonstration number of words and the repetition rate of coding is descended.Statistics shows, to 6763 Chinese characters of GB2312-80, after keying in trigram, decides in the code length scheme
The presenting bank Chinese character shows that number is as follows: (the Chinese figure representative shows number of words, and arabic numeral are represented logarithm)
One two three four five six seven eight nine ten ten one
Simplified Chinese character font code 3,553 1,012 258 66 19 512000
Simplified Chinese character sound sign indicating number 3,194 865 271 110 38 23 15 8342
Complex form of Chinese characters font code 3,422 961 289 89 22 242111
Complex form of Chinese characters sound sign indicating number 3,131 905 268 113 40 26 79423
As can be seen, behind the key entry trigram, Chinese character about 93% shown in the font code scheme is in three words, and Chinese character about 85% shown in the sound code plan is in three words.All Chinese characters all can select to go up screen and needn't page turning by the arabic numeral on the screen.Because computing machine can adopt the mode of priority of high frequency to arrange Chinese character, add the introducing of one, two, three brevity code word, generally, key in and key in " end key " behind the trigram again and just can make on the Chinese character that will import and shield.Even need select by screen, because the screen display number of words is less, therefore, the operator visual angle is more concentrated, selects than faster, and the amount of exercise of eyes is also less.
5. the principle that the notion of the notion of notions more proposed by the invention and principle such as basic Chinese characters, Chinese character root and radicals by which characters are arranged in traditional Chinese dictionaries radical, the disconnected disconnection of loosing are constantly worn, the principle of radical priority etc. make the division clear and definite to Chinese character, as if the situation that what is called " is all understood frequent Fa Free of time spent class hour " can not appear.
Because the Chinese character that the code of the word-building part that the present invention adopts is familiar with for people mostly or the initial consonant of radical.A first and last condition code is also represented with corresponding radical, keyboard and general international standard keyboard compatibility, thereby easy note eager to learn is easily grasped.The specialty keyboarder adopts the method can reach fully and the present widely popular equal input speed of various input method of Chinese character.As long as common people get a thorough understanding of the described content of this instructions, grasp the content of several accompanying drawings, do not need to practise again just can to realize Chinese character input to simplified Chinese character or complex form of Chinese characters sound sign indicating number or font code mode.And, then might reach professional keyboarder's input speed fully as long as its keyboard fingering is skilled.
The content of several figure provided by the present invention and some principle also can be used as the part of the word-building part of other input method of Chinese character.After for example adopting initial and final double-spelling, add two radical codes or add a radical code and add a condition code again.All can play the purpose that reduces the repetition rate of coding effectively.
The present invention can be at various computing machines, the electronics chinese-English typewriter, and telex machine uses in the Chinese character information processing equipment such as Chinese terminal.
In sum, as can be seen, the present invention has a good application prospect.

Claims (10)

1, a kind of pronunciation that utilizes Chinese character, combine between font and the stroke feature three and realize China, Japan, the universal method of Chinese character coding and keyboard thereof that Korea character (simplified Chinese character and the complex form of Chinese characters) information is transmitted, it is characterized in that getting basic stroke, Chinese character, radical commonly used and the unknown stroke group of part are as word-building part, Chinese character is divided into special word, two parts words, three parts words, four types of the above words of four parts have provided the code taking rule of sound sign indicating number and font code dual mode respectively, by the word-building part code, a feature radical code and first and last condition code is represented Chinese character, and the layout of keyboard and existing general international standard QWERTY keyboard layout compatibility adopt the mode of additional keycap on keyboard so that use.
2, the method for Chinese character coding as claimed in claim 1 is characterized in that word-building part consists of the following components:
(1) basic stroke: by horizontal stroke, perpendicular, cast aside, press down, cross break, perpendicular folding, lifting-hook, bending eight kinds of strokes constitutes, be included into horizontal stroke by the lower-left to upper right starting writing, cast aside pen by upper right point to left down and be included into the left-falling stroke stroke, press down pen by upper left point and be included into the right-falling stroke stroke to the bottom right, all strokes of rolling over (comprising " second " form of a stroke or a combination of strokes) from left to right then are included into cross break, the stroke of rolling over to right folding one then is included into perpendicular folding from top to bottom, the stroke of rolling over to left folding one then is included into lifting-hook from top to bottom, except " second " form of a stroke or a combination of strokes, the stroke of all folding eighty percent discounts is included into bending, the code of these eight kinds of basic strokes is with the Chinese Pin Yin pseudonym of its pronunciation or use with the alike English alphabet of this stroke shapes and represent, the code that is horizontal stroke is " H ", the code of casting aside stroke is " P ", the code of pressing down stroke is " N ", the code of bending is " W ", get sound for these four, the code of perpendicular stroke is " I ", the code of cross break is " Z ", the code of perpendicular folding is " L ", the code of lifting-hook is " J ", get shape for these four, form basic stroke code table (Fig. 1) with this.
(2) radical: by a collection of Chinese character, radical and unknown stroke group have constituted an etymon list (Fig. 2), row are radical code hurdle to first hurdle in the etymon list, second hurdle is female root hurdle, third column is radical title and mnemonic(al) word hurdle, the 4th hurdle is sub-root hurdle, the 5th hurdle is the complex form of Chinese characters, Japan and Korea character hurdle, the etymon list row is to pressing 26 English alphabet series arrangement, English alphabet " A ", " E ", " O " is the Chinese phonetic alphabet " A ", " E ", the code of " O " three simple or compound vowel of a Chinese syllable, English alphabet " B; C; D; F; G; H, J, K, L, M, N, P, Q, R, S, T, W, X, Y; Z " code for corresponding Chinese Pin Yin pseudonym, English alphabet " I " is the code of Chinese Pin Yin pseudonym " Zh ", English alphabet " U " is the code of Chinese Pin Yin pseudonym " Ch ", English alphabet " V " is the code of Chinese Pin Yin pseudonym " Sh ", it is code that Chinese character in the etymon list adopts the initial consonant of this character pronunciation of Chinese character substantially, the initial consonant that radical commonly used is got a key word in its custom appellation is a code, the initial consonant that certain Chinese characters in common use that comprise this stroke group or Chinese character got in unknown stroke group or some non-common Chinese character is a code and with certain prefix, by certain word, at the bottom of certain word, certain font, certain word frame name is so that memory, radical is divided into basic element of character and complex root, basic element of character is if tear the minimum radical that will split out the stroke group who does not have in basic stroke or the etymon list again open, the radical that complex root is made up of two above basic element of characters, basic element of character and complex root comprise Chinese character root and radicals by which characters are arranged in traditional Chinese dictionaries radical again, indicating arabic numeral " 1 " in the table is Chinese character root, the radical and the sub-root that indicate arabic numeral " 2 " are the radicals by which characters are arranged in traditional Chinese dictionaries radical, Chinese character " one " and " second " are treated as basic stroke rather than radical, first female root on corresponding each English alphabet hurdle is defined as the key name radical, to the non-Chinese character radicals radical in Fig. 2 etymon list, its this word pseudonym code is got letter " O ", to the non-Chinese character independence stroke group who does not list among the radical that indicates " * " in Fig. 2 etymon list and Fig. 2, its this word pseudonym code is got letter " A ", and its radical code is got the stroke code of its first stroke.
3, the method of Chinese character coding as claimed in claim 1 or 2, it is characterized in that Chinese character (perpendicular after promptly horizontal earlier by the order of writing strokes of standard, cast aside afterwards earlier and press down, from top to bottom, from left to right, from outside to inside, outside in is sealed again, the first intermediate and then both sides, if certain continuous stroke group occurs having write the situation that the back strokes of turning back to write previous parts behind another parts is again write in strokes then earlier by standard order of strokes, then the parts write earlier of stroke are still calculated parts formerly) be divided into special word, two parts words, three parts words, above word four classes of four parts and four parts, concrete partiting step is:
(1) not that the minimum Chinese character that can not tear open again as radical has constituted the alleged special word of the present invention as Chinese characters such as " grasping ", " protruding " by the radical in the etymon list and some,
(2) constituted alleged two parts words, three parts words, four parts and the above word of four parts of the present invention by the basic stroke of two, three, four and four above Fig. 1 and/or the radical of Fig. 2.
4,, it is characterized in that Chinese character being carried out adopt following principle when parts are divided as the claim 1 or the 2 or 3 described methods of Chinese character coding:
(1) the disconnected disconnection of the loosing principle of constantly wearing, when promptly being separated from each other between the parts, each parts that separate is handled as parts, when parts link to each other, splits parts by the radical in the etymon list, and the interspersed basic Chinese characters of stroke is handled as parts,
(2) complex root priority is greater than basic element of character, and promptly to the above word of two parts,, press maximum complex root and the minimum principle of component count is divided handle if can have severally when tearing code plan open,
(3) Chinese character root priority greater than the radicals by which characters are arranged in traditional Chinese dictionaries radical, when priority is identical with regard to preceding not just after, promptly when a basic stroke can be formed a radical with the radical of its back again with radical of radical composition of its front, do not get a Chinese character root and radicals by which characters are arranged in traditional Chinese dictionaries radical in the time of getting two Chinese character roots, in the time of getting a Chinese character root and a radicals by which characters are arranged in traditional Chinese dictionaries radical, do not get two radicals by which characters are arranged in traditional Chinese dictionaries radicals, if priority is identical, then by handling with regard to the preceding not principle after just.
5, the method of Chinese character coding as claimed in claim 1 or 2, it is characterized in that having adopted the feature radical to distinguish Chinese character, the feature radical is handled by the principle of " four exceed; deficiency is successively decreased ", promptly a radical is got four at most by order of writing strokes, if four radicals that can constitute in the etymon list of getting, this radical code feature radical code that is exactly this radical then, if four can not form a radical, then get three by the principle of successively decreasing, see the radical that whether can constitute in the etymon list for two, if three, two can not form a radical, then get the feature radical code of the code of the first stroke basic stroke as this radical, if the stroke number of radical itself is just below four, then the principle of " four exceed " is corresponding becomes three by the principle of successively decreasing and exceeds, two exceed until only getting one.
6, the method of Chinese character coding as claimed in claim 1 or 2, it is characterized in that adopting a Chinese character first and last condition code to reflect the design feature of Chinese character, a first and last condition code is with horizontal stroke, perpendicular, cast aside, press down, rolling over five kinds of strokes is the basis, cross break in the basic stroke code table, perpendicular folding, the folding stroke all is included in lifting-hook and bending, by horizontal stroke, perpendicular, cast aside, press down, roll over five kinds of strokes and be combined to form 25 first and last condition codes (Fig. 3) mutually, select the radical code that has identical first and last pen in the etymon list as the condition code code, wherein, the first and last pen is a code for get " F " anyhow, the first and last pen is that " U " (the anti-factory) of getting of horizontal right-falling stroke is code, the first and last pen is code for perpendicular horizontal get " O ", and the first and last pen is code for casting aside " X " (likeness in form of pressing down) of getting.
7, as the claim 1 or 2 or 3 or 4 or the 5 or 6 described methods of Chinese character coding, it is characterized in that providing code taking rule by sound sign indicating number and font code dual mode, concrete steps are:
(1) sound coding mode:
<1〉special word code taking rule:
Basic stroke code+end stroke code behind this word of Chinese character initial consonant+feature radical code+follow closely feature radical
If get only surplus basic stroke behind the feature radical, the 4th yard feature radical code of getting the feature radical then, if when basic stroke is not enough with letter " O " polishing,
<2〉two parts word code taking rules are:
This word of Chinese character initial consonant+first part code+last part codes+last component feature radical code
If last parts are " Chuo ", " Yin " or basic stroke, then get first part feature radical code for the 4th yard
<3〉the above word code taking rule of three parts is:
This word of Chinese character initial consonant+first part code+second part codes+last part codes
To many initial consonants word, the initial consonant of getting its everyday character is as this word initial consonant, if two initial consonants are the everyday character initial consonant, get the English alphabet position the preceding the letter as its code,
(2) font code mode:
<1〉special word code taking rule:
Basic stroke code+end stroke code behind this word of Chinese character initial consonant+feature radical code+follow closely feature radical
If get only surplus basic stroke behind the feature radical, the 4th yard feature radical code of getting the feature radical then, if when basic stroke is not enough with letter " O " polishing,
<2〉two parts word code taking rules are:
First part code+last part codes+last component feature radical code+first and end stroke condition code
If when last parts be " Fu ", " Chuo ", " heart " or basic stroke, trigram was got first part feature radical code, when stroke is not enough with alphabetical " O " polishing,
<3〉three parts word code taking rules:
First part codes+second part codes+the 3rd part codes+first and last condition code
<4〉the above word code taking rule of four parts and four parts:
First part codes+second part codes+part codes second from the bottom+last part codes
8,, it is characterized in that adopting following principle to reduce the individual character repetition rate of coding to the font code mode as the claim 1 or 2 or 3 or 4 or 5 or the 6 or 7 described methods of Chinese character coding:
Disregard principle when (1) basic stroke is in the above word of four parts the 3rd code fetch position
(2) contain eight in the parts, Shi,  and component count reach four when above, if when the parts that these three parts are adjacent can constitute a Chinese character, the principle that presses big the past is combined into a word code fetch with itself and adjacent parts
(3) first parts are the word of key name radical, get the first stroke of a Chinese character of second parts and the condition code that last parts end pen is combined into, if second a parts first and last condition code got in two parts words, first parts are the Chinese character of nonbonding name radical, get the condition code that the first parts first stroke of a Chinese character and last parts end pen is formed
(4) first code word roots are the structure of " day " and word during for type-word up and down, will " day " regard the key name radical as
(5) whenever " Chuo " do not considered in condition code
(6) first code word roots are the structure of " mouth " and word during for type-word up and down, and code taking rule is:
First part code+last part codes+first part feature radical code+last component feature radical code
9, keyboard as claimed in claim 1, it is characterized in that comprising at least 23 Chinese Pin Yin pseudonyms and " A ", " E ", " O " three final keys and " fuzzy enter key ", " end key ", ten key and other function key, each key adopts the arrangement mode with general international standard QWERTY keyboard compatibility, can adopt convenient use of mode of additional keycap (Fig. 4).
10, as the claim 1 or 2 or 3 or 4 or 5 or 6 or the 7 or 8 described methods of Chinese character coding, it is characterized in that can be with its basic stroke, radical, feature radical code, a first and last condition code in conjunction with the method for Chinese character coding that forms other.
CN 93104822 1992-06-30 1993-05-04 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof Pending CN1080748A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 93104822 CN1080748A (en) 1992-06-30 1993-05-04 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN92104819.X 1992-06-30
CN92104819 1992-06-30
CN 93104822 CN1080748A (en) 1992-06-30 1993-05-04 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof

Publications (1)

Publication Number Publication Date
CN1080748A true CN1080748A (en) 1994-01-12

Family

ID=25742758

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 93104822 Pending CN1080748A (en) 1992-06-30 1993-05-04 Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof

Country Status (1)

Country Link
CN (1) CN1080748A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1086480C (en) * 1995-10-14 2002-06-19 钟诚 Real code coding method for Chinese characters and using keyboard thereof
CN102968188A (en) * 2012-07-04 2013-03-13 张勤发 Bi Sheng code Chinese character input method
CN103440047A (en) * 2013-09-11 2013-12-11 任振敏 Universal code-fetching Chinese character input method

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1086480C (en) * 1995-10-14 2002-06-19 钟诚 Real code coding method for Chinese characters and using keyboard thereof
CN102968188A (en) * 2012-07-04 2013-03-13 张勤发 Bi Sheng code Chinese character input method
CN103440047A (en) * 2013-09-11 2013-12-11 任振敏 Universal code-fetching Chinese character input method
CN103440047B (en) * 2013-09-11 2016-04-13 任振敏 General code fetch input method of Chinese character

Similar Documents

Publication Publication Date Title
CN1023916C (en) Chinese keyboard entry technique with both simplified and original complex form of Chinese character root and its keyboard
CN1043210A (en) Character Root Code Input Method And Apparatus
CN1607491A (en) System and method for Chinese input using a joystick
CN1047447C (en) Computer imput method of figure-sign coding
CN103257720B (en) A kind of input method of Chinese character
CN1080748A (en) Simplified and traditional body sound shape characteristic code input method for Chinese character and keyboard thereof
CN102707809A (en) Component code input method taking national standard component as component base
CN1387109A (en) Numeral (keypad) input method for braille
CN1081004A (en) Chinese-character digital encoding method based on structural strokes order
CN101021753A (en) Chinese character five-stroke fourteen-radicals inputting method on cellphone or computer
CN1103181A (en) Multi-key pressing high-speed Chinese character input method and keyboard
CN1052200A (en) Pronunciation-form-meaning words encode series with compatibility and keyboard
CN85100087A (en) " Chinese coded sound " scheme and its implementation
CN1015751B (en) Input method for computer spelling chinese character
CN1114146C (en) Chinese morpheme code and its computer keyboard input
CN1038888A (en) Pronunciation-form-meaning compatible and character/word combined Chinese coding system and keyboard
CN1694046A (en) Computer coding Chinese character keyboard input method and information code
CN1108776A (en) Qiankun sound track Chinese character input method (QKy)
CN1093182A (en) The sound pen is to code Chinese character input method and keyboard
CN1203388C (en) Double-stroke six-code Chinese character input method
CN1108551C (en) Optimized yinxing code Chinese character system
CN1056007C (en) Codes for inputting Chinese characters
CN103186242B (en) Chinese keyboard
CN1175722A (en) Universal Chinese character input method for computer
CN1008481B (en) Writing-mode input method of chinese-character

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C01 Deemed withdrawal of patent application (patent law 1993)
WD01 Invention patent application deemed withdrawn after publication