CN1173660A - Coordinate codes coding method for computer Chinese characters input - Google Patents

Coordinate codes coding method for computer Chinese characters input Download PDF

Info

Publication number
CN1173660A
CN1173660A CN 96119523 CN96119523A CN1173660A CN 1173660 A CN1173660 A CN 1173660A CN 96119523 CN96119523 CN 96119523 CN 96119523 A CN96119523 A CN 96119523A CN 1173660 A CN1173660 A CN 1173660A
Authority
CN
China
Prior art keywords
type
sign indicating
indicating number
unit
stroke
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 96119523
Other languages
Chinese (zh)
Other versions
CN1054447C (en
Inventor
叶平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN96119523A priority Critical patent/CN1054447C/en
Publication of CN1173660A publication Critical patent/CN1173660A/en
Application granted granted Critical
Publication of CN1054447C publication Critical patent/CN1054447C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A coordinate encode method for typing Chinese characters to computer comprises splitting method with coordinate code without "root set" and encoding mechod with coordinate code. Said splitting method includes 5 non-splitting rules for coordinate code, and Chinese-character splitting basis, relative factors and rules. Said encode method includes encoding and entering information, the correspondent relation between coordinate code and keyboard and encode rules. Its advantages are simple encode method. fast typing, low duplicate rate, and easy mastering it.

Description

Coordinate codes coding method for computer Chinese characters input
The present invention relates to a kind of coding method for Chinese characters input on computer.
Font code is a class computer Chinese input method of present widespread use, and its typical case representative is " the Five-stroke Method ", and " the Five-stroke Method " starts with from font with " spell shape " characteristic of Chinese character rationale as it, sees that shape is known sign indicating number, avoids the pronunciation of Chinese character fully.It has overcome " sound sign indicating number " and has not known the shortcoming that pronunciation or rhotacism just can't correctly be imported, and it is low to have the repetition rate of coding, imports fast advantage.
Font code is encoded by the shape characteristic information that extracts Chinese character, generally all splits Chinese character, and the design philosophy of font code can be summarized as: at first, determine split result---i.e. " the radical collection " of Chinese character based on " certain understanding "; Then, solve " fractionation of Chinese character " problem conversely according to the radical collection, and problems such as coding and input, " radical collection " is the core of font code, and different font codes is the difference of " radical collection " in essence, and the method for splitting of Chinese character is also therefore different.
Weak point is:
Memory is difficult, and its more than 100 radical contains a lot of noncharacter radicals, and do not have regularity, and the memory burden is heavy; Study is difficult, the structure law of Chinese character is an outwardness, the radical collection of the Five-stroke Method also is the major part that Chinese character constitutes undoubtedly, but, the radical collection is defined as unique Hanzi structure unit, remove mechanically rigid fractionation Chinese character with it, just the structure law that has departed from Chinese character, also away from people's writing habit, typical example is exactly " intersecting stroke is split; basic Chinese characters is opened ", so, the deep mother tongue character knowledge of people is not used, and writing habit is not all the year round admitted, not only cause scholastic difficulties, also caused psychological barrier film.
For solving the deficiency of above coding method, purpose of the present invention provides a kind of coordinate codes coding method for computer Chinese characters input, utilize the design feature of Chinese character, set up disassembly principle, do not use " radical collection ", can realize the input of computing machine fast coding, the repetition rate of coding is low, be convenient to memory, ease-to-learn purpose.
Coordinate codes coding method for computer Chinese characters input of the present invention, its content comprises:
1. the coordinate sign indicating number does not have the method for divining by means of characters of " radical collection ": five disassembly principles not; The foundation that Chinese character splits--type unit collection; The correlative factor that Chinese character splits; The fractionation rule of Chinese character.
2. the coding method of coordinate sign indicating number: be used to the information of encoding and importing; The corresponding relation of coordinate sign indicating number and keyboard; Coding rule.
Wherein the coordinate sign indicating number does not have the method for divining by means of characters of " radical collection " and the coding method of coordinate sign indicating number is respectively described below:
Why be the coordinate sign indicating number:
The dot matrix that is distributed in a certain space can be regarded as in Chinese character, for its essence of encode Chinese characters for computer is sought a coordinate exactly, this coordinate Chinese character (repetition rate of coding is low) that should disperse well, and make it to have uniqueness (one yard of a word), as long as can set up such coordinate, it is free establishing sweet thought.
In view of this understanding, this sign indicating number is named as " coordinate sign indicating number ".
One. the basic stroke of Chinese character has five kinds: horizontal stroke, perpendicular, cast aside, press down folding. wherein: " carry " horizontal comprising Perpendicular " the left vertical hook " 亅 that comprises; Right-falling stroke comprise " point ".Array mode between the stroke has three kinds: intersect (1): hand over array mode more to claim to intersect between the stroke each other, as: ten, nine, rich; (2) discrete: array mode separated from one another claims to disperse between the stroke, as: Rui, San, youngster, river, Xiangxi; (3) adhesion: be connected with each other between the stroke but do not hand over array mode more to claim adhesion.
Adhesion divides three kinds again:
A. direct-connected: stroke horizontal, vertical and the folding horizontal, vertical section between the adhesion mode claim direct-connected.As: fourth, defend, the mountain;
B. tiltedly connect: a side of phase adhesion is that the disconnected adhesion mode of left-falling stroke right-falling stroke of casting aside, pressing down or roll over claims tiltedly to connect.As: the people, or not , Ren;
C. end connects: stroke is connected in each other that the adhesion mode of end points claims end to connect; As: protruding, Jiong, factory, several, recessed, mouthful.
Two. five disassembly principles not
Type: the coordinate sign indicating number is called " type " with the assembly of stroke.
Basic model: in the coordinate sign indicating number, the fundamental structural unit of Chinese character is called " basic model ".
The coordinate sign indicating number thinks and Chinese character is reduced into stroke and the method for code fetch is least desirable that it has lost the structural information of Chinese character to greatest extent.The coordinate sign indicating number sums up five not disassembly principles.
1. a stroke does not allow to split into two-section, breaks in two types.Reason: single stroke ought to be complete.
2. the stroke that intersects does not allow to split, as: rich, ten, again, reason: " intersection " is a kind of array mode closely.
3. end stroke does not even allow to split mutually, as bow, and factory, mouthful, protruding.Reason: " end " also is a kind of tight type array mode.
4. do not allow to split into stroke by two Chinese characters that constitute and non-word radical commonly used, as: people, youngster, fourth, Ren, Mi, Fu, Yan, Dao, Bing,  etc.Reason: get by the original function reasoning of stroke.
5. isolated fully by certain unicursal at least, be symmetrically distributed on the structure and contained, two singles are drawn not allow to split out and are formed type.As: flat, cannot split into " Gan Yu Ha "; Wood cannot split into " ten and eight ", reason: philology is pointed out " structure of Chinese character is a kind of modular construction "
These five not disassembly principle protected the structure of Chinese character to a certain extent, but this also is not enough to become a kind of method.
Three. type unit collection
1. the generation of type unit collection:
Philology is pointed out: " Chinese character is made up of combinde rqdical character and single character, and in the Chinese character in early days, combinde rqdical character is made of single character fully ".As seen the basic structural unit of early stage Chinese character is exactly a Chinese character, the single character that promptly can not be split, and the structure law of Chinese character presents completely " character property ".Chinese character develops into today, its structure law also changes again, " but character property " remains its important contents, and this content can be expressed as follows: " Hanzi structure is based on basic Chinese characters and non-word radical commonly used, is aided with the very low parts of numerous frequencies of utilization and constitutes ".
According to " character property " of Hanzi structure rule, in conjunction with the principle of design of " being easy to memory " and the design philosophy of " restriction splits ", the coordinate sign indicating number splits " Chinese character that structure can not split again and non-word radical commonly used " as Chinese character foundation.
Type unit: the coordinate sign indicating number claims " type unit " with Chinese character and the non-word radical commonly used that structure can not split again, and the summation of type unit claims type unit collection.Type unit collection is made up of three parts:
(1) meet five not the Chinese character of disassembly principle and non-word radical commonly used as: ten, nine, mouthful, factory, second, wood, etc.
(2) can not tearing the Chinese character and the non-word radical commonly used of (1) medium-sized unit open, also is type unit, as: non-, year, forever, hold etc.
(3) containing type unit, but do not allow the Chinese character that splits and use non-word radical always in the fractionation rule of coordinate sign indicating number, also is type unit.As: letter, become, fly, the king loses etc.
For GB GB2312 (80) character set, first 334 of total type, wherein Chinese character is 279,55 of radicals commonly used, for details see attached table.
2. the easy memory of type unit collection:
The quantity of the contained type of type unit collection unit is also many, but that memory is got up is very easy, and reason has two: the first, and it has character property, and 279 font units are the simplest Chinese characters of structure, and the overwhelming majority is Chinese characters in common use; Though 55 non-word radicals commonly used are not Chinese characters, because its property commonly used, they are actually a kind of " accurate literal ", and people are not less than Chinese characters in common use to their program of being familiar with, so type unit collection is easy to grasp.Second, it has regularity, type unit has the advantages that structure can not split again, and type unit collection is this type of Chinese character, and the set of non-word radical commonly used, so the discriminating of type unit is very easy, " character property " makes type unit collection be easy to grasp, " regularity " makes type unit collection be easy to difference, and the two makes type unit collection have " easily memory ".
Four. the relation factor that Chinese character splits:
1. the classification of type and character:
Basic model that the coordinate sign indicating number discovers, type is in Chinese character " stability "---both did " " size of ability, relevant with the stroke number that constitutes it, also with stroke between array mode relevant, the coordinate sign indicating number is classified as follows the type in the Chinese character according to stroke number and array mode:
(1) monotype: only have the type of a stroke to claim " monotype ", type unit collection has two monotype type units, and one and second.Character: the stability of monotype is the most weak, has only in particular cases and just can do basic model.
(2) even type: claim " even type " by two types that constitute.As even type unit, the people, youngster, eight, seven, Tou, etc.Character: the character of even type is very special, and Hanzi structure uncertain factor concentrated area is reflected on the body of even type." stability " of idol type occupy between monotype and the moulding, and even type could be subjected to all multifactor influences as basic model.
(3) moulding: the type that is made of three and three above strokes claims " moulding ", moulding is divided into three kinds again according to the array mode between the stroke: 1) positive closo: at least three faces are by horizontal stroke, and closo that the vertical line section constitutes and the stroke that intersects with it claim " positive closo ".As: mouthful, open, usefulness, in, field etc., 2) intersect type: the moulding that contains overlapping relation between stroke claims " intersecting type ".As: wood, very little, rich,, etc.3) accumulation type: only contain adhesion between stroke, the moulding of discrete relationship claims " accumulation type ".As: San, Chuan, Xiangxi, upright, fire etc.Character: moulding " stability " more intense, with moulding unit, they all are basic models generally speaking, have only in particular cases, moulding unit can not be a basic model also.
2. the position of type concerns:
Position relation be meant Chinese character medium-sized between each other position relation, the position relation of Hanzi structure has four kinds: single-relation, upper and lower relation, about relation, internal and external relation, (1) single-relation: promptly Gu Li relation is as the people, seven, ten, greatly.(2) upper and lower relation: between the type position relation that is arranged above and below, as: Lu, Gu, pole, anxious, etc.(3) relation about: the position relation of arranging about being both of type, as: two, woods, leaf, thorough, etc.(4) internal and external relation: be the inside and outside position relation that distributes between the type, as: state, with, the right side, act of violence etc.
About the coordinate sign indicating number is thought, the type in the relation of position, the left and right sides, relatively independent each other, be a kind of coordination, the suitable fractionation; And there is a kind of contact each other in the type in the internal and external relation, and independence is relatively poor comparatively speaking, and fractionation is had certain constraint.
Type with " annexation "
" annexation " is meant the way of contact between the amphitypy, i.e. connected mode between the stroke.Annexation between the type is divided into two classes " disperse " with " adhesion ".Discrete, obviously be the condition that helps splitting.Adhesion according to circumstances can be divided into three kinds of concrete conditions again:
(1) positive closed, two types are if form positive closo, and then the adhesion mode between two types claims " positive closed ".As " field ", " mouth " with " ten " positive closures."
Figure A9611952300101
", "
Figure A9611952300102
" with " mouth " positive closure.
(2) direct-connected: the relation between the amphitypy between the phase adhesion stroke when being direct-connected the relation, is " direct-connected " relation between the amphitypy.As " accounting for ", " " and " mouth ",
(3) tiltedly connect, the relation between the amphitypy between the phase adhesion stroke exists tiltedly to connect when concerning, is exactly " tiltedly connecting " relation between the amphitypy.As " inferior ", " Shen " and " ten " have one direct-connectedly to have one tiltedly to connect, so be tiltedly to connect.
The coordinate sign indicating number thinks that positive closure is an adhesion mode closely between the type; Direct-connected is connected mode more closely, and tiltedly connecting is the most weak adhesion mode.
Five. the rule of divining by means of characters
Based on above-mentioned some understanding to Hanzi structure; just produced the Chinese character fractionation rule that the coordinate sign indicating number shows unique characteristics by analysis; this rule with five not disassembly principle be basic point; with type unit collection is core; and taken into account the classification of type; factors such as position between type relation and annexation, thereby the method for divining by means of characters of the coordinate sign indicating number Chinese character that promptly can disperse fully can be protected the integrality of Hanzi structure again.
The type layer: in the Chinese character, two or more basic models of being made up of same position relation and same connected mode claim the type layer.
Chinese character and type layer that rule 1 is made up of type unit fully, type unit all is basic model, and is removable.
As: political affairs: just, Fan; The mansion: wide, Ren, very little; High: Tou, mouthful, Jiong, mouthful
Rule 2 is torn the not Chinese character and the non-word radical commonly used of removing from mould unit open, and when promptly not having type unit as judgment basis, itself also is type unit, and is non-disconnectable.As non-, million, forever, hold, insect without feet or legs, year etc.
Rule 3: when the first and non-type of type unit type was combined, type unit is basic model not necessarily, and split result concerns that with classification (2) position of (1) type (3) annexation three is relevant.
1. four kinds of connected modes are arranged, between type and the type for positive closure " coordinate sign indicating number regulation:
Rule 3-1: when type and type were positive occluding relation, only to be type removable when first as both sides, and the both is a basic model, otherwise non-disconnectable.As the field, removable is mouth, ten; With: then cannot split.
2. " disperse " and " direct-connected " " tiltedly connects between the type " three kinds of connected modes, the coordinate sign indicating number has following rule:
Rule 3-2: when type unit was moulding unit, as long as the other side is not monotype, promptly detachable, both sides were basic model ".As worm: in, Ye , Myeon, shellfish; Cao: , day.
As seen, moulding unit judges that whether non-type unit type is that the ability of basic model is very strong.
Rule 3-3: when type unit is even type unit, split result will depend on " classification of type, position relation and annexation " three factors ".
(1) when the other side is even type and accumulation type:
A. be adhesion neither internal and external relation the time, both sides all are basic model, and are removable.As: change: Ren, With:
Figure A9611952300111
, the people.
When B. having adhesion situation (direct-connected or tiltedly connect) or internal and external relation, even type unit is the part of basic model, and is non-disconnectable.As: letter (), occasion (factory), the last of the twelve Earthly Branches (Tou), noon (ten), shellfish (people), modern (people), tight (factory) etc. in brief, has a kind of external restraint (or internal and external relation in such cases, or adhesion relation), even type unit just can not self-insurance, becomes the part of basic model
(2) when the other side (positive closo is seen pseudotype described later unit) when intersecting type:
A. tiltedly connect or discrete case under, no matter position relation is how, both sides all are basic model, and are removable.
As: send out, , again; Hurriedly: Bao,
B. under direct-connected situation, up and down, position, left and right sides relation is removable, and both sides are basic model; Inside and outside position relation is non-disconnectable, and type unit is the part of basic model.As: hit, , ; The old man: , again; Skin: non-disconnectable, " again " is the part of font unit.
In brief, the other side is when intersecting type, and a kind of constraint of the external world does not fetter even type unit, and the constraint in two kinds of external worlds just can make both sides become a basic model.
The rule 3-4: " when type unit be single type unit ' one ' and ' second ' time, only when (1) the other side be moulding; (2) discrete each other; (3) be not internal and external relation.When three conditions all satisfied, removable, both sides were basic model.Otherwise it is non-disconnectable." as beautiful: one, Silk:
Figure A9611952300117
, one; Beg, non-disconnectable, is even type; The assistant officer, non-disconnectable, both sides' adhesion.
Six. the expansion of the rule of divining by means of characters
Rule 1~3rd, the rule of divining by means of characters substantially of coordinate sign indicating number can also obtain some other conclusion based on this.
1. single is drawn the condition of doing basic model:
In the coordinate sign indicating number, single is drawn can do basic model, but condition is very strict.
Rule 4:
Single is drawn and is cast aside, right-falling stroke does not allow to do separately basic model; The single picture is horizontal, vertical, folding can be done basic model separately, but will satisfy following three conditions simultaneously:
The one, and adjacent type are not internal and external relation; The 2nd, and adjacent type adhesion; The 3rd. adjacent type is a type unit.
As private, standing grain,
Figure A9611952300118
Hole: , Yin; Day, Shu, day: speech, Tou, one, flatly; Dawn: day, one.
2. about the rule of " "
" one " is very special, promptly is the stroke horizontal stroke, is again type unit one ".There are many words to form with " one " in the Chinese character by certain word, as: the king: one, soil; My god: one, big; : thousand, one; Again: one, slowly; Give birth to: ox, one; Just: one, end; The third: one, in; Inferior: one, already; Go out: one, fire; Soil: ten, one; Door bolt: door, one, the tenth of the twelve Earthly Branches: the west, one or the like.Be the integrality of protection Hanzi structure, rule 5:
When " one " and other type unit were combined, " one " treated so that stroke is horizontal in split process, and split result is not subjected to the influence of its type unit identity, and split result is a basic model, and " one " is type unit; Otherwise, be exactly the stroke horizontal stroke:.As extend, one, day, one; " one " is type unit.Give birth to, non-disconnectable, " one " is stroke.
3. pseudotype unit
The coordinate sign indicating number is divided into the first and non-type of type unit with the type of Chinese character, basic model also is divided into type unit's basic model and non-type unit basic model, and the coordinate sign indicating number is a core with type unit collection, judges whether the first type of non-type is the method for divining by means of characters of basic model, being a kind of intelligentized method, also is unique.Sign indicating number in the past, the split result of Chinese character must be radicals, and radical must be memorize mechanicallyd, and the coordinate sign indicating number does not require the non-type of memory unit, this method for splitting of coordinate sign indicating number, and its advantage is tangible, the first, it need only remember the thing of " character property ", and this has just had the basis of easy memory.(type unit collection also has " regularity ")
The second, the Chinese character basic structural unit that it is admitted be open (type unit also can, non-type unit also can) so, its split result meets the design feature of Chinese character easily.In fact, gain public acceptance at present, literal educational circles come out 600 surplus a word-building part, be reflected in the coordinate sign indicating number, wherein the most frequently used except that small part can continue to split, most important part just is reflected in type unit and concentrates; About 200 frequencies of utilization of another part are low, and the word-building part of character property difference just occurs with the form of non-type unit basic model, and this part does not need to remember in the coordinate sign indicating number exactly.This just the coordinate sign indicating number both met the structure law of Chinese character, easily learn one of reason of easy note again.
Moulding unit is the main body of type unit collection, and it has two features: (1) character property, (2) stability of structure.A kind of type is arranged in the structure of Chinese character, and it does not have a character property (be not Chinese character, be of little use yet), but has the stability of moulding unit, and helpful to splitting some Chinese characters, the coordinate sign indicating number is referred to as " pseudotype unit ", and pseudotype unit has two kinds.
(1) positive closo: positive closo, in conjunction with closely, sharpness of border is easy to differentiate between the stroke, thus the coordinate sign indicating number with it as pseudotype unit.As in the deer "
Figure A9611952300121
", in the leather " ", in the face " ".
(2) moulding unit hands over certain unicursal, connect to form,
By splitting rule, this single is drawn and be can not be split, and the coordinate sign indicating number thinks that this kind of independence should not be lower than the moulding unit in the type simultaneously, so in the pseudotype unit of listing.As: in the chimney
Figure A9611952300124
, old in
Figure A9611952300125
, be
Figure A9611952300126
Rule 6 " pseudotype unit is on Chinese character splits, and its functional equivalent is in moulding unit ".There are 2 points in pseudotype unit with the difference of type unit:
The first, pseudotype unit does not possess character property;
The second, the keyboard entry method difference is seen below.
After introducing pseudotype unit, the means of divining by means of characters of coordinate sign indicating number are more perfect.
As: " examine, face " all do not have type unit, introduce pseudotype unit after, split simple and rationally.
Examine,
Face,
Figure A9611952300133
Figure A9611952300134
4. type layer:
The type layer is the assembly of basic model, is removable; The structural intergrity of simultaneous type layer is also stronger, judges that whether non-type unit type is that the ability of basic model is also stronger,
Rule 7:
When type layer and single picture " horizontal stroke erects, folding " were combined, it was basic model that single is drawn.As: Mai , Ya,
Figure A9611952300135
(head is the type layer) greatly.Department, one, mouthful,
Figure A9611952300136
(
Figure A9611952300137
Be the type layer).
5 special cases
Rule 8:(1) situation of " " and " intersect type type unit " adhesion: when " one " and type unit mutually during adhesion, according to regular 4,5, " one " is non-disconnectable.But, when " one " intersects type type unit " when adhesion becomes non-font because whole non-word, as: in, Xu Zhonghe, so people are easy to identify the crossing type type unit in the integral body, and do like this and also be convenient to code fetch.For taking into account theoretical rigorous and actual demand, the coordinate sign indicating number as special case, and is stipulated as follows this kind situation:
When condition 1) " one " and " intersecting type type unit " adhesion, but be not internal and external relation; 2) both sums are non-fonts, and not with up and down adhesion of the third party, when all being satisfied, ' one ' removable, both sides all are basic models.As: slowly: Chi, , one, wood; : Chi, day, one, very little, surplus:
Figure A9611952300139
, one, wood.
(2) about the preferred version of " "
" one " is type unit after all, so in some occasions, do not violating under the theoretical prerequisite, the fractionation of " " is a kind of preferred version.
As: " penta, one, mouth is " correct for salty splitting into; Split into " the eleventh of the twelve Earthly Branches, mouthful " mistake.Department splits into "
Figure A96119523001310
, one, mouthful " correct, as to split into "
Figure A96119523001311
, mouthful " mistake.Close, split into " , one, mouthful " correct, as to split into " Mouth " mistake.
Rule 9:
When splitting two kinds of situations of Chinese character appearance, split according to following order:
(1) for type unit: " type unit is better than non-type unit; Moulding unit is better than even type unit; Positive type selecting unit is better than merger type unit ".
As " card ", two kinds of situations " upward with fore-telling " and "  is with following " are arranged;  is the merger type unit that foretells, so first kind correct.
And for example: " Gui " has " soil with soil " and " ten with king ";
According to " moulding unit be better than even type unit ", thus first kind correct.
(2) for annexation: " disperse and be better than adhesion; Tiltedly connect be better than direct-connected ".
As: among the Ao "
Figure A9611952300141
", two kinds of situations are arranged:
" with
Figure A9611952300142
" and "
Figure A9611952300143
With ten thousand ";
According to " tiltedly connect be better than direct-connected ", thus first kind correct.
Solved Chinese character and split after this problem, the coding that just can carry out Chinese character with imported, on coding and input element, the coordinate sign indicating number has the advantage of sound sign indicating number; Simply, directly perceived.
In the coordinate sign indicating number, the characteristic information unit of Chinese character is exactly a basic model, type unit collection has 334 type units, add the existence of the first basic model of non-type, the form of basic model is a lot, if with in the past the sign indicating number the same, the form merger of basic model is imported on the key position, will bring very heavy memory burden, the coordinate sign indicating number will not be a good input method of Chinese character.
The coordinate sign indicating number is on the basis of its Chinese character method for splitting, formed the coding theory and the method for the uniqueness of oneself, it is not input " basic model " this characteristic information unit, but the characteristic information of input feature vector information word " basic model ", figuratively speaking, it is " phonetic-stroke code " after Chinese character splits.
Seven. be used to the information of encoding and importing:
The coordinate sign indicating number adopts two kinds of information to encode and imports (1) message breath; (2) preface sign indicating number
1. message ceases:
The message breath, first phonetic alphabet of type unit basic model sign indicating number name, claim the message breath, what is " a sign indicating number name "? type unit divides two kinds, a kind of is font unit, a kind of is radical type unit, and for font unit, the sign indicating number name is exactly this word itself, for radical type unit, the coordinate sign indicating number is done their sign indicating number name according to the characteristics of their popular names with a word, and in fact the message breath is exactly the initial consonant of sign indicating number name or first letter of simple or compound vowel of a Chinese syllable, it does not relate to the Chinese phonetic alphabet " flat; cerebral ", does not relate to " four tones of standard Chinese pronunciation " yet, so whether it is not disturbed by pronunciation accurately.
Sign indicating number name about radical type unit:
Non-word radical commonly used generally all has a popular name sanctified by usage, and its method of being named can reduce three kinds: (1) radical is the part of a certain combinde rqdical character, is that radical is named with this combinde rqdical character, and this is " combinde rqdical character title ".As among Zhao " ", claim by Zhao's word; In the tiger "  ", claim brave prefix; (2) be named according to the feature of radical, this is " shape feature title ", as: Chuan, claim that three turn, San claims three to cast aside; Mouthful, claim square frame.(3) be named (the ancient shape or the distortion that are certain word) according to getting in touch of radical and certain word, this is " same source name ".As Xin and , claim one of the Chinese character components, (being the ancient shape of the heart), Rui claims 3 water, (being the ancient shape of water); Ox with
Figure A9611952300145
Claim by the ox word (distortion of ox).
In the coordinate coding, usually have the radical type unit of " combinde rqdical character title ", its " sign indicating number name " is exactly this combinde rqdical character; Radical type unit with " shape feature title ", its " sign indicating number name " just got the center word of popular name;
Radical type unit with " same source name ", its " sign indicating number name " is exactly its doublet.Specifically see the following form.
Youngster's combinde rqdical character that popular name type codes name letter 01 Tou literal head combinde rqdical character literary composition W 02 Yin of code name sequence number radical type unit of radical type unit builds is built J 03 Zhuang and the other combinde rqdical character of word is done at the bottom of the word combinde rqdical character with J 04 European-allies is done precious B 06 Ji of N 05 Http Baozi top combinde rqdical character and seek the prefix combinde rqdical character and seek the sick B 08  tiger of the sick prefix combinde rqdical character of X 07 Epileptic prefix combinde rqdical character tiger H 09Spring prefix combinde rqdical character spring C 10
Figure A9611952300153
Volume prefix combinde rqdical character volume J 11
Figure A9611952300154
The prefix combinde rqdical character of holding a memorial ceremony for is held a memorial ceremony for J 12 Bo and is stepped on the prefix combinde rqdical character and step on D 13
Figure A9611952300155
The blue or green Q 15 of the blue or green prefix combinde rqdical character of other combinde rqdical character Zhao of Zhao's word Z 14
Figure A9611952300156
The prefix combinde rqdical character is total to G 16 Jiong and rectifies combinde rqdical character with T 17 with word altogether
Figure A9611952300157
Ash prefix combinde rqdical character ash H 18 Bao bag prefix combinde rqdical character bag B 19
Figure A9611952300158
Combinde rqdical character clothing Y 20 at the bottom of the clothing word
Figure A9611952300159
Going out the prefix combinde rqdical character goes out bald T 23  of the bald precious lid shape feature of C 21 Bing WAWQ shape characteristic point D 22 Mi three frame hurdle shape feature frame K 24 mouthfuls of square frame-shaped feature sides F 25 San three and casts aside the shape features and cast aside P 26 Chuan three and turn the shape feature and turn the random L 28 of the random hank knotting shape of G 27 Si feature and adopt the shape feature and adopt the other homology speech of the C 29  shape feature cutter D30 Yan speech word Y31 Dao vertical cutter homology cutter D32 Ha homology of falling the Eight characters eight B33
Figure A9611952300161
Youngster's homology that the perpendicular heart homology heart X37 Chuo of the private S34 Jie of private word homology monaural knife-edge feature ear E35 Fu ears knife-edge feature ear E36 Xin walks is walked the anti-literary composition of the little X39 The-Fan of the little anti-little homology of Z38 and is shown that with the other homology food of the anti-dog homology of source document W41 Quan dog Q42 Cannibals food word S43 Woo mending youngster's homology shows that the other homology ox of S44 ox ox word N45 Zhao pawl prefix homology pawl Z46 Yi clothing mends the fully other homology gold of fixed other homology foot Z49 Jin gold word J50 four or four prefix homologies four S51 Xiangxi homologies fire H of youngster's homology clothing Y47 bamboo prefix homology bamboo Z48 with source document W40 Fan folding literary composition
Rui, Rolling, Lv, the message breath of 4 radical types of Ren unit defines.See below
2. preface sign indicating number:
The coordinate sign indicating number extracts " the shape information " of Hanzi structure by " preface sign indicating number ".Preface sign indicating number: " combination of two strokes clocklike claims the preface sign indicating number ".The coordinate sign indicating number adopts following three kinds of preface sign indicating numbers: (1) sound preface sign indicating number: the first, two two combination of basic model claims " sound preface sign indicating number ".(2) end preface sign indicating number: basic model or type layer, the combination of the first and end stroke of combinde rqdical character claim " end preface sign indicating number ".(3) total order sign indicating number: the total order sign indicating number constitutes by two yards, and first sign indicating number is " the sound preface sign indicating number " of basic model, and inferior sign indicating number is " the end preface sign indicating number " of the surplus portion of basic model.Promptly extract the 1st, 2,3 ends of basic model, stroke coding.
Their using method will be addressed in coding rule.
The feasibility of preface sign indicating number: we know each per capita correctly book go out the Chinese character of oneself not being familiar with, because " order of strokes observed in calligraphy " is general knowledge the most basic in the Chinese character, and the related stroke of preface sign indicating number only is two strokes that the position is special, and the preface sign indicating number has simple property thus.
The meaning of preface sign indicating number: the form of preface sign indicating number is very simple, but is one of indispensable theoretical pillar of coordinate sign indicating number.The first, it has solved the input problem of the first basic model of non-type.The second, it can enter the inside of Hanzi structure, extracts shape information exactly, has guaranteed the diversity and the completeness of coded message.The 3rd, its introducing makes to make the input element of coordinate sign indicating number simple unusually by the coordinate sign indicating number shape data inputting method of sign indicating number employed " input after the merger of characteristic information unit " in the past, convenient, the keyboard content of coordinate sign indicating number is also simpler than the keyboard content of the Two bors d's oeuveres double-tone method in the sound sign indicating number.
The basis of preface the sign indicating number---order of strokes observed in calligraphy:
The order of strokes observed in calligraphy: when writing the Chinese regular script word, the sequencing of starting writing is " order of strokes observed in calligraphy ".The order of strokes observed in calligraphy is the summary that people write experience for a long time, is to form in the practice, and following main rule is arranged:
From top to bottom: three speech beans divide Lu early; From left to right: with leaf river piece shape friend;
Horizontal earlier back is perpendicular: ten cun positive Feng Mu of well; Cast aside afterwards earlier and press down: the people goes into eight chis fire pawl;
The first intermediate and then both sides: little undertaking water also forever; From outside to inside: between flying in moon wind direction;
From the inside to surface: this far builds fierce letter;
Elder generation goes here and there the heart after the main body: the rich string book of Wei volume; Put a little after elder generation's main body: I send out dog prestige dragon;
Elder generation's point point back main body: the justice master is;
Eight. the distortion of type unit and merger
In the coordinate sign indicating number, there is the merger phenomenon in type unit, and merger has two kinds of situations.Both sides' shape difference of merger big as: Xin and , especially with In-particular, then with Nie, this merger is common among the Chinese dictionary, and the reason of its merger is arranged.In the coordinate sign indicating number, only admit situation about listing in the type unit merger table.
2. both sides' structural similarity of merger
Chinese character is a kind of ideographic language, no matter Hanzi structure is complicated and simple, no matter also what of Chinese-character stroke, the profile of Chinese character all is a square, and for keeping the balance of square inside, " basic model " of structure word just can only make some changes---become either large or small, long or flat, to adapt to the requirement of square; Therefore some strokes also do certain change, to avoid the pressure of covering between stroke.
As: in the material " wood-
Figure A9611952300171
"; In the sled "
Figure A9611952300172
-Mao "; In the turtledove "
Figure A9611952300173
-nine ".
(1) " the non-intersection of folding " pen " " even type unit for containing, because the form of " folding " pen is a lot, and its architectural feature point of two pen types of non-intersection is few again, so coordinate sign indicating number regulation: " the even type of the non-intersection unit that contains the folding pen; as long as deformation takes place the folding pen; type unit merger table do not admit again, two types just can not merger, and the type after the variation is a first type of non-type." as: in seeing "
Figure A9611952300174
", be not even type unit " Jiong " just.
(2) to the type unit of remainder: the coordinate sign indicating number is stipulated under following two kinds of situations can natural merger, and does not list type unit and table in.A. congruent type merger: if certain unicursal generation deformation of type unit, but whole structural relation is constant, and the kind of stroke also becomes, and claims that then these two types are that " congruent type " can merger: as again- -; Eight-
Figure A9611952300182
Hair-
Figure A9611952300183
The king-
Figure A9611952300184
Wood-
Figure A9611952300185
B. vertical again pen type merger: direction will be erected by wieling the pen, the perpendicular left-falling stroke and perpendicular section of folding, be called " vertically stroke or line segment ", if two and plural " length " " vertically stroke or line segment " are contained in a type unit, when " vertically stroke or line segment " deformation, but still when being " vertically stroke or line segment ", allow two type merger.
As: get rid of-- With-- Open-
Figure A9611952300188
Well-
Figure A9611952300189
-also; Month-
Figure A96119523001811
Annotate: " moon " with " " in the coordinate sign indicating number, be regular governed, in upper and lower relation, think "
Figure A96119523001813
", and about when concerning, think " moon ", as: bright, friend, stomach, beautiful, vertical again pen type allows merger, is because the architectural feature point of this type is many, change a bit after, still be easy to discern, still close, so the coordinate sign indicating number thinks that they can natural merger.
Other situation: other distortion, only the situation of admitting when type unit merger table can merger, otherwise cannot merger.
Why not together, the type unit of phase merger has: during input, their " sound " information is identical, but their " preface sign indicating number " is with different (congruent figures are constant).
Nine. the keyboard of coordinate sign indicating number
Keyboard is used for importing the coded message of Chinese character, and the keyboard of coordinate sign indicating number is very simple, and its content is less than " the Two bors d's oeuveres double-tone " of sound sign indicating number, the coordinate sign indicating number keyboard synoptic diagram during for details, see the appendix.
Coordinate sign indicating number keyboard is made up of four parts: 1 English alphabet: in order to " sound " information " first phonetic alphabet of imported unit.The English alphabet invariant position.2 preface sign indicating numbers: in order to input " shape " information-preface sign indicating number.Each and every one English alphabet of on the keyboard 25 (N need not) is divided into five districts, corresponding preface sign indicating number the first sum of " horizontal stroke, perpendicular; as to cast aside; as to press down, folding ", each district by " horizontal stroke; perpendicular; cast aside is pressed down, folding " order from the centre to time pen of the corresponding preface sign indicating number of arranged on both sides, such 25 preface code element correspondences 25 English alphabets, constituted " preface sign indicating number keyboard ".Because " preface sign indicating number keyboard " is rich in rule extremely simply again,, grasp easily, 3. the one-level brevity code so need not remember: 26 one-level brevity codes of coordinate sign indicating number, to be formed 5 word to be defined on 26 letter keys, one word, one key during input is in order to improve the speed of individual character input.4.6 individual special type unit: the Lv of type unit, wood, Rui, Rolling, month, the message breath of Ren define, so as to discrete Chinese character, and the minimizing repeated code, defining relation is as follows:
Rui-U; Rolling-l; Lv-A; Wood-V; Ren-O; The moon-P
The corresponding relation of preface sign indicating number and keyboard is as follows:
Preface sign indicating number (the first sum of/time pen) one/one by one/Shu, one/Pie one/Dian one/
Figure A96119523001815
Letter G F D S A
Preface sign indicating number (the first sum of/time pen) Shu/one Shu/Shu Shu/Pie Shu/Dian Shu/
Letter H J K L M preface sign indicating number (the first sum of/time pen) Pie/Pie/| Pie/Pie Pie/Dian Pie/
Figure A9611952300196
Tee R E W Q preface sign indicating number (the first sum of/time pen) Dian/one Dian/Shu Dian/Pie Dian/Dian Dian/
Figure A96119523001913
Letter Y U I O P preface sign indicating number (the first sum of/time pen)
Figure A96119523001914
/ one
Figure A96119523001915
/ |
Figure A96119523001916
/ Pie / Dian /
Figure A96119523001920
Letter b V C X Z
Ten. coding rule:
The single character code rule
According to the quantity of basic model, Chinese character is divided into the monotype word, and dimorphism word, three type-words and many types of word, single character code have following two cardinal rules: 1) press sequential write, extract the 1st, 2,3 last basic models and encode; 2) code fetch for the first time, what type unit got is " sound ", what non-type unit basic model was got is " sound preface sign indicating number "; Information is mended corresponding " preface sign indicating number " inadequately again.
Single character code in two kinds of situation
1. generalized case:
The monotype word: the monotype word is exactly a font unit, and code length is 3, is made of i.e. " sound "+" total order sign indicating number "+space the sound and the total order sign indicating number of font unit.
Illustrate: with in following, " sound " represented with English alphabet, and " preface sign indicating number " is with the letter representation of right shoulder belt *.As: king: WG*H*; People: RW*W*; Long: CT*X*.
Annotate: the coding of (1) radical type unit is identical with the monotype word.As:, QQ*E*; , SQ*C*; 2) contain five basic strokes among the GB character set GB-2312 (80), their sign indicating number name is respectively " one " (one), perpendicular (1) left-falling stroke (Pie) right-falling stroke (Dian) folding (
Figure A96119523001922
), coding method is with the monotype word.
One: YG*G*; (Shu): SJ*J*; (Pie): PE*E* (Dian): NO*O*; ( ): ZZ*Z*
(2). the dimorphism word: the dimorphism code word length is 4, the 1,2 yards, according to stroke order extracts " sound " or " sound preface sign indicating number " of fundamental mode;
The 3rd, 4 yard, according to stroke order extract fundamental mode " end preface sign indicating number " separately.
As two: again, again, YYX*X*; Sign indicating number: stone, horse; SMG*B*; Word: Http, BZP*B*.
(3) three type-words: three type-words: three type-word code lengths are 4, the 1,2,3 yards, according to stroke order extract " sound " or " sound preface sign indicating number " of three basic models; The 4th yard, round at " end preface sign indicating number " of word.As: product: mouthful, mouthful, mouthful, KKKH*; Sit: people, people, soil: RRTT*; Calculate: , order,
Figure A96119523001925
, ZMNR*
(4) many types of word: many types of code word length is 4, according to stroke order extracts " sound " or " sound preface sign indicating number " of the 1st, 2,3 last basic models.
As: defeated: car,
Figure A96119523001926
, month, Dao; CW*PD.
Frequently: end,
Figure A96119523001927
Figure A96119523001928
, shellfish, ZK*D*B
Refreshing: big, DZZZ
Seat: wide, people, people, soil; GRRT
2. special circumstances:
(1) special type unit: the distribution (i.e. 1st yard distribution situation) of Chinese character on the key position is uneven.V, U, I is not the Chinese phonetic alphabet, is preface sign indicating number key position, so the Chinese character on the key position is few; Chinese space on the O.P.A key position also seldom, as the A key, the words that do not include the preface sign indicating number have only " recessed " word, in order to utilize the key position fully, discrete better Chinese character reduces repeated code, and the coordinate sign indicating number is with 6 maximum in Chinese character radicals by which characters are arranged in traditional Chinese dictionaries Rui of type unit, wood, Rolling, Lv, month, Ren, adopt the mode of definition to be placed on above-mentioned 6 key positions, so, their input is also just irrelevant with itself.
Corresponding relation is as follows: Rui-U; Rolling-I; Lv-A; Wood-V; Ren-O; The moon-P.
(2) coding rule of special circumstances
Chinese character on above-mentioned 6 key positions and the K key, the 1st the basic model overwhelming majority is the same, i.e. Rui, Rolling, Lv, wood, Ren, month, mouthful.If by the generalized case code fetch, the 3rd yard of dimorphism word, the 4th yard (end preface sign indicating number) the first sum of function that will lose discrete Chinese character of three type-words, for this reason, the coordinate sign indicating number with first basic model be above-mentioned 6 type units (do not comprise " Chinese character of moon "), as special circumstances, single coding rule that stands.
1) monotype word: (comprising radical) many types of word: coding rule is with general situation.2) dimorphism word: code length is 4, the 1,2 yards, with general situation; The 3rd, 4 yard, get the total order sign indicating number of " inferior basic model " ".
In fact, back trigram be exactly " inferior basic model " " all-key ".3) three type-words: code length is 4, the 1,2,3 yards, with general situation; The 4th yard, get " the end preface sign indicating number " of latter two basic model.
The phrase coding rule:
Individual character extracting code, what get is the characteristic information of basic model, the phrase code fetch is then based on the first letter of pinyin of individual character.Because the phrase input mode is introduced the first letter of pinyin of whole word more, eliminated the end preface sign indicating number of individual character up hill and dale, so the coordinate sign indicating number character property under the phrase mode is more remarkable, code fetch is more directly perceived, fast, this is another rationale that the coordinate sign indicating number can be imported Chinese character fast.
Two-character word: preceding two sign indicating numbers of its all-key got in each word
As: coordinate, RRVY
The people, RW*MB
Three words: the 1st, 2,3 yards is respectively triliteral first letter of pinyin.The 4th yard first sign indicating number of getting last word all-key.
As: bicycle: ZXCC
Chinese herbal medicine: ZCYA
Multi-character words: order extracts the 1st, 2,3, the first letter of pinyin of last word.
As: special economic zone: JJTQ
Painstaking efforts: JKFD
The present invention's advantage: coding method is simple, is easy to realize that computer Chinese-character imports fast, and the repetition rate of coding is low, is convenient to memory, is convenient to study.
Embodiments of the invention:
Example 1: " moon ", " standing grain " word
Month, be to be full of five not font units of disassembly principle, code length is 3, month: YQ*G*
Standing grain contains type unit " wood ", but according to rule " single is drawn to cast aside and cannot be done basic model separately ", so " standing grain " also is that font unit code length is 3, standing grain: HT*L*
Month, the 1st sign indicating number of standing grain two words is their " message breaths ", the 2nd, 3 yard is their " total order sign indicating number ".
Example 2: " institute " word
Two basic models are contained in " institute ", and one is type unit basic model " jin ", a yes-no type unit basic model " ".
Be encoded to institute: E*JT*R*
Example 3: " section " word
Three type unit basic models are contained in " section ", standing grain,
Figure A9611952300211
, ten
Be encoded to: HDSR*, the 4th yard benefit be the end preface sign indicating number R* of whole word.
Example 4 " is climbed " word
Climb, 6 type unit basic models are arranged, for " wood, , , wood, big, hand ".
Coded sequence extracts the 1st, 2,3 ends, basic model coding;
Climb: MZZS.
Example 5 " is examined " word
Examine, split, contain two first basic models of non-type " according to " pseudotype unit "
Figure A9611952300215
", coding is made of the preface sign indicating number fully
Examine: F*A*D*A*

Claims (9)

1. a coordinate codes coding method for computer Chinese characters input is characterized in that this coding method does not have the method for divining by means of characters of " radical collection " by the coordinate sign indicating number and coding method two parts of coordinate sign indicating number are formed, and the method for divining by means of characters that the coordinate sign indicating number does not have " radical collection " is by five disassembly principles not; The foundation that Chinese character splits--type unit collection; The correlative factor that Chinese character splits; The fractionation rule of Chinese character is formed, and the coding method of coordinate sign indicating number is by the information that is used to encode and import; The corresponding relation of coordinate sign indicating number and keyboard: coding rule is formed; Its medium-sized unit collection is made up of three parts:
(1) meets five the not Chinese character and the non-word radicals commonly used of disassembly principle;
(2) can not tearing the Chinese character and the non-word radical commonly used of (1) medium-sized unit open, also is type unit;
(3) containing type unit, but do not allow the Chinese character that splits and use non-word radical always in the fractionation rule of coordinate sign indicating number, also is type unit:
For GB GB2312 (80) character set, first 334 of total type, wherein Chinese character is 279,55 of radicals commonly used;
The correlative factor that Chinese character splits is by the classification and the character of type, and the position of type concerns that the annexation between the type is formed;
The information that wherein is used to encode and imports is made up of " message breath " and " preface sign indicating number ".
2. by the described coding method of claim 1, it is characterized in that described five not disassembly principle be
(1) stroke does not allow to split into two-section, breaks in two types, and reason: single stroke ought to be complete;
(2) stroke that intersects does not allow to split, and reason: " intersection " is a kind of array mode closely;
(3) stroke of Xiang Duanlian does not allow to split, and reason: " end " also is a kind of tight type array mode;
(4) do not allow to split into stroke, reason by two Chinese characters that constitute and non-word radical commonly used: get by the original function reasoning of stroke;
(5) isolated fully by certain unicursal at least, be symmetrically distributed on the structure and contained, two singles draw not allow to split out and form a type, and reason: philology is pointed out " structure of Chinese character is a kind of modular construction ".
3. by the described coding method of claim 1, it is characterized in that the classification of correlative factor one type that Chinese character splits and character are that a type is in Chinese character " stability "---both did " basic model " size of ability, relevant with the stroke number that constitutes it, also with stroke between array mode relevant, the coordinate sign indicating number is classified as follows the type in the Chinese character according to stroke number and array mode:
(1) monotype: only have the type of a stroke to claim " monotype ", type unit collection has two monotype type units, and one and second.Character: the stability of monotype is the most weak, has only in particular cases and just can do basic model;
(2) even type: claim " even type " by two types that constitute;
Character: the character of even type is very special, and Hanzi structure uncertain factor concentrated area is reflected on the body of even type." stability " of idol type occupy between monotype and the moulding, and even type could be subjected to all multifactor influences as basic model,
(3) moulding: the type that is made of three and three above strokes claims " moulding ", moulding is divided into three kinds again according to the array mode between the stroke: 1) positive closo: at least three faces are by horizontal stroke, and closo that the vertical line section constitutes and the stroke that intersects with it claim " positive closo "; 2) intersect type: the moulding that contains overlapping relation between stroke claims " intersecting type ": 3) accumulation type: only contain adhesion between stroke, the moulding of discrete relationship claims " accumulation type ": character: moulding " stability is " more intense, with moulding unit, they all are basic models generally speaking, have only in particular cases, moulding unit can not be a basic model also.
4. by the described coding method of claim 1, it is characterized in that the correlative factor that Chinese character folding divides--the position relation of type, position relation be meant Chinese character medium-sized between each other position relation, the position relation of Hanzi structure has four kinds: single-relation, upper and lower relation, about the relation, internal and external relation is about the coordinate sign indicating number is thought, type in the relation of position, the left and right sides, relatively independent each other, be a kind of coordination, the suitable fractionation; And there is a kind of contact each other in the type in the internal and external relation, and independence is relatively poor comparatively speaking, and fractionation is had certain constraint.
5. by the described coding method of claim 1, it is characterized in that correlative factor--" annexation " between the type that the Chinese character folding divides, " annexation " is meant the way of contact between the amphitypy, it is the connected mode between the stroke, annexation between the type is divided into two classes " disperse " with " adhesion ", discrete, obviously be the condition that helps splitting; Adhesion according to circumstances can be divided into three kinds of concrete conditions again:
(1) positive closed: two types are if form positive closo, and then the adhesion mode between two types claims " positive closed ";
(2) direct-connected: the relation between the amphitypy between the phase adhesion stroke, when being direct-connected the relation, " direct-connected " relation that is both of amphitypy;
(3) tiltedly connect: the relation between the amphitypy between the phase adhesion stroke, exist tiltedly to connect when concerning, between the amphitypy " tiltedly connecting " relation;
The coordinate sign indicating number thinks that positive closure is an adhesion mode closely between the type; Direct-connected is connected mode more closely, and tiltedly connecting is the most weak adhesion mode.
6. by the described coding method of claim 1, it is characterized in that the folding branch rule of Chinese character is
Chinese character and type layer that rule 1 is made up of type unit fully, type unit all is basic model, and is removable;
Rule 2 is torn the not Chinese character and the non-word radical commonly used of removing from mould unit open, and when promptly not having type unit as judgment basis, itself also is type unit, and is non-disconnectable;
Rule 3: when the first and non-type of type unit type was combined, type unit is basic model not necessarily, and split result concerns that with classification (2) position of (1) type (3) annexation three is relevant;
1) " coordinate sign indicating number regulation: regular 3-1: when type and type were positive occluding relation, only removable when both sides are type unit, the both was a basic model, otherwise non-disconnectable four kinds of connected modes to be arranged between type and the type, for positive closure;
2) for type with " disperse " and " direct-connected " " tiltedly connect " three kinds of connected modes, the coordinate sign indicating number has following rule: regular 3-2: when type unit was moulding unit, needing only the other side was not monotype, and promptly detachable, both sides are basic model ";
Rule 3-3: when type unit is even type unit, split result will depend on " classification of type, position relation and annexation " three factors "; 1. when the other side is even type and accumulation type: when adhesion neither internal and external relation, both sides all are basic model, and were removable; When having adhesion situation (direct-connected or tiltedly connect) or internal and external relation, even type unit is the part of basic model, and is non-disconnectable; 2. when the other side when intersecting type: tiltedly connecting or discrete case under, no matter the position relation is how, both sides all are basic models, and are removable; Under direct-connected situation, up and down, position, left and right sides relation is removable, and both sides are basic model; Inside and outside position relation is non-disconnectable, and type unit is the part of basic model;
The rule 3-4: " when type unit be single type unit ' one ' and ' second ' time, only when (1) the other side be moulding; (2) discrete each other; (3) be not internal and external relation, when three conditions all satisfied, removable, both sides were basic model, otherwise non-disconnectable:
Rule 4:
Single is drawn and is cast aside, right-falling stroke does not allow to do separately basic model; The single picture is horizontal, vertical, folding can be done basic model separately, but will satisfy following three conditions simultaneously: be not internal and external relation with adjacent type; With adjacent type adhesion; Adjacent type is a type unit;
Rule 5:
When " one " and other type unit were combined, " one " treated so that stroke is horizontal in split process, and split result is not subjected to the influence of its type unit identity, and split result is a basic model, and " one " is type unit; Otherwise, be exactly the stroke horizontal stroke;
Rule 6:
Pseudotype unit is on Chinese character splits, and its functional equivalent is in moulding unit;
Rule 7:
When type layer and single picture " horizontal stroke erects, folding " were combined, it was basic model that single is drawn;
Rule 8:
When splitting two kinds of situations of Chinese character appearance, split according to following order:
(1) for type unit: " type unit is better than non-type unit; Moulding unit is better than even type unit; Positive type selecting unit is better than merger type unit ";
(2) for annexation: " disperse and be better than adhesion; Tiltedly connect be better than direct-connected ";
7. by the described coding method of claim 1, it is characterized in that being used to encode and import information--be message breath and preface sign indicating number, the message breath is first phonetic alphabet of the first basic model sign indicating number of type name; The preface sign indicating number is the combination of two strokes clocklike, and they are characteristic informations of Hanzi features information word " basic model ".
8. by the described coding method of claim 1, it is characterized in that the coordinate sign indicating number and the corresponding relation of keyboard are that the message breath is corresponding one by one with key letter, preface sign indicating number corresponding relation is as follows: preface sign indicating number (the first sum of/time pen) one/one by one/Shu, one/Pie one/Dian one/
Figure A9611952300053
Letter G F D S A preface sign indicating number (the first sum of/time pen) Shu/one Shu/Shu Shu/Pie Shu/Dian Shu/
Figure A9611952300056
Letter H J K L M preface sign indicating number (the first sum of/time pen) Pie/one Pie/Shu Pie/Pie Pie/Dian Pie/
Figure A9611952300059
Tee R E W Q preface sign indicating number (the first sum of/time pen) Dian/Dian/one | Dian/Pie Dian/Dian Dian/
Figure A96119523000516
Letter Y U I O P preface sign indicating number (the first sum of/time pen)
Figure A96119523000517
/ one / Shu
Figure A96119523000519
/ Pie / Dian
Figure A96119523000523
/ Letter b V C X Z.
9. by the described coding method of claim 1, it is characterized in that coding rule is single character code rule and phrase coding rule,
Wherein single character code rule generalized case is: the monotype code word length is 3, sound and total order sign indicating number by font unit are formed, i.e. " sound "+" total order sign indicating number "+space: the dimorphism code word length is 4, the 1st, 2 yards, according to stroke order extract " sound " or " sound preface sign indicating number " of basic model, the 3rd, 4 yards, according to stroke order extract basic model " end preface sign indicating number " separately; Three type-word code lengths are 4, the 1,2,3 yards, according to stroke order extract " sound " or " sound preface sign indicating number " of three basic models the 4th yard " end preface sign indicating number " that rounds word; Many types of code word length is 4, according to stroke order extracts " sound " or " sound preface sign indicating number " of the 1st, 2,3 last basic models; Special circumstances are: the coordinate sign indicating number is Lv with first basic model, wood, and Rui,, Ren, the Chinese character of mouthful 6 type units are as special circumstances, and list founds coding rule, and monotype word and many types of word code rule are with general situation; The dimorphism code word length is 4, the 1,2 yards, with general situation; The 3rd, 4 yard, get the total order sign indicating number of " inferior basic model " ": three type-word code lengths are 4, the 1, and 2,3 yards with general situation, the 4th yard end preface sign indicating number of getting latter two basic model;
The phrase coding rule:
Two-character word: preceding two sign indicating numbers of its all-key got in each word; Three words: the 1st, 2,3 yards is respectively triliteral first letter of pinyin, the 4th yard first sign indicating number of getting last word all-key; Multi-character words: order extracts the 1st, 2,3, the first letter of pinyin of last word.
CN96119523A 1996-10-31 1996-10-31 Coordinate codes coding method for computer Chinese characters input Expired - Lifetime CN1054447C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN96119523A CN1054447C (en) 1996-10-31 1996-10-31 Coordinate codes coding method for computer Chinese characters input

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN96119523A CN1054447C (en) 1996-10-31 1996-10-31 Coordinate codes coding method for computer Chinese characters input

Publications (2)

Publication Number Publication Date
CN1173660A true CN1173660A (en) 1998-02-18
CN1054447C CN1054447C (en) 2000-07-12

Family

ID=5125774

Family Applications (1)

Application Number Title Priority Date Filing Date
CN96119523A Expired - Lifetime CN1054447C (en) 1996-10-31 1996-10-31 Coordinate codes coding method for computer Chinese characters input

Country Status (1)

Country Link
CN (1) CN1054447C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100388170C (en) * 2002-08-12 2008-05-14 宁绍洲 Universal fast electronic and manual Chinese character processing method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100388170C (en) * 2002-08-12 2008-05-14 宁绍洲 Universal fast electronic and manual Chinese character processing method

Also Published As

Publication number Publication date
CN1054447C (en) 2000-07-12

Similar Documents

Publication Publication Date Title
CN85101817A (en) An zijie type Chinese-character stroke computer code's method and keyboard thereof
CN1019424B (en) High-speed chinese character inputting method using synthetic coding of pronunciations, forms and strokes and keyboard used
CN1173660A (en) Coordinate codes coding method for computer Chinese characters input
CN1086480C (en) Real code coding method for Chinese characters and using keyboard thereof
CN1123819C (en) Chinese character key-position code input method for computer
CN1120408C (en) Chinese-character struture-pronunciation input method for computer
CN1166997C (en) Chinese-character fast input method without splitting
CN1374577A (en) General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1271492C (en) 26104 computer Chinese character
CN1108552C (en) Perfecting method (PHF) for phoenticizing Chinese charaters
CN1309342A (en) Chinese-character pronunciation-shape fuzzy input method for computer
CN1196989C (en) Chinese character pattern schematic input method and keyboard thereof
CN1123820C (en) Chinese-character 'shape-pronunciation' input system
CN1150444C (en) Chinese-character 'letters' input method for computer
CN1841278A (en) Double-code detachment-free high efficiency Chinese character input technology
CN1148635C (en) Chinese-character 'resection code' encode method
CN1455318A (en) Position-stroke Chinese character input method
CN1275128C (en) Chinese character encoding method by employing 100 integrated radicals and computer keyboard thereof
CN1054445C (en) Natural coding method for Chinese characters
CN1357814A (en) Computer Chinese keyboard and its Chinese information inputting and processing method
CN1131295A (en) Sonic code Chinese character input method and its form code scheme
CN87106169A (en) Two-dimensional character code
CN1065740A (en) The on-keyboard of China's light hanzi system and Chinese character and miniature keyboard input
CN1808351A (en) Chinese character input method using initial and etymon to encode for computer
CN1567151A (en) Dictionary code of radicals in Chinese character

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1058613

Country of ref document: HK

CX01 Expiry of patent term

Granted publication date: 20000712

EXPY Termination of patent right or utility model