CN1132366A - Four-stroke sequential syllable Chinese character coding method - Google Patents
Four-stroke sequential syllable Chinese character coding method Download PDFInfo
- Publication number
- CN1132366A CN1132366A CN 95110379 CN95110379A CN1132366A CN 1132366 A CN1132366 A CN 1132366A CN 95110379 CN95110379 CN 95110379 CN 95110379 A CN95110379 A CN 95110379A CN 1132366 A CN1132366 A CN 1132366A
- Authority
- CN
- China
- Prior art keywords
- code
- radical
- stroke
- coding
- suffix
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
A Chinese-character encode method for computer features that the initial and end strokes or roots are used to determine stroke order code, which is then combined with phonetic letters to form a code of single word or phrase. its advantages include clear characteristics, single rule, less memorized amount and low duplicate rate.
Description
The invention relates to a kind of method of computer Chiense character code technology.
Encode Chinese characters for computer at present has multiple scheme, and as phonetic sign indicating number, five-stroke form code etc., these schemes have been utilized features such as the stroke, radical, phonetic, structure of Chinese character, and be social known.But, as the subject matter of its existence of Chinese characters in computer input technology be how to solve standard, standard, efficient, fast, problem such as simple, Yi Xue.So-called standard is meant that coding rule is clear and definite, the problem that does not exist or seldom exist fork to separate; Standard is meant that coding rule should be the most common and the clearest and the most definite according to some, for some feature of Chinese characters that most of people can understand are formulated; Efficiently be embodied on the code length unsuitable long, theoretically, express with existing conventional characters, distinguish each Chinese character and do not have repeated code, generally need three characters, accomplish this point and in fact be difficult to strictness,, just should be considered as efficiently if reach triple bond or be less than three strong words as its average input efficiency of an encoding scheme; Be to reduce repeated code fast as far as possible, improve input speed; Easy to learn is exactly to want coding rule few as far as possible, and memory capacitance is few and be convenient to association, and with reduce special case and irregular processing in particular cases as far as possible.Should still need to study better coding method in order to overcome the above problems simultaneously.
In existing encoding scheme, as the Five-stroke Method, by Chinese character to the coding to split into radical earlier, associate stroke by radical, again by stroke to code.Like this, determine that the coding of a Chinese character need be understood two processes, the one, by the method for splitting of Chinese character to radical, the 2nd, by the define method of radical to code.In these two processes, the user not only will be familiar with a large amount of radicals, and some radical is of little use in addition with the custom inconsistent; Also to be familiar with coding rule, in these coding rules, different method for splitting and code definition method have been taked according to different situations sometimes, be special circumstances, the result of Chan Shenging is that coding rule is complicated, diversified on the one hand like this, allow user, especially beginner feel indeterminate unavoidably or multi-solution is arranged; Be that a large amount of memories has brought inconvenience for study and use on the other hand, if want to reach skill level, people have to spend the plenty of time and go exercise.
Also have some current programmes to adopt some indeterminate or difficult feature of being grasped by the people of Chinese character, if any scheme when the definition radical code, sometimes adopt the foundation that is shaped as of radical, sometimes the phonetic that adopts radical is foundation, this definition is actually adopts double standards or multiple standard, has uncertainty, features such as the tone of the scheme employing Chinese character that has, structure, and these features often are not easy to be grasped by the user, as a lot of people of tone uncertainly; In addition, in using the method for radical, nearly all can't solve radical and code problem one to one as coding basis.
Efficient, fast aspect existing many schemes all reached reasonable target, still, the method that has lacks good operability, for example: though defined simple code words such as two yards, trigram, how to know that it is a brevity code when a certain Chinese character of input.Word? this is still remarkable for the beginner.Address this problem and aspect computer programming, to cooperate, and the program that has has been accomplished this point preferably.
Standardization and standardized problem have solved, and purpose easy to learn has also just realized.
The present invention wishes to reach the simplification coding rule by seeking and utilizing some basic and clear and definite features of Chinese character and these combination of features to carry out individual character and phrase coding, increases the standard degree, reduces the memory composition, improves the purpose of code efficiency.
The present invention is achieved in that prefix, suffix stroke or prefix, the suffix radical that utilizes Chinese character, or carries out individual character or phrase coding in conjunction with phonetic transcriptions of Chinese characters.
When determining prefix, suffix stroke and prefix, suffix radical, utilized the notion of a preface, a preface has clear and definite definition and is familiar with by people in most cases, therefore be feature with prefix, suffix, process and method that Chinese character splits into the radical code have fundamentally been changed, and prefix, suffix feature are given prominence to, and have realized regular simple, the clear and definite standard-required of code.
Because when specifically utilizing prefix, suffix stroke, generally get four, wherein prefix, suffix are respectively got two, add the feature of a preface, phonetic, so the method that the present invention proposes is called four-stroke sequential syllable Chinese character coding method.
As the specific embodiments of said method, can take following steps:
1,,, return into five types as point, horizontal, vertical, left-falling stroke, folding, right-falling stroke, hook etc. with the basic stroke of Chinese character:
2. one, horizontal stroke;
3. Shu is perpendicular;
4. Pie casts aside;
5. ( ∠ Ya
), folding comprises preceding four class strokes other stroke in addition, as perpendicular folding, left-falling stroke folding, horizontal hook, lifting-hook, cross break hook, horizontal crotch etc.
Like this, all Chinese character basic strokes can be representative by above five classes, as the basic stroke type of this encoding scheme.
2, aforementioned five class basic strokes are arranged according to the order of sequence in twos, be assigned to 25 letters strong on, be defined as follows: q: Dian Dian w: Dian one e: Dian Shu r: Dian Pie t: Dian y: a Dian u: i one by one: a Shu o: a Pie p: a a: Shu Dian s: Shu one d: Shu Shu f: Shu Pie g: Shu h: Pie Dian j: Pie one k: Pie Shu l: Pie Pie m: Pie x: Dian c: one v: Shu b: Pie n:
English alphabet " z " usefulness not in definition has purposes in addition in the coding.According to above-mentioned definition, 25 English characters and stroke are arranged and have been formed one-to-one relationship, are called the stroke permutation code, are called for short stroke code.Stroke code is arranged on keyboard in order, easily memory.
3, select a small amount of Chinese radical or radicals by which characters are arranged in traditional Chinese dictionaries as coding characteristic, these radicals or radicals by which characters are arranged in traditional Chinese dictionaries are called the coding radical, and with its be assigned to digital 0-9 and 25 English alphabets except that z strong on, be defined as follows: 1: wood 2, fire 3, soil (scholar) 4, gold 5, Rui 6, day (saying) 7, the moon (
) 8, mountain 9, stone 0, field q, Cannibals w, The-Fan e, youngster r, son (lonely) t, very little y, worm u, horse i, ten o, p, a few a, mouthful s, Xin d, Lv () f, order g, towel h, Quan j, eight k, people i, standing grain m, king x, Yan c, v, b, women n, big again
More than be by letter tactic on keyboard, expression is with the code word root in the bracket.Reduce radical quantity according to above-mentioned definition, and formed one-to-one relationship between coding radical and its code, be called package code.
4, single Chinese character is encoded, its method is:
(1) respectively get two strokes from prefix and suffix, totally four, wherein the first stroke and second 's permutation code constitutes first coding of this word, and pen second from the bottom constitutes second coding of this word with the permutation code of pen last, during four of less thaies by following principle processing:
For the single word, the repetition stroke permutation code of getting this stroke is promptly filled one as first coding, and second coding uses z as complement code then; For two words, first is encoded to the stroke permutation code, and second coding uses z as complement code; For three words, first stroke permutation code that is encoded to the first two, second coding got the 3rd repetition stroke permutation code;
(2) replace first coding of prefix stroke permutation code formation if prefix coding radical is then preferentially got package code, permutation code constitutes second coding if suffix coding radical is then preferentially got package code replacement suffix stroke;
(3) for the Chinese character of coding radical representative, first is encoded to package code, and second coding uses z as complement code;
(4) according to prefix, suffix two true single character codes be called stroke order code, it combines with phonetic, promptly get first, second letter of this word phonetic sign indicating number the 3rd and the 4th coding in order as this word, the phonetic sign indicating number has only one when alphabetical, second replaces with the space, can constitute four complete codings of this word like this, be called the four-stroke sequential syllable sign indicating number, be called for short four-stroke code or preface sound sign indicating number.
Following simplification and modification when the input Pinyin sign indicating number, have been carried out.
1. zh, ch, sh are replaced as one by z, c, s respectively;
2. ang, eng, ing, ong are replaced as one by g;
3. ü is replaced by u.
Owing to the phonetic sign indicating number has only been carried out a small amount of simplification and modification, so do not increase too much memory capacitance.
5, the radical word is encoded:
For part radical (definition is arranged in the character library), can get preceding first and second stroke code by general single character code method, then, the phonetic sign indicating number partly uses two character zz as complement code.
6, phrase being carried out Methods for Coding is to choose the stroke order code of first word of phrase and the stroke order code of suffix word forms the phrase coding.As much as possible two words and multiword phrase be can include, code efficiency and input speed helped improving.
To above coding embodiment, what time followingly specify:
When (1) getting prefix, suffix stroke code or prefix, suffix package code, generally speaking, can not reuse radical or stroke feature, get as " certainly " "
" afterwards, then can not get " order " for second;
To meet independency principle when (2) getting prefix, suffix radical, promptly be surrounded and wherein do not comprise other stroke, and also do not intersect with other stroke with the closed curve radical of will encoding, can not be as the prefix of " always " word as " soil ";
Can adopt brevity code when (3) using the four-stroke sequential syllable coding, promptly under the situation that repeated code do not occur, for some word can only get last, two or three as this word code, and when designing computer programs, can carry out screen and follow the trail of demonstration, do not need special memory, most of Chinese characters can adopt brevity code, improve code efficiency;
(4) this coding method repetition rate of coding is lower, and great majority are two word repeated codes in the repeated code word, like this, the desirable brevity code of everyday character when determining single character code, the word that is of little use still adopts all-key, and promptly four preface sound sign indicating numbers can further reduce the repetition rate of coding.
(5) phonetic, radical and stroke code can adopt lower case without exception.
Adopt the above-mentioned method of Chinese character coding, directly use prefix, suffix stroke permutation code or prefix, suffix package code, avoided coding method being simplified by the split process of Chinese character to radical as coding basis; Adopt prefix, suffix stroke permutation code or prefix, its feature of suffix package code clear and definite, rule is single, has avoided the multi-solution of coding rule; Stroke is arranged and is encoded and formed one-to-one relationship respectively between radical and its code, distributes in order, has reduced radical quantity and memory capacitance; Stroke, a preface, coding radical, phonetic etc. all belong to the Chinese character essential characteristic, and the coding radical all is radical commonly used and radicals by which characters are arranged in traditional Chinese dictionaries, is grasped by people easily, meets daily habits and standardization requirement, additionally do not increase people's learning content, the mode that adopts brevity code to combine with all-key has significantly reduced the repetition rate of coding, has reached higher target, adopt brevity code, phrase coding, and enlarge phrase quantity as far as possible, and having improved code efficiency and input speed, average code efficiency can be below triple bond.
The applicant thinks that above method is compared with current methods, as the Five-stroke Method, mainly contains following difference:
(1) though the forms that two kinds of methods have all adopted stroke and stroke to arrange,, this method is defined as the prefix stroke and the prefix stroke is arranged or suffix stroke and suffix stroke are arranged, and does not have this feature in the Five-stroke Method; The stroke permutation code is directly used in individual character or phrase coding in this method, and is used for the radical classification in the Five-stroke Method; To be used in individual character or the phrase coding be strict corresponding to the stroke permutation code in this method, and be undemanding in the Five-stroke Method.
(2) though two kinds of methods have all adopted the notion of radical,, refer in particular to prefix and suffix radical in this method, and be not to utilize this feature in the Five-stroke Method; Adopt a small amount of radical in this method, and comprised a large amount of radicals in the Five-stroke Method; Radical is corresponding one by one with its code in this method, and the Five-stroke Method usefulness more than one yard, the radical that adopts in this method is relatively more commonly used, standard, has adopted some non-common or non-standard radicals in the Five-stroke Method.
(3) coding method difference, this method are emphasized prefix, suffix feature, stress prefix, suffix stroke feature in particular, and combine with a preface, phonetic, are the main foundations that is different from current methods.Stroke code in individual character and phrase Application in Coding in the highest flight.
(4) improvement of this method has produced positive, outstanding effect.
Above method has been carried out fully open, and people can determine individual character or phrase coding in view of the above, and can design relevant calculation machine program.
Claims (6)
1, about a kind of method of computer Chiense character code, Chinese-character stroke is divided into five kinds of fundamental types, arrange in twos then and form one group, select part Chinese radical and radicals by which characters are arranged in traditional Chinese dictionaries as the coding radical simultaneously, these strokes are arranged and the coding radical is assigned to respectively on the numeral of computer keyboard and the character keys and forms corresponding code, carry out individual character and phrase coding on this basis, the present invention is characterized in: utilize prefix, suffix stroke or prefix, the suffix radical of Chinese character, or individual character and phrase are encoded in conjunction with phonetic transcriptions of Chinese characters.
2, in accordance with the method for claim 1, when utilizing prefix, suffix radical, be respectively to get two strokes from prefix, suffix by a Chinese-character writing preface, totally four, wherein the stroke permutation code of the first two is as a place Chinese character coding, two the stroke permutation code in back is as the another one encode Chinese characters for computer, adopts allonge and complement code mode to handle during four of less thaies.
3, in accordance with the method for claim 1, prefix, suffix radical are chosen from prefix, suffix by a Chinese-character writing preface and the requirement of radical independence, it is characterized in that having selected for use a small amount of radical as the coding radical, and and its code between form one-to-one relationship.
4, in accordance with the method for claim 1, have priority when utilizing prefix, suffix radical, when promptly prefix, suffix run into the coding radical, preferentially get package code and replace the stroke permutation code.
5, in accordance with the method for claim 1, when stroke order code combines with the phonetic sign indicating number, generally select the first two letter of phonetic transcriptions of Chinese characters according to the order of sequence for use, phonetic alphabet can keep true form, and brevity code or modification sign indicating number also can omit.
6, in accordance with the method for claim 1, phrase coding is to utilize first word of phrase and the stroke order code composition of tail word not.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN95110379A CN1100288C (en) | 1995-03-25 | 1995-03-25 | Four-stroke sequential syllable Chinese character coding method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN95110379A CN1100288C (en) | 1995-03-25 | 1995-03-25 | Four-stroke sequential syllable Chinese character coding method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1132366A true CN1132366A (en) | 1996-10-02 |
CN1100288C CN1100288C (en) | 2003-01-29 |
Family
ID=5077773
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN95110379A Expired - Fee Related CN1100288C (en) | 1995-03-25 | 1995-03-25 | Four-stroke sequential syllable Chinese character coding method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1100288C (en) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1031302C (en) * | 1993-05-31 | 1996-03-13 | 王伟 | Associated Chinese Character radical code input method |
CN1097072A (en) * | 1993-06-29 | 1995-01-04 | 曹红海 | Stroke order combined Chinese character coding method and keyboard |
-
1995
- 1995-03-25 CN CN95110379A patent/CN1100288C/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1100288C (en) | 2003-01-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1102714A (en) | Chinese character input method and keyboard based on two strokes and two-stroke symbol | |
CN1132366A (en) | Four-stroke sequential syllable Chinese character coding method | |
CN1177271C (en) | Four-stroke number code input method for characters and words and without duplication code and its keyboard | |
CN1220931C (en) | Sound-shape digital Chinese character input method | |
CN1284066C (en) | Three strokes code method for inputting Chinese characters into computer as well as its keyboard | |
CN1107593A (en) | Chinese-character phonic and shape, longitudinal and latitudinal coding typing method for computer | |
CN1107898C (en) | Double-pair code Chinese character computerized inputting method | |
CN1068444C (en) | Method of Chinese-character coding | |
CN1178344A (en) | Four tone inputting method for Chinese characters | |
CN1125393C (en) | Chinese character encoding and inputting method and keyboard | |
CN2476059Y (en) | Keyboard for Jiang code input method | |
CN1049991C (en) | Chinese character coding method and keyboard thereof | |
CN1042768C (en) | Alphabetic Chinese character input keyboard and input method | |
CN1198199C (en) | Chinese character input method based on English keyboard | |
CN1049056C (en) | Chinese character row structure three-stroke screen display coding method and keyboard thereof | |
CN1049418A (en) | Chinese character keyboard input method for unified code computer | |
CN1060277C (en) | Chinese characters coding and input method for computer using sentences as input unit | |
CN1321924A (en) | Computer chinese character input method and keyboard | |
CN1256453A (en) | Foreign language syllable input method | |
CN1123425A (en) | Shape and phonetic four-code coding method for Chinese character and keyboard thereof | |
CN1109284C (en) | Multi-information code Chinese character input system for computer | |
CN1357815A (en) | Chinese character digital input method | |
CN1151540A (en) | 4-in-one code computer Chinese character coding input method | |
CN1061666A (en) | Microcomputer input chinese characters by voice-literal coding | |
CN1223396A (en) | Computer Chinese character radicals input method and keyboard |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |