CN1373408A - Chinese-character 'root code' input method for computer - Google Patents
Chinese-character 'root code' input method for computer Download PDFInfo
- Publication number
- CN1373408A CN1373408A CN 02104850 CN02104850A CN1373408A CN 1373408 A CN1373408 A CN 1373408A CN 02104850 CN02104850 CN 02104850 CN 02104850 A CN02104850 A CN 02104850A CN 1373408 A CN1373408 A CN 1373408A
- Authority
- CN
- China
- Prior art keywords
- character
- initial consonant
- word
- code
- chinese
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
A Chinese-character "Root code" input method for computer is an efficient "pronunciation-shape" code input method based on word cells and stroke shapes. It is characterized by that the initial consonant of a word and the first stroke are used for coding. its advantages are zero duplicate rate and easy mastering it.
Description
The present invention is " Chinese-character ' root code ' input method for computer ", and to get sound, it is auxilliary getting shape, is a kind of acoustic-visual code Chinese character input method.Domestic existing input method can be divided into " sound sign indicating number ", " font code " and " sound font code " several big classes, and its general characteristics are: " the sound sign indicating number is eager to learn not handy, and font code is handy not eager to learn ".How they, handle well interior together with " the sound font code " of attempting comprehensive the two advantage aspect the contradiction of " difficulty " and " speed ", have also stayed bigger room for improvement.The present invention by precise and appropriate decomposition and the exquisite keyboard layout to the individual character member, has realized the very-high performance of " large vocabulary; many brevity codes word; high tolerance, zero code duplication rate ", the unapproachable performance index of these other input method of Chinese character with the pronunciation-shape encode principle of " initial consonant got in word; shape is got the first sum of ", add very simple and direct fractionation mode and coding method, make this programme " easy to learn, quick practical ", properly solved the contradiction of input method of Chinese character " difficulty " with " speed ".
This input method is the sound font code, 4 yards of maximum code length have integrated easy of phonetic sign indicating number and stroke shape Chinese code distinguishing feature efficiently, by the coding principle of " initial consonant got in word; shape is got the first sum of ", realized the very-high performance of " large vocabulary, many brevity codes word, high tolerance; zero code duplication rate ", make coding natural and tripping, shorten the time of study to greatest extent, alleviate the burden of study.
The existence of at present domestic existing a large amount of various input method of Chinese character presents the situation of so-called " ten thousand yards Pentium ".Scheme is many, can be divided into following three classes substantially:
1. sound sign indicating number.Be represented as spelling yard.
The Chinese phonetic alphabet is meant China's de jure standards Scheme for the Chinese Phonetic Alphabet, adopts 25 English alphabets except that " V " on the standard English keyboard.Under spelling phonetic state, import Chinese character, require to squeeze into phonetic transcriptions of Chinese characters one by one, from shown phonetically similar word, choose needed Chinese character.This scheme is based upon on the basis of the Chinese phonetic alphabet, therefore, has certain Chinese phonetic alphabet basis person to learn and is easy to, but too many because of its repeated code, code selection is wasted time and energy, and speed is very slow, uses rather inconvenience.
2. font code.Be represented as: the Five-stroke Method.
The Five-stroke Method is the input method of Chinese character that Wang Yongmin finds out, and is that China uses the Chinese character entering technique wide, that influence is bigger at present.Though beginning need to learn the radical of memory more, left-hand seat is slower, because it has characteristics such as the repetition rate of coding is lower, can adapt to gradually after the study of process certain hour and the practice, input speed also can progressively improve.
3. sound font code: be represented as: natural code.
Natural code is that of finishing of Zhou Zhinong is auxilliary based on sound with shape, and the excellent functions of attempting the various input schemes of collection is the input method of Chinese character of one.The core of natural code is the phonetic input, has absorbed the keyboard layout of double-spelling Chinese character input method, and the input speech is main, and the part skill of font code has been introduced in the input of individual character.But spelling keyboard layout and font code part memory capacitance are bigger, and system is more numerous and diverse, on top of still need spend many time and efforts.
Purpose of the present invention, be break through the study of computer and use in Chinese character import this " bottleneck ", make input method of Chinese character more simple and practical, properly settle the contradiction of " eager to learn " and " handy " in the Chinese character input.
Technical solution of the present invention is as follows:
1. individual character member: the one, character refers to constitute the minimum Chinese character of single Chinese character, is made of character " wood soil soil " as " osmanthus " word; The 2nd, the form of a stroke or a combination of strokes refers to single or the block of the non-character that combined by single, is made of block " Rui " and character " again " as " Chinese " word;
2. coding principle: be summarised as two, the one, " initial consonant got in word ", promptly sign indicating number headed by its initial consonant got without exception in all individual characters; It is code that all characters in the individual character are also got its initial consonant, as: " osmanthus " word, the 1st yard initial consonant G that gets its this word, the 2nd yard initial consonant M that gets character " wood " word, the 3rd yard initial consonant T that gets character " soil " word, the code of " osmanthus " word is GMT so.The corresponding relation of each initial consonant code and key letter is:
Wherein, initial consonant " zh, ch, sh, n " is incorporated into respectively in the initial consonant " z, c, s, l "; " V " is as query key, in order to inquire about not clear coding; " N " is borrowing for initial consonant of unfamiliar word.
The 2nd, " shape is got the first sum of ", promptly " form of a stroke or a combination of strokes " of all formation Chinese characters carries out code fetch by " casting aside and press down folding " 5 classes its first sum of being divided into anyhow, code is respectively with them identical 5 the first sum of letters " horizontal E, perpendicular I, left-falling stroke A, right-falling stroke U, folding O ", as: " Chinese " word is got individual character initial consonant H for the 1st yard, the 2nd yard code " U " of getting shape " Rui ", the 3rd yard initial consonant Y that gets " again ", " Chinese " word code is HUY like this.
3. disassembly principle has following 3 points:
The one, " retracting by level ": all member is divided into 3 grades in regular turn by the rank height: " character → form of a stroke or a combination of strokes → single ".Single has only the first sum of, is the minimum form in the form of a stroke or a combination of strokes, for sake of convenience, classifies independent one-level as, and in fact belongs to " form of a stroke or a combination of strokes " member.During fractionation, the member that rank is high preferentially splits, but character can not be merely minimum next definite with stroke number, but specifically determines whether character from order of strokes observed in calligraphy direction.Split into as " changing " " the abundant th of Rolling ".
The 2nd, " word little preferential ": for the littler Chinese character that constitutes Chinese character, remove the back have in addition regulation except, should get stroke little as far as possible.Get " upright ten " as " zinc " code fetch for " xals " the right and do not get " suffering ".
The 3rd, " tearing one deck down open ": relation is divided into four layers between the stroke: " loosing → company → friendship → list ".Loosing is meant the distance of having living space between the member stroke, as: " power " word; Even be meant between the member stroke not have space length, as: " suffering " word; Friendship is meant between the member stroke mutually and intersects, as: " rich " word; Singly be meant the one stroke of member of formation, as: " one ".Under tear one deck open, promptly down tear one deck open by four ATM layer relationsATMs orders, having looses breaks, and does not have companys of tearing open of loosing, and does not have to connect and tears friendship open, hands over singulated pen, splits into single as " people " and casts aside and press down, " shellfish " splits into lower frame and " people ", " wealth " splits into " shellfish ".
The 4th, " taking into account custom ": the Chinese character that combines when character and single is during as the member of individual character, and this Chinese character can be torn open little not according to " word is little preferential ", to take into account custom, is convenient to understand.As: " sage " do not split into " again 11 " and splits into " soil "
4. coding method:
The one, single character code, 4 yards of all-key code lengths, first sign indicating number is got the Chinese character initial consonant sign indicating number, and its excess-three code is got " initial workpiece, inferior part, last part " code according to a preface by disassembly principle.Less than adds for 4 yards gets aftermost character, does not get last yard as having, then adding.As: " tree " word is got " wood is very little again " all-key and is: " smyc "; " Chinese " word " Rui again " adds to be got an end pen back all-key and is: " huyu "; " top " word adds and gets the most last character rear portion character for " fourth page or leaf shellfish " for another example, is encoded to " ddyr ".
The 2nd, phrase coding, i.e. the coding of the word string of forming by an above individual character, method is: two words groups are respectively got preceding two yards, and three words groups are got second yard of each prefix coee and last word, and the above phrase of three words is got first three word and is reached the most not prefix coee.As: " one " code fetch " yegm '; " sign indicating number " code fetch " ygmw "; " code inputting method " code fetch " ygmf ".
The 3rd, the brevity code coding: brevity code divides one-level, secondary, and three, method is: the input corresponding number is encoded earlier, adds space short in size key again.As the one-level brevity code: " ", key in " d " and add the space and get final product; The secondary brevity code: " greatly ", key entry " de " adds the space and gets final product.
5. fault-tolerant processing.So-called fault-tolerant processing promptly when individual character is imported, is made corresponding containing to the mistake that might occur and is handled.In have:
Initial consonant is fault-tolerant: zh, ch, sh, n respectively with z, c, s, the l initial consonant merge to use.
Distinguish fault-tolerant: when can not determine its initial consonant code when not knowing the pronunciation of individual character or character, can get corresponding single sign indicating number " EIAUO " according to the first stroke of a Chinese character of this word and make to substitute the sound sign indicating number, also available " N " makes the alternative initial consonant of unfamiliar word;
Multitone is fault-tolerant: polyphone provides the different coding of whole different pronunciations.
The useful effect that is had is compared in invention with background technology:
1. the individual character member is more simple and clear practical:
In the existing font code technology, its member often has only a kind of, i.e. radical, and radical is meticulous because of dividing, and general quantity is bigger, and coding and split the leeway that not have optimization has also increased the difficulty of memory capacitance and study.
The present invention is divided into two kinds of character and the forms of a stroke or a combination of strokes to the individual character member, not only concisely but also easy to perform, has broken away from the constraint of etymon list, and the memory of machinery is become the application of principle, makes to split and encode natural and tripping.
2. coding principle is more scientific and reasonable:
In the existing technology, the sound sign indicating number is only got its initial and the final, and is caused repeated code too much as the phonetic sign indicating number; Font code such as the Five-stroke Method are only got its package code, and each radical is different, and the study amount is excessive, grasp to be difficult for.Sound font code such as natural code, though try hard to integrate the former the eager to learn and handy characteristics of the latter, because on the principle of coding, do not break through, thereby, limited improvement can only be done.
The pronunciation-shape encode principle that the present invention adopts is only with " initial consonant got in word, and shape is got the first sum of " eight words, just summarize the main contents of scheme, break through various loaded down with trivial details things in original input method, made this law taking-up sound sign indicating number at an easy rate, can take out font code at an easy rate again.It is broken through part and is: for getting the part branch, provide unfamiliar word initial consonant " N ", made irrecognizable word also can compile out code easily; For getting the shape part, have same the first sum of " EIAUO " for code by the first sum of the getting of five classes, both make all get the shape member and obtained comprehensive covering, greatly simplified coding method again, make scheme easily learn practicality.
3. disassembly principle is directly perceived more naturally:
In the existing sound font code technology, because member does not have the branch of rank, so there is not the method that retracts by level.When retracting radical, just because of there is not other notion of level, splits and tear open from constant radical without exception, radical is different, brings difficulty to fractionation.And, also easily cause the code figure place very few owing to take to get greatly preferentially, form repeated code.
The present invention retracts by level owing to set up the rank notion of member, does not split out non-one stroke member from intersect stroke, and is therefore very directly perceived, is easy to retract fast, guaranteed the unique correctness that splits and encode.Owing to taked to get little principle of priority, add and improved code taking method, so each individual character can both take out the all-key of 4 codes, this just for distribute scientifically and rationally and adjust brevity codes at different levels, the repeated code that disperses provides great free space.
4. coding method is uniform:
In the existing sound font code technology, must use different coding methods according to different situations, as key name Chinese character, characterized radical, single Chinese character in five different separately coding methods is arranged all, increased the content and the complicated degree of scheme, also increased learner's learning burden.
The present invention's coding need not be distinguished the different situations of coded object, only uses with a kind of method and encodes, and has simplified the content of scheme, has alleviated the burden of study, makes this programme easy to learn, and is quick practical.
5. fault-tolerant technique has obtained system applies:
In the existing sound font code technology, also have some tolerant codes, but amount is few, acts on very little.
The fault-tolerance approach of system is provided among the present invention, has had that initial consonant is fault-tolerant, recognition is fault-tolerant, multitone is fault-tolerant.The system of these fault-tolerant techniques adopts, for typing provides great facility.Particularly distinguish fault-tolerant, the present invention's original creation especially.
Embodiment 1:(single character code)
Lee lmz, good hlz, compare bbb; Body tab, Chinese huy, carry ter; Full qa, down xe, inferior cu, step on do.
Embodiment 2:(phrase coding)
Principle yclw, sound sign indicating number ssms; Input method srfu, a sign indicating number ygms; Chinese people zgrm, ymtq feels proud and elated.
Embodiment 3:(sentence coding) the past people general purpose of making great efforts to study science is the understanding nature; present people Gcqt rama llla ysjx khxu d pbbh midb s lojj zara, xwze d rama then attempt to find out the ways and means of controlling nature and protect the mankind and make the life better.Zb sutk zeg co keza zara d fufu h sada le bahe ralm h gjsy slhd. still, if science improper use, its destructive power will be uncontrollable, thereby Dasr, khxu rlgi saya bedi, tudb phla juha s wefu keza, ykee is very fearful.Science itself is not guilty, and problem mainly is human s jmwu kdpx d.khxu besa s wegs d, and e wmtr zuyx zeye ralm has excessively abused science.gcdg?luya?l?khxu.
Claims (6)
1, a Chinese-character ' root code ' input method for computer with the something in common of existing Chinese-character keyboard input method, is with the English alphabet keys to be code element, 4 yards of maximum code length; This law code fetch is auxilliary based on sound with shape, it is characterized in that: with " character " and " form of a stroke or a combination of strokes " is two class A of geometric unitA that constitute whole Chinese characters, and wherein, " character " is meant the minimum Chinese character that constitutes single Chinese character, is made of character " wood soil soil " as " osmanthus " word; So-called " form of a stroke or a combination of strokes " is the block of one stroke or the non-character that combined by one stroke, as " Chinese " word by block " Rui " and character " again " formation; The code taking method of this law is: 1. initial consonant got in word: promptly sign indicating number headed by its initial consonant got without exception in all individual characters; It is code that all characters in the individual character are also got its initial consonant, as: " osmanthus " word, first yard initial consonant G that gets its this word, second yard initial consonant M that gets character " wood " word, trigram is got the initial consonant T of character " soil " word, and the code of " osmanthus " word is GMT so; 2. shape is got the first sum of: all constitute " form of a stroke or a combination of strokes " of Chinese character and carry out code fetch by " casting aside and press down folding " 5 classes its first sum of being divided into anyhow, code is respectively with them identical 5 the first sum of letters " horizontal E, perpendicular I, left-falling stroke A, right-falling stroke U, folding O ", as: " Chinese " word is got individual character initial consonant H for first yard, second yard code " U " of getting shape " Rui ", trigram is got the initial consonant Y of " again ", and " Chinese " word code is HUY like this.
2, input method according to claim 1 is characterized in that: in " initial consonant got in word " the corresponding to of the initial consonant of getting and keyboard:
The corresponding initial consonant q of key position Q, the corresponding y of corresponding t, Y of corresponding r, T of corresponding e, R of W corresponding w, E, the corresponding s of corresponding a, S of 0 corresponding o, P corresponding p, A and the corresponding l of corresponding k, L of corresponding j, K of corresponding h, J of corresponding g, H of corresponding f, G of sh, D corresponding d, F, N correspondence " unfamiliar word initial consonant ", the corresponding z of Z and the corresponding c of zh, X corresponding x, C and ch, V correspondence " query key ", the corresponding m of B corresponding b, M.
Wherein, initial consonant " zh, ch, sh, n " is incorporated into respectively in the initial consonant " z, c, s, l "; " V " is as query key, in order to inquire about not clear coding; " N " is borrowing for initial consonant of unfamiliar word.
3, input method according to claim 1 is characterized in that: can regard as with " right-falling stroke " pen for the first sum of " form of a stroke or a combination of strokes " with " point " and carry out code fetch for the first sum of " form of a stroke or a combination of strokes ".
4, input method according to claim 1 is characterized in that: when Chinese character was encoded, initial consonant zh, ch, sh, n be respectively with z, c, and s, l is referred to as " initial consonant is fault-tolerant " for borrowing for initial consonant, and for example: " woman " is encoded to " lo ".
5, input method according to claim 1, it is characterized in that: when Chinese character is encoded, to the Chinese character of not knowing pronunciation or character according to its " horizontal, vertical, cast aside, press down, folding " different first stroke of a Chinese character, use " E, I, A, U, O " letter to do to borrow respectively for initial consonant, also available " N " letter is done to borrow for initial consonant, be referred to as " distinguishing fault-tolerant ", for example: " broom " correct coding is " HFF ", and recognition tolerant code is " EFF " or " NFF ".
6, input method according to claim 1, it is characterized in that: when there is the situation of a word multitone in Chinese character, provide whole correct codings of various different pronunciations, be referred to as " multitone is fault-tolerant ", for example: " OK " is encoded to hcc when reading hang, is encoded to xc when reading xing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB021048509A CN1303505C (en) | 2002-02-11 | 2002-02-11 | Chinese-character 'root code' input method for computer |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB021048509A CN1303505C (en) | 2002-02-11 | 2002-02-11 | Chinese-character 'root code' input method for computer |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1373408A true CN1373408A (en) | 2002-10-09 |
CN1303505C CN1303505C (en) | 2007-03-07 |
Family
ID=4740131
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB021048509A Expired - Fee Related CN1303505C (en) | 2002-02-11 | 2002-02-11 | Chinese-character 'root code' input method for computer |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1303505C (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102520808A (en) * | 2011-11-29 | 2012-06-27 | 罗嗣孝 | Head and tail dual stroke Chinese character input method |
CN105446494A (en) * | 2015-10-08 | 2016-03-30 | 应炜强 | Digital soft keyboard shape code character input method |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1266220A (en) * | 1999-03-05 | 2000-09-13 | 司连志 | Phono configurational code Chinese character input method for equipment with digital key |
CN1146776C (en) * | 2000-07-27 | 2004-04-21 | 黄桂清 | Chinese-character sound code input method for computer |
-
2002
- 2002-02-11 CN CNB021048509A patent/CN1303505C/en not_active Expired - Fee Related
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102520808A (en) * | 2011-11-29 | 2012-06-27 | 罗嗣孝 | Head and tail dual stroke Chinese character input method |
CN105446494A (en) * | 2015-10-08 | 2016-03-30 | 应炜强 | Digital soft keyboard shape code character input method |
CN105446494B (en) * | 2015-10-08 | 2019-03-15 | 应炜强 | A kind of digital soft keyboard shape code Chinese character input method |
Also Published As
Publication number | Publication date |
---|---|
CN1303505C (en) | 2007-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1303505C (en) | Chinese-character 'root code' input method for computer | |
CN101046707A (en) | Input method for Chinese character of first pronunciation | |
CN1106146A (en) | Computer input method by computer Chinese-character phonology-tone coding and its keyboard | |
CN1200332C (en) | Chinese character sequence code input scheme | |
CN101063905B (en) | Sound and digital code Chinese-character input method | |
CN1115616C (en) | Method for inputting Yi-nationality characters to computer | |
CN1025540C (en) | Double-combination encoding method by use of initial consonants and vowels of Chinese syllables | |
CN1749933A (en) | Phonetic zone bit input method (letter, symbol key plan) | |
CN1107254C (en) | Positive negative dual-electrode initial-consonant vowel form code input system for Chinese characters | |
CN1119743C (en) | Word code input method | |
CN1207648C (en) | '5-3 code' and its keyboard | |
CN1614539A (en) | Initial consonant and vowel inputting method | |
CN1142474C (en) | Dictionary code Chinese character input method | |
CN1167994C (en) | Input method for Chinese character | |
CN1107594A (en) | Chinese-character computer typing method | |
CN1542593A (en) | Five strokes region shape Chinese input method | |
CN1139024C (en) | Chinese character L-code input system and keyboard | |
CN1147776C (en) | Concave-convex code Chinese character input method | |
CN1151540A (en) | 4-in-one code computer Chinese character coding input method | |
CN1609762A (en) | Binary syllabification | |
CN1098525A (en) | Profile phonetic compound code | |
CN1115050A (en) | Four-stroke character root coding method and its keyboard | |
CN1049565A (en) | Equal-length three-digit coding for chinese pictophonetic characters | |
CN1584796A (en) | Flashing Chinese inputting method | |
CN1074296A (en) | A kind of Chinese phonetic phoneme method of Chinese character coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |