CN101441517A - Double-division three-code input method - Google Patents
Double-division three-code input method Download PDFInfo
- Publication number
- CN101441517A CN101441517A CNA2007101928489A CN200710192848A CN101441517A CN 101441517 A CN101441517 A CN 101441517A CN A2007101928489 A CNA2007101928489 A CN A2007101928489A CN 200710192848 A CN200710192848 A CN 200710192848A CN 101441517 A CN101441517 A CN 101441517A
- Authority
- CN
- China
- Prior art keywords
- parts
- character
- code
- come
- stroke
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The invention relates to a duplex three-code input method, which comprises the following steps: after strokes of a Chinese character are classified into a horizontal line, a vertical line, an inclined line, a spot and a folded line, selecting 70 basic parts with high character forming efficiency, and classifying the basic parts into five areas of the horizontal line, the vertical line, the inclined line, the spot and the folded line; arranging six key positions for the basic parts of the horizontal line, the spot and the vertical line as first strokes, five key positions for the Chinese character taking the inclined line as the first stroke, and three key positions for the basic part of the folded line; and sequentially arraying the key positions in every two lists from left to right, dividing the Chinese characters into mixed characters and single characters, and dividing one mixed word into two words to code the words. During coding, the code length of an optimally stated single character is within three codes at most.
Description
Technical field
The invention belongs to the computer Chinese character input method, just Chinese character coding input method.It is being divided into Chinese character single character and combinde rqdical character, combinde rqdical character is divided into two by one-piece construction, and Hanzi component is arranged on the letter key input Chinese character of encoding according to the first stroke of a Chinese character with per two row, and the individual character code length is at most a trigram, therefore is called double-division three-code input method.
Background technology
The keyboard input is a most popular input method in the present input method of Chinese character.The coding of Chinese character is meant with Chinese character of one group of coded representation.Mainly be divided into sound sign indicating number, font code, phonetic-stroke code three classes input Chinese character.Utilize sound sign indicating number input Chinese character,, use the most extensive because easy to learn.But input speed is unhappy, also has a weakness, and that is exactly that unacquainted Chinese character can't be imported.Though the complicated difficult note of font code can be imported any unacquainted Chinese character, and often very fast.Font code is often sorted out coding with Hanzi component by modes such as pictograph, phonetic and strokes, because stroke is one of greatest invention since the dawn of human civilization, the input method of therefore pressing the stroke classification will surpass the input method of pressing phonetic and pictograph classification, and is often comparatively popular.
The Five-stroke Method is typical case's representative of font code, advantage is that repeated code is few, high input speed, but this input method is only utilized 25 letter key input Chinese characters, and the type frequency of also ignoring each Hanzi component just differs, and firmly Hanzi component is divided into five region and five-positions by the first stroke of a Chinese character, also divided five positions for the Hanzi component of the quite low disassemble head of type frequency, and the quite high first stroke of a Chinese character of type frequency be horizontal, vertical, the point Hanzi component also only got five districts, one or two key position has been wasted in this measure, can cause certain repeated code again.For reducing repeated code, the Five-stroke Method has to the first stroke of a Chinese character is cast aside in the district for the Rolling in the horizontal Hanzi component is incorporated into, Xin and the heart of the first stroke of a Chinese character for point is inserted in the folding district, this is unacceptable fully, because the actual sets word frequency of Rolling, Xin and the heart is higher than the type frequency sum of other parts that come on the same key far away, in fact the key of the row of making Rolling, the Xin and the heart should can be regarded as Heng Qu and Dian Qu, casts aside district and folding district but the Five-stroke Method is included into it firmly.The classification of Hanzi components such as " cars, nine, several " does not meet the rule of dividing by zoning yet.The Five-stroke Method with every district again by second be divided into horizontal, vertical, cast aside, press down, five of foldings, more somewhat forced, be example with perpendicular district, normally roll over for second, be arranged on perpendicular this position key of folding by concentrating, this can bring serious repeated code, therefore has to dispersed arrangement on each key.Unexpectedly have 10 parts not meet so-called position arrangement regulation in 25 key name parts, such rule can not be calculated rule in fact.Also have a bit, the Five-stroke Method has only been used 25 keys, wastes a letter key, if can make full use of 26 keys, then one, two, three brevity code can be more, and input speed also can increase, that is to say should be in addition than the Five-stroke Method input method of Chinese character faster.Other 86 editions the Five-stroke Methods also exist some Hanzi components do not meet spoken and written languages standard, quantity too much, split problem such as inconvenience.
Zheng's sign indicating number improves to some extent to this, 26 keys have been made full use of, the parts compliant, considered the type frequency of Chinese character during by first stroke of a Chinese character subregion, comparatively reasonable, can also exist more than the horizontal district point, erect to distinguish and lacked point, on keyboard, arrange orderly inadequately shortcoming, it is divided into main root, secondary root with Chinese character worse, and except first main root, Hanzi components such as other second main root, bag root, assorted root are all encoded as code name with two letters, although this helps reducing repeated code, but code taking rule is quite numerous and diverse, and the Hanzi component of choosing in addition is also more, and difficult note finds it difficult to learn.
For reducing Hanzi component, the way that at present many input methods all make up in twos by five kinds of basic strokes is encoded.This method is actually the influence that is subjected to the Five-stroke Method, do not fully take into account the situation that the type frequency of the particularly two strokes of Chinese-character stroke differs greatly equally, same surface seems uniform after being aligned on the keyboard, in fact each letter key is uneven in temperature, be easy to generate repeated code, also exist intuitively inadequately in addition, influence problems such as thinking.
For this reason, I have invented double-division three-code input method, and it has only used 70 Hanzi components, and parts are chosen rationally, and the subregion standard is imported advantages such as Chinese character is quick.But its individual character code length has four yards, causes duplication sometimes, and present up-to-date font code input method, Chinese words commonly used is only got the head of warehouse codes, inferior and suffix, and promptly every word limit is got 1 to 3 yard.Change the individual character code length into trigram as Cangjie's input method, Chinese words commonly used is only got head, inferior and suffix, because code taking rule make a big improvement, the repetition rate of coding is still very low.The three-code entering method repetition rate of coding that has is lower than four yards font codes.As seen as long as reasonable in design, there is trigram much of that during single character code.Get trigram, it is just much simple to split Chinese character.
Summary of the invention
Like this, present font code input method or Hanzi component is too much or lack of standardization, subregion is unreasonable, or split difficulty, directly perceived inadequately, influence thinking, or fail to make full use of 26 keys, otherwise the more than trigram of code length, the Chinese character of all failing to accomplish to import quickly and easily.
Relatively standard, subregion are reasonable to the purpose of this invention is to provide a kind of Hanzi component, split simple and direct, and the individual character code length be at most a trigram, import Chinese character fast computer input method for Chinese character be double-division three-code input method.
In order to reach the purpose of double-division three-code input method, the present invention is after various strokes classify as horizontal, vertical, left, points, discount by the regulation of State Language Work Committee with Chinese character, again from " information processing with GB13000.1 character set Hanzi component standard " selected 70 high Hanzi components of type frequency, it is not high that these selected Hanzi components also can comprise several type frequencies, but the similar Hanzi component of homology or similar shape is commonly referred to as the basic element of character.They are referred in five districts of horizontal, vertical, left, points, discount by the first stroke of a Chinese character.Again according to the height of type frequency of Hanzi component in each district decide key position that each district comprised what, calculate through science, the first stroke of a Chinese character is that horizontal stroke, point, perpendicular Hanzi component type frequency are higher, quantity is also more, each get six key positions, the first stroke of a Chinese character is taken second place for the Chinese character frequency of casting aside, get five key positions, the Hanzi component frequency of turning up pen is minimum, only arranges three key positions.Each Hanzi component is all strict to come on the letter key by the first stroke of a Chinese character and compatibility relation, accompanying drawing 1 is seen in distribution, this figure classifies a district as with two on QWERTY keyboard, from left to right classifying Dian Qu, Heng Qu, perpendicular district, left-falling stroke district, folding district as boundary with two successively arranges, has regularity, being the result that year concentrates on studies surplus in the of ten in person, is the creative place of maximum of the present invention.According to classifying arrange Hanzi component on the boundary by first stroke of a Chinese character subregion method as with two, the some district accounts for two row letter keys, six letter keys; Horizontal district accounts for two row letter keys, get six letter keys, perpendicular district accounts for two row letter keys, gets six letter keys, accounts for five letter keys though cast aside the district, get only two row also, folding district only accounts for three letter keys, also two row, thereby reached point, horizontal, vertical, cast aside, folding respectively distinguishes the type frequency difference, shared alphabetical bond number also thereby different but all accounts for the ingenious purposes of two row.Come than the input method that Hanzi component five districts five row are arranged, more regular undoubtedly, more meet the fingering custom, be a kind of huge advance made.Certainly close owing to perpendicular district with the type frequency of casting aside district's Hanzi component, but also the perpendicular district of regulation work accounts for five letters cases, casts aside the district and accounts for six letters cases.In addition also can be with Dian Qu, Heng Qu, the location swap of perpendicular district on keyboard.These all are to distortion of the present invention, still classify the best as with the enforcement that the present invention was lifted.In addition the present invention with point, horizontal, vertical, cast aside, five kinds of basic strokes of folding come respectively on Q, E, T, U, the O key, use corresponding alphabetic coding, owing to all being positioned at row, and only every a letter key, the as regular as clockwork that also seems, easy to learn.
The present invention has adopted the technology that will be divided into two in addition, and combinde rqdical character is divided into stem and surplus portion, again according to stem and surplus portion structure separately, the code length of stem or surplus portion is made to optimize regulation.This radical that makes that a large amount of radicals, particularly stroke are too much or very few and be not in daily use need not come on the key again, thereby the quantity of parts is greatly reduced.Two make code length shorten to trigram, and this also is that the present invention easily learns reason fast.
Again Chinese character is divided into two classes: a class is a single character, and a class is a combinde rqdical character.Single character divides two classes again: a class is itself to be exactly the Chinese character of the basic element of character, it is arranged on the letter key, be called single character in the key, coding rule is: become the code+the first sum of picture code+end stroke code of the word basic element of character, have only one Chinese character just only to get the code of the first sum of picture.Another kind of is the single character that a plurality of basic elements of character combine, it does not appear on the key, be called the outer single character of key, coding rule is to split into several basic elements of character by sequential write, get the code of code+the last basic element of character of code+second basic element of character of first basic element of character, encode, split into the basic element of character after, number of components has been got till the code of all parts less than three.
In combinde rqdical character when coding, will be divided into two this Chinese character by sequential write by one-piece construction, split into two parts, and the part of writing earlier is called first, is called for short stem, after the part write be called second portion, be called for short surplus portion.Coding rule is: stem is got the corresponding code name of first parts and the most last the basic element of character respectively and is encoded, getting the code name of first parts of surplus portion again encodes, when stem has only a basic element of character, can only get 1 yard at most, at this moment desirable 2 yards at most in surplus portion, the corresponding code name of getting first and the most last parts of surplus portion by sequential write is encoded respectively, and surplus portion has only parts to encode with regard to the code name of only getting these parts.
The individual character extracting code rule all will be followed and get big priority principle, promptly to preferentially press the many Hanzi component codings of stroke number, should guarantee to split out big as far as possible Hanzi component, and the number of times that splits to lack at every turn as far as possible, the many parts of stroke not split into the few parts of stroke by sequential write.
Can amplify out a rule according to this rule is exactly that many stroke members are drawn the i.e. five kinds of basic stroke priority encodings of parts than single certainly.Also will take into account in addition intuitively, the square frame shape of avoiding four bandings are closed is taken coding as basic elements of character such as " square frame mouths " apart by sequential write, and this rule is in fact also got big priority principle and amplified out.
A very nerve-wracking situation is arranged when splitting Chinese character, and that runs into exactly how this split when several basic element of character strokes intersected, and at this moment multiple method for splitting is often arranged.I have successfully solved this difficult problem throughout the world finally through concentrating on studies of reaching surplus in the of ten year.Be that convenient character splits, special provision by sequential write and first three and above stroke intersect after write stroke and must take out separately, by single stroke coding, the basic element of character makes an exception.The basic element of character can not split again, should not with in the basic element of character with a plurality of strokes intersect after write stroke and split out coding separately.
Some Chinese character, the basic element of character that they comprise is identical, and just the position difference of the basic element of character for distinguishing the coding of these Chinese characters, makes its not repeated code, must increase font information, is distinguished with distinguish yard.The font of Chinese character can be divided into independent body type and fit type two classes, and fit type accounts for 96% of Chinese character, need fill distinguish yard during fit type Chinese character is not enough trigram.Fit type can be divided into left right model again, go up mo(u)ld bottom half, encirclement type, uses ", ", ". ", "/" expression respectively.Method is: have only its font encoding of filling of two yards.Certainly also available first letter of pinyin is made distinguish yard, does not even consider the font information of Chinese character, participates in coding without distinguish yard.
Utilize input method software, on keyboard, knock the key at certain Chinese character respective coding place and just can import this Chinese character.
Description of drawings
Fig. 1 is double-division three-code input method basic element of character keyboard arrangement figure
Embodiment
Elaborate below in conjunction with preferred embodiment and accompanying drawing.
The present invention when coding to the complete science of the understanding of Chinese-character stroke.Stroke is that the minimum that constitutes the regular script Chinese character pattern connects a unit, the lines that one-time continuous is write as when being writing Chinese characters.Regulation by the State Language Work Committee, when disregarding its weight length only considering the wieling the pen direction of Chinese character, can be divided into five kinds of basic strokes of horizontal, vertical, left, points, discount when promptly only considering its form of a stroke or a combination of strokes, wherein carry and incorporate horizontal stroke into, lifting-hook is incorporated into perpendicular, right-falling stroke is incorporated into a little, and various folding pens are all rolled over, and that is to say that the stroke of other various band turnovers is all rolled over.The State Language Work Committee claims that five kinds of basic strokes are horizontal, vertical, left, points, discount, and tend in person claim horizontal, vertical, cast aside, press down, folding.Because the stroke of point is very short, does not resemble and have certain length other stroke, and the direction of wieling the pen sometimes is sagging, almost with cast aside identical.Certainly the State Language Work Committee regulation claims it a little may is because put into word, and type frequency is higher than right-falling stroke.Owing to be the regulation of State Language Work Committee, have to observe, the State Language Work Committee also can stipulate to claim to press down also to allow in fact.The word-building unit with assembly Chinese word function that Hanzi component is made up of several strokes, Hanzi component have many stroke members and single to draw the branch of parts, and single is drawn parts and is five kinds of basic strokes in the present invention.
Then Chinese character is divided into combinde rqdical character and single character two classes, about combinde rqdical character is meant and has, about, the inside and outside is the Chinese character of investing mechanism, its two parts often have tangible boundary line.About single character is meant and does not have, about, the Chinese character of about structure, its stroke often intersects adhesion, one integrated mass.
Definition of Part of Chinese Characters is identical with the regulation of State Language Work Committee, and it is the geostationary stroke structure that is made of several strokes, can constitute Chinese character after the combination.Hanzi component is pressed nearly more than 600 of " information processing GB13000.1 character set Hanzi component standard " regulations, and this is the cause that the regulation intersection is not torn open.Like this regulation is less rational in fact, it will be argued that when certain stroke and first three to reach with last stroke when crossing, and the stroke of writing after this must split out separately.Form if regulation so, then a large amount of so-called Hanzi components are actually to be pieced together by other several Hanzi components, can get rid of these so-called Hanzi components fully.For reducing memory capacitance, selected 70 high Hanzi components of type frequency from " information processing with GB13000.1 character set Hanzi component standard ", these selected Hanzi components are called the basic element of character in the present invention, and they are referred to five districts of horizontal, vertical, left, points, discount by the first stroke of a Chinese character.The height of the type frequency of the basic element of character in mainly distinguishing again according to each, the number of taking into account the basic element of character decide the key position that comprises in each district what, calculate through science, the first stroke of a Chinese character is that horizontal stroke, point, perpendicular Hanzi component type frequency are higher, quantity is also more, each get six key positions, the first stroke of a Chinese character is taken second place for the Chinese character frequency of casting aside, and gets five key positions, the Hanzi component type frequency of turning up pen is minimum, only arranges three key positions.For ease of memory with take into account fingering operation, and with reference to Xinhua dictionary radicals by which characters are arranged in traditional Chinese dictionaries arrangement regulation, by point, horizontal, vertical, cast aside, the order subregion of folding from left to right is arranged in order.District's parts are come respectively on six letters cases of QWERTY keyboard Far Left two row, promptly come Q, A, Z, W, S, on the X, horizontal district parts are come respectively on six letters cases of the right two row in QWERTY keyboard mid point district, promptly come E, D, C, R, F, on the V, perpendicular district parts are come respectively on six letters cases of the right two row in horizontal district in the QWERTY keyboard, promptly come T, G, B, Y, H, on the N, come respectively on five letter keys that perpendicular the right of distinguishing two is listed as in the QWERTY keyboard casting aside district's parts, promptly come U, J, M, I, on the K, folding district parts come respectively on three letter keys of rightmost two row of QWERTY keyboard.Accompanying drawing 1 is seen in the distribution of each basic element of character on letter key.Specifically, Dian, the Tou in the some district, speech, Yan, parts such as wide come on the Q key, make code with Q; Upright, Epileptic, Ha,
, parts such as Bing come on the A key, make code with A; Parts such as Rui, water come on the Z key, make code with Z; Parts such as fire and Xiangxi come on the W, make code with W; Parts such as Xin, the heart come on the S, make code with S; Parts such as Mi, Http, Chuo, Yi come on the X, make code with X; King, one,
Come on the E Deng parts, make code with E; Parts such as soil come on the D, make code with D; Greatly, parts such as stone come on the C, make code with C; Rolling, parts such as very little come on the R, make code with R; Parts such as wood come to be put on the F, makes code with F; Parts such as worker, seven, Lv come on the V, make code with V; End, parts such as worm, Shu come on the T, make code with T; Parts such as day, Dao come on the G, make code with G; Parts such as little, mountain come on the G, make code with G; Parts such as order, field come on the Y, make code with Y; Parts such as mouth come on the H, make code with H; Parts such as shellfish, Jiong come on the N, make code with N; Parts such as standing grain, Zhu, , The-Fan, Fan, Pie come on the U, make code with U; Parts such as Ren, Ren , Qe come on the J, make code with J; Eight, parts such as Jin, gold come on the M, make code with M; The moon, youngster,
Come on the I Deng parts, make code with I; Bao, , parts such as several come on the K, make code with K; Parts such as second, horse, corpse, the sixth of the twelve Earthly Branches, Fu come on the O, make code with O, and second is represented all folding strokes; Woman, oneself, wait parts to come on the L again, make code with L; Parts such as Si, power, Si come on the L, make code with L; For ease of remembering and following custom, the parts that the individual groups word frequency is not high also are aligned on the key, and they might not be sorted out by the first stroke of a Chinese character, but are referred in the high basic element of character of type frequency by homology and nearly shape., Jin moisture as Rui contains Jin, and contains parts such as bamboo.Be the minimizing repeated code, and be convenient to distinguish repeated code, often have a mind to make the end stroke difference of the basic element of character on each key with distinguish yard.
The code fetch number originally need not special provision, can get complete from the beginning to the end.But Chinese words is mostly complex-shaped loaded down with trivial details, gets one by one entirely, and it is time-consuming to take up one's energy on the contrary, is as good as with hand-written.Optimal code fetch number should be to differentiate all Chinese words, and the reasonable person of the repetition rate of coding.Through further investigation, it is more satisfactory getting trigram.Because the radical of Chinese character has only 200, commonly used has only 30, and 2/3rds combinde rqdical character commanded in these 30 radicals, and 1/3rd combinde rqdical character only commanded in all the other 170 radicals.Be to reduce repeated code, 30 common radicals should preferably come out to be arranged on the letter key, only get one yard, to all the other not too common radicals, because its commander's Chinese character is few, and often less than two, 30, so desirable two yards of this class radical.
The coding rule of single character is in the key: become the code+the first sum of picture code+end stroke code of the word basic element of character, have only one the code of just getting the first stroke." speech " word for example, the code of speech is Q; The first stroke of a Chinese character is a little, and code is Q; The end pen is horizontal, and code is E, and the coding of speech is QQE just." one " word and for example, one code is E, the first sum of is horizontal, code is E, " once " coding be EE.
Another kind of is the single character that a plurality of basic elements of character combine, it does not appear on the key, be called the outer single character of key, coding rule is to split into several basic elements of character by sequential write, get the code of code+the most last basic element of character of code+second basic element of character of first basic element of character, encode, split into the basic element of character after, number of components has been got till all parts less than three.Get head, inferior and suffix when that is to say the outer single character code fetch of key in regular turn.As " just " word, be single character, get code E, I, the Q of horizontal stroke, the moon, point by sequential write, the coding of " just " is EIQ just.
During by the coding of combinde rqdical character, to this Chinese character be divided into two by one-piece construction by sequential write, split into two parts, the part of writing earlier is called first, be called for short stem, the part that promptly comprises by sequential write the first stroke is a stem, and the part that remainder is write promptly is called second portion, is called for short surplus portion.Coding rule is: stem is got the corresponding code name of first parts and the most last the basic element of character respectively and is encoded, getting the code name of first parts of surplus portion again encodes, when stem has only a basic element of character, can only get 1 yard at most, at this moment desirable 2 yards at most in surplus portion, the corresponding code name of getting first and the most last parts of surplus portion by sequential write is encoded respectively, and surplus portion has only parts to encode with regard to the code name of only getting these parts.
Individual skill is arranged when being divided into two, and that is exactly to be divided into two at obvious gap location, is divided into two parts.If the most last pen of many strokes basic element of character be horizontal, the centre below horizontal has perpendicular, also will be divided into two, should many stroke members and the differentiation of other parts, as " walking " word, its stem should be divided into soil, ends, " foot " although two parts do not have the gap, also will be divided into mouth, end two parts too.Although independent point and left-falling stroke may have certain clearance with other parts sometimes, can not be divided into two.
The principle that will hold during fractionation is: press sequential write, all split out the stroke number basic element of character as much as possible at every turn, and to take into account directly perceived, each basic element of character can be by non-intersect fractionation just by non-intersect fractionation, special provision is when running into the stroke that certain stroke three stroke in front or more strokes intersect, this stroke must split out coding separately, but except the basic element of character.Headache is to run into crossing stroke how to split in the input method of Chinese character, and as the first half of " Cao " word, different input methods has different method for splitting, has brought serious inconvenience to the beginner.This special provision has then solved the medium-term and long-term unsolved difficult problem of input method of Chinese character, make " Cao " word the first half the centre two perpendicularly must split into two basic strokes codings separately.Stipulate also that in addition the basic element of character must meet sequential write fully, mustn't insert other stroke, if inserted other stroke in the writing process, then do not become the basic element of character, but except the square frame oral area because " state ", " because of " etc. word in accordance with regulations the finishing touch horizontal stroke must write at last.The input method that has splits into left-falling stroke, worm, Jiong with words such as " Yu ", and this is to violate sequential write, has also increased learning difficulty.
As " volume " word, its stem has only a basic element of character Si, and code is P, and first and the most last basic element of character Dian, Nian can get by sequential write at this moment surplus portion, and code is respectively Q, V, and coding is PQV just.
Combinde rqdical character is if not enough trigram is filled the distinguish yard coding.The coding method of distinguish yard is that left right model, last mo(u)ld bottom half and heterozygous are represented to have only two yards Chinese character with three mutually different punctuation marks such as usefulness ", ", ". " and "/" respectively, needs fill behind the code of these two basic elements of character its font encoding.It may be noted that with left right model, go up mo(u)ld bottom half and heterozygous is used "; " respectively, ". " and "/" coding also be an innovation, because one is simple and easy to note, two come the position of distinguish yard can not appear at first position of encode Chinese characters for computer, do not influence the punctuation mark input.As " Du " word, the first two parts is " wood ", " soil ", and code is F, D, and four yards of less thaies need be filled distinguish yard: be up-down structure, and therefore with ". " coding, being encoded to of " man " " FD. " like this.Single character needn't be mended distinguish yard.
For Chinese characters in common use, as if one, two in the front of only getting its complete coding, fill space bar again, just constituted the I and II brevity code.All more than the input method of 25 keys, the repetition rate of coding is very low again, so the individual character input speed will be hurry up than the input method of 25 keys for the quantity of I and II brevity code in this input method.Words compatibility of the present invention is not because word coding method length difference can produce the duplication problem.
The present invention has a little repeated code, but influence input speed hardly, the people who does not have repeated code for undue pursuit, coding rule also can change into: if stem is the combinde rqdical character of the basic element of character, getting first and second corresponding code name that reaches the most last parts of surplus portion encodes, single character is still got first and second and the most last component coding, if stem is the combinde rqdical character of the non-whole basic element of character, stem is got first and the most last component coding, first component coding is got by surplus portion, that is to say that single character and stem are that the coding rule of combinde rqdical character of the non-whole basic element of character is constant.
For improving input speed, present all kinds of input methods all provide the function of word input, and this input method is also like this, no matter the length of regulation word, its code length all is 4 yards, and the words compatibility.The coding rule of word is: two words, get preceding 2 yards of complete coding of each word respectively.As the coding of word " process ", get " mistake " the first two yard R, X respectively, the first two yard U, the H of " journey ", the coding of process is RXUH just.The 1st yard of complete coding of the first two word, preceding 2 yards of getting triliteral complete coding got respectively in three words.As the coding of word " computing machine ", get meter respectively first yard, the code Q of parts Yan, first yard of calculation is the code U of Bu spare , and the first two of machine yard is parts wood, several code F, K, and the code of " computing machine " is QUFK just.The multi-character words of four words and four above words gets the the 1st, the 2nd, the 3rd and first yard of the complete coding of the last character respectively.The phrase input is the important method that improves input speed.To utilize the phrase input as far as possible.
For the ease of using, also be provided with tolerant code, the Chinese character that import also can appear in the Chinese character to some codings are made mistakes easily when mistake is imported.
Claims (4)
1, a kind of computer input method for Chinese character is a double-division three-code input method, after various strokes classify as horizontal, vertical, left, points, discount by the regulation of State Language Work Committee with Chinese character, it is characterized in that: selected 100 high Hanzi components of type frequency, as the basic element of character, they are referred in five districts of horizontal, vertical, left, points, discount by the first stroke of a Chinese character; Again according to the height of type frequency of the basic element of character in each district decide key position that each district comprises what, through measuring and calculating, the first stroke of a Chinese character is that six key positions each got in horizontal stroke, point, the perpendicular basic element of character, the first stroke of a Chinese character is taken second place for the Chinese character frequency of casting aside, get five key positions, the basic element of character frequency of turning up pen is minimum, only arranges three key positions; From left to right classify the boundary as with per two successively and arrange by point, order subregion horizontal, vertical, that cast aside, roll over; Dian, Tou in the some district, speech, Yan, parts such as wide come on the Q key, make code with Q; , Epileptic, Ha,
, parts such as Bing come on the A key, make code with A; Parts such as Rui, water come on the Z key, make code with Z; Parts such as fire and Xiangxi come on the W, make code with W; Parts such as Xin, the heart come on the S, make code with S; Parts such as Mi, Http, Chuo, Yi come on the X, make code with X; , one, parts such as main come on the E, make code with E; Parts such as soil come on the D, make code with D; Greatly, parts such as stone come on the C, make code with C; Rolling, parts such as very little come on the R, make code with R; Come Deng parts and to put on the F, make code with F; Parts such as worker, seven, Lv come on the V, make code with V; , worm, | wait parts to come on the T, make code with T; Parts such as day, Dao come on the G, make code with G; Parts such as little, mountain come on the G, make code with G; , parts such as field come on the Y, make code with Y; Parts such as mouth come on the H, make code with H; , parts such as door come on the N, make code with N; , parts such as Zhu, , The-Fan, Fan, Pie come on the U, make code with U; Ren, people,
Come on the J Deng parts, make code with J; , parts such as Jin, gold come on the M, make code with M; , youngster,
Come on the I Deng parts, make code with I; Bao,
, parts such as several come on the K, make code with K; , parts such as horse, corpse, the sixth of the twelve Earthly Branches, Fu come on 0, make code with O, second is represented all folding strokes; , oneself, wait parts to come on the L again, make code with L; Parts such as Si, power, Si come on the L, make code with L; Make code with L; The parts that the individual groups word frequency is not high also are aligned on the key, and they might not be sorted out by the first stroke of a Chinese character, but are referred in the high basic element of character of type frequency by homology and nearly shape, and, Xiao Han , Jin moisture as Rui contain Jin, and contain parts such as bamboo;
The coding rule of single character is in the key: the code+the first sum of picture code+end stroke code that becomes the word basic element of character; The coding rule of the outer single character of key is to split into first and second by sequential write to encode with the most last the basic element of character, till the code of having got all parts of three parts of less than; The coding rule of combinde rqdical character is: stem is got the corresponding code name of first parts and the most last the basic element of character respectively and is encoded, getting the code name of first parts of surplus portion again encodes, when stem has only a basic element of character, can only get 1 yard at most, at this moment desirable 2 yards at most in surplus portion, the corresponding code name of getting first and the most last parts of surplus portion by sequential write is encoded respectively, and surplus portion has only parts to encode with regard to the code name of only getting these parts;
Utilize input method software, on keyboard, knock the key at certain Chinese character respective coding place and just can import this Chinese character.
2, double-division three-code input method according to claim 1, it is characterized in that: the coding method of distinguish yard be with left right model, go up mo(u)ld bottom half and this three classes font of encirclement type use respectively three mutually different punctuation marks such as "; ", ". " and "/" expression, have only two yards combinde rqdical character, need to fill its font encoding earlier in these two yards back; Single character needn't be mended distinguish yard.
3, double-division three-code input method according to claim 1, it is characterized in that: the individual character extracting code rule all will be followed and get big priority principle, promptly to preferentially press the many basic element of character codings of stroke number, also to take into account directly perceived, avoid the basic element of character of square frame shape that four bandings are closed to take coding apart by sequential write, special provision by sequential write and first three or more stroke intersect after write stroke and must take out separately, by single basic stroke coding, basic element of character exception.
4, double-division three-code input method according to claim 1 is characterized in that: the coding rule of phrase is: two words, get preceding 2 yards of complete coding of each word respectively; The 1st yard of complete coding of the first two word, preceding 2 yards of getting triliteral complete coding got respectively in three words; Four words and multi-character words get the the 1st, the 2nd, the 3rd and first yard of the complete coding of the last character respectively.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007101928489A CN101441517A (en) | 2007-11-19 | 2007-11-19 | Double-division three-code input method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007101928489A CN101441517A (en) | 2007-11-19 | 2007-11-19 | Double-division three-code input method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101441517A true CN101441517A (en) | 2009-05-27 |
Family
ID=40725975
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2007101928489A Pending CN101441517A (en) | 2007-11-19 | 2007-11-19 | Double-division three-code input method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101441517A (en) |
-
2007
- 2007-11-19 CN CNA2007101928489A patent/CN101441517A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103616960A (en) | Six vowel binary syllabification input method | |
CN102053719B (en) | Input method for Chinese characters | |
CN100498662C (en) | Vowel pinyin Chinese characters input method | |
CN101192102A (en) | 2626-key input method | |
CN101192103A (en) | Split input method | |
CN103744532A (en) | 26 radical root Chinese and English harmonic inputting method | |
CN100381985C (en) | Chinese character (structure code) input method and its device | |
CN101441517A (en) | Double-division three-code input method | |
CN101236457A (en) | Phonetic and stroke Chinese input method | |
CN103207684A (en) | Phonemic letter double-input method | |
CN100501649C (en) | Shape-pronunciation encoding input method of Chinese characters | |
CN102339139A (en) | Three-class five-field input method | |
CN101751134A (en) | Right upper left Chinese character input method | |
CN102053718B (en) | For generating method and the keyboard input devices of Chinese character | |
CN103207685A (en) | T-shaped Chinese character code input method | |
CN102073382A (en) | Stroke, main and auxiliary radical input method | |
CN101470535A (en) | Optimized Chinese character code input method | |
CN1196057C (en) | One-code two-form quick Chinese digital coding input method | |
CN102073383A (en) | Initial component pinyin input method | |
CN102693070A (en) | Method for inputting character by a manner of drawing line | |
CN101458572A (en) | 25-word code Chinese character input method | |
CN101430604A (en) | Chinese character code input method | |
CN100456214C (en) | Chinese document quick-speed input processing technology and keyboard thereof | |
CN102436312A (en) | Three-category five-zone phonetic-configurational code | |
CN101706685A (en) | Chinese character input method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20090527 |