CN104123011B - Chinese character and Chinese phonetic alphabet coding input method - Google Patents

Chinese character and Chinese phonetic alphabet coding input method Download PDF

Info

Publication number
CN104123011B
CN104123011B CN201310637474.2A CN201310637474A CN104123011B CN 104123011 B CN104123011 B CN 104123011B CN 201310637474 A CN201310637474 A CN 201310637474A CN 104123011 B CN104123011 B CN 104123011B
Authority
CN
China
Prior art keywords
input
chinese
code
pinyin
mapping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310637474.2A
Other languages
Chinese (zh)
Other versions
CN104123011A (en
Inventor
韩恒瑞
韩正扬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xuzhou Jienuo Software Technology Co ltd
Original Assignee
Xuzhou Jienuo Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xuzhou Jienuo Software Technology Co ltd filed Critical Xuzhou Jienuo Software Technology Co ltd
Priority to CN201310637474.2A priority Critical patent/CN104123011B/en
Priority claimed from CN 200910149939 external-priority patent/CN101930292B/en
Publication of CN104123011A publication Critical patent/CN104123011A/en
Application granted granted Critical
Publication of CN104123011B publication Critical patent/CN104123011B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention provides a coding input method and application of Chinese characters and Chinese pinyin, which is a comprehensive coding input method of the shape, pronunciation and number of the former Chinese characters and a divisional application of the application, wherein the specific content comprises 5 aspects of double-spelling keyboard input of the Chinese characters and the Chinese pinyin, a Korean code keyboard, character or key position number level mapping, digital coding input of the Chinese characters and the Chinese pinyin, digital 4 tone input of the Chinese characters and the Chinese pinyin and the like, wherein 3 bright spots exist, and the Korean code keyboard can be regarded as a first-choice tool for English input, Chinese character 4 stroke code input, Chinese character double-spelling input and Chinese character and Chinese pinyin 4 tone input, is concise and has complete functions; secondly, a keyboard character and number level mapping method opens a new convenient door for the digital 4 tone input of the subsequent Chinese characters and Chinese pinyin; thirdly, the specific application of the hierarchical mapping method in Chinese character and Chinese pinyin digital coding input creates a shortcut for 4-tone pinyin input, and the input can be performed simultaneously with 4-tone pinyin input, and the input target can be pinyin or Chinese characters, thus being a pioneering invention.

Description

Chinese character and Chinese phonetic alphabet coding input method
Technical Field
The invention is a comprehensive coding method and application of the pictophonetic number of a Chinese character, it is a Chinese character input mainly used for computer, mobile phone, etc., and information exchange, information processing, Chinese character inquiry use comprehensive coding, input method, its core is to two-dimensional graphic characteristic and word-making characteristic to Chinese character, utilize the coded resource as far as possible, have fused the key element such as stroke, form code part, shuangpin, mapping, etc., have carried on the systematic, comprehensive integration, have formed and regarded four strokes as the core with the four-stroke hierarchical code, the keyboard resource utilizes and disposes the rational four-stroke code, input the architecture system, adopt the best choice of 3 codes to GB2312, have reduced 1 key in the whole than five-stroke, etc. input methods; for 70244 Chinese characters of GB18030, a level is added, 4 codes are adopted, and at most, nine repeated codes of 4 codes are obtained from the actual coding sequencing effect, so that the method is undoubtedly a great breakthrough compared with the currently popular 4 code system.
Background
The essence of Chinese character coding is only to establish a relatively stable database, in the past coding method, in order to pursue speed, more than 30 codes are adopted, the aim is to pursue the input effect of 3-key non-coincident codes, at present, 26 letters are promoted to be used, the input of the 3-key non-coincident codes of GB2312 is realized, the process is a long and gradual refining, numerous repetition, optimization and simplification process which goes through decades, the key point of integrating the shapes, the sounds and the numbers is that I find the mapping relation of key position numbers, the pinyin input and the pinyin number input become very simple, the digital input of the mobile phone becomes particularly simple and rapid, a complete comprehensive coding system is formed, and the aspects of easy learning, code length, practicability, speed and the like are comprehensively promoted. The memory of the four-stroke hierarchical shape code is very small, and is as little as about 50 types of coding combinations, thus fundamentally solving the problem of easy learning, and the character can be selected and combined into a position only according to the first stroke, the second stroke or the integral characteristic of a coding part without memorizing the number of the parts, which is the first creation in the field of Chinese character coding and is characterized in that:
1. the classification of the part is essential, for example if the 'big' is a coded part, what code should be used where it can be found. The large character is marked up horizontally, and the code is affirmatively in 9 letters of the 2 nd line; with a left-hand side behind the horizontal, that is the 2 nd letter s, the actual amount of memory is very small.
2. According to the character-making characteristics of Chinese characters, and according to the relative position relationship of 3 components in the character plane and the characteristics of 3-component codes and 3-level codes in form codes, 3-point type nine-palace structure codes are summarized, the optimum cut-in point of 3-code length and speed is found, and the mutual relationship of three points, such as 3 positive angles (general) and 3 negative angles (special) is found, as shown in the claim book table 2, so that the characters are better recorded, and a shortcut is found for code input, in particular for form codes.
The right side is an actual coding example when three codes are taken, and in the nine-palace lattice in the table, at least one example is a three-point structure which is particularly vivid and convenient to memorize, so that the table is named as a nine-palace structure table, and the structural codes are very beneficial to reducing repeated codes when used for inputting the shape codes.
Intermittent plane Wedding anger Support for children
Singing arch Rutting Pinsen Han
Lucky eraser Embrace bundle Remote and simultaneous creeping garden
3. The method fully utilizes the resource to make the character root window display input and the repeated code distinguishing key used as the window display input and the repeated code distinguishing key of the character root, has the function of showing the root, is favorable for improving the input speed, is beneficial to being beneficial to all the things and is certainly popular in the society.
4. The general structure of the shape, tone, and number of the coding system of the korean code is shown in table 1 below:
korean code overall structure configuration list (Table 1)
Figure 100002_1
Disclosure of Invention
Coding input method for Chinese character and Chinese phonetic alphabet
The coding method of four-stroke hierarchical shape code is the core content of claim 1, and comprises two parts of four classification 50 combinations and hierarchical coding methods of shape code components. Coding the Chinese character font with the number of inseparable parts, hundreds of font code parts, 560 according to the specification of GF3001, generally called radicals about 200 (189) according to the classification in a dictionary, and actually, the coding parts have a plurality of uncertainties, such as different coding methods, different number of selected characters, different length of coding codes and the like; the invention avoids the uncertainty, grasps the target with deeper coding, and divides four categories of points, vertical, horizontal and left-falling according to the part, thus the advantage of classification is that the so-called scale benefit is obtained, and the distribution is more uniform. The method divides 26 code letters into four classes according to the proportion of 5: 9: 7, and the four classes are exactly matched with the classification of the components. Aiming at the characters as 'original, or gold, dragon' as big as one coding component, the number of the coding components can be actually not defined, the method adopts a four-stroke hierarchical coding method, the components are classified into about 36 classes according to characteristics, then corresponding codes are determined, and the complicated code determination of hundreds of coding components is replaced, so that the method is the most concise component classification method and is the wonderful point of the invention, the learning is concise, the memory capacity is greatly reduced, and only the same order of magnitude as the universal letter quantity is achieved.
Four strokes are coded by 26 letters in a very concise way, the coincident code rate is very low, the four strokes are used for the ordering of a dictionary, the capacity of the four-stroke number character-checking method is larger, and the four-stroke number character-checking method is simpler and more than the radical character-checking method, namely, the four-stroke hierarchical codes are a very simple and effective coding invention, the four-stroke hierarchical codes are listed, so that the application of the four-stroke hierarchical codes in the aspects of word dictionary compiling, publishing and ordering is clearly emphasized, huge reverberation is generated to the society, the dictionary ordered according to the four-stroke hierarchical coding method is successfully compiled, and the link of stroke character checking is completely deleted, which is an epoch-making progress, and is explained according to two branch requirements of the four strokes and the hierarchy.
The 1.1 four strokes are classified by taking 4 strokes of dot vertical and horizontal left-falling strokes as coding components, the dot strokes occupy 5 code keys of qwerty at the upper row, the vertical strokes occupy 5 code keys of yuoop at the upper row, the horizontal strokes occupy 9 code keys of asdfghJkl at the middle row, the left-falling strokes occupy 7 code keys of zxcvbnm at the lower row, the three rows share 26 keys, the proportion distribution is 5: 9: 7, 5+ 5: 10, 9 and 7 just accord with the design of three rows of a computer keyboard, and the four types of components are convenient to memorize.
1.2, the 26 letters on the keyboard are allocated with four-stroke classification according to 5, 9 and 7, the proportion distribution just meets the actual requirements of the number of components and the use frequency, and the stroke type accounts for 20%, the vertical stroke accounts for 19%, the horizontal stroke accounts for 35% and the left-falling stroke accounts for 26% in the coding sorting statistics of the GB2312 character set (namely, 6763 Chinese characters are input once); the proportion of the 26 keys is respectively 19.2%, 19.2%, 34.6% and 27%, the minimum error is vertical, and 19.2% -19% is 0.2%; the maximum error is the class, and only 27% -26% is 1%, which is very consistent with the proportion of four code letters, which indicates that the use frequency of 26 letters is very similar, and is one of the most prominent successes of the invention.
1.3 the coding parts are reduced to about 36 combinations, which is another outstanding contribution of four strokes, and overcomes the problems of emphasizing the number of the coding parts and judging whether the split is standard in the aspect of Chinese character coding, wherein the four strokes of coding parts are compared with 560 standard parts of GF3001, the number of the coding parts is increased or decreased, the number of the coding parts is 531, and the like, and the corresponding example characters have 'Fu', which is approximately the same as the whole; specific 560 elements of the alphabetical arrangement in this method are exemplified below, without excluding variations:
1.3.1 Point-pen class, total q (u) we (h) rt5 codes:
q (u)1 'point cross-fold' large class, there is the question whether 'door' should be classified separately; the pen is characterized in that the starting pen is a point pen, the rear part of the starting pen has the stroke characteristic of transverse folding, and the total number of the starting pen is 14;
the w bond is also 1 major class, with 13 elements, characterized by 'two points', to which the elements ', ' also belong;
e (h) the key has 2 main categories, one 3 o ' clock, including lifting the prefix, the other right-falling stroke, including the right-falling stroke, as the component ' an ancient type of spoon ' belongs to this category, which is also for balance, there are 10 components;
the r key has 2 categories, one category is 4 points, including 'fire, heart, meter' and the like, the second category is a point skimming category, such as , is, as, states and the like, and 15 components are provided;
the t key has 1 category, one category is point and lifting (point folding), the second category is point and transverse, and 12 parts are classified;
the stylus has 8 major types of 64 parts.
1.3.2 vertical strokes, which share yuoop 5 codes;
the y key has 3 major categories, namely a 'foot' category comprising a nail, a lining, a fruit and the like, a lower opening comprising a lower opening such as an inner opening, a towel, a convex opening and the like, and a 'convex' category comprising 28 parts;
the u (q) key has 3 categories, namely upper opening (vertical folding), such as central, swastika, concave, yun, song, etc., secondly 'day', including't', etc., and thirdly 'mother, un' etc., and has 32 parts;
the (z) key has 2 major types, namely, erecting pens such as upper, mountain, small and the like, and middle and central types such as equal and 29 parts;
the O key has 2 categories, one is a mouth which is a larger component, and the other is a penetrating and fleeing (Shen) category, such as electricity, string, and the like, which has 13 components;
the P key has 3 categories, namely a double vertical category such as industry and non-industry, a multi-opening category such as field and eye, and a four black category such as vessel and 2 nd part of 'Zeng', and 21 parts are used;
the vertical stroke category has 13 major categories, 123 parts.
1.3.3 horizontal stroke, sharing asdfgh (e) jkj/nine codes:
the a key has 2 categories, one category is 'one' category which comprises front, worker and rain, the other category is right opening category, such as Contraband, tooth, tile, drooping, and the like, and the common characteristic is that the transverse and vertical directions are not intersected and 37 components are provided;
the s key has 1 category, is characterized in that the s key is transversely provided with a left-falling part, such as a west part, a large part, a magnolia part, a page part, a hundred part and the like, and is provided with 27 parts;
the d key has 1 large class, is a cross class, and is classified in a simpler way, wherein the constraint is included, and 12 components are provided;
the f key has 1 category which is two horizontal (single vertical) categories, such as dry, , special, soil, etc., and 19 components;
the g key has 2 categories, namely three horizontal keys, such as king, wir, Leishui and the like, and a left opening key, such as and then, Yi, blunt, three, ugly and the like, and has 38 parts;
h (e) there are 3 major types of bond, one is double (multiple) vertical (cross), such as thirty, +, sweet, common upper half, etc., and the other is transverse (double vertical), such as bara, also glance sideways, third doubled rear half, etc., there are 18 parts;
the J bond has 2 major categories, namely J category, such as bow, Cheng and Fu, and secondly leather category, such as brush, which has 37 parts;
there are 2 broad categories of k-bonds, which are transverse rear points, such as go, dog, pan, etc., where 'pan' is referred to as 'point' and where the lift point is coincident with T, there are 18 parts;
the key has 2 categories, 1 is a seven (7) type, such as vehicle, old, second, flying, etc., 2 is a horizontal two-point category, coming, flat, clip, etc., and 29 parts are provided;
the horizontal stroke category has 16 main categories, 235 parts.
1.3.4 classes of left-falling strokes, with z (i) xcvbnm7 codes:
z (i) the key has 1 major category, i.e. left-falling and horizontal category, such as qi, raw, I, character, ox, hand, etc., and has 15 parts;
the x bond has 2 categories, one is left-right (man) category, such as good, man, alpha, mortar, , the other is fork category, such as , , etc. there are 12 components;
the key c has 1 category, is a left-handed and transverse-folded category, such as , , immune, bird, fish and the like, and has 16 parts;
the v key has 1 major category, i.e., the left-turning or right-turning category, such as mai, different portion, Shi, and the upper half of the hair, there are 22 components;
the b bond has 2 major classes, i.e. leftfalling, such as horizontal, ninth, long, vertical, flaky, thousand, cereal, lingual, etc., and white classes, such as inferior, ghost, chimney, invar, etc., as explained herein, 26 parts;
the n key has 3 major categories, namely double-sided or double-sided categories such as grow, insect feet, year, generation and the like, secondly horizontal folding categories such as month, use, lead, volume, generation and the like, and thirdly self-boat categories such as body and the like, and has 30 parts;
the m key has 3 major categories, namely 3 types of left-falling or left-falling, such as , Sichuan, pawn, , the upper half of a star and the like, two types of characters, such as the upper half of an edible, bamboo and alloy, and three types of eight characters, such as the lower two points of income and sharing and the like, and 17 parts;
the cursive class has 13 major classes, 138 parts.
560 parts of four pens and GF3001 are classified, most parts are the same, 560 parts of the four pens have no part, for example, a handwritten part (one eight) in the table 1 belongs to a horizontal-left-falling class, the code is S, and the code of an example character 'long oil' is 'sy'; there are also four strokes which do not exist, such as the No. 521 No. 531 two parts which are split in the four strokes, the example character ' is coded as ' dborzbo ', ' xi ' is coded as ' wbs ', and the like; the coding parts are summarized into about 50 major classes in four strokes, and because edge wiping balls are inevitable in the classification, for example, the difference between two horizontal classes and a horizontal-vertical cross class is that the F key is double horizontal, the H key is double vertical, and the double horizontal and double vertical are edge wiping balls and the like; also as the distinction of the J/K/L bonds in the cross-fold type, the K bonds are of two types, 1 being the cross-point, 2 being the break-point, and the point being associated, e.g. 'again' from the break-point K, the cross-fold of L is similar to '7', 'so' is encoded as 'mk', 'so' is encoded as 'ml', etc.
2. The hierarchical coding method of Chinese characters is divided into four descriptions, namely, the hierarchy and the coding method of Chinese characters; second, 8 encoding rules; thirdly, the coding key description of the four-stroke hierarchical shape codes; and fourthly, the application of the four-stroke hierarchical configuration code.
2.1. Chinese character hierarchy and coding method
The four-stroke hierarchical code coding method is a 4-code-length coding system based on 70244 Chinese characters in GB18030, firstly, corresponding codes are adopted according to the character characteristics, coding components can be strokes, radicals, single-body characters or a plurality of characters, and the four-stroke hierarchical code has obvious flexible characteristics and omission and is very prominent in hierarchy; for complex Chinese characters, a reverse thinking method of taking root codes layer by layer is adopted, 8 coding rules such as priority according to code length and the like are adopted, and the coding of a plurality of character groups is carried out.
2.1. Hierarchy of Chinese characters
a, in the national standard basic character set, most common characters are two or 3 characters, for example, a bag is a double-character, a series of characters such as a full character, a bubble character, a cannon character, a blister character, a cell character, a embracing character and a bract character are 3 characters consisting of a bag and different components, the pronunciation of the Chinese character is similar to that of the bag, the components have different meanings, and the Chinese character has obvious horizontal (side-added) hierarchical characteristics;
b Chinese characters have obvious longitudinal (trend to be complex) features in hierarchy, such as mother, every, sensitive, complex and worm wood, wherein the hierarchy of worm wood is' + "- [ ], i.e. head, tail and tail, i.e. component codes taken in hierarchy are absolutely not arranged in front-to-back order;
c big, dog, pole, you, dragon, long, ridge, dragon, , , etc. illustrate that the rule of selecting parts such as one, two, three and the last is not applicable, the hierarchical coding is just like what key is used to unlock what lock, and is the most applicable method, like the last two characters, the code is the coding rule according to the combination of a plurality of characters when the code is selected, the coding rule is that 'head, tail, head, tail' and 'head, tail', the practical coding is 'tata, ttta', like the character consisting of two reversed 'or' characters, the code is 'ktktktktktktktt'.
2.2. Eight items of coding rules
The total number of Chinese characters in a new version GB18030 reaches 70244, the Chinese characters are increased later, no matter how complex and hierarchical coding methods of the Chinese characters exist, for example, one character in a newly added character set consists of 4 characters of 'western philosopher', the code consists of the first part of the 4 characters, the code is afkx, and for the whole coding system, in order to code more specifications, the following 18 large-size characters have certain representativeness, and eight coding rules are formulated as follows ( , , and ) (only sequence numbers are used for replacement);
1. the code length is preferably specific to the specific measures for reducing the repeated codes of the Chinese characters with few strokes and the Chinese characters with multiple strokes in the same coding system, and means that when a coding component is selected, the component number is close to the code length number (priority), the code length is increased but not allowed, the repeated codes are reduced due to the reduction of coding space, which is more prominent when coding is input, and the priority is only the purpose and is often realized by other rules; for example, for the above 5 words 2, 3, 5, 16, 18, the longevity word on the right side of the light word No. 2 is composed of 6 parts in rows, so as to obtain and balance omission matching, match the word No. 3, 16 with the formation preferentially, match the key point preferentially, match the heavy letter No. 18 with the trade-off 2, and the codes are: the "yfak, vjpi, mtop, sssi, zgb" words.
2. The number of the parts is two, firstly, the use frequency in the splitting of the Chinese characters is quite high, and if the number is too small, the setting value is not necessarily high; another is to see if such settings facilitate ease of input, as opposed to the requirement that the component settings described in GF3001 be discrete, non-detachable. In this method, the single body word is sometimes split into two parts, such as 'heavy' plumb '' for even distribution of codes; sometimes the two parts separated are regarded as one coding means, such as a 'bite', etc., and 'event' is thus classified into G; this facilitates the overall balance of the code components over the 26 letter code allocation. Taking the above-mentioned No. 6 and No. 7 as examples, the two writing methods of Kontaniu are involved, especially the third part of No. 6 and No. 7 is divided into two parts according to the lap joint, the codes of the former part are determined by the code u, and the codes of the latter part are determined by whether the code u exists, if the code u exists, the code u is used in the No. 9, and the codes are determined by the component setting principle; if not, the code is from 'eight' to'm', then the code is based on the code length priority principle and the component setting principle, the code of the 6 th word is vous, and the code of the 7 th word is vous or voum.
3. The sign-in is to strengthen the characteristic of the part, fade the number of the part, enter the number according to the characteristic pair number of the part, thus has reduced the admittance threshold of Chinese character code input, it is about 50 basic principles of part combination to generalize hundreds of parts, have greater flexibility. In the above word example, the first part of the number 1 word is more specific, the stroke is as many as 8, and the method is very concise according to the rule of sign-in, because the initial stroke is 'horizontal', the initial stroke must be in the second line of the keyboard, and the code must be one of 9 letter codes; because of the multiple horizontal lines, the range points to 'G' at a moment, and then the code of the number 1 word is 'gok' can be obtained quickly. Taking the code of 'as' reservoir 'as an example, the first stroke of the first two words is a point, the point is definitely one of' q, w, e, r and t ', the second stroke is a left-falling stroke, the part code points to' r 'immediately, and the codes of the two words are' r 'rr' respectively; the first stroke of the third word is 'left-falling', the part is classified and definitely in the third row of the keyboard, the subsequent '3 o' is pointed out, the part code points to'm' immediately, and the code is 'mbr'; in the same way, the codes of ' rice ' and ' ' are ' cnk ' ' mnk ', and the codes of birds ' and ' ' are ' ca ' ' ba ', which indicates that the codes of the characters are different due to different writing methods and simplifications. Like concave, convex, and part numbered 487 in GF3001, all set the code from the cocking pen in general characteristics, from uy and u, respectively.
4. The total balance is that the longevity word on the right side of the number 2 word has 6 parts, 3 parts are taken as codes in 6, the first part, the middle part and the last part are compared and balanced, wherein, the following 'forming word priority' is also referred to, and finally, 3 relatively suitable parts are taken, and the codes are 'yfak'.
5. The outstanding characteristics are particularly outstanding in ' win ' balance ' and other series Chinese characters, the shellfish, women, sheep and the like which are changed only in the middle are the main point characteristics, and the rest parts can be normalized or double-coded. In the system, the 'death' is set as a first part and the 'mouth' is set as a spare part, and the codes of the three words are respectively: toy, toy and tow. Therefore, in the coding system, the feature component is regarded as the priority of the main point, and the main point is omitted, so that the 'moon' and the like are omitted.
6. The word formation priority is a principle set for meeting the daily habits of the public. The sequential opening of the Kontai Wu is characterized by the advantage of easy character formation, which is especially prominent in the separation of the independent character, such as the character of 'Zhu', for example, the character is separated into 'horizontal separation' and 'un-horizontal separation', preferably, the character can be separated into 'wood', and the like.
7. The interleave 2 is a specific example of a part setting rule, and is 1 rule set for reducing an overlap code for interleave-repeat parts such as ' heavy, vertical, ' ' and the like. This is because not allowing the separation would result in over-concentration and uneven distribution of the components, and would result in more and complicated components; however, multiple splits are not allowed, which causes confusion of split, so that at most, only two component codes can be split by setting a complicated duplicate component, and the same stroke is limited and is not allowed to be repeatedly reflected in the two components. The 3 example word splitting codes are respectively set as: bu, bh, gb; although the strokes of the character No. 9 and the character No. 18 are many, the character No. 9 and the character No. 18 are generally divided into two parts, the 2 nd part applies the principle of splitting 2, the code of the character No. 9 is msy, and the code of the character No. 18 is zgb.
8. Brevity code setting brevity code is a general principle in code input, is widely adopted in a plurality of coding systems, and the specific application of the brevity code is illustrated in code input.
2.3 coding emphasis description of four-stroke hierarchical shape code
The choice of the hierarchical coding is independent of the writing order. For example, the word 'cangue' crossbow 'and the 1 st hierarchical component are' wood, bow, and "" respectively, whereby it can be seen that the hierarchical coding components are determined regardless of the writing order, and the 3 words of codes are 'dho, vkj, and xtq' in turn, from which the order in the codes can be seen, from the writing order.
2.3.1 the characters are sorted in the publication of the dictionary according to four-stroke codes, which is an important component of four-stroke application, and the application range can be expanded to the countries and regions using Chinese character systems such as Korean, Japanese, etc.
2.3.2 there are several unconventional coding and code-fetching settings in the four pens, as specified in (you, cheng, an ancient type of spoon, cun, kou) and so on:
1) win and omit the 3 rd code, omit the principle according to the characteristic, set up women, sheep, etc. as the code that must be chosen; the codes are tov and tow, respectively.
2) The coding method of Yige and Ge in the Chinese characters of Cheng, Er, Wu, Shi, Yue, Tibetan and the like is to make the components singly Yige class from K, and adhere Yige class from K. As in the 2 nd element, the code starts with a pen from the horizontal side and starts with s depending on the specification, the code is based on the premise that the pen is set up, and the first element code at the time of 'becoming' split is s depending on the habit or specification; the code of the latter part in the 'over' word is from a; its full codes are respectively: di-kfy, Chen-sj ', Wu-akl, Carrier-fl', Yue-fia, Tibetan-has, etc.
3) The strokes 'lifting' are regarded as points which are more similar and accord with the custom theory of 3-point water, so that the 'post-processing' of the bean curd block is classified from a transverse point to a transverse point from K in the component classification, and the method is more concise; such as cun or the code of kot, etc.
4) 'an ancient type of spoon' should be hooked vertically according to writing habit, this method regards 'an ancient type of spoon' as right-falling, folding (hooking) from point class e, because the part amount of water of three points is very large, and concentrate on before, make it fold from point class, play a role in balancing single bond burden; the head of the 'than' to be mentioned (i.e. part 43 of the specification) shall cross pen a according to the written order specification, and the head of the 'north' shall cross pen i from the upright pen i, the codes are ae and ie respectively.
5) Lines can sometimes be considered as one part code, e.g. the 3 code length code of a 'balanced' word can be ncs, with 4 code lengths still split into double codes. Some single-body characters, such as concave and convex strokes, are not obvious, but the general image characteristics are obvious, and the codes are respectively appointed by 'shapes', and the codes are respectively from u and y.
6) The character code is very prominent in the large character set, taking the 'original' character as an example, the character code is composed of 3 components of 'factory' white 'small', the character '' is arranged in the GB18030 large character set, obviously, the character is composed of 3 identical 'original' characters, so the character belongs to parallel according to the hierarchical relationship, if 3 codes are taken, the first component 'factory' of the 3 'original' characters is taken, if 4 codes are taken, a tail part code 'small' can be taken again, and the code sssi is coded; further, 4 codes of words such as ', , SHAN, , , and' are etop, tata, wwwg, ttte, and ttta, respectively. Wherein the 4 th code of the first word is 'P' instead of 'N', embodying the requirement priority principle advocated by this law.
7) The following takes 22 words as an example: "one, two, three, four, five, six, seven, eight, nine, ten, hundred, thousand, ten, swastika', swastika, concave, convex, prosperous, juice, complex, worm, ", has a representative hierarchical code, and the codes are "a, aa, aaa, p, a, tm, l, m, b, d, s, b, s, u, j, u, y, hy, xtur, zuzv, hzzv, sssi", respectively.
2.4. Application of four-stroke hierarchical shape code
The total number of Chinese characters in a new version GB18030 is up to 70244, the Chinese characters can be a simplified Chinese character set, a traditional Chinese character set or a combined set with different numbers of the simplified Chinese character set and the traditional Chinese character set, the Chinese characters can be used for keyboard input of Chinese characters, and can also be applied to a word dictionary, the existing dictionary and the dictionary are classified and searched by a method of radical classification, and the existing dictionary and the dictionary are very complicated and complicated, so that the time is very long for use, and four-stroke hierarchical shape code coding is used for 26 English letters like pinyin sequencing used in a Xinhua dictionary, and the advantages are that: the parts are divided into four major categories, and are further classified into about 50 groups, so that the parts are very concise and very convenient to remember; the coding space has a power of 26 to 4, the space is as high as 45 ten thousand (456976), although the code length of the Chinese pinyin is as long as six letters, the change is only about 417 × 4-1668, the repeated codes are as many as hundreds, and the collection of characters in a dictionary is not feasible, such as a Xinhua dictionary, and the Chinese pinyin has more fundamental Chinese characters, such as a dictionary, and the like; according to the sorting of 70244 Chinese characters, the inner code has few repeated codes, and if the Chinese characters are divided into middle, day, Korean and other subsets, the repeated codes are fewer; therefore, the coding method is very practical, has high accuracy and obvious practical value, and is a novel high-level sorting character-checking method. In a dictionary of about ten thousand characters newly compiled by I, the word "" is searched, the word coded as 'djoa' only has one word "", and how good the character searching effect is seen! If the dictionary receives enough words, the 18 words mentioned in page 5 above are searched, and the codes are: according to the codes, corresponding characters can be found, wherein the two characters 6 and 7 are all through-false characters, in the rest 16 characters, only the characters 2, 3 and 17 are selected from the characters 2, 1 has a duplication code, and the rest 13 Chinese characters are all 4-key non-duplication codes and can be directly input, and the average key is about 4.1, so that the actual duplication code rate is very low.
Computer keyboard input method
Further, the four-stroke computer keyboard input method of Chinese characters is one of the main applications of four-stroke hierarchical shape codes, and the classification, induction and coding rules of coding components are basically the same, except that: in order to meet different requirements of Chinese character input, different subsets are designed, different specific settings such as code length, brevity code, structure code and the like are provided, at present, an input subset with 3 code length mainly based on GB2312 and an input subset with 4 code length mainly based on GB13000 are mainly provided, the input speed cannot be said for computer keyboard input, in order to improve the input speed, common characters, non-common characters and cold-avoiding characters can be distinguished, and the specific input subset is set according to specific crowds. The input needs to be explained mainly by 3 points, 1 is the correction of the component setting rule, 2 is the utilization of the symbol key, including the practical application of the structure code, 3 is the setting of the simplified code, and other problems need to be explained due to the special requirements.
GB2312 input subset
The input of Chinese characters is usually directed at common characters, resources are occupied by adding too many rarely-used characters, the input speed is reduced, the waste of resources is caused, the Chinese characters are often conciseness and coexistence in practical application, and in terms of range, an input subset mainly based on GB2312 and an input subset mainly based on simplified characters have similar connotations, and mandatory regulations are difficult to be provided.
1.1 claim 2.1 is directed at the four-stroke input method of the simplified form code, four-stroke input subset taking GB2312 as the main, set up as 3 code length, the code below the 3 part word (including 3 parts), from the part code; such as: the last, four-stroke code is 'sxdtol'; 3 parts above get the level 3 code, such as: frequently, the code is 'zzv'; in the common subset, the number of parts is relatively small, and the frequency of taking the hierarchical coding is low.
The final purpose of Chinese character input is to accurately input a specific Chinese character, which is the uniqueness requirement of input, and the implementation means mainly comprises three types, wherein one type is to select a Chinese character from a prompt window and click to enter the Chinese character, and the method has the defects that the selection needs time, and particularly when repeated codes are many (pinyin input), page turning is needed, and the method is troublesome; secondly, inputting (pinyin) by using a word bar to reduce repeated codes; and thirdly, common characters are set to be input by using brevity codes, the method also uses symbol keys to input the sub-brevity codes, repeated codes are reduced as much as possible, the purpose of inputting 3-key repeated codes is basically achieved, direct input without repeated codes is the best selection of input, and the method contributes to comprehensive display of input performance.
The 1.2 four-stroke hierarchical shape code is used for the application of symbol keys in the Chinese character coding input of a computer, especially plays a role of difficult replacement in the input of 3-key repeated codes of Chinese characters of a GB2312 subset, is an important measure for improving the efficiency by four strokes, and firstly specifically introduces a using method, a comprehensive using effect and then introduces a specific coding input method.
The keyboard is provided with 11 symbol keys, and the basic functions of the symbol keys can not be influenced by using the symbol keys on the premise of entering an input state (clicking the 1 st English code key), namely the use of any function as the symbol keys is not influenced; the method sets the symbol key as the root, brevity code, duplicate code distinguishing key and word selecting key, thereby greatly reducing duplicate code and ensuring extremely low duplicate code rate.
The symbol key is used for setting the sub-brevity code, and the symbol key is used for setting the level 1 brevity code, so that the symbol key has a root showing function and is added with a new function; the 2-level brevity code can reflect the mutual relation of two parts and has double functions of a structure code and an brevity code. Generally, the mutual relationship of two codes exists in 4 setting conditions, namely 1, a head code and a tail code of a single character or split 2 codes; 2. two part codes of left and right sides; 3. an upper part and a lower part; 4. two part codes are delivered, the structure codes are as shown, when certain class 1 is excessive, window prompting is provided, so that compatibility is allowed for reducing repeated codes, and the input speed is improved. The number of the symbol keys is 11, when a single symbol is used, 10 of the symbol keys are used as a root, the symbol keys comprise a symbol (-) special for numbers, the rest symbol (') is specially used for 40 special parts beside the radicals which are rarely used even not used in the process of character input, such as a part of' alpha ', a part of both sides and a part of both sides, and the like, the main system is set to be displayed by using' and the following symbols, and the format is a code plus '+' plus designation symbol, and almost does not occupy letter code resources. For example, the codes are ' n ', ' and ' n ' ' is respectively typed in, ' the ' grow ' word is input, the function can meet the special requirement of inputting the symbols, the normal input resource is not occupied, the normal input speed is not influenced, and the method is a two-in-one selection.
1.3 as an input method, brevity code setting for high frequency words is indispensable, and like other input methods, brevity code setting is not limited by the number of components. The symbol key is used for inputting, and the automatically generated numbers can also serve the purpose of inputting because the number resources are not occupied. In a word, the 3-code length subset taking GB2312 as the main body highlights the characteristics of simplicity and quickness in input, and is more convenient and quicker by adding the input of the vocabulary entry.
Four-stroke encoding input subset of GB18030
The four-stroke input is directed at GB18030, because the set expands the Chinese characters, including many rare characters used in Chinese, Japanese and Korean, the subset is often used in practical application, the current practical GBK subset in China is basically provided with the main characteristics of four-stroke input, the coding rule is basically the same as the above, the difference is that the adjustment can be performed on different practical input subsets, 4-code length is usually selected, compared with the previous input set, the main difference is that a coding level is increased, namely the cost of one code is increased, the coding space is increased by 25 times, and the coding repetition rate is greatly reduced; in addition, the application of the three-point structure code is increased in the utilization of the symbol key.
2.1 adjustment from 3 yards to 4 yards results in the coding of words with 4 parts or less (including 4 parts) taking the level 4 codes from part codes, 4 parts or more words. Because the new version GB18030 character set covers a plurality of rare characters of 'Zhongjapanese and Korean', like Chinese characters consisting of two 'or' and the like, the input and the use of any character cannot be excluded when in speaking and inputting, and certainly, the method of taking the level 4 codes for the Chinese characters is increased; this also pre-represents, the application range of this coding, inputting method has been expanded to the place using Chinese characters, such as ' the country and region of ' day, Korean '; the four-stroke hierarchical shape code coding and inputting method can be designed into various Chinese characters and input subsets according to the requirements and the application range of the actual environment, such as Chinese character coding sets or input sets of three relatively independent countries of China, Japan and Korea, and can be widely applied to actual application environments of various industries, such as science and trade, industry and commerce, teaching and the like.
2.2 application of symbol key in Chinese character input
1. The sign key is applied in Chinese character input, 1-level brevity code and 2-level brevity code have been introduced in 3-code-length brevity code subset, and still be used in the set, when the sign key is used as a second key, namely, after the letter key is typed, 3 use functions are set, firstly, the sign key is used for directly inputting the independent character, and the function of input shunting is achieved, for example, 'Ji' character is input when 'J' is typed, the 'Yi' character is input when 'J' is typed, the 'Ji' character is input when 'J' is typed, and the 'Ji' character is input when 'J'. is typed, and the strokes of the characters are the same, the strokes are also the same, and the shapes are also almost the same, in the shape code input, the sign key is definitely coincident code, at this time, the sign key input is very effective, and the number key is not excluded to select the; secondly, the Chinese character input method is used for inputting brevity codes of common characters, and improves the input efficiency, wherein a sign key '-' is set to be specially used for inputting Chinese numbers, such as inputting 4 numbers of six, seven, eight and nine by using codes T-, L-, M-and B-for example; third, use as, show the root key, namely, take out a symbol key, for example, use 'the' symbol to show the root symbol, when typing any key, such as 'X' key, then type 'the' key, will pop up from the window, ', appoint the input symbol is', ', input symbol', 'will realize the purpose of inputting' so, just so let these unusual characters, can input and avoid the resource environment of the usual input, input with the symbol key, it is the choice of two whole beauty.
2. The third key of the symbol key in Chinese character input is mainly used for distinguishing the function of double-root character structure and also used for inputting second-level brevity codes, and when the symbol key is used as the brevity code, the subsequent fourth key is not restricted by the character structure, and the reference is made to the double-root setting introduction on page 9.
3. The fourth key of the symbol key in Chinese character input is mainly used as a nine-grid structure code (refer to page 11 of the specification) and also can be used as a brevity code, and in a table 2 in the claims, besides the setting of the symbol key, the table also comprises nine numeric codes which are used for setting the structure code in Chinese character input of the mobile phone, the number in the last row in the table is 1 single-body character, 2 double-root character, and the later words are the numbers.
2.3 input method and application of four-stroke hierarchical shape code
2.3.1 the four-stroke hierarchical code coding method according to claim 1, i.e. 50 kinds of related components are distributed and mapped on 26 English letters, then specific coding is performed according to eight basic coding rules, and the design of the symbol keys mentioned in 2.1.1 naturally forms a four-stroke hierarchical code keyboard input method, which is applicable to the whole comprehensive coding character set of GB 18030.
2.3.2 the use of the symbolic key as claimed in claim 2.1. in a 4-code system, the symbolic key is only used in less than 3 Chinese characters due to the change of range and content, and the space key plus selection is a simple input scheme, and the 4 words 1, 9, 12 and 18 are mentioned before (page 5, 18 words) and can be added with symbols to form' gok/, msy/, dbo and zgb; ' the addition of a symbol forms a code with a symbol key input, and other codes are completely the same and show the function of the symbol key.
Three, claim 3 four pen-shaped digital input
The four-stroke digital code is suitable for GB2312 or a method for inputting Chinese characters by using a common character set as a main body and inputting Chinese characters by a mobile phone or a computer, is based on the character-building layers of Chinese characters as claimed in claim 1.2, root codes are selected layer by layer, 3 layer codes are selected no matter the stroke and the number of components of the character, a coding component is directly mapped to 9 number keys, 1-9 numbers are set according to four-stroke proportion of a point 2, a horizontal 3, a left-falling 2 and a vertical 2, a nine-palace structure digital code is added, and a single character is input to have 4 number codes, so that the four-stroke digital code input method integrates the structure of a shape component and the structure of the component into a whole.
1. The coding parts of the four-stroke digital code can be strokes, radicals or single-body characters and the like, and are classified by four strokes; table 3 is a part classification table, i.e. it shows what kind of parts should be used, and what kind of numerical codes should be used, and is now described by taking a stylus as an example, the numerical code in the first row is '1', the set point, right-falling, or start stroke is a point, the part code in the next stroke is '1', and the third column in table 3 is an example of a '1' coded part, such as ', gate', etc., and it should be noted here that the examples of parts are only a few, and the key is the setting of rules in the middle column. The list of mapping the coding component to the specific number key is favorable for determining reliable codes as soon as possible, and in the code table of four-stroke digital codes, only two choices are available except 3 types of horizontal strokes in one type, and the two possibilities are simpler and faster than the trouble and order of strokes of stroke codes, so that the method is a very practical and fast coding method.
2. The four-stroke digital input takes 3 radical codes, and takes three codes for single-body characters, double-radical characters, 3 radical characters and a plurality of (more than 4) characters, 3 codes for 3 radical characters and 3 codes for a plurality of radical characters, which have been explained for many times before, and the important point in the shape codes lies in the single-body characters, double-radical characters and strokes with few strokes. The method comprises the following steps: repeating the single stroke twice to form 3 codes, such as 'one' word, wherein the code is '333', the structural code 1 is added, and the full code is '3331'; the single character is selected from part code, head and tail stroke code or 2-part code (including stroke and part interleaving), such as 'person, thousand', etc., the code is '761, 764', structure code 1, and the whole code is '7611, 7641'; also such as 'plumb, convex, concave', etc.; taking a total root code and a plus-minus 2 (part or head-tail stroke) code, wherein the codes are '7751, 8831 and 9831'; two characters are 'side radical' code (when the hard radical is 'radical component' code), and a 2-split double code of non-radical component is added, such as 'root, code, trial' and other characters, the codes are '456, 353 and 198', and the codes are '4562, 3532 and 1982', and the full codes are '4562, 3532 and 1982'
3. The 4 th digit of the stroke-shaped number is set as a digit structure number, and the application of single or double digits has been described in the preceding paragraphs, and the application of the three-point type nine-palace structure number in the stroke-shaped number according to claim 2 is described here, and the structure number is particularly important here because the stroke-shaped number only takes 3 part numbers.
Three-point nine-palace structure digital code example character table
Intermittent scraping 7 Wedding anger 8 Jien 9
Sing line 4 Rutting 5 Pinyin 6
Lucky eraser 1 Laos inserting bundle 2 Faraway same-drift letter garden 3
It can be seen from this table that nine cells are called nine palaces and are popular calling methods, each cell is represented by 1 number, and it is also doubtful that at least one type in each cell can be split into 3 point types, so this table is named as a nine-palace structure code table, and the numbers in this table are called nine-palace structure numbers. Taking example characters in the table as an example, the four-stroke numerical codes of the 'far article, article and frame' characters are '3313, 8886 and 5849', and the input of the numerical codes is not only few but also simple and clear, and is one of effective means for reducing the repeated codes.
The simple code setting of the four-stroke digital code is to replace 1-3 digital codes with '0'. The first-level brevity code is a high-frequency character which takes the number as the first code, the second-level brevity code is also set by adding '0' to the first two numbers, and the third-level brevity code is input by directly adding '0' to 3 part codes, so that the structure code is omitted.
4. The four-stroke digital input is a highlight of the invention, the setting of brevity codes is removed, the input repeated codes are very few in practice, the maximum number is only 6-7 when the length of 4 codes is full, any 1 Chinese character in GB2312 can be input by 5 keys, the result is a very rare achievement, the seven characters are input, the code is' 8137487182411314512(6 repeated codes are selected 1)5583(3 repeated codes are selected 1), the total number of key strokes (including space keys) is 27, 27/7 is 3.86, the average single character is only 3.86 keys, the learning is very easy, the input is rapid and convenient, and the method can generate extremely deep influence on the civilization and progress of the society.
Further, description of mapping numerical relation of letter key position
Claim 4 explains the mapping relation of letters (key positions) and numbers, which is another highlight of the present invention, and the following claim 6 only describes the specific application of the method in pinyin digital input and Chinese character digital input. For letter mapping, as long as the total number of letters does not exceed 81, the corresponding relation between the letter mapping and two numbers can be realized, such as Russian, Japanese and the like; english letters only account for 26, only account for one third, and can be input by key mapping numbers, namely, each letter input is replaced by two mapping numbers, in some occasions, English names such as Obama are required to be translated into Chinese Oubama, in other occasions, Oubama is required to be translated into English, the method is usually troublesome and irreversible, the mapping relation (the first 3 rows in the following table are used as mapping examples) Obama → 1935213721 → Obama is reversible and is the best, the method is particularly simple, the method can be conveniently converted by Russian letters, Japanese letters and the like, the method is extremely wide in application, two numbers can be used for replacing one letter, the letter coding of any code length is feasible, for example, in the four-stroke hierarchical code with 3 code lengths, mapping codes of 6 numbers can be used for replacing, and a number structure code is added to form the input method of the figure number with 7 code lengths, however, in the korean code system, since there is a more concise input of a 4-symbol long symbol, the height is far less than 7, and thus, it is not adopted.
On the basis of 26 letters, if the calculation is carried out according to 3 times, 3 groups with 78 characters are mapped, and 78 double-digit codes are mapped; or 27 English letters and one virtual letter are used, which is equivalent to 81 mapping numbers in table 4 in claim 4, so that the double spelling input of Chinese characters can be realized. The method uses one of 3, 26 or 27 letter keys as silent double-spelling digital input of spelling, uses another 52 or 54, and also divided into two groups, and combines them according to 2 x 2 to form 4 combinations, and exactly correspondent to 4 tones of spelling, and can make 4-tone spelling digital input of Chinese characters, in particular, the 4-tone digital input also has pure Chinese phonetic alphabet, and its concrete mapping mode and input effect are closely related.
Example table of alphabetic key mapping phonetic digital code
11 12 13 14 15 16 17 18 19
21 22 23 24 25 26 27 28 29
31 32 33 34 35 36 37 38 39
41 42 43 44 45 46 47 48 49
51 52 53 54 55 56 57 58 59
61 62 63 64 65 66 67 68 69
71 72 73 74 75 76 77 78 79
81 82 83 84 85 86 87 88 89
91 92 93 94 95 96 97 98 99
1. In the computer keyboard input of Chinese characters, the concrete mapping mode roughly includes the types of row-column sorting and (group) block sorting; because the nine-nine sorting does not relate to the number '0', firstly, the last letter P in the first row of the keyboard is regarded as the letter in the third row, and the letter P is copied twice to form three groups of nine rows; top 3 behavior first set, as shown in table 4-1 in the claims; the middle 3 line is the second group and the last 3 line is the third group, as shown in table 4-2. Table 4-1 table 4-2 merge that is, as the left table, the first 1 mapping number shows row, the last 1 number shows column, and the representation manner of the so-called determinant is very similar, which is simple and clear when describing the alphanumeric mapping relationship, and is not as easy to remember as the subsequent chunk ordering in practical application.
1.1 the mapping of Table 4-1 can be used for any double-spelling digital input. The double spelling input of Chinese characters is a spelling input method using initial consonants and vowels of Chinese characters, because the initial consonants are only 23, add 1 virtual initial consonant, only 24, less than 26 letters, the vowels are replaced with 26 letters, lean on memorizing the setting of these vowels in the practical use, because the edition is many, numerous and disordered, difficult to have the characteristic, difficult to remember, so use is not common; with the mapping of the letters (key positions) and the numbers, the setting of initial consonants and vowels can be replaced by the numbers to carry out the double-spelling input of the numbers, and the simple and quick Chinese character input can be carried out on the mobile phone only provided with the numeric keyboard, so that the function of the double-spelling digital Chinese character input is highlighted; therefore, the mapping relation has certain universality and wide application.
1.2 the mapping of Table 4-2 can be used for any 4-tone double-spelling code input. Table 4-2 includes two groups of 26 letter mapping numbers, and then two groups of numbers are used to make 2 × 2 combination, i.e. 4 groups of codes are formed according to the settings of 11, 12, 21 and 22, and exactly correspond to 4 tones, so that it can make simple, clear and fast 4-tone double-spelling digital code input on the mobile phone only having digital keyboard, including the input of pinyin and Chinese characters.
1.3 the mapping relation of table 4-1 and table 4-2 can be used for inputting the 4-tone comprehensive double-spelling code without tone, and the two codes are not interfered with each other, and the number '0' can be used for inputting the brevity code.
2. Compared with the table 4-1 and the table 4-2 in the previous clause, particularly in double spelling input, the chunk ordering of the mapping relation of the character and the number, namely the table 4-3 and the table 4-4 in the claim 4, is visual and ordered, convenient to memorize and most concise.
2.1 the codes mapped by the letters of 1-9 of the 26 keys of the table 4-3 can be used for inputting Chinese pinyin or Chinese characters without tones through double-pinyin codes. In the keyboard input of the computer, as shown in the table 4-3, 26 letters are in a large group, and the character is that when the single silent input is carried out, the relative conciseness is relatively realized, namely 3 group number numbers and 3 digit number numbers in each row are the same; if a ' number ' character is input, the pinyin is ' shu ', input in a silent tone, sh → e → 13, u → q → 11, and the input number is 1311, the pinyin ' shu ' or the number of Chinese characters ' can be input.
2.2 the table 4-4 two groups of 26 keys of 1-9 letter mapping digital code, can carry on the double spelling digital code input of Chinese characters of 4 tones, the method is to map the digital code two groups of letters (key position), classify into 1, 2 according to the group number, press 11, 12, 21, 22 four kinds of combinations, and make it correspond to 4 tones, can realize the double spelling 4 phonetic alphabet digital code input of Chinese characters, the benefit is that can realize the input of Chinese characters or Chinese phonetic alphabet of 4 tones with 4 digital codes, used for the mobile phone Chinese character input only with the numeric keyboard, it is convenient to show prominently.
Figure GSB0000191151000000141
The left chart is the mapping relation of the first group in the 9 groups shown in the table 4-4, covers qwe3 letters, maps the front code of the digital code, i.e. the group code is 1, the back code divides into upper and lower two cases, and carries out 2.2 combination (2 system), i.e. the 4 groups of codes are formed according to the setting of 11, 12, 21 and 22, and the double spelling of exactly 4 tones comprises pinyin and Chinese character digital input, etc.; if ' digital ' is input, the phonetic letter is ' shu ', two pronunciations are respectively sh ǔ and sh's, the rule of 3-sound input is ' 21 ', and the digital code is
Figure 100002_5
Sh ǔ → 1914; the rule for 4 sound inputs is '22' and the number is
Figure 100002_6
I.e., sh below → 1917; inputting the numbers 1914(3 tones) 1917(4 tones) can input the pinyin sh ǔ, sh haben 'or the number of Chinese characters'.
2.3 the mapping relation of tables 4-3 and 4-4 can be used for inputting the 4-tone comprehensive double-spelling code without tone, and the two codes are not interfered with each other, and the number '0' can be used for inputting the brevity code.
Table 4-3 table 4-4 is characterized by highlighting that the group codes are the same, i.e. the regular ordering of the next-order codes of the front and back groups is diversified under the precondition that the front-order codes are the same, i.e. the ' other than ' 123, 456, 789 ' in the claims proves that a plurality of choices are available in the double-spelling code input of the chinese characters or pinyin.
3. The Chinese pinyin without tone and 4 tone or the double-pinyin digital input of Chinese characters can be simultaneously realized by three groups of letters (key positions) mapped numbers of 1-9, namely 78 or 81 number code spaces, and the comprehensive pinyin and double-pinyin digital input of Chinese characters can be simultaneously used for inputting without tone and 4 tone without interference because of respective coding spaces and no overlapped part; especially 4 tones pinyin input, as long as two groups of keyboard letters are taken to copy mapping numbers, and then 2 x 2 combination is formed to form 4 tones pinyin input numbers, the Chinese pinyin and Chinese characters can be input with 4 tones, the specific practical value is very high, so that the 4 tones input of Chinese pinyin which is difficult to realize in the letter keyboard input becomes very simple, which is fully embodied in the following claim 6 and is another wonderful highlight of the invention.
Further, Korean code double spelling input
The Chinese characters have 417 pronunciations, and the biggest defect of inputting Chinese characters by pinyin is that repeated codes are very many, and in the coding input subset with only dry characters, the repeated codes are more than one hundred, if the number of characters is increased, the repeated codes are more, and practical pinyin input usually takes entry input as the main part, so that the repeated codes are reduced, and the Chinese character input method becomes a popular Chinese character input method and has complementary effect with the shape code input.
In the Chinese phonetic alphabet, there are 23 initial consonants, more than 30 vowels, when making double spelling input, it can generally set some key outside 23 initial consonants in 26 letters as virtual initial consonant to meet the input of silent initial pinyin, if 1 virtual letter or key is added outside 26 letters, then it adds coding space for digital pinyin input, including setting of vowel and virtual initial consonant and double spelling input of rear virtual key in vowel input, etc., and this kind of double spelling input of rear virtual key is not a complete concept of 'initial consonant' double spelling input.
1. The double spelling input of Chinese characters in pinyin input is a popular input method, and the versions of initial consonants and final consonants of double spelling are set to be many. The coding system adopts special vowel setting, basically arranges 5 blocks of vowels and natural order of letters in sequence, is particularly convenient to memorize, and the most of initials are the same as the setting on a computer keyboard, so that the memory quantity on the whole is very small, the coding system is a very simple and easy pinyin input method, and letter key positions can be used for mapping into digital codes for inputting, so that the letters are faded, the setting relation between the key positions and the coded numbers is strengthened, the setting advantages of the blocks and the sequence of the digital codes are highlighted, a good foundation is laid for Korean code digital double-spelling, and only 5 vowel series and the vowels in the series are regularly arranged after the digital codes are mapped by the keyboard letters, which is the essential difference between the Korean code double-spelling and other double spellings.
2. The Korean keyboard is characterized in that e is moved to h, u is moved to q, i is moved to z, and keys of the 6 letters are exchanged, so that the embarrassment that h is used as e, q is used as u and z is used as l is avoided, the initial consonant setting and the final consonant setting can be unified, and the Korean keyboard is beneficial to people who input Korean codes frequently.
2.1 whether the change has scientificity or not, the answer is positive because the Chinese input is different from the English input and has own characteristics, the keyboard is designed aiming at the use frequency of English input letters, the Chinese phonetic letters are borrowed from the English letters, but the use frequencies of the letters are completely different, some letters occupy important key positions, the use frequencies of u, i and the like are not high, and q and z with higher use frequencies are marginalized, so that the change is certainly changed for people using the keyboard in China and only reasonable basis cannot be found; with the advent of Korean code double spelling, in order to improve the input efficiency of double spelling, it is necessary to replace the disordered double spelling, and the design of the keyboard letter key positions should be changed, and certainly, the agenda schedule should be provided, which has undoubted scientificity.
2.2 in the Korean code input system, the key location is a word with very high repetition rate, the most prominent number code in the mapping relation is determined by the key location, the key location and fingering have close relation, the Korean code keyboard is a keyboard design for Chinese input with very strong practicability of the key location and fingering, it is carried out according to setting e → h, i → z, u → q of the vowel, the positions are exchanged, the letter arrangement of the new keyboard is UWHRTYQZOP, ASDFGEJKL, IXCVBNM, obviously, H, Q, Z all transfers to the high frequency area convenient for keystroke, and adds the setting zh o, ch-v, sh-e of the original initial consonant, z, zh, c, ch, s, sh are presented in the same row, the advantage of convenient memory is not to mention, and the key location where 26 letters are set is located, the key location is arranged according to the series 5 blocks with high order, that is, the 10 keys in the 1 st row are arranged in two series of u7o3, the 9 keys in the 2 nd row are arranged in two series of a5e4, the 7 keys in the 3 rd row are exclusively occupied by the 'i' series, which is the outstanding embodiment of the key position form in the table 5, wherein the u series and the i series have complementary exchange, so that coincident codes appear when pure Chinese pinyin input is carried out; the specific position of the letter in the keyboard is secondary, so that the English keyboard design is adopted, the English input basic pattern is kept, only 6 letters involved in Korean code double spelling input are changed, the practicability of the change is obvious, and keyboard resources can be shared in two input systems.
3. The input keyboard vowel setting is shown in table 5, which is the vowel sorting set by 26 letter key bits, and is described as follows:
3.1 two vowel series in the first row, setting a and e to just account for 9 codes, and completely arranging according to the natural sequence of letters.
3.2 move vowel u to letter q bit key position, this benefit is favorable to the vowel to arrange according to the natural order of the letter, its last vowel is uo, and vowel ou of the o system has symmetrical form, plus vowel 'u', 9 are related to 'u' altogether, so lighten the letter, only say the order, it is particularly convenient for memory.
3.3 in a row o system occupies 3 key positions on the right hand side, and ou, o and ong are sequentially from the original keyboard letter key positions of l, o and p, respectively, wherein the ou setting is for the whole sequencing effect of the row, and natural links of u system and o system are formed, namely, uoouu (you) pong in the 1 st row is formed, and the effect is convenient for memory.
3.4 line 3 sets l on the z key, the arrangement of the series l and the organic association of the iing and the letter n.
3.5 the mutual vowels have iang from ua, uang from ia or iu, and have equal, interchangeable or associative colors; iao follows iang in series from uai or i; lu from the o bond, and uoou u have a feeling of self-integration.
3.6 double spelling input, the vowel added with 'x' in the table 5 indicates that Chinese characters can be directly input without sound, and the actual input lacks a 'none' input information, and needs to virtualize an initial consonant, such as an optional letter a (actually, a key position); or virtual key positions other than 26 letters set in the subsequent pinyin digital input (claim 6), the latter virtual keys are truly virtual without setting specific letters or symbols.
3.7 table 5 shows that, except a, eu ui does not use English letter keys on English keyboard, but introduces the word of key position, neglects and fades letters, thinks that it is only vowels, key positions and numbers, strengthens the mapping and definition of letters and numbers, is especially simple and easy to memorize, and of course, it can be highly unified by Korean keyboard.
4. When the code double spelling enters the double spelling state, namely after the letter key is clicked and before the Chinese character is input, the function of tone screening and page turning can be performed by means of the symbol key.
Further, Korean code Chinese phonetic alphabet digital input and Chinese character digital input
The double-spelling input of Chinese characters has two input levels, 1 is spelling, 2 is Chinese character, and the same is true for the double-spelling digital input of Chinese characters. In most cases, the input of Chinese characters is input, pinyin input is rarely used, the requirement of inputting pinyin is ignored, and in the Korean code double-pinyin digital input, not only the input of Chinese characters but also the input of Chinese pinyin is required, in particular, the input function of Chinese pinyin with 4 tones is required.
1. One of the hanyu pinyin input of korean codes or the digital input form of chinese characters is a digital input method formed on the basis of the alphanumeric mapping of the alphabet keyboard of claim 4, and the initial consonant and vowel setting of table 5 or table 6 in the claims, by setting 26 keys, and pinyin digital input or chinese pinyin digital input is the content claimed in claim 6; the basic conditions and the forming process of the double-spelling digital input of Chinese characters and the pinyin digital input are almost the same, the difference is only the difference of the final input purpose, the two are closely related, and the method has the comprehensive input methods of no tone input, 4 tone input and both of the two.
1.1 adopting the letter keyboard mapping number coding method of claim 4, and setting the initial consonant and the final sound of the pinyin, the 4 number codes can be mapped, and the pinyin, namely Chinese pinyin or pinyin input of Chinese characters is realized; the key position number mapping of the row sorting according to the table 4-1 table 4-2 in the description of universal pinyin input is the simplest and the most clear, and specific examples and experiences can only refer to the subsequent Korean code pinyin number input due to the unknown and uncertain initial consonant and vowel setting so as to understand the universal applicability of the universal pinyin.
1.2 Korean code Pinyin 26 key mapping digital input is to map the initial consonants and the final consonants of Pinyin into 4 digital codes by using the initial consonant and the final consonant of Table 5 or 6 in the claims, and the total number of the initial consonants and the final consonant is 78, so that Pinyin, namely Chinese Pinyin input, or Chinese character Pinyin input is realized; the mapping digit of key digit in claim 4 is selected from one, two or three groups of digit mapping codes, one group of mapping codes can realize the non-tone pinyin input of double pinyin, two groups of mapping codes can be used for the double pinyin input of 4 tones, and three groups of coding modes are the pinyin input of both non-tone and tone double pinyin; because the pinyin digital input and the pinyin digital input of the Chinese characters are different, the difference of the mapping targets is only, or the relationship between the semi-finished product and the finished product is only; the digital input of the pinyin of the Korean codes or the digital input of the Chinese characters refers to the digital input of the key position mapping of the block (group) grouping and sequencing of the tables 4-3 and 4-4, which adopts 4-code length setting, is only different by way of example, and the interchange is also applicable.
1.2.1 tables 4-3 are single-group settings including virtual keys, which completely satisfy the double spelling input of table 5 with only 26 letter keys, or skip (virtual key number 92, select) or delete virtual keys (change mapping number 93 of letter P to 92), at this time, the final input of silent initials can use 3 keys except 23 initial settings in 26 keys as virtual initial consonants for non-initial rhymes input, and table 5 is provided with a key a as a virtual initial key; now take the input pinyin zhong as an example, according to the setting of claim 5, the code of zh is o, the code of ong is P, and then the numbers mapped according to the key position numbers of table 4-3 are 33 and 93, respectively, that is, the input 3393 four numbers are equal to the input pinyin zhong, under the support of software, the pinyin zhong can be displayed, thereby achieving the purpose of input, or the number is used for inputting 'middle' characters, etc.
1.2.2 tables 4-4 are two sets of settings including virtual keys, and as before, skipping is selected by using 26 keys, one-to-one mapping of two sets of key digits is performed according to table 5 of claim 4, table 4 and claim 5 of claim 4, and 4 sets of numeric codes formed by similar 2-system permutation and combination are performed, the tone of which is hidden in 4 numeric codes, which is a typical characteristic of 4-tone input of Han code double spelling, for example, to input Pinyin 'zh ō ng' as 1 sound, the mapping relation of '1-1' as zh → o → 36 and ong → P → 96 as two sets of digits, the combination of the two sets is 3696, and inputting 3696 is equivalent to inputting zh ō ng, and the input of zh ō ng or inputting 'medium' character can be displayed under the support of software; taking the input of er as an example, the second sound is the mapping relation of 1-2, and the mapping relation is mapped into 67 according to table 4, table 5, i.e. J, because the mapping number of the virtual key a is set to 44 in table 4-4 because it is a consonant-free number, and the input 4467 is equivalent to the input of er, and the er is displayed by software, or the Chinese character 'er' can be input, therefore, the tone is hidden in 4 numbers.
1.2.3 the key position number mapping method according to claim 4, the key position setting of initial consonant and final sound of claim 5 or claim 6, choose single group to do the phonetic input of the non-tone from above table 4-3, table 4-4 chooses two groups to do the digital input of the phonetic of 4 tones, it is seen that the input of non-tone and 4 tones of 4 phonetic digital input have their own coding spaces not to interfere with each other, it is these two kinds of inputs that have been taken at the same time to choose the essence of three groups, the process and result are completely the same, it is a Chinese character input method especially suitable for the numeric keyboard (such as the mobile phone) to input the above two examples.
2. The second one of the Chinese phonetic input or Chinese character digital input is 27-key setting, and the tables 4-3 and 4-4 are the simplest setting of block grouping, sorting and mapping, and this has the advantages that the former digit of two digits will not change during no tone or 4 tone input, and the latter digit will only change.
A silent input (the last 1 digit (second, fourth) of tables 4-3 is not other than 1 of 1, 2, 3;
the second and fourth numbers of the 1 st sound (1-1) combination of the 4 tone input (table 4-4) are not other than 1 number in 4, 5 and 6;
the second number of the 2 nd sound (1-2) combination is one of 4, 5, 6, and the fourth number is 1 of 7, 8, 9;
the second number of the 3 rd sound (2-1) combination is one of 7, 8 and 9, and the fourth number is 1 of 4, 5 and 6;
the second and fourth digits of the 4 th sound (2-2) combination are not other than 1 digit of 7, 8 and 9.
If one group is set as a non-tone input, the other two groups are used as 4-tone inputs, and as mentioned above, the relationship with 4 tones is similar to the convention of binary, and the following description is given for the phonetic-numerical code input of Chinese characters:
2.1 Han code double spelling Chinese character input without tone is completed by adding the table 4-3 key position number mapping of claim 4 on the basis of initial and final setting. The method for inputting the 'Chuang' character as claimed in claim, wherein the pinyin chuang is firstly known, the initial consonant is ch, the final sound is uang, the setting is shown in table 5 of claim 5, ch → v, uang → m, and the 4 codes input by the pinyin tone-free number of the 'Chuang' character are 8191 according to table 4-3, v → 81, m → 91; the input 8191 is the pinyin which is mapped to chuang and can be used for inputting chuang, and different from the pinyin input requirement of the previous version, the Chinese character 'creation' is to be found in a plurality of Chinese characters of which the pinyin is chuang, 6 Chinese characters are displayed in a window (trial version) after the Chinese characters are displayed, 1 sound has 'window and sore', 2 sound has 'bed', 3 sound has 'break', 4 sound has 'very sad and creation', and the 'creation' character is one of 6. The input number is 8191, and 8192 in the claims, which is different only from the setting of the final.
2.2 tables 4-4 are two groups of key position digit mapping, for 4 tones Chinese character input or Chinese phonetic input, table 6 is Korean code initial consonant, vowel 27 key setting example table, the group internal code of the front group of tables 4-4 is 4, 5, 6, the rear group is 7, 8, 9, the middle row of each group is public group number, the group number coding is mapping front digit, and it is irrelevant to the existence, absence and input selection of each tone; when inputting Chinese characters or Chinese phonetic 4 tones, the first and third digits are small groups (i.e. mapped first digits), and the difference is that the mapping digits in the second and fourth groups, i.e. in the mapping relationship of key digits of block grouping, the following characteristics are found in the four digits in tables 4-4 of table 43:
according to tables 4-4, two groups of key position number mapping tables are set, initial consonants and vowels of table 6 are set, 4 combined number codes are provided, corresponding to 4 tones of pinyin, and according to the input example of continuous no-tone, the input pinyin is ch ang, and the Chinese character 'bed' character of 2 tones is taken as an example, according to the initial consonant and vowel setting of table 6, namely ch ang → 8498, when the 2 tone number 8498 is input, the pinyin of 2 tones → ch ang can be input, or the 'bed' character 8498 → 'bed' can be input; similarly, if the 'jaywalking' character is input and is in 3 tones, and the number is 8795, the 'jaywalking' character can be directly input without selection; 'create' has two pronunciations, which are 1 sound and 4 sound, the 4-tone mapping input digital code of 1 sound is 8495, the 4-tone digital code is 8798, the 'create' character can be input, which is much better than the input digital code 8191 or 8192 without tone, please note that the 8192, 8495, 8498, 8795, 8798, which exactly accords with the 5 input state digital codes, all have respective coding spaces, and for a certain key position, the following digital code is exactly 1, 4, 7, or 2, 5, 8, and the other key positions are not 123, 456, 789, or 147, 258, 369 in the claims, and the actual meaning of the letter v and the 'virtual' key is diluted, and only lies in the combination of key position and mapping.
When 27 keys are input, the coding space is larger and more convenient, for example, vowel uang is set as key position z (i) in table 5, virtual key position is set in table 6, iao monopolizes one key position in table 6, which is the benefit brought by coding space and has larger maneuverability.
2.3 direct input example of vowel, want to input Chinese character 'er', the spelling is for er, the tone is the second sound, there is no initial consonant, press 4 tone Chinese character spelling digital input, can use table 4-2, table 4-4 to map the digital setting, initial consonant, vowel can use the convention of table 5 or table 6 to set up, use table 4-4 and table 5 as example now, the virtual initial consonant of table 5 is A key, press the 2 tone mapping of virtual initial consonant + vowel, the virtual → A → 44, er → j → 67, then input the number coding of er or Chinese character 'er' is right; 4467; then, the initial consonants and vowels in table 6 are used for setting, the pinyin er is a silent initial input example, or the pinyin number input of Chinese characters is carried out by using vowel + virtual keys, er → j → 64 and virtual → 98, the input number is 6498, namely, the pinyin 'er' or Chinese character 'er' can be input by inputting the number 6498, 5 characters are provided in 6B2312, and 'er' can be input by selecting one character from 5 characters.
2.4 in the pinyin digital input of Chinese characters, the number '0' can be set as an interrupt symbol, and specific common high-frequency characters are input, so that the memory input of brevity codes can bring quick and rich return to the input of the Chinese characters. In the previous section, if the brevity code 6490 is set, 6490 can be directly input to directly input the 'er' character, please note that the brevity code is set to be the plus number '0', and can replace the last 1, two or at most 3 number codes, thereby realizing the purpose of direct input.
Excellent performance of seven-Korean codes
1. The four-stroke parts are distributed in a balanced and reasonable mode, the full code of GB2312 before brevity code setting is counted according to preliminary statistics of a coding database, and the Chinese characters which can be directly input by 3 codes account for 64 percent of the total number; the total key stroke number of the spotlights accounts for the total key stroke number, the code number accounts for the total code number and is respectively 20 percent and 19.2 percent, the vertical strokes account for 19 percent and 19.2 percent, the horizontal strokes account for 35 percent and 34.6 percent, the left-falling strokes account for 26 percent and 27 percent, the error is little, the proportion of the left-falling strokes to the right-falling strokes is very consistent with the proportion of the left-falling strokes to the right-falling strokes to 5: 9: 7 of 26 keys, and the most prominent success point is achieved;
2. the code repetition rate of the four strokes is extremely low, the character set mainly comprising GB2312 can realize 3-key non-repeated input by means of symbol keys;
3. the four-stroke codes for GB18030 are very simple and convenient, and the codes of 70244 Chinese characters are sorted to find that the full-code long coincident codes are 9 at most, the codes are uuu, only 1 ' ' character can be displayed (noted) in the existing input platform, and the coincident codes capture the crown and are related to the ' 120 part (column) setting of GF 3001; 8 secondary coincident codes, the code is nsbi, and the following is 8 groups of 7 coincident codes, taking the code rmoy as an example, only '' one word of a display (GBK) can be input; it can be seen that the duplicate words may change due to the change of the range of the subset of received words, and most of them are in the newly added range of GB18030, which is difficult to be determined.
4. The dictionary is characterized in that four-stroke coding sorting is used for replacing pinyin sorting, the original pinyin sorting is changed into pinyin indexes, and the link of searching characters by radicals and strokes is eliminated. The Chinese characters are spoken into the character patterns, and are coded and sequenced according to the shape codes, a character can have a plurality of pronunciations which are concentrated together, so that the method is particularly favorable for knowing the connotation of multiple pronunciations of the character, and the problem that the character is not known well because the character is easy to find and examined again is solved; i find one example in writing the dictionary, is 'drags' this word, I commonly understand the meaning of 'pulling', this is not wrong either, when finding after the code ordering, this word has 3 pronunciations, wherein the meaning of 'zhu ā i' is 'throws', then the meaning of 'zhu i' is 'pulling', different tones, the meaning is totally opposite, the direction of pulling and throwing is just opposite, this example fully states, classify with the shape code, put together the pronunciation, very favorable to understanding the word, so use four strokes of hierarchical code ordering, offer the great convenience for the search of dictionary, word of dictionary, very swift, because four strokes of encoding only need remember about 50 part combinations of 4 strokes of classification, very concise Chinese, search swift, will create very favorable condition for the study, publishing, of the Chinese character.
5. The four-stroke code (claim 1) is good, the input (claim 2) is good, the application range of the four-stroke code is wide, 70244 Chinese characters of GB18030 are covered, obviously including Chinese, Japanese and Korean, naturally, subsets suitable for the respective ranges can be compiled, convenient pipelines are opened for Chinese characters widely used by people of Chinese, Japanese, Korean and the like, and people of all countries are benefited.
6. The four-stroke shape digital code (claim 3) is only 4 codes long, the repeated code is also only several, any Chinese character can be input by 5 keys (GB2312), and the four-stroke shape digital code is particularly suitable for Chinese character input of communication mobile phones.
7. The features of table 4 of claim 4, which is an invention of alphabetic keyboard mapping numbers, is a key of korean pinyin number input, and also provides a shortcut for converting other double pinyin input into number input, and the difference from korean double pinyin is limited to the setting of virtual initial consonant.
8. The setting of the consonant and vowel of Korean code opens up a broad prospect for the birth of Chinese keyboards, which are keyboards conforming to Chinese language habits, belong to Chinese keyboards, and the true labor is attributed to the series setting of vowels in Korean code double spelling according to plates and the mapping of key position numbers.
9. The Korean code double spelling pinyin (Chinese pinyin) digital input is the basis of Korean code phonetic digital Chinese character input, is a very effective pinyin input method, has temptation and charm which are difficult to replace, can be used for inputting 4 tones of pinyin by using 4 numbers, is difficult to realize by using 26 English letters, is realized by using 4 numbers, and has higher speed and higher efficiency than the pinyin input of the letters, and is a strange achievement.
10. The Korean code double-spelling code Chinese character has simple input and inherent phonetic transcription advantages, and may be used in the development of cellphone communication.
Detailed Description
The Korean code application example is used to prove its excellent performance.
a) The input of the phonetic digital codes is also set to be 4 codes as well as the shape codes, and now, taking the 21 characters of the 'four-stroke hierarchical shape codes of Chinese characters and the digital coding input method of shape and pronunciation' which are input in my previous application as an example, the actual effect of 4-sound digital code input of the phonetic digital code block grouping is deepened, and the 21 characters are translated into the digital codes as follows: chinese character 594, character 7787 selects 4, four 4887 selects 0, pen 8884 selects 7, layer 7669, secondary 7987 selects 3, shape 7589 selects 4, code 9744 selects 4, 6487 turns over selects 4, shape 7589 selects 4, sound 2686 turns over selects 4, number 19, character 7787 selects 4, coding 8575 selects 5, code 9744 selects 4, input 1311 turns over selects 9, input 2717 selects 5454 selects 3, method 5744 selects 2, (note: the old version of the number, character and 'selected' character after the number shows that after the number is input in the existing silent input software, the repeated character needs to be selected, 'page turning over' is needed), 3 times of code length is less than the specified, a single character needs to be input about 5 keys, the digital code input method with the length of 4 codes is known to speak, the efficiency is very high, data coding ordering is found, when the tone-adjusting repeated code is made to be 100, and more than 4 codes are made, and more than 50 tones are made.
b) One section is as follows: the liberation of the forced hierarchy is not only impossible to do the violent revolution, but also impossible to eliminate the national political authorities established by the governing hierarchy that embody such a departure. This is an absolute positive conclusion from a detailed historical analysis of the revolutionary task. The words are input and demonstrated by using simple shape codes, sound codes and sound codes respectively, according to the example single word input statistics, including space keys and page turning keys, the average single word input keystroke effect is as follows:
single input stroke number effect table
Input method Code Total number of keystrokes Chinese character number Average number of key strokes Remarks for note
Simple code 26 199 78 2.55 1.99 key without blank space
Shape number 10 296 78 3.8
Sound code 26 285 78 3.65
Non-tone sound code 9 451 78 5.78
4 sound code 9 383 78 4.91 Full code
4 sound code 10 336 78 4.3 Use brevity code
c) Another example of the following speech is: we can never forget the vigorous achievement they establish for parties and people, can never forget the struggle spirit they cultivate with life, must inherit and develop their excellent quality and highly spirits, and achieve the career life of parties and people is unhappy and struggle. ' 78 characters are now translated into four-stroke codes in the form of single character correspondence as follows: zxqq; asqsvvytr; qjxhxqrlonbox, jlgltwbg; ah, xgvgy, q; asqsvvytr; qjxhxqn, z, mojftoty; bsp, respectively; wdrgyqo, a-rjavrulebovkklxxhqbxkbbbooondxboiirqtorgyqo, xdzafplronbox, jlbgp, z; mojsnr, sp; wdsi.
As seen from the coded translation, the single character has at most 3 codes, and the average single character input keystroke number is as follows: 171 characters excluding space key, 171/78 ═ 2.2 (key); if the space key is included and the space is set to be 0.6, then (171+25)/78 is set to be 2.51 (key), in this example, the statistics of the input software set according to the actual code, the average number of keys hit by a single character is not only related to the selected article, but also inversely related to the number of parts classified, the reduction of the large number of keys inevitably leads to the increase of the number of key hits, and only 0.1 key is added compared with the original specification, and all the keys are selectable.
d) The following is a dictionary example composed by Korean code sequence, highlighting the link of deleting stroke and indexing, and the page numbers of the first 24 surnames in the family names in the dictionary, and also listing the corresponding shape number, double spelling and 4 tone number codes in the table, wherein the number codes include simplified characters and normal characters, and the comprehensive performance of the Korean codes is highlighted. The four-stroke 50-class component combined hierarchical coding sorting is proved to be very simple and convenient.
Korean dictionary and double spelling digit example word list
Figure 100002_3
Note: 1. the dictionary page in the table is a sample of a ten thousand-word Korean dictionary for receiving a word; the number of shapes is the 3-part code plus 1 structure code input method of claim 3 (table 2);
2. the alphabetic code of the korean double-spelling is set with a 36 korean keyboard, and the difference from the general keyboard is that 'uei' 3 alphabets and 'qhz' are exchanged.
3. The Korean code numerals are set to 4 tones, and the mapping relation is shown in the table 4-4 of claim 4.
e) The korean code 4 tone number input mapped by the universal keypad is as follows:
love for father and mother, good new year
qinlai4deba4ba4mama,xinnian2hao4,......zhu4ni3men4shen
14854748465688478847964496447585867859583917897497688565
Healthy body, birthday of birth to the southern mountain! And (c) some.
tijian4kang,shou4bi3nan2shang3!er2mou3。
257467786554873888748649195444679735。
The effect is obviously that only 4 digits are used, namely the number of keystrokes is only 4, which is much less than that of keystrokes using letters, the tones can be distinguished, the input is much simpler than the pinyin input, great convenience is provided for the Chinese character input application of the mobile phone, the mobile phone has good prospect, and the effect is better by adding 0 brevity code.

Claims (4)

1. A Chinese character and Chinese phonetic coding input method is a Korean code phonetic input method which is set on the key position of a computer keyboard according to the initial consonant and the final consonant in the Chinese phonetic scheme, and the single code is added with a virtual key for input, and is characterized in that:
in the pinyin input of Chinese characters or the initial and final double-pinyin input of Chinese pinyin, the input is set as two-key code input, and the initial consonants and the virtual initial consonants are set in an English keyboard, as shown in a chart 1-1:
korean code double spelling initial consonant key position setting table chart 1-1
Figure 1
The 5 vowel series of the Chinese syllable, i, u, o, e, i and u include the 5 vowel series, 5 areas are defined and set in sequence, most of the vowels are arranged according to the natural sequence of English letters, and only the i and u series have few exceptions, as shown in the following chart 1-2:
korean code double spelling vowel key position setting table chart 1-2
Figure 2
In Korean code initial and final double spelling input, the symbol keys are used for tone selection or page turning; at this time, the number keys can still be directly input by selecting the coincident codes.
2. The method for coding and inputting Chinese characters and Pinyin as claimed in claim 1, wherein the mapping relationship is determined by using numeric codes instead of letters as Pinyin input, and further comprising:
the mapping of letters or keyboard keys and numbers has obvious hierarchy, 3 rows of 26 English letters or corresponding keys can be used for mapping once, mapping twice and mapping three times, the mapping just covers 26 letters or keys of a computer keyboard, the mapping is divided into 9 groups, each group has 3 keys, and the mapping is repeated twice to form 81 mappings in total of 9 x 9, the mapping can be used for double spelling input of any Chinese characters or pinyin, 78 mappings are commonly used in practice, the specific mapping is set as good by grouping, and the following graph 2 shows that:
key position digital mapping chart 2
Figure 3
The table is set by a general English keyboard, and the virtual key is a mapping vacancy and can be used or not used.
3. The method for coding input of chinese characters and pinyin and mapping tables of letter keys and digits as claimed in claim 2, further comprising:
in the Chinese character or Chinese phonetic alphabet digital coding input method, as long as using a large group of 26 letters or key position mapping double-digit setting, can be used as phonetic alphabet silent Chinese character input or phonetic alphabet direct input, the upper half of the following table is set for initial consonant digital code, the lower half is set for rhyme digital code, because of the difference of using virtual key, so will not use the virtual key to name 1 type, use the virtual key to name 2 types, and only involve the setting of the vowel, their common characteristics are that the latter digit is only one of 1, 2, 3;
double spelling type 1 silent key position digital setting chart 3-1
Figure 4
In the pinyin non-tone digital input method, in order to fully utilize the mapped information resources and increase the utilization of virtual key positions, the memory difficulty of vowels is reduced, and the vowels are set as shown in the following table:
korean code double spelling 2 type rhyme and mother key position setting table chart 3-2
Figure 5
In the silent input of Chinese characters or Chinese phonetic alphabets, the 0 digit is often used as the brevity code key of common Chinese characters when inputting Chinese numerical codes.
4. The coding input method for Chinese characters and Chinese pinyin according to claim 2, and the mapping table of letter key position number, i.e. the mapping number codes of the chart 1-1, the chart 1-2 and the chart 2, so as to form 4-tone input of Chinese characters or Chinese pinyin, further comprising:
in the four tone input of Chinese characters or Chinese pinyin, two levels of mapping spaces are used, namely, table 4-1 and table 4-2, each table has two mapping number selections of mapping upper and lower, a 2-system mapping relation is exactly formed, the first tone is specially set to be 11, the second tone is 12, the 3 rd tone is 21, the 4 th tone is 22, and even the double-number coding of 4 tones of pinyin is exactly formed, the method is also divided into two categories according to the difference of 'virtual keys' applied to the setting of vowels in the silent tone input, the category 1 does not use virtual keys, the category 2 uses virtual keys, the initial setting is the same, so the initial mapping of the two categories of input is also the same, as shown in the following table 4-1:
double-spelling four-tone initial consonant mapping digital setting table chart 4-1
Figure 6
The setting of vowels input by two types of 4-consonants is different, and the mapping is also different, and the mapping of 1 type of 4-consonants and vowels is shown in the following table 4-2:
double-spelling 1-class four-tone rhyme mother mapping digital setting table chart 4-2
Figure 7
The 1-class 4-tone digital input method is that 2-system codes are selected according to the tones of pinyin, and the tones of 1-class 4-tone input Chinese characters or Chinese pinyin can be displayed by using the two tables and the upper and lower codes in the tables for selection;
2-type 4-sound digital pinyin input is performed by only replacing the graph 4-2 with the graph 4-3 in the same way;
the category 2 vowel settings are shown in tables 4-3 below:
double-spelling 2-class four-tone rhyme mother mapping digital setting table chart 4-3
Figure 8
In the 4 tone input of Chinese characters or Chinese phonetic alphabet, two input schemes have respective advantages and disadvantages, and when the Chinese characters are input by digital codes, the number of '0' is always used as the brevity code key input of the common Chinese characters.
CN201310637474.2A 2009-06-18 2009-06-18 Chinese character and Chinese phonetic alphabet coding input method Active CN104123011B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310637474.2A CN104123011B (en) 2009-06-18 2009-06-18 Chinese character and Chinese phonetic alphabet coding input method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200910149939 CN101930292B (en) 2009-06-18 2009-06-18 Comprehensive coding input method of font, phonetic alphabet and number of Chinese characters and application thereof
CN201310637474.2A CN104123011B (en) 2009-06-18 2009-06-18 Chinese character and Chinese phonetic alphabet coding input method

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN 200910149939 Division CN101930292B (en) 2009-06-18 2009-06-18 Comprehensive coding input method of font, phonetic alphabet and number of Chinese characters and application thereof

Publications (2)

Publication Number Publication Date
CN104123011A CN104123011A (en) 2014-10-29
CN104123011B true CN104123011B (en) 2021-08-13

Family

ID=51768448

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310637474.2A Active CN104123011B (en) 2009-06-18 2009-06-18 Chinese character and Chinese phonetic alphabet coding input method

Country Status (1)

Country Link
CN (1) CN104123011B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107613109B (en) * 2017-08-31 2020-06-09 南京白下高新技术产业园区投资发展有限责任公司 Input method of mobile terminal, mobile terminal and computer storage medium
CN110442246A (en) * 2019-05-07 2019-11-12 佐建明 A kind of Chinese character component input method
CN112307277A (en) * 2020-09-29 2021-02-02 西安赢瑞电子有限公司 Chinese character string matching pre-judging method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN85100094A (en) * 1985-04-01 1986-07-16 清华大学 Phonetic transcriptions of Chinese characters association coding and spelling keyboard
CN1892539A (en) * 2005-07-04 2007-01-10 姜国钧 Beginning-end-stroke and double phonetic-alphabet Chinese-character inputting method
CN101008870A (en) * 2007-01-04 2007-08-01 李志锐 Numeric keypad for dual spelling and input method there therefor
CN101093421A (en) * 2006-06-20 2007-12-26 韩恒瑞 Hierarchy type codes of four stocks of Chinese characters, and digital encoded method for inputting shape and sound
CN101430604A (en) * 2006-12-25 2009-05-13 王治阳 Chinese character code input method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN85100094A (en) * 1985-04-01 1986-07-16 清华大学 Phonetic transcriptions of Chinese characters association coding and spelling keyboard
CN1892539A (en) * 2005-07-04 2007-01-10 姜国钧 Beginning-end-stroke and double phonetic-alphabet Chinese-character inputting method
CN101093421A (en) * 2006-06-20 2007-12-26 韩恒瑞 Hierarchy type codes of four stocks of Chinese characters, and digital encoded method for inputting shape and sound
CN101430604A (en) * 2006-12-25 2009-05-13 王治阳 Chinese character code input method
CN101008870A (en) * 2007-01-04 2007-08-01 李志锐 Numeric keypad for dual spelling and input method there therefor

Also Published As

Publication number Publication date
CN104123011A (en) 2014-10-29

Similar Documents

Publication Publication Date Title
CN100462901C (en) GB phoneticize input method
CN104123011B (en) Chinese character and Chinese phonetic alphabet coding input method
CN101694602B (en) Chinese character input method utilizing Chinese character holographic initial consonant code and final sound code
CN101930292B (en) Comprehensive coding input method of font, phonetic alphabet and number of Chinese characters and application thereof
CN105912139B (en) Method for correspondingly recognizing modular stroke coding Chinese characters
CN102511021B (en) Number-order-code-element keyboard and information input method thereof
CN100545790C (en) Computer Chinese characters information hunt head code input method
KR100655720B1 (en) Alphabet input apparatus in a keypad and method thereof
CN102023717A (en) Three-five initial-subsequent phonetic code and keyboard thereof
CN107256092B (en) Chinese character digital shape code quick input method
CN105607752A (en) Xingyi Chinese character inputting method
CN1057624C (en) Chinese character input method and keyboard design
CN105278697B (en) Combined double-spelling class major-minor code Chinese character, word coded input method and its keyboard
CN101093421A (en) Hierarchy type codes of four stocks of Chinese characters, and digital encoded method for inputting shape and sound
CN101089794A (en) Chinese simple search and characters quickly input
CN101271366A (en) Head and tail double-pin input method and keyboard thereof
CN1125393C (en) Chinese character encoding and inputting method and keyboard
CN105320291B (en) Combined type pronunciation and meaning class major-minor code Chinese character, word coded input method and its keyboard
CN1204487C (en) Chinese character input method based on code of radicals and sound
CN111309159A (en) Chinese character digital retrieval method and device and full-digital Chinese character input method and device
CN102637077A (en) Phonological, calligraphic and tone hybrid coding method for inputting Chinese characters to computer
CN1519686A (en) Method of big and small character elements for inputting Chinese characters
CN105204657B (en) Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard
CN102902367A (en) Multi-purpose etymon coding, indexing and inputting method
CN1904811B (en) Chinese character encoding input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant