CN117111752A - New homophonic near-bit Chinese character code input method - Google Patents

New homophonic near-bit Chinese character code input method Download PDF

Info

Publication number
CN117111752A
CN117111752A CN202310081787.8A CN202310081787A CN117111752A CN 117111752 A CN117111752 A CN 117111752A CN 202310081787 A CN202310081787 A CN 202310081787A CN 117111752 A CN117111752 A CN 117111752A
Authority
CN
China
Prior art keywords
code
codes
chinese character
chinese
chinese characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310081787.8A
Other languages
Chinese (zh)
Inventor
王治阳
王亭朝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CN117111752A publication Critical patent/CN117111752A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A Chinese-character encode method for inputting Chinese characters to computer includes such steps as coding with sound code and shape code, and features that the first strokes of the combined Chinese characters except for radicals are the same basic strokes, the multiple strokes with same phonetic initial letters are arranged according to the same-sound near-position method, and the second code is coded by dual-layer scan technique.

Description

New homophonic near-bit Chinese character code input method
Technical Field
The invention belongs to a computer Chinese character coding input method, which is characterized in that after the t-type Chinese character code input method is invented, the simultaneous near-position technology is invented, the left-to-right scanning technology is invented at most twice, the selection and the disjunction of strokes and radicals can be found out unexpectedly, and the positions on a keyboard can be calculated quantitatively, and the keyboard has only unique arrangement as the mathematics, which is an unprecedented innovation of the Chinese character input method, and the Chinese character coding input method is further improved after the Chinese character code input method is subjected to the state that a certain input method company is unwilling to learn for one minute, the learning time of a few minutes is shortened, and a new simple Chinese character input method is integrated, so the Chinese character coding input method is called as a new simultaneous near-position Chinese character code input method, and the keyboard for realizing the input method is also related. In addition, the method can also be called a new homophonic near-position Chinese character code pinyin input method.
Background
The keyboard input method is the most widely used input method in the current Chinese character input method. The keyboard input is divided into three types of sound codes, shape codes and sound-shape codes according to codes.
The phonetic codes are generally based on the pinyin of Chinese characters and are encoded by the pronunciation of the Chinese characters. The shape code is encoded by using the character pattern characteristics of the Chinese characters. The phonetic and shape codes are encoded by utilizing the phonetic features of Chinese characters and the font features of Chinese characters. The sound-shape codes can be divided into two types, namely the sound-shape codes using only the initial consonant and the sound-shape codes using the whole sound-shape codes of the Chinese characters, the initial consonant and the final sound of the Chinese characters are completely used, and the auxiliary codes are arranged at the back of the specified sound-shape codes, so that the thinking is basically not influenced, the thinking is almost the same as the sound-shape codes, the repeated code rate is almost the same as the shape codes, and the Chinese characters can be compatible with the pinyin, and the superiority is increasingly displayed. At present, the sound-shape codes invented by other people tend to have more Chinese character components or higher repeated code rate, and the Chinese character codes invented by the inventor can simply and high-speed input Chinese characters on the basis of innovative coding rules by only using about 21 radicals and 5 basic strokes. The 26 strokes and the 26 English letters are in one-to-one correspondence, and are convenient to display on small screen keys of a mobile phone and the like.
However, after the invention patent is disclosed, many people reflect that people are not used to the horizontal, vertical, skimming, dot (right-falling) and folding respectively, and the code is a, o, e, i, u or H, S, P, D, Z respectively, so that the code is easy to memorize, accords with the brain response, is difficult to determine at a time, and later, the invention is recommended to a certain input method company by a manager of a certain input method department: anything that the user needs to learn, even only a minute, is not acceptable to the user if there is no qualitative breakthrough. Further research has found that the coding of the horizontal, vertical, left-falling, dot (right-falling) and fold with H, S, P, D, Z respectively does not favor word repetition when the first code of the auxiliary code is input, but when the second code of the auxiliary code is input, the word repetition rate is much less than the coding of the horizontal, vertical, left-falling, dot (right-falling) and fold with the vowel E, I, A, O, U respectively due to consonant letters, which is a great finding. This gives me the discretion to encode the transverse, vertical, left-falling, dot (right-falling), and fold with H, S, P, D, Z, respectively. However, the original keyboard layout has to be significantly modified because some basic strokes will be the same as some multi-stroke components, i.e., radicals, with initials or finals, which requires a new method to arrange the multi-stroke components. In addition, when few radicals are selected, only the number of repeated codes can be simply counted, and the frequency number of Chinese characters with repeated codes is not counted through a big data technology, so that the selection of multi-stroke components is optimized and determined. After the multi-stroke part is determined, if the initial of the preferred multi-stroke part is ingeniously used as E, I, A, O, U code on the basis of homophonic near-position technology, the multi-stroke part can be memorized more easily. The remaining few radicals, i.e., the multi-stroke components, are also located according to the homophonic near-position and quantitative calculation.
In order to meet different requirements, the input method which is learned only for a few minutes is integrated on the basis of the new Chinese character codes, only the repeated codes are increased, the key stroke times are increased, and the typing speed is slowed down. This input method is merely auxiliary.
The voice code part adopts full spelling as the simplest, but has the problem of longer code length, mainly that ng in the full spelling has high occurrence frequency and needs to strike the key twice. If ng is represented by a key, the pinyin code length can be shortened. Many input methods are denoted by g, but this easily coincides with the initial g. It is necessary to use more conveniently.
In the previous invention of the inventor, only single word coding is paid attention to, and the importance of phrase repetition is not enough, and the problem of insufficient importance of phrase repetition in the invention of other people is also existed. However, the inventor finds that the word group duplication rate of the Chinese phonetic scheme is higher in the process of inputting the software, and the inventor carries out statistics based on a common word stock of Beijing university and a common word stock of a modern Chinese dictionary, and finds that 57000 syllables of homophones are saved in the whole phonetic system, namely about 7000 syllables of homophones exist in the whole phonetic system, wherein the maximum number of two syllables is about 4800, and the maximum number of three syllables is 2200 or more. With the increase of phrases, homophones are more prominent, so many homophones have a strong influence on the input speed, and many homophone errors frequently occur in actual input. When the number of phrases is enlarged to 68000 common phrases, the five-stroke equal-shape code type input method has a lot of repeated codes, even more than pinyin. One solution is intelligent input, which is just like the declaration of intelligent input methods such as dog hundred degrees, etc., to improve the automatic preference rate of phrases. However, many times, the homonym selection is troublesome. Therefore, it is necessary to invent a simple method for distinguishing homophones. While the method is preferably based on a single word encoded ancillary code.
Disclosure of Invention
Thus, the current Chinese character input method is either nonstandard in Chinese character components or unreasonable in the number of selected Chinese character components; or radicals, i.e. Chinese character components, do not fully consider the word-forming frequency, practical frequency and duplication code rate in commonly used Chinese characters; or the positions of the five basic strokes in the keyboard are unreasonable, so that the words and the codes are easy to cause; or the code length is too long; or the repeated codes are too high, so that the input speed is influenced; or only the initial consonant or the pinyin initial of the Chinese character is utilized; or not intuitive enough; or the code taking rule is not reasonable, and the brain response can be influenced; or whether the code is left and right or not or whether the code is transversely fetched for a while or longitudinally fetched for a while is continuously distinguished during the code fetching; or the arrangement regularity of the Chinese character components on the keyboard is not strong, even has a little effect; or the quantitative calculation is not carried out on the multi-stroke parts, and the selection and the arrangement on the keyboard are abandoned by experience intuitiveness; or the frequency of the radicals is not used as the statistics data of authoritative big data, namely the actual frequency is not considered enough; or a single auxiliary code cannot be compatible with multiple input methods; or there is no good way to distinguish homophones (re-code words). The technical problem of simplicity, rapidness and inconvenience cannot be well solved, and Chinese characters are not convenient and rapid to input.
The invention aims to provide a computer Chinese character coding input method which has reasonable selection layout of Chinese character stroke components, is standard and visual, is simple and easy to learn, has reasonable code taking rule, is simple and quick to input Chinese characters and takes single character and phrase codes into account, namely a new homonymous near-position Chinese character code input method. The input method can be composed of two or more auxiliary code codes, and different requirements are met.
In order to achieve the purpose of the new homophonic near-position Chinese character code input method, the invention provides that the codes of the new homophonic near-position Chinese character code input method consist of two parts of a sound code and an auxiliary code, the auxiliary code can be before the sound code or after the sound code, the auxiliary code is generally provided before the sound code and after the auxiliary code, because the pinyin has more and more superiority, but the homophonic character selection trouble still exists in the pinyin, and the artificial intelligence and search engine technology can not completely solve the problem. The shape part codes in the invention belong to auxiliary codes, and the auxiliary codes are also various. For convenience of distinguishing, the auxiliary codes for inputting by adopting about 21 multi-stroke Chinese character components and five basic strokes are called shape part codes, and the auxiliary codes for inputting by only five basic stroke codes and the pronunciation codes of the initial letters of the pinyin of each radical in the Chinese character are called simple auxiliary codes. The auxiliary code may be direct, i.e. directly following the sound code; or indirectly, namely, after inputting pinyin like a certain input method, inputting auxiliary codes after tab key. The shape part code of the present invention is designed carefully, so that the repeated codes of words can be reduced to the maximum extent, and the direct auxiliary codes are also helpful for improving the speed of inputting Chinese characters.
The phonetic code may be phonetic or phonetic notation. The input method is similar to the phonetic input method, but vowels are expressed in a phonemic mode, and the initials are basically from Latin letters and internationalized rails. Of course, the phonetic codes support the phonetic alphabets of full spelling, double spelling, phonetic alphabet, etc. and may be input via parallel stroke, fast recording, etc.
The shape-encoding portion occupies at most two codes. The invention is generally composed of two letter codes, five basic strokes and about 21 multi-stroke Chinese character components are preferably participated in coding, because the Chinese language commission also refers to the five basic strokes as Chinese character components, the five basic strokes are called single-stroke components in the invention, and other 21 preferable Chinese character components are composed of a plurality of strokes, called multi-stroke components, which are all radicals, so that the radicals are called, or the radicals are called directly. These five basic strokes and 21 multi-stroke parts are collectively called basic parts, and are preferably coded according to basic parts with multiple strokes when shape coding, and are preferably coded according to multi-stroke parts, otherwise, the provision of selecting the multi-stroke parts becomes meaningless. The code fetch rules of the shape part codes are three:
The code fetch rule of the first shape part code is: the method comprises the steps of taking corresponding code codes of the first two basic components according to writing order, or taking corresponding code codes of the first and last basic components of the Chinese character according to writing order, and taking corresponding code codes of only one basic component or taking corresponding code codes of the basic component twice successively when the Chinese character has only one basic component; the combined word is divided into two parts according to the integral structure, the first writing part is a head part, the second writing part is a residual part, and the corresponding codes of the first basic part of the head part and the first basic part of the residual part are respectively taken according to the writing sequence for coding.
This coding rule has a weakness: when the first basic component of each Chinese character is taken into account during the shape part coding, the character shape must be considered, namely, whether the character is a single character or a combined character must be separated, and then two different code taking rules are adopted for coding according to two different character shapes, so that the brain reaction is affected, and whether the combined character is the combined character or not is difficult to judge by some Chinese characters, and sometimes the combined character is difficult to be separated into two parts. The Chinese character coding according to the left-right structure and the Chinese character coding not according to the left-right structure are much easier, because whether one Chinese character is of the left-right structure or not is easy to separate, the left part and the right part of the Chinese character of the left-right structure are provided with gaps, and the Chinese character is easy to separate into two parts according to the gaps. The Chinese characters with the left-middle-right structure generally take the first gap as the reference, and the middle part is divided into the right part, namely the part except the left part of the Chinese characters with the left-middle-right structure calculates the right part.
The second code fetch rule of the shape part code is: the Chinese characters with left and right structures are respectively coded according to the corresponding codes of the first basic component of the Chinese character and the first basic component of the right part of the Chinese character according to the writing sequence; chinese characters with non-left-right structure are obtained according to the writing sequence, the corresponding code codes of the first and last basic components of the Chinese characters are obtained, and only one basic component is obtained, or the corresponding code codes of the basic component are obtained successively for two times. To prevent bypassing patents, or provision is made for: chinese characters with non-left-right structure are coded by taking the corresponding codes of the first and second basic components of the Chinese characters according to the writing sequence, but the provision is easy to increase a large number of repeated codes.
It is to be noted in particular that: the fact that all Chinese characters take the codes of the first two basic components or take the codes of the first two basic components is not specified, because the specification can lead the code taking rule of the shape part code to be simple and easy to remember on the surface, and a large number of repeated codes can be caused in practice, or the cost of adding a large number of multi-stroke components is increased. Why "Chinese characters with left and right structures are coded by the corresponding codes of the first basic component of the left part and the right part according to the writing sequence" can reduce repeated codes? Because Chinese character shape-pronunciation characters are the majority, the left side is often the radical, the right side is the pronunciation side, and the pronunciation side is often a single character representing sound. If the first and last basic elements are taken in the written order as in a general input method, there is a case where the first stroke of the radical is identical to the first stroke of the radical, which brings about a large number of repeated codes. In order to reduce the repeated codes, more radicals are needed to be selected, so that the memory is difficult. That is why "Chinese characters with non-left-right structure" are coded by taking the corresponding codes of the first and last basic components of the Chinese characters in the writing order. "do? The answer is also to reduce radicals, because the first strokes and the last strokes of the pronunciation are often different, for some Chinese characters with the same pronunciation, the left and right structure, the second code takes the first strokes of the pronunciation, the Chinese characters with the structure not left and right, and the second code takes the last strokes of the pronunciation, so that the codes of the two second codes are different, and the repeated codes can be generally avoided well. In addition, if the first two basic components are taken for coding the Chinese characters with the non-left-right structure according to the writing sequence, more repeated codes are easy to be caused, and because the first two basic components of many Chinese characters with the upper, lower and surrounding structures are the same, but the last basic component of the Chinese character is different, the repeated codes can be effectively reduced by taking the last Chinese character component of the second code according to the stroke order. It can be seen that the code fetch rule can effectively reduce repeated codes, so that compared with other input methods invented by other people, the radicals used by the invention are greatly reduced, what double strokes or three strokes are not needed, and the code fetch rule is a kilowatt-hour result, so that the repeated code rate in the commonly used 3775 Chinese characters is very low, and the repeated code rate in the national standard 6763 Chinese characters and the Xinhua dictionary is very low.
However, the coding rule is also to continuously distinguish whether a Chinese character is in a left-right structure or not during coding, and although whether a Chinese character is in a left-right structure or not is clear at a glance, whether a Chinese character is in a left-right structure or not is continuously distinguished during actual input of a long character, and thinking is still troublesome. In this way, when actually taking codes, the third rule of taking codes by the shape part codes, which is a method for thinking about burst inspiration in the first few days of dreams, needs to be used for twenty years: the first code of the shape code is: firstly, taking the code of the first basic component of the Chinese character according to the writing sequence regardless of pseudo-ginseng twenty-one. The second code of the shape part code adopts the code taking rule of burst inspiration when people feel semi-awake and semi-asleep when people feel meditation of me: starting from the right side of the first basic component of the Chinese character, scanning left to right or looking from left to right, if a vertical line can be used for cutting the strokes of the Chinese character, if the Chinese character can be divided into two parts, the Chinese character is in a left-right structure, the right part of the vertical line is the right part of the Chinese character, then the code of the first basic component of the right part of the Chinese character is obtained according to the writing sequence, if the Chinese character can not be divided into two parts by using a vertical line under the condition of not cutting the strokes, the lower half layer or the lower half layer of the Chinese character is scanned left to right, and then the code of the last basic component of the Chinese character according to the writing sequence or the corresponding code of the basic component of the right lower corner of the Chinese character is found out to code. The lower half of the Chinese character is specified to be scanned because the last basic component of the Chinese character is easily found. The lower half layer (or lower layer) of the Chinese character is specified to be scanned from left to right because the scanning direction of the Chinese character is the same as that of the Chinese character with a left-right structure, the scanning direction of the Chinese character is from left to right, and the line direction of the Chinese character is consistent. The method of scanning the Chinese character from left to right twice or scanning the lower half layer or the lower half part of the Chinese character from left to right twice is a great innovation in various input methods of Chinese characters.
The Chinese characters with left and right structures often have obvious gaps and are easy to separate, so that the second code can be scanned from left to right only from the right side of the first basic component of the Chinese character without vertical lines, the gaps of the left and right parts of the whole Chinese character are found, the parts on the right of the gaps are the right parts of the Chinese character, the code codes of the first basic component of the right parts of the Chinese character are taken according to the writing sequence, if the Chinese character does not have the gaps on the left and right, the lower half layer (or the lower half layer or the lower layer part) of the Chinese character is scanned or seen from left to right, and the code codes of the last basic component of the Chinese character according to the writing sequence are found.
Briefly, the first code of the shape code is: the code of the first basic component of the Chinese character is taken according to the writing sequence, namely, the first is taken. When the second code of the shape part code is fetched, the Chinese character is scanned from left to right, if the Chinese character is of a left-right structure, the code of the first basic component of the right part of the Chinese character can be found and fetched according to the writing sequence. If the right part of the Chinese character can not be found, the lower half layer of the Chinese character is scanned from left to right, and the code of the last basic component of the Chinese character in the writing sequence is found. The search of the right lower corner of the Chinese character is not needed to be directly carried out as before, and the thinking is easy to be confused. Briefly, the method of shape encoding the second code is: scanning from left to right, the writing sequence takes the code of the first basic component of the right part of the Chinese character, which is abbreviated as right. If the right part cannot be found, the Chinese character is scanned from the left to the right, and the code of the last basic component of the Chinese character according to the writing sequence is found, namely the code is simply marked as the non-right code, and the last code is taken. In general, the shape part codes the second code can be simply marked as left and right scanning, and the right is right; scanning left and right without taking out the tail right; i.e. not right, and then take the end. The code taking rule of the whole shape part code can be simply recorded as the first right and the last right; or simply the first code is first taken and the second code is not right taken.
Note that when some Chinese characters such as the radicals are "gate" or the lower half of the Chinese characters are "heart, ", "front two strokes of the teacher" and the front three strokes of the "forward" are encountered, they can be regarded as integral parts, and no vertical line segmentation is necessary. The last stroke of the vast majority of Chinese characters is arranged at the lower layer or the lower part of the Chinese characters, when the last Chinese character component in the writing sequence in the Chinese characters is a Chinese character component such as 'Fu, canine, go, yi', and the like, the last basic component is not arranged at the lower half layer of the Chinese characters according to the stroke order, at this time, the second code can be used for coding the code of the last stroke point according to the stroke order, and the last stroke point can be also used for coding after the last stroke point is removed, and the code of the last stroke point can be used for coding, so that the repeated code is hardly influenced, namely the high definition of the error-tolerant code of the invention.
The code-taking rule of the shape part code can show that the Chinese characters with the non-left-right structure are slightly inconvenient to be compared with the Chinese characters with the left-right structure, because the Chinese characters with the left-right structure only need to be scanned once from left to right, and the Chinese characters with the non-left-right structure need to be scanned once from left to right again from the lower half part of the Chinese characters. The present invention has innovated. That is, the Chinese characters with the structure not in the left and right are preferentially simplified codes, even though the common frequency is far less than that of the Chinese characters with the structure in the left and right. That is, when the first codes of the shape codes of the Chinese characters with the non-left and right structures are the same as the first codes of the shape codes of the Chinese characters with the left and right structures, the Chinese characters with the non-left and right structures take the shorthand codes preferentially, and only after the sound codes of the Chinese characters are input, the first codes of the shape codes are input, and the space key is tapped, so that the Chinese characters with the left and right structures can be input. This provision has the advantage that the chinese characters of non-left-right structure, because they are shortcodes, do not have to be scanned once again from the left to the right from the lower half of the chinese character.
Incidentally, when the corresponding code of the last basic component of the Chinese character is coded according to the writing order or the corresponding code of the basic component of the right lower corner of the Chinese character is coded, most Chinese characters are coded identically, but the last basic component of a few Chinese characters is not in the right lower corner, but in other positions, from the view of searching, the basic component of the right lower corner is more convenient, but the right lower corner of some Chinese characters is not obvious, at this time, the corresponding code of the last basic component of the Chinese character is better according to the writing order, and the method of processing by the user is to give error-tolerant codes, namely, the last basic component of the Chinese character is coded according to the writing order or the corresponding code of the basic component of the right lower corner of the Chinese character is taken.
The inventor also found that after the combined word is divided into two, the first stroke of the part of the combined word with homophones and radicals except the radicals is the same kind of basic strokes, which is unexpectedly less, only two hundred pairs, namely the duplicate rate will be very low, and the finding and creative code-fetching rule is that only 5 basic strokes and about 21 basic parts are selected to participate in coding. The components and radicals are strictly different, but the components adopted by the invention are common, so the components are simply called radicals.
In the earliest Chinese character code input method, 28 basic components are selected in the input method which is called right-left upper input method, so that a plurality of basic components are conveniently memorized, but a plurality of radicals with the same pinyin initial letters are adopted, which is according to the initial, which is not according to the initial, and has no clear standard, wherein the radicals with the same homonym are mainly concentrated on's, h, r, y, z, c ', the radicals with the same pinyin initial letters as s are provided with ' fire and grass ' respectively, the radicals with the same pinyin initial letters as h are provided with ' fire and grass ' respectively, the radicals with the same pinyin initial letters as r are provided with ' fire and day ', the radicals with the same pinyin initial letters as y are provided with ' bamboo, foot and head ', and the pinyin initial letters as c ' respectively, and the radicals with the same pinyin initial letters as z or the multiple parts are provided with ' fire and grass '. When the multi-stroke components are arranged in the original Chinese character code input method, the multi-stroke components are arranged according to pinyin or pictographic instead of the stroke numbers and the sequences of horizontal, vertical, skimming, dot and folding. When the pinyin initial letters of the basic components are arranged, repeated codes are avoided. The basic components with the same initial or initial of the other pinyin are arranged in a pictographic mode. The square stroke components of the Chinese characters can be distinguished from western letters after all, and are difficult to be very similar, and the misunderstanding is somewhat strong. In order to avoid homophones, the data are encoded according to the pronunciation of the point, the data are encoded according to the shape like F, other homophones have similar implications for radicals. The inventor realizes the problem in the Chinese character codes of the prior invention, but the method is a novel method for arranging homophonic radicals, namely a homophonic near-position method on a keyboard, through the bitter grope and sudden inspiration of the last ten years. Namely, when encountering a plurality of multi-stroke components with the same initial consonant or pinyin initial, selecting one of the multi-stroke components easy to memorize to encode according to the initial consonant or the pinyin initial, so that the multi-stroke component is called a captain, and the rest multi-stroke components are called members, wherein the positions of the members beside the key positions of the captain are arranged, generally the positions on the left or the right of the key positions, and generally the adjacent key positions are arranged according to the keyboard layout. That is, the basic components of Chinese characters with initial consonants or pinyin initial letters are generally arranged in parallel on the same row of the keyboard and are arranged left and right, and of course, when strokes or other multi-stroke components are encountered, the basic components are separated by the strokes or other multi-stroke components, but are just left or right of the key positions. Therefore, the device is firmly positioned, obviously is very easy to find and memorize, is easier to memorize than the arrangement modes such as shape, strokes, pithy formula and the like, and is a global significant initiative. However, there is no quantitative calculation on what is the captain and what is the team member, and the input method at that time is not a precedent of the quantitative calculation. The same sound means that the pinyin initial consonants or pinyin initial letters are the same, the near positions are relatively close to each other on the keyboard, and many keys are directly adjacent.
In the original Chinese character code, because of skimming and folding are very commonly used, the initial coding of pinyin is inconvenient for keystroke, so five basic strokes, i.e. horizontal strokes, vertical strokes, first strokes and first strokes, are respectively used for representing A, O, E, I, U, but after deep consideration, the initial coding of the horizontal strokes, vertical strokes, skimming strokes, dot strokes and folding strokes is actually better, because simpler and more accordant with thinking habits, the original invention considers that the initial coding of the horizontal strokes, vertical strokes, skimming strokes, dot strokes (right strokes) and folding strokes is not beneficial to word repetition codes when the first codes are coded by H, S, P, D, Z respectively, but when the second codes are coded by the shape parts, the repeated word codes are much less than the initial coding of the horizontal strokes, vertical strokes, skimming strokes, dot strokes (right strokes) and folding strokes by ingenious treatment when the second codes are coded by vowels E, I, A, O, U respectively. The reason is that the original Chinese character codes are coded by vowels E, I, A, O, U respectively due to the horizontal, vertical, skimming, dot (right falling) and folding, and the result is that the second codes coded by the horizontal, vertical, skimming, dot (right falling) and folding at the shape part have high frequency, so that a large number of words are repeated, and in the Chinese character codes of the new improved invention, on the basis of homophonic near-position method arrangement, the shape part of the Chinese character is specially used for coding the first code, but the radicals such as the first code, the second code, the third code and the fourth code are rarely used for coding the final letters E, I, A, O, U of the Chinese characters, so that the repeated codes of words are effectively avoided, and the invention is greatly improved and perfected.
The quantitative analysis is a remarkable progress of the new simultaneous-pronunciation near-position Chinese character code input method relative to the original phoneme simultaneous-pronunciation near-position Chinese character code input method, and through quantitative calculation, 21 Chinese character multi-stroke components are optimized and accurately positioned on a keyboard. The following is a specific explanation: in GB6763 Chinese characters, the character frequency of the Chinese characters in the character radicals of the Chinese characters of the cross, the mouth, the wood, the white, the red, the blue and the white is very high, more than three hundred Chinese characters can be formed, if the strokes are coded, a large number of repeated codes are generated, so that the Chinese characters are selected to be arranged on the keys, and are respectively coded by one letter. The multi-stroke parts or radicals of 'worm, female and month' can also form 250 Chinese characters, the first few strokes of 'worm' are 'mouths', and in order to avoid encoding 'worm' and 'foot' as 'mouths', a large number of repeated codes are caused, and 'worm' and 'foot' are selected and encoded by a certain letter. If the female and the month are coded according to strokes, four and fifty repeated codes are also brought, and the female and the month are also selected and respectively coded by another letter. Radicals , fire, bar, stone, etc. have slightly less word forming capability, about 200 pairs, if they are coded in strokes, wherein multi-stroke part can bring about 40 pairs of codes; the fire energy brings about 40 pairs of weight codes; can bring about 36 pairs of codes; one energy band 41 pairs of codes; the stone and king can bring 35 pairs of codes; the radicals 'day' and 'foot' can respectively bring 40 pairs and 36 pairs of repeated codes; the radicals "the" can bring about approximately 30 pairs, respectively, and the repetition code can bring about more than 20 pairs. The radicals, such as the first, second, third, fourth, fifth, sixth, seventh, etc. are also encoded with one letter according to the size of the avoid-repeated-code capability, the first, second, fifth, sixth, etc. the characters with the radicals being the first, fifth, sixth, etc. are encoded with one letter according to the size of the avoid-repeated-code capability, the first, fifth, sixth, etc. the avoid-repeated-code capability of the vector is close to , the avoid-repeated-code capability is slightly higher than when the characters with the radicals being the second, and the frequency of use of the characters with the radicals being the third, etc. is higher than the frequency of use of the characters with the radicals being the fourth, at this time, the characters with the radicals being the third, etc. are encoded with one letter, and the characters with the fifth, etc. are encoded according to strokes. Thus 21 radicals have been selected, each encoded with a letter. The Chinese characters formed by the radicals 'fish' are often in the form of word groups of 'fish', such as 'carp', 'silver carp', etc., so that the actual use frequency of single characters is very low, and only 21 radicals are selected, therefore, the Chinese characters are slightly abandoned. Other radicals such as  and the vector (cave) and mountain have a large number of repeated codes, when the number of Chinese characters is expanded to the Xinhua dictionary,  can generate 30 pairs of repeated codes, the vector together with the 'cave' can generate 30 pairs of repeated codes, and the mountain can generate nearly 30 pairs of repeated codes. If the number of the radicals is determined according to the number of the heavy codes, the selection and the disjunction become difficult because the numbers of the heavy codes of the radicals , the generated holes and the mountains are close to those of the feet, the stones and the , and the number of the heavy codes generated by the radicals , the grass, the birds and the like is smaller, the can generate 15 heavy codes, the 'grass' can generate 16 heavy codes, the 'bird' can generate 24 heavy codes and the like can be generally omitted.
In order to solve the difficult problems that radicals , closed (holes), mountains, feet, stones and with the number of the repeated codes being 30 are relatively close and difficult to be selected and closed, the inventor takes a year of time to intensively study, finally decides to base on a word frequency table of Beijing university and a word frequency summary of Beijing university and a word frequency table of diced bean 6763 (10 hundred million), statistics , producing holes, mountain, foot, stone, chinese, , shi, , girl, king, day, person ((the bottom of the right)), soil, earth, moon, and white, wherein radicals such as fire, gold, red, insect, wood and the like appear in Chinese character codes, when repeated codes occur, recording the frequency number of the Chinese character and the frequency number of one or more Chinese characters with the repeated codes of the Chinese character, carrying out statistics addition according to the radicals classification to obtain frequency sums, selecting high frequency sums as much as possible, the method can reduce homophone selection and increase speed, takes word frequency total of Beijing university of word and word respectively counted as an example, and is calculated and found that multi-stroke parts such as daily, people (with the name), earth, relative, moon, relative, fire, gold, relative, worm, wood and the like participate in the encoded Chinese characters, if the encoding is carried out according to the strokes, the frequency sum of the Chinese characters with repeated codes and the frequency sum of other Chinese characters with repeated codes are very high, so that the selection is carried out, and the frequency sum of the other Chinese characters with repeated codes is 1209, 1916, 523, 563, 363 and 229. Thus, the code number of the code is equal to 316 thousand, the code number of the code is equal to 502 thousand, the code number of the code of the character of the foot is equal to 502 thousand, the code number of the code of the character of the shape, the code of the radical and the code of the character of the head are equal to each other, the code of the component is recommended to be selected, and the code of the stone is selected, of course, the frequency difference between the code of the character of the foot and the code of the character of the foot is very large, such as the frequency of the component stone can be selected, and the frequency of the component is not selected. Of course, the stone and the foot can be selected simultaneously, only two radicals exist on one key, one radical is not corresponding to one letter, and the display on the mobile phone screen is inconvenient.
For the convenience of memorizing, five basic strokes, king, earth, rare earth, mouth, , , female, wood and other multi-stroke parts are all encoded according to the initial letters of pinyin, the other multi-stroke parts are arranged and encoded according to the same-sound near-position method, and particularly for the convenience of memorizing, the concept of the initial letters of the finals is introduced on the basis of not greatly influencing the repeated codes of words, namely 'people, months, chinese character' are encoded according to the initial letters E, U, I, O of the finals, while the Chinese character is the 'Chinese character' although the sound is the 'Chinese character', and three-point water is read frequently, so that the initial letter A of the pinyin san of 'three' is taken. This facilitates memory, but is also based on quantitative calculations. The following is a specific explanation:
the quantitative analysis and calculation arrangement mapping are also carried out when radicals with the same initial consonant or the same pinyin initial are arranged according to the method of the same sound near position. All the Chinese characters appearing in Xinhua dictionary app are selected. Some radicals have strong character-forming capability and high use frequency, but are unevenly distributed in syllables of initials or finals where 26 letters are located, and in pinyin syllables where some initials and finals are located, the number of Chinese characters where the radicals are located is small, and if the radicals are coded by a specific letter, word repeated codes can be effectively avoided. This principle is the theoretical basis for quantitative calculation.
The near bits are ordered: i.e., Q to P, to a to L, to Z to M, and back to Q on the keyboard. The keyboard is arranged from left to right, from Q to P, then to the keyboard, and from left to right, from A to L. Then to the lower row of the keyboard, from left to right, from Z to M, and then back to the Q key. Because the only multi-stroke parts of the pinyin initials such as horizontal, vertical, skimming, dot, folding and king, earth, tie, mouth, , , girl, wood and the like are specified to be encoded by the pinyin initials, only thirteen stroke radicals are left in arrangement according to the homophonic near-position method. The components have high word-forming frequency, but the initial letters of the pinyin are the same, so that a large number of repeated codes are caused, and the method is a reason that the auxiliary codes of input methods such as a certain input method are difficult to increase the speed and cause failure. Wherein, the first letter of Pinyin of the first letter of the second letter of the first letter is r, y, h, c, z, s. The first letters of the pinyin for the two radicals are the same, and the first letters of the pinyin for the two radicals are arranged on the keyboard according to the left and right adjacent positions so as to facilitate the memory. The following explains the multi-stroke components with the same initial in terms of the sequence c, h, r, s, y, z of the initial.
The initial letters of Chinese phonetic alphabets of the Chinese and the insect are c, and according to the homophonic near-position method, the Chinese phonetic alphabets can be arranged on two adjacent keys of c and v, because v is a final, the Chinese phonetic alphabets are rare, so that only the number of Chinese characters which appear in the initial letters of the Chinese phonetic alphabets c is considered, 3 insects appear, the frequency sum is lower, 11 insects appear, the frequency sum is higher, and because v is taken as the Chinese character prefixThe final, so in order to avoid the duplicate code of the word, so propose using the frequency and lower multi-stroke part, in the Chinese character of common use, appear in the shape and encode the number and frequency sum calculation of the group number of Chinese character of the second code, it is lower that the worm, the higher the level of the loaf. Thus, insects are coded v, c, and may be abbreviated as meadow insects. The Pinyin of (a) is cao, the Pinyin of the insects is chong, and the arrangement of the Pinyin is c-code, and the insects are v-code.
The pinyin initials of the stroke transverse and radical fire are H, and the fire can only be coded by the adjacent G on the left side of the H key because J is arranged for radical fire.
And r of the initial phonetic letters of Rihe and Yuan can be arranged on two keys which are adjacent left and right and are only arranged on e and r according to the synchronous near-position method. From the viewpoint of avoiding word repeated codes, the number and frequency sum of the daily sum and the alpha sum of basic components in Chinese characters with the vowels being ue and ie are counted, and the number and frequency sum of the Chinese characters appearing in the daily sum of the basic components are very low and very close to each other because the number and the frequency sum of the Chinese characters appearing in the daily sum of the basic components are very low, so that the frequency sum or the frequency sum of the Chinese characters in the second codes of the basic components and the daily shape codes are required to be considered, and the E is a vowel. The statistics shows that the number of Chinese characters appearing on the basic component 'day' is 218, the frequency sum is higher, and the number of Chinese characters appearing on the alpha-containing person is only 5, the frequency sum is lower. The number, frequency and consideration of Chinese characters which appear in the shape part and code the second code in the common Chinese characters are also that the multi-stroke part 'day' is higher, the multi-stroke part 'alpha' contains people less, and the frequency of the second code which appears in the vowel is lower when the alpha contains no people. Only the base component with the lower frequency number and the lower frequency number in the shape code second code can be encoded with E, while the base component with the higher frequency number and the higher frequency number in the shape code second code is encoded with the pinyin initials r. And the pinyin is REN, and the initial of the final is just E, so that the memory is convenient.
The initial letters of pinyin of vertical, horizontal, vertical and horizontal are s, and the input methods such as many input methods are s-coded, so that a large number of repeated codes are caused, and the coding fails. From the keyboard layout, the basic stroke verticals are very common, the verticals are of course coded with S-keys, and the I-key, O-key, a-key can be seen as being adjacent to the S-key. The brush, the san may be arranged to the I, O, A key. Therefore, the quantitative calculation is carried out by the inventor through operation research. Of the Chinese characters with the initial of a, 1 Chinese character with the component, the frequency is 5920, 2 Chinese characters with the component, the frequency sum is 64779, so that the Chinese characters with the initial of a are better coded by a, and the Chinese characters with the initial of o are 1, and the Chinese characters with the component are not, so that the coding by o is more proper. In the Chinese characters at the beginning of the pinyin initial letters i, o and a, no Chinese characters exist, the letters o and a are coded by the Chinese characters respectively, and the rest i is coded by comprehensively considering. I is highest, a times is lowest, o is lowest, and from the point of view of the frequency of radicals, i is lowest, and from the point of view of the repetition of the code words, v is highest, the vowels with high frequency are suitable for matching the multi-stroke parts or radicals with low frequency, and the vowels with low frequency are suitable for matching the multi-stroke parts or radicals with high frequency, so that i is suitable for coding, o is suitable for coding, and a is suitable for coding. And the Zhi, zhi and san are just coded by the initials i, o and a respectively, so that the Chinese character can be easily memorized.
Alternatively, the memory may be: the spelling of the character (S) is si, two letters is shou, four letters is shou, and the spelling of the character (S) is shoi, so that the characters can be arranged in the middle row of the keyboard from left to right according to the number of the letters, and the letters are arranged in the order of sound when the number of the letters is the same. The primary, secondary and tertiary amino acids are respectively arranged on the i, o and a bonds and are respectively coded by corresponding letters.
The initial letters of the Pinyin of the moon and the moon are y, and the Chinese phonetic alphabets are arranged on two adjacent keys of y and u according to the synchronous near-position method. From the viewpoint of avoiding word duplication, the frequency or frequency sum of Chinese characters with the basic components of 'moon' sum appearing at vowels of iu or ou needs to be considered. The number of Chinese characters appearing in the prefix of the basic component 'moon' is 2, and the number of Chinese characters appearing in the prefix is 8. The frequency sum (sum of using frequency) of the Chinese characters is also that of the Chinese characters in the month part is relatively low, so that the U code is used for the basic part month, and the y code is suitable for the user. At this time, the input shape part encoding hardly generates the repetition code. In the Chinese characters with initial consonants of y, the number of Chinese characters with the prefix of 'month' and 'month' is 10, the number of Chinese characters with the prefix of 'month' is 15, and the sum of frequency number and even frequency of use is higher, so that the basic component is coded by y, the basic component is coded by u, and u is just the initial of 'month', so that the Chinese characters are easy to memorize. In addition, from the order perspective, the pinyin is yan, the pinyin of the month is yue, and the basic component is y-coded and the basic component is u-coded if the pinyin is arranged from left to right in the order.
The first letter of pinyin for Zhan, chi and Zhu is z, and the stroke is very common, of course denoted by z. According to the simultaneous approach, the feet and the bamboo can only be arranged on the left l, Q and F keys, wherein l and z are respectively positioned on the rightmost side of the second row and the leftmost side of the third row of the keyboard and can be regarded as the approach, and letters on the right side of the lower row z of the keyboard are arranged with radicals, so that the Q key and the F key on the upper row of the keyboard are barely close according to the simultaneous approach rule. The initial L is arranged first because the frequency of the initial L in Chinese is much more common than that of the initial F and Q. And can only appear at the end of Chinese characters, the sum of the frequency numbers is 0. When the initial letter of the pinyin is L, the number of Chinese characters in the initial is 7, the sum of the numbers is 109352, and the number of Chinese characters in the initial is 12, and the sum of the numbers is 16734. It can be seen that neither foot nor bamboo is well suited for encoding with L. From the viewpoint of avoiding word repetition, the word repetition can be made to be 0 by encoding with L. This is a very elegant arrangement. The number of Chinese characters with the prefix being sufficient and the pinyin initial being F is 3, and the sum of frequency numbers is 293; the number of Chinese characters with the prefix of bamboo and the initial of Pinyin of F is 3, and the sum of frequency numbers is 16022. The number of Chinese characters with the prefix of the Chinese character is enough, the number of the pinyin initial is Q, and the sum of the frequency numbers is 7626; the Chinese characters with the prefix of bamboo and the initial of Pinyin of Q have 5 Chinese characters and the sum of frequency numbers is 9664. From the number, the feet and the bamboos are fewer and close on the keys with the initial letters of F and Q, from the frequency sum, the Chinese characters with the initial letters of Q and F are more in bamboo radicals, so that the feet and the bamboos can be selected from either one of the F and the Q, and from the viewpoint of convenience in keystroke, the bamboos are encoded by the F, and the feet are encoded by the Q. Of course, the bamboo is Q-coded and the foot is F-coded. From the memory perspective, the first letter of Pinyin is z, which can only be arranged according to strokes, the first pen of foot is vertical, the first pen of bamboo is prime, the first pen of N is dot, the first pen of N is vertical, prime and dot are arranged in sequence, and the sequence from left to right on the keyboard is Q, F, L, so that the foot, bamboo and Chinese are respectively coded by Q, F, L on Q, F, L keys from left to right according to the first pen of foot, prime and dot respectively. From the similar perspective, the head and tail of the foot are similar to Q, the head part of the foot is also similar to Q in lowercase, the left half part or the right half part of the bamboo is similar to F, the head part or the right half part of the bamboo is also similar to L, and the foot is easy to memorize. Of course, the number of letters and the order of the order can be also selected, the pinyin of the feet is composed of two letters, so the letters are arranged on the leftmost q of the keyboard, the pinyin of the bamboos is zhu, the pinyin of the one-pair bamboos is three letters, the bamboos and the one-pair bamboos are respectively arranged on the f and the l keys from left to right according to the order of the voice, and the corresponding letters are respectively used for coding.
Since the alpha almost only appears at the prefix, the second code is rarely encoded at the shape part, and only 5 alpha of the area appears at the second code, the repeated codes of words can be greatly avoided, and the vowel just the part is E, so that the vowel is easy to memorize. Since the y codes are almost only present in the prefix, the second codes are rarely present in the shape code, and the number of the second codes is not large even if 19 codes are present in the shape code in the traditional Chinese language, the word repetition code can be well avoided. The Chinese character is almost only present at the prefix, the second code is rarely present at the shape part, and the Chinese character of vowel v (u) is rarely present at low frequency, so that the word repeated code can be greatly avoided.
If the alpha is changed to the alpha, r coding is used, and the daily e coding is consistent with the synchronous near-position arrangement. If instead, u is used for coding, y is used for coding, and the code also accords with the synchronous near bit. Is coded by v, and the coding by c is also coincident with the co-sound near bit. Similarly, mapping on the i, o, a bond can be interchanged. But is less suitable from the point of view of encoding word repetition codes.
The Chinese character phonetic alphabet can be uniformly sequenced from left to right based on homophones and the near positions, the phonetic alphabets are uniformly sequenced from left to right based on the number of the phonetic alphabets of the radicals, and the phonetic alphabets are simultaneously sequenced according to the phonetic alphabets, so that the character is coded by r instead of r, E instead of daily, the rest of the character is unchanged, the character and word repeated codes are not reduced, and the character and the near characters are coded by the initial letters of the phonetic alphabets.
If the number of radicals is not considered on the basis of the same sound near position, the radicals are arranged according to the sound sequence, at the moment, the alpha is coded by e, the gamma is coded by daily r, the gamma is coded by i, the gamma is coded by o, the gamma is coded by a, the fire is coded by g, the lambda is coded by c, the v is coded by insects, the bamboo is coded by q, the gamma is coded by f, and the foot is coded by l.
It can be seen that in the input method, the 8 multi-stroke parts of 5 strokes, king, earth, jaw, mouth, , , female and wood are all encoded according to the pinyin initial, the day, the user and the insect are also encoded according to the pinyin initial, so that the total 16 stroke parts are all encoded according to the pinyin initial, only 10 multi-stroke radicals which are actually required to be memorized are arranged according to the method of the homophonic (the pinyin initial) near position (the position on the keyboard is adjacent), and basically the number of the letters of the pinyin of the radicals is the same, the letters are arranged according to the order, the strokes and the shapes are also considered, and the memory is very easy. The five multi-stroke components alpha, month and scale are coded according to the initial letters of the vowels, are arranged according to the homophonic near positions, and are actually arranged according to the homophonic near positions, wherein the 5 components are arranged according to the letter number and English sound sequence on the basis of the homophonic near positions, so that the five multi-stroke components alpha, month and scale are easy to memorize. In order to further shorten the memorizing time, the user also compiles a pithy formula, namely, the basic strokes are captain, the basic strokes are preferentially encoded by the phonetic initial letters, in syllables with the same phonetic initial letters, the date, the Chinese character, the end and the transverse line are captain, the person, the month, the insect and the fire are captain, the soldier are arranged on the left and right adjacent key positions of the captain, the Philippine insect and the Chinese character, the left and right partition walls are arranged on the c and v keys, and the memorizing is facilitated. Folding, folding feet and forming a team with bamboo, wherein the team leader is folding, and the pair of feet and bamboo is teammate. Erects, moves into a team, and moves into the team leader, moves. The coding rule of the shape part coding is simply recorded as' first and last without right, which is easy to memorize. So that a typical person can remember for three or five minutes.
Through optimizing about 21 multi-stroke parts and five basic strokes, creatively prescribes a code-taking rule of a second code of the shape part code, creatively adopts a homophone near-position method to arrange the multi-stroke parts and the basic strokes and creatively carries out quantitative calculation and accurate positioning, the shape part code is simple and easy to memorize, homophones can be effectively distinguished, the duplicate rate in 3500 Chinese characters commonly used and 6763 Chinese characters commonly used in national standard is very low, and the input speed can be compared with input methods such as five-stroke characters. The method solves the difficult problem that any other input method cannot solve, really achieves the advantages of simplicity, intuitiveness, low duplicate rate, high input speed, almost no duplicate rate by adopting the technologies of artificial intelligence, search engines and the like, can be compatible with the most popular pinyin input method or phonetic transcription input method, and is the only ideal perfect Chinese character input method which can be popularized to students in middle and primary schools.
Some radicals are common, but because of the lower repetition code, only more than 10 repetition codes can be reduced, and only 26 key positions are not selected. But if these components are chosen it would be beneficial for some people who pursue typing speed. In the new invention, the selected parts are double coded, namely, the parts can be coded according to strokes or radicals, and the parts are inconvenient to display on small-screen keyboards of mobile phones and the like. These several components are referred to as dual components or virtual components, and may also be referred to as dual radicals or virtual radicals. The virtual component is called because it does not appear on the letter keys of a small screen such as a cell phone, but can be encoded with punctuation keys. That is, the double component can be coded by strokes or punctuation keys. The character forming capability of the fish is strong, and 24 repeated codes can be avoided and are arranged in the row; "on the bond, use"; "code". The mountain, and the grass are arranged in the groups of ", respectively according to the frequency of the radicals. The ","/"keys are used respectively. ","/"are coded, see FIG. 2.". "and", "are the same key, the same code, but are used for clarity. ".
As a variant of the invention, it is also possible to replace the feet or the bamboo with stones, or to arrange the component stones on the L-bond, since the radical on the L-bond only occurs at the end of the letter. However, the L key is not known, some radicals are quite common, the repeated codes are low, only 10 times of repeated codes can be reduced, and only 26 key positions are not selected, but if the parts are selected, people seeking typing speed are facilitated, in the new inventor, a plurality of parts can be subjected to double coding, namely, coding according to strokes or component radicals, and are not displayed on a small-screen keyboard of a mobile phone and the like, and the parts become double parts or virtual parts because the parts do not appear on letter parts of the small-screen of the mobile phone and the like, but can be coded by punctuation marks. For example, 5 multi-stroke parts can be added on the figure 1, and the added multi-stroke parts are not displayed on the screen of the mobile phone, so that the user can keep in mind with high hands. One possible arrangement is that "stone" is encoded with "l" and "fish"; "coding," mountain, , grass are used respectively, ". ","/"is encoded. Fig. 5, fig. 6, fig. 7 and fig. 8 show the arrangement and mapping modes of strokes of other parts on the keyboard. The features of these figures are that individual, relatively infrequently-used, closely-frequency multi-stroke components may be replaced. It should be noted that fig. 3, fig. 4, fig. 5, fig. 6, fig. 7, fig. 8 are only enumerated examples, and are variations of the present invention.
The shape part code is used as an auxiliary code, the word compatibility problem is fully considered in the invention, the artificial intelligence and search engine technology is adopted and used as a direct auxiliary code, and the key such as tab is not needed to be pressed, so that the number of key strokes can be reduced, and the indirect auxiliary code can be used as an indirect auxiliary code. The indirect auxiliary code suggests to press the tab key, and the single-word repetition rate is also high because of more word repetition codes.
Some input method manager thinks that one minute is not willing to learn, and the actual pinyin is to be learned,encoded with v. For a person who manages an input method, the person can learn the Chinese character for one minute, the radical cannot be used for one minute, only the basic strokes can be used, the first code-taking rule of the simple auxiliary code is still unchanged, namely 'first and no right end', and the method for inputting Chinese characters is also provided. The horizontal, vertical, left-falling, dot and fold are respectively coded by the initial letters of the pinyin, namely, after the codes of the first basic strokes of a certain Chinese character are input, the Chinese character is scanned left and right, and if the Chinese character is in a left-right structure, the codes of the first basic strokes of the right part of the Chinese character are taken according to the writing sequence. If there is no right part, the code of the last basic stroke of the Chinese character is taken according to the writing order. The coding is so that the repeated codes can be greatly improved to influence the typing speed, and the most likely Chinese characters to be input can appear in the first few bits of the candidate characters through artificial intelligence and search engine technology, so that the adverse influence of the repeated codes is eliminated to the greatest extent.
When phonetic input is adopted for the phonetic code part, unrecognized Chinese characters can be encountered, so that the invention provides a quick input method based on the shape part coding keyboard arrangement diagram, namely, the shape part coding keyboard arrangement diagram can be selected from one of the accompanying drawings of 1, 2, 3, 4, 5, 6, 7 and 8, and can not be changed once selected. Fig. 1 is generally selected. Taking fig. 1 as an example, when inputting, codes corresponding to the basic strokes and the multi-stroke components of the Chinese character are sequentially input according to the stroke order and the principle of taking the large priority by combining the basic strokes and the multi-stroke components in fig. 1, and the required Chinese character is selected according to the prompt line. Of course, u can be added in front, then the Chinese characters are input in sequence according to the stroke order and combined with the basic strokes and multi-stroke components in the figure 1, and the required Chinese characters are selected according to the prompt lines.
Because the phonetic code part adopts the full spelling, the code length is longer, and the vowel tail ng has room for improvement, the invention creatively uses ng as v, because the vowel is not influencedDenoted by v. So as to shorten the code length and improve the input speed. Meanwhile, people who do not want to express ng by v can still adopt full spelling, so that perfect compatibility is achieved.
Because the phrase is basically input according to the phonetic code, in order to solve the problem that the homonym is serious when the phonetic code is input, the invention refers to the auxiliary code on the basis of the new homonym near-position Chinese character code input method, the trouble of homonym selection can be basically avoided only by inputting the auxiliary codes of the first word and the second word of a homonym or the first code of the shape part code after the encoding of a certain homonym, generally, the trouble of homonym selection can be generally avoided only by inputting the first code of the auxiliary code of the first word of the homonym, one homonym which is most commonly used in a plurality of homonyms can be encoded according to pinyin, the first code of the auxiliary code of the first word can be obtained by the other homonyms, and thus the homonym can be effectively distinguished, and the first code of the auxiliary code of the second word of the homonym can be obtained if the homonym is still heavy. Of course, in some homophones, sometimes an auxiliary code is added, so that different pinyin is changed, for example, the pinyin of a certain homophone is zhidu, and if the first code of the auxiliary code of a certain homophone is i or o, the auxiliary code of a certain homophone is input again into i or o, the auxiliary code of a certain homophone is changed into zhiduo or zhidui. To avoid syllable collision, a somewhat less common homonym may also be selected, which may not be added to the first code of the auxiliary code of its first word. The homonym is generally fully spelled, and the first codes of the auxiliary codes of the first word and the second word of the homonym are input after the fully spelled homonym, so that homonym selection can be better avoided, and the first codes of the auxiliary codes of the first word, the second word and even the third word of the homonym can be input again under the condition of simple spelled, so that the trouble problem of homonym selection can be effectively avoided, but the effect is relatively weak.
Drawings
FIG. 1 is a diagram of one of the arrangement of a shape-coded keyboard
FIG. 2 is a diagram showing a second arrangement of a shape-coded keyboard
FIG. 3 is a third diagram of an arrangement of a shape-coded keyboard
FIG. 4 is a diagram showing a layout of a shape-coded keyboard
FIG. 5 is a diagram showing a layout of a shape-coded keyboard
FIG. 6 is a diagram of a six-part code keyboard arrangement
FIG. 7 is a diagram of a seven-segment code keyboard arrangement
FIG. 8 is an eighth diagram of a shape-coded keyboard arrangement
FIG. 9 is one of the mapping relationship diagrams of the phone letter vowels on the keyboard
FIG. 10 is a second mapping diagram of phone letter finals on a keyboard
Detailed Description
The new homophonic near-bit Chinese character code input method consists of two parts, one part is phonetic code, i.e. pronunciation code or phonetic code, and the other part is shape code, which is the auxiliary code in common input method. The two parts can be the prior sound code and the subsequent shape code when forming the code; the shape code may be preceded and the sound code may be followed. But once selected cannot be changed. To facilitate typing, consistent with thinking, prior pinyin is suggested for full compatibility with pinyin input methods, with shape encoding followed by such methods in the encoding example. The spelling can be full spelling, double spelling, simple spelling or no spelling The complete spelling, namely the standard spelling of a Chinese character, can also adopt the phonetic notation input method, pay attention to the part representing the tone in the phonetic notation input method to be removed, because the shape part code of the invention is much higher than the tone distinguishing and code-repeating capability. Double spelling is inconvenient to arrange and memorize due to 35 vowels, and cannot be popularized all the time. Therefore, in the new invention, a non-professional typist would not be favored for use with double spelling. The input speed is theoretically faster than that of the continental pinyin, the phonetic notation is not represented by Latin, and the vowel is not represented by phonemic. The phonemic letters of the invention have the same coding length and syllable expression mode as phonetic notation, but the initials are represented in a latinized mode, the finals are represented in a phonemic mode, the writing is simple, the phonemic letters are convenient to display on a small screen such as a mobile phone, the code length is shorter than the pinyin, the input speed is faster than the pinyin, and the defects of the phonemic letters and the phonetic alphabet are that if one phonemic letter is used, punctuation keys or numeric keys are needed, and the keystroke of a plurality of punctuation keys or numeric keys is slightly inconvenient. The initial consonant of a single letter of the phoneme letter is the same as the pinyin, the uptongue sounds in the pinyin can be arranged on the keys v, u and i, and because the invention can effectively avoid repeated codes, the uptongue sounds are not separated, and the repeated codes are very low as the same, the invention can be seen as the uptongue sounds which are not separated, namely, zh is coded by z, ch is coded by c and sh is coded by s. The vowels of the phonemic letters are also very simple, the phonemic vowels can be conveniently converted with the vowels in the Chinese phonetic scheme, so long as remember one one (I) a Chinese character of Fang, ka, And F, r respectively represent the letters e, i, a, o, u, n, ng, r of the vowels formed by the Chinese phonetic alphabet scheme, and then the Chinese phonetic alphabet scheme is written. />Or is represented by +..
A mapping relation diagram of the numeric keys of each letter punctuation mark on the english keyboard and the pinyin vowels and the phoneme letter vowels is shown in fig. 9:
in the drawings. The "is" key where ">".
The shape coding will be described in detail below.
The strokes of Chinese characters are classified into five basic strokes of horizontal, vertical, left-falling, dot and folding according to the rule of the Chinese language commission. The strokes are one line which is written once without interruption when writing Chinese characters, and can be classified into five basic strokes of transverse, vertical, skimming, point and folding, wherein the strokes are combined into the transverse, vertical and folding, the vertical hooks are combined into the vertical, the point is combined into the folding, and the rest various strokes with turning are combined into the folding. In order to reduce the repetition code, it is also preferable that about 21 Chinese character components with high word-combining frequency or practical frequency, which are composed of two or more strokes, are arranged on letter keys to participate in coding, because the number of the strokes is two or more, and the Chinese character components are called as multi-stroke components, or radicals, in the invention, so as to be different from single-stroke components, or basic strokes. The multi-stroke component and the single-stroke component are collectively referred to as basic components, sometimes simply referred to as components.
The code fetch rule of the first shape part code is: the individual words are coded by taking the corresponding codes of the first two basic components according to the writing sequence: or the corresponding code of the first or last basic component is taken according to the writing sequence, and when only one basic component is provided, the corresponding code of the basic component is taken only; the combined word is divided into two parts according to the integral structure, the first writing part is a head part, the second writing part is a residual part, and the corresponding codes of the first basic part of the head part and the first basic part of the residual part are respectively taken according to the writing sequence for coding.
In long-term coding research, the inventor long recognized that whether a Chinese character has a left-right structure is clear at a glance, the Chinese character with the left-right structure can be easily divided into two parts at a gap, and the Chinese character with an up-down and surrounding structure can not be easily divided into two parts, or even can not be easily divided into a single character or an up-down structure or a surrounding structure. The division according to whether a Chinese character is left-right structure is the simplest and easy to learn. When the Chinese characters with the left, middle and right structures are encountered, the middle and right parts are calculated as right parts or right parts. Strictly speaking, it is better to divide by whether the right is a word or not after division.
If all Chinese characters are divided into left-right structures and non-left-right structures, the Chinese characters can be encoded, and the Chinese characters can be still encoded by using the codes shown in the accompanying drawings 1 and 9, namely the selected pinyin, basic components and codes are unchanged. The code also consists of pinyin and shape codes. The code fetch rule of the second type of shape part code is: the Chinese characters with left and right structures are coded by the corresponding codes of the first basic component of the writing sequence of the left part and the right part respectively; the Chinese characters with non-left-right structure are obtained according to the writing sequence, the corresponding codes of the first and last basic components of the Chinese characters are obtained, and only one basic component is obtained, and the corresponding codes of the basic component are obtained or the codes of the basic component are obtained twice. At this time, the corresponding codes of the first two basic components of the Chinese character with the non-left-right structure cannot be obtained according to the writing order, because repeated codes are caused, and the corresponding codes of the first basic component and the last basic component of the Chinese character are obtained according to the writing order. Because it is clear whether a Chinese character is in a left-right structure or not, ambiguity is not generated, besides a few Chinese characters such as 'ShunChuan, zhou, er' and the like, the left-right part of the Chinese character in the left-right structure is easy to generate gaps, and only the Chinese character is divided into two parts by a vertical line according to the gaps. The Chinese characters with left and right structures sometimes meet the needs of individual Chinese characters such as 'Chuan', 'Shun', 'Zhou', which are formed by discrete strokes, and are regarded as independent characters, and 'Shun' is characterized in that a Chinese character component is added to the discrete strokes to form a Chinese character, the whole discrete strokes are generally suggested to calculate the left part, and the other Chinese character component calculates the right part, such as 'Shun', 'Chuan' is a left part and 'page' is a right part. Of course, the input method has great fault tolerance, and the left part is the horizontal character and the rest is the right part. In addition, "" cannot be bisected by a vertical line.
In order to reduce unnecessary repetition codes, for a few barycentric words, it may also be provided that the second code of the shape-coding is coded according to the code of the first or last basic element in which the barycenter is located, it being proposed that the code of the first basic element in order of strokes in which the barycenter is located is coded. The gravity center word refers to a special shape Chinese character with radicals in the middle or tail of the Chinese character, such as "winning", "carrying", "glume", "competition", and the like, and the second code of the shape part code can be coded according to the corresponding code of the basic component "woman" where the gravity center is located. Also, for example, a "fluorescent" character, the second code of the shape code may be encoded with the corresponding code of the "fire" of the base member where the center of gravity is located, because the portion of the "fluorescent" character that does not include the "fire" is in fact phonic. The center of gravity of the Chinese character with the left middle right structure and the left part identical to the right part is in the middle part, so that the shape part code second code can be code of the last basic component of the middle part. For example, the "distinctive" character, the two codes for the shape may be the "prime" code for the last basic component of the middle part. The Chinese characters of the bird parts of the left, middle and right structures are usually coded by the shape part or the gravity center of the auxiliary code at the bird part, and the second code is coded according to the gravity center. .
Since the last basic component of a Chinese character is basically at the lower layer of the Chinese character, the point of the upper right corner is the last component except for the Chinese character components such as 'Fu, ge' and the like which are specified in the writing order. Therefore, when the Chinese characters containing Chinese character components such as the Pu and the go are encountered, the point at the upper right corner can be ignored as error-tolerant codes, namely, the 'Pu and the go' respectively take the vertical stroke and the left stroke as the last stroke.
A preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 1. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
and respectively coding the multi-stroke part and the basic stroke with corresponding letters according to the set relation.
A preferred arrangement of 25 multi-stroke elements and five basic strokes on a keyboard is shown in fig. 2. One mapping relation between 25 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
a preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 3. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
/>
a preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 4. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
A preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 5. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
a preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 6. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
a preferred arrangement of 21 multi-stroke parts and five basic strokes on a keyboard is shown in fig. 7. One mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
the individual considers that fig. 7 is also feasible because the repetition rate of the multi-stroke part mountain and fire is relatively high, about 35 pairs are available, the repetition rate of the multi-stroke bamboo and the vector is about 25 pairs, and the repetition rate of the multi-stroke part grass and is only about 15 pairs, so that the multi-stroke part grass and , are also discarded, and the multi-stroke part mountain and fire are selected, but the use frequency of Chinese characters of part and bamboo part is far less than that of Chinese characters of part, and therefore 'y' is selected. At this time, according to the homophonic (homophonic initial) near-position method, the multi-stroke component mountain is arranged on the G key and is coded by G, the multi-stroke component is arranged on the L key mountain and is coded by L, and the code is coded by B.
A preferred arrangement of 25 multi-stroke elements and five basic strokes on a keyboard is shown in fig. 8. One mapping relation between 25 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
some components will change slightly after word formation, the complex shape will change, and must be considered as similar basic components, which are coded with the same letter, such as alpha and man, right and left, right and gold, right and water, right and left, right and hand, and heart, right and , fire and , etc. and are characterized by homology. And the rise is approximated. Some parts of the original input method are such as # -containing people, containing heart, san containing water and white, and the same source is easy to remember, but because of different shapes, some people indicate inconvenient memory, and on the contrary, for the Chinese characters with only complex and simple body distinction such as the Chinese words like the Chinese words of the King and the King, the positions in the Chinese characters are unchanged, and the Chinese characters indicate easy memory. Two fault tolerant codes are proposed, one is: the first place is easy to remember, in the new invention, basic components which are only distinguished by complex shapes are adopted, such as 'alpha' and 'human' are adopted in preference, the front part and water are adopted in preference, the rear part and hands are adopted in preference; and heart preferably use , fire and preferably use fire, and "human" homologous to the symbol, water homologous to the symbol, homologous to the fire, etc. will be compatible in the form of error-tolerant codes, i.e., the "human" may be encoded with the code of "alpha", but is assigned to the error-tolerant code. Similar other so-called homologous multi-stroke components will appear as error-tolerant codes. Another approach is to use the homologous code as a pair, the strokes of "human" as the fault tolerant code of "alpha", and so on. The basic components may also comprise individual components similar to those described above, and are encoded with the same letter. For example, the component "earth" may include "soldiers", and since the two components have only a fraction of stroke length, coding as the same component may be more consistent with the brainstorming habit. The term "unit" is also quite similar to the term "unit", and may be defined to include the unit, or may not be arranged.
The code is fetched according to the second shape part code, so that the problem of whether the Chinese characters with the left-right structure are needed to be distinguished continuously exists. Therefore, the code-taking rule according to the third shape part code is simpler and easier to memorize, and the code-taking rule is adopted in the code example and the code of the figure 1 is adopted. The phonetic code lists the pinyin and the phonetic letters for selection.
Encoding example: for example, the code of "Chinese" is h, the initial consonant is an, the vowel is an, the sound code portion is han, the first basic component of the shape code is a multi-stroke component of a Chinese character according to the writing sequence, the second code is a code of "folding" of the first stroke of the right part of the Chinese character according to the writing sequence, and the code of "folding" is z, so that the code of "Chinese" is "hanaz". The phoneme letter is "h ", and the corresponding position on the keyboard is "h". Thus, the "Chinese" code is "h, az". For example, when the code of the "word" is encoded, the phonetic code part is zi, the first basic component of the "word" according to the order of strokes is a vector, the code is b, the Chinese character is a Chinese character with a non-left-right structure, the code h of the last basic component of the "word" is taken according to the writing sequence, then the code of the "word" is "zibh", and if the phoneme letter is adopted, the phonetic code part is still "zi", thus the code of the "word" is "zibh". If the Chinese character is ' this ' character, it is zhe ' and the shape part code, the first basic component of the Chinese character is ' point ' and the code is'd ' according to the writing order, the last basic component of the Chinese character with non-left and right structure is ' l ' according to the writing order, the shape part code of ' dl ' and the code is ' zhehdl ', because the invention is the uptongue sound, the meaning is not great, the uptongue sound is not readable by the south, the uptongue sound can be removed, and the code is ' zedl '. For another example, the code of "wood" is double-spelled as mu, the Chinese character has only one basic component "wood", the code is m, the shape code of "wood" is "m", and then the code of wood is mu. In order to pursue uniform code length, it is also possible to provide that a Chinese character having only one basic element can take the code of the first stroke or the last stroke or repeat the code of the basic element as the second code of the shape code. This code example is not so specified.
In fig. 9, the number keys are used, and the key is struck in a crossing manner, so that the frequency of p in Chinese is very low because w keys and y keys are empty, word repetition codes can hardly occur when the vowels are arranged on p keys, and the n keys and r keys are also same, so that ei, en, eng, ou, ong is arranged on w, r, y, n, p keys, and at the moment, a mapping relation diagram of each letter punctuation number key, pinyin vowels and phoneme letter vowels on an English keyboard is shown in fig. 10:
in fig. 10, the frequency of the initial consonants k and r is almost the same, and the r key in the figure can be replaced by the k key.
The arrangement of fig. 10 is more regular, i.e. the pinyin initials are divided into an a area, an o area and an e area, and each area is arranged according to the order of a, o, e, i, u, n, ng. The a area has ao, ai, an, ang arranged on four punctuation keys, and the O area has ou, ong, n or p keys, or k or p keys. The e region has ei, en, eng arranged on the w, r and y bonds, respectively. The key-pressing rule is met, and the key-pressing is convenient. Higher frequency vowels are arranged on keys convenient for keystroke, for example, en and ou with higher frequency in Chinese are arranged on r and n keys where the keystroke is convenient for index finger, and e-beginning vowels and o-beginning vowels of other low frequency points are arranged on other keys.
For a small number of lazy people, the shape part coding can also adopt pure strokes, and after the pinyin of a Chinese character is input, codes of the first two basic strokes of the Chinese character are input according to the writing sequence. The code length of the shape part code can be 2 codes or an indefinite code length, namely, all strokes of the Chinese character are taken for coding.
To increase the input speed, shortcodes are designed for words that are frequently used. The simple code is to take the 1, 2 or 3 codes before the complete codes for the Chinese characters, and then to input the Chinese characters according to 1 space key. Since the regular phonetic codes are prior and the shape part codes are later, a plurality of Chinese characters need to input the simplified codes of the Chinese characters, the codes of the single characters are mainly phonetic codes and are assisted by the shape part codes, the shape part codes play the role of auxiliary codes, and common words are enough to input the shape part codes as long as the first codes are input.
As the pinyin of the Chinese character is only four hundred, the two-level brevity codes of the Chinese character are only four hundred, and 729 coding spaces are provided, the brevity code words can be set up for the other three hundred coding spaces, so that the typing speed is further improved. For example, the pinyin of Chinese characters has no form of kian, the double-spelling codes have no form of ky, and k and y are respectively the initial consonants of "ok" and "ok", so ky can be used as "ok" codes. Because the input method is provided with more than three hundred simplified code words, the input speed of the phrase is higher than that of a single word in theory, so that the input speed of the Chinese character can be obviously improved. After the key of the simplified code of a Chinese character or phrase is knocked on the computer, the space key is knocked again, so that the corresponding Chinese character or phrase can be input.
The word input is the most common method for improving the input speed of Chinese characters, because the regular phonetic codes are prior and the shape part codes are after, the word input totally utilizes the phonetic codes to input, the phonetic codes can adopt full spelling or double spelling when the word is input, take Chinese pinyin as an example, and can adopt simple spelling only by inputting the Chinese pinyin of each word, the method is as follows:
a. the two-word is input by taking the pinyin codes of the initial consonant of the first word and the initial consonant and the final vowel of the second word in sequence; such as "encoding" simply spelled as bma.
b. The three words and phrases are sequentially input by taking the initial consonant or the code of the pinyin initial of each word, and then the blank is input; the code of the simplified spelling like "computer" is "jsj". Of course, it is also possible to provide that the first code of the first word and the second word, i.e. the code of the initial consonant, is taken and then the first and second codes of the third word are taken. The first two codes of the first word can be regulated, and the first codes of the second word and the third word, namely the codes of the initial consonants, can be taken.
c. The codes of the first three words and the initial consonants of the last word are sequentially input; if the "science and technology" is four words, the simple spelling code of the initial consonant of each word is "kxjs". Of course, four or more words can be specified, and the pinyin initial or the pronunciation of each word in the phrase is taken for encoding.
The input can be completed by using the new homophonic near-position Chinese character code input method software to strike the key of a corresponding code of a Chinese character or phrase on the computer keyboard. The Chinese characters or phrases without repeated codes and reaching the specified code length are generally specified to be automatically displayed, the space key is required to be pressed for the characters or phrases with repeated codes which are not less than the specified code length are selected according to the prompting line. The invention is compatible with words when the sound code adopts double spelling and the code length is four keys at most and when the sound code adopts full spelling and the code length is variable.
The invention provides that the pinyin is only suitable for common words, and for people with low cultural level or unrecognized Chinese characters, basic strokes and multi-stroke components of the Chinese characters are only required to be input according to the writing sequence on the basis of the arrangement diagram of the shape part coding keyboard in the drawing. Such as "artemisia", so long as the code c, d, h, k, s, z, k for the symbol, dot, horizontal, mouth, vertical, fold, mouth is entered. That is, the full code of "artefact" is cdkszk, taking care that multi-stroke parts take precedence over basic stroke codes. In practice, the first few codes are input, and the first few codes can be selected according to the prompt line. When the method is not independently used as an input method, a leading symbol such as u and the like can be added in front, and then the shape part code is input, for example, the input ucdhkszk is the full code of 'artemisia'. And prompting the row to select Chinese characters.
At present, a plurality of people input Chinese characters by using voice or pinyin, because the homophones of the Chinese characters are more, homophone errors are easy to occur, the input method software provides a powerful homophone modification function, namely enters the homophone modification function, moves a cursor in front of or behind the homophone with the errors, and pays attention to either uniformly prescribing that the cursor is moved in front of the Chinese characters or uniformly prescribing that the cursor is moved behind the Chinese characters. The software automatically recognizes the pronunciation of the Chinese character, the sound code part of the invention is not needed to be input, the complete code of the Chinese character is input only by inputting the shape code, the original Chinese character is automatically replaced by no repeated code, the individual repeated code is selected according to the prompting line, and the selected Chinese character automatically replaces the original Chinese character with the input error.
Coding examples that can substantially eliminate homonym selection trouble: when the word group pinyin is the same, the first codes of the shape part codes of the first word and the second word in the word group are input only after the pinyin. For example, the pinyin dili has five homonyms, the geography is the most commonly used, the first code of the auxiliary code of the first Chinese character can be omitted, so that the geography code is still dili, the first code of the auxiliary code of the first Chinese character 'ground' is utilized as t after the pinyin, and the code is dili. And the land fertility is that the first code t of the auxiliary code of the first Chinese character 'ground' is added after the pinyin, the repeated code is generated with 'utilizing', and the first code z of the auxiliary code of the second Chinese character 'force' is added, and the code is dilitiz. Whetsl is added with the first code h of the auxiliary code of the first Chinese character after the pinyin dili, and the auxiliary code h is coded as dilih. The dilia is dripped, and the first code a of the auxiliary code of the first Chinese character is added after the pinyin dili, so that the Chinese character is encoded into dilia.
To shorten the code length and increase the input speed, ng or v of the pinyin part is represented, for example, the voice code of the building is zhuang, which can be represented as zhuav.
The invention is fully compatible with phonetic input method and phonetic input method, and creatively adopts double-color candidate word technology, i.e. in candidate window, after inputting the letters, the words can appear for selection, the words which do not adopt shape part coding are of a certain color, such as green, the Chinese characters which adopt shape part coding are of another color, such as black, when inputting black several times, the system considers the Chinese character code technology to be understood, and inputs Chinese characters preferentially according to the Chinese character codes, so as to improve the speed.
Error-tolerant codes are also set, and Chinese characters which are easy to be input can be generated when the Chinese characters are input in error. It is noted that letters are not in all cases in the specification, the claims and the drawings of the specification, and that letters are equivalent in case.

Claims (9)

1. A Chinese-character encode keyboard input method for computer features that the strokes of Chinese character are classified as five basic strokes including horizontal, vertical, left-falling, dot and folding, and features that:
(1) The code consists of two parts, one part is phonetic code, namely phonetic code, the other part is auxiliary code, the auxiliary code is divided into shape part code and simple auxiliary code, and the two parts of Chinese character code can be before and after each other, and once the two parts are selected, the two parts cannot be changed, and the general phonetic code is before;
(2) The phonetic codes can adopt full spelling, double spelling, simple spelling or incomplete spelling, and also can adopt Taiwan phonetic notation and phonogram; mapping relation between each letter punctuation number key and phonetic final and phoneme letter final:
the other mapping relation between the numeric keys of each letter punctuation mark and the phonetic final and the phoneme letter final:
(3) The first code fetch rule of the shape part code is: the individual character is coded by the corresponding codes of the first two basic components according to the writing sequence, or the corresponding codes of the first and last basic components according to the writing sequence, when only one basic component is used, the corresponding codes of the basic component are only used, and the corresponding codes of the first and last basic components of the Chinese character can be specified according to the writing sequence;
the combined character is divided into two parts according to the integral structure, the part containing the first stroke of the Chinese character according to the writing sequence is a head part, the rear writing part is a residual part, and the corresponding code codes of the first basic part of the head part and the first basic part of the residual part are respectively taken according to the writing sequence;
The second code fetch rule of the shape part code is: the Chinese characters with left and right structures are coded by the corresponding codes of the first part of the writing sequence of the left part and the right part respectively; the Chinese characters with non-left-right structure are coded according to the corresponding codes of the first and last basic components of the Chinese characters according to the writing sequence, and only one basic component only is coded according to the corresponding code of the basic component or coded according to the basic component successively and secondarily; or provision for: a Chinese character with a non-left-right structure is coded by taking a first basic component of the Chinese character and a basic component of the lower right corner of the Chinese character according to a writing sequence (the lower right corner in the surrounding structure is taken for surrounding);
third rule of shape coding code fetch: the first code of the shape code is: firstly, taking the code of the first basic component of the Chinese character according to the writing sequence regardless of pseudo-ginseng twenty-one; the second code of the shape part code, from the right side of the first basic component of the Chinese character, left to right scanning or looking at, if can use a vertical line under the condition of not cutting the stroke of the Chinese character, can divide the Chinese character into two, the Chinese character is left and right structure, the right part of the vertical line is the right part of the Chinese character, then the code of the first basic component of the right part of the Chinese character is taken according to the writing order to encode, if can not use a vertical line under the condition of not cutting the stroke to divide the Chinese character into two, scan the lower half layer or lower half part of the Chinese character from left to right, find the code of the last basic component of the Chinese character according to the writing order or take the corresponding code of the basic component of the right lower corner of the Chinese character to encode; the Chinese characters with left and right structures often have obvious gaps and are easy to separate, so that the Chinese characters can be separated without vertical lines, the second code only needs to scan from left to right from the right side of the first basic component of the Chinese character, the gaps of the left and right parts of the whole Chinese character are found, the parts on the right of the gaps are the parts on the right of the Chinese character, then the code codes of the first basic component of the parts on the right of the Chinese character are taken according to the writing sequence, if the Chinese character does not have the gaps on the left and right, the lower half layer (or the lower half layer or the lower layer) of the Chinese character is scanned or seen from left to right, and the code codes of the last basic component of the Chinese character according to the writing sequence are found;
Briefly, the first code of the shape code is: taking the code of the first basic component of the Chinese character according to the writing sequence; when the second code of the shape part code is taken, the Chinese character is scanned from left to right, if the Chinese character is of a left-right structure, the code of the first basic component of the right part of the Chinese character can be found out and obtained according to the writing sequence; if the right part can not be found, scanning the lower half layer of the Chinese character from left to right, and finding the code of the last basic component of the Chinese character according to the writing sequence;
(3) When the coding rule of the shape part codes is adopted, five basic strokes and 21 multi-stroke parts are preferably selected to participate in the coding, when the multi-stroke parts are selected, the word-combining frequency and the repeated code rate of the multi-stroke parts in 3755 commonly used Chinese characters are mainly considered, the multi-stroke parts are coded according to the initial letters or the initial consonants, the initial letters of the multi-stroke parts are arranged according to the same-sound near-position method when meeting the initial letters of the multi-stroke parts, and the accurate positioning calculation arrangement is carried out, so that the character-combining capacity of some radicals is very strong, the use frequency is high, but the syllables of the initial letters or the final letters where 26 letters are located are not uniformly distributed, the number of the Chinese characters where the radicals are located is very small in the syllables of the pinyin where some initial letters and the final letters are located, and if the radicals are coded by a certain specific letter, the word repeated codes can be effectively avoided; based on homophonic near-position method arrangement, the shape part of Chinese character is specially used to encode the first code, while the radicals of the second code are rarely used to encode by vowel E, I, A, O, U, according to the capability of avoiding repeated codes, wherein, the Bu, the kou, the wood, the bar, the Yi, the Tu and the Bu are respectively coded by a letter, radical, first, , second, day, fire, foot, stone, king, are also encoded with another letter, respectively; the horizontal, vertical, skimming, dot and folding five basic strokes, and the multi-stroke parts of king, earth, doctor, mouth, , vector, girl, wood and the like are all encoded according to the initial letter of the pinyin, and the other multi-stroke parts are encoded according to the arrangement of the same-sound near-position method;
(4) Taking the general word frequency of Beijing university counted by words respectively as an example, through calculation, it is found that the frequency sum of Chinese characters with repeated codes and the frequency sum of other Chinese characters with repeated codes are high if the Chinese characters are coded by strokes, the frequency sum of Chinese characters with repeated codes is 888, 767, 191, 177, 128 and 59 respectively, and the frequency sum of Chinese characters with repeated codes is 1209, 1916, 523, 563, 363 and 229 respectively if the Chinese characters with repeated codes are coded by strokes; thus, the code (hole is coded by the code), the frequency and the higher, the proposal of the code is carried out, the bamboo and the fire are also carried out, the choice is also recommended, the radical mountain is lower, the choice is not greatly recommended, the stone is judged according to the condition of the foot, the code is coded by the first stroke vertically, the frequency of the own heavy code is about 236 dry, the frequency of the code is 316 kilo when the Chinese characters with the heavy code of the Chinese character containing the foot are calculated, the frequency and the sum of the frequency of the code of the Chinese characters with the shape is 502 kilo when the code is coded by the foot, the frequency and the sum of the radicals are also high, the choice of the components is recommended, and the stone is carried out, of course, the frequency difference between the heavy codes of the Chinese characters of the foot is very large, for example, the frequency of the components of the stone can be selected, and the component is not selected; of course, stones and feet can be selected simultaneously, only two radicals exist on one key, one radical is not corresponding to one letter, and the key is inconvenient to display on a mobile phone screen;
(5) Based on homophonic near-position method arrangement, the shape part of Chinese character is specially used to encode the first code, while the radicals of the second code are rarely used to encode by vowel E, I, A, O, U, according to the capability of avoiding repeated codes, wherein, the Bu, the kou, the wood, the bar, the Yi, the Tu and the Bu are respectively coded by a letter, radical, first, , second, day, fire, foot, stone, king, are also encoded with another letter, respectively; the horizontal, vertical, skimming, dot and folding five basic strokes, and the multi-stroke parts of king, earth, doctor, mouth, , vector, girl, wood and the like are all encoded according to the initial letter of the pinyin, and the other multi-stroke parts are encoded according to the arrangement of the same-sound near-position method;
(6) Wherein, the first letters of Pinyin of the first letter, the second letter, the first month, the second letter, the first fire, the first insect, the second fire, the first foot, the first vertical and the second foot, the first stone and the second stone are the same, and r, y, h, c, z, s respectively; the first letters of the pinyin for the two radicals are the same for the first letters of the pinyin for the two radicals for the first day, the first year, the second year, the first month, the third year, the first fire, the second insect and the third year, for facilitating the memorization, the two keys are arranged on the keyboard together according to the left and right adjacent positions so as to facilitate the memorization;
(7) The initial letters of Chinese phonetic alphabets of the Chinese and the insect are c, and according to the homophonic near-position method, the Chinese phonetic alphabets can be arranged on two adjacent keys of c and v, because v is a final, the Chinese phonetic alphabets are rare, so that only the number of Chinese characters which appear in the initial letters of the Chinese phonetic alphabets c is considered, 3 insects appear, the frequency sum is lower, 11 insects appear, the frequency sum is higher, and because v is taken as the Chinese character prefix The vowels, so in order to avoid the duplicate codes of words, the multi-stroke parts with lower frequency and lower ratio are recommended, and in the common Chinese characters, the number and frequency of the groups of Chinese characters with the second codes appearing in the shape part are calculated, namely, the number of the Chinese characters with the second codes is lower, and the number of the Chinese characters with the second codes appearing in the shape part is higher; thus, insects are coded v, c, and may be abbreviated as meadow insects; the Pinyin of the Chinese character is cao, the Pinyin of the insects is chong, and the Chinese character is coded by c and the insects are coded by v according to the tone sequence;
(8) The spelling initial letters of the strokes of the horizontal and radical fire are H, and the fire can only be coded by the adjacent G on the left side of the H key because J is arranged to the radical fire;
(9) R of the initial phonetic letters of Rihe and Yuan can be arranged on two keys which are adjacent left and right and are only arranged on e and r according to the synchronous near-position method; from the viewpoint of avoiding word repeated codes, counting the number and frequency sum of the daily sum and the alpha of basic components in Chinese characters with the vowels being ue and ie, wherein the number and frequency sum of the Chinese characters appearing in the daily sum of the basic components are less, and the number and frequency sum of the Chinese characters appearing in the Chinese characters are very low and very close, so that the group digital frequency in the second code of the codes of the basic components, the daily shape part and the Chinese characters need to be considered, and the frequency sum or the frequency sum is required to be low because E is a vowel; through statistics, the number of Chinese characters appearing on the basic component 'day' is 218, the frequency sum is higher, and the number of Chinese characters appearing on the alpha containing people is only 5, the frequency sum is lower; the number, the frequency and the consideration of Chinese characters in the shape part code second code in the common Chinese characters are that the multi-stroke part 'day' is higher, and the multi-stroke part 'alpha' contains less people, so that only the frequency and the lower basic part in the shape part code second code can be encoded by E, and the frequency and the higher basic part 'day' in the shape part code second code can be encoded by the pinyin initial r; and the pinyin is REN, and the initial of the final is just E, so that the memory is convenient;
(10) The initial letters of pinyin of vertical, horizontal, vertical and horizontal are S, the basic strokes are very common, the vertical is naturally encoded by an S key, and an I key, an O key and an A key can be regarded as adjacent to the S key; the R, R can be arranged on the I, O, A bond; therefore, the inventor performs quantitative calculation by using operation research; of the Chinese characters with the initial of the pinyin a, 1 Chinese character with the component of the Chinese character, the frequency of the Chinese character with the component of the Chinese character is 5920, 2 Chinese characters with the component of the Chinese character, the frequency of the Chinese character with the component of the Chinese character and the sum of the Chinese characters is 64779, so that the Chinese characters with the initial of the pinyin a are better coded by a, and the Chinese characters with the component of the Chinese character with the initial of the pinyin are 1, and the Chinese characters with the component of the Chinese character with the character are not coded by o, so that the coding by o is more proper; in the Chinese characters at the beginning of the pinyin initial letters i, o and a, no Chinese characters exist, o and a are coded by using a pattern and a code respectively, and the rest i is coded by comprehensively considering; i is highest, a times is highest, o is lowest, and from the point of view of the frequency of radicals, i is lowest, and from the point of view of the repetition of the code words, v is lowest, v is highest, and from the point of view of the repetition of the code words, v is highest, a vowel with high frequency is suitable for matching with a multi-stroke part or radical with low frequency, a vowel with low frequency is suitable for matching with a multi-stroke part or radical with high frequency, so that i is suitable for coding, o is suitable for coding, and a is suitable for coding; the primary letters i, o and a of vowels are just used for coding the primary letters i, o and a of the vowels respectively, so that the vowels, the words and the strokes are easy to memorize;
(11) The Pinyin of the Chinese character is si, two letters is shou, four letters is shou, and the Pinyin of the three letters is shoi, so that the Chinese character can be arranged from left to right on the upper row of the keyboard to the middle row of the keyboard according to the number of the Pinyin letters respectively and according to the sound sequence when the number of the Pinyin letters is the same; arranging the primary, secondary and tertiary amino groups on the i, o and a keys respectively, and coding by corresponding letters respectively;
(12) The initial letters of the Pinyin of the moon and the moon are y, and the Chinese phonetic alphabets are arranged on two adjacent keys of y and u according to the synchronous near-position method; from the viewpoint of avoiding word repetition, the frequency or frequency sum of Chinese characters with the basic components of 'moon' sum appearing at vowels of iu or ou needs to be considered; the number of Chinese characters appearing in the prefix of the basic component 'moon' is 2, and the number of Chinese characters appearing in the prefix of the basic component 'moon' is 8; the frequency sum (sum of using frequency) of the Chinese characters is also that Chinese characters in the month part are relatively low, so that the basic part is u-code in month and y-code in month is relatively proper; at this time, the input shape part code hardly generates repeated codes; in the Chinese characters with initial consonants of y, the number of Chinese characters with the prefix of 'month' and 'month' is 10, the number of Chinese characters with the prefix of 'month' is 15, and the sum of frequency number and even frequency of use is higher, so that the basic component is coded by y, the basic component is coded by u, and u is just the initial of 'month', so that the Chinese characters are easy to memorize; in addition, from the viewpoint of the sound sequence, the pinyin is yan, the pinyin of the month is yue, and if the pinyin is arranged from left to right according to the sound sequence, the basic component is coded by y, and the basic component is coded by u;
(13) The first letters of the pinyin of the folds, the first and the second letters of the pinyin of the feet and the pinyin of the bamboos are z, and the strokes are very common and are naturally represented by z; according to the simultaneous approach, the feet and the bamboo can only be arranged on the left l, Q and F keys, wherein l and z are respectively positioned on the rightmost side of the second row and the leftmost side of the third row of the keyboard and can be regarded as the approach, and letters on the right side of the lower row z of the keyboard are arranged with radicals, so that according to the simultaneous approach rule, the Q key and the F key on the upper row of the keyboard are barely close; because the frequency of the initial consonant L in Chinese is much more common than that of the initial consonants F and Q, the initial phonetic letter L is arranged first; the sum of the frequency numbers of the Chinese characters is 0; when the initial letter of the pinyin is L, the number of Chinese characters on the prefix is 7, the sum of the frequency numbers is 109352, the number of Chinese characters on the prefix is 12, and the sum of the frequency numbers is 16734; it can be seen that neither the feet nor the bamboo are well suited for encoding with L; from the perspective of avoiding word repetition, the word repetition can be 0 by encoding with L; this is a very subtle arrangement; the number of Chinese characters with the prefix being sufficient and the pinyin initial being F is 3, and the sum of frequency numbers is 293; the number of Chinese characters with the prefix of bamboo and the initial of Pinyin as F is 3, and the sum of frequency numbers is 16022; the number of Chinese characters with the prefix of the Chinese character is enough, the number of the pinyin initial is Q, and the sum of the frequency numbers is 7626; the prefix is bamboo, the number of Chinese characters with the initial phonetic letters of Q is 5, and the sum of the frequency numbers is 9664; from the number, the feet and the bamboos are fewer and close on the keys with the initial letters of F and Q, from the frequency sum, the Chinese characters with the initial letters of Q and F are more in bamboo radicals, so that the feet and the bamboos can be selected from either one of the F and the Q, and from the viewpoint of convenience in keystroke, the bamboos are encoded by the F, and the feet are encoded by the Q more appropriately; naturally, the bamboo is coded by Q, and the bamboo is coded by F; from the memory perspective, the first letter of Pinyin is z, which can only be arranged according to strokes, the first pen of foot is vertical, the first pen of bamboo is prime, the first pen of R is dot, which is arranged according to sequence, namely vertical, prime and dot, and the sequence from left to right on the keyboard is Q, F, L, so that the feet, the bamboo and the first pen are respectively coded by Q, F, L on Q, F, L keys from left to right according to the vertical, prime and dot of the first pen; from the shape-like perspective, the head and tail of the foot are shaped like Q, the left half part or the right half part of the bamboo is shaped like F, the case is shaped like L, and the memory is easy; of course, the number of letters and the angle of the order can be also ordered, the pinyin of the feet is composed of a number of letters and an order of the order, and only two letters are arranged on the leftmost q of the keyboard, the pinyin of the bamboos is zhu, the pinyin of the one-pair is three letters, the bamboos and the one-pair are respectively arranged on the f key and the l key from left to right according to the order of the letters, and the corresponding letters are respectively used for coding;
One mapping relation of 21 basic components, five basic strokes and letter keys is set as follows:
the other mapping relation between the 25 multi-stroke components and the five basic strokes and the letters and punctuation marks is set as follows:
the mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
the other mapping relation between the 21 multi-stroke components and the five basic strokes and the letters and punctuation marks is set as follows:
the other mapping relation between the 21 multi-stroke components and the five basic strokes and the letters and punctuation marks is set as follows:
the mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
one mapping relation between 21 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
one mapping relation between 25 multi-stroke components and five basic strokes and letters and punctuation marks is set as follows:
the first input rule of the simple auxiliary code is that five basic strokes of horizontal, vertical, left-falling, point and folding are respectively used as codes by using the initial H, S, P, D, Z of pinyin, after the codes of the first basic stroke of a certain Chinese character are input, the Chinese character is scanned left and right, if the Chinese character is in a left and right structure, the codes of the first basic stroke of the right part of the Chinese character are taken according to the writing sequence for coding; if the right part is not available, the code of the last basic stroke of the Chinese character is taken according to the writing sequence; at this time, through artificial intelligence and search engine technology, the most probable Chinese character to be input appears in the first place of the candidate word;
The second input rule for the simple auxiliary code is: for unrecognized Chinese characters, the codes of basic strokes and multi-stroke components of the Chinese characters are input once according to the writing sequence only according to the mapping relation of the shape part coding keyboard arrangement diagram in the drawing, and then the Chinese characters can be input according to the prompt line selection;
the method for basically avoiding homonym selection is to input the first codes of the auxiliary codes of the first word and the second word of a homonym only after the encoding of a certain homonym, wherein one homonym can only be pinyin.
2. The new simultaneous near-bit Chinese character code input method of claim 1, wherein the method comprises the following steps: when arranging other multi-stroke components which are the same as the first letters of the Chinese phonetic alphabet or the initial consonants of the Chinese phonetic alphabet, the fire, the person and the moon, the same-sound approach method is adopted, namely, the other multi-stroke components which are the same as the first letters of the Chinese phonetic alphabet are arranged at the positions near the key positions where the multi-stroke components are arranged, and because the letter keys of the keyboard are divided into three rows, the letter keys are generally arranged at the left side or the right side of the same row of the multi-stroke components.
3. The new simultaneous near-position Chinese character code input method of claim l, wherein the method is characterized by comprising the following steps: is formed by (a), bu, kou, mu, 2 to, alpha, female (1, ), month, insect, soil, shi the basic components of fire, sun, stone, king, , foot, etc. are all selected from the radicals of Chinese characters.
4. The new simultaneous near-bit Chinese character code input method of claim 1, wherein the method comprises the following steps: the near bits are ordered: namely Q to P, A to L, Z to M on the keyboard, returning to Q, arranging from left to right on the keyboard, arranging from Q to P, arranging in the keyboard, and returning to left to right, wherein from A to L; then to the lower row of the keyboard, from left to right, from Z to M, and then back to the Q key.
5. The new simultaneous near-bit Chinese character code input method of claim 1, wherein the method comprises the following steps: when the first codes of the shape codes of the Chinese characters with the non-left and right structures are the same as the first codes of the shape codes of the Chinese characters with the left and right structures, the Chinese characters with the non-left and right structures take the simple codes preferentially, and only after the sound codes of the Chinese characters are input, the first codes of the shape codes are input, and the space key is knocked, so that the Chinese characters with the left and right structures can be input.
6. The new phoneme simultaneous near-position Chinese character code input method as claimed in claim 1, wherein the method is characterized by comprising the following steps: when the Chinese character is coded by E, because the alpha almost only appears at the prefix, the second code is rarely coded at the shape part, and only 5 alpha of the area are coded at the shape part, the repeated codes of the words can be greatly avoided, and the vowel just the part is E and is easy to memorize; since the y codes are almost only present at the prefix, the second codes are rarely present at the shape part, and the second codes are only present at the shape part in 19 traditional Chinese characters, and the number is not large, the word repetition code can be well avoided; the Chinese character is almost only present at the prefix, the second code is rarely present at the shape part, and the Chinese character of vowel v (u) is rarely present at low frequency, so that the word repeated code can be greatly avoided;
Naturally, if alpha is changed to alpha, r is used for coding, daily e coding is consistent with synchronous near-position arrangement; if the coding is changed into u coding, the coding is y coding, and the coding accords with the synchronous near bit; v, c is also coincident with the co-sound near bit; similarly, mapping on i, o, a bond positions can be interchanged; but from the point of view of the repetition of the encoded words, it is not very suitable;
the Chinese characters can be arranged according to the phonetic order when the number and the number of the pinyin of the radical are the same on the basis of homophones and near positions, so that the Chinese characters are coded by r instead of the Chinese characters, E instead of the Chinese characters, and the rest of the Chinese characters are unchanged, so that the repeated codes of words are not reduced, and the Chinese characters are coded by the initial letters of the pinyin vowels; see fig. 3;
if the number of radicals is not considered on the basis of the same sound near position, the radicals are arranged according to the sound sequence, at the moment, the alpha contains the alpha and is coded by e, the mu is coded by r, the mu is coded by i, the mu is coded by o, the mu is coded by a, the he is coded by g, the he is coded by c, the v is coded by insects, the bamboo is coded by q, the mu is coded by f, and the he is coded by l; see fig. 4;
it can be seen that in the input method, the 8 multi-stroke parts of 5 strokes, king, earth, jaw, mouth, , , female and wood are all encoded according to the pinyin initial, the day, the user and the insect are also encoded according to the pinyin initial, so that the total 16 stroke parts are all encoded according to the pinyin initial, only 10 multi-stroke parts which are actually required to be memorized are arranged according to the method of the homophonic (the pinyin initial) near position (the position on the keyboard is adjacent), and basically the number of the letters of the pinyin of the part is the same, the letters are arranged according to the order, and the strokes and the shapes are also considered, so that the input method is very easy to memorize; wherein, five multi-stroke components alpha, month and 2, and 3 are coded according to the initial letters of vowels, are arranged according to the homophonic near positions, and are actually arranged according to the homophonic near positions, and the 5 are arranged according to the letter number and English sound sequence on the basis of the homophonic near positions, so that the Chinese character strokes are easy to memorize; in order to further shorten the memorizing time, the inventor also compiles a pithy formula, namely, the basic strokes are captain, the basic strokes are preferentially coded by the phonetic initial letters, in syllables with the same phonetic initial letters, the date, the Chinese character, the end and the transverse line are captain, the people, the month, the insect and the fire are captain, the Chinese character and the insect are also arranged on the left and right adjacent key positions of captain, the Chinese character and the insect are in the form of the Chinese character, and the left and right partition walls are arranged on the c and v keys, so that the memorizing is facilitated; folding, forming, foot and bamboo into a team, wherein the team leader is folding, and the foot and bamboo are teammates; erecting, driving and driving into a team, wherein the erecting is a captain, driving and driving; the coding rule of the shape part coding is simply recorded as' first, right and last, and is easy to memorize; so that a person can remember for three or five minutes;
Through optimizing about 21 multi-stroke parts and five basic strokes, creatively prescribing a code-taking rule of a second code of the shape part code, creatively adopting a homophone near-position method to arrange the multi-stroke parts and the basic strokes and creatively carrying out quantitative calculation and accurate positioning, the shape part code is simple and easy to memorize, homophones can be effectively distinguished, the duplicate rate is very low in 3500 Chinese characters commonly used and 6763 Chinese characters commonly used in national standard, and the input speed can be compared with that of input methods such as five-stroke characters and the like; the method solves the difficult problem that any other input method cannot solve, really achieves the advantages of simplicity, intuitiveness, low duplicate rate, high input speed, almost no duplicate rate by adopting the technologies of artificial intelligence, search engines and the like, can be compatible with the most popular pinyin input method or phonetic transcription input method, and is the only ideal perfect Chinese character input method which can be popularized to students in middle and primary schools;
some radicals are common, but because the repeated codes are lower, more than 10 repeated codes can be reduced, and only 26 key positions are not selected; but if these components are chosen, it would be beneficial for some people who pursue typing speed; in the new invention, the selected parts are double coded, namely, the parts can be coded according to strokes and radicals, and the parts are inconvenient to display on small-screen keyboards of mobile phones and the like; these several components are referred to as dual components or virtual components, which may also be referred to as dual radicals or virtual radicals; the virtual component is called as a virtual component because the virtual component does not appear on letter keys of a small screen such as a mobile phone and the like, but can be coded by punctuation mark keys; the double parts can be coded by strokes or punctuation marks; the character forming capability of the fish is strong, and 24 repeated codes can be avoided and are arranged in the row; "on the bond, use"; "coding; the mountain, and the grass are arranged in the groups of ", respectively according to the frequency of the radicals. The ","/"keys are used respectively. ","/"is encoded.
7. The new simultaneous near-position Chinese character code input method of claim l, wherein the method is characterized by comprising the following steps: the input method software provides powerful homophone modification function, i.e. entering homophone modification function, moving cursor in front of or behind the wrong homophone, note that either the cursor is uniformly specified to be moved in front of the Chinese character or the cursor is uniformly specified to be moved behind the Chinese character, at this time, the software automatically recognizes the pronunciation of the Chinese character, the pronunciation code part of the invention is not needed to be input any more, the complete code of the Chinese character is input only by inputting the shape part code, the original Chinese character is automatically replaced without repeated code, and individual repeated code is selected according to the prompting line, and the selected Chinese character automatically replaces the original Chinese character with wrong input.
8. The new simultaneous near-bit Chinese character code input method of claim 1, wherein the method comprises the following steps: the double-color candidate character technology is adopted, namely, after characters are input in a candidate window, words can appear for selection, the words which do not adopt shape part codes are in a certain color, such as green, the Chinese characters which adopt shape part codes, namely, the Chinese characters which adopt Chinese characters codes, are in another color, such as black, and when the Chinese characters are input for several times, the system considers that the Chinese character code technology is understood, and the Chinese characters are input preferentially according to the Chinese character codes, so that the speed is improved.
9. The new simultaneous near-bit Chinese character code input method of claim 1, wherein the method comprises the following steps: after the encoding of a homonym, the trouble of homonym selection can be basically avoided by inputting the first codes of the auxiliary codes of the first word and the second word of the homonym, the trouble of homonym selection can be generally avoided by inputting the first codes of the auxiliary codes of the first word of the homonym, one homonym which is most commonly used in a plurality of homonyms can be encoded only according to pinyin, and the other homonyms can take the first codes of the auxiliary codes of the first word, so that homonyms can be more effectively distinguished, and if the homonym still has a heavy code, the first codes of the auxiliary codes of the second word of the homonym can be taken; of course, in some homonyms, sometimes an auxiliary code is added, so that different pinyin can be obtained, in order to avoid syllable conflict, a certain homonym which is not commonly used can be selected, and the selected homonym can be free of the first code of the auxiliary code of the first word; the homonym is generally fully spelled, and the first codes of the auxiliary codes of the first word and the second word of the homonym are input after the fully spelled homonym, so that homonym selection can be better avoided, and the first codes of the auxiliary codes of the first word, the second word and even the third word of the homonym can be input again under the condition of simple spelled, so that the trouble problem of homonym selection can be effectively avoided, but the effect is relatively weak.
CN202310081787.8A 2022-02-22 2023-01-08 New homophonic near-bit Chinese character code input method Pending CN117111752A (en)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
CN202210189185 2022-02-22
CN202210189185.X 2022-02-22
CN202210919982 2022-07-22
CN202210919982.9 2022-07-22
CN202211315290 2022-10-16
CN202211315290.X 2022-10-16
CN202211661234 2022-11-16
CN202211661234.1 2022-11-16

Publications (1)

Publication Number Publication Date
CN117111752A true CN117111752A (en) 2023-11-24

Family

ID=88804457

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310081787.8A Pending CN117111752A (en) 2022-02-22 2023-01-08 New homophonic near-bit Chinese character code input method

Country Status (1)

Country Link
CN (1) CN117111752A (en)

Similar Documents

Publication Publication Date Title
US8862988B2 (en) Pictorial keyboard with polysemous keys for Chinese character output
TW200900996A (en) Information input method based on Chinese phonetic alphabets
CN100462901C (en) GB phoneticize input method
CN111880667A (en) Phoneme same-tone near-bit common Chinese character code input method
CN103488426A (en) Virtual keyboard based on touch screen and input method
CN103616960A (en) Six vowel binary syllabification input method
WO2013135208A1 (en) Input method for compatible keyboard
CN117111752A (en) New homophonic near-bit Chinese character code input method
CN102511021B (en) Number-order-code-element keyboard and information input method thereof
CN107615218A (en) Sound trail keyboard and input method
CN112783336A (en) New phoneme same-tone near-bit Chinese character code input method
CN111381687A (en) Chinese character font and sound-shape dual-purpose code and platform for cross inputting Chinese characters through sound, shape and sound-shape codes on same platform
CN104238765B (en) Students in middle and primary schools' keyboard marks phonetic code inputting method
CN102141844B (en) Chinese character internal code input method with pronunciations of sound, rhyme and tone
CN112328095B (en) Four-purpose phonetic and shape code Chinese character input method and input platform without using number keys
WO2020087769A1 (en) Phonetic writing input method
CN111611773A (en) Digital coding method for Chinese and foreign languages and its use
CN110955337A (en) Character input method and device
CN101093420A (en) Free mode input method
CN1068947C (en) Shape-sound and shape-shape associated Chinese input method and its keyboard
TW201827989A (en) Syllable string set-based input method using initial, medial, tone, and rhyme, and employing main keyboard and numeric keypad
CN103019399B (en) The mute literary input method of Chinese double-spelling
CN103941882A (en) T-shaped Chinese character code input method
KR20100080576A (en) Inputting method of character
CN110119213A (en) Computer Chinese-character holography hybrid coding input method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication