CN115047980A - Non-split Chinese character input integrated system capable of accurately inputting Chinese characters - Google Patents

Non-split Chinese character input integrated system capable of accurately inputting Chinese characters Download PDF

Info

Publication number
CN115047980A
CN115047980A CN202110257647.2A CN202110257647A CN115047980A CN 115047980 A CN115047980 A CN 115047980A CN 202110257647 A CN202110257647 A CN 202110257647A CN 115047980 A CN115047980 A CN 115047980A
Authority
CN
China
Prior art keywords
code
chinese character
character
coding
stroke
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110257647.2A
Other languages
Chinese (zh)
Inventor
不公告发明人
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202110257647.2A priority Critical patent/CN115047980A/en
Publication of CN115047980A publication Critical patent/CN115047980A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)
  • Input From Keyboards Or The Like (AREA)

Abstract

The present invention relates to a Chinese character input method which accords with Chinese character writing specifications and uses a common English alphabet keyboard or a touch screen virtual soft keyboard to accurately input Chinese characters to electronic equipment such as a computer, a mobile phone, etc. in various coding modes for Chinese character information processing and communication. Based on the basic knowledge of national obligation education, the method creates 11 shape roots with specific meaning and 31 Chinese character components as the coding elements of Chinese character font, determines the coding roots of the Chinese character font by the positioning of the first stroke and the last stroke without splitting the Chinese character, constructs a Chinese character input integrated system with a plurality of input modes by changing the positions of the character pronunciation coding elements and the code elements of the character font coding elements, solves the problems that the existing pinyin input method cannot accurately type, the character font input method is difficult to split and the character roots are difficult to memorize, and realizes the universal Chinese character input method for coding Chinese characters of national standard GB2312-80 font library, UNICOD complex and simplified large character sets by the same coding rule.

Description

Non-split Chinese character input integrated system capable of accurately inputting Chinese characters
One, the technical field
The invention relates to a Chinese character keyboard coding input technology for information processing of electronic equipment such as computers, tablet computers, mobile phones and the like, in particular to a Chinese character standard coding method for inputting Chinese characters to the electronic equipment such as the computers, the tablet computers, the mobile phones and the like in a plurality of coding modes by using a virtual soft keyboard of a common English letter keyboard and a touch screen to process Chinese information and communicate.
Second, background Art
With the rapid increase of social ownership of computers, tablet computers and mobile electronic communication devices and the appearance of virtual keyboards on touch screens of mobile phones, people increasingly use intelligent pinyin to input Chinese characters, but the homophones of the Chinese characters are too many, and pages are inevitably turned to find the characters when inputting single characters, so that the mental effort is consumed, and the input efficiency is reduced. The present intelligent pinyin input method still needs hundreds of auxiliary etymons to reduce the coincident code rate of single characters, is difficult to be widely accepted by people due to large memory capacity, and can not solve the contradiction that a good input method is too difficult to learn and an easy input method is not good to learn for a long time, but urgently needs a simple Chinese character input method which has low learning cost and can accurately type in the fields of financial systems, publishing industries, file management and the like with higher requirements on Chinese character input accuracy.
Third, summary of the invention
The invention aims to provide an integrated system which is simple and can accurately input Chinese characters in various coding modes without converting input modes, and solves the problems that the prior intelligent pinyin input method has too many repeated code characters when single characters are input, has low typing efficiency and can not meet the requirements of accurately inputting Chinese characters in the industries of finance, publishing, archive management and the like. The system integrates multiple Chinese character coding input modes such as pinyin, sound shape, shape sound, full shape and the like to adapt to users with different culture levels and working environments with special requirements on typewriting. If the Chinese character is not familiar with the pinyin or unknown characters can be input by a pure etymon mode, the limitations that the pinyin is difficult to input unknown rare characters and the shape code is difficult to input the rare characters which can not be written are solved. The invention can make the Chinese character input become an ideal Chinese character input method which can be used as soon as learning and is simple and quick, and can accurately input Chinese characters by meeting the requirements of financial systems, publishing industries and file management.
1. Chinese character coding design idea
In order to solve the problems that the single character has too many coincident codes and the font of the font code is difficult to remember in the pinyin input mode, the invention establishes 29 font fonts with specific significance and 13 Chinese character components as 42 coding fonts of the Chinese character font based on the basic knowledge of the national obligation education. The Chinese character coding determines the coding etymon of the Chinese character font by the first stroke and the last stroke, and the Chinese character is not required to be split during typing. The invention forms a plurality of Chinese character coding modes in different combination modes according to the character pronunciation characteristics and the character form characteristics of Chinese characters, selectively integrates the Chinese character coding modes into a whole, and forms a Chinese character input integrated system with a plurality of input modes coexisting.
The Chinese character coding method of the invention is realized as follows: the method comprises the steps of establishing corresponding English letter codes for 42 Chinese character form etymons, Chinese character components and Chinese pinyin initial consonants according to the relation of pictographs and harmonic tones by using 26 English letters of a touch screen virtual soft keyboard of electronic equipment such as a standard keyboard universal for a computer or a tablet personal computer, a mobile phone and the like, inputting Chinese character codes consisting of the initial consonants of the Chinese pinyin and first stroke codes, last stroke codes, identification codes and feature codes of Chinese character patterns into the electronic equipment such as the computer or the mobile phone by typing the English letter codes, and realizing the accurate input of Chinese characters. The font code takes out the form etymons or the Chinese character components by positioning the first stroke and the last stroke, because the font code elements are only 42, the etymons between the head and the tail are mutually separated, the Chinese character splitting and code taking are not needed, the uncertainty caused by the Chinese character splitting is eliminated, the problem that the font codes are difficult to remember and take the font codes is solved, and the coincident code rate index of the Chinese character input is superior to the five-stroke font and the traditional font code input method.
The Chinese character and phrase of the present invention is composed of four coding elements, and the maximum coding length is four codes. The character pronunciation and shape features of Chinese character are the basic coding elements constituting Chinese character coding, and the position of the coding elements is changed to constitute the integrated Chinese character coding and inputting system with several input modes.
2. Chinese character coding element and coding rule
(1) Initial consonant code of Chinese character phonetic alphabet
The initial consonant of the Chinese character pinyin is the first letter of the Chinese pinyin, and the first letter of the vowel is taken as the Chinese character without the initial consonant.
For example: the Chinese characters with double initial letters zh, ch and sh only take the first letters z, c and s.
The 'an' character only has a vowel an without an initial consonant, and the initial letter a of the vowel is used as an initial consonant code.
(2) Chinese character font coding radical
The invention relates to a Chinese character input method, which classifies the coding etymons of Chinese characters by assigning specific meanings to the coding etymons in a mode of describing the stroke features of the etymons of the Chinese characters by characters, integrates a large number of Chinese character etymons which look irregular into 42 coding etymons classified according to morphological features, establishes rational mapping association with 26 English letters according to the relationship of pictographs or harmonious tones and accordingly forms the Chinese character input method with 42 coding etymons. The simplified Chinese character input method is formed by the default of 13 radicals of Chinese character "", "", Yuan, tooth, jade and Zheng and the remaining 29 radicals.
When the radical code of the Chinese character font is taken, the coding radical defined by the invention is determined according to the first stroke, the last stroke and the fixed position at the upper right, and the corresponding letter code is taken out, and the Chinese character is not required to be split in the code taking process.
The first stroke code, the last stroke code, the identification code and the feature code of the Chinese character font are defined as follows:
the first stroke code is the English letter code of the radical formed by the first stroke of the Chinese character or the strokes connected with the first stroke.
The last stroke code is an English letter code of a etymon formed by the last stroke of the Chinese character or the strokes connected with the last stroke of the Chinese character.
The identification code is an English letter code of the highest stroke or etymon at the upper right of the Chinese character.
The characteristic code is an English letter code of the highest stroke or etymon at the upper right after the identification code of the Chinese character is removed.
The English letter codes corresponding to the coding etymons of the Chinese character font are as follows:
Figure BSA0000235717270000021
Figure BSA0000235717270000031
(3) code-fetching rule of Chinese character font
The font code-taking rule is that the first stroke code, the last stroke code, the identification code and the feature code of the Chinese character are defined according to the invention, the radical code consisting of the first stroke, the last stroke and the upper right stroke of the Chinese character is taken out, and the code-taking rule of the font code is as follows:
when the first stroke code, the last stroke code and the identification code are the same stroke or etymon, the alphabetic code of the etymon or the stroke can be repeatedly taken.
For example: the first stroke of the 'wood' is horizontal, the last stroke is right-falling, and the horizontal on the upper right forms the 'wood' etymon, and the first stroke code, the last stroke code and the identification code all take the letter code F of the 'wood' etymon.
And for example, the first stroke code of the Chinese character standing grain is a stroke for 'left falling', but is also a stroke shared by the identification code at the upper right, and the first stroke code and the identification code are both the letter code J for 'left falling'.
And secondly, when the upper half section and the lower half section of one stroke respectively form different etymons with other strokes, the letter codes corresponding to the etymons can be respectively selected.
For example: the folding strokes in the 'car' character are shared by an upper etymon and a lower etymon, and the 'two obliquely crossed' etymon X of the first stroke code and the 'vertically crossed' etymon M of the last stroke code are respectively taken.
For another example, the vertical stroke in the last character is shared by the upper and lower etymons, and the vertical penetration M of the first stroke code and the wooden etymon F of the last stroke code should be respectively taken.
And for example, the folding stroke in the second character is shared by an upper etymon and a lower etymon, the first stroke code is a sharp corner of the upper half section and the last stroke code is a right hook of the lower half section and the letter code is a letter code C.
And thirdly, letter codes of large etymons with more strokes are preferentially taken, and single strokes are directly taken when the large etymons are not formed.
For example: the first stroke code of ' educating ' does not take the single stroke ' point, and the larger part point and the horizontal letter code A of ' high character head ' should be taken; the last stroke code should be the letter code P of the larger etymon moon instead of the single stroke horizontal.
When the prefix is the Chinese character covered by a large square frame (mouth), a cursive word head (+), and a diseased word head (), the identifier at the upper right should be removed before the identifier is taken.
For example: the first stroke code of the treatment is the head of the sick character, the head of the sick character is removed from the identification code at the upper right, and then the letter code K of the oblique angle at the upper right of the character is taken.
The first stroke code of the fen is covered by the careless head, the careless head of the identification code at the upper right is removed, and then the letter code B of the eight at the upper right part of the sub-character is taken.
The first stroke code of the 'garden' is 'big square frame', and the letter code Y of the 'Yuan' etymon is taken after the 'big square frame' is removed from the identification code at the upper right.
3. Sound-shape coding input mode
(1) Chinese character sound-shape coding method
The sound-shape coding of Chinese character is according to the code-fetching rule of Chinese character coding elements, and the complete code of Chinese character is formed from four coding elements of initial consonant, first stroke code, last stroke code and identification code of Chinese character, in which the identification code can be inputted when there is Chinese character with identical code. The code fetch sequence of Chinese character coding can also be expressed as follows:
first letter + first stroke code + last stroke code + identification code of Chinese character pinyin
For example: the yoga character coding process includes firstly taking the initial consonant Y of the character, then taking the first stroke code 'king' etymon code Z, then taking the last stroke code 'carabiner' code I, and finally taking the identification code 'eight' code B on the upper right, wherein the complete code of the yoga is YZIB, and the three codes 'yoga' are unique, so that the identification code B does not need to be input during actual operation.
Under the sound-shape input mode, the Chinese character identification code can be directly replaced by a universal letter code Y or Z, the required Chinese character can be quickly found through fuzzy input, and the code fetching sequence of the Chinese character fuzzy code can also be expressed as follows:
the first letter + the first stroke code + the last stroke code + the universal letter code Y of the pinyin of the Chinese character;
the first letter of the Chinese character pinyin + the first stroke code + the last stroke code + the universal letter code Z;
(2) phrase coding method
The phrase coding of the present invention has various input modes according to different sequences of code-fetching elements.
Three coding modes of the two-character phrase:
firstly, according to the code-fetching rule of said invention for Chinese character coding element firstly fetching first letter of phonetic transcription of every Chinese character in two-character phrase, then sequentially fetching first stroke code of every character in two-character phrase. The code fetch order of a two-word group code can also be expressed as follows:
the first letter 1 of the pinyin, the first letter 2 of the pinyin, the first stroke code 1 and the first stroke code 2;
for example: "beauty" takes the first letter ML of each character pinyin in the two-character phrase, then takes the first-stroke code "eight" code B of "beauty" character, "the first-stroke code" horizontal "code E of" beauty "character, and the code of" beauty "is MLBE.
Secondly, the first letter and the first stroke code of the pinyin of each character in the two-character phrase are sequentially selected according to the code selection rule of the Chinese character coding elements. The code fetch order of a two-word group code can also be expressed as follows:
the first letter 1+ the first stroke code 1+ the first letter 2+ the first stroke code 2 of the pinyin;
for example: "beauty" takes the first letter M of the pinyin of the first character "beauty" of the two-character word group and the code B of the first code "eight", then takes the first letter L of the pinyin of the last character "beauty" and the code E of the first code "horizontal", and the code of "beauty" is MBLE.
And thirdly, according to the code-fetching rule of the character pattern of the Chinese character, sequentially fetching the first stroke code and the last stroke code of each character in the two-character phrase. The code fetch order for a two-word block code can also be expressed as follows:
the first stroke code is 1+ the last stroke code is 1+ the first stroke code is 2+ the last stroke code is 2;
for example: "happiness" takes the first code "two vertically crossed" codes H of first character "happiness" and the last code "two vertically crossed" codes H, then takes the code U of first code "point" of last character "happiness", and takes the code Q of last code "real" letter code "and" happiness "code as HHUQ.
Two coding modes of the three-character phrase:
firstly, according to the code-fetching rule of said invention for Chinese character coding element the phonetic first letter of every character in three-character phrase can be sequentially fetched, then the first stroke code of the invented character can be fetched. The code fetch order of the three-word block code can also be expressed as follows:
the first letter 1 of the pinyin, the first letter 2 of the pinyin, the first letter 3 of the pinyin and the first stroke code 3;
for example: the "new technique" takes the initial XJS of each word in the phrase, followed by the first code "wood" code F of the last word "art", the encoding of the "new technique" being XJSF.
Secondly, according to the code-taking rule of the Chinese character coding elements, the first stroke code of each character in the three-character phrase is sequentially taken, and then the last stroke code of the last character is taken. The code fetch order of the three-word block code can also be expressed as follows:
the first stroke code is 1+ 2+ 3;
for example: the ' computer ' takes a head shape code ' side of language ' code C, ' golden side ' code T, ' wooden side ' code F of each character in the phrase, then takes a tail shape code ' right hook ' code C ' of a last character, and codes of the ' computer ' are CTFC.
Two coding modes of multi-character phrases with more than four characters are as follows:
firstly, the first phonetic letter of the first three characters and the first phonetic letter of the last character in the phrase are sequentially selected according to the code-selecting rule of the Chinese character coding elements of the invention. The code-fetching sequence of the multi-word phrase code can also be expressed as follows:
the first letter of the pinyin 1+ the first letter of the pinyin 2+ the first letter of the pinyin 3+ the first letter of the last pinyin;
for example: the first letter of the pinyin of each character in the four-character phrase is directly taken and coded as MBSS.
The first phonetic letter of the first three characters and the first phonetic letter of the last character in the phrase are taken by the people's republic of China, and are coded as ZHRG.
Secondly, according to the code-taking rule of the Chinese character coding elements, the first stroke codes of the first three characters and the first stroke codes of the last characters in the phrase are sequentially taken. The code-fetching sequence of the multi-word phrase code can also be expressed as follows:
the first stroke code is 1+ 2+ 3+ the last prefix stroke code;
for example: the 'win-loss' directly takes a first stroke code 'splayed' code B, a horizontal stroke code E, a moon code P, an oblique stroke code K and a 'win-loss' code BEPK of each character in the four-character phrase.
The high and new technology industry development area firstly takes a first code 'high head' code A, a 'room' code W, a first code 'right angle' code L and a 'AAWL' code of the first three words in the phrase.
4. Shape-sound coding input mode
The shape-sound coding input mode of the present invention is a coding mode which mainly uses the character form characteristics of Chinese characters and uses the pronunciation of the Chinese characters as auxiliary, and the coding etymon of the character form of the Chinese characters and the initial consonant of the pronunciation of the Chinese characters or the initial consonant of the character-forming radical of the Chinese characters form a Chinese character code, and the Chinese character code has at most four coding elements.
(1) Code-fetching rule of Chinese character component
In order to solve the problem of unknown uncommon character input and establish the concept of Chinese character component formation, the invention has the following promissory meanings to the Chinese character component formation:
the character-forming radicals of the present invention have clear pronunciation and may be used independently as Chinese character radicals without need of correcting stroke. For example: bei, niu, Reo, Chong, Shi, Pi, Shu, Flat, etc. are radicals with pronunciation and may form character independently, and Chinese characters, Chinese characters radicals, Qi, ,
Figure BSA0000235717270000061
A frame,
Figure BSA0000235717270000062
"etc. components that cannot be independently formed into words are not processed as word components.
The first character-forming radical is the largest character-forming radical formed by the first stroke and the subsequent strokes of the Chinese character.
For example: the 'outer' has the largest character head forming character radical 'chapter' and no character tail forming character radical, wherein 'standing and pronunciation' is not the largest forming character radical.
The last character component of the Chinese character is the maximum character component formed by the last stroke of the Chinese character and the previous stroke of the Chinese character.
For example: the reading has the largest character-last character-forming component sold and no character-first character-forming component sold, wherein the head and the buy are not the largest character-forming components.
The 'whip' has the leather of the character-forming components with the largest prefix and the convenient character-forming components with the largest tail, wherein the 'dragging and the more' are not the character-forming components with the largest prefix.
And fourthly, the code taking of the Chinese character radicals takes the initial consonant codes of the character radicals formed at the tail of the character according to position priority, and if the character radicals formed at the tail of the character do not exist, the initial consonant codes of the character radicals formed at the maximum prefix are taken.
For example: the initial consonant C of the character component of the maximum character end forming character is selected as the initial consonant of the 'uniform'.
The single horizontal stroke of 'one' can not be used as a character-forming radical, and Chinese characters without the character head and the character tail-forming radical have the radical initial consonant code of 'V' uniformly.
For example: the simple, the fact, the pharma, the initial consonant code thereof all take V.
(2) Chinese-character 'shape-pronunciation' encode method
The phonographic coding mode is inconvenient for inputting the uncommon word with inaccurate pronunciation, and the phonographic coding mode can simply, conveniently and accurately input the uncommon word.
According to the code-fetching rule of the Chinese character coding elements of the invention, the first stroke code, the last stroke code and the identification code at the upper right of the Chinese character are sequentially fetched, and then the pronunciation of the Chinese character or the first letter of the pronunciation of the maximum character-forming radical is fetched. The Chinese character coding and code-fetching sequence of the pictophonetic input mode can also be expressed as follows:
the first letter of the first stroke code, the last stroke code, the identification code and the pinyin of the Chinese character;
the first stroke code, the last stroke code, the identification code and the first letter of the pinyin beside the character-forming component;
for example: firstly, taking a first stroke code horizontal code E, a last stroke code wood code F and an identification code left-falling code J at the upper right of the crisp, then taking a consonant S of the crisp or a consonant H of a Chinese character side with a character at the end of the crisp, and coding the crisp into EFJS or EFJH;
5. pure font coding input mode
The pure character pattern coding input mode of the invention is to obtain the coding etymon of the Chinese character according to the character pattern characteristics of the Chinese character, and the Chinese character code codes at most four character pattern coding elements.
According to the code-fetching rule of Chinese character font, the first stroke code, last stroke code, identification code and characteristic code of upper right of Chinese character are fetched in sequence. The code fetch sequence of Chinese character coding can also be expressed as follows:
first stroke code + last stroke code + identification code + feature code;
when the characteristic code of Chinese character is selected, the characteristic code is selected after the components of mountain, bird, bamboo, page, rain, gas, , ge, , , vacuum control unit and are removed.
For example: the ' can ' takes a first stroke code ' golden side head ' code T, a tail shape code ' horizontal ' code E and an identification code ' caret head ' code N at the upper right, then the caret head ' is removed, a characteristic code ' empty port ' code 0 at the lower part is taken, and the code of the ' can ' is TENO.
6. Pinyin compression input mode
The double initial of the pinyin compression input mode is compressed into only one initial, ng of the vowels is compressed into g, and the Chinese pinyin coding elements of the compressed Chinese characters do not exceed four codes, so that characters which cannot be written can be input conveniently.
For example: the Pinyin of the 'packaged' Chinese character is zhuang, and the 'packaged' character can be found only by inputting the compressed code ZUAG.
7. Full-spelling and double-spelling auxiliary etymon input mode
The full spelling and double spelling auxiliary etymon input mode of the invention takes the first stroke code and the last stroke code as the auxiliary etymon, when the full spelling or double spelling needs to uniquely determine the first stroke code or the last stroke code of a character, the input mode of the auxiliary etymon is suitable for various layouts, thereby achieving the effect of inputting pinyin single characters without repeated codes. The auxiliary encoding method of the pinyin single character can also be expressed as follows:
chinese character full spelling + first stroke code + last stroke code
Chinese character double spelling + first stroke code + last stroke code
For example: the full spelling of the Chinese character 'Bao' is bao, and the first stroke code 'one-man side' code Y and the last stroke code 'mu' of the character are continuously input after the bao, and the code is BAOYF.
The double spelling of the Chinese character 'De' is de, and the first stroke code 'left falling' code J and the last stroke code 'cun' code K of the character are continuously input after de, and the code is DEJK.
8. Simplified Chinese character input method
The simplified Chinese character input method of the present invention is that the number of coded etymons is reduced in original 42 etymons, the radicals of 13 Chinese characters, namely heart, middle, Shen, vacuum control unit, inch, horse, Nei, and, the radical, element, tooth, jade and positive, are omitted, the remaining 29 etymons are used as coding elements of Chinese character font, the corresponding English letter key positions are kept unchanged to construct a simplified Chinese character input method that the number of etymons is close to that of English letters, one etymon corresponds to one English letter key position, and the simplified Chinese character input method is more convenient to use on mobile phones and tablet computers by combining with a touch screen soft keyboard. The simplified Chinese character input method and the 42 full-etymon input method are suitable for the code-fetching rule and the method of the invention, and the code-fetching rule and the method are simpler and more convenient to use because the number of the coded etymons is reduced. For example: the original phono-shape codes directly take the radical codes of "horse", "Yuan" and "" and are respectively coded as "MEM" and "YYT". In simplified version, since neither "ma" nor "is a radical, it is changed to single stroke and its phono-shape is coded as" MEE "and" YER ".
The simplified Chinese character sound-shape coding code-fetching sequence can be expressed as follows:
initial of Chinese character phonetic alphabet + first stroke code + operation stroke code + identification code
The simplified Chinese character pictophonetic coding and code fetching sequence can be expressed as follows:
the first letter of the first stroke code, the last stroke code, the identification code and the pinyin of the Chinese character;
the first stroke code, the last stroke code, the identification code and the first letter of the pinyin beside the character-forming component;
the simplified Chinese character pure font code-fetching sequence can be expressed as follows:
the first stroke code + the last stroke code + the identification code + the feature code;
9. mobility and interchangeability of key-location radical keyboard layouts
The Chinese character parts and etymons on each key position of the invention can be used as a whole to be exchanged or moved mutually, and the coincident code rate index of the Chinese character input method of the invention is still kept and is not changed. For example, according to the sound support rule, the "moon" etymon of P key is placed on Y key position, the original "single person side, element and tooth" of Y key is placed on D key position, the original "large square frame, middle and Shen" of D key is placed on K key position, the original "oblique angle" of K key is placed on J key position, and the original "left falling" of J key is placed on P key position, i.e. the whole keyboard layout of key position etymon is changed, and the effect of accurate typing by input method is not affected. The characteristic of the invention that the coded radicals can be moved and exchanged integrally is also suitable for rearranging the keyboard layout according to the human engineering principle, and the typing accuracy is kept unchanged while the finger keystroke comfort is improved.
10. Chinese character coding input mode integrated system
The invention can select one of the coding modes to form a complete Chinese character input method, and can integrate several coding modes into a Chinese character input system comprising various input modes. For example, four coding modes of sound-shape coding, shape-sound coding, fuzzy coding and pinyin compression coding are integrated into a Chinese character input method, any one of the coding modes is used for inputting Chinese characters and phrases without switching operation, and Chinese character codes of various input modes can be naturally separated without mutual interference.
11. Advantageous effects
(1) The Chinese character coding method is consistent with the national obligation education content of the primary school Chinese, special learning and training are not needed before the Chinese character coding method is used, only the first stroke and the last stroke of the Chinese character and 42 or 29 coding word roots are needed to be known, the Chinese character can be accurately input by knowing the coding rule, compared with the pinyin and five-stroke input method, the Chinese character coding method is simpler and more accurate to use, the index of the coincident code rate of single characters is far lower than that of the traditional main manifold code input method, and the difficult problem that the input method which is good for a long time is not good for learning is solved.
(2) The invention integrates a plurality of Chinese character input modes into a whole, and can select a proper input mode to type according to own habits and preferences without any conversion operation during typing. The integrated system with multiple Chinese character input modes improves the degree of freedom of typing of the user, and solves the problems that the pinyin cannot input unknown rare characters, and the font codes are difficult to input rare characters which cannot be written, so that the invention has wider universality and practicability.
(3) The 42 coding etymons can cover all Chinese character codes of the existing international universal UNICOD word set, and even if the number of the coding etymons of the Chinese characters is increased, the number of the coding etymons does not need to be increased any more, so that the number of the coding etymons has certain stability and universality, the consistency of the number of the coding etymons and the content is favorably kept when the number of the coding etymons of the Chinese characters is expanded, and the consistency of the coding rule after the input method is upgraded is also kept, so the invention can be used without re-learning and changing the original typing habit, the burden of re-learning is favorably lightened, and the social labor cost is saved.
Description of the drawings
The keyboard layout of 42 etymons is shown in the attached drawing. In order to facilitate memorizing the corresponding positions of the etymon and the key positions, the etymon and the corresponding key positions correspond to the corresponding shape and the harmonic sound, for example, the strokes of 'left falling' are placed on the J key, the roots of 'mouth' are placed on the 0 key, and the roots of 'oblique crossing' are placed on the X key. If the corresponding rule of the etymons is changed, the strokes of 'left falling' can be placed on the P key, the etymons of 'mouth' can be placed on the K key and the etymons of 'oblique crossing' can be placed on the C key in a phonetic support mode. Because the etymons and the key position letters of the invention are mostly in one-to-one correspondence, the change of the corresponding positions of the etymons and the key positions does not affect the original typing accuracy of the input method, but the fingering of the fingers for typing is changed during typing, thereby further improving the harmony of both hands during typing and being more convenient for memorizing the etymons.
Fifth, detailed description of the invention
The invention forms a plurality of Chinese character codes by a plurality of coding elements such as pinyin, character patterns and the like through position change and extraction of code elements with different quantities, even if the same Chinese character or the same phrase can be accurately input into microelectronic equipment such as computers, tablet computers, mobile phones and the like by using a plurality of different codes. For example: the smart's full pinyin code is: CONG, the sound shape code is: CEAB, shape and pronunciation code: EABC, and the shape and sound component codes are EABZ: the pure font code of "clever" is: the EABO can selectively combine various Chinese character codes according to the user requirements and the using environment to form a Chinese character input system integrating pinyin and character pattern input modes, and because the same Chinese character has different coding modes, the various Chinese character codes are naturally separated after being integrated and do not interfere with each other.
The Chinese character input system is manufactured in the form of application software and used for microelectronic equipment such as computers, tablet computers and mobile phones for Chinese information processing, is installed on operating system platforms of computers such as Windows, Linx, iOS and Android and intelligent touch screen mobile phones, and provides a simple, convenient, fast and ideal Chinese character input method integrated by multiple coding modes for users.

Claims (10)

1. A Chinese character coding input method for Chinese information processing, the Chinese character and phrase coding of the method is composed of two parts of Chinese character phonetic elements and font coding elements, and a multi-coding mode Chinese character input integrated system which has a Chinese character sound-shape coding mode, a shape-sound coding mode, a pure font coding mode, a full-spelling font auxiliary coding mode, a double-spelling font auxiliary coding mode, a fuzzy coding mode and a simplified version Chinese character input method is formed according to a Chinese character code-fetching rule, and is characterized in that:
(1) the Chinese character pinyin elements take the first letter of Chinese pinyin as an initial code, Chinese characters without the initial and the first letter of a vowel as a code;
(2) the etymons of the font coding elements are classified by assigning specific meanings to the coded etymons of the Chinese characters in a mode of describing the character form characteristics of the etymons of the Chinese characters by characters, a large number of the Chinese character etymons which look irregular are integrated into 42 coded etymons classified according to morphological characteristics, and the coded etymons are in rational mapping association with 26 letters according to the pictographic or harmonious relation, so that a Chinese character input method with 42 coded etymons is formed, and 13 radicals of 'heart, middle, Shen, vacuum line, cun, horse, and, English, element, tooth, Jade and positive' are defaulted, and the remaining 29 etymons form a simplified version Chinese character input method;
(3) the radical code of the Chinese character font determines the coding radical defined by the invention according to the first stroke, the last stroke and the fixed position at the upper right, and takes out the corresponding letter code, without splitting the Chinese character, the font code of the Chinese character is composed of the first stroke code, the last stroke code, the identification code and the feature code of the Chinese character font, and the agreed significance is as follows:
the first stroke code is the English letter code of the radical formed by the first stroke of the Chinese character or the strokes connected with the first stroke;
the last stroke code is an English letter code of a etymon formed by the last stroke of the Chinese character or the strokes connected with the last stroke;
the identification code is an English letter code of the highest stroke or etymon at the upper right of the Chinese character;
the characteristic code is an English letter code of the highest stroke or etymon at the upper right after the Chinese character identification code is removed;
the English letter codes corresponding to the coding etymons of the Chinese character patterns are as follows:
Figure FSA0000235717260000011
Figure FSA0000235717260000021
(4) the code-fetching rule of the Chinese character font is as follows:
the font code-taking rule is that the first stroke code, the last stroke code, the identification code and the feature code of the Chinese character are defined according to the invention, the radical code consisting of the first stroke, the last stroke and the upper right stroke of the Chinese character is taken out, and the code-taking rule of the font code is as follows:
when the first stroke code, the last stroke code and the identification code are the same stroke or etymon, the etymon or the letter code of the stroke can be repeatedly taken;
when the upper half section and the lower half section of one stroke respectively form different etymons with other strokes, the letter codes corresponding to the etymons can be respectively selected;
preferentially taking letter codes of a large etymon with more strokes, and directly taking single strokes when the large etymon is not formed;
fourthly, when the prefix is a Chinese character covered by a large square frame (oral), a Chinese character head (+), and a Chinese character head (), when the identification code at the upper right is taken, the prefix is removed and then the identification code is taken.
2. The Chinese character coding input method of claim 1, the Chinese character coding of the sound-shape coding mode, characterized in that: the sound-shape coding of Chinese characters is based on the code-fetching rule of Chinese character coding elements, the complete coding of Chinese characters is composed of four coding elements of first letter, first stroke code, last stroke code and identification code of Chinese character phonetic alphabet, in which the identification code is only required to be input when there is Chinese character with same coding, and the code-fetching sequence of Chinese character sound-shape coding can also be expressed as follows:
first letter + first stroke code + last stroke code + identification code of Chinese character pinyin
The fuzzy coding of Chinese characters in the sound-shape input mode is a simple input mode, the identification code of Chinese characters does not need to be subdivided, and can be directly replaced by general letter code Y or Z, the required Chinese characters can be quickly found, and the code-fetching sequence of Chinese character coding can be expressed as follows:
the first letter + the first stroke code + the last stroke code + the universal letter code Y of the Chinese character pinyin;
the first letter + the first stroke code + the last stroke code + the universal letter code Z of the pinyin for the Chinese character.
3. The Chinese character coding input method of claim 1, the Chinese character coding of the pictophonetic coding mode, characterized in that: the character pattern characteristic of Chinese character is used as main, the pronunciation of character is used as auxiliary coding mode, a Chinese character code is formed by coding etymons of Chinese character patterns and the first letter of pronunciation of Chinese character or the first letter of pronunciation of character components of Chinese characters, and the length of Chinese character code is four coding elements at most;
(1) the invention has the convention significance for Chinese character component
The character-forming radicals referred to in the invention mean that the radicals forming the character have definite pronunciation and can be independently used as the radicals of the Chinese characters without correcting the stroke shape;
the first character-forming radical is the maximum character-forming radical consisting of the first stroke of the Chinese character and the subsequent strokes of the Chinese character;
the last character component of the Chinese character is the maximum character component formed by the last stroke of the Chinese character and the previous stroke of the Chinese character;
the code taking of the Chinese character components takes the initial codes of the character components formed at the end of the character according to position priority, and if the character components formed at the end of the character do not exist, the initial codes of the character components formed at the maximum prefix are taken;
fifthly, single horizontal stroke 'one' cannot be used as a character-forming radical, Chinese characters without a prefix and a character-forming radical are not used, and initial consonant codes of the radicals are uniformly V;
(2) chinese-character 'shape-pronunciation' encode method
According to the code-fetching rule of the Chinese character coding elements of the invention, the first stroke code, the last stroke code and the identification code at the upper right of the Chinese character are sequentially fetched, and then the pronunciation of the Chinese character or the initial consonant code of the pronunciation of the most character-forming components are fetched, and the code-fetching sequence of the Chinese character coding in the form-sound input mode can also be expressed as follows:
the first letter of the first stroke code, the last stroke code, the identification code and the pinyin of the Chinese character;
the first stroke code, the last stroke code, the identification code and the first letter of the pinyin beside the character-forming component.
4. The Chinese character coding input method of claim 1, a Chinese character coding in a pure font coding mode, characterized in that: according to the invention, the first stroke code, the last stroke code, the identification code at the upper right and the feature code of the Chinese character are sequentially selected according to the code selecting rule of the Chinese character font, the length of the Chinese character code is at most four font coding elements, and the code selecting sequence of the Chinese character code can also be expressed as follows:
first stroke code + last stroke code + identification code + feature code;
when the characteristic code of Chinese character is taken, the characteristic code is taken after the components of mountain, bird, bamboo, leaf, rain, gas, , ge, , , vacuum control unit, are removed.
5. The Chinese character coding input method of claim 1, the simplified version Chinese character input method is applicable to the Chinese character coding rules and method of the present invention, and is characterized in that: the number of coded etymons is reduced in original 42 etymons, 13 Chinese character components of 'heart, middle, Shen, vacuum control unit, cun, horse, Nami, Onwei, Yu, tooth, Yu and Zheng' are omitted, the rest 29 etymons are used as coding elements of Chinese character font, the corresponding English letter key positions are kept unchanged to construct a simplified version Chinese character input method with the etymon number being close to that of English letters, one etymon corresponds to one English letter key position, and the simplified version Chinese character sound-shape coding code-taking sequence can be expressed as follows:
first letter + first stroke code + last stroke code + identification code of Chinese character pinyin
The simplified Chinese character pictophonetic coding and code fetching sequence can be expressed as follows:
the first letter of the first stroke code, the last stroke code, the identification code and the pinyin of the Chinese character;
the first stroke code, the last stroke code, the identification code and the first letter of the pinyin beside the character-forming component;
the simplified Chinese character pure font coding code-fetching sequence can be expressed as follows:
first stroke code + last stroke code + identification code + feature code.
6. Pinyin compression input mode
The Chinese character coding input method of claim 1, the pinyin compression mode being characterized by: the compression of the double initial consonants is that only one first letter is input, the compression of ng in the final consonants is g, and the Chinese phonetic coding elements of the compressed Chinese characters do not exceed four codes, so that the characters which cannot be written can be conveniently input.
7. The Chinese character coding input method of claim 1, wherein the Chinese character coding is based on the auxiliary mode of full spelling and double spelling, and the method is characterized in that: the first stroke code and the last stroke code of the Chinese character are used as auxiliary etymons, the first stroke code or the last stroke code of the character is continuously input when only one Chinese character needs to be determined after the full spelling or double spelling is input, the auxiliary etymon input mode is suitable for the layout of various double spelling vowels, thereby achieving the effect of inputting single pinyin without repeated codes, and the font auxiliary coding of the full spelling and double spelling Chinese characters can also be expressed as follows:
chinese character full spelling + first stroke code + last stroke code
Double spelling of Chinese characters + first stroke code + last stroke code.
8. The Chinese character coding input method according to claim 1, wherein the phrase codes have a plurality of coding modes according to the position sequence of the coding elements, characterized in that:
(1) three coding modes of the two-character phrase:
firstly, according to the code-fetching rule of Chinese character coding element of said invention firstly fetching first letter of phonetic transcription of every Chinese character in two-character phrase, then sequentially fetching first stroke code of every character in two-character phrase, the code-fetching sequence of two-character phrase code also can be expressed as follows:
the first letter 1 of the pinyin + the first letter 2 of the pinyin + the first code 1+ the first code 2;
secondly, the first letter and the first stroke code of each Chinese character pinyin in the two-character phrase are sequentially selected according to the code selecting rule of the Chinese character coding elements, and the code selecting sequence of the two-character phrase codes can also be expressed as follows:
the first letter 1+ the first stroke code 1+ the first letter 2+ the first stroke code 2 of the pinyin;
thirdly, according to the code-fetching rule of the character pattern of the Chinese character, the first stroke code and the last stroke code of each character in the two-character phrase are sequentially fetched, and the code-fetching sequence of the two-character phrase codes can also be expressed as follows:
the first stroke code 1+ the last stroke code 1+ the first stroke code 2+ the last stroke code 2;
(2) two coding modes of the three-character phrase are as follows:
firstly, according to the code-fetching rule of Chinese character coding elements of said invention, the first letter of every Chinese character phonetic alphabet in three-character phrase is sequentially fetched, then the first stroke code of last character is fetched, and the code-fetching sequence of three-character phrase code also can be expressed as follows:
the first letter 1 of the pinyin + the first letter 2 of the pinyin + the first letter 3 of the pinyin + the first stroke code 3;
secondly, the first stroke code and the last stroke code of each character in the three-character phrase are sequentially selected according to the code selecting rule of the Chinese character coding elements, and the code selecting sequence of the three-character phrase codes can also be expressed as follows:
the first stroke code is 1+ 2+ 3;
(3) two coding modes of multi-character phrases with more than four characters are as follows:
firstly, the first letters of the Pinyin of the first three Chinese characters and the first letter of the Pinyin of the last character in the phrase are sequentially selected according to the code selection rule of the Chinese character coding elements of the invention, and the code selection sequence of the multi-character phrase coding can also be expressed as follows:
the first letter of the pinyin 1+ the first letter of the pinyin 2+ the first letter of the pinyin 3+ the first letter of the last pinyin;
secondly, the first code of the first three characters and the first code of the last character in the phrase are sequentially selected according to the code selecting rule of the Chinese character coding elements, and the code selecting sequence of the multi-character phrase coding can also be expressed as follows:
the first stroke code is 1+ 2+ 3+ the last initial stroke code.
9. The Chinese character coding input method of claim 1, wherein the layout of the key etymons on the keyboard has mobility and interchangeability, characterized in that: the Chinese character components and the etymons on the key positions can be used as a whole to be mutually exchanged or moved, the coincident code rate index and the accurate typing effect of the Chinese character input method are still kept unchanged, the characteristic that the etymons can be integrally moved and exchanged is also suitable for rearranging the keyboard layout of the etymons according to the human engineering principle, the finger keystroke comfortableness is improved, and meanwhile, the typing accuracy is still kept unchanged.
10. The integrated Chinese character input system comprising the Chinese character encoding and inputting method as claimed in claim 1, wherein: the invention can use only one coding mode as a complete Chinese character input method, and can integrate several coding modes into a Chinese character input system comprising multiple input modes, wherein any one coding mode is used for inputting Chinese characters and phrases without switching operation, and the Chinese character codes of various input modes are naturally separated and do not interfere with each other.
CN202110257647.2A 2021-03-09 2021-03-09 Non-split Chinese character input integrated system capable of accurately inputting Chinese characters Pending CN115047980A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110257647.2A CN115047980A (en) 2021-03-09 2021-03-09 Non-split Chinese character input integrated system capable of accurately inputting Chinese characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110257647.2A CN115047980A (en) 2021-03-09 2021-03-09 Non-split Chinese character input integrated system capable of accurately inputting Chinese characters

Publications (1)

Publication Number Publication Date
CN115047980A true CN115047980A (en) 2022-09-13

Family

ID=83156623

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110257647.2A Pending CN115047980A (en) 2021-03-09 2021-03-09 Non-split Chinese character input integrated system capable of accurately inputting Chinese characters

Country Status (1)

Country Link
CN (1) CN115047980A (en)

Similar Documents

Publication Publication Date Title
US5360343A (en) Chinese character coding method using five stroke codes and double phonetic alphabets
KR100962956B1 (en) Input method for optimizing digitize operation code for the world characters information and information processing system thereof
CA2477637C (en) Component-based, adaptive stroke-order system
JP2006127510A (en) Multilingual input method editor for ten-key keyboard
TWI464678B (en) Handwritten input for asian languages
WO2000025197A1 (en) Keyboard input devices, methods and systems
CN111880667A (en) Phoneme same-tone near-bit common Chinese character code input method
CN103616960A (en) Six vowel binary syllabification input method
WO2000043861A1 (en) Method and apparatus for chinese character text input
CN115047980A (en) Non-split Chinese character input integrated system capable of accurately inputting Chinese characters
CN105912139B (en) Method for correspondingly recognizing modular stroke coding Chinese characters
CN107256092B (en) Chinese character digital shape code quick input method
WO2001093180A1 (en) World characters numerical coding input method and thereof its information handling system
CN1032939C (en) Chinese-character coding, English keyboard and single-hand keyboard input
CN108459735A (en) Phonetic double-click touch screen method for inputting pinyin
CN110502128B (en) Chinese character multi-element input method and system
CN101957662B (en) Computer with Chinese character elements as well as cell phone keypad for inputting Chinese characters and input method
CN106325540A (en) Simplified input method of northeast Yunnan sub-dialect Miao language and application of simplified input method
CN101866338B (en) Method for creating Chinese character
CN1063856C (en) Keyboard and method for computer input of character-separated phonetic transcriptions
CN112783336A (en) New phoneme same-tone near-bit Chinese character code input method
WO2020087769A1 (en) Phonetic writing input method
CN105807949A (en) Tibetan input method and system
CN1752899B (en) Chinese language coding and its Chinese character input method and retrieval method
KR20240029703A (en) Keyboard system utilizing multi-pointer input

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination