WO2004036404A1 - Pattern-code chinese character input method with “three patterns association” - Google Patents

Pattern-code chinese character input method with “three patterns association” Download PDF

Info

Publication number
WO2004036404A1
WO2004036404A1 PCT/CN2003/000858 CN0300858W WO2004036404A1 WO 2004036404 A1 WO2004036404 A1 WO 2004036404A1 CN 0300858 W CN0300858 W CN 0300858W WO 2004036404 A1 WO2004036404 A1 WO 2004036404A1
Authority
WO
WIPO (PCT)
Prior art keywords
code
glyph
chinese characters
chinese
type
Prior art date
Application number
PCT/CN2003/000858
Other languages
French (fr)
Chinese (zh)
Inventor
Zongxing Lin
Zongfan Lin
Original Assignee
Zongxing Lin
Zongfan Lin
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zongxing Lin, Zongfan Lin filed Critical Zongxing Lin
Priority to AU2003272866A priority Critical patent/AU2003272866A1/en
Publication of WO2004036404A1 publication Critical patent/WO2004036404A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters

Definitions

  • the present invention relates to a word processing method, and more particularly, to a Chinese character input method.
  • the present invention is to overcome the shortcomings in the prior art, and provides a Chinese character input method with low recoding rate, easy to learn, memorable and fast.
  • a Chinese character "three-association" shape code input method includes the following steps:
  • the Chinese character input rules are as follows: taking basic glyph elements as a unit, removing three types of nine-type code points, selecting three glyph elements, and typing in sequence to complete the input of a Chinese character; when the same glyph element is the same, the big priority is given to the glyph element.
  • connecting or intersecting use the method of disconnecting the common strokes up and down or left and right to get the code.
  • the Chinese characters are divided into three types and nine types, and the glyph elements are selected according to the code taking points.
  • the three types are single-line type, double-line type, and three-line type.
  • the nine types and code-taking methods are:
  • "02 type” is a character that can be divided into two glyph elements up and down, followed by one or "R” as the identification code to complete the input of three codes;
  • "03 type” is a character that can be divided into three or more glyph elements up and down. Type in the first, second, and last three glyph elements in sequence;
  • "012 type” is a character font, which can be divided into upper and lower parts, taking the first glyph element in the upper part, taking one glyph element in the lower part, and the last glyph element;
  • "021 type” is an inverted type, which can be divided into upper and lower parts, taking the first glyph element and the last glyph element in the upper two parts respectively;
  • "11 type” is a left and right type. Take one glyph element from left and right, and then add an "R" or "L” identification code to complete the input.
  • 111 type is a Sichuan font, and the first, second, and last three glyph elements are taken from left to right.
  • the coding rules for Chinese character splitting are as follows: The connected glyph elements are split and split; the connected or intersecting glyph elements use the method of disconnecting public strokes up and down or left and right to get the code, and the left and right can not be disconnected; When the glyph elements outside the frame are sufficient, the glyph elements in the frame are not coded; when the glyph elements are sufficient, the corner points are not coded; when the glyph elements are insufficient, the corner points are coded; when there is a frame, the coding is de-framed downwards, first outside the frame and then inside the frame ; Upward deframe coding, first inside the frame and then outside the frame; characters above the two glyphs on the left side, take the first, last code and the last code on the left side; characters with and beside are coded from the left, without stroke order; Characters with discrete Chinese character structure are split naturally.
  • Table 3 shows the correspondence between the glyph elements and the 26 letters on the keyboard when entering both simplified and traditional characters or ancient Chinese characters.
  • This code is based on the six characters of the six characters (ie, pictograph, pictophonetic, understanding, referral, borrowing and re-sending), using the scientific method formed by the history of artificial characters, to extract the short form of the glyph element used for encoding.
  • "Yuan” the code extraction method is designed by the association of sound, form and meaning. Chinese characters are strictly classified, and the code points of each character are fixed. As long as it is encoded according to rules, the encoding of any Chinese character is unique. Rarely seem Instead of the so-called fault tolerance code. Its logic is much higher than all kinds of codes today. The code length is short, and the coverage of words and words is large. Moreover, the code, sound, and meaning of the six books, such as pictograms, are closely connected, so that logic and image are perfectly combined. Learning is fun, not boring. So as to improve the efficiency of learning and use, you can achieve more results with less effort.
  • This coding is based on the philosophical theory of the six books and the method of code selection, using the association of sound, form and meaning.
  • This code selects 210 components from the "Chinese Character Component Specification GF3001-1997", plus 90 innovative components for a total of 300, as the final level of this code.
  • glyph element (referred to as “shape element"), which is used to compile three “shape elements” with a total of 21,003 Chinese characters at home and abroad.
  • the 300 "shape elements" are divided into groups of 26 groups according to the principle of the same family. Based on the association of sound, shape, and meaning, each family shape element is mapped to 26 Chinese Pinyin letters, that is, 26 English letters on the keyboard. relationship. That is to say, the English alphabet is used as the mother of the pinyin of the Chinese character, which constitutes a family of "shape elements" that can be read phonically. Under the accompanying sound, the glyph structural elements required for encoding can be extracted from the Chinese characters. Harmonic sounds indicate keystrokes to achieve fast and easy Chinese character input.
  • Table 1 Table 2 and Table 3 for the glyph elements, the composition of each family, and the correspondence with the 26 letters on the keyboard.
  • the glyph elements shown in Table 1 are used to input the 3000 most commonly used cylinder characters, which can be used in applications such as mobile phones that require simple input.
  • Table 2 is used to input 7000 commonly used Chinese characters.
  • the glyph elements shown in Table 3 are added with traditional and ancient Chinese characters. It is used to input both simplified and traditional Chinese characters or ancient Chinese characters. Words represented by "mouth” in the table, such as “gold” and "rice”, should be split when they are not used alone.
  • Table 4 shows the classification and coding rules of Chinese characters of "three types and nine types”.
  • the "dot" indicates the position where the primitive is extracted when the Chinese character is encoded.
  • Mouth represents a single element; represents any pure single column
  • Three categories refers to single-row, double-row, and three-row.
  • 012 ⁇ is a character font, which can be divided into upper and lower parts, taking one glyph element in the upper part, taking the first glyph element in the lower part and the last glyph element; for example, the word “Salary” is divided into: Ding is coded as NLT, "021" is an inverted product, which can be divided into upper and lower parts. Take one glyph element in the upper part and the last glyph element in the lower part. For example, the "type” character is divided into: 1, 4, and Coded as HZT.
  • "11 type” is a character of "shape element" (one yard) on each side.
  • the word “ding” is divided into: mouth, Ding, and the supplementary identification code “R” or its code is OTR.
  • 111 type is a Sichuan font. Take the first, second, and last three “shape elements” (three yards) from left to right. For example, if the word “tide” is divided into:,, ten, and month, it is encoded as DSM.
  • the all-inclusive type is a single-row type. Press outside the frame and then inside the frame to remove the type. For example, the word “return” is “02 type”. The word “country” is "03 type”.
  • Example Word Split Coding Example Word Split Coding;
  • the frame shape element is sufficient (including corner points), and the shape element inside the frame is not coded
  • the shape element is not enough for corners to be encoded
  • Example word split coding Zhou Mentukou MT 0 plaque ⁇ , door EDM En ⁇ big heart KYD for, force, DU 3 ⁇ 4 small doorway XM 0 door 7 people DIC
  • this code will use commonly used Chinese characters (select high-frequency Chinese characters as much as possible), only take one or two "shape elements" in front of it, and then add a number key or space key to end. As the code of the word, a barrel code is formed. Because this code is a three-code code (three-button end). Therefore, only one or two short codes are set.
  • the first level short code of this code is divided into two levels.
  • the first level is: Each family can arrange the first code of high frequency words plus space bar, a total of 26 short codes.
  • the second-level shortcode refers to the code that takes only the first two "shape elements" of a single character, adds a space key to the end, and selects the frequently used Chinese characters as the second-level shortcode.
  • the first and second short codes are designed with on-screen prompts, no need to remember. After a little practice, professionals can achieve blind typing. (6) Vocabulary coding rules
  • the two-word words used in this code are mostly used in "single words".
  • the first character takes two yards according to the rules, and the second character takes the first yard.
  • the encoding rules for multi-word words are to take the first code of the first, second and last words.
  • this code In order to adapt to a wider range of requirements and facilitate communication at home and abroad, in addition to simplifying the Chinese character system, this code also designs traditional Chinese and ancient Chinese character coding systems. This code compiles a total of 21,003 Chinese characters at home and abroad. The coding rules for traditional and ancient Chinese characters are the same as for tubed characters, but a prescribed end key (or conversion key) is added respectively.
  • This code is a pure form code characterized by spelling "phonetic":
  • This code is based on the shape, and the code used is Hanyu Pinyin. That is, the glyph structural elements such as strokes, radicals, radicals, standard parts and innovative parts of Chinese characters are collectively referred to as glyph elements (referred to as "shape elements"). Through the association of sound, form and meaning, these "shape elements" are regarded as Chinese phonetic alphabets that can be read aloud. The glyph of each Chinese character is coded using only three Chinese Pinyin letters. This is a kind of sound and spell. Not only does it have the same effect as the keyboard input of Pinyin text (such as English, etc.), but it is also faster and more concise.
  • the word “jiang” is divided into three “shape elements” as:,;, ding, one, where ",;” (point Di S n) is associated with D, " ⁇ " is T, and " ⁇ " (Heng Ying 6 ng) association is ⁇ , so its encoding is DTH.
  • the Chinese characters are entered by spelling "Pinyin” because the glyphs of the Chinese characters are ideographic. I ca n’t understand from the South to the North, but I understand everything as soon as I write the character (glyph). That ’s the reason. Chinese characters are not just “writing symbols of the recorded language” like Western pinyin characters. . Chinese characters are also "writing symbols that record people's thoughts, consciousness, and ideas.” For keyboard input, only the glyph of Chinese characters can be accurate Ideographic. Because there are too many homonyms in Chinese characters, only by inputting Chinese characters in a spelling shape, a low repetition rate can be achieved. This code is further than the general shape code: This code is not a simple mechanical spelling, but a "phonetic".
  • this code also has the characteristics of "phonetic” and "shape", it is fundamentally different from the general phonetic code (or phonetic code).
  • the sound and shape of the sound shape code are separated, that is, the spelling sound is firstly added, and then the first or last shape of the word is added to reduce the repetition rate. Due to the unsynchronized sound and shape of such a phonetic code, there is inconsistent thinking in use, and it can not get rid of the same defects as the Chinese phonetic code, that is, unrecognizable characters or inaccurate pronunciation cannot be input. Practice has proved that The input effect is not as good as a simple shape code. This is the fundamental reason why the commonly used form codes are still difficult to learn but still occupy the main market. The tone and shape of this code are synchronized and integrated.
  • the technical way out for entering Chinese characters on a computer keyboard is in shape codes.
  • the way out of the shape code is to solve the problem of learnability and shorten the code length appropriately, and maintain a relatively heavy code rate. Imitation of the shape, sound, and meaning of the ancestors' "six books" is the most effective way to solve easy learning, easy to remember, and memorable. This is determined by the inherent characteristics and profound connotation of square Chinese characters.
  • the coding is based on the pronunciation, shape and meaning. Due to the role of association, mapping effects can be formed between human senses, thinking and glyphs, codes, and key positions. Therefore, when typing Chinese characters on the keyboard, not only can you see and type, but it is also convenient to want to type and listen to type. Therefore, it can meet the needs of various personnel.
  • Associativity is the bond to understand things.
  • you see the word "person” you will think of a human figure standing on two feet.
  • you see the words “say, month” you will think of the sun and moon in the sky.
  • you see the word "fire” you will think of a wild bonfire set up in the wild.
  • you see the word "rain” you will feel drizzle.
  • you see the word “tears” you will feel the tears dripping from your eyes. All this seems primitive, yet it is very modern and scientific.
  • the computer is especially "favorite” these magical Chinese characters, it can accurately and quickly identify it. This shows that it can adapt well to contemporary technology. It is high technology that revitalizes ancient Chinese characters. As long as we use the right method, the technology of inputting Chinese characters on the keyboard will be more popular and better than pinyin (such as English). It is just around the corner.
  • This code adopts the method of disconnecting and dividing a part of Chinese characters with connected and intersecting glyphs, instead of using the method of "shape element" and "extraction". In this way, under the condition that the shape of the glyph is kept intact, the purpose of intuitive and quick code fetching can be achieved.
  • the so-called “break” is only the point at which the two "shape elements” meet at the intersection of Chinese characters, and the shape of the glyph has not changed. This is very important for the intuitiveness of code in shape. Since this encoding does not PT / CN2003 / 000858
  • the "extract" split code is adopted, so it is avoided, which is recognized as the difficulty of learning and using the shape code, thus greatly improving the coding efficiency.
  • Prerequisite speed only look accurate, fast code set, in order to quickly keystrokes ⁇ "disconnect" take the code, change shape due to the shape, do not have to abstract thinking, and create favorable conditions for quick keystrokes. It should be noted that the speed of typing depends not only on the number of keystrokes, but more importantly, whether the code can be determined intuitively and quickly.
  • This code uses three types of nine-character classification of Chinese characters and three-point scattered coding method.
  • this code uses three types of nine-character classification of Chinese characters, for more complex glyphs, the used code points are scattered and code points are used. It does not go deep into the font, so it is relatively simple and intuitive, and it is easy to determine the code point. At the same time, the code points are scattered, so that each independent structural part of the Chinese character can obtain codes, which will inevitably reduce the re-coding rate.
  • the character font of this code (012
  • the code points of Chinese characters of type), inverted type (type 021) and left and right type (type 21) are compared with the examples of commonly used shape codes today:
  • This coded word only uses a maximum of three codes. When the word is less than three yards,
  • W, L, R three letters are used as identification codes. There are no complicated situations such as cross recognition. The other types of single words are all three yards, so there is no need to add an identification code. Therefore, the font recognition of this code is very easy to learn and remember.
  • W, L, R three-letter codes are font identification codes, which is one of the important technical features of this code. ⁇
  • the number of primary and secondary short codes of this code is large, high quality, and no memory is required.
  • This code has 286 primary short codes (one code plus space bar or number key). More than 660 secondary short codes (two codes plus space bar).
  • the first-level shortcodes are all high-frequency words; and almost include the first-level barrel codes of today's shape codes and phonetic codes.
  • the first and second level short codes of this code are designed with screen prompts, so do n’t memorize them. Professionals can touch blindly with a little practice.
  • This code also opens the door for foreigners (especially English learners) to learn and input Chinese characters.
  • foreigners especially English learners
  • the code of water “is W, which coincides with the English WATER.
  • Other Xing Yuan "is not difficult to learn as long as you understand Chinese characters.
  • Foreigners and Southerners speak Mandarin The pronunciation is inaccurate, and it is often difficult to master the ending sound.
  • the first sound is generally no problem.
  • This code spells "phonetic", which is actually only "Jin Pin”. It uses only the first letter, and it is compatible with the pinyin code. In the same way, you can directly use the English keyboard layout of the computer to make it foreign (not mechanically used by general shape codes). Therefore, it is not difficult to learn and use. Therefore, this code will help Chinese and foreign cultural exchanges.
  • this code has superior technical performance, so it is not only easy to use, but also can meet the needs of professionals. Therefore, students, teachers, writers, journalists, secretaries, etc. can learn and master quickly. Dialect areas can also be promoted. As long as you know a little bit of Hanyu Pinyin, it is suitable for all ages and learn to use it.
  • this code uses the "phonetic" method, which is consistent with the Chinese phonetic alphabet education in elementary and middle schools, it is suitable for general promotion in primary and secondary education. In this way, we can achieve: Wen computer coding Starting from the baby, it can popularize and improve the input of Chinese characters on computer keyboards, and lay a solid foundation for the modernization of Chinese information processing.
  • this code can also be extended to the field of compiling common words, dictionaries and electronic dictionaries.
  • Using this code to search dictionaries and dictionaries can be the same as searching English dictionaries, and it is more convenient and quick.
  • the code length is short (a single word uses only three letters), it can basically be done in one step (that is, "seat with a check mark").
  • the dictionary of this code is basically arranged in the way of radicals and radicals. The same and near words are arranged. In one piece. This makes it easy to compare, learn, remember, and deepen understanding.
  • This code includes simplified and traditional Chinese characters, ancient and modern Chinese characters, and Chinese characters at home and abroad. This can not only study and excavate the vast cultural treasure trove of ancient China and conduct ancient text research, but also adapt to the needs of Hong Kong, Macao, Taiwan and Chinese-speaking regions and the growing international information exchange, and serve international friends and overseas Chinese who use Chinese characters in the world!
  • Type 012 public, crime, crown, arrogance, dysfunction, Jia, Ji, Lu Fan, 'comfort, capital, dysfunction, dysfunction, panic, disadvantage, feeling
  • Type 21 ⁇ ⁇ Regulation, brake, daring, quail. Missing, scattered, wading, offering, three types of Sichuan,. ⁇ , ⁇ , ⁇ , ⁇ , invitation, pelican, incubation

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)
  • Controls And Circuits For Display Device (AREA)

Abstract

The present invention discloses a pattern-code Chinese character input method with ' Three Patterns Association ', which intends to provide a Chinese character input method with low repetition rate and highly input speed, as is easy to learn and difficult to forget. The method chooses 300 pattern units which are divided into 26 classes based on pattern according to a classification rule, and creates the relationship between the pattern units in each class and 26 English letter keys on the keyboard with association in pattern, pronunciation, or meaning, to form 'patter units' classes in which homophone can be pronounced. A portion of said 300 pattern units are used for inputting 3000 most frequently used Chinese characters, and a portion of those are used for inputting 7000 frequently used Chinese characters. All of the 300 pattern units are used for inputting simplified Chines characters, traditional Chinese characters and ancient Chinese characters. By classifying Chinese characters into nine types in three classes and employing a code-selection method with the primary pattern units as unit, three pattern units are selected to be input sequentially, wherein, when the pattern units is the same, the larger is selected by preference. The present invention has the advantages of short code length, low repetitive rate, capable of choose pattern units required for coding in Chinese characters with the help of pronunciation, high-speed and easily input of Chinese characters by typing according to the indication of homophone.

Description

汉字 "三形联想" 形码输入法 技术领域  Chinese character "Three-shape association" Shape code input method Technical field
本发明涉及一种文字处理方法, 更具体的说, 是涉及一种汉 字输入方法。  The present invention relates to a word processing method, and more particularly, to a Chinese character input method.
背景技术 Background technique
中文信息处理的现代化, 是国民经济和社会信息化的基础。 计算机键盘输入汉字效率的提高, 是实现中文信息处理现代化的 瓶颈。 因为 "效率就是生命" 已是人们的共识。  The modernization of Chinese information processing is the foundation of national economic and social informatization. The improvement of Chinese keyboard input efficiency is the bottleneck of modernizing Chinese information processing. Because "efficiency is life" has been the consensus of people.
当今虽有上千种计算机键盘输入汉字的编码, 然而至今还没 有一种编码能做到既好学又好用。 也就是说好学的打不快。 所谓 好学的拼音码, 重码多, 难以盲打, 同音字、 生僻字、 语音不统 一等难题无法解决, 效率难以提高。 至于当前常用的形码, 虽然 能打的快, 熟练者打字速度甚至超过英文, 但既难学又易忘, 学 习效率很低。 有人说就像学了一门 "外语" 。 若不经常使用又会 忘掉大半。 因此这种形码, 只能在专业打字员中使用。 如果汉字 计算机的编码不能普及使用, 或使用效率低下, 文件的起草与录 排 不能同步, 即打字要转手, 不能像英文那样, 只要练练指法, 稍加学习, 多数人都会打字, 也就不能说, 汉字的计算机键盘输 入, 已经真正实现了现代化。  Although there are thousands of codes for inputting Chinese characters on computer keyboards today, there is no one that can be both easy to learn and use. In other words, it's hard to learn to beat. The so-called easy-to-learn pinyin codes have many heavy codes and are difficult to type blindly. Problems such as homophones, rare words, and inconsistent pronunciation cannot be solved, and the efficiency is difficult to improve. As for the commonly used shape codes, although they can type fast, and the skilled person can type faster than English, it is difficult to learn and easy to forget, and the learning efficiency is very low. Some people say it is like learning a "foreign language". If you don't use it often, you will forget most of it. Therefore, this form code can only be used by professional typists. If the Chinese character computer coding cannot be widely used, or the efficiency is low, the drafting and recording of documents cannot be synchronized, that is, typing must be changed, and it cannot be like English. As long as you practice fingering and learn a little, most people will type It is said that the computer keyboard input of Chinese characters has been truly modernized.
当今尚有许多 "音形" 、 "形音" 、 "音数" 并举, 及数笔 画的汉字编码。 这些编码, 虽然能好学些, 或者能降低一些重码 率, 但其诸多缺陷, 都难以从根本上提高效率。 此外尚有声控和 手写的输入方法。 声控存在南腔北调难统一, 说错了改也难等问 题。 手写输入汉字, 由于笔画多, 输入慢, 计算机识别难, 效率 很低。 因此, 这些方法只能在特定条件下使用, 难以普及推广, 要想成为计算机输入汉字的主流, 是很困难的事情。  Today, there are still many "phonetic", "phonetic", "phonetic" simultaneous, and Chinese character coding of several strokes. Although these codes can be better learned or can reduce some re-coding rates, many of their shortcomings are difficult to fundamentally improve efficiency. There are also voice input and handwriting input methods. There are problems with voice control that are difficult to unify from the south to the north. Chinese characters are input by hand. Due to the large number of strokes, the input is slow, computer recognition is difficult, and the efficiency is very low. Therefore, these methods can only be used under specific conditions, and it is difficult to popularize them. It is very difficult to become the mainstream of Chinese characters input by computers.
综上所迷, 汉字编码虽然从无到有, 在一定程度上, 满足了 中文信息处理现代化的需求, 但从总体上讲仍效率低下, 还远远 不能满足日益增长的国民经济和社会信息化的需求。 计算机键盘 输入汉字编码方案, 虽然有了量的积累, 仅报请专利的就有 1000 多种, 但汉字计算机编码方案, 更需要有质的飞跃。 汉字编码要 做到: 易学、 难忘、 快速三者兼有, 才能达到要求。 To sum up, although the encoding of Chinese characters is from scratch, to a certain extent, it satisfies The demand for the modernization of Chinese information processing is still inefficient, and it is far from meeting the growing demand for national economic and social informatization. Although the Chinese keyboard coding scheme for computer keyboard input has accumulated, there are more than 1,000 patents that have only been applied for, but the Chinese computer coding scheme requires a qualitative leap. Chinese character coding must be easy to learn, memorable, and fast in order to meet the requirements.
发明内容 Summary of the Invention
本发明是为了克服现有技术中的不足之处, 提供一种重码率 低, 易学、 难忘、 快速的汉字输入方法。  The present invention is to overcome the shortcomings in the prior art, and provides a Chinese character input method with low recoding rate, easy to learn, memorable and fast.
本发明通过下述技术方案实现:  The present invention is achieved through the following technical solutions:
一种汉字 "三形联想" 形码输入法, 包括下述步骤:  A Chinese character "three-association" shape code input method includes the following steps:
( 1 )选择字形元素作为编码的基本部件, 按形以同族原则分 成 26族,并与键盘上的 26个字母对应,其对应关系如表 1所示; (1) Select the glyph element as the basic component of the coding, divide it into 26 families according to the same family principle according to the shape, and correspond to the 26 letters on the keyboard. The corresponding relationship is shown in Table 1.
( 2 )输入汉字; (2) input Chinese characters;
所述汉字输入规则为: 以基本字形元素为单位,祛三类九型的 取码点选取三个字形元素, 依次键入即可完成一个汉字的输入; 同为字形元素时以大优先, 字形元素相连或相交时采用上下或左 右断开公用笔画再取码的方法。  The Chinese character input rules are as follows: taking basic glyph elements as a unit, removing three types of nine-type code points, selecting three glyph elements, and typing in sequence to complete the input of a Chinese character; when the same glyph element is the same, the big priority is given to the glyph element. When connecting or intersecting, use the method of disconnecting the common strokes up and down or left and right to get the code.
在输入汉字时,将汉字分为三类九型,根据取码点选取字形元 素; 所述三类为指单列类、 双列类和三列类; 所述九型及取码方 法为:  When inputting Chinese characters, the Chinese characters are divided into three types and nine types, and the glyph elements are selected according to the code taking points. The three types are single-line type, double-line type, and three-line type. The nine types and code-taking methods are:
"01型" 为成字的字形元素, 输入时在该成字字形元素后边 补两个 "W" 识别码;  "01 型" is a glyph element, and two "W" identification codes are added after the glyph element when entering;
"02型"为上下可拆分成二个字形元素的字,其后补一个 或 "R" 为识别码, 完成三码的输入;  "02 type" is a character that can be divided into two glyph elements up and down, followed by one or "R" as the identification code to complete the input of three codes;
"03型" 为上下可拆分成三个及三个以上字形元素的字, 按 首、 二、 末三个字形元素依次键入;  "03 type" is a character that can be divided into three or more glyph elements up and down. Type in the first, second, and last three glyph elements in sequence;
"012型"为品字型, 可拆分成上部和下部, 取上部第一个字 形元素、 取下部笫一个字形元素和最末一个字形元素; "021型"为倒品型, 可拆分成上部和下部, 分别取上部两部 分的第一个字形元素和下部最末一个字形元素; "012 type" is a character font, which can be divided into upper and lower parts, taking the first glyph element in the upper part, taking one glyph element in the lower part, and the last glyph element; "021 type" is an inverted type, which can be divided into upper and lower parts, taking the first glyph element and the last glyph element in the upper two parts respectively;
"11型" 为左右型, 左右各取一个字形元素, 在其后补一个 "R" 或 "L" 识别码完成输入;  "11 type" is a left and right type. Take one glyph element from left and right, and then add an "R" or "L" identification code to complete the input.
"12型" 为左右型, 取左边一个字形元素, 右边取首、 末二 个字形元素;  "12 type" is left and right, taking one glyph element on the left and the first two glyph elements on the right;
"21型" 为左右型, 取左边首、 末二个字形元素, 取右边一 个字形元素;  "21" is left and right, taking the first two glyph elements on the left and the first glyph element on the right;
"111型" 为川字型, 从左到右依次取首、 二、 末三个字形元 素。  "111 type" is a Sichuan font, and the first, second, and last three glyph elements are taken from left to right.
在输入常用汉字时,键盘上 26个字母与字形元素的对应关系 如表 2所示 ^  When entering common Chinese characters, the correspondence between the 26 letters on the keyboard and the glyph elements is shown in Table 2 ^
输入汉字时, 汉字拆分编码规则如下: 相接的字形元素, 分 解拆分; 字形元素相连或相交采用上下或左右断开公用笔画再取 码的方法, 能上下断开的不左右断开; 框外字形元素足够时, 框 内字形元素不编码; 字形元素足够时, 角点不编码; 字形元素不 足时角点编入码; 有边框时, 向下脱框编码, 先框外后框内; 向 上脱框编码, 先框内后框外; 左旁两字形元素以上的字, 取左旁 的首、末码和字的末码; 带 和 旁的字从左起编码, 不按笔顺; 离散汉字结构的字, 顺其自然拆分。  When inputting Chinese characters, the coding rules for Chinese character splitting are as follows: The connected glyph elements are split and split; the connected or intersecting glyph elements use the method of disconnecting public strokes up and down or left and right to get the code, and the left and right can not be disconnected; When the glyph elements outside the frame are sufficient, the glyph elements in the frame are not coded; when the glyph elements are sufficient, the corner points are not coded; when the glyph elements are insufficient, the corner points are coded; when there is a frame, the coding is de-framed downwards, first outside the frame and then inside the frame ; Upward deframe coding, first inside the frame and then outside the frame; characters above the two glyphs on the left side, take the first, last code and the last code on the left side; characters with and beside are coded from the left, without stroke order; Characters with discrete Chinese character structure are split naturally.
在既输入简体字, 又输入繁体字或古汉字时的字形元素与键 盘上 26个字母的对应关系如表 3所示。  Table 3 shows the correspondence between the glyph elements and the 26 letters on the keyboard when entering both simplified and traditional characters or ancient Chinese characters.
本发明具有下述有益效果:  The invention has the following beneficial effects:
本编码根据六书造字(即象形、 形声、 会意、 指事、 假借和转 注六种形式) 的字理, 采用先人造字历史形成的科学的方法, 提 取用于编码的字形元素简称 "形元" , 取码的方法采用音、 形、 意的联想设计的。汉字经过严格分类,每个字的取码点是固定的。 只要按规则编码, 任何一个汉字的编码都是唯一的。 很少有似是 而非的所谓容错码。 它的逻辑性远远高于当今各种形码。 其码长 较短, 字、 词涵盖面大。 而且, 因取码与象形等六书的形、 音、 意紧密相连,使逻辑与形象完美结合。 故学之有趣, 不枯燥乏味。 从而提高学习和使用效率, 能达到事半功倍的效果。 This code is based on the six characters of the six characters (ie, pictograph, pictophonetic, understanding, referral, borrowing and re-sending), using the scientific method formed by the history of artificial characters, to extract the short form of the glyph element used for encoding. "Yuan", the code extraction method is designed by the association of sound, form and meaning. Chinese characters are strictly classified, and the code points of each character are fixed. As long as it is encoded according to rules, the encoding of any Chinese character is unique. Rarely seem Instead of the so-called fault tolerance code. Its logic is much higher than all kinds of codes today. The code length is short, and the coverage of words and words is large. Moreover, the code, sound, and meaning of the six books, such as pictograms, are closely connected, so that logic and image are perfectly combined. Learning is fun, not boring. So as to improve the efficiency of learning and use, you can achieve more results with less effort.
具体实施方式 detailed description
下面结合具体实施例对本发明作进一步说明。  The present invention is further described below with reference to specific embodiments.
本编码是根据六书造字的字理, 取码的方法釆用音、 形、 意 的联想设计的。 本编码从《汉字部件规范 GF3001-1997》 中选取 210个部件, 另加创新部件 90个共 300个, 作为本编码的末级部 件。 并将其定义为 "字形元素" (简称 "形元" ) , 用以编制总 数 21003个海内外汉字的三个 "形元" 的编码。  This coding is based on the philosophical theory of the six books and the method of code selection, using the association of sound, form and meaning. This code selects 210 components from the "Chinese Character Component Specification GF3001-1997", plus 90 innovative components for a total of 300, as the final level of this code. And define it as "glyph element" (referred to as "shape element"), which is used to compile three "shape elements" with a total of 21,003 Chinese characters at home and abroad.
将 300个 "形元" , 按形以同族原则, 分成 26族的组合, 通 过音、 形、 意的联想, 使各族形元与 26个汉字拼音字母即键盘上 的 26 个英文字母建立映射关系。 即把英文字母当成汉字拼音字 母, 构成谐音能够声读的 "形元" 族。 可在伴读声下, 从汉字中 提取编码所需的字形结构要素一一字形元素。通过谐声指示击键, 达到快易输入汉字的效果。  The 300 "shape elements" are divided into groups of 26 groups according to the principle of the same family. Based on the association of sound, shape, and meaning, each family shape element is mapped to 26 Chinese Pinyin letters, that is, 26 English letters on the keyboard. relationship. That is to say, the English alphabet is used as the mother of the pinyin of the Chinese character, which constitutes a family of "shape elements" that can be read phonically. Under the accompanying sound, the glyph structural elements required for encoding can be extracted from the Chinese characters. Harmonic sounds indicate keystrokes to achieve fast and easy Chinese character input.
字形元素、各族的組成及与键盘上 26个字母的对应关系参见 表 1、 表 2和表 3。 其中表 1所示的字形元素用以输入 3000个最 常用筒体字, 可用于手机等需要简单输入方面的应用。 表 2用于 输入 7000个常用汉字,表 3所示的字形元素增加了繁体字和古汉 字中的字形元素, 用于既输入简体字, 又输入繁体字或古汉字的 情况。 表中用 "口" 表示的字, 如 "金" 、 "米" 等字, 在不作 偏旁单独使用时要拆分。  See Table 1, Table 2 and Table 3 for the glyph elements, the composition of each family, and the correspondence with the 26 letters on the keyboard. The glyph elements shown in Table 1 are used to input the 3000 most commonly used cylinder characters, which can be used in applications such as mobile phones that require simple input. Table 2 is used to input 7000 commonly used Chinese characters. The glyph elements shown in Table 3 are added with traditional and ancient Chinese characters. It is used to input both simplified and traditional Chinese characters or ancient Chinese characters. Words represented by "mouth" in the table, such as "gold" and "rice", should be split when they are not used alone.
为了在学习时容易记忆字形元素, 在本输入法中增加了联想 的方法, 如表二中联想方法所示。  In order to easily memorize glyph elements during learning, the association method is added to this input method, as shown in Table 2.
为了使汉字的拆分更合理, 将汉字结构按照 "三类九型" 分 类。 汉字 "三类九型" 结构分类及取码规则如表 4所示。 表中的 点" 表示汉字编码时提取形元的位置。 In order to make the division of Chinese characters more reasonable, the structure of Chinese characters is classified according to "three types and nine types". Table 4 shows the classification and coding rules of Chinese characters of "three types and nine types". In the table The "dot" indicates the position where the primitive is extracted when the Chinese character is encoded.
表 4中: 口表示单个形元; 表示任何纯单列 In Table 4: Mouth represents a single element; represents any pure single column
Figure imgf000006_0001
Figure imgf000006_0002
表示任何单列类 表示任何类型结构
Figure imgf000006_0001
Figure imgf000006_0002
Represents any single column class represents any type of structure
(一) "三类九型" 分类方法和字形识别码: (1) "Three types and nine types" classification method and glyph identification code:
汉字结构非常复杂, 一般形码只把汉字结构分为上下型、 左 右型和杂合型三种。 因此使许多复杂结构的汉字, 取码点难以确 定, 以致编码时无从下手。 为了更加直观、 快速地确定取码点, 提高编码效率, 本编码将汉字结构分为 "三类九型" 。  The structure of Chinese characters is very complicated. Generally, only the structure of Chinese characters is divided into three types: upper and lower, left and right, and heterozygous. Therefore, it is difficult to determine the code point of many complex Chinese characters, which makes it impossible to start coding. In order to determine the code point more intuitively and quickly, and to improve the coding efficiency, this coding classifies the structure of Chinese characters into "three types and nine types".
"三类" 是指单列类、 双列类和三列类。  "Three categories" refers to single-row, double-row, and three-row.
"九型" 内容如下:  "Nine types" reads as follows:
"01型" 为成字的 "形元" , 故在该成字 "形元" 后边, 补 两个识别码 "WW" 以完成输入。 如 "人" 的编码为 A, 则人字 的输入编码为 AWW。  "01 type" is the character "shape element", so after the character "shape element", add two identification codes "WW" to complete the input. If the code of "person" is A, the input code of herringbone is AWW.
"02型" 为上下可拆分成二 "形元" (二码) 的字。 其后补 "L" 或 "R" 为识别码, 以完成三码的输入。 如 "古" 字拆分为 十、 口, 则 "古" 字的输入码为 SOL。  "02 型" is a character that can be split up and down into two "shape elements" (two yards). Then add "L" or "R" as the identification code to complete the input of the three codes. If the word "gu" is divided into ten and ten, the input code of the word "gu" is SOL.
"03型" 为上下可拆分成三个以及三个以上 "形元" (三码) 的字, 如 "亭" 字, 拆分成: 、 口、 丁。 它的编码为 LOT。  "03 型" is a character that can be split up and down into three or more "shape elements" (three yards), such as the word "ting", which is split into:, mouth, and Ding. Its encoding is LOT.
"012型"为品字型, 可拆分成上部和下部,取上部笫一个字 形元素、 取下部第一个字形元素和最末一个字形元素; 如 "薪" 字拆分为: 、 立、 丁其编码为 NLT, "021型"为倒品型, 可拆分成上部和下部,分别取上部两部 分的笫一个字形元素和下部最末一个字形元素; 如 "型" 字拆分 为: 一、 4、 土其编码为 HZT。 "012 型" is a character font, which can be divided into upper and lower parts, taking one glyph element in the upper part, taking the first glyph element in the lower part and the last glyph element; for example, the word "Salary" is divided into: Ding is coded as NLT, "021" is an inverted product, which can be divided into upper and lower parts. Take one glyph element in the upper part and the last glyph element in the lower part. For example, the "type" character is divided into: 1, 4, and Coded as HZT.
"11型" 为左右各一个 "形元" (一码) 的字。 如 "叮" 字 的拆分为: 口、 丁, 补识别码 "R" 或 则其编码为 OTR。  "11 type" is a character of "shape element" (one yard) on each side. For example, the word "ding" is divided into: mouth, Ding, and the supplementary identification code "R" or its code is OTR.
"12型" 为左边一 "形元" (一码)右边二 "形元" (二码) 的字如 "胸" 这的拆分为: 月、 勺、 则其编码为 MPU。  "12 type" is the left one "shape element" (one yard), the right two "shape element" (two yards), such as "chest". This is divided into: month, spoon, then it is coded as MPU.
"21型" 为左边二 "形元" (二码)右边一 "形元" (一码) 的字, 如 "趁" 字的拆分为: 土、 人、 多则其编码为: TAP。  "21" is the left two "shape elements" (two yards) and the right one is "shape elements" (one yard). For example, the word "while" is divided into: soil, people, and more. Its code is: TAP.
"111型" 为川字型, 从左到右取首、 二、 末三 "形元" (三 码) 。 如 "潮" 字的拆分为: 、 、 十、 月则其编码为 DSM。  "111 type" is a Sichuan font. Take the first, second, and last three "shape elements" (three yards) from left to right. For example, if the word "tide" is divided into:,, ten, and month, it is encoded as DSM.
(二) 字形分类的特殊情况:  (2) Special cases of glyph classification:
① 多排列字如州、 洲等, 属三列类。 按 "111型" 从左到右 取首、 二、 末。 则洲字拆分为: ') 、 、 、 I 其编码为 DDZ。  ① Multiple permutations such as states and continents belong to three columns. Press "111 type" to get the first, second and last from left to right. The continent word is divided into: '),,, I, and its encoding is DDZ.
② 全包型属单列类。按先框外,后框内脱框分型处理。如"回" 字为 "02型" 。 "国" 字为 "03型" 。  ② The all-inclusive type is a single-row type. Press outside the frame and then inside the frame to remove the type. For example, the word "return" is "02 type". The word "country" is "03 type".
③ 上三包、 下三包、左上包、右上包的字属单列类。 如: 风、 凶、 厅、 句等字按 "02型" 处理。 如: 同、 病、 虱等按 "03型" 处理。  ③ The characters of upper three bags, lower three bags, upper left bag and upper right bag belong to a single column. Such as: wind, fierce, hall, sentence and other words are treated as "02". Such as: Same, disease, lice, etc. are treated as "type 03".
④ 左三包、 左下包的字属双列类。 如: 巨、 达等按 "11型" 处理。 如: 医、 过等按 "12型" 处理。 如: 趟、 起等按 "21型" 处理。 上述分类和处理方法。 都是顺其自然, 不用死记。  ④ The characters in the left three bags and the lower left bag belong to the double-rank category. For example: Ju, Da, etc. are treated as "11 type". Such as: medical treatment, treatment, etc. according to "12" treatment. For example: trips, starts, etc. are handled as "21". The above classification and processing methods. Let it be, without rote.
(三) "三形联想" 单字编码原则  (3) "Three-shape association" single-word coding principle
1. 以形取码,按书写顺序,从左到右从上到下,先外后内(有 和: L字旁的字例外) 。  1. Use the shape to take the code, in order of writing, from left to right, from top to bottom, and then inside and outside (with the exception of the words beside and: L).
2. 以基本 "形元" 为单位, 按 "三类九型" 字形结构和取码 点取码。  2. Take the basic "shape element" as the unit, and press the "three types and nine types" glyph structure and code points to get the code.
3. —般按首、 二、 末三个 "形元" 的顺序取码, 最多只取三 码。 3. — generally take the code of the first, second and last three "shape elements" in order, at most only three code.
4. 单体结构拆分, 要拆成码数少, 笔画数多的 "形元 成大优先) 。  4. For the single structure split, it is necessary to disassemble the "elements with the largest number of strokes and the larger number of strokes".
5. 当单字 "形元" 不足三码时, 补上字形识别码。  5. When the single character "shape element" is less than three codes, add the glyph identification code.
6. 有些相交或相连的字, 不成 "形元" , 釆取 "断开 法, 使之成为 "形元" 后来取码。  6. Some intersecting or connected words do not become "shape elements", so they use the "breaking method" to make them "shape elements" and then take the code.
(四) 汉字拆分编码规则及例字 (D) Chinese character split coding rules and examples
1. 相接的 "形元" , 可以分解拆  1. The connected "shape elements" can be disassembled
Figure imgf000008_0001
Figure imgf000008_0001
2. "形元" 相连、 相交可以上下断开公用笔画 (能上下断, 不左右断)  2. "Shape" connected and intersecting can disconnect common strokes up and down (can be broken up and down, not left and right)
例 字 拆 分 编 码 例 字 拆 分 编 码 兼 ;|v BCX 东 七 十 小 CSX 永 、 τ 水 DIW 黑 W 土 '"、 FTD 平 一 小 十 HXS 甲 田 1 ZL 井 NNL 世 NJL 毛 PQJ 韦 * p QUL 车 十 CQL 束 十 中 小 SQX 火 小 人 XAL 中 I 田 I ZKZ 米 小 十 小 XSX 求 十 水 、 SWD 夷 十秀 SQN 吏 十 中 SQC 3. 形元相交, 可以左右断开公用笔画
Figure imgf000009_0001
Example Word Split Coding Example Word Split Coding; | v BCX East Seventy Little CSX Wing, τ Water DIW Black W Soil '", FTD Ping Yi Xiaoshi HXS Koda 1 ZL Well NNL World NJL Mao PQJ Wei * p QUL Car Ten CQL Beam Ten Middle School SQX Fire villain XAL Middle I Tian I ZKZ Mi Xiao ten Xiao XSX Seeking Shishui, SWD Yi Shixiu SQN Li Shizhong SQC 3. Shape elements intersect, you can disconnect common strokes left and right
Figure imgf000009_0001
4. 框外形元足够 (包括角点) , 框内形元不编码
Figure imgf000009_0002
4. The frame shape element is sufficient (including corner points), and the shape element inside the frame is not coded
Figure imgf000009_0002
5. 形元足够角点不编码 5. The shape element is not enough for corners to be encoded
Figure imgf000009_0003
Figure imgf000009_0003
6. 形元不足角编入码
Figure imgf000009_0004
6. Insufficient angle coding
Figure imgf000009_0004
7. 向下脱框编码 (包括向外、 向右) 先框外后框内 例字 拆 分 编 码 例字 拆 分 编 码 周 门 土 口 M T 0 匾 匸 、 门 E D M 恩 □ 大 心 K Y D 为 、 力 、 D U ¾ 尚 小 门 口 X M 0 门 7 人 D I C 7. Down-frame coding (including outward and right) Example word split coding Example word split coding Zhou Mentukou MT 0 plaque 门, door EDM En □ big heart KYD for, force, DU ¾ small doorway XM 0 door 7 people DIC
8. 向上脱框编码, 先框内后框外  8. Up frame decoupling, first inside the frame and then outside the frame
Figure imgf000010_0002
Figure imgf000010_0002
9. 左旁两形元以上的字, 取左旁的首、 末码和末码( "///型"  9. The characters above the two digraphs on the left, take the first, last, and last code on the left ("/// 型"
Figure imgf000010_0003
Figure imgf000010_0003
上述 "///形" 即 "川字形" 的例外如:  The exceptions to the above "/// 形" or "Sichuan-shaped" are:
Figure imgf000010_0001
Figure imgf000010_0001
10. 带 和 旁的字从左起编码 (不按笔顺) 例字一 拆 分 编 码 . 例字 拆 分 编 码  10. The characters with and are coded from the left (not in stroke order). Example one split code. Example word split code
建 争 牛 , Γ E Q 道 _ 目 B F  Jian Zheng Niu, Γ E Q Road _ Head B F
廷 . I P T 糙 α X W 0  Ting. I P T Rough α X W 0
延 Z L I P J 逼 : L一 田 W Η Κ 11. 离散汉字结构的拆分, 要顺其自然 Yan ZLIPJ Force: L Yitian W Η Κ 11. Split the structure of discrete Chinese characters, let it go
Figure imgf000011_0001
Figure imgf000011_0001
12. 特殊规定: 12. Special regulations:
;| 、 儿、 三种形元组合结构, 不属双列类和三列类, 而归 属单列类。  ; |, Er, and three form element combination structures are not classified as double-column or triple-column, but are classified as single-column.
(五) 简码输入  (5) Shortcode input
为了提高输入速度, 本编码将常用汉字 (尽可能选高频使用 汉字) , 只取其前边一个或两个 "形元" , 再加一个数字键或空 格键结束。 作为该字的编码, 以构成筒码。 因本编码是三码的编 码 (三键结束) 。 故只设一、 二级简码。  In order to improve the input speed, this code will use commonly used Chinese characters (select high-frequency Chinese characters as much as possible), only take one or two "shape elements" in front of it, and then add a number key or space key to end. As the code of the word, a barrel code is formed. Because this code is a three-code code (three-button end). Therefore, only one or two short codes are set.
1. 一级简码  Short code
本编码的一级简码分两个层次。 笫一层次是: 每族可安排 1 个高频字的首码加空格键, 共 26个简码。 笫二层次是: 每族 "形 元" 可以安排 10个首码相同的高频使用汉字, 各加一个数字键, 共加 10个数字键, 作为 10个筒码。 本编码共 26族。 故可安排 26 X 10=260个。  The first level short code of this code is divided into two levels. The first level is: Each family can arrange the first code of high frequency words plus space bar, a total of 26 short codes. The second level is: Each family of "form elements" can arrange 10 high-frequency Chinese characters with the same first code, each with a number key, and a total of 10 number keys, as 10 barrel codes. There are 26 families in this code. Therefore, 26 X 10 = 260 can be arranged.
一级简码共 26+260=286个, 参见表 5。  The first-level shortcodes are 26 + 260 = 286, see Table 5.
2. 二级筒码  2. Secondary Barcode
二级简码是指编码时, 只取单字的前两个" 形元" 的代码, 再加一个空格键结束,并挑选使用频率较高的汉字, 为二级简码。 二十六族" 形元" 的代码, 其前两码的组合共计有 26 x 26=676 个二级简码 (其中有 9个空位) , 参见表 6。  The second-level shortcode refers to the code that takes only the first two "shape elements" of a single character, adds a space key to the end, and selects the frequently used Chinese characters as the second-level shortcode. For the codes of the twenty-six families of "shape elements", the combination of the first two codes has a total of 26 x 26 = 676 secondary short codes (including 9 spaces), see Table 6.
3. 一、 二级简码均设计屏幕提示, 无需记忆。 专业人员稍加 练习后, 均可实现盲打。 (六) 词汇编码规则 3. The first and second short codes are designed with on-screen prompts, no need to remember. After a little practice, professionals can achieve blind typing. (6) Vocabulary coding rules
许多编码的方法的实践都证实, 词汇编码输入都可以有效降 低重码率, 且显著缩短码长, 从而大大提高输入速度和效率。 本 编码采用三码的词码输入, 且以三字以上的词为主, 速度和效率 极高。  The practice of many encoding methods has confirmed that the lexical encoding input can effectively reduce the re-coding rate, and significantly reduce the code length, thereby greatly improving the input speed and efficiency. This code uses three-code word code input, and mainly uses words with more than three words, with extremely high speed and efficiency.
1. 双字词的编码规则  Coding rules for double words
本编码的双字词多用在 "独词" 。 第一字按规则取二码, 笫 二字取首码。  The two-word words used in this code are mostly used in "single words". The first character takes two yards according to the rules, and the second character takes the first yard.
例如:  E.g:
Figure imgf000012_0001
Figure imgf000012_0001
3. 多字词的编码规则  3. Multi-word encoding rules
多字词的编码规则是取一、 二、 末字的首码。 例如:  The encoding rules for multi-word words are to take the first code of the first, second and last words. E.g:
例 词 拆 分 编 码 科学技术 禾 ,w 十 P X S 为人民服务 、 人 夂 DAY 科学社会主义 禾 ," 、 PXD 中国人民解放军 中 口 QKG 一个国家两种制度 一 人 广 HAR Example word split coding science and technology Wo, w ten PXS Serving the People, People 夂 DAY Scientific Socialism, ", PXD Chinese People's Liberation Army Zhongkou QKG One Country Two Systems One Person Canton HAR
(七)繁体字和古汉字的编码:  (7) Encoding of traditional and ancient Chinese characters:
为适应更广泛的要求和便于进行海内外交流,本编码除设计简 化汉字系统外, 还设计了繁体汉字和古汉字的编码系统。 本编码 共编制海内外汉字总数为 21003个。 繁体字与古汉字的编码规则 与筒化字相同, 但分别增加一个规定的结束键 (或转换键)  In order to adapt to a wider range of requirements and facilitate communication at home and abroad, in addition to simplifying the Chinese character system, this code also designs traditional Chinese and ancient Chinese character coding systems. This code compiles a total of 21,003 Chinese characters at home and abroad. The coding rules for traditional and ancient Chinese characters are the same as for tubed characters, but a prescribed end key (or conversion key) is added respectively.
1. 繁体字的编码 例 字 拆 分 编 码1. Example of Traditional Chinese Character Encoding
'- 儉 (俭) 人 入. • A A A / 劍 (剑) 人 1 j : A A Z / 輩 (辈) " 十 Z Z S / 齒 (齿) 厂 卜 u . Z Z l) / . '-Thrift (thrift) people enter. • AAA / Sword (sword) person 1 j: AAZ / Generation (generation) "Ten ZZS / Tooth (Tooth) Factory Bu u. ZZ l) /.
2. 古汉字的编码 例 字 拆 分 编 码 .' 佳 1ί A A A // 2. Example of encoding ancient Chinese characters. Character splitting and encoding. '佳 1ί A A A //
-隻. 人人 A A Y // 走 Ί 卜 人 ' I l //  -Only. Renren A A Y // Walk Ί Bu Ren 'I l //
. 4 ^ 1 1 1 // 本编码采用分集设计, "正集" 的编码只设三个字位(正字、 副字、 余字) , 用于打常用汉字和通用汉字。 "副集" 的编码设. 4 ^ 1 1 1 // This encoding adopts diversity design. The encoding of the "positive set" only has three character positions (orthographic, adverb, and remainder), and is used to type common Chinese characters and general Chinese characters. "Subset" encoding settings
5个字位, "余集 " 的编码设 10个字位, 供屏幕上选择。 "副集" 和 "余集" 用于打繁体字、 古汉字和海外汉字。 根据 "国家语委" 提供的汉字使用频率统计资料。 正集汉字, 使用率为 93%、 副字 6.1%、 余字 0.85%。 尚有 0.05%的汉字, 需在副集中查到。 如果 你要的是副字, 继续打入键盘 〈2> 或 〈9〉 即可更改为副字。 如 果你要的是余字, 则需要借助 "提示行" , 出此情况的机率不到 1%。 故操作多次以后, 会很快记住的。 本编码简化汉字体系, 常 用汉字的重码率, 在三码条件下仅 5%左右。 若使用一、 二简码 和以打词码为主, 则接近无重码。 繁体、 古汉字体系则需借助提 示行。 There are 5 characters, and the encoding of the "set" is set to 10 characters for selection on the screen. "Vice set" and "Yu set" are used to type traditional Chinese characters, ancient Chinese characters and overseas Chinese characters. Statistics on the frequency of use of Chinese characters provided by the National Language Commission. The Chinese characters are used in the regular set, with a usage rate of 93%, adverbs 6.1%, and the remaining characters 0.85%. There are still 0.05% Chinese characters, which need to be found in the deputy. If you want an adverb, continue typing <2> or <9> on the keyboard to change it to an adverb. If you want a residual word, you need to use the "prompt line". The probability of this situation is less than 1%. So after many operations, you will quickly remember. This coding simplifies the Chinese character system, and the repetition rate of Chinese characters is often used, which is only about 5% under the three-code condition. If one or two short codes are used and the word code is mainly used, it is close to no heavy code. Traditional and ancient Chinese character systems require the help of hint lines.
本编码的技术特点是:  The technical characteristics of this coding are:
1. 本编码是以拼 "形音" 为特点的纯形码:  1. This code is a pure form code characterized by spelling "phonetic":
本编码是以形取码的, 使用的 码是汉字拼音字母。 即把汉 字的笔画、 偏旁、 部首、 规范部件和创新部件等字形结构要素, 统称为字形元素 (简称 "形元" ) 。 通过音、 形、 意的联想, 把 这些 "形元" 当成可以声读的汉字拼音字母。 每个汉字的字形都 只用三个汉字拼音字母为代码进行编码。 这是一种有声有拼形。 不但与拼音文字 (如英文等) 的键盘输入有异曲同工之效, 而且 更加快速简练。 如: "江" 字拆分成三 "形元" 为: 、;、 丁、 一, 其中 "、; " (点 Di S n )联想为 D, "丁" 为 T, "一" (横 Η 6 ng )联想为 Η, 故其编码为 DTH。  This code is based on the shape, and the code used is Hanyu Pinyin. That is, the glyph structural elements such as strokes, radicals, radicals, standard parts and innovative parts of Chinese characters are collectively referred to as glyph elements (referred to as "shape elements"). Through the association of sound, form and meaning, these "shape elements" are regarded as Chinese phonetic alphabets that can be read aloud. The glyph of each Chinese character is coded using only three Chinese Pinyin letters. This is a kind of sound and spell. Not only does it have the same effect as the keyboard input of Pinyin text (such as English, etc.), but it is also faster and more concise. For example: The word "jiang" is divided into three "shape elements" as:,;, ding, one, where ",;" (point Di S n) is associated with D, "丁" is T, and "一" (Heng Ying 6 ng) association is Η, so its encoding is DTH.
用拼 "形音" 的方法输入汉字, 是因为汉字的字形是表意的。 南腔北调听不懂, 但一写出字来(字形) 就全懂了, 就是这个道 理,汉字不像西方拼音文字那样只是 "记录语言的书写符号" (不 懂字音的意思, 就不认字) 。 汉字还是 "记录人的思维、 意识、 观念的书写符号" 。 对于键盘输入也只有用汉字的字形才能准确 表意。 因为汉字的同音字太多, 所以只有用拼形输入汉字, 才能 实现低重码率。 本编码比一般形码更进一步的是: 本编码不是单 纯的机械拼形, 而是拼 "形音" , 是建立在联想的基础之上, 不 存在死记硬背, 是采用类似汉字拼音码的 "简拼" 的办法进行编 码。 但与拼音码有本质的不同, 拼音码拼的是整个字的音, 而本 编码拼的是 "形元" 的音。 The Chinese characters are entered by spelling "Pinyin" because the glyphs of the Chinese characters are ideographic. I ca n’t understand from the South to the North, but I understand everything as soon as I write the character (glyph). That ’s the reason. Chinese characters are not just “writing symbols of the recorded language” like Western pinyin characters. . Chinese characters are also "writing symbols that record people's thoughts, consciousness, and ideas." For keyboard input, only the glyph of Chinese characters can be accurate Ideographic. Because there are too many homonyms in Chinese characters, only by inputting Chinese characters in a spelling shape, a low repetition rate can be achieved. This code is further than the general shape code: This code is not a simple mechanical spelling, but a "phonetic". It is based on the association, there is no rote memorization, and it is similar to the Chinese phonetic code "Jinpin" method for coding. However, it is fundamentally different from the pinyin code. The pinyin code spells the sound of the entire word, while this code spells the sound of the "shape element".
本编码虽然也具有 "音" 和 "形" 的特点, 但与一般的音形 码 (或形音码)有本质的不同。 一般音形码的音与形是分离的, 即先拼字音, 然后加字的首形或尾形, 以减少重码率。 这样的音 形码由于音与形不同步, 使用上存在思维不连贯, 而且也摆脱不 了与汉字拼音码同样的缺陷, 即不认的字, 或读音不准的字都无 法输入, 实践证明其输入效果不如单纯的形码。 这也是至今常用 形码虽难学但仍占领主要市场的根本原因。 本编码的音与形是同 步的、 一体的。 拼的就是 "形元" 的音, 而不是整个字音。 这种 拼 "形音" 的特点, 从本质上讲仍属于纯形码。 拼 "形音" 虽然 语言无法识别 (听不懂) , 但计算机能够准确无误地识别是什么 字。 因为电子计算机键盘打字, 主要是依靠显示器屏幕上的字形 传输信息和进行操作的。 屏幕上出现的只能是字形, 而不会是语 音和字音。 拼音文字也不例外。 汉字是拼形文字, 汉字的字形最 能准确表意, 这正好与电子计算机的操作特性相吻合。 就是因为 这种客观存在, 决定了方块汉字在电子高科技时代的作用。 键盘 输入只要能定准字形, 就能准确传输信息, 而拼 "形音" 是最有 效的定形手段。 因此用拼 "形音" 定形的方块汉字, 特别能适应 电子高科技时代的需求。 所以, 以形为本的古老汉字的腾飞将在 此一举!  Although this code also has the characteristics of "phonetic" and "shape", it is fundamentally different from the general phonetic code (or phonetic code). Generally, the sound and shape of the sound shape code are separated, that is, the spelling sound is firstly added, and then the first or last shape of the word is added to reduce the repetition rate. Due to the unsynchronized sound and shape of such a phonetic code, there is inconsistent thinking in use, and it can not get rid of the same defects as the Chinese phonetic code, that is, unrecognizable characters or inaccurate pronunciation cannot be input. Practice has proved that The input effect is not as good as a simple shape code. This is the fundamental reason why the commonly used form codes are still difficult to learn but still occupy the main market. The tone and shape of this code are synchronized and integrated. The spelling is the sound of "shape element", not the whole character sound. The characteristics of this spelling of "phonetic" still belong to pure code. Pinyin Although the language cannot be recognized (incomprehensible), the computer can accurately identify what the word is. Because computer keyboard typing, it mainly relies on the glyphs on the display screen to transmit information and perform operations. Only glyphs appear on the screen, not voices and phonetic sounds. Pinyin text is no exception. Chinese characters are spelled characters, and the glyphs of Chinese characters can be represented most accurately, which coincides with the operating characteristics of electronic computers. It is because of this objective existence that determines the role of square Chinese characters in the electronic high-tech era. As long as the keyboard input can accurately align the glyphs, information can be accurately transmitted. Pinyin is the most effective means of setting. Therefore, the square Chinese characters fixed by spelling "phonetic" can meet the needs of the electronic high-tech era. Therefore, the ascension of ancient Chinese characters based on shape will be here!
拼 "形音" 实际上是在字表形态不变的条件下, 使方块汉字 变成 "准拼音" 的文字。 这是多少代人的梦想, 如今将要变成现 实。 这是在计算机条件下实现的梦想。 这种易学的 "准拼音" , 在现代信息传输技术上, 将胜过所有的拼音文字 (如英文等) , 也就是说它比拼音文字传揄信息的效果还要好。 在机械打字机时 代, 无法用拼音字母, 拼成方块汉字。 单个汉字又数量巨大, 结 构复杂。 因此以形为本的汉字, 就无能为力, 一直落后了, 一百 多年。 如今电脑打字, 只要解决了易学性, 输入孤立的 "形元" 就能转换成汉字。 这使古老的汉字活力再现; 使我们有可能用超 越拼音文字的方法, 赶在时代的最前沿! The spelling of "phonetic" actually makes the square Chinese characters "quasi-pinyin" under the condition that the form of the word table is unchanged. This is the dream of many generations, and it will become a reality now. This is a dream realized under computer conditions. This easy-to-learn "quasi-pinyin" In modern information transmission technology, it will outperform all Pinyin text (such as English, etc.), which means that it will perform better than Pinyin text to transmit information. In the era of mechanical typewriters, it was not possible to spell Chinese characters with squares using pinyin letters. The number of individual Chinese characters is huge and the structure is complicated. Therefore, form-based Chinese characters are powerless and have been lagging behind for more than a hundred years. Nowadays, as long as the ease of learning is solved by computer typing, the isolated "shape element" can be converted into Chinese characters. This rejuvenates the ancient Chinese characters; it makes it possible for us to stay ahead of the times by using methods beyond pinyin!
我们认为: 计算机键盘输入汉字, 技术上的出路在于形码。 而形码的出路是解决易学性和适当缩短码长, 且保持较[氐的重码 率。 效仿先人 "六书" 造字的形、 音、 意联想, 是解决易学、 易 记、 难忘的最有效方法。 这是方块汉字固有的特点和深刻内涵所 决定的。  We believe that: The technical way out for entering Chinese characters on a computer keyboard is in shape codes. The way out of the shape code is to solve the problem of learnability and shorten the code length appropriately, and maintain a relatively heavy code rate. Imitation of the shape, sound, and meaning of the ancestors' "six books" is the most effective way to solve easy learning, easy to remember, and memorable. This is determined by the inherent characteristics and profound connotation of square Chinese characters.
本编码是以音、 形、 意联想取码的。 由于联想的作用, 所以 人的感官、 思维与字形、 代码、 键位之间, 可形成映射效应。 因 此, 在键盘输入汉字时, 不但能看打, 还便于想打和听打。 因此, 它能满足多方面人员的需求。  The coding is based on the pronunciation, shape and meaning. Due to the role of association, mapping effects can be formed between human senses, thinking and glyphs, codes, and key positions. Therefore, when typing Chinese characters on the keyboard, not only can you see and type, but it is also convenient to want to type and listen to type. Therefore, it can meet the needs of various personnel.
联想是理解事物的纽带, 它可以由此及彼, 触类旁通。 你见 到 "人" 字, 就会想到两脚站立的人形。 你见到 "曰、 月" 两字, 就会联想到天空的太阳和月亮。 你看到 "火" 字, 就会想到野地 架起燃烧的篝火。 你见到 "雨" 字, 就会感到细雨纷纷。 你见到 "泪" 字就会觉得眼睛(目) 滴下的点点泪珠。 这一切看来似乎 都 4艮原始,然而它又是非常现代与科学。 因为电子计算机特别 "青 睐" 这些神奇的汉字, 能准确而又迅速地识别它。 这说明它能很 好地适应当代科技。 正是高科技使古老的汉字, 煥发了青春。 只 要我们采用的方法得当, 键盘输入汉字的技术, 在普及和提高方 面胜出拼音文字 (如英文等) , 已是指日可待。  Associativity is the bond to understand things. When you see the word "person", you will think of a human figure standing on two feet. When you see the words "say, month", you will think of the sun and moon in the sky. When you see the word "fire", you will think of a wild bonfire set up in the wild. When you see the word "rain", you will feel drizzle. When you see the word "tears", you will feel the tears dripping from your eyes. All this seems primitive, yet it is very modern and scientific. Because the computer is especially "favorite" these magical Chinese characters, it can accurately and quickly identify it. This shows that it can adapt well to contemporary technology. It is high technology that revitalizes ancient Chinese characters. As long as we use the right method, the technology of inputting Chinese characters on the keyboard will be more popular and better than pinyin (such as English). It is just around the corner.
2. 本编码非常易学易记, 其键盘输入技术是先进的: 本编码共有 "形元" 三百个, 数量虽多, 但其中一部分用于 繁体字, 用于常用汉字的只有二百多个。 真正频繁使用的也只有 几十个。 但由于采用形似同族的原则和音、 形、 意联想, 因此都 非常便于记忆。 只要记住一个同族的代表及其代码, 便可记下一 串同族 "形元" 。 一旦记住, 都难以忘掉。 只要稍懂汉字拼音, 许多重点 "形元" 及其代码, 都会永远记住, 能做到过目不忘。 例如使用频率最高的 "形元" "一"横(H 0 ng ) 、 " 1 " 直(Zh ί ) 、 " ) " 撇 (? ) 、 " " 捺(Ν έ ) 、 "、 " 点 (Di δ η )。 上述"形元"都只用它读音的第一个字母为代码。 则 "一" 为 H, " I " 为 Z, " J " 为 P, " " 为 N, "、 " 为 D。 而 且( 、、 "、 ';、 "、、、 心) , 都是 D。 ( J 、;/ 、 、 勺)都是 Po 又如 "十" 为 S、 "土" 为 T, "口" 为 o等等。 这实际上就 是 "筒拼" , 只拼该 "形元" 读音的第一个拼音字母。 正因为这 种拼 "形音" 简单易学, 所以只要稍懂汉字拼音, 或因方言地区 发音不太准者, 都不难学习。 如果已学会打汉字拼音码的, 再学 本编码, 更是轻而易举的事。 因此本编码的普及性是可以预见的 β 至于输入速度, 因码长较短, 重码率低, 输入准确性高, 适合于 盲打。 故其输入速度应该比任何一种键盘文字输入为高 (包括中 外文字) 。 此非戏言! 做这种判断的理由是: 当今常用形码, 其 熟练者的打字速度已超过英文(见于有关报导) , 但本编码的打 字速度, 不是个别人的超出, 而是许多人都会超出。 这是本编码 的技术特点和优势所决定的。 只有这样, 才能说: 汉字的计算机 键盘输入技术, 已屹立于世界民族之林! 2. This code is very easy to learn and remember, and its keyboard input technology is advanced: There are three hundred "shape elements" in this code. Although there are many, some of them are used for Traditional Chinese, there are only more than 200 commonly used Chinese characters. Only a few dozen are used frequently. However, due to the use of similar family principles, sound, form, and meaning association, it is very easy to remember. As long as you remember a representative of the same family and its code, you can remember a series of "shape elements" of the same family. Once remembered, it is hard to forget. As long as you have a little understanding of Chinese pinyin, many important "shape elements" and their codes will always be remembered, and you will never forget them. For example, the most frequently used "shape element""一" horizontal (H 0 ng), "1" Straight (Zh ί), ")" 撇 (?), "" 捺 (Νέ), "," points (Di δ η). The above "shape element" uses only the first letter of its pronunciation as the code. Then "a" is H, "I" is Z, "J" is P, "" is N, ", and" D ". And (,, ",";, ",,, heart) are all D. (J,; /,, spoon) are Po. For example, "ten" is S, "soil" is T, "mouth" is o, and so on. This is actually "tube spelling", only spelling the first pinyin letter of the "shape element" pronunciation. Because this pinyin is easy to learn, as long as you know a little bit about Chinese pinyin or the pronunciation of dialects is not accurate, it is not difficult to learn. If you have learned to type the Chinese phonetic alphabet, it is even easier to learn this coding. Thus the popularity of this coding is predictable β As the input speed, because the shorter the code length, weight code rate, high input accuracy suitable for comfortably. Therefore, its input speed should be higher than any keyboard text input (including Chinese and foreign text). This is not a joke! The reason for making this judgment is: Today's commonly used form codes, the skilled person's typing speed has exceeded English (see related reports), but the typing speed of this code is not exceeded by individual, but many people will exceed. This is determined by the technical characteristics and advantages of this encoding. Only in this way can we say: The computer keyboard input technology of Chinese characters has stood in the forest of the nations of the world!
3. 本编码采用 "断开" 拆分法  3. This code uses the "disconnect" split method
本编码对一部分字形相连、 相交的汉字, 采用断开拆分的方 法, 而不用 "形元" "抽出" 的拆分方法。 这样就能在保持字形 形态完整的条件下, 达到直观, 快易取码的目的。 所谓 "断开" , 只是在汉字中, 两 "形元" 交接处, 设想出断开点, 而字形的形 态并没有改变。 这对以形取码的直观性十分重要。 由于本编码不 P T/CN2003/000858 This code adopts the method of disconnecting and dividing a part of Chinese characters with connected and intersecting glyphs, instead of using the method of "shape element" and "extraction". In this way, under the condition that the shape of the glyph is kept intact, the purpose of intuitive and quick code fetching can be achieved. The so-called "break" is only the point at which the two "shape elements" meet at the intersection of Chinese characters, and the shape of the glyph has not changed. This is very important for the intuitiveness of code in shape. Since this encoding does not PT / CN2003 / 000858
采用 "抽出" 的拆分取码方法, 因此避免了, 被公认为形码的学 习和使用难点, 从而大大的提高了编码效率。 The "extract" split code is adopted, so it is avoided, which is recognized as the difficulty of learning and using the shape code, thus greatly improving the coding efficiency.
现将本编码的断开拆分(断) , 与当今常用形码的抽出拆分  Now the disconnection (break) of this code is separated from the extraction of commonly used shape codes.
. 一 mi
Figure imgf000018_0001
( ) - 4 王 s (柚)
One mi
Figure imgf000018_0001
()-4 Kings (Pomelo)
(摘) 十大 火 ZJ ( 一 人 (Excerpt) Top Ten Fires ZJ (one person
Figure imgf000018_0002
PL 才 日 Z
Figure imgf000018_0003
Figure imgf000018_0002
PL Cairi Z
Figure imgf000018_0003
速度的先决条件: 只有看的准, 定码快, 才能迅速击键 β "断开" 取码, 由于字形形态不变, 不必抽象思维, 为快速击键创造了有 利的条件。 须知, 打字的快慢, 不单取决于击键次数的多少, 更 重要的是, 是否能够直观, 快速地确定打什么码。 Prerequisite speed: only look accurate, fast code set, in order to quickly keystrokes β "disconnect" take the code, change shape due to the shape, do not have to abstract thinking, and create favorable conditions for quick keystrokes. It should be noted that the speed of typing depends not only on the number of keystrokes, but more importantly, whether the code can be determined intuitively and quickly.
4. 本编码采用三类九型的汉字分类和三点分散取码法 由于本编码釆用三类九型的汉字分类, 对较复杂的字形, 釆 用的取码点比较分散, 取码点不深入到字型内部, 因此比较简单 直观, 易于判断取码点。 同时取码点分散, 使汉字各独立结构部 分都能取到码, 这必然会减少重码率。现将本编码的品字型(012 型) 、 倒品型 ( 021型)和左右型 ( 21型) 的汉字取码点与当今 常用形码的例字比较如下: 4. This code uses three types of nine-character classification of Chinese characters and three-point scattered coding method. As this code uses three types of nine-character classification of Chinese characters, for more complex glyphs, the used code points are scattered and code points are used. It does not go deep into the font, so it is relatively simple and intuitive, and it is easy to determine the code point. At the same time, the code points are scattered, so that each independent structural part of the Chinese character can obtain codes, which will inevitably reduce the re-coding rate. The character font of this code (012 The code points of Chinese characters of type), inverted type (type 021) and left and right type (type 21) are compared with the examples of commonly used shape codes today:
( 制品^ . ) 1 i ^ (Product ^.) 1 i ^
Figure imgf000019_0001
± 口 三
Figure imgf000019_0001
± mouth three
Q一 Q one
l 品 ) / Λ (:取 、〕 土 口 口 f 取- / . ± >L  l) / Λ (: take,) soil mouth f take-/. ±> L
十 — 、 <n - ¾  Ten —, <n-¾
一 A (取 έ右 点)
Figure imgf000019_0002
一 口 /j、 止 ) 、 卞 卞 r取 ^三 厂 ^
A (take the right point)
Figure imgf000019_0002
Yikou / j, only), 卞 卞 r take ^ three plants ^
感'  sense'
'本 X X手 ( ¾ ^ ) τ - P l i  '本 X X 手 (¾ ^) τ-P l i
( f、 -角  (f, -angle
( y、(y,
Figure imgf000019_0003
*取 - 从上例可见, 从字型外部三点分散取码与深入字形内部的两 点取码的直观性, 有明显差异。 三点分散取码, 取码点是固定的, 只要考虑取码点上是什么码就行。 不要到字形内部去找。 字形内 部, 由于 "形元" 的交叉连接, 往往难以识别和确定。 因此对较 为复杂的字形, 从字形外部三点分散定点取码, 能达到直观、 快 速、 准确的目的。 写字和画画一样, 必须把外围轮廓勾勒好, 才 能画得像。 因此分散在外围定点取码是定准字型的重要方法。 同 时取码范围分散到各个角落, 必然所取不同的 "形元" 的数量和 差异也扩大了, 当然重码率也就降低了。本编码能在三码条件下, 实现低重码率即得益于此。 由于采用 "三类九型" 分类和三点分 散定点取码法, 使越复杂的汉字, 取码越加简单。 这种化繁为简 的方法, 极大地提高了本编码的键盘输入效率。
Figure imgf000019_0003
* Fetching-As can be seen from the above example, there are obvious differences in the intuitiveness of three-point code scatter from the outside of the font and two-point code scatter from the inside of the font. Three-point scatter code, the code point is fixed, just consider what code is on the code point. Don't look inside the glyph. Inside the glyphs, it is often difficult to identify and determine due to the cross-connection of "shape elements". Therefore, for more complex glyphs, three-point scattered fixed-point fetching from the outside of the glyph can achieve the purpose of being intuitive, fast, and accurate. Writing and drawing are just like drawing. You must outline the outline to draw it. Therefore, scattered fixed-point code fetching at the periphery is an important method for aligning fonts. At the same time, the range of code fetching is spread to every corner, and the number and difference of different "shape elements" that must be taken are also enlarged, and the re-coding rate is also reduced. This coding can benefit from the realization of low repetition rate under the condition of three codes. Because the "three types and nine types" classification and three-point scattered fixed-point code-taking method are used, the more complicated the Chinese character, the easier the code-taking. Simplification This method greatly improves the keyboard input efficiency of this encoding.
5. 本编码的字型识别非常筒单, 不用交叉识别  5. The font recognition of this code is very simple, without cross recognition
当今常用形码, 末笔字型识别是学习难点之一。 当单字不足 四码时, 需要用末笔的横、 竖、 撇、 捺、 折五种笔形和上下型、 左右型、 杂合型三种字型, 进行交叉识别, 共有十五种识别码。 初学者要背复杂的识别码表。 特别是三种字型分类; 有的又很不 明确; 尤其杂合型的分类, 给初学者的学习和判断造成困难。  Today's commonly used shape codes, the last character recognition is one of the learning difficulties. When the single character is less than four yards, cross-recognition is required using the five strokes of the horizontal stroke, vertical stroke, skimming, 捺, and folding of the last stroke, and the upper and lower strokes, left and right strokes, and heterozygous strokes. There are fifteen types of identification codes. Beginners have to memorize complex identification code tables. In particular, the classification of three fonts; some are not clear; especially the classification of heterozygous types makes it difficult for beginners to learn and judge.
本编码的单字, 最多只用三码。 当单字不足三码时, 成字的 This coded word only uses a maximum of three codes. When the word is less than three yards,
"形元" 加 "WW" 补足三码。 如成字 "形元" (01型) "人" 字的代码为 A, 则 "人" 字的编码为 AWW。 上下型 (02 ) 型的 两码字, 不足三码则加 "L" 补足三码。 如 "古" 字的前两码是 SO, 则 "古" 字的编码为 SOL。 左右型 (11型) 的单字不足三 码时加 "R" 补足三码。 如 "汀" 字的前两码为 DT, 则 "汀" 字 的编码为 DTR。只存在这三种情况: 即成字型、上下型和左右型, 不足三码的字。 只用 W、 L、 R三个字母(代码) 为识别码。 不 存在交叉识别等复杂情况。 单字的其他各型, 均够三码, 故不必 再加识别码。 因此, 本编码的字型识别, 非常好学、 好记。 用 W、 L、 R三个字母的代码为字型识别码, 是本编码的重要技术特征 之一 β Add "WW" and "WW" to make up three yards. If the code of the character "shape element" (type 01) "person" is A, then the code of the character "people" is AWW. Up and down type (02) type two code words. If it is less than three codes, add "L" to make up three codes. If the first two codes of the word "gu" are SO, then the code of the word "gu" is SOL. If the left and right characters (type 11) are less than three yards, add "R" to make up the three yards. If the first two codes of the word "ting" are DT, then the code of the word "ting" is DTR. There are only three cases: fonts, up and down, left and right, and characters with less than three codes. Only W, L, R three letters (codes) are used as identification codes. There are no complicated situations such as cross recognition. The other types of single words are all three yards, so there is no need to add an identification code. Therefore, the font recognition of this code is very easy to learn and remember. W, L, R three-letter codes are font identification codes, which is one of the important technical features of this code. Β
6. 本编码的一、 二级简码数量多, 质量高, 不需要记忆 本编码设一级简码 286个 (一码加空格键或数键) 。 二级简 码 660多个(两码加空格键) 。 一级简码都是高频字; 而且几乎 嚢括当今形码和音码的一级筒码。  6. The number of primary and secondary short codes of this code is large, high quality, and no memory is required. This code has 286 primary short codes (one code plus space bar or number key). More than 660 secondary short codes (two codes plus space bar). The first-level shortcodes are all high-frequency words; and almost include the first-level barrel codes of today's shape codes and phonetic codes.
本编码的一、 二级简码在设计上, 采用屏幕提示, 不要死记 硬背。 专业人员稍加练习就能盲打。  The first and second level short codes of this code are designed with screen prompts, so do n’t memorize them. Professionals can touch blindly with a little practice.
7. 本编码非常便于外国人学习和输入汉字  7. This code is very convenient for foreigners to learn and input Chinese characters
本编码也为外国人(特别是学英文的) 学习和输入汉字打开 方便之门。 如: 人、 3、 7、 Π、 口、 厂、 丁、 U等形似的 "形 元",外国人会同样加以联想。 "水"的编码为 W,与英文 WATER 更是不谋而合。 其他 "形元" 只要稍懂汉字, 学习也不难。 外国 人和南方人说普通话, 发音不准, 往往都是在尾音上掌握不好。 第一音一般是没有问题的。 本编码拼 "形音" , 实际上只是 "简 拼" , 只用第一字母, 而且和拼音码一样, 可直接利用计算机英 文的键盘排布, 做到洋为中用 (不是一般形码机械地利用) 。 故 学、 用都不是很难的。 因此本编码将有助于中外文化的交流。 This code also opens the door for foreigners (especially English learners) to learn and input Chinese characters. Such as: People, 3, 7, Π, Mouth, Factory, Ding, U, etc. Yuan ", foreigners will also associate it. The code of" water "is W, which coincides with the English WATER. Other" Xing Yuan "is not difficult to learn as long as you understand Chinese characters. Foreigners and Southerners speak Mandarin The pronunciation is inaccurate, and it is often difficult to master the ending sound. The first sound is generally no problem. This code spells "phonetic", which is actually only "Jin Pin". It uses only the first letter, and it is compatible with the pinyin code. In the same way, you can directly use the English keyboard layout of the computer to make it foreign (not mechanically used by general shape codes). Therefore, it is not difficult to learn and use. Therefore, this code will help Chinese and foreign cultural exchanges.
综上所述, 本编码因具有优越的技术性能, 所以它既便于普 及使用, 又能满足专业人员录排的需要。 因此, 学生、 教师、 作 家、 记者、 文秘人员等, 都能^ ί艮快学习和掌握。 对方言地区也能 普及推广。 只要稍懂汉语拼音, 老少皆宜, 学习使用。  To sum up, this code has superior technical performance, so it is not only easy to use, but also can meet the needs of professionals. Therefore, students, teachers, writers, journalists, secretaries, etc. can learn and master quickly. Dialect areas can also be promoted. As long as you know a little bit of Hanyu Pinyin, it is suitable for all ages and learn to use it.
由于本编码用的是拼 "形音" 的取码法, 与中小学语文的汉 字拼音教育是相吻合的, 所以它适合于中小学教育中普遍推广, 这样就能做到: 汶字计算机编码从娃娃抓起, 能为计算机键盘输 入汉字的普及和提高, 为中文信息处理现代化,打下坚实的基础。  Because this code uses the "phonetic" method, which is consistent with the Chinese phonetic alphabet education in elementary and middle schools, it is suitable for general promotion in primary and secondary education. In this way, we can achieve: Wen computer coding Starting from the baby, it can popularize and improve the input of Chinese characters on computer keyboards, and lay a solid foundation for the modernization of Chinese information processing.
本编码的实用性还可以扩展到编制通用的字、 词典和电子词 典领域。 用本编码查字典、 词典, 可以与查英文词典一样, 而且 更加方便快捷。 因为码长较短 (单字只用三个字母) , 基本可以 做到一步到位 (即 "对号入座" ) 。 不像查汉语拼音字典时, 经 常会带出一连串同音字, 有的甚至还要翻页, 本编码的字典, 基 本是按偏旁、 部首的方式编排的, 形同和形近的字, 编排在一块。 这样就便于对比、 学习、 记忆和加深理解。 它与传统的康熙字典 的编排方式比较接近。 但它又能以 "拼形音" 的方式查找, 因此, 用起来非常方便。 这样, 既保持了传统, 又增加了科技含量。 这 与当今汉语拼音字典, 为了拼音查找方便, 而杂乱无章的编排, 是完全不同的。 这种模仿拼音文字, 字典的编排, 不是很科学的。 是与汉字以形为本的特点相背离的 β 例如: 行(h 4 ng )银行、 行 ( χ ί ng )行走、 行 ( heng )道行、 行(h "g )树行子, 虽然是 同形的字, 但由于发音不同, 就不能编排在一块(分在四处) 。 这就不便于对比、 学习和记忆。 这样的字还很多。 用本編码编制 字典, 就能从根本上克服这个缺陷。 The practicality of this code can also be extended to the field of compiling common words, dictionaries and electronic dictionaries. Using this code to search dictionaries and dictionaries can be the same as searching English dictionaries, and it is more convenient and quick. Because the code length is short (a single word uses only three letters), it can basically be done in one step (that is, "seat with a check mark"). Unlike when looking up a Chinese pinyin dictionary, it often brings out a series of homophones, and some even turn the pages. The dictionary of this code is basically arranged in the way of radicals and radicals. The same and near words are arranged. In one piece. This makes it easy to compare, learn, remember, and deepen understanding. It is close to the traditional arrangement of the Kangxi dictionary. However, it can also be searched in the form of "pinyin", so it is very convenient to use. In this way, both the tradition and the technological content are increased. This is completely different from today's Chinese Pinyin dictionary, which is convenient for the Pinyin search and has a disorderly arrangement. This imitation of pinyin text and dictionary layout is not very scientific. It is β that deviates from the characteristic of Chinese characters. For example: bank (h 4 n g ) bank, bank (χ ί ng) walk, bank (heng) bank, bank (h "g) tree bank, although Characters of the same shape, but due to different pronunciation, they cannot be arranged together (split around). This makes it difficult to compare, learn and remember. There are still many such words. Using this code to compile a dictionary can fundamentally overcome this shortcoming.
本编码包括简繁汉字、 古今汉字和海内外汉字。 这不但可以 研究、 挖掘我国古代浩瀚的文化宝库和进行古文字研究, 还可以 适应港澳台及华语地区和日益增长的国际信息化交流的需要, 为 世界各地使用汉字的国际友人和华侨、 华人服务!  This code includes simplified and traditional Chinese characters, ancient and modern Chinese characters, and Chinese characters at home and abroad. This can not only study and excavate the vast cultural treasure trove of ancient China and conduct ancient text research, but also adapt to the needs of Hong Kong, Macao, Taiwan and Chinese-speaking regions and the growing international information exchange, and serve international friends and overseas Chinese who use Chinese characters in the world!
本编码的重要意义还在于, 它是 "全形码" , 能在保存方块 汉字的前提下, 促进信息处理的现代化。 既然方块汉字的内在规 律已被充分揭示, 已经释放出巨大的潜能, 使古老的汉字, 能够 适应科技飞速发展新时代的需求。 我们为什么不能把方块汉字千 秋万代保存下来, 并加以发扬光大呢? 汉字有其自身发展规律的 特殊性。 汉字不应该全盘西化, 也应该走中国式的发展道路。 这 就是说: 汉字要适当简化, 汉语、 汉字要学用拼音, 但汉字不应 该, 也没必要改成拉丁化的拼音文字。  The significance of this encoding is also that it is a "full-size code", which can promote the modernization of information processing on the premise of saving square Chinese characters. Now that the internal rules of square Chinese characters have been fully revealed, huge potential has been released so that ancient Chinese characters can adapt to the needs of the rapid development of science and technology in the new era. Why can't we save the Chinese characters of the squares, and carry them forward? Chinese characters are peculiar to their own laws of development. Chinese characters should not be completely westernized, and they should also follow the Chinese-style development path. That is to say: Chinese characters should be appropriately simplified, and Chinese and Chinese characters should learn to use Pinyin, but Chinese characters should not be changed to Latinized Pinyin characters.
尽管参照实施例对所公开的涉及一种汉字 "三形联想" 形码 输入法进行了特别描述, 本领域技术人员将能理解, 在不偏离本 发明的范围和精神的情况下, 可以对它进行形式和细节的种种显 而易见的修改。 因此, 以上描述的实施例是说明性的而不是限制 性的, 在不脱离本发明的精神和范围的情况下, 所有的变化和修 改都在本发明的范围之内。 Although the disclosed method involving a Chinese character "three-associative" shape code input method is specifically described with reference to the embodiments, those skilled in the art will understand that without departing from the scope and spirit of the invention, Make obvious changes in form and detail. Therefore, the embodiments described above are illustrative and not restrictive, and all variations and modifications are within the scope of the present invention without departing from the spirit and scope of the present invention.
表 1 Table 1
Figure imgf000023_0001
表 2
Figure imgf000023_0001
Table 2
Figure imgf000024_0001
衮 3
Figure imgf000024_0001
衮 3
Figure imgf000025_0001
表 4 例 字 纯列单列单列倒单品品
Figure imgf000025_0001
Table 4 Example
编码补 WW  Coding Supplement WW
人, 八, 门, 廿, 山, 水, 虫, 七
Figure imgf000026_0001
Man, eight, gate, cricket, mountain, water, bug, seven
Figure imgf000026_0001
02型  02 type
Θ 编码补 L 仑, 并, 文, 火, 木, 句, 凤, 厅 单  Θ code complement L lun, union, text, fire, wood, sentence, phoenix, hall single
列 03型 Ξ Column 03 type Ξ
类 伞, 罩, 米, 荒, 襄, 同, 病, 虱Umbrella, hood, rice, barren, Xiang, with, disease, lice
Figure imgf000026_0002
Figure imgf000026_0002
012型 众, 罪, 冠, 嚣, 纛, 嘉, 籍, 麓 繁,'慰, 资, 鳖, 膂, 攀, 弊, 感Type 012 public, crime, crown, arrogance, dysfunction, Jia, Ji, Lu Fan, 'comfort, capital, dysfunction, dysfunction, panic, disadvantage, feeling
021型Type 021
Figure imgf000026_0003
眺, 北, 相, 认, 仲, 仅, 巨, 达 双列类 1 12型 曰. 1 3 江, 何, 耘, 禧, 驳, 睹, 柏, 医
Figure imgf000026_0003
Looking down, north, phase, recognition, middle, only, giant, up to double-column type 1 12 type. 1 3 Jiang, He, Yun, Jubilee, Refutation, Seeing, Bai, Medical
21型 □ □ 规, 刹, 敢, 鹑. 缺, 散, 趟, 献 三列类 型 川, .撖, 铤, 淋, ¾ίΗ, 邀, 鹕, 孵Type 21 □ □ Regulation, brake, daring, quail. Missing, scattered, wading, offering, three types of Sichuan,. 撖, 铤, 淋, Η, Η, invitation, pelican, incubation
Figure imgf000026_0004
Figure imgf000026_0004
~9Z- 菊 T z ~ 9Z- chrysanthemum T z
X A  X A
案 • 璨 暴 X  Case • Can storm X
m m 班  m m class
m Λ m Λ
m n  m n
& Έ 1¾ 穉 不 ¾ 难 1 m 末 s  & Έ 1¾ 穉 No ¾ difficult 1 m to s
班 镩 ^ . Ή  镩 镩. Ή
# if • 0  # if • 0
, ¾ ¥ 紫 d in 浙 . 剁 o  , ¾ ¥ Purple d in Zhejiang. Chop o
· 眯  · 眯
m w m 庫 . ¾ 颦 ¾ 1 豳 畨 昝 雷 瞢 s m . 田 If  m w m library. ¾ 颦 ¾ 1 豳 昝 昝 lei s m.
m 禺 薪 f 杳 SI a: %. I a m 罄 M . 2 H 袅 萆 . £ O m . 音 s ¾ ' ' Λ 会 m 鲔 m 禺 Salary f 杳 SI a:%. I a m is over M. 2 H 袅 萆. £ O m. Sound s ¾ '' Λ will m 鲔
鼷 ' 翳 猓 a 鼷 '翳 猓 a
车 萆 . D  Car. D
V 擊 每 a  V strike every a
令 Ψ V  Let Ψ V
 group
0 6 8 L 9 ς Ϋ ε l I
Figure imgf000027_0002
Figure imgf000027_0003
Figure imgf000027_0004
Figure imgf000027_0001
S8000/C00ZN3/X3d 1^9£0請 Z OAV -LZ-
0 6 8 L 9 ς ε ε l I
Figure imgf000027_0002
Figure imgf000027_0003
Figure imgf000027_0004
Figure imgf000027_0001
S8000 / C00ZN3 / X3d 1 ^ 9 £ 0 Please Z OAV -LZ-
Figure imgf000028_0001
Figure imgf000028_0001
8S8000/C001M3/X3d 8S8000 / C001M3 / X3d

Claims

L 一种汉字 "三形联想" 形码输入法, 其特征是, 包括下述 步骤: L A Chinese character "three-shaped association" shape code input method, which is characterized by including the following steps:
( 1 )选择字形元素作为编码的基本部件, 按形以同族原则分成 26族, 并与键盘上的 26个字母对应, 其对应关系为:  (1) The glyph element is selected as the basic part of the coding, and is divided into 26 families according to the same family principle according to the shape, and corresponds to the 26 letters on the keyboard. The corresponding relationship is:
A 人、/ (、入、 佳、 A person, / (, into, good,
β A、夕 、、/ w、 、 β A, evening,, / w,,
C 义 、3、七、 1(、 ΐ、弋、 、屮、  C meaning, 3, seven, 1 (, ΐ, 弋,, 屮,
D 、 、 n、 、'"、、 心 、 ί£ ,i、 门  D,, n,, "" ,, heart, i, door
E 3 、丁、 、,、尹 匸、 、尸、区 ...  E 3, Ding, ,,,, Yin,, corpse, district ...
F 目 、旦、且、ππ、四、 tttr、  F head, once, and, ππ, four, tttr,
Q r^> 、 、 rA、 ' 、 > 、骨 '.  Q r ^>,, rA, ',>, bone'.
H ― 、王.、军、 、 '5、 ^ w 、 i¾.  H ―, Wang., Jun, ′ 5, ^ w, i¾.
X 、 7、 "、 、H 、 Ί、 " ,、 V .、  X, 7, ",, H, Ί,", V.,
J L 、 、纟、 、 i、 、 ί 、 i:、 l_、 L/、 匕、 厂  J L,, 纟,, i,, ί, i :, l_, L /, dagger, factory
□、曰、 ca、田、 、 ί、 、尸、曲 丰□, Yue, ca, Tian,, ί,, corpse, Qu Feng
L ^、 、 L ^,,
门、 >Αι rL、 、巾、 :  Door,> Αι rL,, towel,:
N 廿、f 、廿、 、 、·¾·、 、,  N 廿, f, 廿,,,, ¾, ... ,,
D o 、 B 、 P 、 ::£ D o, B, P, :: £
P 、 、夕、 、々、 、 、身、 # 、 ^、台  P,, evening,, 々,,,, body, #, ^, Taiwan
Q 、; ί=、φ、 、丰、声、 . ¾、 、本、  Q ,; ί =, φ,, Feng, sound,. ¾,, this,
R r、 、 /、: f 广、户、 f、 e. R r,, / ,: f Guang, household, f, e.
S +、十、 、氷  S +, ten,, ice
T : t 、士、士、丁、了  T: t, Shi, Shi, Ding, Le
V 山、 、 .LJ、 6、力、†l、卡、  V mountain,, .LJ, 6, force, † l, card,
V 女、 、υ 、夕、.夕 夕、 w ;^、 、; 、氽、 、》、ί^ V Female,, υ, Xi,. Xi Xi, w; ^,,;,,,,》, ί ^
^ -、小、、| 、 、 Ψ 、 、 、.来  ^-, Small,, |,, Ψ,,,.
γ 、 、 、夂、戈 、大、丈.  γ,,, 夂, Ge, Da, Zhang.
. I 、丄、 'J、 、†、 ^ 、  . I, 丄, 'J,, †, ^,
( 2 ) 输入汉字; (2) input Chinese characters;
所述汉字输入规则为: 以基本字形元素为单位, 按三类九型 的取码点选取三个字形元素, 依次键入即可完成一个字的输入; 同为字形元素时以大优先, 字形元素相连或相交时采用上下或左 右断开公用笔画再取码的方法。  The Chinese character input rules are as follows: taking basic glyph elements as the unit, selecting three glyph elements according to the three types of code points, and typing in sequence to complete the input of a character; when the same glyph element is the first, the glyph element takes precedence When connecting or intersecting, use the method of disconnecting the common strokes up and down or left and right to get the code.
2. 根据权利要求 1所迷的汉字 "三形联想"形码输入法, 其 特征是, 在输入汉字时, 将汉字分为三类九型, 根据取码点选取 字形元素; 所迷三类为指单列类、 双列类和三列类; 所述九型及 取码方法为:  2. The Chinese character "three-associative" shape code input method as claimed in claim 1, characterized in that, when inputting Chinese characters, the Chinese characters are divided into three types and nine types, and the glyph elements are selected according to the code point; Refers to single-row type, double-row type, and three-row type; the nine types and code obtaining methods are:
"01型" 为成字的字形元素, 输入时在该成字字形元素后边 补两个 识别码;  "01 型" is a glyph element, and two identification codes are added after the glyph element when input;
"02型"为上下可拆分成二个字形元素的字,其后补一个 "L" 或 "R" 为识别码, 完成三码的输入;  "02 type" is a character that can be divided into two glyph elements up and down, followed by an "L" or "R" as the identification code, and the three-code input is completed;
"03型" 为上下可拆分成三个及三个以上字形元素的字, 按 首、 二、 末三个字形元素依次键入;  "03 type" is a character that can be divided into three or more glyph elements up and down. Type in the first, second, and last three glyph elements in sequence;
"012型"为品字型, 可拆分成上部和下部,取上部第一个字 形元素、 取下部笫一个字形元素和最末一个字形元素;  "012 type" is a character font, which can be split into upper and lower parts, taking the first glyph element in the upper part, taking one glyph element in the lower part and the last glyph element
"021型" 为倒品型, 可拆分成上部和下部, 分别取上部两部 分的第一个字形元素和下部最末一个字形元素;  "021 type" is an inverted type, which can be divided into upper and lower parts, taking the first glyph element and the last glyph element in the upper and lower parts respectively;
"11型" 为左右型, 左右各取一个字形元素, 在其后补一个 "R" 或 "L" 识别码完成输入;  "11 type" is a left and right type. Take one glyph element from left and right, and then add an "R" or "L" identification code to complete the input.
"12型" 为左右型, 取左边一个字形元素, 右边取首、 末二 个字形元素的字; υ τ . "12 type" is left and right, take the left glyph element, the right side take the first and last two glyph elements; υ τ.
"21型" 为左右型, 取左边首、 末二个字形元素, 取右边一 α  "21" is left and right. Take the first and last two glyph elements on the left, and take the right one.
个字形元素的字; Characters of glyphs;
"111型" 为川字型, 从左到右依次取首、 二、 末三个字形元 素。  "111 type" is a Sichuan font, and the first, second, and last three glyph elements are taken from left to right.
3. 根据权利要求 2所述的汉字 "三形联想"形码输入法, 其 特征是,在输入常用汉字时,键盘上 26个字母与字形元素的对应 关系为: The character "three-shaped Legend" shape code input method according to claim 2, wherein, when the input Chinese characters, the keyboard 26 corresponding relationship between the letters and graphic elements are:
A 人 、 (、―入、 佳、.  A person, (, ---, Jia ,.
B A、夕 、 \/、 JL-、 4、  B A, Xi, \ /, JL-, 4,
C 、 、 » " 、弋、六、 、 车、  C,, »», 弋, six,, car,
D 、 、 、 必 门,、  D,,, Beam ,,
E 3 、 n、 ^、 ^ 、尹 、匸、 、尸' .、区、 、 、 、  E 3, n, ^, ^, Yin, 匸,, corpse '., District,,,,
F ,.·目 、且、且、 ΠΠ、、2ξ) 、 d 、 c¾、 、JCE2、  F, ..., head, and, and, ΠΠ ,, 2ξ), d, c¾,, JCE2,
Q 、 、 ^、 Ί b 、'ψ  Q,, ^, Ί b, 'ψ
H ― 、王、? 、 ^ .、~ί 、 ί 、 、 1 、 ί¾、  H ―, Wang ,? , ^., ~ Ί, ί,, 1, ί¾,
I 7·· ,、 "7、 5* Α、 、"L、 J、 ¾、 ¾、  I 7 ·· ,, "7, 5 * Α," "L, J, ¾, ¾,
r L 、 幺、 纟、 ^、 ( ,Χ、 Γ、 匕 、 L、 L, b Γ、《<、 、 J 、 、 .、 i. X 、 r L, 幺, 纟, ^, (, χ, Γ, dagger, L, L, b Γ, "<,, J,,., i. X,
κ 、8、α=ιϋ V-,尸 、曲 Ξ.、 κ, 8, α = ιϋ V-, corpse, Qu Ξ.,
「 、  ",
、— ,  , —,
Μ π .、 r)、^i、 n s ^ 、 、 rm、 、  Μ π., R), ^ i, n s ^,, rm,,,
廿 、廿、卄 、" ^、廿 、 、讲 、母、母、 \-、 、·ί、  廿, 廿, 卄, "^, 廿, 讲, mother, mother, \-,, · ί,
Figure imgf000031_0001
Figure imgf000031_0001
土 、士 :k、丁、丁  Soil, Shi: k, Ding, Ding
山 、 、 u、九、力、 h、卡、 fji、 i¾  Mountain,, u, nine, force, h, card, fji, i¾
0 V 女、 、 、力、夕、少、 、鱼、 0 V Female,,,, force, evening, little,, fish,
W . ; 、 、氷、氽、 、 A、 t、  W.;,,, Ice, 氽,, A, t,
x 4、 、 小、 、i 、 U 、 /i)、、 y 、 ^  x 4,, small,, i, U, / i), y, ^
Y 申 、 、乂、夂、戈、大、丈、 、  Y Shen,, 乂, 夂, Ge, Da, Zhang,,
I 、 J 、 'j、 小卜 H .虫、  I, J, 'j, Xiao Bu H. Worm,
4. 根据权利要求 1至 3中的任一项所述的汉字 "三形联想" 形码输入法, 其特征是, 汉字拆分编码规则如下: 相接的字形元 素, 分解拆分; 字形元素相连或相交采用上下或左右断开公用笔 画再取码的方法, 能上下断开的不左右断开; 框外字形元素足够 时, 框内字形元素不编码; 字形元素足够时, 角点不编码; 字形 元素不足时角点编入码; 有边框时, 向下脱框编码, 先框外后框 内; 向上脱框编码, 先框内后框外; 左旁两字形元素以上的字, 取左旁的首、 末码和字的末码; 带 和 旁的字从左起编码, 不 按笔顺; 离散汉字结构的字, 顺其自然拆分。  4. The Chinese character "three-shaped association" shape code input method according to any one of claims 1 to 3, wherein the Chinese character split coding rules are as follows: connected glyph elements, decomposition and split; glyph elements Connect or intersect using the method of disconnecting the common strokes up and down or left and right to get the code. It can be disconnected up and down and not left and right. When the glyph elements outside the frame are sufficient, the glyph elements in the frame are not encoded. ; When the glyph element is insufficient, the corners are coded; when there is a frame, the code is deframed downwards, first outside the frame and then inside the frame; the code is deframed upwards, first inside the frame and then outside the frame; The first and last code on the left and the last code of the word; the words with and beside are coded from the left without stroke order; the words with discrete Chinese character structure are split naturally.
5. 根据权利要求 4所述的汉字 "三形联想"形码输入法, 其 特征是, 在既输入简体字, 又输入繁体字或古汉字时的字形元素 与键盘上 26个字母的对应关系为:  5. The Chinese character "three-shaped association" shape code input method according to claim 4, characterized in that the correspondence relationship between the glyph elements and the 26 letters on the keyboard when inputting both simplified characters and traditional characters or ancient Chinese characters is: :
A 人 、 、 入、 佳、 食、 金 A person,, into, good, food, gold
B 八 、、/、 、 ~、 、/^、  B eight,, /,, ~,, / ^,
C XL 、 、 . 七、 、t、^·、古.、 ^、车、 Ά  C XL 、 、 七 、 、 t 、 ^ · 、 古. 、 ^ 、 车 、 Ά
D 、 心 、 必 、 门  D, heart, must, door
£ 3 、 ΐ、 匸、〕、戶 、 、 Ε、 、尹、 、 、 ψ  £ 3, ΐ, 匸,], hut,, Ε,, Yin,,, ψ
、^、 、¾、,感.  , ^,, ¾ ,, sense.
P b . 01?、·且、且、皿、 '.Ε?、 ψ、 ΐ&、 、具  P b. 01 ?, · and, and, dish, '.Ε? , Ψ, ΐ &,, with
Q 、 、骨、 L Q,, bone, L
―、3 、 、 、墨 、 1、 、©.、 ^ ' 7 、  ―, 3, 、, ink, 1, 、 ©., ^ '7 、
7、 "、 孓 H 、 H 、 乙、7, ", 孓 H, H, B,
J 5 > > J 5>>
厶 、 、 ^ 、  厶,, ^,
J  J
」 、厂 、 Γ 、  '', Factory, Γ,
D. 、曰 、曰、  D.,,,,,
、戶 、戶、 Τ B SZ几么-2 P、  , Households, households, Τ B SZ How many -2 P,
、 丄 、 . 、  , 丄,.,
S、 、巾、■¾、门、 贝 曲 -、、、 :  S,, towel, ■ ¾, door, shell--,,,:
"
Figure imgf000033_0001
Figure imgf000033_0001
v  v
、 、 、文、 、 、夂、大、大、史 z 'J 卜 ¾、F3  ,,,,,,,,,,,,,,,,, Z z 'J BU ¾, F3
PCT/CN2003/000858 2002-10-16 2003-10-14 Pattern-code chinese character input method with “three patterns association” WO2004036404A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003272866A AU2003272866A1 (en) 2002-10-16 2003-10-14 Pattern-code chinese character input method with "three patterns association"

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN02131465.9A CN1328649C (en) 2002-10-16 2002-10-16 Chinese character 'three-shape association' shape-code input method
CN02131465.9 2002-10-16

Publications (1)

Publication Number Publication Date
WO2004036404A1 true WO2004036404A1 (en) 2004-04-29

Family

ID=29257509

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2003/000858 WO2004036404A1 (en) 2002-10-16 2003-10-14 Pattern-code chinese character input method with “three patterns association”

Country Status (3)

Country Link
CN (1) CN1328649C (en)
AU (1) AU2003272866A1 (en)
WO (1) WO2004036404A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103927022A (en) * 2014-03-21 2014-07-16 江山 Pinyin-form Chinese character input method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01292564A (en) * 1988-05-17 1989-11-24 Liu William Method for boshiamy code input of chinese caracters and keyboard used for the input
CN1057727A (en) * 1990-03-13 1992-01-08 蒋辅文 Phonetic element encoding method
CN1078563A (en) * 1993-05-31 1993-11-17 王伟 Associated Chinese Character radical code input method
CN1142073A (en) * 1995-07-28 1997-02-05 毛良 Pattern-sound code input method for popularizing Chinese characters and key-board thereof

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1110806A (en) * 1994-04-25 1995-10-25 汤建民 Intelligence five-stroke double-spelling code letter-word chain type positioning association input method
CN1247336A (en) * 1998-09-08 2000-03-15 张云衡 Shape-pronunciation code
CN1269542A (en) * 1999-04-05 2000-10-11 深圳市粤丝实业有限公司 Association Chinese character input system
CN1273383A (en) * 2000-06-06 2000-11-15 王敦清 Chinese-character 'shape-pronunciation code' input method for computer

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01292564A (en) * 1988-05-17 1989-11-24 Liu William Method for boshiamy code input of chinese caracters and keyboard used for the input
CN1057727A (en) * 1990-03-13 1992-01-08 蒋辅文 Phonetic element encoding method
CN1078563A (en) * 1993-05-31 1993-11-17 王伟 Associated Chinese Character radical code input method
CN1142073A (en) * 1995-07-28 1997-02-05 毛良 Pattern-sound code input method for popularizing Chinese characters and key-board thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103927022A (en) * 2014-03-21 2014-07-16 江山 Pinyin-form Chinese character input method

Also Published As

Publication number Publication date
AU2003272866A1 (en) 2004-05-04
CN1328649C (en) 2007-07-25
CN1455316A (en) 2003-11-12

Similar Documents

Publication Publication Date Title
TW201407423A (en) Keyboard input method based on sequence of finals
CN102253728B (en) A kind of initial and final double-spelling input method of Chinese character and dummy keyboard thereof
Becker Typing Chinese, Japanese, and Korean
WO2005121993A1 (en) Application system of multidimentional chinese learning
CN103616960A (en) Six vowel binary syllabification input method
WO2013135208A1 (en) Input method for compatible keyboard
CN102053719A (en) Input method for Chinese characters
CN105183183B (en) A kind of sound shape of square seedling text, which combines, rapidly inputs coding and its optimization method
CN103246354A (en) Inputting method for encoding and expressing Chinese characters through common language characters and keyboards of inputting method
CN104820503A (en) Chinese character input system integrating plurality of manners of spelling and encoding character patterns
CN101751139A (en) Encoding scheme for inputting pinyin with tones
WO2004036404A1 (en) Pattern-code chinese character input method with “three patterns association”
CN103207684A (en) Phonemic letter double-input method
CN104536590B (en) Embedded software keyboard system based on West Xia Dynasty&#39;s text sound character roots input method
CN102455785A (en) Method for inputting Chinese characters by computer
CN102023714A (en) Novel pinyin keyboard
CN106325540B (en) A kind of simple stroke input method of Northeast Yunnan, China subdialect seedling text and its application
CN101751134A (en) Right upper left Chinese character input method
CN214670487U (en) BPMF keyboard for computer
CN104620200A (en) Chinese character input method and keyboard
WO2014139060A1 (en) Specified holographic chinese character input method used for teaching and input method system
CN107357445B (en) Input method capable of reading and writing or combining Chinese characters to assist spelling letters
WO2023124573A1 (en) Digital new language and input method therefor, and input tool
CN107066104B (en) Optimum key position of computer keyboard for mutual reading and translation between Chinese and English
KR101384455B1 (en) Computer, electronic dictionary, smartphone keyboard Japanese alphabet arrangement

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP