CN1049727A - Form-pronunciation encoding of chinese characters - Google Patents
Form-pronunciation encoding of chinese characters Download PDFInfo
- Publication number
- CN1049727A CN1049727A CN 90102877 CN90102877A CN1049727A CN 1049727 A CN1049727 A CN 1049727A CN 90102877 CN90102877 CN 90102877 CN 90102877 A CN90102877 A CN 90102877A CN 1049727 A CN1049727 A CN 1049727A
- Authority
- CN
- China
- Prior art keywords
- chinese
- character
- chinese characters
- radical
- pronunciation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The invention discloses a kind of computer Chinese character coding method.The present invention mainly is the initial of getting the whole word Chinese phonetic alphabet of Chinese character, the initial of getting the part or all of Chinese character root Chinese phonetic alphabet by the Chinese-character writing rule constitutes form-pronunciation encoding of Chinese characters successively then, has overcome the shortcoming that existing encode Chinese characters for computer brings by the coded system of phonetic transcriptions of Chinese characters or spell shape.The present invention has easily remembers beginner Yi Xue, the repetition rate of coding is low, input efficiency is high, rule is simple and clear, keyboard need not any note, meet the advantages such as speech habits of Chinese.
Description
The present invention relates to a kind of computer Chinese character coding method.
The modern key of Technology of Chinese Information Processing is the Chinese characters in computer input technology, and the subject matter in the input technology is the coding of Chinese character.Up to now the coding method of Chinese character has reached hundreds of, and can be practical only tens kinds, they exist defective to some extent, will comparatively use below ten kinds encode and The characteristics as follows:
Numbering coding name encoding mode repeated code situation retrieval mode is easily learned easy note degree
It is the most difficult that 01 International Zone bit code numerical coding does not have repeated code calculating retrieval
02 Head-Til stroke Chinese code spell shape coding has the repeated code retrieval of tabling look-up difficult
03 phonetic sign indicating number Pinyin coding has repeated code to table look-up retrieval easily
04 quick Head-Til stroke Chinese code hybrid coding has the repeated code retrieval of tabling look-up difficult
The micro-repeated code retrieval of tabling look-up of encoding of 05 5 graphemic code spell shapes is difficult
06 double spelling code Pinyin coding do not have repeated code table look-up the retrieval easier
The no repeated code of 07 4 jiaos of sign indicating number frameworks coding table look-up retrieve easier
08 fast coding Pinyin coding has the repeated code retrieval of tabling look-up easier
The no repeated code of 09 front-three-end-one sign indicating number spell shape coding is tabled look-up and is retrieved difficulty
10 storehouses lucky sign indicating number spell shape coding has the repeated code retrieval of tabling look-up difficult
The major defect of phonetic sign indicating number class is that the requirement operating personnel pronounce accurately in the big class coding of last table two, and repeated code is too many.The major defect of phonetic shape code class is that coding rule is more, and the beginner is difficult to very fast grasp, so the shortcoming of this two classes encode Chinese characters for computer has hindered the raising of Chinese character input speed and popularizing of they separately to some extent.
The objective of the invention is for overcoming the shortcoming of spell shape, Pinyin coding, provide that a kind of coding rule is few, the repetition rate of coding is low, the form-pronunciation encoding of Chinese characters method easy to learn that input efficiency is high.
Initial of to the effect that getting the whole word Chinese phonetic alphabet of Chinese character of the present invention, press Chinese-character writing then from left to right, the initial that the rule of sealing after from top to bottom, in the elder generation is got part or all of Chinese character root phonetic constitutes the form-pronunciation encoding of Chinese characters that computer Chinese-character is imported successively.
The Chinese Pin Yin initial of Chinese character has 23, they are that a, b, c, d, e, f, g, h, j, k, l, m, n, o, p, q, r, s, t, w, x, y, z(c and ch are merged into one, s and sh are merged into one, z and zh are merged into one), if establishing yardage is three, then the denotable Chinese character number of coding of three initial compositions is NUM:
NUM=23
3=12167()
Therefore represent GB GB2312(80 with form-pronunciation encoding of Chinese characters) totally 6763 Chinese characters are more than sufficient for I and II.
The radical of form-pronunciation encoding of Chinese characters is defined as three classes: basic element of character, distortion radical, stroke radical.
The basic element of character of form-pronunciation encoding of Chinese characters is the simple Chinese character that can be used to organize word.For example: rice, king, stone, gold, fire, soil etc.The constitution principle of basic element of character is that the basic element of character number of a Chinese character is no more than three, can be made up of " day " and " getting " as " ", also can be made up of " day ", " ear ", " again ", but can not resolve into basic element of character more than three again." " word can be trigram, also can be four yards.This form-pronunciation encoding of Chinese characters can have many yards of words.
The distortion radical of form-pronunciation encoding of Chinese characters has two classes, one class is the radical of Chinese character, and its constitution principle is to get the custom call of people to radical, gets the initial r of the herringbone Chinese phonetic alphabet as " Ren ", " Xiangxi " get fire initial h of the word Chinese phonetic alphabet, " Lv " gets the initial c of the Chinese character written in the cursive hand Chinese phonetic alphabet.Another kind of distortion radical is some radicals that are similar to simple Chinese character, as in the beautiful word "
" get the initial r of day (re) word Chinese phonetic alphabet.
The Chinese character component part of form-pronunciation encoding of Chinese characters except basic element of character and distortion radical all is the stroke radical, and the stroke radical has eight, and they are a Dian (dian), horizontal one (hang), and perpendicular Shu (shu) casts aside Pie (pie), presses down
(na), carry Pie (ti), folding
ㄑ, Off (zhe) collude
, Yi, , ㄋ, second (gou).
Form-pronunciation encoding of Chinese characters is three bit codes or four bit codes.When coding surpasses four, only get preceding four.Having only " one " and " second " in the Chinese character is a stroke Chinese character, so its coding is expanded into YHH and YGG.
Major advantage of the present invention is that the initial by the initial of the Chinese phonetic alphabet of the whole word of Chinese character and its radical phonetic constitutes, it does not have c and ch, s and sh, the branch of z and zh cerebral and non-cerebral, thereby do not require that phonetic is accurate, it only requires that operating personnel know that 23 initials of phonetic transcriptions of Chinese characters are promptly passable, because the pronunciation of radical is consistent with Chinese speech pronunciation, do not need other memory, and the element-initial of coding is easily learned easily note.Because the present invention is the pronunciation of getting Chinese character pattern, so its coding rule is very simple, reduced memory capacitance, and the corresponding relation of each memory coding and keyboard is very easy.Shape-pronunciation code is that the initial by the initial of whole Chinese characters phonetic and the radical Chinese phonetic alphabet constitutes, such coded system repetition rate of coding is very low, clear rules, it is more suitable for developing direction-speech recognition system and the natural-sounding understanding system that computer Chinese-character is handled than phonetic shape code.
Form-pronunciation encoding of Chinese characters is a kind of encode Chinese characters for computer that adapts to the computer Chinese-character input, adopt the Chinese character information processing importation of shape-pronunciation code, also can add auxiliary process technology such as association type input, phrase input, tolerant code, the processing of frequency repeated code, once popularization, it can be used in Chinese character information modernization processing procedure widely.
Form-pronunciation encoding of Chinese characters is exemplified below:
The radical shape-pronunciation code that Chinese character can resolve into
The Chinese (Han) Rui (San Dian) is (You) HSDY again
Word (Zi) Http (Bao) (Zi) ZBZ
Beautiful (Yu) GKY of state (Guo) mouthful (Kou)
Lee (Li) wood (Mu) (Zi) LMZ
Li (Li) factory (Cang) lining (Li) LCL
Carp (Li) fish (Yu) lining (Li) LYL
(Shi) Yin (Go) LSG is shown in gift (Li)
Jasmine (Li) Lv (Cao) standing grain (He) Dao (Dao) LCHD
Sharp (Li) LCL of Lv (Cao)
Litchi (Li) Lv (Cao) power (Li) power (Li) LCLL
Official (Li) one (Hen) history (Shi) LHS
Chestnut (Li) west (Xi) wood (Mu) LXM
The LCW of strict (Li) factory (Cang) ten thousand (Wan)
Encourage (Li) strict (Li) power (Li) LLL
Happy (Le) LSL of gravel (Li) stone (Shi)
Go through (Li) factory (Cang) power (Li) LCL
Sharp (Li) standing grain (He) Dao (Dao) LHD
Lisu Nationality (Li) Ren (Ren) trembles (Li) LRL
Example (Li) Ren (Ren) row (Lie) LRL
Sharp (Li) LRL of clever (Li) Ren (Ren)
Sharp (Li) LBL of dysentery (Li) Epileptic (Bin)
Upright (Li) Dian (Dian) (Heng) Dian (Dian) LDHD
Upright (Li) LML of grain (Li) rice (Mi)
Drop (Li) three (San Dian) is gone through (Li) LSDL
Claims (4)
1, a kind of computer Chinese character coding method, shape, sound with Chinese character are coding, it is characterized in that getting the Chinese Pin Yin initial of the whole word of Chinese character, press Chinese-character writing then from left to right, the initial that the rule of sealing after from top to bottom, in the elder generation is got part or all of Chinese character root phonetic constitutes form-pronunciation encoding of Chinese characters successively.
2, form-pronunciation encoding of Chinese characters method according to claim 1 is characterized in that the radical of form-pronunciation encoding of Chinese characters has basic element of character, and basic element of character is the simple Chinese character that can be used to organize word, and the basic element of character number of forming a Chinese character is no more than three.
3, form-pronunciation encoding of Chinese characters method according to claim 1 is characterized in that the radical of form-pronunciation encoding of Chinese characters has the distortion radical, and they are that some are similar to the radical of simple Chinese character and the radical of Chinese character.
4, form-pronunciation encoding of Chinese characters method according to claim 1 is characterized in that the radical of form-pronunciation encoding of Chinese characters has eight stroke radicals, and they are point (dian), horizontal (heng), perpendicular (shu) casts aside (pie), presses down (na), folding (zhe) is carried (ti), colludes (gou).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 90102877 CN1049727A (en) | 1990-06-13 | 1990-06-13 | Form-pronunciation encoding of chinese characters |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 90102877 CN1049727A (en) | 1990-06-13 | 1990-06-13 | Form-pronunciation encoding of chinese characters |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1049727A true CN1049727A (en) | 1991-03-06 |
Family
ID=4877759
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 90102877 Pending CN1049727A (en) | 1990-06-13 | 1990-06-13 | Form-pronunciation encoding of chinese characters |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1049727A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1068126C (en) * | 1995-04-06 | 2001-07-04 | 许保才 | Chinese characters chain-code computer input method and keyboard thereof |
CN103729068A (en) * | 2013-12-27 | 2014-04-16 | 王金学 | Coding input method for pinyin initial letters of Chinese characters and word roots |
-
1990
- 1990-06-13 CN CN 90102877 patent/CN1049727A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1068126C (en) * | 1995-04-06 | 2001-07-04 | 许保才 | Chinese characters chain-code computer input method and keyboard thereof |
CN103729068A (en) * | 2013-12-27 | 2014-04-16 | 王金学 | Coding input method for pinyin initial letters of Chinese characters and word roots |
CN103729068B (en) * | 2013-12-27 | 2017-01-11 | 王金学 | Coding input method for pinyin initial letters of Chinese characters and word roots |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1049727A (en) | Form-pronunciation encoding of chinese characters | |
CN101063905A (en) | Sound and digital code Chinese-character input method | |
CN101046707A (en) | Input method for Chinese character of first pronunciation | |
CN1251436A (en) | Coding method for Chinese characters input on computer | |
CN1052799C (en) | Chinese character coding method and its keyboard | |
CN1106146A (en) | Computer input method by computer Chinese-character phonology-tone coding and its keyboard | |
CN1119742C (en) | Pictophonetic code computer keyboard inputting method for Chinese characters | |
CN1116336A (en) | Substitution type Chinese phonetic character, word input coding method and keyboard thereof | |
CN1099494A (en) | Encoding method for identification Chinese by initial consonant and strokes and keyboard thereof | |
CN1530805A (en) | Chinese character shape inputting system | |
CN1043092C (en) | English input code and its keyboard | |
CN1224188A (en) | Chinese character stroke encoding input method | |
CN1215859A (en) | Radical and phonetic code | |
CN1142629A (en) | Compiling method for nine stroke coding of Chinese characters | |
CN1075644C (en) | Combined code Chinese character unit phonic encoding entering method and keyboard thereof | |
CN1007094B (en) | Fifty word-elements multifunction chinese character input system and its special keyboard | |
CN1174349A (en) | Twenty-nine radical code for Chinese character coding and input | |
CN1049054C (en) | Eight-diagrams code Chinese charactor input method and keyboard thereof | |
CN1029357C (en) | Coding method for Chinese character components and its key board | |
CN1503111A (en) | Four corner number based Chinese character input method and keyboard thereof | |
CN1146023A (en) | Chinese learning code | |
CN1122924A (en) | HLV Chinese character spelling inputting method | |
CN1702606A (en) | Eight stroke input method | |
CN1306238A (en) | Chinese-character strokes input method | |
CN1066928A (en) | Chinese characters decomposing and fixing location coding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
RJ01 | Rejection of invention patent application after publication |