CN1098525A - Profile phonetic compound code - Google Patents
Profile phonetic compound code Download PDFInfo
- Publication number
- CN1098525A CN1098525A CN 94112196 CN94112196A CN1098525A CN 1098525 A CN1098525 A CN 1098525A CN 94112196 CN94112196 CN 94112196 CN 94112196 A CN94112196 A CN 94112196A CN 1098525 A CN1098525 A CN 1098525A
- Authority
- CN
- China
- Prior art keywords
- character
- code
- radical
- radicals
- chinese
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The method of Chinese character coding that a kind of ideophone is compound based on the most basic ideophone method of Chinese characters word-formation, decomposes basic stroke and the radicals by which characters are arranged in traditional Chinese dictionaries that constitute Chinese character, reduce 74 kinds of radicals of 10 big classes, and represent with 0~9 10 number character codes.The present invention encodes to individual character and word, and the former is auxilliary based on numerical code, with character code, and the latter is auxilliary based on character code, with numerical code, and code length is 4.This coding radical decomposes science, and clear rules is easily learned easily note, easy to operate.The words communal space, capacity is huge, and the repetition rate of coding is low, and code length is identical, is convenient to beat soon continuously, touch system.The present invention has stepped a huge step for optimum condition and the top speed that realizes the computer Chinese-character input.
Description
The invention belongs to the Hanzi coding technique field, is a kind of Profile phonetic compound code.
Kind surplus Hanzi coding scheme has reached 400 at present.These schemes can be divided and made three major types: according to the sound sign indicating number of Chinese-character pronunciation coding, and according to the sound sign indicating number of the encoding of Chinese font, and the phonetic-stroke code of taking all factors into consideration word sound and character shape coding.Various encoding schemes all have the chief, and shortage is also respectively arranged, and a hundred flowers blossom in the field of encode Chinese characters for computer for they, and the strange bucket of bucket is gorgeous.
The sound sign indicating number is easily learned easily note, but repetition rate of coding height, input speed is slow., add more than 1,200 only of tone sign because but the Chinese character syllabify has only more than 400 altogether.This just occurs a large amount of repeated code words inevitably.Though font code has overcome the obstacle of repeated code word basically, but on the decomposition font, do intricately.Common people are difficult to grasp its method at short notice, as for various phonetic-stroke codes, though each has overcome the defective of sound sign indicating number, font code in some aspects, but bring many new problems simultaneously, have the also few of extensive applying value.
The objective of the invention is to based on the most basic ideophone method of Chinese characters word-formation, provide a kind of and can overcome the defective that sound code weight code check height, font code font decompose complexity, and clear rules, manipulate ideophone multiple coding method easily.
As everyone knows, Chinese character is a kind of ideograph that is formed by pictograph development, and its word-formation method mainly contains four kinds of pictograph, self-explanatory characters, understanding, ideophones, and the most common with the ideophone method, vitality arranged most.Statistical data is told us, is phonogram more than 90 percent in the Chinese character.Therefore, we can say that the characteristics of phonogram are exactly the basic characteristics of Chinese character.Utilize the characteristics of phonogram that Chinese character is encoded, be to meet actual looks of Chinese character and writing style most, thereby also may be classic encode Chinese characters for computer.
The most tangible characteristics of phonogram are to resolve into the pictographic element of a pictophonetic of table justice and phonetic element of a Chinese pictophonetic character two parts of watch sound easily.Those non-pictophonetic characters, and some have lost the phonogram of original appearance through long term evolution, though the branch of invisible side, the phonetic element of a Chinese pictophonetic character, perhaps the pictographic element of a pictophonetic, phonetic element of a Chinese pictophonetic character difference are not obvious, according to radicals by which characters are arranged in traditional Chinese dictionaries and stroke, often still can resolve into two-part.Utilize the characteristics of phonogram that Chinese character " is divided into two " or " one is divided into three " is the main thought and the main way of this programme.
The present invention will constitute the basic stroke and the radicals by which characters are arranged in traditional Chinese dictionaries decomposition of Chinese character and be generalized into 74 kinds of radicals of ten big classes.And use 0~90 Ah position uncle numeral respectively.All the coding of Chinese character promptly produces on this basis.
The pronunciation of Chinese character is one of Chinese character three elements.When the design encoding scheme, make full use of this information of Chinese-character pronunciation, must bring positive effect new encoding scheme.Profile phonetic compound code has taken into full account this key element of pronunciation of Chinese character to single encode Chinese characters for computer the time, and with it is few as those strokes, font is simple in structure and mostly again are parts of the coding of daily word.Owing to increased the pronunciation information of this part Chinese character, thereby stepped a conclusive step for thoroughly eliminating the repeated code phenomenon.
The present invention is also to often there being word to encode.Because word is different with the individual character form of expression: individual character only is difficult to determine with pronunciation, also will lean on written form ability final clear and definite; And the word above word of double-tone joint particularly relies on pronunciation just can judge.So this programme is main foundation to the word coding with pronunciation, font information then is used for distinguishing the unisonance word.This programme had both absorbed the strong point of general sound sign indicating number to the coding of word, had the characteristic of oneself again, thereby made the coding and the input of word, became very simple and efficient.
Concrete scheme of the present invention is as follows: all basic strokes and the radicals by which characters are arranged in traditional Chinese dictionaries that will constitute Chinese character are decomposed into radical, and reduce totally 74 kinds of 10 big classes.To 10 class radicals respectively with 0~9 totally 10 numerical codes number represent.The name of this 10 class radical is called horizontal, vertical, left, points, discount, mouth, fork, eight, covers, closes, and corresponding numerical code is respectively: 1,2,3,4,5,6,7,8,9,0.74 radicals are divided into main level radical (26) digital stage radical (16), enlarge level radical (32), specifically see following etymon list:
Based on above-mentioned radical and numerical code thereof number, the present invention proposes the coding method of Chinese character individual character and word:
About the Chinese character single character code.Each Chinese character individual character is decomposed by above-mentioned radical, form this character code with corresponding numerical code, for the less individual character of stroke, then be combined into this character code with the numerical code of radical and the character code of this word, yardage is no more than 4.By how much being divided into of individual character radical of following 3 kinds of situations:
(1) is less than the individual character of 4 radicals, the numerical code of getting each radical successively by sequential write, additional letter sign indicating number more at last.Example: mouthful: Jiong,, 91K.In: two, Pie, 22V.
Emerging:
, one, eight, 418X.
(2) equal the individual character of 4 radicals,, get 4 yards on foot by the numerical code that sequential write is got each radical successively.Example:
(3), be divided into 2 kinds of situations more than the individual character of 4 radicals.
1, single character, by the numerical code that sequential write is got preceding 4 radicals successively, all the other need not.
2, combinde rqdical character, the structure word situation of combinde rqdical character are complicated, and according to the structure word situation of combinde rqdical character, this sign indicating number is divided into three kinds of situations with combinde rqdical character to be handled.
(1) " two or two formula ", this class word is made of two parts, each part can resolve into two or more radicals, be called " two or two formula ", its coding method is: earlier individual character " is divided into two ", each several part is resolved into radical again, get the numerical code of forward and backward two-part the first two radical then by sequential write respectively, be combined into 4 yards.Example:
(2) " one or three formula ", this class word is made of two parts, and a preceding part has only a radical, and a back part can be decomposed into the radical more than three, is called " one or three formula ".Its coding method is: earlier individual character " is divided into two ", again a back part is divided into radical, the numerical code of a radical of a part and preceding 3 the radical numerical codes of a back part are combined into 4 yards before getting then.
(3) " three same form ", this class word is made of two parts, and a preceding part can resolve into the radical more than three, and a back part has only a radical, is called " three same form ".Its coding method is: earlier individual character " is divided into two ", a more preceding part is resolved into radical, a back part is radical, and the numerical code of the numerical code of first three radical of a part and a radical of a back part is combined into 4 yards before getting.
About Chinese character phrase coding.The present invention encodes for the everyday expressions of being made up of 2 Chinese characters and 2 above Chinese characters, its basic skills is to form 4 speech sign indicating numbers of this speech with the numerical code of the character code of this speech and relevant radical, specifically divides following 4 kinds of situations by Chinese character (syllable) quantity of word:
1, for disyllabic word, the character code of getting these 2 words is successively earlier got the numerical code of first radical of these 2 words more successively and is formed this speech sign indicating number.Example: writer: ZJ39.
2, for trisyllable, the character code of getting these 3 words is successively earlier got the numerical code of first word first radical again and is formed this speech sign indicating number.For example: computing machine: JSJ4, modernization: XDH1.
3,, get the character code of these 4 words successively and form this speech sign indicating number for quadrisyllable.For example, encode Chinese characters for computer: HZBM advances by leaps and bounds: TFMJ.
4,,, get the character code of preceding 3 words and last word successively and form this speech sign indicating number promptly more than the speech of 4 syllables for polysyllabic word.For example: Indonesia: YDNY.
Above-mentioned so-called character code is meant the consonant letter of this word, and ZH, CH, SH get Z, C, S respectively.
I is capable, u is capable, the capable zero consonant word of u, presses phonetic plan and handles, and i, u are write as Y, W respectively, and u then is rewritten into V.
The order of the above-mentioned radical of mentioning as preceding 2 radicals of first radical etc., is the order determined according to rules for writing.
The Profile phonetic compound code that the present invention proposes is simultaneously according to Chinese character, particularly phonogram font and pronunciation two aspect information, integrated use numerical code and character code, it is auxilliary that individual character is carried out based on shape, with sound, carries out based on sound, with shape to word and is auxilliary coding, is a kind of phonetic-stroke code of novelty.It has following several characteristics:
One, design science is easily learned easily note.This coding decomposes with radical the stroke that constitutes whole Chinese characters and radicals by which characters are arranged in traditional Chinese dictionaries and concludes, and the differentiation of big " class " (10) is clear and definite, the limited amount of " kinds " (74), thereby easily learn easily and remember, easy to use.Brief and concise being convenient to of coding rule and method operated, and general personnel can expertly carry out Chinese character input through short-term study and training.
Two, capacity is huge, and words is shared.This coding is enabled 0~90 numerical code and 26 character codes of A~Z simultaneously, except having simultaneously that numerical code and character code are former and having living space, also has the huge space of numeral and monogram sign indicating number.4 isometric numerical codes can be held whole 6763 Chinese characters of regulation in the national standard " Chinese Character Set Code for Informati (baseset) " more than sufficiently, and 4 isometric character codes and alphanumeric sign indicating number then can hold Modern Chinese basic vocabulary and various special-purpose vocabulary fully.Character code, the speech sign indicating number communal space, the two input mode is identical, and is without gear shift, easy to operate.
Three, the space is wide, eliminates repeated code.Between word and the word, between speech and the speech because space is wide, dispersion is big, thereby the repeated code phenomenon is dropped to bottom line.We can say that the ideal state of one yard one word of this coding distance, one yard one speech has only one step away.
Four, flexible, can letter can expand.To the scheme of this coding, can simplify according to the actual needs of input.For example on the basis of routine coding, work out one-level digital brevity code, two-stage digital brevity code, one-level letter brevity code, secondary letter brevity code respectively, also can work out three stages of digital brevity codes and three grades of alphabetical brevity codes naturally.As special requirement, also can work out special brevity code.Therefore, originally be coded in establishment brevity code aspect special convenience is provided.The also visual actual needs of this coding, a large amount of new sign indicating number, expansion word, speech capacity set.In addition, also can work out different specialties simultaneously with sign indicating number, for special requirement.Therefore originally being coded in the dilatation aspect has great potential.
Five, coding is isometric, makes things convenient for touch system.This coding single character code is isometric numerical code (being serial code), and word is encoded to letter, combination of numbers sign indicating number.Numerical code is more easy to operate than character code, isometricly then helps realizing continuous touch system.
Six, rapidly and efficiently, time saving and energy saving.These above-mentioned all characteristics of encoding are all in order to make great efforts to reach the optimum condition and the highest level of Chinese character for computer input.Skilled day by day along with operative technique catches up with spoken speed and also will become a reality.We also wish: the appearance of Profile phonetic compound code, with changing the difficult backward state of Chinese character for computer input, make great efforts to make it to become relaxation and happiness, make us happy work.
Profile phonetic compound code is the encode Chinese characters for computer extremely widely of a kind of purposes.It also has other multiple use except mainly applying to the Chinese character for computer input.For example, this coding can be used for searching and produce index, is having more superiority aspect the layout dictionary especially.Because the no cumbersome rule of this coding, do not have each transfer process, see that word can go out sign indicating number, yard number page number, simple and easy quick, non-other mode searchings are comparable.Profile phonetic compound code can apply to simple and easy shorthand, need not pass through specialized training, directly with coding instead of part everyday character and everyday expressions, can improve writing speed in very short time.Simultaneously, the writer also can utilize this encoding scheme according to need of work and writing style, special brevity code or the new sign indicating number of interim initiative, very convenient practicality.Profile phonetic compound code can pass through appropriate reconstruction, replaces old-fashioned telegraph code, alleviates the labour intensity of telegram transmitting-receiving greatly, increases work efficiency and service quality.In addition, Profile phonetic compound code all has wide application prospect to scientific research, culture and education, journalism, taking care of books, password formulation, software development aspects.
Claims (3)
1, the compound method of Chinese character coding of a kind of ideophone is characterized in that the basic stroke and the radicals by which characters are arranged in traditional Chinese dictionaries that will constitute Chinese character are decomposed into radical, and radical is reduced totally 74 kinds of 10 big classes, to 10 class radicals respectively with 0~9 totally 10 numerical codes number represent:
2, the method for Chinese character coding according to claim 1 is characterized in that each Chinese character is decomposed by above-mentioned radical, forms this character code with the numerical code and the character code that are no more than 4 sign indicating numbers:
(1),, adds the character code of this individual character at last by the numerical code that sequential write is got each radical successively for the individual character that is less than 4 radicals;
(2), get the numerical code of 4 radicals successively by sequential write for the individual character that equals 4 radicals;
(3), be divided into 2 kinds of situations for individual character more than 4 radicals:
1. single character, the numerical code of getting preceding 4 radicals by sequential write successively;
2. combinde rqdical character is divided into three kinds of situations with it and handles:
(ⅰ) " two or two formula " is decomposed into 2 parts with individual character, again each several part is decomposed into radical, gets the numerical code of forward and backward two-part preceding 2 radicals successively according to sequential write;
(ⅱ) " one or three formula " is decomposed into 2 parts with individual character, the numerical code of a radical of a part before getting earlier, the numerical code of getting preceding 3 radicals of a back part again;
(ⅲ) " three same form " is decomposed into 2 parts with individual character, the numerical code of preceding 3 radicals of a part before getting earlier, the numerical code of getting a radical of a back part again.
3, the method for Chinese character coding according to claim 1 is characterized in that 2 Chinese characters and 2 words that above Chinese character is formed, and forms this speech sign indicating number with the character code of this speech with relevant radical numerical code respectively:
(1) for disyllabic word, the character code of getting these 2 words successively earlier, the numerical code of getting first radical of these 2 words more successively;
(2) for trisyllable, the character code of getting these 3 words successively earlier, the numerical code of getting first radical of first word again;
(3), get the character code of these 4 words successively for quadrisyllable;
(4), get the character code of preceding 3 words and word of thing successively more than tetrasyllabic speech.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN94112196A CN1054930C (en) | 1994-06-08 | 1994-06-08 | Profile phonetic compound code |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN94112196A CN1054930C (en) | 1994-06-08 | 1994-06-08 | Profile phonetic compound code |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1098525A true CN1098525A (en) | 1995-02-08 |
CN1054930C CN1054930C (en) | 2000-07-26 |
Family
ID=5035990
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN94112196A Expired - Fee Related CN1054930C (en) | 1994-06-08 | 1994-06-08 | Profile phonetic compound code |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1054930C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103760988A (en) * | 2013-11-18 | 2014-04-30 | 赵树清 | Syllable Chinese character input method |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1048812C (en) * | 1987-05-22 | 2000-01-26 | 熊正才 | Chinese characters stroke-order and shape-code coding input method |
CN1044717A (en) * | 1989-06-08 | 1990-08-15 | 李伟君 | Chinese character with ten-radicals stroke ordered codes |
CN1027196C (en) * | 1991-12-31 | 1994-12-28 | 丘镇华 | Computer Chinese character digitizing input method and ingenious keyboard therefor |
-
1994
- 1994-06-08 CN CN94112196A patent/CN1054930C/en not_active Expired - Fee Related
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103760988A (en) * | 2013-11-18 | 2014-04-30 | 赵树清 | Syllable Chinese character input method |
CN103760988B (en) * | 2013-11-18 | 2016-08-31 | 赵树清 | A kind of syllable Chinese character input method |
Also Published As
Publication number | Publication date |
---|---|
CN1054930C (en) | 2000-07-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1054930C (en) | Profile phonetic compound code | |
CN101046707A (en) | Input method for Chinese character of first pronunciation | |
CN1106146A (en) | Computer input method by computer Chinese-character phonology-tone coding and its keyboard | |
CN1188771C (en) | Radical form code Chinese character input method and keyboard | |
CN1081811C (en) | Chinese strock pronunciation code encoding input method | |
CN1107254C (en) | Positive negative dual-electrode initial-consonant vowel form code input system for Chinese characters | |
CN1074556C (en) | Chinese character inputting method and keyboard by pronunciation and corner codes | |
CN1108553C (en) | Universal popular voice form Chinese character coding input method | |
CN1080070A (en) | The ideophone position holographic Chinese characters coding | |
CN1146023A (en) | Chinese learning code | |
CN1296807C (en) | Voice-Voice Chinese Character inputting method | |
CN1074296A (en) | A kind of Chinese phonetic phoneme method of Chinese character coding | |
CN1073023A (en) | Chinese character dual-part numeric code input method and keyboard thereof | |
CN1164695A (en) | Chinese character stroke-form numeric coding method | |
CN1339733A (en) | Chinese character Hanyi code input method and keyboard for computer | |
CN1202647A (en) | Phonetic Chinese characters | |
CN1100538A (en) | New spelling Chinese input method and its keyboard design | |
CN1373408A (en) | Chinese-character 'root code' input method for computer | |
CN1584876A (en) | Multidimensional computer coding Chinese studying system | |
CN86107214A (en) | A kind of Chinese word input method and keyboard thereof | |
CN1191340A (en) | Chinese character positive pole and negative pole shape code entering system | |
CN1129825A (en) | Second-generation regional code of Chinese characters for keyboard inputting | |
CN1049418A (en) | Chinese character keyboard input method for unified code computer | |
CN1376968A (en) | Chinese-character 'Three roots' input method for computer | |
CN1375763A (en) | Chinese character encoding method grouping in consonants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |