CN1120694A - improved pictophonetic Chinese characters code - Google Patents
improved pictophonetic Chinese characters code Download PDFInfo
- Publication number
- CN1120694A CN1120694A CN 95111006 CN95111006A CN1120694A CN 1120694 A CN1120694 A CN 1120694A CN 95111006 CN95111006 CN 95111006 CN 95111006 A CN95111006 A CN 95111006A CN 1120694 A CN1120694 A CN 1120694A
- Authority
- CN
- China
- Prior art keywords
- code
- chinese
- word
- sound
- radical
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The improved Chinese-character shape-pronunciation coding having text function features 26 letters, 4-bit length, encoding root based on pronunciation, static decomposing method for root, and use of pronunciation code and stroke code as identification code. Its advantages are easy mastering and memorizing it.
Description
Improved Chinese-character shape-sound code is a kind of font code, is applied to Chinese information processing.It is the improvement to Chinese-character shape-sound code (number of patent application 91108178.x).
The Hanzi coding scheme of the nineties has developed into a new height, and its quantity is above 1000 kinds.Fundamental type is still sound sign indicating number, font code, phonetic-stroke code three classes.With regard to font code, learnability, standardization all increase, and mainly contain following improvement trend:
(1) the fixed sign indicating number rule of radical trend is strict, and the fixed sign indicating number of sound (or partly pressing the fixed sign indicating number of sound) is pressed in more employing.Holographic sign indicating number as Du Bingchan.
(2) split the strictness of rule trend.As the holographic sign indicating number of Du Bingchan strictly according to order of strokes observed in calligraphy code fetch.
(3) simplify identification code, adopt brevity code to disappear heavily.Adopted brevity code to disappear heavily as the five-stroke form code new edition.Simplified and traditional five sign indicating number identification codes are used the end pen instead, save font.The font code scheme that has adopts this character pronunciation (initial consonant or initial) to make identification code, as the configuration code of Chen Aiwen, four Shape-pronunciation codes of Li Xingmin.
(4) adopt pre-prompt facility.Utilize presenting bank to realize that the system designer wish offers user's prompting, as brevity code word, repeated code word, high frequency word etc. by key ground.
(5) trigram one word or most of trigram one word have been realized, as four Shape-pronunciation codes of Li Xingmin.
(6) adopt dynamic self-word creation.
The current computing machine that is faced with is popularized, and middle and primary schools need impart knowledge to students with coding, once, two kinds " learnability ", " rapidity ", font code scheme that " standardization " level is very high become pressing for of society.
The objective of the invention is to improve original Chinese-character shape-sound code, simplify its identification code, achieve the key property of " literal code ".
Design principle of the present invention and basic structure are as follows:
Chinese character is a kind of spell shape literal.The Chinese character modernization is exactly a Chinese character symbolization, just creates a kind of " literal code " for Chinese character.Literal code just is to require in the cataloged procedure with general coding difference, do not allow to exist the instruction means beyond the philology in the symbol transition.Its task is to describe Chinese character as far as possible perfectly, express Chinese character, progressively becomes the national regulation that a kind of and the Chinese phonetic alphabet have par.This coding not only is used for the Chinese character computer input, can also be used for the Chinese character sort retrieval, and the spelling method teaching of literacy instructs the simplification of Chinese character and the unification of simplified and traditional Chinese characters.This coding in use forms the association between " word-sign indicating number " naturally, might realize the reversal of identification of everyday character.This encoding scheme is created according to the requirement of literal code.
This coding adopts English standard keyboard.2~4 of character codes (be used for full encode Chinese characters for computer and also can adopt 2~5).140~190 roman radicals are pressed sound (initial consonant) code fetch, and the zero initial radical adopts a letter of simple or compound vowel of a Chinese syllable: initial or head vowel of a final letter.21 of initial consonants, Ch, Sh, Zh are write as C, S, Z.Cancellation is separated with initial consonant W, Y, and Y transforms into.C, S can take V, W key, and Z and Z share the Z key.Radical " wood " is pressed the legal sign indicating number of pictograph in " 0 " with " ten ".Totally 26 key positions like this.The radical pronunciation is selected in following pronunciation: the 1. modern pronunciation of Chinese characters, and 2. 3. old Chinese phonology is accustomed to sound, 4. the phonetic symbol sound, do not have ready-made pronunciation or heavily need to draft pronunciation because of keeping away, can adopt following method: 1. shape is changeed meaning method, 2. profiling sound method, 3. imitative sound is economized method, and 5. 4. imitative resection is similar to method.(annotate: the modern pronunciation of Chinese characters is meant modern pronunciation.)
A kind of new radical decomposition method of the invention: static decomposition method.Chinese character is the spell shape literal.Through differentiation, improvement in several thousand, numerous Chinese characters have been by some fundamental figures that standardized-be referred to as parts or radical is stacked formed.The first step of coding: radical decomposes, and acts in a diametrically opposite way exactly; As: wood, mouthful (synthesizing) → bundle (decomposition) → wood, mouth.This decomposition method is complied with the literal tradition.It than common be that the dynamic decomposition method of preface is good with the order of strokes observed in calligraphy." order of strokes observed in calligraphy is preface, get big preferential " such decomposition rule can't be handled the array mode of Chinese character and the contradiction between the ways of writing, as " bundle ", " defending ", " must " etc. the decomposition of word.So king's sign indicating number has been stipulated " taking into account directly perceived ", still " intuitively " is a fuzzy concept.And several rules exist side by side, and cause the ambiguity of decomposition.Du sign indicating number splits in strict accordance with the order of strokes observed in calligraphy, so " must " word splits into " point, hook, point, point ", reflected the font style characteristic of " heart " word.The static decomposition method that this programme is created is set up at this drawback.
Radical splits rule and is made up of a prioritization criteria that must follow criterion and five order utilizations.Must follow criterion is " order of strokes observed in calligraphy consistance " criterion, and its order of strokes observed in calligraphy of radical of this regulation decomposition gained must be consistent with the order of strokes observed in calligraphy in the whole word, and promptly the order of strokes observed in calligraphy of radical can be interrupted by other radical in whole word, but can not put upside down.Article five, prioritization criteria is followed successively by: 1. minimum radical is preferential, 2. take off connect preferential, 3. the single radical preferential/non-folding stroke is preferential, 4. the order of strokes observed in calligraphy is preferential continuously, 5. stroke divides preferential earlier.Explain the main points briefly below:
Minimum radical preferentially is in order to obtain less radical, and such radical font is more complete.
Take off and connect preferentially: the stroke group in the Chinese character becomes radical or parts, illustrates to have certain relation between these strokes.Can be divided three classes according to the tightness degree of its relation, 1. relevant: as not link to each other geometrically, and link to each other on the philology, as eight, the heart; 2. link to each other: the stroke contact; 3. intersect: stroke intersects, and intersection point is arranged.Taking off even, preferential implication is that relation is looser between stroke: the relevant and preferential disengagement that links to each other is characterized in that number of hits does not reduce.
The single radical is preferential/and non-folding stroke is preferential: and under the identical condition of the prioritization criteria of going ahead of the rest, with respect to a multiple radical, the single radical is preferential; With respect to the folding pen, non-folding stroke is preferential.
The order of strokes observed in calligraphy is preferential continuously: the order of strokes observed in calligraphy of static decomposition method regulation radical can be interrupted by other radical in whole word, and under the identical condition of the prioritization criteria of going ahead of the rest, the decomposition result that the order of strokes observed in calligraphy of radical is not interrupted should be preferential.
Stroke divides preferential earlier: under the identical condition of the prioritization criteria of going ahead of the rest, the middle stroke that can belong to two radicals should belong to preceding radical.
Radical order after the decomposition is a preface according to the priority of its first sum of picture.
The radical of this coding decomposes two steps of employing to carry out: the first step is decomposed into individual components earlier.This is meant that these parts and adjacent stroke do not have and involves: what is called involves and is meant between adjacent stroke and can forms new radical.8000 roman Chinese characters have 600 individual components, and wherein more than 200 is single radical, and remainder has only more than 300 multiple radical parts to need to decompose.As long as according to decomposition criteria, grasp the decomposition result of these more than 300 parts, so the decomposition of 8000 roman Chinese characters is just driven a light carriage on a familiar road, be swift in response.
The character code composition rule is as follows:
A) two radical words: primary word collection:
Two package code+word sound sign indicating numbers, the code fetch mode of word sound sign indicating number is identical with package code;
Secondary word collection: the first and last of two a package code+inferior radicals stroke shape Chinese code,
The code fetch mode of stroke shape Chinese code is identical with package code.B) three radical words:
Primary word collection: three package codes;
Secondary word collection: three package code+word end stroke shape Chinese codes; C)>=four radical word:
All get head, two, three, last package code.D) radical word:
Mode one: by 3~4 of first phonetic, back stroke mode code fetches,
That is: package code+subsequent sound sign indicating number+head, inferior, last stroke shape Chinese code,
Primary word collection word is got 3 (indivedual 4), and secondary word collection word is got 4 without exception.
J, Q, X, C, S, Z, Zh, Sh, Zh, follow-up simple or compound vowel of a Chinese syllable I can economize,
Y=Yu,Z=Zh。
Mode two: by 3~4 of package code+head, inferior, last stroke shape Chinese code code fetches.
The high frequency word can adopt the one-level brevity code with a representation, and the one-level brevity code is by whole character pronunciation code fetch.In addition, the secondary brevity code with the first two representation of character code can be arranged, with the three of first three representation of character code.
With reference to word code structure, can work out the speech sign indicating number: two words, form by the first two sign indicating number of two words; Three words are made up of the first two yard of 1. lead-in, the first sign indicating number of back two words, or the first two sign indicating number composition of 2. first, the first sign indicating number of secondary word, last word, 3. three of three words word sound sign indicating numbers+"; " the symbol composition.〉=four words are made up of the first sign indicating number of 1. front-three-end-one word, or 2. its word sound sign indicating number composition.Self-word creation can be accustomed to coding by the user.
Repeated code is handled following mode:
1. brevity code method: when two words are heavy mutually, a word is decided to be the brevity code word, another word row code character word preface first place of attaching most importance to, such two words all can enter automatically.
2. the method for raising the price: three code words are heavy mutually, the order word of repeated code group is added a word sound sign indicating number become four code words and disappear heavily behind its all-key.
3. space bar method: when four code words were heavy mutually, the first preface word (speech) of repeated code group can enter when continuing keystroke automatically, and the available space bar of order word (speech) is sent into.
This coding is because its structure by the sound code fetch can provide a kind of new sign indicating number, note coding mode read.Be decomposed into " day, cutter, mouth, fire " four radicals as " photograph " word, can be encoded to " RDKH ", have two kinds to read coding mode like this, 1. read sign indicating number, 2. read sign indicating number by the radical pronunciation by alphabetical pronunciation.The inventor advises adopting the second way.At this moment, reading sign indicating number is: " Ri, Dao, Kou, Huo ".By the pronunciation of radical, the font structure and the code of individual character are closely connected.This helps the note sign indicating number.So also be our study, the memory Chinese character pattern provides a kind of new method-spelling method.Since ancient times, the memory Chinese character pattern has only by seeing and write two kinds of methods (reading is note word sound), and is much more present a kind of as the method that combines Chinese character pattern into syllables that combines English new word into syllables.
From top introduction, superiority of this coding and originality as can be seen.Its design is rigorous, and standardization is strong, complies with the literal tradition.It has five performances such as easy, quick, readable, standard, simplified and traditional body compatibility concurrently, realizes the performance requirement of literal code substantially, can be used as teaching with coding.
As embodiment, GB I and II word collection is encoded, select about 160 of roman radical for use, together with totally 260 of variant, distortion radicals, see " Shape-pronunciation code radical key bitmap " for details, with mnemonic word.
Zero initial radical and single root coding are as follows: yan, yao, ang, yang → A one,
→ I (yi, ti) er, ye → E Shu, → U (shu) yi, yin → I Pie → P (pie)
(dian, na) Yi, Yin → E (zhe) yue, yu, yong → Y (annotate: Rolling → folding → E) for Dian, → A
A face mode was encoded after present embodiment, radical word adopted first phonetic.Three words adopt the first two sign indicating number of lead-in, inferior, last prefix coee mode to encode, and choose words in constant use 6000.Present embodiment has been made commodity software: XSM V3.0 free suspension type input system has functions such as the prompting of giving, inquiry, self-word creation, association.
Shape-pronunciation code radical key bitmap
Claims (9)
1. an improved Chinese-character shape-sound code that is used for Chinese information processing is characterized in that radical by the sound code fetch, adopts the static decomposition method of radical, and adopting word sound sign indicating number, stroke shape Chinese code is identification code, form package code preceding, identification code after four limit for length's sign indicating numbers.
2. improved Chinese-character shape-sound code according to claim 1 is characterized in that the roman radical chooses 140~190 and be advisable.
3. improved Chinese-character shape-sound code according to claim 1 is characterized in that radical by its pronunciation initial consonant code fetch, and the zero initial radical adopts a letter of simple or compound vowel of a Chinese syllable: initial or head vowel of a final letter.
4. according to claim 1,3 described improved Chinese-character shape-sound codes, it is characterized in that 21 of initial consonants, Ch, Sh take V, W key, Z and Zh share the Z key, and cancellation is separated with initial consonant W, Y, and Y transforms into, vowel " O " is pressed the pictograph method as the code of radical " wood " with " ten ", accounts for 26 key positions altogether.
5. improved Chinese-character shape-sound code according to claim 1, it is characterized in that the radical pronunciation the 1. modern pronunciation of Chinese characters 2. old Chinese phonology 3. be accustomed to sound and 4. select in the phonetic symbol sound, do not have ready-made pronunciation or heavily need to draft the following method of adopting of pronunciation because of keeping away: 1. shape is changeed the meaning method, 2. profiling sound method, 3. imitative sound is economized method, 4. 5. imitative resection is similar to method.
6. improved Chinese-character shape-sound code according to claim 1, the static decomposition method that it is characterized in that radical must be followed criterion by one: order of strokes observed in calligraphy conformance criteria, form with the prioritization criteria of five order utilizations: 1. minimum radical is preferential, 2. take off and connect preferentially, 3. the single radical preferential/non-folding stroke is preferential, 4. the order of strokes observed in calligraphy is preferential continuously, and 5. stroke divides preferential earlier.
7. improved Chinese-character shape-sound code according to claim 1 is characterized in that the character code code length is 2~4 (being used for also available 2~5 of full encode Chinese characters for computer), and its word code structure is as follows:
1. double word root word: primary word collection:
Two package code+word sound sign indicating number/two package codes
Secondary word collection: the first and last of two a package code+inferior radicals stroke shape Chinese code
2. three radical words:
Primary word collection: three package codes
Secondary word collection: three package code+word end stroke shape Chinese codes
One, two, three, last package code 3. 〉=four radical word:.
8. according to claim 1,7 described improved Chinese-character shape-sound codes, it is characterized in that the coding of radical word can adopt one of following two kinds of modes:
(1). by 3~4 of first phonetic, back stroke mode code fetches, that is:
Package code+subsequent sound sign indicating number+head, inferior, last stroke shape Chinese code,
J, Q, X, C, S, Z, C, S, the follow-up simple or compound vowel of a Chinese syllable I of Z can economize Y=
(2). package code+head, inferior, last stroke shape Chinese code.
9. improved Chinese-character shape-sound code according to claim 1 with reference to word code structure, can be worked out the speech sign indicating number, and the speech sign indicating number can be made up of the font code of the word of organizing speech, also can be made up of the sound sign indicating number of word.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 95111006 CN1120694A (en) | 1995-04-12 | 1995-04-12 | improved pictophonetic Chinese characters code |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 95111006 CN1120694A (en) | 1995-04-12 | 1995-04-12 | improved pictophonetic Chinese characters code |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1120694A true CN1120694A (en) | 1996-04-17 |
Family
ID=5078330
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 95111006 Pending CN1120694A (en) | 1995-04-12 | 1995-04-12 | improved pictophonetic Chinese characters code |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1120694A (en) |
-
1995
- 1995-04-12 CN CN 95111006 patent/CN1120694A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5360343A (en) | Chinese character coding method using five stroke codes and double phonetic alphabets | |
CN201264430Y (en) | Book capable of retrieving and quick checking via tag system | |
CN1120694A (en) | improved pictophonetic Chinese characters code | |
CN1257444C (en) | Complete pronunciation Chinese input method for computer | |
Joshi et al. | A phonemic code based scheme for effective processing of Indian Languages | |
CN1081353C (en) | Latinized phonetic codes for modern Chinese works | |
CN1405662A (en) | Chinese-character input method of Chinese phonetic alphabet adding with tail code | |
CN106227363A (en) | Accurate encoding of chinese characters on the basis of phonetic and keyboard and input method | |
CN1116336A (en) | Substitution type Chinese phonetic character, word input coding method and keyboard thereof | |
CN101046707A (en) | Input method for Chinese character of first pronunciation | |
CN1200332C (en) | Chinese character sequence code input scheme | |
CN1388430A (en) | Modern Chinese pronunciation input method | |
CN1122913C (en) | Normal encoding input method for Chinese data processing in computer | |
CN1025540C (en) | Double-combination encoding method by use of initial consonants and vowels of Chinese syllables | |
CN1106146A (en) | Computer input method by computer Chinese-character phonology-tone coding and its keyboard | |
CN1048341C (en) | Fuzzy character transtormer | |
CN1202647A (en) | Phonetic Chinese characters | |
CN1673935A (en) | Jiaguwen (inscriptions on bones or tortoise shells of the Shang Dynasty) computer inputting method | |
Pandey | Proposal to encode Dives Akuru in Unicode | |
CN1054930C (en) | Profile phonetic compound code | |
Hussain et al. | PAN localization: A study on collation of languages from developing Asia | |
CN1060363A (en) | Chinese-character shape-sound code | |
Pederson | Systematic phonetics | |
KR0176779B1 (en) | Coding method for hangul character | |
CN86107214A (en) | A kind of Chinese word input method and keyboard thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C01 | Deemed withdrawal of patent application (patent law 1993) | ||
WD01 | Invention patent application deemed withdrawn after publication |