CN1285539A - Chinese character shape symbol input system - Google Patents
Chinese character shape symbol input system Download PDFInfo
- Publication number
- CN1285539A CN1285539A CN 99111733 CN99111733A CN1285539A CN 1285539 A CN1285539 A CN 1285539A CN 99111733 CN99111733 CN 99111733 CN 99111733 A CN99111733 A CN 99111733A CN 1285539 A CN1285539 A CN 1285539A
- Authority
- CN
- China
- Prior art keywords
- character
- chinese character
- pictograph
- word
- parts
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The present invention provides character-looking up method, ordering method, coinaging method and input method for Chinese characters. According to the Chinese character structure and the principle of configuration of Chinese characters it can utilize configuration symbols of Chinese characters to implement configuration symbolic keyboard of Chinese characters. The Chinese character can be converted into character element set and string, said character element can be used as input code element, can be marked on the surface of key and can be formed into correspondent relationship with letter.
Description
The invention belongs to the Chinese information processing field, for Chinese character provides a kind of indexing method, ranking method, method for generating Chinese character, for computing machine provides a kind of Chinese character input method.
Existing technical information sees Chinese patent application " a kind of input method of pictograph and keyboard thereof ", application number: 95110690.2, and it has proposed to adopt the input symbols of character as Chinese character; The Hanzi structure theoretical research relevant can write with reference to me " the character structure of Chinese character " with the present invention, the elementary cell that this article has been discussed physical structure of Chinese characters in detail is a character, and deeply inquired into the layer of structure of Chinese character, provided clear and definite definition and the classification of Chinese character members at different levels, set up stroke, parts, piece spare, the piece group, individual characters etc. have the Chinese character member system of character feature layer by layer, realized the systematicness that Chinese character members at different levels are determined and sorted out, objectivity, the present invention improves last application on this basis, content related to the present invention can must be pointed out that the present invention is not limited by this article with reference to this article.
After numerous domestic and international research of charaters persons analyzes Chinese character pattern, the number of components that obtains just has surprising difference: 105,128,160,166,177,205,250,255,256,297,300,320,344,370,496,500,504,512,588,686, or the like.And the parts that the present invention tentatively finds in 6763 words of GB2310-80 have only about 320 kinds, why have so big difference? reason is many-sided, a chief reason still, different choosing under the criterion, what have has not only selected some parts, also chosen the combination block part, even piece spare combination, what have then is earlier rigid some " preferably " members of having determined, other member has been carried out artificial processing, therefore extensively had the formulation of " human components " and " natural parts " on the coding circle.The present invention wishes not exist between the difficulty or ease of what and memory of member the relation of opposition, preferably can realize the font code scheme that the sound sign indicating number " does not have coding " like that.
The objective of the invention is to by Chinese character is analysed scientifically, knot body configuration principle according to all Chinese characters that comprise letter, numerous two body Chinese characters and Japan, South Korea Chinese character, provide one group of pictograph that is derived from Chinese character itself as basic code name, make Chinese character to convert a string orderly, linear pictograph set to by planar graph in intuitive and convenient ground, for Chinese character information processing provides a kind of maneuverable method.
The invention has the advantages that: it has found the spell shape symbol of Chinese character, can realize the shape-symbol keyboard of Chinese character, and all members are followed identical structural principle and corresponding with pictograph, and memory capacitance is few, easy and simple to handle.
The present invention realizes by the following method:
Character is the pictograph of Chinese character, simple in structure, body standard, less, the easy memorize of number, corresponding the composition member of Chinese character, can be used as the input symbols of Chinese character, can set up the character attribute dictionary of Chinese character according to the body characteristics of Chinese character, can extract the character code that character information is weaved into Chinese character, and character can be set up corresponding relation with the key position, can also be used to marking keyboard, adopt such Word elements keyboard to import Chinese character by knocking the character code.
Character is changed in quality by Chinese character " field ", has reflected the upward 64 kind states of each straight-line segment under difference choice situation of Chinese character " field ", and basic structure has
And the various orientation diagrams of these 19 kinds of structural units.Through arrangement, character is divided into 3 classes, totally 55 kinds:
For according to head and the tail always, not only science but also easily principle decompose all Chinese characters, the present invention chooses the basic building block that parts decompose as Chinese character.
Having only " simple substance " of parts also few in the Chinese character, is " potpourri " that some parts are put together mostly.To from " potpourri ", " simple substance " segregation be come out, at first will understand the composition of " potpourri ".Single combination block part is exactly the simplest " potpourri ", and more complicated Chinese character can resolve into several piece spares earlier, one by one parts is emanated out then.This shows that the parts segregation is to the very natural decomposition of Chinese character, splitting with the parts of general font code is different notions.
Generally speaking, the decomposition of Chinese character only need be followed a criterion: according to the first sum of sequencing segregation of each parts.For example:
Definition according to parts, whole component enumerates of Chinese character are come out to there is no need fully, and the new parts of the unavoidable appearance of the Chinese character of newly making in the future, but for the ease of using, the present invention still sorts out according to its zeroth order character at the parts of 6763 Chinese characters among the GB2310-80, can be referring to subordinate list, other unlisted parts can be according to identical methods analyst.
Parts can be subdivided into two classes according to its character feature:
(1) first this base part of shape parts is close with the character character, can directly get corresponding character code according to its shape, can sort out in view of the above with first form parts.For example:
(2) this base part of parts of deriving is close with the character of deriving, and has multi-level character feature.Have identical zeroth order character with unit's parts of deriving, can sort out in view of the above.For example:
Can describe the structure of parts with character formula (character box), give some instances below.
The character formula of parts is actually a kind of coding of parts, and the character formula set of the whole parts of Chinese character constitutes the character attribute dictionary of Chinese character.
Chinese character is emanated out according to the first sum of priority of each parts behind the parts, imports all or part of element of each parts character formula successively and can import corresponding Chinese character (bracket can omit).For example:
In: ten (mouthfuls Shu) or ten mouthfuls of Shu or ten mouthfuls
Or ten Shu or mouthful Shu
Dash: 20 (mouthfuls Shu) or 20 mouthfuls of Shu or 20 mouthfuls
Or 20 Shu or two mouthfuls of Shu
Generally speaking, Chinese character is imported the zeroth order character of each parts successively and can be imported corresponding Chinese character after being segregated into the combination of parts according to the first sum of priority of each parts.For example:
According to the present invention's statistics, 4.05 parts of the average every word of primary word among the GB GB2310-80,3.26 parts of the average every word of preceding 1000 high frequency words.The mean code length of component coding is shorter, general no more than four yards, therefore for the Chinese character that is no less than four parts, can choose the zeroth order character of each parts, usually only get one, two, three, the zeroth order character of last parts, for the individual character that is less than four parts, except the zeroth order character of choosing each parts, can consider to append the replacement character of the parts of deriving, also can consider to append the font character of individual character as auxiliary symbol.
Imitate rule for improving input, word also can adopt character element code input computing machine, and the code length of all kinds of speech all is no more than four yards, can adopt following method:
A. the coding of two-character word=first word, one or two yards+second word is one or two yards
B. the coding of three words=one yard example of first word one or two yards+second word the one yard+the 3rd word: impulsive force=dash (20) to hit (soil) power (ten)
Feasibility=can (fourth mouth) row (one) property (river)
C. the coding of the above speech of four words=one yard example of first word, one yard+last word of one yard+second word the one yard+the 3rd word: groundless=as not have that (ten) lifes (soil) have (ten) in (doing)
The People's Republic of China (PRC)=in (ten) China (fourth) people
State's (mouth)
D. the single part word can only be got one yard in two-character word three words, also can append auxiliary symbol and supply code length.Example: floating=as to float (three workers) floats (30)
Picture album=picture (field) volume (ten)
Picture album=picture (field) volume (ten Jiong)
Picture album=picture (field) volume (11)
Chinese=in (ten Shu) state (mouth)
The pairing member of character " ten " is more, and to have a single order character at least be the one dimension character for the character formula of member wherein to have 5 classes to derive, and can deriving with one-level respectively according to this situation, " Nian thirty for character
Feng Jing " corresponding corresponding member and as code element, these code elements also can be elected to be the pictograph of Chinese character.For example
Part: Ren (T) ox
Chinese character had oneself one the cover pictograph, so we can design the input Chinese character Chinese keyboard, on the key face, identify pictograph, by the input Chinese character the pictograph code import Chinese character.
By common western language keypad input Chinese character, to set up the corresponding relation between the character string that Chinese character and the Latin alphabet constitute usually, that this corresponding relation requires is directly perceived, nature, simply, and the character input method can reach this requirement.At first, character element code itself is exactly a kind of character string, and character can be identified on the key face as letter fully; Secondly, can also set up a kind of corresponding relation between character and the letter, nearly 55 of characters, letter has only 26.This corresponding relation can not be corresponding one by one, and more impossible is unique.
A kind of scheme is provided below the present invention, and for the people who knows English key face, using Chinese Word elements keyboard input word primitive encoding and beaing letter is the one thing basically.Following corresponding relation set up in character and letter:
Letter " IRPS " does not have corresponding character, can arrange them corresponding with code element " Feng Feng ".
Another kind scheme preferably also is provided below the present invention, and letter and pictograph are set up following corresponding relation:
This scheme all is arranged in the zero dimension character on the key, because the zero dimension parts all are straight-line segments, capitalization " I " also is a straight-line segment, and lowercase " i " also has a point.
The input of the compatible Chinese character pictograph and the Latin alphabet on same keyboard, this is only real Chinese and western languages keyboard, can the compatible Chinese phonetic alphabet with such Chinese characters for keyboard inputting, two kinds of basic skills are arranged:
A kind of method is the shape phonetic input method: pictograph code+note code
For example: thousand=do+QIAN
Another kind method is a tone-form input method: note code+pictograph code
For example: thousand=QIAN+ does
The Hanzi component detail list
Claims (10)
1. a Chinese character pictograph input method is characterized in that adopting the input symbols of the pictograph of Chinese character as Chinese character, and the pictograph correspondence the composition member of Chinese character, and pictograph can be set up corresponding relation with the key position, imports Chinese character by input pictograph code.
3. according to a kind of Chinese character input method of claim 1, it is characterized in that choosing the basic building block of parts as encode Chinese characters for computer, the structure of parts can be expressed with the character formula, Chinese character decomposes according to the first sum of priority of each parts, imports all or part of element of each parts character formula successively and can import corresponding Chinese character.
4. according to claim 1, a kind of Chinese character input method of 2 and 3, it is characterized in that choosing the basic code element of the zeroth order character of each parts as the Chinese character input, for the Chinese character that is no less than four parts, usually only get one, two, three, the zeroth order character of last parts, for the individual character that is less than four parts, can append the replacement character of the parts of deriving, the font character that also can append individual character is as auxiliary symbol.
5. according to claim 1,2, a kind of Chinese character input method of 3 and 4, it is characterized in that word adopts character element code input computing machine: the single part word can only be got one yard in the coding of the coding of the coding of a. two-character word=one or two yards b. three words of first word, one or two yards+second word=one yard above speech of c. four words of first word, one or two yards+second word the one yard+the 3rd word=one yard d. two-character word of first word, one yard+last word of one yard+second word the one yard+the 3rd word, three words, also can append auxiliary symbol and supply code length.
6. according to a kind of Chinese character input method of claim 1 and 3, " European-allies thirty to it is characterized in that increase
Feng Jing " as the pictograph of Chinese character.
7. according to a kind of Chinese character shape-symbol keyboard of claim 1, the pictograph that it is characterized in that adopting Chinese character is as key unit, and the pictograph of sign Chinese character is set up the corresponding relation between the pictograph and the Latin alphabet on the key face, uses common western language keypad input Chinese character.
8. according to a kind of Chinese character input method of claim 1 and 7, it is characterized in that letter and pictograph set up following corresponding relation:
9. according to a kind of Chinese character input method of claim 1 and 7, it is characterized in that letter and pictograph set up following corresponding relation:
10. according to a kind of Chinese character input method of claim 1, it is characterized in that the input of compatible Chinese character pictograph and the Latin alphabet on same keyboard, can compatible Chinese phonetic alphabet input Chinese character with such keyboard, two kinds of basic skills are arranged:
(1) shape phonetic input method: pictograph code+note code
(2) tone-form input method: note code+pictograph code
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 99111733 CN1285539A (en) | 1999-08-20 | 1999-08-20 | Chinese character shape symbol input system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 99111733 CN1285539A (en) | 1999-08-20 | 1999-08-20 | Chinese character shape symbol input system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1285539A true CN1285539A (en) | 2001-02-28 |
Family
ID=5275259
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 99111733 Pending CN1285539A (en) | 1999-08-20 | 1999-08-20 | Chinese character shape symbol input system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1285539A (en) |
-
1999
- 1999-08-20 CN CN 99111733 patent/CN1285539A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1285539A (en) | Chinese character shape symbol input system | |
CN1162767C (en) | Square round classify pictographic code | |
CN1284066C (en) | Three strokes code method for inputting Chinese characters into computer as well as its keyboard | |
CN1062667C (en) | All spelling form guide code Chinese character input system | |
CN1164982C (en) | Yi-code input method for Chinese characters | |
CN1818836A (en) | Fast and convenient inputting method with code number and pictograph | |
CN1142474C (en) | Dictionary code Chinese character input method | |
CN1178121C (en) | Double Chinese character stroke order-radical input system | |
CN1458566A (en) | Chinese character plain code input method | |
CN1068444C (en) | Method of Chinese-character coding | |
CN1043381C (en) | Four-stroke digit look-up method for Chinese characters | |
CN1503111A (en) | Four corner number based Chinese character input method and keyboard thereof | |
CN1052799C (en) | Chinese character coding method and its keyboard | |
CN1047676C (en) | Square-circular code entering method for Chinese characters | |
CN1041231A (en) | Digitally-indexing method for chinese characters | |
CN1124366A (en) | Chinese character natural component coding | |
CN1167296A (en) | Four-stroke Chinese character code | |
CN1710524A (en) | Three-step all-round code Chinese character input method and keyboard thereof | |
CN85105556A (en) | Chinese character outline symbol and grapheme (being parts) sorting and coding method | |
CN1099494A (en) | Chinese character coding method and keyboard by identifying initial consonant and stroke of component | |
CN1360246A (en) | Chinese-charactre digitalized encode and its application | |
CN1302009A (en) | character shape and element code for Chinese character input in computer and its keyboard | |
CN1160243A (en) | Character shape stroke order code Chinese character entering system and keyboard thereof | |
CN1125336A (en) | Chinese character Chinese code computer keyboard input method | |
CN1409191A (en) | Improved 111 Chinese character coding method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |