CN1138198C - Chinese-character encoding method-'Qianli Code' - Google Patents
Chinese-character encoding method-'Qianli Code' Download PDFInfo
- Publication number
- CN1138198C CN1138198C CNB001220918A CN00122091A CN1138198C CN 1138198 C CN1138198 C CN 1138198C CN B001220918 A CNB001220918 A CN B001220918A CN 00122091 A CN00122091 A CN 00122091A CN 1138198 C CN1138198 C CN 1138198C
- Authority
- CN
- China
- Prior art keywords
- stroke
- chinese character
- strokes
- code
- pen
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Abstract
The present invention relates to a coding method for Chinese character picture-sound codes ('smart codes'). The present invention divides strokes into four classes of a vertical stroke, a horizontal stroke, a slanting stroke and a twist stroke according to the coordinate direction for writing the Chinese character strokes. Then, the quantity of the four strokes of each Chinese character is calculated (the quantity is only calculated to six strokes) or a Chinese character code-the smart code of a digital-letter code and a complete letter code is formed for keyboard input by the method that a digit is converted into a letter, a main code is formed according to arrangement in the order of the vertical stroke, the horizontal stroke, the slanting stroke and the twist stroke, and an auxiliary code is a first letter of phonation of the letter.
Description
Technical field under technical field the present invention is the encode Chinese characters for computer computer input method.
The background technology encode Chinese characters for computer, numerous, be divided into font code, sound sign indicating number, Shape-pronunciation code and serial number four big classes basically.Font code and Shape-pronunciation code are that Chinese character is decomposed into several specific radicals by which characters are arranged in traditional Chinese dictionaries or " parts " artificially basically, and give its code name, weave into sign indicating number in proper order according to certain regulation.Remember one, 200 artificial parts of setting and skillfully convert to code name again button get the whole process of Chinese character, training without certain degree is not wield, and need not the coding method remembered at present, often repeated code is too many, and will give all auxiliary codes.Therefore, the present situation of encode Chinese characters for computer is " beating of easily learning is unhappy, and the difficulty of beating is soon remembered ".When searching a Chinese character with traditional radicals by which characters are arranged in traditional Chinese dictionaries in dictionary, need through following 6 steps: position (3) → calculatings radicals by which characters are arranged in traditional Chinese dictionaries of determining these radicals by which characters are arranged in traditional Chinese dictionaries of radicals by which characters are arranged in traditional Chinese dictionaries (1) → calculating part first stroke (2) → find are stroke (4) → find the position (5) → at last of this word to find this word (6) outward.
Summary of the invention the present invention relates to a kind of according to Chinese character being decomposed into the most basic unit one stroke, and according to the coordinate direction that stroke is write be divided into perpendicular, horizontal, oblique, the folding four kinds of forms of a stroke or a combination of strokes, calculate the quantity of these four kinds of forms of a stroke or a combination of strokes again, digital conversion can be become letter simultaneously, be arranged in order into numeral and make primary key, the initial that adds the phonetic of this word is made secondary sign indicating number, forms the coding method of a kind of numeral-character code or full character code, and this sign indicating number is referred to as " a thousand li sign indicating number ".
The invention provides a kind of is the coding method of feature with natural, easy, standard, both need not any memory, understands that at a glance repeated code seldom can be realized beating soon; Looking up the dictionary with this coding method only need 2 step (even you do not know the pronunciation of this word, finding this word also to be easy in identical primary key few in number).The concrete coding situation of 6763 Chinese characters is in the GB baseset: 5257 of gross yards, the having of one yard one word 4006 (accounting for 75.94%) wherein, the having of one yard two word 888 (accounting for 16.83%), one yard three word has 276 (accounting for 5.23%), one yard four word has 75 (accounting for 1.42%), one yard five word has 24 (accounting for 1.46%), and one yard six word has 5 (accounting for 0.095%), and one yard six word has 1 (accounting for 0.019%).
Stroke to more than 10400 Chinese character (polyphone is by a plurality of words) of including in " Xinhua dictionary " (version in 1998, down with) is done rough estimates, and total stroke number is about 123,600, and promptly the stroke number of average each Chinese character is 11.9, promptly near 12 strokes.To the statistics of 3736 Chinese characters in the GB baseset (GB2312-80), total stroke number is 71778 strokes, and promptly the stroke number of average each Chinese character is 10.614, promptly near 10.6 strokes.The present invention finds a kind of Chinese character stroke number that calculates to add the code input method for computor that first letter of pinyin is " a thousand li sign indicating number " of feature from this point.
Specific implementation method 1, the stroke of Chinese character is divided into six kinds of horizontal, vertical, points, left-falling stroke, right-falling stroke, folding usually.The present invention moves towards four kinds of forms of a stroke or a combination of strokes of boil down to the basic strokes of Chinese character when writing on coordinate axis: along the vertical pen of ordinate capwise, along the flat pen of horizontal ordinate level trend, edge oblique pen that straight line (or near straight line) moves towards between the coordinate in length and breadth and writing process change the curved pen of direction.Vertical pen comprises the perpendicular pen of common theory, and flat pen comprises the horizontal pen of common theory, tiltedly pen comprise common theory point, left-falling stroke, press down, carry (these strokes usually are difficult to distinguish), curved pen comprises common theory variously bends, rolls over and collude pen.For the needs of traditional habit, will hang down, put down, tiltedly, curved be called perpendicular, horizontal, oblique, roll over four kinds of forms of a stroke or a combination of strokes.
According to rough estimates to more than 10400 Chinese character in " Xinhua dictionary ", in about 123,600 stroke altogether, perpendicular pen has about 22,200, horizontal pen has about 34,700, oblique pen about 43,500, rolling over pen has about 23,000, that is to say that perpendicular, horizontal, oblique, four kinds of forms of a stroke or a combination of strokes of folding of average each Chinese character are respectively 2.1,3.3,4.2 and 2.2 strokes.According to statistics to 6763 Chinese characters in the GB baseset (GB2312-80), in 71778 stroke altogether, perpendicular pen is 12621 strokes, horizontal pen is 19976 strokes, tiltedly pen is 25366 strokes, folding pen is 13815 strokes, that is to say average each Chinese character perpendicular, horizontal, tiltedly, four kinds of forms of a stroke or a combination of strokes of folding are respectively 1.866,2.954,3.751 and 2.043 strokes or approximate 1.7,3.0,3.8 and 2.0 strokes.
2. the arrangement of stroke is by order rather than order horizontal, vertical routinely, oblique, folding perpendicular, horizontal, oblique, folding.Coming perpendicular pen crosswise, the reason of a front is the frequency that occurs in Chinese character according to these strokes.According to rough estimates, in more than 10400 Chinese characters to " Xinhua dictionary ":
The Chinese character that does not have perpendicular pen has 1345;
The Chinese character that does not have horizontal pen has 513 (wherein 259 appear in the Chinese character that does not have perpendicular pen);
Not having tiltedly, the Chinese character of pen has 368 (wherein having only 25 occurs not having in the Chinese character of perpendicular pen);
The Chinese character that does not have the folding pen has 637,000 (wherein having only 82 appears in the Chinese character that does not have perpendicular pen).In 6763 Chinese characters of GB baseset, above-mentioned Chinese character has 1065,418,275,518 respectively.In the Chinese character of 418 horizontal pens of nothing, have 225 to appear in the Chinese character of perpendicular of nothing; And do not have in the Chinese character of rolling over pen at 518, have only 61 to appear in the Chinese character that does not have perpendicular pen.
By perpendicular, horizontal, tiltedly, during the folding series arrangement, first of front is that zero coding has 1345, the first two is that zero coding has 259.In input during these encodes Chinese characters for computer, zero of front can be saved and be reduced stroke like this, adopt perpendicular, horizontal, tiltedly, the folding sequential encoding can shorten code length to greatest extent, thereby improve Chinese character input speed, looked after people's custom simultaneously again.
3. encode Chinese characters for computer of the present invention is divided into primary key and secondary sign indicating number.Primary key has numerical code.Perpendicular, horizontal, oblique, folding stroke number that the numeral primary key is this Chinese character are arranged in order composition.Numeral can also letter representation, and left hand key f, d, s represent odd number 1,3,5 respectively, and right hand key j, k, l represent even number 2,4,6 (this is also consistent with China traditional " lower-left is upper right " principle) respectively; O is similar to 0, represents 0 with the o key.Constitute alphabetical primary key like this.Secondary sign indicating number is this Chinese character phonetic initial letters, and initial consonant is that ch gets that c, sh get s, zh gets z, and the difficulty that this both can have been avoided cacoepy to bring can shorten code length again.When if this word is the polyphone of different prefixes, a plurality of secondary sign indicating numbers can be arranged, therefore a plurality of different codings are just arranged, during input, available wherein any.
4. calculating perpendicularly, horizontal, tiltedly, during four kinds of strokes of folding, only need be calculated to 6 strokes, surpassing 6 strokes and still do 6 strokes of processing.Along with the further simplification of Chinese character, even can only be calculated to 5 strokes.According to the statistics of more than 10400 Chinese character in " Xinhua dictionary ", perpendicular pen surpasses 6 strokes Chinese character and has only 101, and horizontal pen surpasses 6 strokes Chinese character 747, and tiltedly pen surpasses 6 strokes more, is 1684, and the folding pen surpasses 6 strokes minimum, is 48.In the GB baseset, surpass 6 strokes of Chinese characters perpendicular, horizontal, oblique, folding and have only 21,259,762,10 respectively.For simple, quick, practical, only need be calculated to 6 strokes, surpass 6 strokes, still make 6 strokes of meters.The numeral of primary key has only seven numerals of 0-6, converts letter to and also has only seven letters.Simple to operate.
5. the present invention is applicable to simplified and coding unsimplified Hanzi, and also the simplified character code of available correspondence is added " letter-numerous " word hand over word when the input complex form of Chinese characters.
6. the present invention also is applicable to the coding of speech and phrase, only the major and minor sign indicating number of first (or one of them) Chinese character need be added that at this moment the secondary sign indicating number of back (or other) Chinese character gets final product.
7. the full character code among the present invention is applicable to Chinese character information processing, and numeral-character code both had been applicable to that the retrieval (as the retrieval of dictionary) of Chinese character also was applicable to Chinese character information processing.
The present invention can be illustrated with the following examples:
Embodiment 1:
That Chinese character " class " word has respectively is 2 perpendicular, 3 horizontal strokes, 3 tiltedly (each one of point, left-falling stroke, right-falling strokes), 2 foldings, and pronunciation " ke " is encoded to 2332k, is convertible into the full character code of jddjk.
Embodiment 2:
Chinese character " " word have respectively 0 perpendicular, 0 horizontal stroke, 0 tiltedly, 2 foldings, promptly 0002, and " 0 " of front needn't write, primary key is 2, paying sign indicating number is 1, all-key is 21, is convertible into the full character code of jl.
Embodiment 3:
Chinese character " just " word, perpendicular, horizontal, oblique, broken number is respectively 2,3,0,0, and primary key is 2300, and zero of back can not be saved.Paying sign indicating number is z, and all-key is 2300z, is convertible into the full character code of jdooz.
Embodiment 4:
That Chinese character " jar " word has respectively is perpendicular, horizontal, oblique, folding 8,9,3,3, only need be calculated to 6 during calculating, so primary key is 6633, and all-key is 6633g, is convertible into the full character code of llddg.
Embodiment 5:
Chinese character " gal " pronunciation has three kinds: " ga " (gamma ray), " jia " (Galileo), " qie " (agalloch eaglewood).Therefore, its coding also has three, that is: 2122g, and 2122j, 2122q can convert jfjjg respectively to, jfjjj, the full character code of jfjjq.
As stated above with the numerical code of Chinese character (perhaps convert numeral 1,2,3,4,5,6,0 to alphabetical f respectively, j, d, k, s, l, o) and the first letter of pinyin order impact keyboard and promptly finish the Chinese characters in computer input, need not memory, simple and fast.
Claims (1)
1. a thousand li sign indicating number---a kind of easy encode Chinese characters for computer computer input method is characterized in that taking following input step:
(1) Chinese character is decomposed into the most basic unit---stroke,
Angle from science, Chinese character stroke is a kind of vector, except the branch of length is arranged, the different of direction more arranged, therefore, can determine and classify according to the movement locus of stroke writing process on the planimetric coordinates axle, stroke is erected pen along being classified as of ordinate y direction of principal axis operation in writing process, along the horizontal pen of being classified as of abscissa axis x direction of principal axis operation, basically the merger that does not change traffic direction along operation between x and the y axle is oblique pen, in writing process, change being referred to as of running orbit and roll over pen, simultaneously, adopt the scientific approach of " straight line approaches " and consider traditional classification of Chinese character stroke, these strokes that traditional " point; cast aside; press down; carry " often is difficult to be distinguished are according to above-mentioned rule, in the lump as tiltedly; Various " bend, roll over, collude " merger is the folding pen;
(2) according to above-mentioned rule, calculate the quantity of these four kinds of strokes of Chinese character respectively, become the numerical code primary key according to series arrangement perpendicular, horizontal, oblique, folding, if numeral in front is zero, then can save, shorten code length, reduce stroke, just because of this consideration,, the stroke ordering is perpendicular, horizontal, oblique, folding according to the frequency of Chinese character stroke appearance;
(3) Chinese character phonetic initial letters is combined as secondary sign indicating number and primary key become complete kanji code, why a thousand li sign indicating number adopts initial and without initial consonant, is a difficult point of considering in the Chinese-character pronunciation; Z and zh, s and sh, c and ch usually are not easily distinguishable, and adopt initial can also shorten code length, reduce stroke, kill two birds with one stone;
(4) when calculating four kinds of strokes of Chinese character, every kind of stroke only need be calculated to 6 strokes, if stroke number surpasses 6 then needn't count again, does 6 strokes of processing;
(5) press the numerical code-primary key of Chinese character, and first letter of pinyin-pair sign indicating number, the numerical key of the upper keyboard that uses a computer and letter key order keystroke input computing machine; Simultaneously, a thousand li sign indicating number can also be used as Chinese character index, realizes that Chinese character for computer is handled and Chinese character index is integrated;
(6) personage for the ease of custom alphabetic keypad operation handles literal, numeral 1,2,3,4,5,6,0 can be used f respectively, d, and k, s, l, o replaces, i.e. left hand key f, d, s replace numeral 1,3,5 respectively; Right hand key j, k, l replace numeral 2,4,6 respectively; O replaces 0, has so just become full character code, and these letters are exactly the English alphabet on the computer keyboard, can realize touch system.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB001220918A CN1138198C (en) | 2000-08-27 | 2000-08-27 | Chinese-character encoding method-'Qianli Code' |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB001220918A CN1138198C (en) | 2000-08-27 | 2000-08-27 | Chinese-character encoding method-'Qianli Code' |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1317735A CN1317735A (en) | 2001-10-17 |
CN1138198C true CN1138198C (en) | 2004-02-11 |
Family
ID=4589083
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB001220918A Expired - Fee Related CN1138198C (en) | 2000-08-27 | 2000-08-27 | Chinese-character encoding method-'Qianli Code' |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1138198C (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9012912B2 (en) | 2013-03-13 | 2015-04-21 | Taiwan Semiconductor Manufacturing Company, Ltd. | Wafers, panels, semiconductor devices, and glass treatment methods |
-
2000
- 2000-08-27 CN CNB001220918A patent/CN1138198C/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1317735A (en) | 2001-10-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1138198C (en) | Chinese-character encoding method-'Qianli Code' | |
CN105912139B (en) | Method for correspondingly recognizing modular stroke coding Chinese characters | |
CN1019696B (en) | Eight first sounds (fool) code Chinese character input method | |
CN1022781C (en) | Encoding method of Chinese character strokes | |
CN1027839C (en) | Chinese character encoding input method | |
CN1032986C (en) | Chinese-character stroke order code enter method and its keyboard | |
CN100378624C (en) | Chinese character Yin-Yang bipolar code input system and single-hand input keyboard | |
CN1254122A (en) | Stroke-form and radical mixed Chinese character digital code input method and keyboard | |
CN100511112C (en) | Latin type five-stroke input method for Chinese character | |
CN1460913A (en) | One-code two-form quick Chinese digital coding input method | |
CN1063856C (en) | Keyboard and method for computer input of character-separated phonetic transcriptions | |
CN1164982C (en) | Yi-code input method for Chinese characters | |
CN1241100C (en) | Same sound shape digital code | |
CN1239784A (en) | Chinese number inputting method and keyboard | |
CN1040259C (en) | Two-stroke coding method and two stroke keyboard | |
CN1252555A (en) | Three-Three phonetic code and Three-Three digital code | |
CN1139024C (en) | Chinese character L-code input system and keyboard | |
CN1375763A (en) | Chinese character encoding method grouping in consonants | |
CN1885241A (en) | Chinese character input method capable of reducing candidate characters: phonetic coding and stroke coding | |
CN1149732A (en) | Initial consonant, simple or compound vowel and tone stroke Chinese character coding method | |
CN1095832A (en) | Two code indexing coding method | |
CN1018774B (en) | Chinese-character and symbol encode method based on pattern, pronunciation and symbol and keyboard thereof | |
CN85102161A (en) | The computer " full acoustic encoding " Chinese-character input scheme | |
CN1304075A (en) | Nanural phonetic configuration code computer Chiense character code input method | |
CN1099494A (en) | Encoding method for identification Chinese by initial consonant and strokes and keyboard thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |