CN1138198C - Chinese-character encoding method-'Qianli Code' - Google Patents

Chinese-character encoding method-'Qianli Code' Download PDF

Info

Publication number
CN1138198C
CN1138198C CNB001220918A CN00122091A CN1138198C CN 1138198 C CN1138198 C CN 1138198C CN B001220918 A CNB001220918 A CN B001220918A CN 00122091 A CN00122091 A CN 00122091A CN 1138198 C CN1138198 C CN 1138198C
Authority
CN
China
Prior art keywords
stroke
chinese character
strokes
code
pen
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB001220918A
Other languages
Chinese (zh)
Other versions
CN1317735A (en
Inventor
钟小先
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNB001220918A priority Critical patent/CN1138198C/en
Publication of CN1317735A publication Critical patent/CN1317735A/en
Application granted granted Critical
Publication of CN1138198C publication Critical patent/CN1138198C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Abstract

The present invention relates to a coding method for Chinese character picture-sound codes ('smart codes'). The present invention divides strokes into four classes of a vertical stroke, a horizontal stroke, a slanting stroke and a twist stroke according to the coordinate direction for writing the Chinese character strokes. Then, the quantity of the four strokes of each Chinese character is calculated (the quantity is only calculated to six strokes) or a Chinese character code-the smart code of a digital-letter code and a complete letter code is formed for keyboard input by the method that a digit is converted into a letter, a main code is formed according to arrangement in the order of the vertical stroke, the horizontal stroke, the slanting stroke and the twist stroke, and an auxiliary code is a first letter of phonation of the letter.

Description

A thousand li sign indicating number---a kind of easy encode Chinese characters for computer computer input method
Technical field under technical field the present invention is the encode Chinese characters for computer computer input method.
The background technology encode Chinese characters for computer, numerous, be divided into font code, sound sign indicating number, Shape-pronunciation code and serial number four big classes basically.Font code and Shape-pronunciation code are that Chinese character is decomposed into several specific radicals by which characters are arranged in traditional Chinese dictionaries or " parts " artificially basically, and give its code name, weave into sign indicating number in proper order according to certain regulation.Remember one, 200 artificial parts of setting and skillfully convert to code name again button get the whole process of Chinese character, training without certain degree is not wield, and need not the coding method remembered at present, often repeated code is too many, and will give all auxiliary codes.Therefore, the present situation of encode Chinese characters for computer is " beating of easily learning is unhappy, and the difficulty of beating is soon remembered ".When searching a Chinese character with traditional radicals by which characters are arranged in traditional Chinese dictionaries in dictionary, need through following 6 steps: position (3) → calculatings radicals by which characters are arranged in traditional Chinese dictionaries of determining these radicals by which characters are arranged in traditional Chinese dictionaries of radicals by which characters are arranged in traditional Chinese dictionaries (1) → calculating part first stroke (2) → find are stroke (4) → find the position (5) → at last of this word to find this word (6) outward.
Summary of the invention the present invention relates to a kind of according to Chinese character being decomposed into the most basic unit one stroke, and according to the coordinate direction that stroke is write be divided into perpendicular, horizontal, oblique, the folding four kinds of forms of a stroke or a combination of strokes, calculate the quantity of these four kinds of forms of a stroke or a combination of strokes again, digital conversion can be become letter simultaneously, be arranged in order into numeral and make primary key, the initial that adds the phonetic of this word is made secondary sign indicating number, forms the coding method of a kind of numeral-character code or full character code, and this sign indicating number is referred to as " a thousand li sign indicating number ".
The invention provides a kind of is the coding method of feature with natural, easy, standard, both need not any memory, understands that at a glance repeated code seldom can be realized beating soon; Looking up the dictionary with this coding method only need 2 step (even you do not know the pronunciation of this word, finding this word also to be easy in identical primary key few in number).The concrete coding situation of 6763 Chinese characters is in the GB baseset: 5257 of gross yards, the having of one yard one word 4006 (accounting for 75.94%) wherein, the having of one yard two word 888 (accounting for 16.83%), one yard three word has 276 (accounting for 5.23%), one yard four word has 75 (accounting for 1.42%), one yard five word has 24 (accounting for 1.46%), and one yard six word has 5 (accounting for 0.095%), and one yard six word has 1 (accounting for 0.019%).
Stroke to more than 10400 Chinese character (polyphone is by a plurality of words) of including in " Xinhua dictionary " (version in 1998, down with) is done rough estimates, and total stroke number is about 123,600, and promptly the stroke number of average each Chinese character is 11.9, promptly near 12 strokes.To the statistics of 3736 Chinese characters in the GB baseset (GB2312-80), total stroke number is 71778 strokes, and promptly the stroke number of average each Chinese character is 10.614, promptly near 10.6 strokes.The present invention finds a kind of Chinese character stroke number that calculates to add the code input method for computor that first letter of pinyin is " a thousand li sign indicating number " of feature from this point.
Specific implementation method 1, the stroke of Chinese character is divided into six kinds of horizontal, vertical, points, left-falling stroke, right-falling stroke, folding usually.The present invention moves towards four kinds of forms of a stroke or a combination of strokes of boil down to the basic strokes of Chinese character when writing on coordinate axis: along the vertical pen of ordinate capwise, along the flat pen of horizontal ordinate level trend, edge oblique pen that straight line (or near straight line) moves towards between the coordinate in length and breadth and writing process change the curved pen of direction.Vertical pen comprises the perpendicular pen of common theory, and flat pen comprises the horizontal pen of common theory, tiltedly pen comprise common theory point, left-falling stroke, press down, carry (these strokes usually are difficult to distinguish), curved pen comprises common theory variously bends, rolls over and collude pen.For the needs of traditional habit, will hang down, put down, tiltedly, curved be called perpendicular, horizontal, oblique, roll over four kinds of forms of a stroke or a combination of strokes.
According to rough estimates to more than 10400 Chinese character in " Xinhua dictionary ", in about 123,600 stroke altogether, perpendicular pen has about 22,200, horizontal pen has about 34,700, oblique pen about 43,500, rolling over pen has about 23,000, that is to say that perpendicular, horizontal, oblique, four kinds of forms of a stroke or a combination of strokes of folding of average each Chinese character are respectively 2.1,3.3,4.2 and 2.2 strokes.According to statistics to 6763 Chinese characters in the GB baseset (GB2312-80), in 71778 stroke altogether, perpendicular pen is 12621 strokes, horizontal pen is 19976 strokes, tiltedly pen is 25366 strokes, folding pen is 13815 strokes, that is to say average each Chinese character perpendicular, horizontal, tiltedly, four kinds of forms of a stroke or a combination of strokes of folding are respectively 1.866,2.954,3.751 and 2.043 strokes or approximate 1.7,3.0,3.8 and 2.0 strokes.
2. the arrangement of stroke is by order rather than order horizontal, vertical routinely, oblique, folding perpendicular, horizontal, oblique, folding.Coming perpendicular pen crosswise, the reason of a front is the frequency that occurs in Chinese character according to these strokes.According to rough estimates, in more than 10400 Chinese characters to " Xinhua dictionary ":
The Chinese character that does not have perpendicular pen has 1345;
The Chinese character that does not have horizontal pen has 513 (wherein 259 appear in the Chinese character that does not have perpendicular pen);
Not having tiltedly, the Chinese character of pen has 368 (wherein having only 25 occurs not having in the Chinese character of perpendicular pen);
The Chinese character that does not have the folding pen has 637,000 (wherein having only 82 appears in the Chinese character that does not have perpendicular pen).In 6763 Chinese characters of GB baseset, above-mentioned Chinese character has 1065,418,275,518 respectively.In the Chinese character of 418 horizontal pens of nothing, have 225 to appear in the Chinese character of perpendicular of nothing; And do not have in the Chinese character of rolling over pen at 518, have only 61 to appear in the Chinese character that does not have perpendicular pen.
By perpendicular, horizontal, tiltedly, during the folding series arrangement, first of front is that zero coding has 1345, the first two is that zero coding has 259.In input during these encodes Chinese characters for computer, zero of front can be saved and be reduced stroke like this, adopt perpendicular, horizontal, tiltedly, the folding sequential encoding can shorten code length to greatest extent, thereby improve Chinese character input speed, looked after people's custom simultaneously again.
3. encode Chinese characters for computer of the present invention is divided into primary key and secondary sign indicating number.Primary key has numerical code.Perpendicular, horizontal, oblique, folding stroke number that the numeral primary key is this Chinese character are arranged in order composition.Numeral can also letter representation, and left hand key f, d, s represent odd number 1,3,5 respectively, and right hand key j, k, l represent even number 2,4,6 (this is also consistent with China traditional " lower-left is upper right " principle) respectively; O is similar to 0, represents 0 with the o key.Constitute alphabetical primary key like this.Secondary sign indicating number is this Chinese character phonetic initial letters, and initial consonant is that ch gets that c, sh get s, zh gets z, and the difficulty that this both can have been avoided cacoepy to bring can shorten code length again.When if this word is the polyphone of different prefixes, a plurality of secondary sign indicating numbers can be arranged, therefore a plurality of different codings are just arranged, during input, available wherein any.
4. calculating perpendicularly, horizontal, tiltedly, during four kinds of strokes of folding, only need be calculated to 6 strokes, surpassing 6 strokes and still do 6 strokes of processing.Along with the further simplification of Chinese character, even can only be calculated to 5 strokes.According to the statistics of more than 10400 Chinese character in " Xinhua dictionary ", perpendicular pen surpasses 6 strokes Chinese character and has only 101, and horizontal pen surpasses 6 strokes Chinese character 747, and tiltedly pen surpasses 6 strokes more, is 1684, and the folding pen surpasses 6 strokes minimum, is 48.In the GB baseset, surpass 6 strokes of Chinese characters perpendicular, horizontal, oblique, folding and have only 21,259,762,10 respectively.For simple, quick, practical, only need be calculated to 6 strokes, surpass 6 strokes, still make 6 strokes of meters.The numeral of primary key has only seven numerals of 0-6, converts letter to and also has only seven letters.Simple to operate.
5. the present invention is applicable to simplified and coding unsimplified Hanzi, and also the simplified character code of available correspondence is added " letter-numerous " word hand over word when the input complex form of Chinese characters.
6. the present invention also is applicable to the coding of speech and phrase, only the major and minor sign indicating number of first (or one of them) Chinese character need be added that at this moment the secondary sign indicating number of back (or other) Chinese character gets final product.
7. the full character code among the present invention is applicable to Chinese character information processing, and numeral-character code both had been applicable to that the retrieval (as the retrieval of dictionary) of Chinese character also was applicable to Chinese character information processing.
The present invention can be illustrated with the following examples:
Embodiment 1:
That Chinese character " class " word has respectively is 2 perpendicular, 3 horizontal strokes, 3 tiltedly (each one of point, left-falling stroke, right-falling strokes), 2 foldings, and pronunciation " ke " is encoded to 2332k, is convertible into the full character code of jddjk.
Embodiment 2:
Chinese character " " word have respectively 0 perpendicular, 0 horizontal stroke, 0 tiltedly, 2 foldings, promptly 0002, and " 0 " of front needn't write, primary key is 2, paying sign indicating number is 1, all-key is 21, is convertible into the full character code of jl.
Embodiment 3:
Chinese character " just " word, perpendicular, horizontal, oblique, broken number is respectively 2,3,0,0, and primary key is 2300, and zero of back can not be saved.Paying sign indicating number is z, and all-key is 2300z, is convertible into the full character code of jdooz.
Embodiment 4:
That Chinese character " jar " word has respectively is perpendicular, horizontal, oblique, folding 8,9,3,3, only need be calculated to 6 during calculating, so primary key is 6633, and all-key is 6633g, is convertible into the full character code of llddg.
Embodiment 5:
Chinese character " gal " pronunciation has three kinds: " ga " (gamma ray), " jia " (Galileo), " qie " (agalloch eaglewood).Therefore, its coding also has three, that is: 2122g, and 2122j, 2122q can convert jfjjg respectively to, jfjjj, the full character code of jfjjq.
As stated above with the numerical code of Chinese character (perhaps convert numeral 1,2,3,4,5,6,0 to alphabetical f respectively, j, d, k, s, l, o) and the first letter of pinyin order impact keyboard and promptly finish the Chinese characters in computer input, need not memory, simple and fast.

Claims (1)

1. a thousand li sign indicating number---a kind of easy encode Chinese characters for computer computer input method is characterized in that taking following input step:
(1) Chinese character is decomposed into the most basic unit---stroke,
Angle from science, Chinese character stroke is a kind of vector, except the branch of length is arranged, the different of direction more arranged, therefore, can determine and classify according to the movement locus of stroke writing process on the planimetric coordinates axle, stroke is erected pen along being classified as of ordinate y direction of principal axis operation in writing process, along the horizontal pen of being classified as of abscissa axis x direction of principal axis operation, basically the merger that does not change traffic direction along operation between x and the y axle is oblique pen, in writing process, change being referred to as of running orbit and roll over pen, simultaneously, adopt the scientific approach of " straight line approaches " and consider traditional classification of Chinese character stroke, these strokes that traditional " point; cast aside; press down; carry " often is difficult to be distinguished are according to above-mentioned rule, in the lump as tiltedly; Various " bend, roll over, collude " merger is the folding pen;
(2) according to above-mentioned rule, calculate the quantity of these four kinds of strokes of Chinese character respectively, become the numerical code primary key according to series arrangement perpendicular, horizontal, oblique, folding, if numeral in front is zero, then can save, shorten code length, reduce stroke, just because of this consideration,, the stroke ordering is perpendicular, horizontal, oblique, folding according to the frequency of Chinese character stroke appearance;
(3) Chinese character phonetic initial letters is combined as secondary sign indicating number and primary key become complete kanji code, why a thousand li sign indicating number adopts initial and without initial consonant, is a difficult point of considering in the Chinese-character pronunciation; Z and zh, s and sh, c and ch usually are not easily distinguishable, and adopt initial can also shorten code length, reduce stroke, kill two birds with one stone;
(4) when calculating four kinds of strokes of Chinese character, every kind of stroke only need be calculated to 6 strokes, if stroke number surpasses 6 then needn't count again, does 6 strokes of processing;
(5) press the numerical code-primary key of Chinese character, and first letter of pinyin-pair sign indicating number, the numerical key of the upper keyboard that uses a computer and letter key order keystroke input computing machine; Simultaneously, a thousand li sign indicating number can also be used as Chinese character index, realizes that Chinese character for computer is handled and Chinese character index is integrated;
(6) personage for the ease of custom alphabetic keypad operation handles literal, numeral 1,2,3,4,5,6,0 can be used f respectively, d, and k, s, l, o replaces, i.e. left hand key f, d, s replace numeral 1,3,5 respectively; Right hand key j, k, l replace numeral 2,4,6 respectively; O replaces 0, has so just become full character code, and these letters are exactly the English alphabet on the computer keyboard, can realize touch system.
CNB001220918A 2000-08-27 2000-08-27 Chinese-character encoding method-'Qianli Code' Expired - Fee Related CN1138198C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB001220918A CN1138198C (en) 2000-08-27 2000-08-27 Chinese-character encoding method-'Qianli Code'

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB001220918A CN1138198C (en) 2000-08-27 2000-08-27 Chinese-character encoding method-'Qianli Code'

Publications (2)

Publication Number Publication Date
CN1317735A CN1317735A (en) 2001-10-17
CN1138198C true CN1138198C (en) 2004-02-11

Family

ID=4589083

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB001220918A Expired - Fee Related CN1138198C (en) 2000-08-27 2000-08-27 Chinese-character encoding method-'Qianli Code'

Country Status (1)

Country Link
CN (1) CN1138198C (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9012912B2 (en) 2013-03-13 2015-04-21 Taiwan Semiconductor Manufacturing Company, Ltd. Wafers, panels, semiconductor devices, and glass treatment methods

Also Published As

Publication number Publication date
CN1317735A (en) 2001-10-17

Similar Documents

Publication Publication Date Title
CN1138198C (en) Chinese-character encoding method-'Qianli Code'
CN105912139B (en) Method for correspondingly recognizing modular stroke coding Chinese characters
CN1019696B (en) Eight first sounds (fool) code Chinese character input method
CN1022781C (en) Encoding method of Chinese character strokes
CN1027839C (en) Chinese character encoding input method
CN1032986C (en) Chinese-character stroke order code enter method and its keyboard
CN100378624C (en) Chinese character Yin-Yang bipolar code input system and single-hand input keyboard
CN1254122A (en) Stroke-form and radical mixed Chinese character digital code input method and keyboard
CN100511112C (en) Latin type five-stroke input method for Chinese character
CN1460913A (en) One-code two-form quick Chinese digital coding input method
CN1063856C (en) Keyboard and method for computer input of character-separated phonetic transcriptions
CN1164982C (en) Yi-code input method for Chinese characters
CN1241100C (en) Same sound shape digital code
CN1239784A (en) Chinese number inputting method and keyboard
CN1040259C (en) Two-stroke coding method and two stroke keyboard
CN1252555A (en) Three-Three phonetic code and Three-Three digital code
CN1139024C (en) Chinese character L-code input system and keyboard
CN1375763A (en) Chinese character encoding method grouping in consonants
CN1885241A (en) Chinese character input method capable of reducing candidate characters: phonetic coding and stroke coding
CN1149732A (en) Initial consonant, simple or compound vowel and tone stroke Chinese character coding method
CN1095832A (en) Two code indexing coding method
CN1018774B (en) Chinese-character and symbol encode method based on pattern, pronunciation and symbol and keyboard thereof
CN85102161A (en) The computer " full acoustic encoding " Chinese-character input scheme
CN1304075A (en) Nanural phonetic configuration code computer Chiense character code input method
CN1099494A (en) Encoding method for identification Chinese by initial consonant and strokes and keyboard thereof

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee