CN1159029A - Chinese character input method and product thereof - Google Patents

Chinese character input method and product thereof Download PDF

Info

Publication number
CN1159029A
CN1159029A CN 96120740 CN96120740A CN1159029A CN 1159029 A CN1159029 A CN 1159029A CN 96120740 CN96120740 CN 96120740 CN 96120740 A CN96120740 A CN 96120740A CN 1159029 A CN1159029 A CN 1159029A
Authority
CN
China
Prior art keywords
characters
radicals
traditional chinese
stroke
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 96120740
Other languages
Chinese (zh)
Other versions
CN1089175C (en
Inventor
吕奇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 96120740 priority Critical patent/CN1089175C/en
Publication of CN1159029A publication Critical patent/CN1159029A/en
Application granted granted Critical
Publication of CN1089175C publication Critical patent/CN1089175C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The Chinese character input method consists of 'simplified strokes', 'pronunciation parts', 'numerical codes' and 'spelling symbols' and has the functions of four input methods of form code, pictophonetic code, numerical code and phonetic code. Based on the regular character parts, strokes, stroke order and spelling scheme, the present invention has corresponding character structure classification and encoding method. The said four input methods are used separately and jointly.

Description

Synthetic input method of standard radical and stroke and product
The present invention relates to a kind of input method of Chinese character (hereinafter to be referred as " microcode ") in computer code field.
The characteristics of 1-1 " microcode "
" microcode " input method of Chinese character is to the relevant regulations of stroke, the order of strokes observed in calligraphy, Pinyin rule and the radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character, main with reference to " newly organized pupil dictionary " (nineteen eighty-three version and version in 1992), and " Xinhua dictionary " of " Ci hai " of reference version in 1980 and version in 1993 carries out some corrections and additional, except that special explanation, enumerate no longer one by one.
Standardization is one of " microcode " most important outstanding feature.Work out because " microcode " is the current specifications of abideing by Chinese-character writing and reading, thereby be a kind of low repetition rate of coding input method of Chinese character that need not the memory of any key position (please noting: need not any key position and remember these characteristics and only had " the spelling method " of high repeated code to realize originally); Because the coding criterion of " microcode " and used " exempting from memory " cryptoprinciple, thereby, no matter be new hand or professional typist, use " microcode " all to need not hypermnesia and carry on the back firmly, only need possess the above level of grade in the primary school, just can learn and grasp " microcode " easily.
Be fast " microcode " outstanding feature two.At first, " microcode " has low repetition rate of coding index, for example, when pressing the individual character input, only 183 groups of " letter pen " method repeated codes of font code class, 2.9% the repetition rate of coding (this is the static repeated code statistical number to 6763 encodes Chinese characters for computer of two-stage character library), reach fully as a kind of outstanding input method of Chinese character the repetition rate of coding index that should have; Moreover, be the most familiar Scheme for the Chinese Phonetic Alphabet of students in middle and primary schools and 189 standardization radicals by which characters are arranged in traditional Chinese dictionaries because " microcode " adopt, so the thinking switching rate can accelerate to import greatly the time and improve accuracy rate; In addition,, arrangement was carried out in the individual character of " microcode " ordering and group speech meticulously, very helped further improving input speed according to the high frequency principle of priority.
Three of the outstanding feature of " microcode " is to have comparatively complete compatibility.It is made up of " letter pen ", " by the sound ", " number " and " piecing together symbol " four kinds of input method of Chinese character, the keyboard Chinese-character input method that representative and compatible font code, phonetic-stroke code, number and sound sign indicating number are four types, both can independently use wherein a kind ofly, and also can optionally unite just and use; Both can backward compatible I and II Chinese character base, also can upward-compatible Chinese character large character set and a large amount of phrases; Both the Chinese and the Chinese phonetic alphabet can be imported fast, also foreign language can be imported nimblely.
Thereby, " microcode " can be widely used in the demand of each personage of stratum to encode Chinese characters for computer " easy to be quick ", it can suit, and vast adolescent student is universal to be learnt and the grasp computing machine, also can satisfy the scholar of numerous subjects and the demand that press gang use the microcomputer writing; Can suit computer professional's use also can be satisfied the needs of well-trained secretarial personnel high speed touch system.
1-2 " microcode " is to the some regulations and the classification of font structure
One, stroke:
Stroke is the fundamental element (or part) that constitutes single block character." microcode " merges into five kinds of the most basic strokes with 28 kinds of strokes of the new font of regulation in " newly organized pupil dictionary ": horizontal (one), perpendicular (Shu), left-falling stroke (Pie), point (Dian), folding (second), and (1) lists right-falling stroke in a some stroke; (2) list horizontal stroke in carrying; (3) stroke of buckle is listed in the folding stroke.
Two, the order of strokes observed in calligraphy: the stroke order the when order of strokes observed in calligraphy is writing Chinese characters." microcode " defers to the rules for writing of " horizontal earlier back is perpendicular, casts aside afterwards earlier and presses down, and from top to bottom; from outside to inside, seals the first intermediate and then both sides behind the inside earlier ", and be as the criterion with order of strokes observed in calligraphy of regulation in " newly organized pupil dictionary ", indivedual individual character order of strokes observed in calligraphys are contradictory, revised with reference to " Ci hai ".
Three, word portion: word portion be by one as for dry brush intersect to link to each other and to form, be constitute single Chinese character from or accurate from the basic element of character.
We stipulate: when writing individual character, the first sum of cartoon book of each word portion is write the relative order that order is this word portion.
The individual character that only contains a word portion is whole individual character, and comprises 189 radicals by which characters are arranged in traditional Chinese dictionaries of dictionary defined; The individual character that contains two above word portions is a prose style free from parallelism individual character.
Single Chinese character according to mutually high between its stroke or the radicals by which characters are arranged in traditional Chinese dictionaries, continuous, intersect situation and can be divided into one to several primary word portions or accurate word portion.
(1), is the parts of relatively independent state from both non-intersect also not linked to each other between two or more word portions.
(2) link to each other: have a place or many places stroke to be continuous state between the word portion.
(3) intersect: stroke intersects the part that forms in the individual character, regards a primary word portion without exception as.
By prose style free from parallelism type become the word radicals by which characters are arranged in traditional Chinese dictionaries through simplification and radical " Yan, Cannibals, door, Si, shellfish, see, Woo, Yi, bird ... ", together with " Dao, Bing, Xin, Rui, Chuo, Xiangxi " etc., list the radicals by which characters are arranged in traditional Chinese dictionaries of the stroke that do not link to each other together in.
Four, the grade of word portion is with preferential:
Become the word radicals by which characters are arranged in traditional Chinese dictionaries
The regulation radicals by which characters are arranged in traditional Chinese dictionaries
Non-word radicals by which characters are arranged in traditional Chinese dictionaries standard radicals
The distortion radicals by which characters are arranged in traditional Chinese dictionaries
Non-marking-up portion
Annotate: the word category type of the top that keeps left more, grade is preferential more during coding.
Five, principal part head, secondary radicals by which characters are arranged in traditional Chinese dictionaries and time secondary radicals by which characters are arranged in traditional Chinese dictionaries:
(1) shared relative scale is not simultaneously in corresponding individual character according to each word portion, choose by " from very much not from little " earlier, size identical again by the rule of " from mark (standard radicals) not from non-(non-standard word portion) ", " in the past not from the back " in one of first, last word portion, choose principal part head;
(2) choose principal part head after, again according to above-mentioned rule and preferential according to " first ", " end first " order, next method of " first and last ", " end, end two " order is chosen from spare word portion " secondary radicals by which characters are arranged in traditional Chinese dictionaries " again.
(3) and the like, also can choose " inferior secondary radicals by which characters are arranged in traditional Chinese dictionaries ".For example: major and minor, the inferior secondary radicals by which characters are arranged in traditional Chinese dictionaries of " lotus " be respectively (Nian, Ren, can), major and minor, the inferior secondary radicals by which characters are arranged in traditional Chinese dictionaries of " heat " are respectively (Xiangxi, Rolling, ball).
Annotate 1: when single is drawn as word radicals by which characters are arranged in traditional Chinese dictionaries " " and links to each other with other radicals by which characters are arranged in traditional Chinese dictionaries, should and select major and minor radicals by which characters are arranged in traditional Chinese dictionaries as different word portion's codings, as: day (one, greatly).
Annotate 2: few in number earlier in the middle of the prose style free from parallelism individual character of the order of strokes observed in calligraphy should note (not comprising whole individual character) selection of major and minor radicals by which characters are arranged in traditional Chinese dictionaries, as " territory " (eight, car), " win " (Tou, shellfish).
Annotate 3: indivedual difficult individual characters of distinguishing radicals by which characters are arranged in traditional Chinese dictionaries, as: the major and minor radicals by which characters are arranged in traditional Chinese dictionaries of " a widow " are respectively (woman, end).
Annotate 4: to the individual character of the continuous upper, middle and lower type structure of stroke, getting first and last word portion is major and minor radicals by which characters are arranged in traditional Chinese dictionaries, and for example: the major and minor radicals by which characters are arranged in traditional Chinese dictionaries of " honor " and " table " are respectively (Nian, wood) and (fore-telling, wood).
Annotate 5: the individual character of three structures of loosing is if when being made up of certain independent radicals by which characters are arranged in traditional Chinese dictionaries and another prose style free from parallelism individual character, should get principal part in this prose style free from parallelism individual character and independent radicals by which characters are arranged in traditional Chinese dictionaries as major and minor radicals by which characters are arranged in traditional Chinese dictionaries, as: the major and minor radicals by which characters are arranged in traditional Chinese dictionaries of " case " are (Http, wood).
Six, the division of six kinds of font structures of two classes:
Carried out after the Mathematical Statistics Analysis through us to vast and numerous Hanzi font repeated structure ground, the word portion composition of all single Chinese characters clearly is divided into whole and a prose style free from parallelism two class formations, wherein a prose style free from parallelism can be divided into again " about ", " up and down ", " inside and outside ", " half left side half right ", " half first down " type structure, interior, amount to six kinds of font structures together with one-piece construction; And each prose style free from parallelism structure, can be divided into " two minutes ", " three minutes ", " four minutes " etc. by the number of the contained word of each individual character portion again, i.e. " about two minutes ", " two minutes up and down ", .... this standard with middle and primary schools' current teaching materials implementation conforms to substantially, and is only slightly flexible.So numerous and diverse Chinese character is converted into the numeric structure that is readily appreciated that, by the distribution of being correlated with of 5 districts of " letter pen " corresponding alphabetic keypad of method, finishes Chinese character input thus again.
When using " microcode " to carry out the Chinese character input, the right-hand man works in coordination, so both having kept some comparatively outstanding input method of Chinese character correctly utilizes fingering to improve the advantage of input speed, corresponding font structure again with keyboard position, order of writing strokes is corresponding with the coding input sequence, dictionary enquiring is corresponding with computer search, the literal intension of block character and computing machine are formed " microcode " input method of Chinese character that meets the Modern Chinese standard to letter, digital these corresponding characteristics of quick conversion.
The font structure of single Chinese character can be divided as follows:
Whole: radicals by which characters are arranged in traditional Chinese dictionaries, individual character
A prose style free from parallelism: left and right sides type: left right model (upper left following, left, center, right ...)
Half left right model (half upper left following, half left, center, right ...)
Other type: go up mo(u)ld bottom half (upper, middle and lower, go up about ...)
Half last mo(u)ld bottom half is (about on half upper, middle and lower, half ...)
About type (full encirclement, upward encirclement ...)
Annotate: about structure is that with about half or half key distinction that goes up the mo(u)ld bottom half structure the former is outer three or outer completely encircle structure, and the latter just go up left and right upward, two sides investing mechanism such as lower-left.
Seven, the judgment basis of single Hanzi font structure:
The font structure of single Chinese character should be main judgment basis with principal part head relative position of living in individual character.For example: the principal part head " wood " of " lattice " word is positioned at left part, and then this word is diffusing three structures (or being divided into left and right sides structure roughly) of upper left mo(u)ld bottom half; In like manner, " shuttlecock " is diffusing three structures of half left, center, right type, and " energy ", " reaching " two words then are diffusing four structures of last mo(u)ld bottom half.
The principal feature of the simple technique of writing of 2-1 " microcode "
" letter pen " method is with character pattern input, thereby for any Chinese character that you are unfamiliar with or can not read, all is easy to input, and it belongs to and connects the type that font code is imported, and the user need possess the ABC that word and order of writing strokes looked in radicals by which characters are arranged in traditional Chinese dictionaries." letter pen " method is 2.9% in the static repetition rate of coding of a whole secondary character library.
The maximum code length of " letter pen " method is four yards, and each individual character is keyed in four yards at most, generally can go up screen, not enough four yards word, and benefit is struck space bar and can be gone up screen; If any the repeated code word, can select numerical key according to the prompting of screen below presenting bank, screen in the key entry.
" letter pen " method is provided with the coding prompt facility, from second yard of every word, whenever click query key "/? ", presenting bank can be pointed out subsequent encoding automatically.
One, the definition of stroke and keyboard item:
Stroke: horizontal (one) perpendicular (Shu) casts aside (Pie) point (Dian) folding (second)
Item: 12345
Annotate 1: alphabetic keypad is divided into 5 districts, area code from 1 to 5, the first sum of picture of corresponding individual character basic code or the preceding unicursal of complement code; Each district distributes five keys, key item from 1 to 5, the back unicursal of corresponding basic code or complement code respectively.
Annotate the 2:N key as dedicated array of keys, substitute " mouth ", " mouth " two enclosed type stroke radicals or be used as the enter key position that joined mark transferred in whole individual character.
Annotate 3: the area code item of no matter pressing " simple " method on Master Keyboard is imported or is imported by correspondence is alphabetical, owing to the key position is identical, so input results is consistent.For example: (123435, FWQ), the region code of " letter pen " method is represented in the comma front to machine, and alphabetic coding is represented in the back.
Two, coding constitutes:
(1) prose style free from parallelism individual character: add that by 1-4 base sign indicating number or base sign indicating number complement code constitutes.
(1) base sign indicating number: constitute in twos as area code and item by two corresponding strokes in each word portion respectively, with angle brackets<expression, be designated as<first J〉and<JJ (J represents the substitute symbol of crossing or non-intersect stroke).
(2) complement code:, be designated as [complement code] with square bracket [] expression.When<4 strokes, no matter whether intersect word portion, get end stroke respectively; When 〉=4 strokes, end stroke is still got by non-crossing word portion, intersects word portion and then gets last and hand over stroke.
(2) whole individual character: add that by 1-4 base sign indicating number or base sign indicating number joined mark constitutes.
(1) base sign indicating number: constitute in twos by each stroke of whole individual character, with angle brackets<expression, be designated as<first,<three or four,<five or six,<end two, end.
(2) joined mark: be used for a whole individual character to six-stroke,, be designated as [joined mark] with square bracket [] expression.The choice of joined mark can be divided into four kinds of situations (detailed rules and regulations are detailed to see next section " letter pen " method coding formula with for example)
2-2 " letter pen " method coding formula with for example
One, the individual character of prose style free from parallelism bipartite texture:
(1) diffusing two structures, two base sign indicating numbers:
(1) do not mend: the first J of<stem+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉
As: juice (4412, OF), do (5112, BF).
(2) complement code: the first J of<stem 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[complement code]
(1) just mend (the complement code area code is got preceding word portion, and item is got back word portion):
As: (203414, NWS), from (343444, WWO).
(2) the anti-benefit (the complement code area code is got back word portion, and item is got preceding word portion):
As: Lu (202011, NNG), brother (203551, NQB), day (511341, BDY).
(2) folding is mended (the folding complement code is got the first stroke of a Chinese character of a back bending stroke and moved towards to be area code, and adds item 5):
(1)<the first J of stem 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[folding is mended]:
As: he (325225, NWY), mother (535535, CZQ), beat (151525, AAM), the Chinese (445415, OXA), the Liao Dynasty (554535, ZPQ).
(2)<the first J of stem 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[complement code]+[folding is mended]:
As: cross (15454425, ApOM), pay (32152425, RALM), generation (32152445, RALP), level (55351455, ZQSZ).
(2) diffusing two structures, three base sign indicating numbers:
(1) do not mend:
Stem (4 strokes: the first J of<stem 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries 〉+<the last JJ of portion 〉
End portion (4 strokes:<stem is first 〉+<stem JJ 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉
As: Shanghai (444513, OPD), pin (311512, TAF), heap (123241, FRY), the storehouse (411521, YAH).
(2) complement code:
Stem (4 draw: the first J of<stem 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries 〉+<the last JJ of portion 〉+[complement code]
End portion (4 draw:<stem is first 〉+<stem JJ 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[complement code]
As: group (55251111, ZMGG), strange (13125154, DFBX), morning (21131341, HDDY).
(3) diffusing two structures, four base sign indicating numbers:
<stem is first 〉+<stem JJ 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries 〉+<the last JJ of portion 〉
As: ball (11211544, GHAO), pen (31413115, TYTA).
Two, the individual character of a prose style free from parallelism three separation structures:
(1) do not mend: the first J of<stem+<inferior radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉
As: estimate (321220, RFN), portion (412052, YNV), Hong (115454, GXX), shuttlecock (355254, QVX), gloomy (141414, SSS).
(2) complement code:
(1)<the first J of stem 〉+<inferior radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[major and minor complement code]
As: insert (15313212, ATRF), make (31204524, TNPL), greedy (34452544, WPMP), military (51152114, BAHY).
(2)<the first J of stem 〉+<inferior radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[the secondary complement code of primary and secondary]
As: carry (15211214, AHFS), lock (35242554, QLMX), obtain (12351432, FQSR), deceive (55452212, ZPJF).
Three, the individual character of a prose style free from parallelism four separation structures:
(1) do not mend: the first J of<stem+<inferior radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last two radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last radicals by which characters are arranged in traditional Chinese dictionaries J 〉
As: curtain (12211325, FJDM), the territory (11325134, GRBW), little (33253534, EMQW), win (41202535, YNMQ).
(2) first, last complement code: the first J of<stem 〉+<inferior radicals by which characters are arranged in traditional Chinese dictionaries J 〉+<last two radicals by which characters are arranged in traditional Chinese dictionaries J 〉+[first, last complement code]
Be used for 〉=prose style free from parallelism structure of four word portions, when the number that becomes word radicals by which characters are arranged in traditional Chinese dictionaries (not comprising the distortion radicals by which characters are arranged in traditional Chinese dictionaries) in the individual character becomes even number.
As: liquor-saturated (11413452, GYWV), more (34512544, WBMO), swollen (35121213, QFFD), can (54253554, XMQX)
Four, integrally-built individual character:
(1) do not connect:<first+<three four+<five six+<end two, the end
As: one (51, B), two (11, G), the people (34, W), soil (1251, FB), the river (3222, RJ), art (123444, FWO), with (1551, AB), (5354, CX).
(2) transfer to connect (add 20 or N):<first 〉+<three four+<five six+[company of accent]
As: the scholar (125120, FBN), owe (353420, QWN), youngster (352020, QNN).
(3) folding connects:<first 〉+<three four+[folding connects]
As: power (5315, CA), the woman (533135, CTQ), (5535, ZQ), ball (355415, QXA), and (355455, QXZ).
(4) doubly-linked:<first 〉+<three four+<five six+[joined mark]
As: fourth (1515, AA), ten thousand (133535, DQQ), all (355454, QXX), north (21133535, HDQQ), half (43111212, IGFF), must (45433434, PIWW).
The principal feature of the other method of 3-1 " microcode " sound
" by the sound " method is easy to learn, and optimum is created and drafted manuscript, and its memory capacitance is very little, high input speed.
" by the sound " is owned by France in the type of pressing the comprehensive input of sound shape, and the user will possess the ABC that word and order of writing strokes looked in the Chinese phonetic alphabet, radicals by which characters are arranged in traditional Chinese dictionaries.
The maximum code length of " by the sound " method is four yards, keys in four yards, generally can go up screen, not enough four yards word, and benefit is struck space bar and can be gone up screen; If any the repeated code word, can select numerical key according to the prompting of screen below presenting bank, screen in the key entry.
" by the sound " method is provided with the coding prompt facility, from second yard of every word, whenever click query key "/? ", presenting bank can be pointed out subsequent encoding automatically.
One, sound:
The rule of the relevant Chinese phonetic alphabet in " by the sound " method conforms to substantially with the standard of middle and primary schools existing textbooks.
Principle according to " easily learning fast ", we adopt " piecing together symbol " sound of exempting from memory-type, be the first identical shared letter key of individual character of initial consonant (the female ZH of alliteration, CH, SH Z, C, the S key replaces), and the also shared letter key of the individual character that the first simple or compound vowel of a Chinese syllable is identical (as U, UO, UANG uses the U key to replace), still need memorize mechanically the problem of key position thereby solved other input method of Chinese character owing to merging letters case based on phonetic.By the way be that this method can improve the accuracy of input indirectly to some word sounds not word or some people not up to standard that speaks standard Chinese pronunciation of readability standard.Should be noted that following some:
(1) zero initial individual character, consonant-vowel code are only imported a simple or compound vowel of a Chinese syllable sign indicating number and are got final product, as: the consonant-vowel code in " Ah, goose, Europe " is " A, E, O ";
(2) when the individual character of input tape simple or compound vowel of a Chinese syllable NG, available G replaces NG, is " AG " as the consonant-vowel code of " dirty ";
(3) simple or compound vowel of a Chinese syllable ü replaces with the V key.
Two, word portion:
In " by the sound " method, second ingredient of single character code is " word portion sign indicating number ", and it is that major and minor radicals by which characters are arranged in traditional Chinese dictionaries by individual character constitute.
Word portion sign indicating number is in the coding mark, and stroke code represents that with the area code item of " letter pen " method the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number is with letter representation.
(1) prose style free from parallelism structure:
(1) during standard radicals<4 strokes, yard get stroke code (wherein modification radicals by which characters are arranged in traditional Chinese dictionaries should by actual stroke code fetch) by the loose base of three above structures of " letter pen " method, as: lift (TA1554).
(2) non-marking-up portion and 13 difficulties are read radicals by which characters are arranged in traditional Chinese dictionaries and are also connect the loose base of three above structures of " letter pen " method and yard get stroke code, as: Tianjin (JI4452).
(3) during standard radicals 〉=4 strokes, the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number got in the pronunciation that connects radicals by which characters are arranged in traditional Chinese dictionaries (comprising the modification radicals by which characters are arranged in traditional Chinese dictionaries), as: silver (YTJG).
(2) one-piece construction:
(1) generalized case is got stroke code by actual font according to the base sign indicating number of " letter pen " diffusing two structures of method, as: open (KA1132).
(2) band bending stroke and<during 4 strokes, get a base sign indicating number and a stroke code of rolling over joined mark, as very little (CU1525).
Three, radicals by which characters are arranged in traditional Chinese dictionaries:
Radicals by which characters are arranged in traditional Chinese dictionaries are the classes of being divided by the adopting Chinese character form radical.
We arrange the frequency of utilization that 189 radicals by which characters are arranged in traditional Chinese dictionaries of " newly organized pupil dictionary " and " Xinhua dictionary " (" Ci hai " adopt be 250 radicals by which characters are arranged in traditional Chinese dictionaries) connect word, be divided into high frequency radicals by which characters are arranged in traditional Chinese dictionaries, radicals by which characters are arranged in traditional Chinese dictionaries commonly used and the inferior radicals by which characters are arranged in traditional Chinese dictionaries of using, again according to the principle of " high frequency is preferential " and " exempt from remember ", with its together with non-marking-up portion stroke code interior, be divided into stroke code and radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number two classes, be referred to as " word portion sign indicating number ", as second ingredient of single character code.
(1) first kind (stroke code): totally 88, get stroke code by the base sign indicating number of " letter pen " method, wherein:
(1) the having of standard radicals<4 strokes 75 (comprise sealing radicals by which characters are arranged in traditional Chinese dictionaries), as: big, Rui, field;
(2) difficulty is read 13 radicals by which characters are arranged in traditional Chinese dictionaries, comprises the radical and the difficult deserted word of reading difficult note that do not have pronunciation, as: Xiangxi, yarn (reading M ì).
(3) do not belong to the non-marking-up portion of 189 radicals by which characters are arranged in traditional Chinese dictionaries, get stroke code yet by " letter pen " method, as: " giving " (FE1114).
(2) second classes (radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number): totally 99.
The radicals by which characters are arranged in traditional Chinese dictionaries of all 〉=4 strokes (do not comprise difficulty read radicals by which characters are arranged in traditional Chinese dictionaries) are according to first letter of radicals by which characters are arranged in traditional Chinese dictionaries pronunciation initial consonant, as the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number.As: Epileptic, wood, stone, the moon.
Four, the selection of polyphone:
" by the sound " method is to 3,755 Chinese characters of one-level character library of (GB2312-80), and 3,008 Chinese characters of secondary character library amount to 6,763 Chinese characters, has arranged the selection input of polyphone.Above polyphone read in a word two in the one-level character library, then encode respectively,, generally only choose modal a kind of (, can adopt " piecing together symbol " method to other sound tone) input as being the tone difference as the sound difference; To the secondary character library, except that the polyphone input arranged in more common word, generally only select one to read.For example: input CA3154 or ZA3154, all exportable " length " this word.
3-2 " by the sound " method coding formula with for example
This chapter described " sound " sign indicating number, with angle brackets<expression, " word portion " sign indicating number is with [] expression; Described [front portion] or [rear portion], be meant principal part or secondary portion the two one of, sequential write preceding be [front portion], after be [rear portion], and with the principal part position as the foundation of judging font, as: issue (left right model, the front portion is " eight ", the rear portion is " page or leaf ").
One, prose style free from parallelism type individual character:
(1) just mend:<sound+<rhythm 〉+[front portion]+[rear portion]
The individual character that is used for left and right sides type, as:
Beat (DA1515), change (HU1534), ring (HUW13), border (JT12Y) debates (BTX45), makes (ZAN45), good fortune (FUS51), shuttlecock (JTM54).
Annotate 1: when zero initial, should get:<rhythm 〉+[front portion]+[rear portion], as: Ah (A5212).
(2) the anti-benefit:<sound 〉+<rhythm 〉+[rear portion]+[front portion]
The individual character that is used for other type, as:
Word (ZT5544) has (YOY13), and phoenix (FE5435) is planted (ZAMG), garden (YV1125), curtain (MU2512), preceding (QT2543), device (QTQ20).
Annotate 2: when zero initial, should get:<rhythm 〉+[rear portion]+[front portion], as: grace (EX25).
Two, monolithic devices individual character:
(1) non-intersect:<sound 〉+<rhythm 〉+[first]+[three, last two]
As: go into (RU34), three (SA11), month (YV3511), north (BE2113), white (BE5321).
Annotate 3: when zero initial, should get:<rhythm 〉+[first]+[three, last two], as: youngster (E35), recessed (A2525), and (E1322).
(2) intersect:<sound+<rhythm 〉+[first]+[three, end] are as (YI5125), the people (MI5155), fish (YV3521).Annotate 4: when zero initial, should get:<rhythm 〉+[first]+[three, end], as: ear (E1221).
Annotate 5: can not obtain when handing over by above-mentioned generalized case stroke individually≤6 and word portion that two crossing strokes are arranged, get<sound 〉+<rhythm 〉+[first]+[preceding friendship, end], as: west (XT1231).
Annotate 6: can not obtain when handing over by above-mentioned generalized case stroke individually>6 and word portion that two crossing strokes are arranged, get<sound 〉+<rhythm 〉+[first]+[hand over the back, end], as: two (LI1234), the tenth of the twelve Earthly Branches (YO1251), fiber crops (MA4124), deer (LU4125).
Annotate 7: the microcode upgrade version, can all adopt " by the sound " method and just to mend formula.
The characteristics and the using method of 4-1 " number " method
" microcode " Chinese-character digital code input method belongs to the type of coding by the numeral input.
" number " method is that the numeric structure characteristic of utilizing " letter pen " method to form naturally converts to from a kind of method of numeric keypad input.
One, the characteristics of " number " method:
(1) can be convenient to alphabetic keypad and the unfamiliar user of fingering.After the operation of process some time is familiar with, these users can progressively improve the input speed of use " number " method, and can surpass the speed that they use " microcode " other input method input Chinese character in addition fully.
(2) " number " method is only changed " letter pen " method, this is because be converted to number from " by the sound " method or " piecing together symbol " method, the transfer process of a human thinking how, certainly will influence input speed, the people that particularly utilize " by the sound " to write or draft manuscript, the ABC that has all possessed the Chinese phonetic alphabet and word portion, as long as in addition fingering exercise, the input speed of using " letter pen " method in " microcode " or " by the sound " method all may meet or exceed the touch system of professional typist's high speed.
(3) maximum code length of " number " method is four yards, and each individual character is keyed in four yards at most, generally can go up screen, not enough four yards word, and benefit is struck space bar and can be gone up screen; If any the repeated code word, can select numerical key according to the prompting of screen below presenting bank, screen in the key entry.
" number " method is provided with the coding prompt facility, from second yard of every word, whenever click query key "/? ", presenting bank can be pointed out subsequent encoding automatically.
Two, the input method of " number " method:
By " letter pen " method individual character is encoded earlier, each individual character produces the number of one group of area code and item, respectively with every group of area code and item addition, obtain one group of number newly thus, it is keyed in successively,, can press the numeral of presenting bank prompting again and select to determine if any repeated code.As:
" machine " should be (123435) with " letter pen " method coding, and through 1+2=3,3+4=7, the quick mental arithmetic of 3+5=8 just can obtain the coding (378) of " number " method, with 378 key entries, promptly exportable " machine " this word.
The characteristics and the using method of method that 4-2 " pieces together symbol "
It is owned by France in the type of pressing the input of sound sign indicating number " to piece together symbol ".
" piece together symbol " method is to work out according to the Scheme for the Chinese Phonetic Alphabet that Committee for Reforming the Chinese Written Language announces, it and the performed standard basically identical of the existing textbook of middle and primary schools.
The formulation of " piece together symbol " method, mainly be for:
(1) make things convenient for the user to use when needing Chinese character that input can read can not to write running into, thereby it not as a kind of quick input method, and be a kind of have the dictionary character of actual application value and input method of Chinese character of standby character;
(2) convenient numerous students in middle and primary schools and the user that just learns to speak Chinese utilize phonetic transcription to import Chinese character.
" piece together symbol " method is the Chinese phonetic alphabet according to individual character, with approximate letter of letter of full form from the keyboard standard knock in again the in addition coding of tone.It has following characteristics:
Need not distinguish zero initial when one, importing.
Two, simple or compound vowel of a Chinese syllable " ü " replaces with " V " key, and " NG " ending of a final replaces with " G " key without exception.
Three, piece together symbol method 3755 Chinese characters of one-level character library to (GB2312-80), 3008 Chinese characters of secondary character library amount to 6763 Chinese characters, have arranged the selection input of polyphone.Above individual character read in one-level character library one word two, and what harmonious sounds was different all encodes respectively, and to the secondary character library, the polyphone that all " pupil dictionary " and " Xinhua dictionary " have been taken in is also all encoded respectively, greatly facilitates the user.For example: input CHANG2 or ZHANG3, all exportable " length " this individual character.
Four, each individual character is after the input Pinyin letter, both can be according to the prompting word selection of presenting bank, also the available digital key is keyed in behind the circumflex of this word word selection again: softly (as: a) strike 0, the first (high and level tone, as: ā) strike 1, the second sound (rising tone, as: á) strike 2, the three (go up sound, as: ǎ) strike 3, the fourth sound (falling tone is as à) strikes 4.
Five, less when the number of words of identical sound, when screen prompt row delegation all shows, just can select screen in the key entry according to the numeral of prompting; When delegation does not show, then use "+" (page turning backward), "-" (page turning forward), after this word occurring,, shield in the key entry again according to the numeral of prompting.Because this law has adopted the mode of operation of keying in circumflex, will significantly reduce the workload that page turning is searched than common spelling method, thereby input speed wants fast.

Claims (7)

1. Hanzi coding input method and product, comprising " letter pen ", " by the sound ", " number " and " piecing together symbol " computer Chinese input method of four types, and matching used phrase and foreign language import, and its principal feature is:
(1) adopting standard radicals by which characters are arranged in traditional Chinese dictionaries, stroke, the order of strokes observed in calligraphy and the Scheme for the Chinese Phonetic Alphabet of " newly organized pupil dictionary " and " Xinhua dictionary " is the basis;
(2) adopt the synthetic and corresponding font structure classification of stroke and easily learning fast of forming;
(3) adopt above-mentioned four kinds of coding methods and the character library compatibility that forms.
2. according to the described software of claim 1, disk or other computer product.
3. encode Chinese characters for computer and the search method of working out according to the described standard radicals by which characters are arranged in traditional Chinese dictionaries of claim 1.
4. according to the described font structure classification of claim 1, mainly refer to the two classes differentiation and the judgement of totally six kinds of font structures to Chinese character.
5. according to claim 1 described " simple pen " method, adopt the synthetic corresponding alphabetic keypad area code of stroke, to solve the method that repeated code is remembered and reduced in the key position.
6. according to claim 1 described " by the sound " method, the single-letter of the employing Chinese phonetic alphabet is represented sound, rhythm, and stroke is synthetic represents word portion with readability radicals by which characters are arranged in traditional Chinese dictionaries pronunciation, to solve the method that repeated code is remembered and reduced in the key position.
7. according to claim 1 described " number " method, adopt the synthetic area code that is produced of stroke to convert numerical coding to, from the method for numeric keypad input.
CN 96120740 1996-12-03 1996-12-03 Chinese character input method and product thereof Expired - Fee Related CN1089175C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 96120740 CN1089175C (en) 1996-12-03 1996-12-03 Chinese character input method and product thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 96120740 CN1089175C (en) 1996-12-03 1996-12-03 Chinese character input method and product thereof

Publications (2)

Publication Number Publication Date
CN1159029A true CN1159029A (en) 1997-09-10
CN1089175C CN1089175C (en) 2002-08-14

Family

ID=5126553

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 96120740 Expired - Fee Related CN1089175C (en) 1996-12-03 1996-12-03 Chinese character input method and product thereof

Country Status (1)

Country Link
CN (1) CN1089175C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG111933A1 (en) * 2001-02-27 2005-06-29 Sony Corp Character inputting method and character inputting apparatus

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SG111933A1 (en) * 2001-02-27 2005-06-29 Sony Corp Character inputting method and character inputting apparatus

Also Published As

Publication number Publication date
CN1089175C (en) 2002-08-14

Similar Documents

Publication Publication Date Title
CN1023916C (en) Chinese keyboard entry technique with both simplified and original complex form of Chinese character root and its keyboard
CN1015218B (en) Imput method of word root code and apparatus thereof
CN1159029A (en) Chinese character input method and product thereof
CN1121645C (en) Sound and shape word code Chinese character input method
CN1129058C (en) Chinese character phonetic code and keyboard design
CN1166997C (en) Chinese-character fast input method without splitting
CN1164689A (en) Computer input method for Chinese characters' sound pattern meaning based on word and Chinese-Spanish compatible keyboard
CN85100087A (en) " Chinese coded sound " scheme and its implementation
CN1123819C (en) Chinese character key-position code input method for computer
CN1186976A (en) Computer Chinese character eight-four code input method and key board
CN1108552C (en) Perfecting method (PHF) for phoenticizing Chinese charaters
CN1156744C (en) Chinese-character 'meta-root code' input method
CN1417674A (en) Chinese syllable double reading scheme, Chinese keyboard and information input and processing method
CN1162766C (en) Chinese-character 'pronunciation-shape code' input method and its keyboard profile
CN1058342C (en) Chinese character byte codes and its keyboard of using the same
CN1023669C (en) Wang's code Chinese input method
CN1374577A (en) General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1808351A (en) Chinese character input method using initial and etymon to encode for computer
CN1055434A (en) The pixel input method of character and keyboard thereof
CN1295589C (en) Chinese character input method using etymon-less code
CN1527184A (en) Chinese character input method and keyboard
CN1056357A (en) Chinese character coding input method
CN1172983A (en) Phonetic Chinese word encoding and its keyboard
CN1334499A (en) Fast phonetic-letter input method
CN1379307A (en) Chinese-character universal normalized holographic encode and high-speed input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee