CN101051246A - Computer keyboard shape code Chinese character code input method - Google Patents
Computer keyboard shape code Chinese character code input method Download PDFInfo
- Publication number
- CN101051246A CN101051246A CNA2006100744252A CN200610074425A CN101051246A CN 101051246 A CN101051246 A CN 101051246A CN A2006100744252 A CNA2006100744252 A CN A2006100744252A CN 200610074425 A CN200610074425 A CN 200610074425A CN 101051246 A CN101051246 A CN 101051246A
- Authority
- CN
- China
- Prior art keywords
- word
- code
- chinese character
- mouth
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Machine Translation (AREA)
Abstract
A code-inputting method of image code Chinese character by utilizing computer keyboard includes disconnecting Chinese character to be character beams, dividing said character beams to be 38 prospect classifications according to image prospect of character beams the combining said prospect classifications to be 26 sorts corresponding to 26 relevant letter being used as sort code of character beam based on principle of image correlation and position complementary. Its code-taking manner is also disclosed.
Description
1 technical field
The invention belongs to a kind of computer keyboard Chinese character shape code input method.
2 background technologies
Computer keyboard Chinese character shape code input method all is that Chinese character is split as parts (what have is called " radical ", and the present invention is called " word beam "), then according to certain rules with the parts correspondence on key letter, realize the Chinese character input by code taking method again.
Application number is that 95104165.7 invention splits into the word beam with Chinese character, and represents each word beam respectively with classification code and identification code, and like this, classification code (being main) and identification code (being auxilliary) have just constituted the keyboard input coding of Chinese character.Because being the complementary combinations of direction, shape, the order of strokes observed in calligraphy and word beam according to the word beam, classification code takes all factors into consideration definite relevant letter that is easy to remember, identification code is the first letter of pinyin or the relevant letter of word beam pronunciation, make this invention solve the contradiction between Hanzi keyboard input coding minimizing memory and the shortening code length effectively, thereby can be used for: establishment dictionary, ci and qu or other bibliographies are used to look into word; Make Chinese character input software, be used for the input of keyboard Chinese character.
More than Fa Ming inventor just in person.Owing to multiple reason, more than invention is not applied on market effectively.It also exist many can improved place.For example, more than the Chinese character scope that relates to of invention mainly is 6763 Chinese characters among the GB2312-80, and the Chinese character beyond 6763 is lacked necessary word beam and corresponding keys position thereof.Chinese character splits rule also need be perfect, and the word beam of choosing also need add or delete, and the classification of word beam also need further be optimized and revised or the like.
3 summary of the invention
The present invention is to be the improvement of carrying out on the basis of 95104165.7 background technology at application number.Its content comprises that Chinese character splits, the key position is shone upon, three ingredients of code taking method.
3.1 Chinese character splits
It is an importance of character shape coding that Chinese character splits, and target of the present invention is to accomplish directly perceived, nature.For this reason, must at first understand Hanzi structure and the human brain relation between feeling naturally, and then split practice by Chinese character and sum up Chinese character and split rule, formulate and split rule, conclude Chinese character and split type.Comprise under this title that Chinese character splits the disposal route of relevant issues, Chinese character splits regularity summarization, Chinese character fractionation Rulemaking, the conclusion of Chinese character fractionation type, five parts of difficult point issue handling.
1 Chinese character splits the disposal route of relevant issues
(1) stroke is striden a combinatorial problem
" mouth " in " state " is to be combined by the first stroke, second and finishing touch, middle span strokes, Here it is, and stroke is striden the pen combination.Stride the pen combination and violated the stroke order of writing Chinese characters, why will observe following example reason as can be seen like this.
State's (Jiong do Dian one by one) witch (Xia everybody one) bundle (flatly
) ... (definitely splitting) according to stroke order
State (mouthful king Dian) witch (workman people) bundle (wood mouth) ... (fractionation mode comprise stride pen combination)
What should be understood that writing Chinese characters emphasizes is whether how could arrange space and nib to shift preferably convenient; What the Chinese character fractionation of character shape coding was emphasized then is to be convenient to visual discrimination, is beneficial to information extraction apace, alleviates brain burden.
Writing Chinese characters is two kinds of different physiology operations with splitting Chinese character, and physiological needs is carried out according to the labour-saving pattern exactly.Correct sequential write satisfies is physiological requirements ways of writing under, and correct fractionation mode is satisfied is physiological requirements under the fractionation mode.These two kinds of psychological needs have the place of coincidence, and dissimilarity is also arranged.
Furthermore, stride the more firm often structure of structure that pen makes up out, and big with the structure vision difference that is striden across, separate easily each other.For example, in " bundle ", stride pen " wood " that makes up out and " mouth " that striden across, the former is a branch shape, and the latter is a ring shape, and vision difference is obvious; In " hanging down ", stride pen and make up out
Striden across De “ Nian with quilt ", the former was many horizontal one perpendicular the intersection, and the latter is many perpendicular intersections of a horizontal stroke, and same vision difference is obvious.
Follow vision difference follow exactly intuitively, the nature.It is necessary following sequential write on the whole, but emphasizes that excessively it intuitively then is worthless that sequential write is violated.Therefore in the fractionation rule that formulate this paper back " writing rule " and " principle of clarity " arranged.Below give some instances again and be further described.
" again " correct fractionation mode is " (a Jiong soil) again ", strides pen combination fractionation mode " (king's Jiong) again " and be inadequate in incorrect.Because the harmony of " king " loses " again ", particularly centre one horizontal stroke of " king " is limited in the frame firmly by " Jiong ", makes " king " lose intuitive, thereby makes human brain be difficult to catch.
The correct fractionation mode of " genus " is " to belong to (corpse Pie mouth
) ", stride pen combination fractionation mode and " belong to (corpse Pie worm Jiong) " and be inadequate with incorrect.Because " worm " be one by ring shape " mouth " and branch shape
The impure composite structure of forming of feature, fastness is poor, after being separated by " Jiong ", has just lost script fully with regard to unstable globality, becomes not directly perceived.
The correct fractionation mode of " Chi " is " Chi (Cao one worm) ", strides pen combination fractionation mode " Chi (Qian
Mouthful) " be wrong with incorrect.Because this word is not (" one " separates with " worm ") write like this.
(2) stroke order combinatorial problem
According to sequential write, a stroke can become the word beam with stroke combination in front, also can become the word beam with the stroke combination of back, for example, the horizontal pen of first in " sheep " can Yu “ Ha " be combined into " ", also can be with the back
Be combined into
How the stroke that order occurs should make up, and mainly should consider 2 points, and the one, should help vision and cut apart, the 2nd, should make to split the word beam minimum number of coming out.How fractionation helps vision cuts apart and will be introduced in splitting rule.Making and splitting the word beam minimum number (i.e. " minimum principle ") of coming out is to realize that Chinese character fractionation mode tends to one of unique necessary condition.For example:
Following (Xia Dian), Bian (Tou fore-telling) ... (correct: as to meet minimum principle)
Bian (Dian Xia Dian) ... (mistake: violate minimum principle)
(3) the word beam sequencing problem of taking
The word beam take the order be from top to bottom, from left to right, from outside to inside, from central authorities to both sides, from integral body to the part.Object lesson is as follows.
From top to bottom: adopted (Dian Qe), (ten again), deep and remote (the one youngest mountain), tail (corpse hair), this (literary composition), formula (shooting a retrievable arrow the worker)
From outside to inside: be stranded (mouthful wood), fork (Dian again) is asked (doorway), district (Fang Qe), tricky (
), the family name (
)
From the centre to both sides: do (power eight), million (youngsters
), ridge (people
Month), (Tou one for rate
Ten)
From integral body to the part: witch (workman people), heavy (
Day), bundle (wood mouth), (king pig Dian) chiseled in village (seven Qian)
Above example has reflected most situations, and following example can say something pointedly.
Order: Broken (one one takes
Jin) , Following (Si one one
), disconnected (rice jin), Death (people )
Sequential write: Broken (one youngest, one one youngest jin) , Following (Si youngest, one one youngest, one ), break (meter jin), Death (people )
In the order of taking, in " Broken "
What get is the order of " " in the sequential write; “ Following " in
What get is the order of " " in the sequential write.When taking " one ", " one " and " " have been combined into
When taking " ", " " and " one " have been combined into
So the result that finally takes as above.
Take in proper order: Oes (sunset is foretold a mouthful an ancient type of spoon), Bing (
Do), (scholar's mouth mouth) Se (cutter Dian ends)
Sequential write: Oes (sunset is foretold a mouthful an ancient type of spoon), Bing (
Do), (scholar's mouth mouth scholar mouth mouth) Se (cutter Dian ends cutter Dian and ends)
Why the order of taking of " Oes ", " Bing " is from top to bottom, and " ", " Se " the order of taking be from left to right because sequential write is like this.
Above example explanation, the order of taking of word beam depends on the sequential write of Chinese character.That is to say that sequential write (i.e. " writing rule ") is the important evidence that Chinese character splits.Below give some instances again and be further described.
Take in proper order: formula (shooting a retrievable arrow the worker), military (two end
), F (evildoer people's dagger-axe), (bad soil is non-for Jian
)
Sequential write: a formula (worker
), it is military that (two end
), (evildoer people one is non-for F
), (bad soil is non-for Jian
)
Take in proper order: murder (Qe wood Dian and shoot a retrievable arrow) , Ghost-of-a-child (in vain
The Si dagger-axe), pull out (Rolling Dian again)
Sequential write: murder (Qe wood Dian one worker
) , Ghost-of-a-child is (white
Si flatly
), pull out (Rolling Dian again)
2 Chinese characters split regularity summarization
Stimulate human brain to the physiological influence of human brain according to the word beam, the shape word beam that the word beam can be divided into that an apt word added to clinch the point (as De “ Ha in " folder " "), branch shape word beam (as " wood " in " bundle ") and ring shape word beam (as " mouth " in " bundle ").These three kinds of word beams stimulate human brain completely different to the influence of human brain.The array configuration of three kinds of stimulus types is as follows.
Stimulus type difference: put fine shape---branch shape; Branch shape---ring shape; Ring shape---put fine shape
Stimulus type is identical: put fine shape---put fine shape; Ring shape---ring shape; Branch shape---branch shape
(1) stimulus type difference
During the different word beam weave in of stimulus type, distinguish easily.Object lesson is as follows.
The example of point fine shape and branch shape as: do (power eight) ball (nine Dian), sheep (Ha
), come (Wei Ha).Wherein " sheep " relates to stroke order combinatorial problem.Because “ Ha " be short stroke promptly an apt word added to clinch the point shape,
For long stroke is a branch shape, both vision differences are obvious, so " sheep (Ha
) " be correct fractionation, " sheep (
) " be wrong the fractionation.
The example of branch shape and ring shape as: because of (mouthful big), extend (two say), hundred (
Day), lose (two ) blunt (Ji
), ugly (
Ten), person (Uu day), bundle (wood mouth), wife (ten woman), centre ( is big), narrow-necked earthen jar (noon Qian).Wherein " hundred ", " narrow-necked earthen jar " relate to stroke order combinatorial problem.Because
" noon " is branch shape, and " day ", " Qian " they are ring shape, and two kinds of different structure vision differences are obvious, so " hundred (
Day) ", " narrow-necked earthen jar (noon Qian) " be correct fractionation, " hundred (one is white) ", " narrow-necked earthen jar (
The mountain) " be wrong the fractionation.
The example of ring shape and the fine shape of point is as all (a few Dian), ovum (
Jie Dian), a rain (towel
), family (Dian corpse).
(2) stimulus type is identical
Stimulus type is identical, and it is more rare with the situation of putting fine shape weave in to put fine shape.For example: a rain (towel
), be (Dian power Dian).
The situation of ring shape and ring shape weave in is also few.For example: return (mouthful mouth), huge (Contraband
), electric (Ri Yin), village (seven Qian) goes out (Cao Qian).The situation of branch shape and branch shape weave in is comparatively complicated.It below is exemplary.
Vow that ( is big), do not have (
Yin) ... (correct: " greatly ",
For intersecting shape, sound construction)
Bi (Ri Nian
) ... (correct: “ Nian ",
For intersecting shape, sound construction)
Surplus (people
), house (Ren Gankou) ... (correct:
" do " and be the shape that links to each other, sound construction)
When identical stimulus type weave in is cut apart, intersect and to be better than linking to each other, link to each other be better than from, be all continuously, order preferentially.
" Cao " is intersection, and " mountain " is connected structure, promptly " goes out (Cao Qian) " and splits for correct, and " going out (Qian mountain) " splits for wrong.This is better than linking to each other for intersecting.The horizontal pen of penult in " Bi " can with “ Nian in front " be combined into “ ",
Also can be combined into " ten " of back
Be better than linking to each other but intersect, so " Bi (Ri Nian
) " be correct the fractionation, " Bi (day
Ten) " be wrong the fractionation.
Indivedual exception: “ Land-Use-and-Requisition (Trucks one Jiong mountain) " be correct the fractionation.
Be connected structure,
Be from structure, i.e. " surplus (people
) " be correct the fractionation, " surplus (
Pin) " be wrong the fractionation.This be better than for linking to each other from.
“ " be connected structure, " my god " also be connected structure, Dan “ " in proper order formerly, promptly " vow that ( is big) " be correct the fractionation, " vowing (Pie days) " is that mistake splits.This is continuous for being all, and order is preferential.
Be all intersect or from situation few, as " vertical (
Nian) ", " Si (two or two) ".
(3) fastness of word girder construction
The boundary line that splits is that compactedness and the fastness by the word girder construction determined.From firmly being followed successively by to insecure
Stroke>ring-type links to each other shape word beam>parallel streamline from shape word beam>crossing shape word beam>other continuous shape word beams>other are from shape word beam
Generally speaking, people all " second ", " ㄑ ",
Be used as one, and " ",
" ",
Be used as two.Boundary line between the two is too uncertain really.In order to make single draw that the division is clearly demarcated with many strokes, the author advises not mentioning after handle is started to write formed lines and is used as one and treats.Such understanding has been arranged,
Can not be split as " Pie " and " ", " " are not split as
Just " end " and having understood easily.
Ring-type link to each other shape word beam as " mouth ",
Deng fastness be only second to stroke, more than existing more example relates to, and no longer repeats at this.The centre has the continuous shape word beam of the ring-type of horizontal pen also to have stronger fastness as " day ", " moon ", " ".The fractionation example is as follows:
Electricity (Ri Yin), Yue (day Shu) is by (day Shu), Hui (Si on the ten) ... (correct: " day " is strong construction)
Electricity (mouth seven), Yue (Kou Xia), by (Kou Shang) and, Hui (by Si) ... (mistake: intuitive is not good enough)
Get rid of (month Yin), Yin ( Pie), ugly (
Ten), Books (Jiong Nian), Mao (Jiong two), Dan (Jiong two) ... (correctly)
Do not mention parallel streamline as yet from shape word beam in more than splitting for example.Parallel streamline from shape word beam mainly comprise " two ", " three ",
" San ",
" river ".The fractionation example is as follows.
Some word beam is as " standing ", " already ", because of its occurrence rate height, look familiar, reason such as the present position is special in Chinese character, though member from, but in people's consciousness the integral body of a close and firm, being its firm degree is equal to linking to each other, in addition be higher than continuous, for example:
Hot (upright ten), close (Li Pin), produce (upright Pie), tight (an industry Pie) ... (correctly)
Indivedual exception: “ Qi (
Mouthful) " be correct fractionation mode.Because “ is Qi (upright mouthful Pie) " not directly perceived.
Intersect shape word beam, other continuous shape word beams, other all relate to no longer repetition herein in above fractionation for example from shape word beam.
The ability of the word Liangqi resistance to deformation that in addition, fastness is stronger is generally also stronger.For example, " wood " is a well-set word beam, and distortion has taken place " wood " in " bundle ", but it is still stronger to people's independence or globality.The ability of the word Liangqi resistance to deformation that fastness is more weak generally also a little less than.The fractionation mode of for example, " closing " be " close (
Mouthful) ", the fractionation mode of " Ji " then is " Ji (people one) ".Because
Be a discrete topology of not reinforcing, looks original behind the position change of horizontal pen have just lost fully.Other analogue comparative illustration are as follows:
Assist (Tou mouth mouth
), Select-currency (upright Yi Kou Xia), dead (bad an ancient type of spoon) ... (correctly)
Assist (
Mouthful
), Select-currency is (upright
Xia), dead (sunset an ancient type of spoon) ... (mistake)
Because
Be discrete topology, and " bad " is connected structure, firm degree varies sample is so when deforming, disposal route is just different.
(4) influence of stroke weight
Stroke by light to heavy order is: left-falling stroke, horizontal, vertical, right-falling stroke, folding, point.A little less than the stimulation relatively of lighter stroke to human brain, human brain difficulty is caught.For example when not seeing the original text typewriting, the left-falling stroke in " very " on " Zhu " is gently lacked, and human brain difficulty is caught, and therefore the fractionation mode of " very " is confirmed as " different (bad Zhu) ", rather than " different (bad Pie is not).
Heavier stroke is stronger relatively to the stimulation of human brain, and human brain is more easily caught.For example " chisel " though in the area that occupies of point less, it is that stroke is heavier to the interspersing of " pig ", human brain is more easily caught, thereby the fractionation mode of " chiseling " is confirmed as " chiseling (king pig Dian) ", rather than " chisel (Wang PHP-Manual) ".
More than summed up the universal law that Chinese character splits from many aspects and different angles, this is to formulate the prerequisite that splits rule.
3 Chinese characters split Rulemaking
(1) disassembly principle
Disassembly principle comprises writing rule, principle of clarity and minimum principle (hereinafter to be referred as " three principles ").Writing rule is exactly must split Chinese character according to sequential write generally; Principle of clarity splits Chinese character exactly and wants nature directly perceived; Minimum principle is exactly must make to split the word beam minimum number of coming out.
(2) split rule
Generally be exactly, the order sensation is taken successively, and close and firm is not broken up the family.Order sensation is taken successively and is referred to generally according to sequential write in conjunction with the locus, takes to greatest extent successively with sensation.The close and firm structure that refers to close and firm of not breaking up the family is not generally broken.
Typical case's fractionation mode that the reflection Chinese character splits rule is listed below.
1. grow (Pie
), beautiful (Ha king is big), lung (a month towel) ... (right: as to reflect correct ways of writing, meet writing rule)
Long (
Pie ), beautiful ( ), lung (month Tou towel) ... (mistake: ways of writing is incorrect cause splitting incorrect)
2. hundred (
Day), Chu (Er 亅), complete (people king), this (wood one) ... (right: directly perceived, nature meets principle of clarity)
Hundred (one is white), Chu (fourth), complete (
Soil), originally (
The people) ... (mistake: not directly perceived, awkward, violate principle of clarity)
3. Bian (Tou fore-telling), meeting (
), extend (two days), bundle (wood mouth) ... (right: as to meet minimum principle)
Bian (Dian Xia Dian), meeting (people two Si), extend (one day one), bundle is (flatly
) ... (mistake: violate minimum principle)
4. because of (mouth is big), take advantage of (standing grain
An ancient type of spoon), flat (Gan Ha) ... (right: as to follow sequential write generally, directly perceived, nature)
Because of (Jiong is big by one), take advantage of (thousand
An ancient type of spoon people), flat (Yi Ha ten) ... (mistake: definitely follow sequential write, not directly perceived, loaded down with trivial details).
[special instruction] spent so long length that Chinese character is discussed and split, split rule just so several ordinary, even these words do not say that people also know.The reader who has may produce doubt, and here the author wants ben and be, this just we pursue target---not having rule is exactly best rule.Human brain is a boost inductor body, and the best mode that Chinese character splits should match, adapt with this inductor.If can accomplish this point, also want those fractionation rules that play interference effect what is done?
4 Chinese characters split type and conclude
It is to take rather than write that Chinese character among the present invention splits what emphasize, observes Chinese character of the present invention with the thinking of taking and splits example, and many queries just can be met and be born and separate.For the ease of further getting one's ideas into shape, the author splits type with Chinese character and reduces following 19 kinds.
(1) has between word beam and the word beam and significantly cut apart ditch, cut apart fractionation by ditch.As:
Receive (Jiu The-Fan), portion (upright mouthful Fu), long (Pie
), cross (very little Chuo) mouse (mortar
) family (Dian corpse), total (Ha mouth heart), swallow (twenty
Mouth Xiangxi), surplus (people
)
(2) short word beam and the long word beam collocation of stroke of stroke cut apart fractionation by the length difference.As:
Closed (Ha days), sheep (Ha
), (Ha
Order), beautiful (Ha king is big), southern (Shi Men Ha does), bifurcation (Ha Shu), blue (Ha three), half (Ha
)
(3) word beams surround another word beam, and both take apart naturally.As:
Because of (mouth is big), fierce (Qe Qian), impossible (Contraband mouth) asks (doorway), ridge (Jiong Qe), deep and remote (youngest one mountain), from (civilian Qian Jiong Si), brain (using civilian Qian), fowl (humane Qian Si)
(4) a back word beam is to the interspersing of previous word beam, and presents discrete state, and both take apart naturally.As:
Fork (Dian again), ovum (
Jie Dian), all (a few Dian), too (big Dian), dog (big Dian), following (Xia Dian), a rain (towel
), be (Dian power Dian) sword (cutter Dian)
(5) the short symmetry word beam of stroke is clipped in the both sides of long word beam of stroke, presents the isologue state, and both take apart naturally.As:
Do (power eight), million (youngsters
), ridge (people
Month), (Tou one for rate
Ten), hold (three
), letter (
Qian), assistant officer (
One), ask (
Dian)
(6) two kinds of different word beams of form characteristic are embedded in together, present discrete state, and what the form characteristic was different separates naturally, the combination naturally that the form characteristic is identical.As:
Extend (two say) boundary (bending native three fields), witch (workman people), deep pool (Rui
Rice), take advantage of (standing grain
An ancient type of spoon), well-behaved (thousand
An ancient type of spoon)
(7) the short word beam of stroke is embedded in the long word beam of stroke, presents discrete state, and both take apart naturally.As:
Flat (Gan Ha), come (Wei Ha), folder (Fu Ha), (
Ha), the state (
The river), stingy (Tu Ha mouth mouth), golden (Ren Wang Ha)
(8) two word beams are connection status, disconnect from the place that connects.As:
Open (European-allies), inferior (industry), mutual (one
), straight (ten and), the step (ends
), card (go up and foretell), the Tuan commentary on meaning of different diagrams in The Book Changes (
), ( spreads
), vow that ( is big), the family name (
), (Bao not
), the back (
), the institute (
Jin), pawl (Shu ), do not have (
Yin), chi (corpse ), hundred (
Day), first (
Order), city (Tou towel)
(9) discrete shape word beams and another word beam join, and disconnect from connecting place.As:
Chu (Er 亅), (Lv Mi two covers in unit (two youngsters)
), the capital (
Little), hot (upright ten), close (Li Pin), occasion (factory two
)
The inboard of (10) word beams and an annular word beam joins, and annular is taken apart naturally with other than ring type.As:
Face (
Mouthful
), bent (Kou Nian), ugly (
Ten), minister (Contraband Shu
Shu), go out (Cao Qian), and (
Jiong
), narrow-necked earthen jar (noon Qian)
The edge or the port of (11) word beams and another word beam join, and disconnect from the joint.As:
(12) word beam lodges are taken apart from oblique line on the oblique line of another word beam.As:
Filial piety (Uu), old (Uu an ancient type of spoon), examine (Uu one ) and, person (Uu day), name (sunset mouth)
(13) a back word beam is to the interspersing and be intersection of previous word beam, and the word beam of interspersing is taken apart.As:
Ball (nine Dian) is scolded (jin Dian), chisels (king pig Dian), hurriedly (Bao
Dian), this (wood one) must (heart Pie), and (
)
(14) two word beams that are dumb-bell shape are inlayed weave in, disassemble according to compact state.As:
(15) annular word beams and a branch shape word beam are inlayed weave in, and annular and branch shape are taken apart.As:
Bundle (wood mouth), official's (zhang mouth), card (wood
), heavy (
Say), thorn (wooden Jiong Dao), jujube (wooden Jiong
), village (seven Qian), thing (
Mouthful ), favour (
Day heart), smooth (longbow) grasps (standing grain )
(16) cruciform word beams are inserted in the centre of an annular word beam, and cruciform word beam is taken apart.As:
Wife (ten woman), capsule (ten mouthfuls of Mi
), 18-hole-golf-course (ten mouthfuls are again), Graduate (ten suns), Cao (ten everyday)
(17) discrete shape word beams and another word beam intersect, and both are separated.As:
(18) annulars or bending shape word beam and another word beam intersect, and both are separated.As:
In (mouthful Shu), exempt from (
Mouthful
), volume (
One), red (
Tou), slowly (Jiong soil), centre ( is big), allusion quotation ( eight), (Bing determines
The people), find pleasure in (
Little), tooth (
亅 Pie), error (sunset
Shu), deer is (wide
An ancient type of spoon), get rid of (month Yin), Yu (mortar people), ghost is (white
Si), a kind of monkey mentioned in ancient literature (day Jiong
), black (
Soil Xiangxi), more (one day
), Yin ( Pie), then (
), being subordinate to ( Shui), ( holds concurrently
), respectful ( Shu
Eight), make (
Towel Dao), lung (using a towel), prompt (Rolling one
), Wei (two
), book (
Dian), farming (Mi
), crane (Mi Ren Dian bird), Shen (Rui Mi
), east (
Little), practice (Si
), special (two
Dian), send out ( Pie is Dian again) elder sister (woman
Shu Pie)
(a 19) word beam and word beams that have oblique hook that have horizontal pen intersect, and the position separates both at shoulder.As:
Become (
), plant (building
), I (
), sunlight (Ri Ha king
), it is military that (two end
), cut (native Ren Dian
), (Tu Kou
), get over (soil
)
5 difficult point issue handlings
Pursuing fractionation intuitively is our hope, almost is impossible but all Chinese characters can both be split intuitively.For example, in " pair ",
Do as a whole do not split more directly perceived, but in " beans ",
Do as a whole do not split just not directly perceived.For another example, in " control ", " cave " done as a wholely not split more intuitively, but that independent " cave " word does not split is just not directly perceived.Here exist the problem of a break-even point.Hold equilibrium generally, indivedual local undesirable being difficult to are avoided.Following given example all is special circumstances, also is difficult point.
1. split the sensation naturally that rule must be able to reflect that Chinese character splits, can limit the randomness that Chinese character splits again.But, make this hope absolutely to realize because of the restriction that the word beam is chosen.For example:
Inner feelings (
Shu
), decline (
One
), quiver (
Mouth day shellfish) ... (correct, but be not easy to expect for the first time)
Inner feelings is (among the Tou
), quiver (Tou returns a day shellfish) ... (mistake, because of " in ", " returning " be not the word beam)
Inner feelings (Tou mouth Shu
), (Tou mouth one declines
), (Tou mouth mouth shellfish) quivers ... (mistake, the minimum principle of violation disassembly principle)
Meeting (
), fowl (humane Qian Si), food (
Ji
) ... (correct, but be not easy to expect for the first time)
Meeting (people two Si), fowl (
Yi Qe Si),
Food (people Dian Ji
) ... (mistake, the minimum principle of violation disassembly principle)
Fortunately, this situation is few especially, more than is only example.
2. minority Chinese character, writing is to launch to both sides from the centre, splitting then is to take from left to right.For example:
(twenty mouthful of swallow
An ancient type of spoon Xiangxi), ( is from ending Fan) , Qi (Tou cutter bifurcation in the sixth of the twelve Earthly Branches eight for a one-legged monster in fable
) ... (correct sequential write)
Swallow (twenty
Mouthful Xiangxi), a one-legged monster in fable (
End order Fan) , Qi (
Cutter Shu
) ... (correct fractionation mode)
3. extremely indivedual Chinese characters can be considered to handle with simpler and clearer method for splitting.For example:
Brother (Yu Koukou) ... (simpler and clearer processing mode)
4. the ways of writing mistake is a kind of common phenomenon in the Chinese character split process.For example:
Long (Pie
), beautiful (Ha king is big), lung (a month towel) ... (correct, as to meet correct ways of writing)
Long (
Pie ), beautiful ( ), lung (month Tou towel) ... (mistake, ways of writing is incorrect cause splitting incorrect)
Can, carry out fault-tolerant processing? for this situation can not.This mistake is carried out fault-tolerant, not only connived mistake to continue to continue, and make whole Chinese character split the typing system to become unintelligible, cause ideologically confusedly easily, finally increase the weight of brain burden.
5. extremely indivedual Chinese characters, because of the singularity of its structure causes the visual discrimination difficulty, so that most people have wrongly write, this situation can be fault-tolerant.For example:
Awkward (nine
Ware), a word used for translation (nine people
) , Hideaway (
) ... (correct, fault-tolerant way)
Awkward ( Yin
Ware), a word used for translation ( Yin
The people
) , Hideaway (Xia
) ... (correct, normal mode)
To " embarrassment " two words, the author tested 10 people (being the above educational background of university or university), did not have a people to write (all " In-particular " having been treated as " nine ") unexpectedly." beggar " be placed on contrasting at the moment write, the author has also tested 10 people, can the right people of write once have only 1.
6. the mode that do not split is intuitively individually listed separately.For example:
Sparrow (
Ren Dian
) , Yang (day ten thousand
), Cong (Ye Ha does again), Shou ( Ya mouth is very little), Is-dated ( Shu
Eight), Fang (side
second) , Huan (
Ren Dian people), Ami (
) , You (
Contraband
), Stay (one
The Contraband field), Shi (wood ten
) , ?(He ) , Suo (Tou one
) , With (
One
Eight)
7. the only a few Chinese character that also has picture character can be handled and list separately especially.For example:
Protruding (Shang), recessed (Qian), Tortoises (Pie Yin) , Strider (Kou Ri Yin), Yuan (women Si Si is big), As (Dian power
Xiangxi), be (Zhao
Xiangxi)
3.2 key position mapping
The Hanzi structure piece has various features, and the form artistic conception is a kind of can the absorption naturally and the feature of abstract by human brain.The present invention obtains 38 artistic conception classifications according to this signature analysis Hanzi structure piece, merges into corresponding 26 letters of 26 classes according to the principle of artistic conception relevant position complementation then, the results are shown in Table 1.Corresponding letter is called the classification code of word beam in the table 1, and wherein upper and lower case letter all is meant same key position.
Table 1 word beam classification code contrast relationship table
Identification code is meant the first letter of pinyin of word beam, does not have the word beam identification code of pronunciation to represent with a.Word beam identification code sees Table 2.
Table 2 word beam identification code contrast relationship table
(continuous table)
3.3 code taking method
Code taking method of the present invention is, the classification code of word beam got successively in individual character, and the not enough trigram of code length adds identification code and branch, and less than adds the space for four yards, surpass four yards get before trigram and last sign indicating number.The first two sign indicating number got in the every word of double word phrase.Three words groups, prev word are got the first two sign indicating number, and last sign indicating number respectively got in back two words.The phrase that four words and four words are above, last yard of getting first three word and last word.Individual character and phrase code fetch see Table 3 for example.
Table 3 code taking method for example
The Chinese character of being made up of a word beam is called the single-beam word, and the Chinese character of being made up of two word beams is called the twin beams word, and the Chinese character of being made up of the word beam more than three or three is called many beams word.The higher word beam of following 31 structure word frequencys is called Gao Ziliang.
The people Fu month
Mountain stone worm Xin soil Rolling day Yan Yan Http Lv mouth corpse Jin Jin wood
Chi Quan Ren king Epileptic Rui woman He Si Si
In the twin beams word, after classification code has been got, get the identification code of lead-in beam successively, if the lead-in beam is Gao Ziliang, then get the identification code of tail word beam, if head and the tail word beam all is Gao Ziliang, then use v as identification code.Certainly, the first letter of pinyin of the also desirable complete Chinese character of the identification code of twin beams word.
The coding latter half of everyday character often can omit and can be presented at the front in advance, at this moment can directly send by space bar.In order to point conveniently moving, every energy can be used period keys ". " replacement with what 2 selections were upward shielded, and every energy can be used apostrophe keys "/" replacement with what 3 selections were upward shielded.
For numeric keypad such as cell phone keyboard, the English alphabet mode of keyboard input code is constant.As for the corresponding relation of key letter, promptly decide with the key face setting of mobile phone with numeral.
4 differences and effect
The present invention and application number are that 95104165.7 background technology (hereinafter to be referred as background technology) is compared, split rule system perfecting more, the word beam of choosing is more reasonable, the corresponding relation of word beam and key letter is science more, identification code is detailed to be listed, the scope of application expands the GBK character set to, and code taking rule is hommization more.Specifically, split regular aspect, background technology has only brief generality to describe and a spot of fractionation example, the Hanzi structure of complexity is not formed the disposal route of system, do not sum up Chinese character and split rule, Chinese character is not split type and carry out the systematization classification, do not win out the difficult point problem in the Chinese character fractionation.And the present invention has formed a whole set of systematized disposal route to the Hanzi structure of complexity, has summed up the rule that Chinese character splits all sidedly, Chinese character is split type reduce 19 kinds, and the difficult point problem during Chinese character is split has been won separately.These all are very rare innovations and breakthroughs.
Aspect the mapping of key position, the word beam that background technology is listed is 346, mainly at the GB2312-80 character set.The present invention removes wherein 3, increases by 104, i.e. the word beam that the present invention lists is 447, and the scope of application expands the GBK character set to.Shine upon for the classification code key letter, the work that the present invention did mainly is to extend, replenish and optimize, to the identification code keyboard map, background technology has only been used descriptive language, clearly do not list, the present invention is detailed listing, and has removed the regulation of " alike especially word Liangqi identification code with it as the shape letter " in the background technology.Though the modification that the present invention is made on basis of background technology is not obvious especially sensuously, these modifications are to guarantee that this encoding scheme becomes a kind of unique selection of optimum coding scheme, are very rare innovations and breakthroughs.
Aspect code taking method, the present invention has increased the explanation to numeric keypad such as cell phone keyboard code taking method, and other do not have difference.
Comprehensively discuss, the present invention compares with background technology or other keyboards input font code, Chinese character splits, and word beam, the letters case of word beam correspondence, code taking method regular, that choose have all reached the uniqueness that can't negate, thereby make the present invention might become the best mode of Chinese character shape code, this is substantive breakthroughs of the present invention and innovation.The present invention promptly the image sign indicating number have corresponding with the heart, with refreshingly accompany, unity of body and soul, the miraculous effects appearing in one's mind naturally, never forget.
5 embodiments
Embodiment has partly been stated research thinking of the present invention and step, and whether the present invention is that the optimised form of character shape coding has certain help to understanding.
5.1 the derivation of classification code
The first step according to feeling to be that above-described Chinese character splits three principles and 19 kinds of fractionation types split Chinese character naturally, serves as according to dividing, obtain the relatively distincter classification of 38 form artistic conceptions, the results are shown in Table 4 with the form artistic conception then.
Second step, add up each classification in the Chinese character first place, the appearance number of two and end position.One or a class word beam may appear at the first place in Chinese character, as " mouth " in " mouth ", " stinging ", may appear at two, as " mouth " in " button ", " turning ", may appear at the position, end, as " mouth " in " button ", " bright ", " melting "." mouth " appears at the first number of words, is called the first place and number occurs, appears at two number of words, is called two and number occurs, appears at the number of words of position, end, is called the position, end and number occurs.Wherein, " mouth " in " mouth " only calculates the first place and number occurs; " mouth " in " button " both calculated two and number occurred, calculated the position, end again and number occurred.The total number of word of number divided by research range appears in the position, is called the position occurrence rate.
With the GB2312-80 character set is example, and this character set has 6763 Chinese characters (hereinafter to be referred as 6763 Chinese characters).6763 Chinese characters are divided into the first-level Chinese characters and the Chinese characters of level 2 by commonly used and inferior using always, and first-level Chinese characters has 3755, and the Chinese characters of level 2 have 3008.Word beam of each letter representative of 26 letters in 3755 Chinese characters the first or two on the ideal value of appearance number should be 144 (3755/26), ideal value in 3008 Chinese characters should be 116 (3008/26), and the ideal value in 6763 Chinese characters should be 260 (144+116).
The 3rd, 4,5 row of table 4 are that 38 classifications of word beam are the first in 6763 Chinese characters, the statistics of the appearance number on two and the position, end.
38 form artistic conception classifications of table 4 word beam
(continuous table)
Annotate: " soil ", " worker ", " king ", " life ", " just ", " car ", " fish ", " horse ", " bird ", " standing ", " ending ", " already ", " son ",
" and ", " ear ", " " be as left avertence when other, its last horizontal stroke often become and carry (as " "); " wood ", " fork-like farm tool used in ancient China ", " standing grain ", " Bian ", " rice ", " husband ", " fire ", " shellfish ", " literary composition " be as left avertence when other, and it last one is pressed down and often become point (as " machine "); " nine ", " several ", " youngster ",
When other, crotch wherein often becomes carries (such as “ Dove as left avertence for " seven ", " hair ", " " "); " Shui ", " plumage ",
In hook default sometimes (as " rhinoceros ").These distortion word beams are not single-row.
The 3rd step, collocation playback and code arrangement.Word beam classification is more than 26 classes, and has only 26 letters on the keyboard, therefore also must arrange in pairs or groups.The general principle of collocation is that locations complementary, artistic conception are relevant.The specific implementation step is as follows.
1. (1) to (14) class, first place number and two occur and number occurs, have near ideal value, differing with ideal value of having is not too big, can finalize the design earlier.Except that (11), each class has all found the relevant representative letter (listing at left column) of form artistic conception.
2. (15) class, first place number occurs to exceed ideal value more, and two appearance are several then too little, should look for a first place number young waiter in a wineshop or an inn position to occur and the big class of number occur and arrange in pairs or groups with it, and what conform with this condition has (33) to (38).But consider from the form artistic conception, (34), (35), (36), (37) existing best collocation (back has explanation successively), remaining can consider (33), (38) are arranged.Weigh gains and losses, (38) are more suitable.(15) three summits of 3 corresponding W, the shape that fill the span of a man's arms for both hands (38), the meaning that letter w also has both hands to fill the span of a man's arms.So (15) can finalize the design with (38), represent letter to be W.
3. (16) class, the first place number occurs and exceeds ideal value, and two number occurs near ideal value, and the position, end still has bigger space, the classification that can fill up this space that (33), (34), (35), (36), (37) are arranged.But have only (36) and the collocation of (16) to coordinate most.And (16) to have certain correlativity, n then to be that the best of (36) represent alphabetical with N (have two perpendicular).So (16) can finalize the design with (36), represent letter to be N and n.
4. (17) class, the first place number occurs and has exceeded ideal value, and two and position, end still have certain space to utilize.In (33) to (38), seek, have only (37) optimum.Both are downtree bifurcation extended conformation, represent with the letter r that single pin is overhanging, feel very comfortable, can finalize the design.
5. (18) class, the first place number occurs and ideal value is suitable, and two have big vacancy with the position, end.In (30), (33) to (38), seek, have only (34) optimum.(18) with the T morphologic correlation, (34) coincide with the t form.So (18) can finalize the design with (34), represent letter to be T and t.
6. (19) class, the first place number occurs and ideal value is suitable, has vacant position with the position, end for two.Seek in (30), (33) to (38), eliminating has (30), (34), (35), (36), (37), (38) (the existing explanation or explanation soon) of best collocation, and last only being left (33).(33) with alphabetical Q to a certain degree correlativity is arranged.So, can represent letter to be Q (19) and (33) typing.
7. (20) class, the first with two all near ideal value, there is big vacancy the position, end.In (33) to (38), seek, have only (35) to coordinate most.(20) be the pot cover form, be harmonious with alphabetical M form, the four font attitudes of (35), consistent with the M of sealing.So (20) can finalize the design with (35), represent letter to be M.
8. (21) class, the first place number occurs near ideal value, has vacant position with the position, end for two.In (30) to (38), seek, have only (30) to coordinate most.(21) form is that horizontal hanging is down waftd left, is harmonious with alphabetical F, and waft left for " ten " word intersects (30), is harmonious with alphabetical f.So (21) can finalize the design with (30), represent letter to be F and f.
9. (22) class, there is a small amount of vacancy the first place, two with the position, end near ideal value.In (28) to (32), seek, have only (32) optimum.Both and alphabetical P morphologic correlation.So (22) can finalize the design with (32), represent letter to be P.
10. after the collocation of (24) class and (29) class, it is approaching or differ and be not too big with ideal value that number average appears in the first, two, position, end.(24) coincide with the B form, (29) are consistent with the b order of strokes observed in calligraphy.So (24) can finalize the design with (29), represent letter to be B and b.
11. (23), (25), (26) two number appears near ideal value, have vacant position in the first and position, end.(31) first place number occurs than two, last greatly.(27) the first, two, position, end number occurs and are more or less the same.(28) number to occur big in the first place, and two little, and the position, end be a sky.
Generally speaking, the difference that number does not reach highly significant appears in the position of (31), (27), (28).Therefore, consider, can't find the arranging scheme that strong cogency is arranged if list number occurs from the position.On the form artistic conception, do not have correlativity between this 6 class yet and can say.Consider from form artistic conception conflict property that more promptly conflict with (28) nothing (26), conflict with (27) nothing (25).(23) with (31) collocation slight being unsatisfied with arranged still.So, (31) can be added to or exchange to other classifications of having finalized the design and get on? seek all over whole classification, have only (12) to consider, but feel still not ideal enough.The final table 4 of pressing is arranged the collocation typing.Left column is the representative letter, and the horizontal sensation of a horizontal expression is wherein arranged among the G; U represents to rotate U-shaped; K represents to tilt K shape.
So far, 38 artistic conception classifications have been merged into 26 classes.Wherein have only C not use in 26 letters, (11) do not arrange corresponding letter as yet.So (11) represent also just to have had nothing to speak with C.
5.2 the screening of word beam
The word beam that number is many more, structure is compact more, area that occupy in Chinese character is more little occurs, its form artistic conception is often distinct more, and people split unit and treat just easy more it is used as, and just can not break it when splitting Chinese character.This just character shape coding choose the advantage that splits unit.But not all situation all is so desirable.For example, a people who never learned any character shape coding, may be split as " stone " "
Mouthful ", and " building " is split as " Shi Shishi ".The former is because of loop configuration " mouth " in " stone " word and branch shape structure
The cause that vision difference is bigger, the latter is the less cause of area that occupies in " building " because of " stone ".On the one hand, physiological sensation will be followed, and on the other hand, splits unit and also should be fixed up.Therefore, must screen the word beam.
Here select several representative word beams, in the analysis of adding and deleting, illustrated standard and the yardstick that the word beam is chosen.
1. " Dian Tou dies in classification
Upright
Six literary compositions also
The side
Don't you see " suffering ", " clothing ", " product ", " last of the twelve Earthly Branches ", " profound " extensively ", what reason?
The reason of not listing " suffering " in has 3 points, and the one, the word (suffering is debated the peppery lobe pigtail guilt of distinguishing and taken leave the government official and ward off zinciolate Xin Zi) that contains " suffering " in 6763 Chinese characters is less; The 2nd, the area that " suffering " occupies in Chinese character is big slightly.The 3rd, after " suffering " was split as " standing " and " ten ", it was encoded to df, and df is almost vacancy in code table.The reason of not listing " clothing ", " product ", " last of the twelve Earthly Branches ", " profound " in is similar with " suffering ".
2. " chief of a tribe " can be removed from split unit?
" chief of a tribe " area in Chinese character is big slightly, " mouth " wherein is dazzling especially, wish it is taken out from " chief of a tribe ", but, if really will split " chief of a tribe ", feel thorny especially again: " chief of a tribe " is split as “ Ha " and " tenth of the twelve Earthly Branches " then not directly perceived, be split as " " and remainder, then this remainder does not have position arrangement.Fortunately also have a structure " honor " not find extra reason for not splitting " chief of a tribe ".Because the first two sign indicating number of " honor " is vj, and with what vj began two words that are of little use " malignant boil " and " disease " is only arranged." chief of a tribe " do not split, and " honor " and " abiding by " just in time filled up, and abdicated valuable vo space (more with the everyday character that vo begins).
Never people who learned any character shape coding be easy to " capital " is split as " Tou ', " mouth ", " little ", but the present invention
Classify fractionation unit as, how to explain?
Classifying the reason that splits unit as has 4 points, and the one, in 6763 Chinese characters, contain
Word (whale enjoy prosperous Guo cook who ripe honest exterior feature decline the bright cream of inner feelings booth milli person of outstanding talent quiver sad height strike the capital just the frightened Dun Hao of alcohol do plunder the punt-pole scape shadow that dries in the air and sincerely forgive alpine rush or palm-bark rain cape and groan howl pick and stop the pure original text of the cold cold of the fine jade quail private school Dui granary of having a strong smell and report the awake newly-risen sun white of the high big and heavy stone austere of the vulture Guo Roripa wormwood artemisia ligusticumic Hao rammer einsteinium graceful thin white silk used in ancient China that transmutes of the withered large-leaved dogwood rent of outer coffin rafter purlin Hao swinging Hao of rewarding with food and drink of kicking in the least) more; The 2nd,
The area that in Chinese character, occupies little (be about Chinese character area 1/4); The 2nd, if
Be split as " Tou " and " mouth ", it is encoded to do, and in code table, do original just many (not very the do of " Tou ", " mouth " composition has more than 50); The 4th, if
Do not split, can make 28 words become 3 yards of code lengths on the one hand by 4 yards of code lengths, on the other hand,
Can form db, dg, dh, dm, dr, du, dy code with the structure that links to each other, these codes are all being filled up vacancy in varying degrees.Analogue also has
" stone ", " shellfish " etc.
[special instruction]
" chief of a tribe ", " stone ", " west ", " tenth of the twelve Earthly Branches " these several structures, first three is to split or do not split, the author totally pondered ten years, according to this use experience in 10 years, the author think still be not split as suitable.
Be not 4. " arrow " listed in fractionation unit, and " mistake ", " Zhu " are listed in fractionation unit, how to explain?
The word that contains " arrow " in 6763 Chinese characters have 44 (vow rectify short know the short short of intelligence square besides pheasant Ju family spider dust suffer bunch sound of sighing study doubt the marquis and cure that disease is silly hate the monkey larynx and coagulate Yi watchtower in ancient times an ancient plucked stringed instrument Ei a small bundle of straw, etc. for silkworms to spin cocoons on puncture arrowhead screen wart solid food Gou pig a word used in place name of waiting of hesitating of whistling to a dog).Though consider have " arrow " classified as the impulsion that splits unit from area and self structure compactedness thereof that " arrow " occupies Chinese character, based on following 2 reasons, " arrow " still do not classified the fractionation unit as and is advisable.
One, in the code table related with the present invention, the coding qa that comprises " vow ( big) ", the first two yard is totally 32 of qa, skew ideal value (ideal value is 10) 22, meeting maximum offset must not be greater than the requirement of 30 (referring to " discussion of the good and bad evaluating method of an encode Chinese characters for computer " literary composition, this article is about to be published on " Journal of Chinese Information Processing ").
Its two, if " arrows " classified as fractionation unit, the easiest expect be with it and " people goes into big day Fu Shi Bo that dies young of fiery shellfish
" put together, " people goes into big day Fu Shi Bo that dies young of fiery shellfish but so just reduced
" artistic conception sharpness and distinctiveness.
If " mistakes " and " Zhu " do not classify fractionation unit as, then their fractionation mode be " losing (Pie husband) " and " Zhu (Pie is not) ", when not seeing that original text is imported word such as " order ", " very ", feels that significantly seizure is difficult." mistake ", " Zhu " as a branch of branch, have only a visual focus respectively, and the light and short formed quantity of stimulus of a left-falling stroke is difficult to the notice of human brain is pulled through from visual focus.As seeing or expecting a people in a flash that the focus of seizure is its face, and can not note the ear on that people left side earlier, unless that ear is dazzling especially.Before several years, keep the consciousness of artistic conception sharpness and distinctiveness stronger, not enough to the understanding degree of depth of this problem, " mistake ", " Zhu " are not classified as fractionation unit.At present " mistake ", " Zhu " are classified as the consciousness that splits unit and occupied windward significantly.
Why " shellfish " be split unit and " opinion " be not?
The word that contains " shellfish " in 6763 Chinese characters has nearly 200, and what wherein " shellfish " was in the prefix position just has 35.If " shellfish " is split as " ", " people ", then its code should be UA, the first two sign indicating number that will have the individual word of 53 (35+18) so in the code table related with the present invention is UA, skew ideal value (ideal value is 10) 43, and seriously having exceeded maximum offset must not be greater than 30 requirement.Thereby undoubtedly, " shellfish " should classify fractionation unit as.
The word that contains " opinion " in 6763 Chinese characters has only 30, " sees " that wherein the word that is in the prefix position only has " opinion " word." see " that the area that occupies is big, structural compactness is poor in Chinese character, occur that quantity is few, the form artistic conception is compound, make " opinion " unsuitable as splitting unit.
6. contain
Word " head " basic model is only arranged, involved word has only 6 (first road Guo a one-legged monster in fable thoroughfare bows) in 6763 Chinese characters, why classify it as fractionation unit?
One, two visual focuses are very bright and clear in " head ", and one is
Another is " order "; Its two,
Be among a kind of form artistic conception of distinctness.Therefore,
It is very sufficient to classify fractionation unit reason as.
7. structure
Two (drag and drag) are only arranged in 6763 Chinese characters, can from split unit, remove?
Residing form artistic conception (shoot a retrievable arrow by the Jian dagger-axe
) very distinct, remove wherein any one, all can greatly increase brain burden.Please know from experience:
R ← Jian dagger-axe is shooted a retrievable arrow
(one does not lack thinking amount and memory capacitance minimum)
R ← Jian dagger-axe is shooted a retrievable arrow
(lacks
Thinking amount and memory capacitance increase)
R ← Jian dagger-axe is shooted a retrievable arrow
(lacks
With
Thinking amount and memory capacitance are bigger)
The word (the well-behaved swallow in back of the body Ji, north takes advantage of surplus Sheng Bei thoroughbred horse to stick one piece of cloth or paper on top of another) that contains " north " in 6763 Chinese characters has 11." north " has stronger coherency though be divergence type structurally, is used as an integral body by the people easily and treats, thereby want to classify it as fractionation unit very much.Is problem where " north " is placed on after classifying as and splitting unit? the easiest expect be with its with " Xin is non-
Sheet
Foretell slit bamboo or chopped wood
Jiu Zhuang
The river
Shu industry Zhi Shang Shang " put together, but from the artistic conception degree of agreement, some is dissatisfied still.So " north " does not temporarily classify fractionation unit as.
Structure compact, should classify fractionation unit as according to reason, but in mapping table, not have suitable position.Because having only a Chinese character " to face " in 20902 Chinese characters of GBK character set contains
And
Be in the position, end, work as input
With
The time (when importing h and k), " facing " just shown in advance, thereby
It is just so unimportant to classify the fractionation unit as yet.Scheme of the present invention be with
Be split as " mouth " and " Shu ".
5.3 FAQ about code fetch
1. first letter of pinyin is a kind of mental bigger non-artistic conception feature that consumes, why also use this feature during code fetch?
Consume under the not enough situation of little artistic conception feature in mentality, it is necessary seeking that other features replenish, and first letter of pinyin is in as identification code after the classification code, is a kind of additional.Many beams word need not identification code, and twin beams word great majority can omit identification code and import by the brevity code form.Though being in the more important second place, the identification code of single-beam word can not omit, but because the single-beam word has only a word beam, stimulate that area is big, the time is long, and classification code and identification code are close together, it is whole to form a consciousness easily, makes the defective performance of this feature not come out.
Does 2. why not the identification code of twin beams word pay the utmost attention to the first letter of pinyin of getting a complete Chinese character?
The one, avoided not knowing the worry of word sound, the 2nd, the occurrence frequency of word beam is than a complete Chinese character height, and frequency is high more, and the number of times of stimulation is just many more, and reaction velocity is just fast more naturally.
Does 3. why not the identification code of twin beams word get the fixed position?
The identification code of twin beams word just can be cancelled 26 high word beams if get fixed position (as tail word beam), and it is simpler that the coding rule of twin beams word also can become.If but like this, repeated code will increase.26 high word beams appear to and are difficult to remember, this is unnecessary worry in fact.Because high word beam occurrence frequency height generally just can be known with sensation.Say that again in actual typewriting process, the identification code of twin beams word is used seldom,, can also have a look presenting bank just in case do not know.
The concrete reason that identification code is not got the fixed position illustrates as follows.
" Chinese ", " you's " classification code is identical, and the lead-in beam is identical, " ", the classification code of " bundle " is identical, and the lead-in beam is identical, this explanation identification code should not be fixed on the first place.
The classification code of " this ", " mark " is identical, and tail word beam is identical, and this explanation identification code should not be fixed on the tail position.
In " ten " in " leaf ", " gulping down " " my god " more characteristic than " mouth " to the stimulation of human brain, be easier to cause that human brain notes.But their position is sometimes in the first place, and sometimes in the tail position, this explanation identification code should not fixed position.
What does 4. the phrase code fetch have be particular about?
Though the above phrase of four words and four words is that every word is got one yard, in the practice process, few people can import Chinese character in this manner.Reason is that reaction velocity that human brain is caught font does not catch up with every word and gets one yard rhythm.Therefore, three words groups, first word is to get the first two sign indicating number.Four words groups except encoding by the mode of four words groups, have also been splitted into " having no " and " query " two double word phrases as " undoubtedly ".Two double word phrases are compared with one four words group, though stroke is many one times, meet circadian rhythm, and reaction velocity is fast, feels happy.
Four words groups also can consider to adopt prev word to get the first two sign indicating number, and two yards way got altogether in back three words.
5. how to handle for " speech " in the complex form of Chinese characters?
In the GBK character set, " speech " only appears at the left side of a complete structure and can change " Yan " Shi Caineng into and be used as the word beam and treat.For example:
Framed (Yan workman people: LIAA) Prison (the big Dian of Quan Yan: SLAD) (" Yan " is the word beam)
Falsely accuse (Yan workman people: LIAA) prison (the big Dian of Quan Yan: SLAD) (" Yan " is the word beam)
Letter (two mouthfuls of Ren Tou: ADGO) (" speech " is not the word beam)
The processing of [special instruction] " speech " is a difficult point.To GBK character set or BIG5 character set, how structure " speech " should be handled, and is still waiting further observation.
6. how flexible is the code taking rule of numeric keypad (as cell phone keyboard)?
The lexicographic order arrangement mode of numeric keypad is different fully with the standard computor-keyboard at present.No matter be computer or mobile phone now, people's contact is all very frequent.But can not find to mobile phone at the character position of being familiar with on the computor-keyboard, this is the injury to human brain.Therefore, the author advocate with the English alphabet on the numeric keypad put in order change into consistent with the standard computor-keyboard.
The same at input Chinese character on the keypad with input Chinese character on big keyboard, coding as " China " is OHOU, button on cell phone keyboard is 6468, also be OHOU, because M, N, three letters of O are arranged on " 6 " key on the mobile phone key face, G, H, three letters of I are arranged on " 4 " key, T, U, three letters of V are arranged on " 8 " key.Input 6468, screen display:
1. in Chinese 2. mid-terms 3. totally 4. loyalties 5. dismiss 6. and choke
By 0 key, again by 1 key, promptly " China " goes up screen.
Claims (1)
1, a kind of computer keyboard Chinese character shape code input method generally has three ingredients, and the one, split rule, the 2nd, split the mapping relations tabulation or the description of unit and key letter, the 3rd, code taking method, characteristics of the present invention are:
The fractionation rule is, principle and rule that the feeling naturally of a whole set of and human brain matches, and principle wherein is writing rule, principle of clarity and minimum principle, and rule wherein is that the order sensation is taken successively, and close and firm is not broken up the family;
The mapping relations that split unit and key letter are as follows
Classification code keyboard map letter splits unit
A people goes into the big sky of the fiery shellfish husband that dies young and loses
Tony
H Xin non-Guan
Sheet
Foretell slit bamboo or chopped wood
Jiu Zhuang
The river
Shu industry Zhi Shang Shang is protruding
Tuft-of-hair
The sweet Nian Jing of n Lv Nian European-allies
Thirty generation
Several
Nine
Youngster
Their-registered
O mouth mouth
R Mu Pin not Zhu end fork-like farm tool used in ancient China Jian dagger-axe shoots a retrievable arrow
Fu I
Identification code keyboard map letter splits unit
A Bo
Cao
Ji
Xia
Na
Xi
Guan
Jiu
Zhuang
Shang
Uu
亅
Zhao
Yin
Nian European-allies
Jie
Pin
Pan
Ha Chuan Huan ㄑ
マ Qe
Bian
Ji
Yin
Yen
Tuft-of-hair
Their-registered
Fu I
Jie
For-additional
Swastika Swastika
H standing grain fire one
A few first towel of j township Jin jin well nine mortar Jian of a specified duration are lonely own
Jin Si
The k mouth
The upright power of l six fork-like farm tool used in ancient China Dao
P sheet slit bamboo or chopped wood Mi Pie
The q 7,000 and the chief of a tribe
R day people goes into the ninth of the ten Heavenly Stems
The Shen body is given birth to ten scholar's generation, thirty Si water Shui pig Cannibals Woo Shu San on the s stone corpse mountain Chi Rui Xin three
Four Si the sixth of the twelve Earthly Branches of Xiangxi
Hand
Kan Shi
The uncivilian crow of w king ten thousand do not die by five noons
Y Yan also already second also use by dying young again the tenth of the twelve Earthly Branches and the one shoot a retrievable arrow the Yi Contraband
Yan
Code taking method is
The classification code of word beam got successively in individual character, the code length less than adds identification code and branch for 3 yards, less than adds the space for 4 yards, surpass 4 yards and get preceding trigram and last sign indicating number, the first two sign indicating number got in the every word of double word phrase, three words groups, prev word is got the first two sign indicating number, last sign indicating number respectively got in back two words, the phrase that four words and four words are above, last yard of getting first three word and last word, wherein the Chinese character of being made up of a word beam is called the single-beam word, the Chinese character of being made up of two word beams is called the twin beams word, and the Chinese character of being made up of the word beam more than three or three is called many beams word, and the higher word beam of following 31 structure word frequencys is called Gao Ziliang:
The people Fu month
Mountain stone worm Xin soil cun day Yan Yan Http Lv mouth corpse Jin Jin wood
Chi Quan Ren king Epileptic Rui woman He Si Si
In the twin beams word, after classification code has been got, get the identification code of lead-in beam successively, if the lead-in beam is Gao Ziliang, then get the identification code of tail word beam, if head and the tail word beam all is Gao Ziliang, then use v as identification code, certainly, the first letter of pinyin of the also desirable complete Chinese character of the identification code of twin beams word;
The coding latter half of everyday character often can omit and can be presented at the front in advance, at this moment can directly send by space bar, in order to point conveniently moving, every energy can be used period keys ". " replacement with what 2 selections were upward shielded, and every energy can be used apostrophe keys "/" replacement with what 3 selections were upward shielded;
For numeric keypad such as cell phone keyboard, input code is exactly the English alphabet keys position on the key face.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2006100744252A CN101051246A (en) | 2006-04-08 | 2006-04-08 | Computer keyboard shape code Chinese character code input method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2006100744252A CN101051246A (en) | 2006-04-08 | 2006-04-08 | Computer keyboard shape code Chinese character code input method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101051246A true CN101051246A (en) | 2007-10-10 |
Family
ID=38782679
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2006100744252A Pending CN101051246A (en) | 2006-04-08 | 2006-04-08 | Computer keyboard shape code Chinese character code input method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101051246A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102163087A (en) * | 2011-03-29 | 2011-08-24 | 陈长俊 | Chinese character shape code input method |
CN102880301A (en) * | 2011-07-15 | 2013-01-16 | 孙基寿 | Concept code Chinese character input method and keyboard |
CN103558924A (en) * | 2013-11-04 | 2014-02-05 | 汤仁和 | Chinese character encoding method and input keyboard |
-
2006
- 2006-04-08 CN CNA2006100744252A patent/CN101051246A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102163087A (en) * | 2011-03-29 | 2011-08-24 | 陈长俊 | Chinese character shape code input method |
CN102163087B (en) * | 2011-03-29 | 2013-08-07 | 陈长俊 | Chinese character shape code input method |
CN102880301A (en) * | 2011-07-15 | 2013-01-16 | 孙基寿 | Concept code Chinese character input method and keyboard |
CN103558924A (en) * | 2013-11-04 | 2014-02-05 | 汤仁和 | Chinese character encoding method and input keyboard |
CN103558924B (en) * | 2013-11-04 | 2016-07-20 | 汤仁和 | A kind of method of Chinese character coding |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN85101817A (en) | An zijie type Chinese-character stroke computer code's method and keyboard thereof | |
CN101051246A (en) | Computer keyboard shape code Chinese character code input method | |
CN1900886A (en) | Method for single click and multiple key combining click mixing input Chinese and English and keyboard | |
CN1031302C (en) | Associated Chinese Character radical code input method | |
CN1808355A (en) | Chinese harmonic input method | |
CN1295588C (en) | Chinese inputting method and keyboard thereof | |
CN1116634C (en) | Coding method for Chinese spelling characters and keyboard therefor | |
CN1255713C (en) | Chinese characters input method using font and pronunciation | |
CN1725156A (en) | Chinese character input method and keyboard using said method for input | |
CN1387106A (en) | Chinese-character phonetic letter encoding method and its keyboard | |
CN1128398C (en) | Chinese 'Latin-Chinese code' input system | |
CN1908870A (en) | Method and keyboard for mixed inputting English with single button and multiple buttons | |
CN1038366C (en) | Chinese character input system for computer | |
CN1054447C (en) | Coordinate codes coding method for computer Chinese characters input | |
CN1417674A (en) | Chinese syllable double reading scheme, Chinese keyboard and information input and processing method | |
CN1121646C (en) | Character-writing code Chinese character input method for computer | |
CN1150444C (en) | Chinese-character 'letters' input method for computer | |
CN1065973C (en) | Sound speed code Chinese character input system and its input keyboard | |
CN1124539C (en) | Digitalization Chinese character radicals indexing method for computer input and its special-purpose keyboard | |
CN1145097C (en) | Configuration-and-stroke Chinese character input method and its input device and use | |
CN1148635C (en) | Chinese-character 'resection code' encode method | |
CN1231831C (en) | Trisection digital input method | |
CN1164694A (en) | Chinese character base code keyboard computer input method | |
CN1357814A (en) | Computer Chinese keyboard and its Chinese information inputting and processing method | |
CN1447215A (en) | Method for looking up works by inputting two strokes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20071010 |