CN1154502A - Method and device for ducation standardized inputting Chinese characters by five stroke - Google Patents

Method and device for ducation standardized inputting Chinese characters by five stroke Download PDF

Info

Publication number
CN1154502A
CN1154502A CN 95105931 CN95105931A CN1154502A CN 1154502 A CN1154502 A CN 1154502A CN 95105931 CN95105931 CN 95105931 CN 95105931 A CN95105931 A CN 95105931A CN 1154502 A CN1154502 A CN 1154502A
Authority
CN
China
Prior art keywords
parts
character
key
input
code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 95105931
Other languages
Chinese (zh)
Inventor
王永民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 95105931 priority Critical patent/CN1154502A/en
Publication of CN1154502A publication Critical patent/CN1154502A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention relates to an educational standard five-stroke font Chinese character input method and its device. It is an improvement on the existent five-stroke font Chinese character input method, and is a complete set of fully-new component system, general rules for division of Chiense characters, coding system and fully-new keyboard design which are formed by using principle of ergonomics and adopting such methods of character source analysis, character element induction, conventional definition and system specification, selecting and using traditional radicals of Chinese characters, creating dicode components and completely cancelling character roots made up by inventor-self.

Description

Educational standards the Five-stroke Method input method of Chinese character and device thereof
The invention belongs to the Comnputer Chinese character technical field of information processing, to the omnibearing innovation and the disruptive technology progress of an existing font code Chinese character computer input technology of the Five-stroke Method that widely uses.
" the Five-stroke Method " Chinese character entering technique of domestic and foreign current, comprise application on April 1st, 1985 Chinese patent, obtained No. 85100837.2 patents of invention that Patent Office of the People's Republic of China is authorized on February 26th, 1992, and under the principle of this technical scheme, adjust existing " the Five-stroke Method " technology that popularizes since in March, 1986 a little through inventor's Wang Yongmin.
" the Five-stroke Method " at home and abroad obtained widely to use, become 90% above computer user use is arranged at present, prevailing Chinese character entering technique at home." the Five-stroke Method " not only makes high-level efficiency input Chinese character become a reality at aspects such as journalism, publishing business, office automations, obtain widespread use in the United Nations, Southeast Asia, and, " the Five-stroke Method " also entering into school and family on a large scale, almost become an indispensable technology of the young civilization employment of China's Contemporary.
The development of Science and Technology is a process, and " the Five-stroke Method " also is a technology that is constantly developed, constantly brings forth new ideas to advanced stage by the primary stage.Large-scale application practice over 10 years, when checking " the Five-stroke Method " prior art is as the science of a pioneering invention, creativeness, practicality, progress along with science, the raising of application level and computer popularize and enter the education of middle and primary schools field, found that " the Five-stroke Method " still exists deficiency even major defect at aspects such as standardization, science, practicality, can not satisfy the strong request of education of middle and primary schools Chinese character entering technique study.This explanation, " the Five-stroke Method " be to the innovation and the development of standardization, high level, practicability direction, still unnecessary, also very urgent especially.
The deficiency and the defective of existing for overcoming " the Five-stroke Method " technology, the inventor was from 1986, concentrated on studies through 10 years, extensively listen to the domestic and international user's of all circles suggestion, particularly learn the expert at the famous spoken and written languages of State Language Work Committee multidigit, the information processing technology expert of scientific and technological circle, under computer expert's the direct guidance, adopt theoretical analysis, the means that repetition test and computer Aided Design combine, finally broken through the framework of existing " the Five-stroke Method " technical scheme from five aspects, determined that this a new generation meets the higher level of educational standards and Chinese-character canonical, the input technology scheme and the device thereof of practicability more.
In view of the present invention was finished in nineteen ninety-five, of the present invention abbreviating as " 95 editions the Five-stroke Method and keyboard " thereof or " 95 editions the Five-stroke Methods ".Get the Chinese Pin Yin initial of " standard " and " five ", promptly constitute title abbreviation alphanumeric codes---GWB of the present invention;
In view of " the Five-stroke Method " code input technology of Wang Yongmin invention is called " king's sign indicating number " in the society over 10 years sanctified by usagely, the another kind of abbreviation of the present invention, be called " 95 king's sign indicating number " or " standard Wangma code " again.
Correlation technique of the present invention " the Five-stroke Method " technical scheme that promptly No. 85100837.2 patent constituted also is current existing " the Five-stroke Method " technology of at home and abroad widely using, below unified be simply referred to as " former scheme ".
For describe just, below the present invention is called for short " 95 editions the Five-stroke Methods ", " GWB scheme " or " GWB ".
According to the present invention, describe employed notion of its input method and input keyboard thereof and technical term and be defined as follows:
1, basic stroke: the defined basic stroke of the present invention has 5 kinds, and stroke form and digital code thereof are:
Horizontal: (one,
Figure A9510593100131
), direction is for from left to right, and digital code is 1;
Perpendicular: (Shu, 亅), direction is for from top to bottom, and digital code is 2;
Cast aside: (Pie), direction is upper right to the lower-left, and digital code is 3;
Point (right-falling stroke): (Dian, _), direction is for left to bottom right, and digital code is 4;
Folding: (second), the stroke of band turnover, digital code is 5;
Wherein folding (second) also comprises 23 kinds of following two big classes:
Clockwise direction:
Figure A9510593100132
_
Figure A9510593100133
Ya
Counter-clockwise direction: _ _ _
Figure A9510593100137
Figure A9510593100138
2, font: refer to the topology figure classification of Chinese character:
Left and right sides arrayer is a left right model, digital code 1
The person of being arranged above and below is last mo(u)ld bottom half, digital code 2
Neither about be not again the person of being arranged above and below, be heterozygous, digital code is 3.
In order to extract font information with discrete repeated code, the repeated code of particularly discrete generation when the CJK10646 large character set encode, font also can further be subdivided into the 4-10 kind, and its code name can be represented it with 0~9 when the time comes.
3, district: refer to a zone on the radical divided by the first sum of stroke kind or the components list or a zone on the keyboard, area code from 1 to 5.
4, position: refer to press in a time feature of parts or each district that an end feature is arranged the some keys in 5 keys, item from 11 to 55.
5, parts: parts specially refer to by the present invention preferably as traditional radicals by which characters are arranged in traditional Chinese dictionaries, stroke structure and the distortion thereof of word-building unit or the stroke structure of " likeness in form " with it.The present invention does not adopt " radical " this notion.When mentioning " radical ", also specially refer to " parts " in the present invention according to custom.When mentioning the order that parts split,, still continue to use " root preface " speech for ease of appellation.All one-tenth word persons are " character formation component " in the parts, and the person is " noncharacter radical " or " not character formation component " not become the word.
6, key name: refer among the present invention each 5 in 5 districts representational first parts on totally 25 key positions.
7, master unit, coordination parts: the representational parts that comprise key name are master unit, and are arranged in after the master unit, the distortion of these parts that bracket with bracket or with it the stroke structure of homology, likeness in form be the coordination parts.
8, parts summary table: whole parts of being chosen among the present invention, by each 5 totally 25 in 5 districts, according to master unit preceding, the part distortion of coordination or likeness in form parts bracket in back available (), the traditional font parts can bracket with (), comprise the form that dicode parts, each single are drawn and drawn the form of a stroke or a combination of strokes, the whole strokes of band " second " class and the high frequency word that are composited by single simultaneously.
9, keyboard layout: refer among the present invention with key name, region-position code, whole parts or part parts sign or the design drawing or the synoptic diagram of design on button.The keyboard of manufacturing according to keyboard layout of the present invention, or use the keyboard that disposes in certain system of the present invention all to be called " GWB keyboard ".
10, high frequency word: refer to be arranged at that go up 25 positions of the present invention (key), every (key) 25 Chinese characters one, the most frequently used.In general, the parts on most high frequency words and the key position, place (key) have certain contact.
11, single character code: i.e. the coding of word, refer to by compiling method of the present invention to be the input code of individual character establishment, this input code has parts decomposition, position, three kinds of modes of letter.
12, dictionary: also be word storehouse, corpus, refer to a set of the Chinese-character words that constitutes by 2 above Chinese characters.How much not limitting of entry, the form with data in system is stored in certain space, and is for future reference standby.
13, speech sign indicating number: refer to by compiling method of the present invention to be the input code of the vocabulary establishment of two above Chinese characters formations, this input code has can distinguish parts decomposition, position, three kinds of modes of letter of using separately.
14, two words: refer to the speech formed by 2 Chinese characters.
15, three words: refer to the speech formed by 3 Chinese characters.
16, four words: refer to the speech formed by 4 Chinese characters.
17, multi-character words: refer to the speech formed by 4 above Chinese characters.
18, parts system: refer to the summation of the parts that go out preferred for this invention and in the present invention compatibility, regularity, hamony by the permutation and combination of multiple goal unified demand branch zoning position.
19, solid size parts: refer to that parts only compile the parts of a sign indicating number.
20, dicode parts: refer to that parts have the parts of two sign indicating numbers, the dicode parts also are " dicode radical " sometimes.
21, addressable part: refer to individual character or word are split become after the parts sequence, take out according to code taking rule again and be used to the parts sequence of encoding and importing, generally form by 2-4 parts according to splitting rule.
22, coding scheme: refer to that Chinese character that the present invention works out splits the coding rule of rule, words and the parts is olation of according to this character set that the present invention was suitable for being set up or the word coding method code table of region-position code mode or alphabetic mode, or the word coding method dictionary.
23, space encoder: the possibility that refers to a certain code length coding.The level Four space refers to whole possibilities of 4 yards, and three grades of spaces refer to whole possibilities of 3 yards, and the secondary space refers to whole possibilities of 2 yards, and the one-level space refers to whole possibilities of 1 yard.In the present invention, the secondary space i.e. 25 * 25=625, and the rest may be inferred by analogy.
24, true form sequence: refer to the region-position code (or character code) of addressable part correspondence is from left to right all listed one group of code sequence that the back forms, if any the dicode parts, then 2 of these dicode parts sign indicating numbers are all listed in this sequence.
25, coding process flow diagram: the operation steps synoptic diagram that refers to according to coding rule individual character or word be carried out among the present invention disassembled coding.
26, identification code: i.e. " last stroke character patten intersection identification code ", refer to work as among the present invention a word and tear not enough 4 parts open, thereby during code length less than 4, must after its coding, add one of being composited by " a last code name " of this word and " font code name " " intersect identification code ".How much can deciding of identification code by the font of selecting for use:
When adopting 4 kinds of fonts, be 5 * 4=20 kind,
When adopting 3 kinds of fonts, be 5 * 3=15 kind,
When adopting 2 kinds of fonts, be 5 * 2=10 kind,
When adopting a kind of font, be 5 * 1=5 kind.
Identification code has three kinds can distinguish the mode of using separately, i.e. parts mode, position mode, alphabetic mode.
27, code length: refer to according to compiling method of the present invention, the length of the input code of compiling out for word, speech, when disregarding space bar, this code length equates with the input stroke of corresponding word, speech in the present invention.
28, Chinese character frequency table: be used for calculation key position load, it is according to being the Hanzi frequency count data that State Language Work Committee Mr. Fu Yonghe waits " contemporary Chinese common word table " (Chinese Press's in June, 1989 version) " character data statistical form in the Chinese Character Set Code for Informati baseset " of writing to be provided.
29, compatibility: refer to several parts exist together a key position, when enjoying one and same coding, the degree of holding mutually can be quantified as the influence to repeated code, i.e. the repeated code number.
30, regularity: the finger part is arranged the rule of dividing the key position.Some the rules that the deviser announces directly can be quantized on each parts, be formed " the rule degree " of these parts.
31, hamony: refer to principle and test method, the key position load distribution situation that calculates according to ergonomics and biomechanics.
32, key position load: when referring to certain input method input Chinese character, each key position stroke is shared percent value in total stroke.
33, static load: when referring to each one of whole 6763 Chinese character among certain input method input GB-2312.80, each key position or a certain parts are hit number of times and are accounted for the percent value of total stroke.
34, dynamic load: when referring to the related whole Chinese character of certain input method input " Chinese character frequency table ", each key position or a certain parts stroke shared percent value in total stroke.
35, key position load diagram: refer to each key position load number percent is marked on the keyboard layout and a chart that facilitates consultation, analyzes, contrasts that forms.
The present invention keeps the following design content of continuing to use " the Five-stroke Method " prior art basically:
(1) 5 kind of stroke: horizontal (one), perpendicular (Shu), cast aside (Pie), point (Dian, _), folding (second) and corresponding digital code 1,2,3,4,5 thereof.
(2) use the keyboard that 25 enter keies are arranged, this keyboard both can be any one keyboard special that contains 25 character keys positions, also can be the western language keyboard of a standard.
(3) 25 keys are divided into 5 districts, 5 positions, every district according to 5 kinds of strokes.
(4) key name, become radical and get ", two, threes', end " four yards word coding method rule at most.
The method for designing of (5) repeated code prompting, tolerant code input.
(6) simple type " five strokes " input method.
Below introduce in detail, after the deficiency of the present invention aspect solving five of former scheme and the defective and these 95 editions new the Five-stroke Method technical schemes of establishment, i.e. GWB scheme breakthroughly.
At first, introduce the deficiency of following five aspects of former scheme existence:
One, former scheme " radical system " lacks standard, does not meet in the education of middle and primary schools standardization requirement about Chinese Character source and character structure rule
1, some nonstandard " radicals " " have been made " in the former scheme certainly.
Have in the former scheme "
Figure A9510593100161
_, _,
Figure A9510593100162
" wait " self-made characters root "." self-made characters root " destroyed the globality of traditional radical and stroke structure.That these " are made " certainly, as to be specifically designed to the computer input " parts ", though in radical merger, minimizing repeated code, played certain effect, but after all with the word source, greatly differ from each other with traditional cognitive custom and teaching norm, therefore thereby be not easy to be accepted by the user, even some expert criticizes and says " the Five-stroke Method has polluted the Chinese character environment ".
2, radical " end " with "
Figure A9510593100163
", " sheep, " all be same word source with " _ ", but in the prior art or the homology root divides at Liang Chu, or splitted into several sections inequality fully.
3, " radical " is unclear, lack of standardization with " non-radical " boundary.Because of the radical system had not been pressed the standardization confirmation request, often make the study user feel " seemingly resembling seemingly not elephant ", when splitting Chinese character dare not " following cutter ", produce " ambiguity " even " polysemy ", seriously influenced accuracy that Chinese character splits and beginner's study schedule.
As: radical " " in reality splits, all tear works " seven " open, but in etymon list, do not indicate " seven " also represent simultaneously "
Figure A9510593100166
"
For another example: the radical mouth with " in fact also represented "
Figure A9510593100167
, with ", but fail in the etymon list to indicate.
Two, former scheme " coding scheme " does not meet in traditional habit and the education of middle and primary schools standardization requirement about Chinese character teaching
1, the radical of a collection of Chinese character " fractionation order " does not meet correct sequential write standard.
As: worn-out: Ha Jiong The-Fan (should be by standard: _ Jiong eight The-Fan)
Swallow: twenty
Figure A95105931001610
Mouth Xiangxi (should be by standard: twenty mouthful
Figure A95105931001611
Xiangxi)
The container made of bamboo, wicker, ratten, etc.: _ White youngster (should be by standard: _ white Youngster)
2, the radical of a collection of Chinese character " split result " does not meet the structure word standard of Chinese character.
As: bundle: flatly (should be by standard: the wood mouth)
From: civilian Qian Jiong Si (should be: Tou Qe Qian Si) by standard
Chu: two _ (should be by standard: a fourth)
Resemble: _
Figure A9510593100172
(should be by standard: _ mouthful )
3, " the fractionation stroke order " of a collection of Chinese character do not meet in the middle and primary schools teaching code requirement about stroke order.
As: side: Dian one Pie (should be: Dian one Pie)
Become: a Pie
Figure A9510593100177
Figure A9510593100178
Dian Pie (should be: a Pie
Figure A95105931001710
Pie Dian)
Not: _ One Pie (should be: _
Figure A95105931001712
Pie one)
Three, former scheme " keyboard Designing " is not because the existence of many " making certainly " radical meets Chinese-character canonical
Individual more than 10 owing to having occurred on the keyboard "
Figure A95105931001713
_,
Figure A95105931001714
_ " etc. " making certainly " symbol, the keyboard Designing that makes former scheme is unfavorable for correct cognition and the use of students in middle and primary schools to Chinese character during as " spell shape group word " input, also is an impediment to universal Chinese character entering technique in educational system naturally.
Four, former scheme " keyboard Designing " is needed improvement badly aspect ergonomics, and the finger load distribution of each key position is more reasonable, keystroke harmony more so that make
Because the inventor only did research qualitatively and had not calculated static state and the dynamic load of respectively arranging key, each finger quantitatively simultaneously for the key position design of radical in history, thereby occurred not matching, inharmonic phenomenon, make former scheme when the input Chinese character, the load distribution of each finger of upper, middle and lower three row's keys and every row's key is unreasonable, makes the operator be easy to fatigue and influences input efficiency.
In addition, because it is unreasonable that radical is chosen, not such as common components such as " mother ", " family names " as integeral part (promptly will take apart), also make former scheme at the more suitable Chinese characters in common use of input, during as " every, sea, plum, paper, the end, low " etc., the phenomenon of " not smoothly " even " other hands " occurs, thereby influence speed, error code rate is raise.
Five, the major defect of former scheme must be imparted knowledge to students to Chinese-character canonical, standard input produces harmful effect
In sum, existing " the Five-stroke Method " chosen at radical, existed more serious deficiency and defective aspect coding rule, coding scheme and the keyboard Designing.Require to compare with standardization, in 6763 Chinese characters of national standard, relate to the Chinese character more than 40% altogether, this is major part or everyday character wherein.If former scheme is continued to promote, and untimely improvement, innovation, particularly after " the Five-stroke Method " enters school and family on a large scale, will inevitably produce and have a strong impact on:
1, be unfavorable for the standard teaching of Chinese character in the middle and primary schools, particularly after computer was popularized, popularizing of former scheme can influence students in middle and primary schools' correctly writing Chinese character conversely.
2, be unfavorable for carrying out of relevant document that State Language Work Committee, State Education Commission are issued with regard to the spoken and written languages standard.
3, not consistent with the requirement of text structure tradition and Chinese character teaching normization because of radical system, fractionation rule, the split result of former scheme, " the Five-stroke Method " just can not on a large scale, socialization enter school and family.
4, be unfavorable for further improving the efficient and the quality of Chinese character input.
5, be unfavorable for realizing from " writing " transition to " writing " this ways of writing with computer with pen.
6, be unfavorable for that China's Chinese character shape code input technology heads for unification.
The objective of the invention is to overcome above cited whole deficiencies and the defective of " the Five-stroke Method " prior art, with component set, parts system, dicode parts and the coded input method thereof of redesign, contain the dicode parts " the root preface is preferential, normative stroke order, as far as possible get big, take into account directly perceived " disassembled coding principle and input keyboard thereof formed one jointly and met Chinese character teaching norm, more science, more practical, the Chinese character input system being convenient in middle and primary schools, apply, the present invention is once omnibearing development and the innovation based on " the Five-stroke Method " prior art.
Below in conjunction with Figure of description, technical scheme of the present invention is introduced in detail, will fundamentally understand characteristics of the present invention, advantage and great advance meaning by following description:
Accompanying drawing 1:GWB parts system
Accompanying drawing 2:GWB parts system is compared with the prior art the variation complete list
The radical that accompanying drawing 3:GWB reduces from existing the Five-stroke Method radical system
Parts and key of living in position thereof that the more existing the Five-stroke Method radical of accompanying drawing 4:GWB system is newly-increased
" the dicode parts " of accompanying drawing 5:GWB and key position thereof distribute
The homology that accompanying drawing 6:GWB determines, likeness in form, deformation component and key position thereof distribute
Accompanying drawing 7:GWB traditional font parts and key position thereof distribute
Accompanying drawing 8:GWB basic stroke and different shape thereof
Accompanying drawing 9:GWB coding rule and coding process flow diagram
Accompanying drawing 10:GWB keyboard (using Qwerty keyboard)
Accompanying drawing 11:GWB keyboard (special-purpose Chinese keyboard)
Accompanying drawing 12:GWB component layouts " rule degree "
Accompanying drawing 13: former scheme radical layout " rule degree "
The static frequency table of accompanying drawing 14:GWB parts
Accompanying drawing 15:GWB part classification table
The parts sequence chart of accompanying drawing 16:GWB master unit beginning
Accompanying drawing 17:GWB indexing unit
Accompanying drawing 18: the GWB input keyboard within the big or middle keyboard
Accompanying drawing 19:GWB5 * 5 microminiature input keyboards
The information handling system of accompanying drawing 20:GWB
Radical, the cancellation of the present invention by reducing former scheme do not meet " making certainly " radical of Chinese-character canonical, as far as possible adopt traditional radicals by which characters are arranged in traditional Chinese dictionaries, design dicode parts, increase measures such as parts, redesign parts position and refinement stroke form, formed one meet Chinese-character canonical be used to encode and the parts system imported (Fig. 1, Fig. 2).
Fig. 1 shows the parts system that the present invention is brand-new;
The detailed fundamental difference that has been listed as parts system of the present invention and prior art of Fig. 2.
(1) the present invention is directed to Chinese Character Modern standard font and design, in order to make the preferred of parts and to determine to meet the structure law of Chinese character and the requirement of standard teaching, special according to requirement and the famous philology expertise of State Language Work Committee to standardization of Chinese characters, with following several principles as theoretical direction:
1. word source analytic approach: analyze through Hanzi component being carried out the word source, according to the method for generating Chinese character of Chinese character choose belong to same word source with master unit parts as " same source block ", and will tightly be listed in after the master unit with source block, as:
" Xin ", "
Figure A9510593100191
" and " heart " homology, selected, and after being listed in " heart ";
"
Figure A9510593100192
", "
Figure A9510593100193
" and " ending " homology, selected, and be listed in " ending " afterwards;
"
Figure A9510593100194
" and " rice " homology, selected, and be listed in " rice " afterwards;
Choose the parts that meet Chinese character word-building rule and teaching norm according to this.
2. grapheme analytic approach: the grapheme analytic approach refers to the word-building law of Chinese character, if choosing fully according to word source analytic approach of parts then may not be applicable to the understanding of modern Chinese character font.As " grain husk " word, from " standing grain ", " just " sound should be divided into " standing grain " and " just " by word source method, presses the grapheme analytic approach, then should be divided into 4 parts such as " an ancient type of spoon standing grain _ shellfishes ".
3. integrated use principle: traditional radicals by which characters are arranged in traditional Chinese dictionaries are selected in choosing of parts as far as possible for use, when traditional radicals by which characters are arranged in traditional Chinese dictionaries are put in order the word parts as not going into to elect as because of the restriction of key position, can not split into the part of several " making certainly ", should look after standardization, and reasonably design is encoded again.
4. be accustomed to principle of engagement: single character " really " fails to press the word source into electing parts as, should split into " Tian Mu " two parts, but by " stroke can not cut off " this custom, " really " can only split into " day wood " two parts, is sanctified by usage.
5. system stipulates: because the restriction of space encoder and the consideration of parts compatibility, the present invention has made the regulation of native system uniqueness to parts: as the design of " dicode parts " and the regulation of adding " identification code ", both guaranteed the standardization of parts, reasonably designed key position and coding again, made system obtain breakthrough aspect " keypad, standardization ".
(2) according to the theoretical direction in (1), the present invention has cancelled following 16 radicals (Fig. 3) of " making certainly " the word-building ability difference and that do not meet Chinese-character canonical in the former scheme:
Jian, shoot a retrievable arrow,
Figure A9510593100202
_, _,
Figure A9510593100206
_,
Part among them is decomposed into parts or stroke respectively in the following manner in the present invention, as:
Jian: a dagger-axe, shoot a retrievable arrow:
Figure A9510593100209
Dian,
Figure A95105931002010
: the Tou mouth
Figure A95105931002011
:
Figure A95105931002012
Dian,
Figure A95105931002013
: Network
(3) according to the theoretical direction of (1), the present invention on the basis of existing the Five-stroke Method radical newly-increased following 25 whole words or traditional radicals by which characters are arranged in traditional Chinese dictionaries as parts (Fig. 4):
_, penta, insect without feet or legs, Cui, family name, mother, gas, slit bamboo or chopped wood, or not fork-like farm tool used in ancient China, the tenth of the twelve Earthly Branches, leather, skin, boat, Niu, Shi, Quan, fish, sheep,
Figure A95105931002015
_, Woo, Yi, blunt,
Figure A95105931002016
After these parts are put into each key vast and numerous calculating is carried out in repeated code, the isoparametric influence of hamony, the present invention designs above-listed preceding 8 parts respectively on following key position:
_---21, penta---13, insect without feet or legs---33, Cui---34, the family name---33,
Female---55, gas---32, slit bamboo or chopped wood---42
(4) according to the theoretical direction of (1), the present invention newly designed with the critical piece homology or likeness in form following 35 parts distortion or that be convenient to associative memory, and be arranged in respectively on the key position shown below (Fig. 6):
????????????????????——12??????
Figure A95105931002018
??????????????——13
Xi---14
Figure A95105931002019
---21
Figure A95105931002020
European-allies---15 days
Figure A95105931002021
---22
Figure A95105931002022
????????????????????——24??????? ??????????????——25
Figure A95105931002024
Month
Figure A95105931002025
With---33 _
Figure A95105931002026
---35
Figure A9510593100211
_
Figure A9510593100212
Shui---43
Figure A9510593100213
---44
Slit bamboo or chopped wood---42 _
Figure A9510593100214
---53
ユ???????????????——51
Figure A9510593100216
An ancient type of spoon seven---55
(5) be used for 11 of the parts of complex form of Chinese characters coding below the present invention has increased newly, when not using these parts, the present invention just becomes the technology of only handling simplified Chinese character; When not using corresponding with it simplified parts when using these parts, the present invention just becomes the technology of only handling the complex form of Chinese characters; When simplified, when the traditional font parts use simultaneously, the present invention just becomes and not only can handle the traditional font but also can handle simplified technology (Fig. 7) simultaneously.
Figure A9510593100217
Trucks Tony Bird
Figure A9510593100219
Fish Yan Door Ma Si
(6) according to the theoretical direction of (1), the present invention easily learns for the fractionation that makes Chinese character is directly perceived, different shape during single picture fractionation that also standard is clear and definite, and in code book, mark intuitively, so that students in middle and primary schools split Chinese character and learn to contrast when coding is imported and use (Fig. 8).
One:
Shu: 亅
Pie:
Figure A95105931002111
_: Dian
Second:
Figure A95105931002113
_
Figure A95105931002114
Ya
Figure A95105931002115
Figure A95105931002116
Figure A95105931002117
_ Yin _
Figure A95105931002121
Figure A95105931002122
(7) the present invention changes the key name of 55 (X) key into " one " by " Si ", changes the key name " " of 51 (N) into " own ", so that appellation and memory.
(8) choose the requirement that meets educational standards according to the theoretical direction and the parts of (1), the parts that word-building ability is low among the present invention, newly-increased traditional radicals by which characters are arranged in traditional Chinese dictionaries, dicode parts and homology, likeness in form, deformation component, except can also in teaching and practical process, doing increase and decrease by a small margin, between whole district and the whole position, can also exchange as integral body in case of necessity.
2. the present invention has creatively proposed the design of " dicode parts ", this design is to solve the whole word parts technological breakthrough of this conspicuous contradiction too much in the keypad input scheme: both guaranteed the integrality of traditional radicals by which characters are arranged in traditional Chinese dictionaries in the parts summary table, reasonably distributed space encoder, improved the design of key face again significantly, make it compliant, practical more.
The present invention cancelled in the current programme "
Figure A9510593100221
_, _, _,
Figure A9510593100224
" wait non-traditional radicals by which characters are arranged in traditional Chinese dictionaries, and with traditional radicals by which characters are arranged in traditional Chinese dictionaries and whole word bundle, the tenth of the twelve Earthly Branches, leather, skin, boat, Niu, Shi, Quan, Yu, Fish, sheep, _, Woo, Yi, blunt,
Figure A9510593100226
Replace Deng " dicode parts ".In order to make the arrangement of these parts on keyboard not only meet regular requirement, allocated code space reasonably again, be unlikely because of these parts on keyboard because of enjoy a key, a sign indicating number " is lorded it over a district ", thereby cause rolling up repeated code, the present invention adopts computer Aided Design, through big quantitative statistics, analysis, test, research, calculating and test repeatedly relatively, determine:
18 traditional radicals by which characters are arranged in traditional Chinese dictionaries such as " leather, Woo, Yi, skin, Quan, Niu " are defined as " dicode parts " (Fig. 5) by its " brush-pen dipped in red ink stroke feature " or " architectural feature of an end stroke " respectively, that is:
The number of " dicode parts " is the amount doesn't matter among the present invention.The present invention proposes and the meaning of design " dicode parts " is:
1. make parts and traditional radicals by which characters are arranged in traditional Chinese dictionaries be consistent, be kept perfectly and do not split, and meet the text structure tradition, not to students in middle and primary schools increase be not familiar with, nonstandard stroke structure, thereby be convenient to study and use, shorten the training time effectively.
2. make keyboard Designing rationally simple and clear more, meet structure word standard, the keystroke of divining by means of characters is consistent with the traditional habit of writing of reading, easy to operate, raise the efficiency.
In addition, according to the method for designing of the present invention about " dicode parts ", in the simplified Chinese character scope in order further to reduce repeated code, the number of " dicode radical " in fact also can increase, and can further will occupy Chinese character topological graph left part and top, group word more (as surpassing 25 Chinese characters), cause that the too many tradition of repeated code " radicals by which characters are arranged in traditional Chinese dictionaries " also is defined as " dicode parts ".For example, can be " dicode parts " with following " radicals by which characters are arranged in traditional Chinese dictionaries " expansion design again according to the method for " dicode parts ":
The dicode parts Fiber crops Deer Black Nose Wind
The position dicode 41 14 ?41 ?55 ?24 ?44 31 15 ?25 ?32
The letter dicode YS ?YX ?LO TA ?MR
According to the present invention, when handling the complex form of Chinese characters, when particularly handling among the CJK10646 Chinese character of 20902 Chinese characters or more big collection, the number of " dicode parts " can also increase again:
The dicode parts Cutter Lice Fish Horse
The position dicode 21 25 25 22 35 44 ?54 ?44
The letter dicode HM MJ QO ?CO
According to the present invention, two sign indicating numbers of " dicode parts " when reducing repeated code and reasonable distribution key position load and become principal contradiction, are considered under the situation of convenient memory simultaneously, and the determining of second sign indicating number under the prerequisite of the form of a stroke or a combination of strokes feature of foundation parts, can compare flexibly.As two sign indicating numbers of " boat ", both can be " 31,33 " (TE), also can be " 31,41 " (TY); Two sign indicating numbers of " sheep " both can be " 42,13 " (UD), also can be " 42,21 " (UH).The former is that the latter then is according to end stroke according to the form of a stroke or a combination of strokes of the stroke group at a place, end.
The dicode parts that the present invention created are new technical schemes, and it is different from " many code words root " in the prior art " Zheng's sign indicating number " fully:
" dicode " of the present invention compares with " many yards " of Zheng's sign indicating number
Item compared " dicode " of the present invention " many yards " of Zheng's sign indicating number Conclusion
The code length of many yards roots 2 sign indicating numbers 2-3 sign indicating number Different
Determine second yard foundation According to an end stroke or an end form of a stroke or a combination of strokes According to an end font or any Different
Determine the foundation of trigram (no third sign indicating number) Arbitrariness Different
The dicode number of components About 18 About 150 Different
3 yards number of components 0 13 Different
The key position is relatively corresponding Example: Woo---PY boat---TE fork-like farm tool used in ancient China---DI Example: Woo---WS boat---PY fork-like farm tool used in ancient China---CK Different
The example code length relatively Very little------D crust---C first---L of F factory Very little------GG crust---YIA first---KIB of DS factory Different
Affiliated coding scheme The GWB coding scheme Zheng's sign indicating number coding scheme Different
" dicode parts " of the present invention when using, on keyboard or in the instructions all the form with complete radical represent, particularly for the learner of the present invention of study use at the very start.
Yet, used the people of former scheme for study, sometimes in order to make former scheme be connected transition as early as possible with the present invention, perhaps convenient for the individual who looks after a part of user, all " dicode parts " of the present invention, still can be torn open by " by force " and make two two parts that include " parts " lack of standardization, and respectively by first yard in its " dicode " with second yard with these two portion arranged on corresponding position (key position), as:
Generally speaking, the coding of the non-standard parts that occur in " by force " split result and key position there is no different with using " dicode parts ".But sometimes in order more reasonably to distribute key position load, or the part of more reasonably second sign indicating number design in the dicode more " not crowded " in whole space encoder, specify a coding also can for non-standard parts " artificially ", just be equivalent to the non-standard parts are designed artificially on certain key position.And according to the needs of space encoder reasonable distribution and parts compatibility, dicode parts can split into 2 kinds even 3 kinds of results according to shape " by force " sometimes artificially, and as above shown in " no " and " boat " in the table, this situation is " system's agreement ".
Like this, just formed " transitional component set ", " transitional coding scheme ", " transitional parts summary table " and a keyboard thereof of being convenient to carry out the transition to " standard fully " from " not exclusively standard ".For ease of carrying out the transition to the present invention from former scheme, from practical angle, this point is also permitted for the present invention.
According to the present invention,, when 4 sign indicating numbers of " true form sequence " less than, should add last stroke character patten identification code (not listing in the following table) in that Chinese character is split extraction " addressable part " afterwards
A, " addressable part " are 2 o'clock:
First parts Second parts Input code
??A ??B ?AB
??BX ?ABX
??AW ??B ?AWB
??BX ?AWBX
B, " addressable part " are 3 o'clock:
First parts Second parts The 3rd parts Input code
??A ??B ??C ?ABC
??CY ?ABCY
??BX ??C ?ABXC
??CY ?ABXY
??AW ??B ??C ?AWBC
??CY ?AWBY
??BX ??C ?AWBC
??CY ?AWBY
C, " addressable part " are 4 o'clock:
First parts Second parts The 3rd parts The most last parts Input code 1 Input code 2
??A ????B ????C ??D ?ABCD ?ABCD
??DZ ?ABCZ
????CY ??D ?ABCD
??DZ ?ABCZ
????BX ????C ??D ?ABXD
??DZ ?ABXZ
????CY ??D ?ABXD
??DZ ?ABXZ
??AW ????B ????C ??D ?AWBD
??DZ ?AWBZ
????CY ??D ?AWBD
??DZ ?AWBZ
????BX ????C ??D ?AWBD
??DZ ?AWBZ
????CY ??D ?AWBD
??DZ ?AWBZ
Input code 2 is that 4 addressable parts are respectively got first yard, and promptly no matter the dicode parts have severally, get ABCD without exception.The advantage of this a kind of code taking method is discrete effective to repeated code, and repeated code is few, and shortcoming is concerning " true form sequence ", is not the order code fetch.So, more than two kinds of code fetch methods when using, only select that it is a kind of.When handling the CJK10646 large character set, be to reduce repeated code, with second method for well.This method also can be used for the situation of three addressable parts sometimes.
Coding rule and dicode parts input method with these dicode parts that form;
3. the present invention is according to normalized requirement, creatively propose " the root preface is preferential, normative stroke order, as far as possible get big, take into account directly perceived " general provisions that split as coding, Chinese character input and Chinese character teaching norm, normalized written are taken into account, with above coding rule, formed new coding scheme (Fig. 9) about the dicode parts.
The stroke component parts, parts constitute Chinese character, and Chinese character constitutes word.Stroke, parts, whole word are that Chinese written language is when the input computer " three levels ".
The present invention is referred to as " parts " with selected word-building unit.Be actually " part " that constitute Chinese character.
The present invention to total standardization guiding theory that Chinese character splits is: Chinese character splits into parts, and parts split into single and draw.
During writing Chinese characters, what people paid attention to is that the order of stroke---order of strokes observed in calligraphy---is a sequential write.
During key feeding character,---root preface---i.e. key entry order that is the order of radical that people pay attention to.
In most of the cases, " order of strokes observed in calligraphy " is on all four in Chinese character with " root preface ".
As: " tree ", " always ", " speech ", " rash " etc., their radical (parts) order is on all four with its stroke order.
Yet when " an encirclement type " reach " intussusception type " Chinese character and split into a string " parts sequence ", " order of strokes observed in calligraphy " just usually conflicted with " root preface ".
As " state ":
The order of strokes observed in calligraphy is: Shu one Dian one of Shu _ one by one
The root preface is: mouthful king Dian
Wherein, last stroke " horizontal stroke " of " state ", by radical " " " band " to first parts.
" bundle " for another example:
The order of strokes observed in calligraphy is: an a Shu _ Shu Pie _
The root preface is: the wood mouth
Wherein, instead the end pen of the 4th " mouth " write as, and has been arrived in first parts by " wood " " band " in " Pie _ " that " mouth " write afterwards second parts by parts " mouth " " band ".
Will be when splitting input according to the root preface, and emphasize normative stroke order in the Chinese character teaching, the two contradiction, in existing font code encoding scheme, comprise in " the Five-stroke Method " prior art, never have the unified rule that to deal with all situations, thereby make the fractionation ubiquity arbitrariness, polysemy, both can't with the teaching " integrating with ", can not improve encoding quality again.
In order to solve above contradiction, the present invention proposes " the root preface is preferential, and normative stroke order is got greatly, takes into account directly perceived " as far as possible and splits general provisions as one.Promptly when conflicting, preferentially press " root preface " code fetch in " root preface " and " order of strokes observed in calligraphy ", when needs splitted into the single picture, then the sequential write in strict accordance with standard carried out.
When Chinese character is split into parts, split into maximum known elements as far as possible; As not forming known maximum part, can split into less known elements; As not forming less parts, then split into single and draw, this promptly " gets big " as far as possible.
When a single character is splitted into grapheme or parts, should both can split into as " certainly " with the parts intuitive that splits out well for preferably tearing method open " _ three ", also can split into " Pie order ", but latter's intuitive is good, so the present invention gets the latter, this promptly " takes into account directly perceived ".
The meaning of this regulation is: can make the fractionation of parts meet the standard and the tradition of Chinese character word-building.Can make the fractionation of stroke meet teaching norm again, be convenient to Chinese character entering technique and enter among middle and primary schools' Chinese character teaching.
According to " the root preface is preferential, normative stroke order, as far as possible get big, take into account directly perceived " the fractionation general provisions, the fractionation code fetch input rule of word of the present invention, speech is as follows:
1. key name is encoded and input method:
According to the present invention, on each key position in one group of parts, optional representational parts are as the key name of this key position, and key name both can have been continued to use in existing the Five-stroke Method technology this key connected and make a call to four times and import the key name Chinese character,
As: standing grain: standing grain standing grain standing grain standing grain, (31 31 31 31, TTTT)
Also can be not according to plaing four times inputs, and with key name as " character formation component " input,
As: standing grain: standing grain Pie one _, (31 31 11 41, TTGY)
2. solid size component coding and input method:
According to the present invention, the solid size parts of non-dicode parts in the character formation component, its input method is:
Component region bit code+the first sum of single sign indicating number+inferior single sign indicating number+end single sign indicating number.
If by 4 sign indicating numbers of above input less than, then will be at front and back complement space bar to finish input.
According to the present invention, the solid size parts can also adopt to split into and get certain several stroke wherein after single is drawn and design input code for it according to the distribute way of definition or semidefinite justice of space encoder.
3. dicode component coding and input method
According to the present invention, the coding of its dicode parts and input method have 4 kinds can distinguish use separately or use wherein several coding input modes simultaneously.
A) directly with 2 sign indicating numbers of dicode parts as input code:
As: leather: 15 12 (AF)
B) first yard in the dicode+second yard+the first sum of single+end single
As: leather: 15 12 11 21 (AFGH)
C) first yard in the dicode+the first sum of single+inferior single+end single
As: leather: 15 11 21 21 (AGHH)
D) at two sign indicating number front prefixing sign indicating numbers of dicode parts, or the back adds suffix code, or prefixing sign indicating number and suffix code form the input code input simultaneously in front and back
As: the dicode of " leather " is 15 12 (AF)
Prefixing: leather: 24 15 12 (LAP)
Add suffix: leather: 15 12 24 (AFL)
While prefixing and suffix code: leather: 24 15 12 24 (LAFL)
Being used for that sign indicating number (letter) of prefix or suffix, can be the vacant code in the space encoder, promptly be unlikely because of add produce repeated code 11~55 (any one sign indicating number among the G~X), prefix code, suffix code also can be same yard.
E) split into the single that is no more than 4 and draw, during 4 of less thaies, add the space bar input
As: leather: a Shu Shu Shu
4. single-character splitting is encoded and input method:
According to the present invention, its disassembled coding and input flow process simplified and unsimplified Hanzi are seen Figure of description 9.
5. word is encoded and input method:
According to the present invention, its word coding and input method that includes the dicode parts is:
A, two-character word: preceding 2 sign indicating numbers of its individual character all-key got in every word, and totally 4 yards,
As: economy: Si ス Rui literary composition (55 54 43 41, XCIY)
B, three words: first yard of its individual character all-key respectively got in first word, second word, add preceding two yards of the 3rd word all-key again, and totally 4 yards,
As computing machine: Yan _ wood several (41 31 14 32, YTSR)
C, four words: first yard of its all-key respectively got in four words, and totally 4 yards,
As: science and technology: standing grain _ Rolling wood (31 43 32 14, TIRS)
D, multi-character words: get first sign indicating number of the all-key of first, second and third and the last word, totally 4 yards,
As: the People's Republic of China (PRC): mouthful Ren population (23 34 34 24, KWWL)
6. single is drawn input method:
According to the present invention, the coding of 5 kinds of single pictures and input method are the key at place even to be made a call to add 2 definitions for twice again:
One: 11 11 24 24 (GGLL)
Shu: 21 21 24 24 (HHLL)
Pie: 31 31 24 24 (TTLL)
Second: 51 51 24 24 (NNLL)
Wherein definitions can be 24 (L) other codings in addition, and can be one also can be 2-3;
7. last pen intersection identification code:
According to the present invention, its " last stroke character patten identification code " can all directly continue to use " 5 kinds of end pen * 3 kinds of fonts " 15 identification codes of meter in existing the Five-stroke Method technology, also can further be classified as a kind ofly, be about to font and further be reduced to " left right model " and " non-left right model " 2 kinds going up mo(u)ld bottom half and heterozygous." last stroke character patten identification code " then has following 10 kinds at this moment:
Left right model: 11 (one, G), 21 (Shu, H), 31 (Pie, T), 41 (_, Y), 51 (second, N) non-left right model: 12 (two, F), 22 (||, J), 32 (
Figure A9510593100291
, R), 42 ( , U), 52 (ㄍ, B)
In addition,, be further simplification " last stroke character patten identification code " according to the present invention,
Can only 1 type-word (left right model) be added identification code in a manner described, but not 1 type-word does not add " identification code ", at this moment, identification code has only 5 kinds:
11 (one, G), 21 (Shu, H), 31 (Pie, T), 41 (_, Y), 51 (second, N).
For the word of 4 sign indicating numbers of less than, this point just equals actually: an end stroke added in the word of all 1 types (left right model), but not its component code then only imported in the word of 1 type (left right model).This situation is the embodiment that simplifies most of " identification code " this creation.
8. high frequency word:
High frequency word on each key position of the present invention both can keep former scheme, also can be selected again according to the frequency of Chinese character.
4. the present invention is according to the parts system and the ergonomics principle that meet Chinese-character canonical, adopt the machine aided means, the word construction frequency of each parts of quantitative Analysis parts system, the practical frequency of each parts, rate of dynamic coincident code, the static load of the static repetition rate of coding and each key position, dynamic load, the method that utilization adjustment component key position and fractionation rule combine, make parts hold the compatibility of a key altogether, the regularity that parts key position distributes, three targets such as the hamony of pointing during keystroke, it is unified to have reached harmony on higher level, formed and met Chinese-character canonical, science, practical input keyboard (Figure 10, Figure 11).
As everyone knows, the design of computer input method for Chinese character and keyboard thereof, so it claims difficulty throughout the world, lid is a brand-new interdisciplinary science that relates to spoken and written languages, information theory, computer science and ergonomics because of it.Have only and all multidisciplinary theories are used simultaneously and created, just might create Chinese character entering technique really scientific, practicability.
After the Five-stroke Method comes out, the inventor to oneself 1978---nineteen ninety-five reaches theory study and the research practice in 16 years and is summed up, creatively proposed meeting under the situation of Chinese-character canonical, guarantee " font code designs three principles " that a font code design has science, practicality, that is:
A. compatibility principle:
When referring to that several parts hold a key altogether, to the influence of the repetition rate of coding, promptly to the coding " uniqueness " influence.Compatibility is good more, and the repeated code that causes is few more.In the research process, " compatibility " usually directly represented with the number of words that is quantified as repeated code;
B. regular principle:
The learnability of finger part alignment placement on keyboard.The alignment placement of better regularity is convenient to memory, grasps easily.In general, regularity is one can be known from experience but very difficult the quantification, thereby is difficult to the soft quota of evaluation.Here, the inventor has created a kind of method, and " regularity " that the parts of deviser's proposition are soon arranged on keyboard is quantified as the numeral of representing relative value.This in the present invention quantized value is:
The arrangement of parts rule Condition Quantized value
1. parts location item rule ??(1) Area code is identical with the first sum of symbol ????4
??(2) Item is identical with a time code name ????4
??(3) Item is identical with an end code name ????2
2. the area code rule of single and compound stroke member Item is identical with the stroke number ????4
3. other associative memory method of parts and and alphabetical getting in touch ??(1) With coordination master unit homology Same parts
??(2) Be similar to the coordination master unit ????2
??(3) It is the distortion of coordination master unit ????2
??(4) The letter of place key is relevant with the parts pronunciation ????2
??(5) The letter of place key and parts plesiomorphism ????2
In view of the above, can calculate " the rule degree " of any one parts." the rule degree " of dicode parts can be by " rule degree " addition of two yard calculating of averaging.
The static state of parts of the present invention " rule degree " mean value is G=7.42 (Figure 12)
The static state of former scheme radical " rule degree " mean value is G=7.26 (Figure 13)
This shows that the present invention obviously improves than the regular of former scheme parts (radical) layout.
" the rule degree " of parts as an important theory index, for a font code design, is the very important data of weighing its study complexity, estimating its quality.A kind of pursuit that project setting is optimized, the regularity of arrangement of parts is improved, and " rule degree " this value is increased.
" rule degree " can be divided into static and dynamic two values.Quiescent value is the rule degree of each parts of only considering that the word construction frequency of parts calculates; Dynamic value then is the rule degree that the quiescent value weighted calculation is gone out according to " Chinese character frequency table ".
C. hamony principle
The correlativity and the hamony of each finger movement when the load distribution of finger finger keystroke, keystroke.
The research of hamony is very complicated an ergonomics and a biomechanics problem.No matter be font code or the design of sound sign indicating number, the research of hamony and application all can produce influence great, essence for input efficiency.
As everyone knows, beat English alphabetic keypad, proved the very low but design that can not change because of " the wood is already made into a boat " of efficient recently.In view of Hanzi input keyboard also not typing at present, so design efficiency height at the very start, promptly the good keyboard of hamony just is significant even far-reaching historical meaning.
In order to make keyboard Designing of the present invention be issued to good hamony in scientific guidance, the present invention reaches a conclusion by a large amount of statistical research and according to the test figure of biomechanics:
1. the single finger of same hand knocked the movement space average out to 0.09 second (refer to together double hit);
2. the different finger tapping movement space of same hand are 0.03 second (hitting with the different finger wheel of hand), and average stroke is singlehanded when singly referring to 3 times;
3. knock be spaced apart 0.02 second (left and right wheels is hit) of motion between the finger of different hands, average stroke is singlehanded when singly referring to 4.5 times.
And, measure the frequency (per minute number of times) as shown in the table that each finger of people knocks continuously by thousands of person-times experiment:
Finger Left hand The right hand
Forefinger ????400 ????420
Middle finger ????360 ????380
Four refer to ????330 ????360
Little finger of toe ????280 ????300
This studies show that the input keyboard that hamony is good, efficient is high should be accomplished:
1. give full play to the function of forefinger, middle finger;
2. to alleviate the load of both hands little finger of toe;
3. avoid one hand singly to refer to keystroke as far as possible;
4. parts fractionation and code Design will realize right-hand man's alternate key stroke as far as possible.
" three principles " is an interactive multiple goal of three.When emphasizing some targets, other two just can be weakened.As the lay special stress on regularity, promptly pay attention to learnability especially, then repeated code must increase, and hamony is inevitable destroyed.Therefore can under the situation of three's overall coordination, same component set be designed many different embodiment of pros and cons choice come.
According to about requirement and above-mentioned " font code designs three principles " of standardization of Chinese characters, the keyboard Designing that the present invention meets Chinese-character canonical is carried out according to following method and is had following characteristics:
(1) adopt the parts system that meets Chinese-character canonical to form the key face layout that meets Chinese-character canonical:
1. eliminated " making certainly " parts "
Figure A9510593100321
_, _,
Figure A9510593100322
_,
Figure A9510593100323
Figure A9510593100325
_,
Figure A9510593100327
" wait the non-standard stroke structure.
2. increased traditional radicals by which characters are arranged in traditional Chinese dictionaries " not, fork-like farm tool used in ancient China, the tenth of the twelve Earthly Branches, leather, skin, boat, Niu, Shi, Quan, Yu, Fish, sheep, _, Woo, Yi, blunt,
Figure A9510593100329
" wait as " dicode radical ".
3. traditional radicals by which characters are arranged in traditional Chinese dictionaries " _, penta, family name, insect without feet or legs, mother, Cui, gas " etc. have been increased as " whole word parts ".
4. with same source block, deformation component or likeness in form parts " Slit bamboo or chopped wood, _, _, " wait design on keyboard.
5. the different shape that will roll over pen " second " is illustrated on the key position or on the keyboard.
Above measure not only makes the parts standardization, key face layout, keyboard Designing standardization, and average " the rule degree " of parts is improved.
According to normalized requirement and practical experience, the suitably increase and decrease of the traditional radicals by which characters are arranged in traditional Chinese dictionaries on the keyboard, dicode parts, same source block, likeness in form parts and deformation component.
To the influence of fingering hamony, design part also distributes its key position when (2) splitting input according to parts, and the hamony marked improvement of finger when making input helps alleviating the tired of keyboarder and improves input speed:
1. about " " and " _ "---alleviate the design of A key load
Do not have in the former scheme " _ ", when running into " _ " input, will be split as 2 parts:
_--- Seven (21 15, HA)
Therefore, all inputs contain the word of " _ ", must hit the A key with the left hand little finger of toe of " low energy ".The present invention has designed " _ " afterwards, and the load of left hand little finger of toe is reduced.And " _ " still be in 21 (H) key, and making the rule degree is 8 still, is not affected;
2. about " mother "---avoid down, in, last three keys of repelling and attacking
Do not have in the former scheme " mother ", run into " mother ", will split into 3 parts:
Female--- One
Figure A9510593100332
(55 11 42, XGU)
And " XGU " is respectively on three row's keys.So must hit three row's keys during input, not only make mistakes easily, and influence speed.
The present invention has improved load distribution after increasing " mother ", makes the input of " every, extra large, quick, numerous, plum " these everyday characters meet the requirement of hamony, begin to fight " smoothly ".
3. about " family name "---avoid the little finger of toe double hit, shift the little finger of toe load:
Do not have in the former scheme " family name ", the word of all containing " family name " will be torn open without exception and be 2 parts with same little finger of toe keystroke:
The family name---
Figure A9510593100333
Figure A9510593100334
(35 15, QA)
After the present invention increases whole word parts " family name ", obviously reduce the load of little finger of toe double hit QA, for words such as everyday characters " low, paper, the end, support, wedding, dusk ", the hamony of finger is greatly improved, thereby improve input speed, reduce wrong keystroke, reduce error code rate.
Though " family name " design is gone up because of " inferior pen does not meet item " makes " rule degree " reduce 4 in " 33 ", and this sacrifice has improved hamony greatly, thereby is worth.
Under the total guiding theory of the present invention, adopt identical method, can also adjust the key position of several parts again according to normalized requirement, and make hamony with compatibility, regular three's benefit-risk balance in do new choice and become new embodiment.As can also " Woo, Yi " being moved on on 44 (O) key, for alleviating A key load, " worker " moved on on 12 (F) key or the like in order to alleviate the load of 45 (P).
(3) fruitful ground of the present invention reasonable distribution the keystroke load of key position, make hamony that important advance be arranged
Obviously on the low side and influence the problem of fingering hamony and input speed in order to solve the average keystroke of 32 (R) keys and 44 (O) key load, to usage frequency and all quite high 3 parts commonly used of word construction frequency---a few, Qe, wide, method with machine aided, by the coding of 348 related words of these 3 parts is constituted, practical frequency and to key position load, the repetition rate of coding, regular parameters such as combined influence, calculate the contrast and the balance, the present invention with these 3 Component Design on following key position:
" several " are moved on to 32 (R) key from 25 (M) key;
Jiang “ Qe " move on to 32 (R) key from 35 (Q) key;
" extensively " moved on to 44 (O) key from 41 (Y) key;
Following form is represented the influence of above design to key position load, repeated code number and regularity quantitatively:
A few, Qe, wide key position design relative consistency, regularity, hamony influence contrast table
Parts Several Qe Extensively
Former scheme key position ??25????M ????35????Q ????41????Y
Key of the present invention position ??32????R ????32????R ????44????O
Static load % Former scheme ??M=2.08 ??R=3.19 ????Q=3.67 ????R=3.19 ????Y=9.39 ????O=0.87
The present invention ??M=1.80 ??R=3.47 ????Q=3.27 ????R=3.59 ????Y=9.13 ????O=1.12
Dynamic load % Former scheme ??M=2.61 ??R=3.88 ????Q=3.15 ????R=3.88 ????Y=5.22 ????O=1.34
The present invention ??M=2.24 ??R=4.24 ????Q=2.65 ????R=4.44 ????Y=5.02 ????O=1.68
Former scheme causes the repeated code number 5 couples of 1-2:2 1-2:3 2-2:0 12 couples of 1-1:2 1-2:9 2-2:1 16 couples of 1-1:6 1-2:8 2-2:2
The present invention causes the repeated code number 6 couples of 1-1:3 1-2:3 2-2:0 4 couples of 1-1:3 1-2:1 2-2:0 7 couples of 1-1:2 1-2:5 2-2:0
The evaluation of repeated code of the present invention Littlely increase 1 pair Subtract 8 pairs Subtract 9 pairs
The present invention is regular to be estimated The rule degree rises 4, the first sum of area code that meets The rule degree keeps 4, and is constant The rule degree falls 4, and inferior pen is not inconsistent with item
The present invention loads and estimates % Quiet R increases 0.88 Q subtracts 0.4 O increases 0.25
Moving R increases 0.36 Q subtracts 0.5 O increases 0.34
Overall merit Important advance Left side little finger of toe load and repeated code all subtract major progress greatly Right nameless load increases, repeated code subtracts greatly, sharp big fraud is little, important advance
More than in the table, the repeated code group is counted the hurdle: 1-1 refers to the repeated code between first-level Chinese characters, and 1-2 refers to the repeated code between primary word and the secondary word; 2-2 refers to the repeated code between the secondary word.
As seen from the above table, moving of parts that relate to large quantities of Chinese characters usually exerts an influence simultaneously to " three principles ".
For more detailed purpose, method and the effect of introducing particularly to parts redesign key position thereof, move on to from 41 (Y) key with parts " extensively " now that the influence to repeated code is an example before and after 44 (O) key, this design is described except that improving key position load, relative consistency has also been made very big contribution (* with the lower word back represents primary word, and * * represents the secondary word):
Caused repeated code when 1. " extensively " is on 41 (Y) key:
1.5959 open grave FYT * * 2327 mill FYT *
2.3602 mushroom AYSD * 6234 Mi AYSD * *
3.8566 pace the KHYC * * 8583 KHYC * * that walks in small steps
4.3550 waste YSSD * 3605 mill YSSD *
5.3473 numb YSSI * 8767 harness YSSI * *
6.3607 magic YSSC * 8765 petty YSSC * *
7.2987 careful YAKG * 6659 hut YAKG * *
8.5105 hawk YWWG * 5863 augury YWWG * *
9.5568 5589 YWWF * of rate YWWF *, 10.2567 wide YYGT * 2329 side YYGT * 11.5527 earnest YYKB * 3210 wide YYKB * 12.3314 honest and clean YUVO * 3911 modest YUVO * 13.3346 Liao YNWE * 3593 wrong YNWE * 14.5887 Kuang YBH * * 5890 a word used in place name YBH * * YVWI * in 15.2493 heptan 6655 an enclosure for storing grain YVWI * * 16.7094 silk floss XYT * * 2336 spin XYT *
Above repeated code has 16 pairs, and the repeated code word has 32
Wherein: primary word has 20, and the secondary word has 12
Wherein: 1-1 repeated code=6,1-2 repeated code=8,2-2 repeated code=2
2. caused repeated code after " extensively " designs on 44 (O) key:
1.3602 mushroom AOSD * 6234 Mi AOSD * *
2.8711 flathead QGOH * 3359 squama QGOH *
3.3550 waste OSSD * 3605 mill OSSD *
4.3473 numb OSSI * 8767 harness OSSI * *
5.3607 magic OSSC * 8765 petty OSSC * *
6.4505 front yard OTFP * 1858 rough OTFP *
7.2493 heptan OVWI * 6655 an enclosure for storing grain OVWI * *
More than new repeated code have 7 pairs, new repeated code word has 14
Wherein: primary word has 9, and the secondary word has 5
Wherein: 1-1 repeated code=2,1-2 repeated code=5,2-2 repeated code=0
Parts " extensively " have related to 117 Chinese characters altogether, " extensively " though displacement sacrificed a part of regularity, moved on to from first two complete " 41 " consistent and had only 44 keys the first sum of and that area code meets with the position, make the rule degree reduce 4, the rank that descended, but this some sacrifice bring be:
A. compatibility is better: make the repeated code sum that caused by " extensively " reduce to 7 pairs from 16 pairs, wherein the repeated code between the primary word has just reduced 4 pairs; And from above listed repeated code as seen, the repeated code between 2 pairs of primary words that still exist after moving " waste, grind " and " front yard, rough " in, having only a word " front yard " is everyday character." uniqueness " when this compatibility for parts is promptly imported, very big contribution beyond doubt;
B. hamony is greatly improved: the static load 0.26% (dynamic load 0.2%) that former cause right hand forefinger (controlling 6 keys altogether) was just laid particular stress on originally arrives on 44 (O) key of right ring finger control, make the too light originally dynamic load of 44 (O) key obviously rise to 1.69% from 1.33%, will obviously improve the hamony of the right hand like this, will alleviate operator's degree of fatigue and improve input efficiency.
This shows, the high parts of strong, the practical frequency of mobile word-building ability, to realize standardization, scientific requirement, with a little less than the mobile word-building ability, parts that practical frequency is low are fundamental differences, the former relates to complicated multi-disciplinary theory and analysis, purport will reach the unification of " multiple goal " or quantitatively contrast weighs the advantages and disadvantages, and the latter is then because the word that involves seldom, is of little use very much is all insignificant to the influence of the harmonious tonality of repeated code.
Certainly, according to the above design of " font code designs three principles ", only be on the basis of GWB parts system, according to one group of embodiment of " three principles ".In like manner, according to identical method, can also on basis of the present invention, continue to change the key position of several parts and become some other embodiment.For example move on to " power " on 53 (V) key or reduce again and increase few parts etc.
(4) the present invention is by research and application to " font code designs three principles ", compatibility in the distribution of parts position, regular restriction have been solved to hamony breakthroughly, realized redistributing of key position load, made hamony design of the present invention reach unprecedented high level:
The present invention has reasonably redistributed the keystroke load of each finger by the preferred and layout of parts and determining of fractionation rule, makes:
1. row's finger load alleviates under the row of going up;
2. among the same row, two ends, the left and right sides particularly load of right-hand man's little finger of toe alleviate;
3. the row of making strides to be discharged to down row's double hit and to stride the load that is discharged to double hit from following row and alleviates.
According to keyboard of the present invention and coding scheme and " Chinese character frequency table ", can calculate the keystroke static load of each key of the present invention, can do following contrast with former scheme and Zheng's sign indicating number:
The present invention Former scheme Zheng's sign indicating number Compare conclusion
Last row's key total load ????42.01 ????41.11 ????32.65 Optimum of the present invention
Middle row's key total load ????42.24 ????42.00 ????41.03 Optimum of the present invention
Under arrange the key total load ????15.75 ????16.90 ????26.32 Optimum of the present invention
In the table, former scheme refers to the Five-stroke Method four editions of current popular.The code book of Zheng's sign indicating number derives from " Zheng's sign indicating number " code book in " Zheng's sign indicating number " " standard form service manual " that easy electronics corporation prints in February, 1993 in Beijing, gets with same computed in software.
Can find out significantly that by above correlation data the present invention is in the superiority aspect the load of key position and outstanding progressive.In three schemes in the table:
1. row's load is the highest in of the present invention, promptly points the load height of lead key (original position), and it is the shortest on average to point stroke, is convenient to keystroke, can raise the efficiency;
2. the key of row down load of the present invention is minimum, and the load of the crooked withdrawal of finger keystroke is alleviated, and can raise the efficiency;
3. row load is owing to moved into a part of load from row down a little more than other two schemes in the present invention, and upwards row stretches and refers to that keystroke contracts than row down and refer to that keystroke efficient is higher.
For the relation of considering compositive frequency of component and key position load quantitatively reaches the purpose that science designs, the present invention has calculated the dynamic and static load (Figure 14) of each parts.
5.GWB component body ties up to the layout on the key position, and following three kinds of modes can be arranged:
(1) simplified mode: only list simplified parts on the key bitmap, at this moment, the present invention is applicable to the situation of input simplified Chinese character, speech;
(2) traditional font mode: when certain parts has simplified, two kinds of forms in traditional font, only list the wherein key bitmap of traditional font form, then only be applicable to the situation of handling the complex form of Chinese characters, speech;
(3) simplified and traditional parallel mode: the mode that simplified, the traditional font of whole parts all are listed on the key bitmap is simplified and traditional parallel mode, can import the simplified Chinese character and the complex form of Chinese characters simultaneously in this way.
When with above (1), can be by software the text of input by the simplified traditional font that converts to; When with above (2), can convert to the text of input simplified by the traditional font by software;
Above function is promptly called " simplified and traditional parallel, compatible exchange ".
6. the present invention is divided into the form of a stroke or a combination of strokes, key name, parts, dicode parts four big classes (Figure 15) with all parts in the parts summary table
The situation of 42 (U) key for example:
This classification designs for the purpose that reaches easy.
7. the parts on the same position of the present invention (key position) are beginning arranged in groups (Figure 16) with the master unit
Among the form of a stroke or a combination of strokes of the present invention from each key position, key name, the parts, respectively select several typical case representatives as master unit, these master units are together with homology with it, be similar to or be convenient to the parts of associative memory, with the key position is unit, is arranged as a sequence of being taken the lead by master unit separately.When tabulating or representing on keyboard, the font size of master unit can be greater than " coordination parts ", " coordination spare ".This method can reduce memory capacitance widely, shortens learning cycle.
For example the situation of the radical on 15 (A) key is:
And the situation of 33 (E) key is:
Figure A9510593100391
8. the present invention can form simplified universal " GWB " Chinese character sort method, Chinese-character index method and indexing unit thereof.
According to the present invention, after Chinese character is encoded, just can set up character library, corpus or library to any Chinese Character Set according to " GWB " of Chinese character coding, and by the compiling method of the present invention retrieval of sorting.This retrieval both can be undertaken by the region-position code (11~55) of radicals by which characters are arranged in traditional Chinese dictionaries or parts, also can press parts corresponding 25 letters (A~Y) on English standard keyboard.Thus, just must produce a kind of " GWB " ranking method of Chinese dictionary, character library, document of practicability and indexing method, descriptor index method by the present invention.After the present invention included education of middle and primary schools in, " GWB " descriptor index method will obtain widespread use.Because " GWB " is fully according to shape coding, so " GWB " ranking method, descriptor index method can be common to the simplified Chinese character and the complex form of Chinese characters.
With this ranking method, descriptor index method by software be used to have input keyboard, the system of character library, display screen, this system promptly becomes the GWB indexing unit, can be widely used in (Figure 15) among books, archives, the information retrieval, the steps include:
(1), on the key position of GWB coding scheme, knocks after the pairing key of the parts position, keyboard is pairing coding of output block or code, the key position at " dicode parts " place wherein, promptly export corresponding " dicode parts " pairing coding or code afterwards knocking corresponding " dicode parts ", this corresponding codes or code, by software be transferred to corresponding character library, word is retrieved in the storehouse;
(2), the input keyboard of GWB with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component and " dicode parts " has the function that simplified and traditional Chinese character exchanges, this exchanges the character library of function corresponding to corresponding simplified and traditional Chinese character, carries out the individual character retrieval of corresponding simplified and traditional Chinese character;
(3), corresponding being set with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component of the region-position code of GWB keyboard reaches character library, the word storehouse that " dicode parts " are associated, this character library, word storehouse are corresponding with the region-position code of the parts of above-mentioned keyboard, and retrieval includes the individual character and the word of above-mentioned parts;
(4), the region-position code of GWB keyboard is corresponding with the instruction of corresponding word, dictionary, show according to pairing above-mentioned traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, the word of likeness in form parts, deformation component and dicode parts, the display screen of speech of including of the region-position code of keyboard device;
9. the present invention is coupled to hardware system with display, main frame, Chinese character data bank, printer, utilization totally 25 key positions distributes Hanzi components to constitute the keyboards that the Five-stroke Method key position distributes what be no less than 25 key positions with each 5 in 5 districts, or utilize similar window type to show, can carry out the system of Chinese character data-searching easily, the steps include:
(1), 5 on totally 25 key positions in 5 districts, with 11,12 ... 54,55 region-position codes as each group parts form and key position parts system one to one with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component and " dicode parts ";
(2), with above-mentioned component body be, Chinese character is split as the parts sequence with the method for splitting of " the root preface is preferential; normative stroke order; get big; take into account directly perceived " as far as possible, get its maximum first, second, third and the parts in front and back as the input coding parts, with the position at this parts place or its code as its input code, 4 yards of less thaies add identification code and the retrieval input method that forms;
(3), with above-mentioned retrieve encoded system, by software, automatically any Chinese character data is carried out the ordering of individual character or word, and utilizes this retrieve encoded parts system and retrieve encoded input method that any Chinese character data is retrieved;
(4), by keyboard, display, printer above-mentioned result for retrieval is edited, shown and exports;
Constitute Chinese character sort and the Chinese character index system thereof that can be used for books, archives, information processing with this.
10. appended " the five stroke input keyboards " of the present invention is on stroke classification, and it is detailed on key position or subordinate list to roll over pen " second "
11. the present invention shows that by the practical frequency of Chinese character repeated code is when a certain Chinese character of input runs into the repeated code word, this word and with it the word of repeated code be presented at from left to right in the presenting bank of screen by its practical frequency, the most frequently used word is presented at high order end, when this word of needs, as long as can being shown to the editor position automatically, the word of this word back in the input file, this word get on.When the word of needs was on position such as the 2nd, the 3rd, the numerical key of available keyboard top was selected.
When 12. Intelligent treatment repeated code of the present invention runs into repeated code when a certain Chinese character of input, can be considered and do not have repeated code, and utilize installed dictionary and Intelligent treatment software in the machine automatically to differentiate the word and the relation of the collocation between the repeated code word of hereinafter input, automatic will can deciding with that word of word-building hereinafter in the middle of the repeated code word or revisal is come.
13. input media of the present invention can be a special Chinese keyboard of making, also can directly continue to use the QWERTY keyboard of a standard, when being the Chinese keyboard of a special manufacturing, both can be a zone (Figure 16) that includes 25 keys within a big keyboard, the middle keyboard, also can be one 5 row, 5 key positions of every row mini keyboard (Figure 17).
14. the present invention is by encoding to simplified and traditional Chinese character, write the Input Software module, use keyboard to computer or other communication apparatus input character information, from character library, retrieve word or the speech that to import by software and system, and word or speech be presented on the screen or directly print, form a cover Chinese character information processing system with this, this system comprises (Figure 18) such as input keyboard, Input Software, operating system, character library, dictionary, main frame, display, printer, terminal and workstations.
15. the present invention is the load module of main contents by input coding, goes for and is transplanted to various computers and information handling system and get on.
16. component set of the present invention, parts system, coding scheme, input keyboard, information handling system, can also handle the Chinese character and the word thereof of CJK10646 large character set or more big collection.
17. the present invention is than the remarkable advantage and the substantive progress of existing the Five-stroke Method:
The present invention has cancelled " making certainly " radical that does not meet Chinese-character canonical, increase and meet traditional radicals by which characters are arranged in traditional Chinese dictionaries of Chinese-character canonical as parts, design meets the dicode parts of Chinese-character canonical, design homology, likeness in form parts, design traditional font parts are clearly rolled over form of a stroke or a combination of strokes attitude, change key name, compare with former scheme, following advantage and good effect arranged:
1.. parts (radical) system meets Chinese-character canonical and compares obviously different with former scheme and have remarkable advantage:
Project Symbol Former scheme The present invention Contrast
Key total ??K ????25 ????25 Identical
The maximum code length of words ??L ????4 ????4 Identical
Radical (parts) sum ??N ???199 ????246 Increase by 23.6%
The dicode component count ??S ????0 ????17
The present invention compares with existing technology, has obtained substantive progress, only just can find out from following table from changing the number of words that splits and encode:
The present invention and former scheme disassembled coding similarities and differences comparison sheet
Project Different numbers Among 3755 primary words Among the 6763 two-stage words
Split different words Number of words 2070 3820
Ratio % ??55.13 ??56.48
The different word of encoding Number of words 1825 2571
Ratio % ??31.53 ??28.56
2.. coding scheme meets Chinese-character canonical, and the progress and the breakthrough of matter are arranged than former scheme.
3.. the regularity of parts branch zoning position is significantly improved, and average " the rule degree " of parts brings up to of the present invention 7.42 from 7.26 of former scheme.
4.. the compatibility of parts has substantive progressive.
As the general knowledge of encode Chinese characters for computer academia, people know, under the certain prerequisite of bond number, parts tear open " broken " more, difficult more generation repeated code; Conversely, parts are " greatly " more, causes repeated code easily.And the present invention individually meets traditional radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character teaching norm as parts current programme being done so to have improved significantly, to have increased more than 20, under the situation that has obtained substantive innovation like this aspect the standardization, total repeated code does not only increase, have on the contrary and obviously reduced by 19 pairs, this is to inquire into a new height, the new high degree that just can not reach repeatedly without long-term conscientiously research.
Item compared Former scheme The present invention Compare conclusion
Total repeated code (to) ????259 ????240 The present invention reduces 19
The gross weight code word ????531 ????497 The present invention reduces 34
The primary word repeated code ????292 ????282 The present invention reduces 10
Secondary word repeated code ????239 ????215 The present invention reduces 24
5.. parts divide zoning position and key position ergonomic design principle, the input keystroke hamony major progress is arranged, can alleviate the typing personnel fatigue, reduce errors, improve input efficiency.Dynamic load contrast of each key position of the present invention and former scheme is as follows, and each key top one row's data is former scheme in the table, and a following row is for of the present invention:
Each key position dynamic load contrast table (%) of the present invention and former scheme
From above contrast as can be seen, 35 (Q) of the left hand pinkie of " incapability " control, the load of 15 (A) obviously descend; 44 (0) keys that load is too light in the former scheme and the load of 32 (R) key then obviously rise, and no matter the present invention is that static load distributes or dynamic load distributes, and all is better than former scheme significantly.In view of the domestic example that quantizes calculation key position load for the design input technology of also not reporting up to the present, the precedent that does not also have calculating unit load and key position to load, so we can say, aspect the hamony of realizing finger keystroke, the present invention has obtained unprecedented progress.
6.. keyboard Designing meets Chinese-character canonical and ergonomics principle, is convenient to popularization and application in middle and primary schools.
The present invention be at home and abroad the Chinese Computer information processing technology develop rapidly, widespread use, rapidly enter teaching, enter under the situation of family and arise at the historic moment.At current " the Five-stroke Method " in the input in Chinese technology under the situation of dominate, the present invention in time and has effectively overcome " the Five-stroke Method " in the problem that aspects such as standardization, scientific, practicability obviously exist, and makes the present invention than current programme a breakthrough development be arranged at aspects such as science, creativeness, standardization, practicability.
The present invention is a system engineering, carried out brand-new creation for the system that comprises main frame, display, input keyboard, printer, character library, dictionary and software etc.: meet the parts system that contains the dicode parts, the dicode coding scheme of educational standards, the foundation of double-code key bit keyboard, and the word corresponding with the coding of dicode coding scheme, the setting of dictionary and the establishment of corresponding system, guaranteed that the present invention is able to smooth enforcement, and reached as what is desired the purpose of being expected---make the Five-stroke Method technological direction standardization, scientific and more practical, higher efficient is arranged, more wide application is arranged and even in middle and primary schools, popularize. These all are the achievements of inventor's creative work over 10 years. This achievement is so that the present invention is that former scheme is compared with existing the Five-stroke Method technology, qualitative leap is arranged, the present invention is become directly to enter the education of middle and primary schools system at home and abroad, realize that normalized Chinese character teaching and normalized Chinese character input teaching combines nearly, add that the present invention effectively processes the advanced of CJK10646 large character set, of will become that worldwide simplified universal, simplified and traditional mutual usefulness, words melt altogether of the present invention is that carry forward Chinese culture is made major contribution, the brand-new technology of huge social effect is arranged.

Claims (5)

1,, and, it is characterized in that according to a kind of educational standards five stroke character type computer Chinese character input method of getting " one, three, three, end " component coding input Chinese character at most at the keyboard layout that is no less than on the key position of 25 keys with each 5 distribution member of 5 districts and input code:
(1) according to Chinese-character canonical teaching and analysiss of word source, grapheme analysis, integrated use, custom agreement, system's regulation etc. preferably, determine the principle of parts, formed one meet Chinese-character canonical, meet teaching norm, simplified and traditionally walk abreast, the set of standardization the Five-stroke Method addressable part of compatible exchange.The set of this addressable part is that the newly-increased whole word that meets Chinese-character canonical or traditional radicals by which characters are arranged in traditional Chinese dictionaries are as parts:
_, penta, insect without feet or legs, Cui, family name, mother, gas, slit bamboo or chopped wood, or not fork-like farm tool used in ancient China, the tenth of the twelve Earthly Branches, leather, skin, boat, Niu, Shi, Quan, fish, sheep , _), Woo, Yi, blunt ( );
(2) in above-mentioned set of reading addressable part, the parts of traditional radicals by which characters are arranged in traditional Chinese dictionaries that will meet Chinese-character canonical are designed to " dicode parts ", the coding that is about to several traditional radicals by which characters are arranged in traditional Chinese dictionaries is coded in the rationality that distributes in the whole space encoder according to its stroke feature and its and is designed to 2 sign indicating numbers, forms " the dicode parts " that meet Chinese-character canonical with this:
Not, fork-like farm tool used in ancient China, the tenth of the twelve Earthly Branches, leather, skin, boat, Niu, Shi, Quan, fish, sheep
Figure A9510593100023
, _), Woo, Yi, blunt ( );
(3) in above-mentioned component set, increase and the coordination parts master unit homology, likeness in form, distortion or that be convenient to associative memory:
Figure A9510593100025
Xi,
Figure A9510593100027
European-allies, day,
Figure A9510593100028
Figure A95105931000211
Month,
Figure A95105931000212
With, Mi, _,
Figure A95105931000214
Shui,
Figure A95105931000215
Slit bamboo or chopped wood, _,
Figure A95105931000217
An ancient type of spoon, seven
Formed the component set that meets Chinese-character canonical with this;
Compatibility when (4) drawing the position according to the parts grouping, and with the first sum of consistent with area code, the inferior pen regularity consistent with item, the hamony three of the load distribution of each position and finger keystroke realizes that multiobject unification is a foundation, the set of above-mentioned parts and other parts is divided into 5 districts, each distinguishes 5 positions, respectively with 11,12,13,54,55 as each the group parts region-position codes, with this of forming meet Chinese-character canonical and with the key position one to one, can carry out the parts system of disassembled coding to simplified and unsimplified Hanzi and CJKl0646 large character set:
The GWB component body ties up to the layout on the key position
Figure A9510593100031
(5) according to " the root preface is preferential, normative stroke order, as far as possible get big, take into account directly perceived " get one, two, three, general provisions that method that four yards of maximum four parts in end, less than are added identification code forms Chinese characters disassembled coding;
Utilize the above-mentioned coding input step of the key position input code of traditional radicals by which characters are arranged in traditional Chinese dictionaries, dicode parts, same source block, likeness in form parts, deformation component and other parts that comprises to be:
A: individual character input
1. itself be the individual character of parts
A, single are drawn: single sign indicating number+single sign indicating number+definitions+definitions (or adding a definitions)
B, key name word: with place key double hit 4 times, or by the solid size person that becomes word input or tear open and make 2-4 single and draw and import
C, solid size parts become word person: region-position code+the first sum of single+inferior single+end single
During 4 yards of less thaies, the blank fill key is as the end of input mark
Or split into 2-4 single and draw
D, dicode become the word, and the person has 5 kinds can use or use simultaneously wherein several coding input modes respectively:
I) directly with 2 sign indicating numbers of dicode parts as input code;
Ii) first yard+second yard+the first sum of single+an end single;
Iii) first yard+the first sum of single+inferior single+end single;
Iv) add suffix code, or prefixing and suffix code form the input code input simultaneously in front and back in the front prefixing sign indicating number or the back of 2 sign indicating numbers of dicode parts;
V) split into the single that is no more than 4 and draw input, during 4 of less thaies, add the space bar input;
2. non-parts individual character
According to the fractionation general provisions of " the root preface is preferential, and normative stroke order is got greatly, takes into account directly perceived " as far as possible, be split into after the sequence into solid size parts or dicode parts without exception
A, when not containing the dicode parts
When 5 parts are above, get one, two, three, the input of last component coding
During 4 parts, get the input of encoding successively of its whole parts
During 2~3 parts, get successively after its whole parts, add " last stroke character patten identification code ", still less than is 4 yards, plays space bar as the end of input mark;
B, when containing the dicode parts, when the above-mentioned dicode parts that meet Chinese-character canonical are participated in fractionation and coding input, carry out according to the following steps:
1. get one, two, three, last parts: promptly splitting rule according to parts, is that unit is split as Chinese character " parts sequence " with parts, gets one, two, three more at most from this " parts sequence ", four parts in end are as " addressable part ";
2. " addressable part " pairing region-position code or English alphabet in the 1. middle quilt of step being got are listed successively, become one " true form sequence ", and during wherein if any the dicode parts, 2 sign indicating numbers of dicode parts all will be listed in;
3. getting at most " one, two, three, end " four sign indicating numbers according to following three kinds of situations from " true form sequence " is input code as the correct coding of this Chinese character;
The code taking rule of three kinds of situations in the following example shown in: wherein with ABCD as the single sign indicating number of " one, two, three, end " parts or first yard, with WXYZ second yard as dicode parts among " addressable part "; When 4 sign indicating numbers of " true form sequence " less than, also should as existing the Five-stroke Method technology, add " last stroke character patten identification code ";
A, " addressable part " are 2 o'clock: First parts Second parts Input code ??A ??B ?AB ??BX ?ABX ??AW ??B ?AWB ??BX ?AWBX
B, " addressable part " are 3 o'clock: First parts Second parts The 3rd parts Input code ??A ??B ??C ?ABC ??CY ?ABCY ??BX ??C ?ABXC ??CY ?ABXY ??AW ??B ??C ?AWBC ??CY ?AWBY ??BX ??C ?AWBC ??CY ?AWBY
C, " addressable part " are 4 o'clock: First parts Second parts The 3rd parts The most last parts Input code 1 Input code 2 ??A ??B ??C ??D ?ABCD ?ABCD ??DZ ?ABCZ ??CY ??D ?ABCD ??DZ ?ABCZ ??BX ??C ??D ?ABXD ??DZ ?ABXZ ??CY ??D ?ABXD ??DZ ?ABXZ ??AW ? ?B ??C ??D ?AWBD ??DZ ?AWBZ ??CY ??D ?AWBD ??DZ ?AWBZ ??BX ??C ??D ?AWBD ??DZ ?AWBZ ??CY ??D ?AWBD ??DZ ?AWBZ
Input code 2 is that 4 " addressable parts " are respectively got first yard, and promptly no matter wherein the dicode parts have severally, get ABCD without exception;
B. word input
No matter form in the individual character structure of vocabulary whether contain the dicode parts, the coding and input method of its word is all the same, promptly all with the all-key of its individual character basis, wherein as code fetch:
1. two-character word
Preceding two sign indicating numbers totally 4 yards inputs of its individual character all-key got in every word
2. three words
Preceding two sign indicating numbers that first yard, the last character that its individual character all-key respectively got in preceding 2 words got its individual character all-key amount to 4 yards inputs
3. four words
First sign indicating number that its individual character all-key got in every word amounts to 4 yards inputs
4. multi-character words
Get first, second, third and first sign indicating number of the last character individual character all-key amount to 4 yards inputs;
It is identical that coded input method and the coding elm of the word that does not conform to the dicode parts that contains the word of dicode parts goes into method, the steps include: the all-key based on individual character, divides 2 words, 3 words, 4 words and multi-character words respectively by above requirement code fetch and input;
With this component set, coding scheme and input method of containing traditional radicals by which characters are arranged in traditional Chinese dictionaries, dicode parts, same source block, likeness in form parts, deformation component that forms;
2, according to dicode word, the Chinese word coding input method of claim 1, it is characterized in that, after the coding input of dicode parts also can be split as the two parts that may contain the non-standard parts, encode respectively and import, at this moment, the non-standard parts that may occur in the fractionation just are equivalent to be on the key position of second sign indicating number indication of dicode parts, or other according to being coded in the rationality that distributes in the space encoder, compatibility according to shape artificially on the key position of appointment, that is:
Not 43---one
Figure A9510593100071
(11 43) fish 11--- One (35 11)
---_ foretell (13 21) Fish 44---
Figure A9510593100073
Xiangxi (35 44)
Fork-like farm tool used in ancient China 43---three
Figure A9510593100074
(13 43) sheep 13 — — Ha _ (42 13)
The tenth of the twelve Earthly Branches 11---one (14 11), west
Figure A9510593100075
13 — — Ha _ (42 13)
Leather 12---twenty
Figure A9510593100076
(15 12) _ 13 — — Ha kings (42 11)
Skin 54---_ (21 54) Woo 41---again _ Dian (45 41)
Boat 41---Pie (31 33) Yi 42--- (45 42)
Pie, Dian (31 41) blunt 33---Ji (53 33)
Shi 33---the people (34 54)
Figure A95105931000712
54---Ji
Figure A95105931000713
(53 54)
Quan 31--- Pie (35 31)
Form transitional component set and the coding scheme thereof that carries out the transition to " standard fully " from " not exclusively standard " with this;
3, a kind of educational standards five stroke character type computer Hanzi input keyboard is characterized in that:
On each 25 key position of 5,5 districts, be set with brand-new addressable part, wherein:
(1) on following key position, increased the whole word that meets Chinese-character canonical or traditional radicals by which characters are arranged in traditional Chinese dictionaries newly as the coding input block:
13.D-penta 21.H---_ 32.R---gas 33.E---insect without feet or legs
34.W---Cui 35.Q---family name 42U---slit bamboo or chopped wood 55.X-mother
(2) on following key position, designed " the dicode parts " that meet Chinese-character canonical:
11.G---13.D---fork-like farm tool used in ancient China 14.S---tenth of the twelve Earthly Branches not
15.A---leather 21.H---skin 31.T---boat, Niu
34.W — — Shi 35.Q---Quan, Yu, Fish
42.U---sheep, _ 45.P---Woo, Yi 53.V---is blunt,
(3) on following key position, designed same source block, likeness in form parts, deformation component:
12.F---
Figure A9510593100083
13.D--- 14.S---Xi
15.A---
Figure A9510593100085
European-allies 21.H---
Figure A9510593100086
22.J---day
24.L---
Figure A9510593100088
25.M--- 33.E---
Figure A95105931000810
Month With
35.Q---_ ---42.U slit bamboo or chopped wood 43.I--- _ Shui
44.O—— ?????????????????????????????????????????51.N——
Figure A95105931000816
ユ????????53.V——_
Figure A95105931000817
55.X--- An ancient type of spoon seven
(4) on following key position, newly designed parts in the former scheme:
32.R---a few Qe
44.O---wide
The key name of (5) 55 (X) key is set at " one ", and the key name of 51 (N) key is set at " own ";
(6) the folding pen " second " on the 51.N key is represented simultaneously:
Constitute simplified Chinese character, the speech that both can import Chinese character with this, also can import the complex form of Chinese characters, speech, and be applicable to the input keyboard that meets Chinese-character canonical of handling the CJK10646 large character set;
4, comprise display, main frame, character library, word storehouse, printer, and utilize what be no less than 25 key positions and totally 25 key positions distribute the keyboard of Hanzi component formation the Five-stroke Method keyboard layouts with 5 in 5 districts, the maximum inputs first, second, third of foundation and the last parts or its coding are finished the individual character of monomer and unsimplified Hanzi and a kind of educational standards five stroke character type computer Chinese character input system of word input, it is characterized in that:
(1) in 5 districts each 5 the following position of the keyboard of totally 25 key positions be on the key position
A, increase meet traditional radicals by which characters are arranged in traditional Chinese dictionaries of Chinese-character canonical as whole word parts, and design is on following position:
_---21, penta---13, insect without feet or legs---33, Cui---34, the family name---33,
Female---55, gas---32, slit bamboo or chopped wood---42
B, in 5 districts 5 the following position of the keyboard of totally 25 key positions be on the key position with same source block, likeness in form parts, deformation component merger on the position at master unit place:
Figure A9510593100091
???????????????????——12????????????????
Figure A9510593100092
?????????????——13
Xi---14
Figure A9510593100093
---21
Figure A9510593100094
European-allies---15 days ---22
Figure A9510593100096
???????????????????——24????????????????? ?????????????——25
Month With---33 _ ---35
_ Shui---43
Figure A95105931000913
---44
Slit bamboo or chopped wood---42 _ ---53
Figure A95105931000915
ユ---51
Figure A95105931000916
An ancient type of spoon seven---55
C, with word construction frequency and all quite high common components " several " the, “ Qe of practical frequency ", " extensively " according to its in the unified design of multiple goal of the compatibility on the keyboard, regularity, hamony three on following key position:
Several---32 (R) Qe---32 (R) are wide---44 (O)
D, " one " is decided to be the key name of 55.Q, " own " is decided to be the key name of 51.N
E, will roll over the pen " second " represent the following form of a stroke or a combination of strokes:
Figure A95105931000917
F, in 5 districts each 5 the following position of the keyboard of totally 25 keys be will meet Chinese-character canonical on the key position " dicode parts " design on corresponding position:
Figure A95105931000918
On the key position of above input coding system, knock after the pairing key of the parts position, keyboard is pairing coding of output block or code, the key position at " dicode parts " place wherein, promptly export corresponding " dicode parts " pairing coding or code afterwards knocking corresponding " dicode parts ", this corresponding codes or code, by software be transferred to corresponding character library, word is retrieved in the storehouse;
(2) above-mentioned input keyboard with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component and " dicode parts " has the function that simplified and traditional Chinese character exchanges, this exchanges the character library of function corresponding to corresponding simplified and traditional Chinese character, carries out the individual character retrieval of corresponding simplified and traditional Chinese character;
(3) be set with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component corresponding with the region-position code of above-mentioned keyboard reaches character library, the word storehouse that " dicode parts " are associated, this character library, word storehouse are corresponding with the region-position code of the parts of above-mentioned keyboard, and retrieval includes the individual character and the word of above-mentioned parts;
(4) corresponding with the region-position code of above-mentioned keyboard with the instruction of corresponding word, dictionary, show according to pairing above-mentioned traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, the word of likeness in form parts, deformation component and dicode parts, the display screen of speech of including of the region-position code of keyboard device;
5, a kind ofly comprise display, main frame, Chinese character data bank, printer, and utilize and be no less than totally 25 key positions the distributing Hanzi components to constitute the keyboard of the Five-stroke Method key position distribution or utilize similar display window to carry out the system of Chinese character data-searching of 25 key positions, it is characterized in that with each 5 in 5 districts:
(1) in 5 districts 5 on totally 25 key positions, with 11,12 ... 54,55 region-position codes as each group parts form and key position parts system one to one with traditional radicals by which characters are arranged in traditional Chinese dictionaries, same source block, likeness in form parts, deformation component and " dicode parts ":
The GWB component body ties up to the layout on the key position
(2) with above-mentioned component body be, Chinese character is split as the parts sequence with the method for splitting of " the root preface is preferential; normative stroke order; get big; take into account directly perceived " as far as possible, get its maximum first, second, third and the parts in front and back as the input coding parts, with the position at this parts place or its code as its input code, 4 yards of less thaies add identification code and the retrieval input method that forms;
(3),, automatically any Chinese character data is carried out individual character or word ordering, and utilize this retrieve encoded parts system and retrieve encoded input method that any Chinese character data is retrieved by software with above-mentioned retrieve encoded system;
(4) by keyboard, display, printer above-mentioned result for retrieval is edited, shown and exports;
Constitute Chinese character sort and the Chinese character index system thereof that can be used for books, archives, information processing with this.
CN 95105931 1995-06-09 1995-06-09 Method and device for ducation standardized inputting Chinese characters by five stroke Pending CN1154502A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 95105931 CN1154502A (en) 1995-06-09 1995-06-09 Method and device for ducation standardized inputting Chinese characters by five stroke

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 95105931 CN1154502A (en) 1995-06-09 1995-06-09 Method and device for ducation standardized inputting Chinese characters by five stroke

Publications (1)

Publication Number Publication Date
CN1154502A true CN1154502A (en) 1997-07-16

Family

ID=5075663

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 95105931 Pending CN1154502A (en) 1995-06-09 1995-06-09 Method and device for ducation standardized inputting Chinese characters by five stroke

Country Status (1)

Country Link
CN (1) CN1154502A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1317631C (en) * 2004-12-31 2007-05-23 魏世勇 Integral pattern-joining Chinese character input method
CN100343789C (en) * 2005-08-18 2007-10-17 姜涛里 A ten-stroke ten-component classification method and Chinese character input method composed therefrom
CN100388169C (en) * 2002-02-07 2008-05-14 杨华纬 Simplified and original complex form mixed Chinese character shape and code three key computer input method
CN100389376C (en) * 2005-09-01 2008-05-21 钱任举 Universal Chinese character input method and virtual keyboard thereof
CN100401238C (en) * 2005-09-23 2008-07-09 英保达股份有限公司 Inputting-unit setting system and method therefor
CN100410855C (en) * 2005-12-31 2008-08-13 联想(北京)有限公司 Method of inputting special information by keyboard
CN101833378A (en) * 2010-04-12 2010-09-15 林海涛 Standard five-stroke input method and keyboard thereof
CN101930298A (en) * 2010-08-16 2010-12-29 胡锡全 Chinese character striding input method

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100388169C (en) * 2002-02-07 2008-05-14 杨华纬 Simplified and original complex form mixed Chinese character shape and code three key computer input method
CN1317631C (en) * 2004-12-31 2007-05-23 魏世勇 Integral pattern-joining Chinese character input method
CN100343789C (en) * 2005-08-18 2007-10-17 姜涛里 A ten-stroke ten-component classification method and Chinese character input method composed therefrom
CN100389376C (en) * 2005-09-01 2008-05-21 钱任举 Universal Chinese character input method and virtual keyboard thereof
CN100401238C (en) * 2005-09-23 2008-07-09 英保达股份有限公司 Inputting-unit setting system and method therefor
CN100410855C (en) * 2005-12-31 2008-08-13 联想(北京)有限公司 Method of inputting special information by keyboard
CN101833378A (en) * 2010-04-12 2010-09-15 林海涛 Standard five-stroke input method and keyboard thereof
CN101833378B (en) * 2010-04-12 2012-09-19 林海涛 Standard five-stroke input method and keyboard thereof
CN101930298A (en) * 2010-08-16 2010-12-29 胡锡全 Chinese character striding input method
CN101930298B (en) * 2010-08-16 2012-09-26 胡锡全 Chinese character striding input method

Similar Documents

Publication Publication Date Title
CN1577229A (en) Method for inputting note string into computer and diction production, and computer and medium thereof
CN85101817A (en) An zijie type Chinese-character stroke computer code's method and keyboard thereof
CN1280748C (en) Speed typing apparatus and method
CN1154502A (en) Method and device for ducation standardized inputting Chinese characters by five stroke
CN1048343C (en) Free combination code Chinese character input method and key board
CN1140865C (en) Super numeral code
CN1089919C (en) Chinese character-splitting coded method and its keyboard for computer
CN1399185A (en) Integral Chinese character input method and its keyboard
CN1025896C (en) New concept Chinese character coding
CN1054695C (en) Computer Chinese character eight-four code input method and key board
CN1026924C (en) Chinese-character sound dissection encode and input method
CN1529219A (en) Language code inputting method
CN1129836C (en) Li Ming multifunctional shape-meaning-class-letter encode technique for inputting Chinese characters
CN1417674A (en) Chinese syllable double reading scheme, Chinese keyboard and information input and processing method
CN1155874C (en) Simplified and unsimplified Chinese characters unified keyboard encode method and its input method
CN1276337C (en) Computer Chinese character coding inputting method
CN1259615C (en) Letter-keyboard and number-keyboard universal inputting method for Chinese character inputting and left-part character-shape identification method
CN1402110A (en) Information input method and use
CN1043209A (en) Computer chinese treatment method
CN1068127C (en) Text data processing method and device
CN1604017A (en) Chinese character characterized location encoding combination input method based on one-key -for-one-character
CN1074842C (en) Simple digital encode scheme for Chinese characters
CN1019527B (en) Character pixel input method and its keyboard
CN1752899A (en) Chinese language coding and its Chinese character input method and retrival method
CN1442780A (en) English quick input method and its keyboard mouse

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C01 Deemed withdrawal of patent application (patent law 1993)
WD01 Invention patent application deemed withdrawn after publication