CN102830809B - Encode method for entering Chinese characters - Google Patents

Encode method for entering Chinese characters Download PDF

Info

Publication number
CN102830809B
CN102830809B CN201110160422.1A CN201110160422A CN102830809B CN 102830809 B CN102830809 B CN 102830809B CN 201110160422 A CN201110160422 A CN 201110160422A CN 102830809 B CN102830809 B CN 102830809B
Authority
CN
China
Prior art keywords
code
chinese character
character
chinese
parts
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201110160422.1A
Other languages
Chinese (zh)
Other versions
CN102830809A (en
Inventor
董为群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wen Hua (beijing) Education Technology Co Ltd
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201110160422.1A priority Critical patent/CN102830809B/en
Publication of CN102830809A publication Critical patent/CN102830809A/en
Application granted granted Critical
Publication of CN102830809B publication Critical patent/CN102830809B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a kind of encode method for entering Chinese characters, it comprises: according to Chinese character separating program, Hanzi structure is split, by Chinese character be divided into that single character, left and right equate or left, center, right equates, left few right many, left many right sides equate less, up and down or upper, middle and lower equates, upper few lower many, upper how under less, two sides encirclement, three bread enclose and completely encircle and special construction, and use respectively a numerical key as its code; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses respectively a numerical key as its code; According to the feature that will input Chinese character, using the code of its Hanzi structure as coding the 1st, using the number of components of Chinese character or stroke number as coding the 2nd, and using the code of the stroke of this Chinese character all parts as coding 3rd~6; By the numerical key on keyboard or soft keyboard by coding input.

Description

Encode method for entering Chinese characters
Technical field
The present invention relates to Chinese information processing technology field, in particular to a kind of encode method for entering Chinese characters.
Background technology
Current encode character for computer roughly can be divided into level Four pattern: the first order is whole word pattern; The second level is to meet countryThe normal parts pattern of standard; The third level is the non-standard parts split mode between normal parts and stroke; The fourth stageIt is stroke pattern.
Wherein whole word pattern is without Chinese character is carried out to any fractionation, and spelling input method and region-position code are exactly the typical case of this patternRepresentative. Current spelling input method is the most popular input method of Chinese character. Its advantage is will input by phonetic, is the most naturalChinese character coding input method. Its weakness be cannot input not can pronunciation Chinese character, because space encoder is too small, cause repeated code too much. MoreSerious is to use for a long time Pinyin Input, can weaken the memory to Chinese character pattern, reduces the writing level of Chinese character, even causes the ChineseWord amnesia.
The problem of normal parts input method is to remember a large amount of parts, and what on July 1st, 2009 start to try is " existingFor commonly used word parts and component names specification " GF0014-2009 specified 514 parts. As far back as the GF3001-issuing before this1997 information processings with GB13000.1 character set Hanzi component regulation and stipulations 560 parts. How hundreds of parts rationally divideCloth, on QWERTY keyboard, is a difficult problem that there is no solution always.
Third level pattern between normal parts and stroke, due to less consideration Chinese character self-law, is not subject to countryStandard constraint, or have a preference for for individual, or being limited to oneself opinion, the various schemes that compare one's strong points with others' weak points emerge in an endless stream. ThisCause just the major reason of Chinese character shape code and phonetic-stroke code " ten thousand yards of Pentium " low-level repetition.
The advantage of stroke pattern is easy to learn easy to remember, can write and will input; Shortcoming is that the input that stroke is many is slow, the weight that stroke is fewCode is many, can not write and can input.
As can be seen here, what China Computer Users was the most scarce still really meets character rule, Chinese character input easy to learn and easy to useMethod. Moreover, both at home and abroad Chinese character teaching also to need really to contribute to become literate, write, look into the Chinese character of word and typewriting defeatedEnter coding.
The effect of encode character for computer is not only to input Chinese character. The most important teaching task of primary school period Chinese course is to knowWord, good encode method for entering Chinese characters should be able to play the effect that other means do not have in character learning, can effectively help to learnRaw learning and memory Chinese character, obviously improves the timeliness of character learning, be conducive to the understanding of student to Chinese character and have deep love for, and allows Chinese-character canonicalPopularization is fulfilled. These just contemporary Chinese character teaching to encode method for entering Chinese characters requirement, if encode character for computerItself be exactly a kind of character learning method of specification, and contribute to solve the large problem that Chinese character finds it difficult to learn, that just can't be better!
Summary of the invention
The invention provides a kind of encode method for entering Chinese characters, in order to help people's learning and memory Chinese character in the time that Chinese character is inputted.
For achieving the above object, the invention provides a kind of encode method for entering Chinese characters, it comprises the following steps: according to Chinese characterDisassembler splits Hanzi structure, by Chinese character be divided into that single character, left and right equate or left, center, right equates, left few right many, leftMany right sides equate less, up and down or upper, middle and lower equates, upper few lower many, upper many under less, two sides encirclement, three bread enclose and completely encircle andSpecial construction, and use respectively a numerical key as its code; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses respectively oneNumerical key is as its code; According to the feature that will input Chinese character, using the code of its Hanzi structure as the 1st that encodes, willThe number of components of Chinese character or stroke number are as the 2nd of coding, and using the code of the stroke of this Chinese character all parts as coding3rd~6; By the numerical key on keyboard or soft keyboard by coding input.
Preferably, in the time that Chinese character is single character, using the stroke number of this single character as its coding the 2nd.
Preferably, horizontal code is 1, and perpendicular code is 2, and the code of slash is 3, and the code of point is 4, and the code of folding is 5, withThe horizontal code that other strokes intersect is 6, and the code of lifting-hook is 7, and the code of the slash of intersecting with other strokes is 8, with otherThe code of drawing the folding intersecting is 9.
Preferably, when Chinese character is single character, the single character that exceedes 4 pictures is got it and front 4 is drawn and encode.
Preferably, in the time that Chinese character comprises 2 parts, encode for first 2 that get respectively its 2 parts; When Chinese character comprises 3When individual parts, first 2 of the 1st and the 3rd parts that get respectively front 2 parts encode; When Chinese character comprise 4 and more thanWhen parts, encode for the 1st that gets respectively front 4 parts; Wherein, when if desired getting the parts of first 2 and only having 1 stroke,While coding, this stroke is repeated 2 times.
Preferably, the identifier finishing in advance as coding by digital " 0 ".
Preferably, in coding, last position with the first letter of pinyin of this Chinese character as this encode character for computer, wherein this spellingSound initial can appear on any position of this encode character for computer.
In above-described embodiment, carry out the input of Chinese character according to the Hanzi structure of Chinese character, parts and stroke, not only can be used asThe Chinese character input method of high efficient and flexible, is widely used in the information terminal apparatus of computer and all kinds of employing numeric keypads; Also canWith with helping people's learning and memory Chinese character, to the character learning in Chinese character teaching, handwriting practicing with look into word and all have larger booster action.
Brief description of the drawings
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existingHave the accompanying drawing of required use in technical description to be briefly described, apparently, the accompanying drawing in the following describes is only thisSome embodiment of invention, for those of ordinary skill in the art, not paying under the prerequisite of creative work, all rightObtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is input method of Chinese character flow chart according to an embodiment of the invention;
Fig. 2 is encode character for computer inputting interface screenshot capture according to an embodiment of the invention;
Fig. 3 is digital according to an embodiment of the invention pure shape code individual character input " Chinese " screenshot capture;
Fig. 4 is digital according to an embodiment of the invention pure shape code individual character input " word " screenshot capture;
Fig. 5 is digital according to an embodiment of the invention pure shape code individual character input " volume " screenshot capture;
Fig. 6 is digital according to an embodiment of the invention pure shape code individual character input " code " screenshot capture;
Fig. 7 is digital according to an embodiment of the invention pure shape code word input " encode character for computer " screenshot capture;
Fig. 8 is the input of shape tone code word according to an embodiment of the invention " encode character for computer " screenshot capture;
Fig. 9 is the input of pure tone code word according to an embodiment of the invention " encode character for computer " screenshot capture;
Figure 10 is the input of trigram loan blend according to an embodiment of the invention " encode character for computer " screenshot capture.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, completeWhole description, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiment. Based onEmbodiment in the present invention, those of ordinary skill in the art are not paying obtain under creative work prerequisite every otherEmbodiment, belongs to the scope of protection of the invention.
The present invention relates to one and meet national standard specification, towards Chinese character teaching, there is digital pure shape, the combination of shape sound and wordThree kinds of forms of female pure tone are without the encode method for entering Chinese characters that switches mixed function. The method, taking digital watch shape as main, is aided with phoneticInitial, therefore be referred to as " number form consonant " input method.
Encode method for entering Chinese characters is the most important entrance that Chinese information enters computer and mobile information terminal apparatus. WithTime be also important means and the instrument of Chinese character teaching, and be the technological approaches that solves Chinese character sort search problem.
The present invention is observing on the basis of national language word related standards and specification completely, wants from Hanzi structure and configurationQuantitative relation between element is started with, and extracts " configuration quantity " this new code element, and direct by 0~90 numeralExpress this code element, be aided with Chinese Pin Yin initial, formed and followed national standard specification, towards Chinese character teaching, toolThere are digital pure shape, three kinds of forms of the combination of shape sound and alphabetical pure tone without the encode method for entering Chinese characters that switches mixed function.
Fig. 1 is encode method for entering Chinese characters flow chart according to an embodiment of the invention. As shown in Figure 1, this input method of Chinese characterComprise the following steps:
S102, splits Hanzi structure according to Chinese character separating program, Chinese character is divided into single character, left and right equate or leftThe middle right side equates, left few right many, left many right sides equate less, up and down or upper, middle and lower equates, upper few lower many, upper how under less, two sides encirclement, threeBread encloses and completely encircle and special construction, and uses respectively a numerical key as its code; By the stroke of Chinese character be divided into horizontal stroke,Perpendicular, slash, point, folding, and use respectively a numerical key as its code;
S104, according to the feature that will input Chinese character, using the code of its Hanzi structure as coding the 1st, by Chinese characterNumber of components or stroke number as coding the 2nd, and using the code of the stroke of this Chinese character all parts as coding the 3rd~6;
S106, by the numerical key on keyboard or soft keyboard by coding input.
In the present embodiment, carry out the input of Chinese character according to the Hanzi structure of Chinese character, parts and stroke, not only can be used as heightImitate Chinese character input method flexibly, be widely used in the information terminal apparatus of computer and all kinds of employing numeric keypads; All rightWith helping people's learning and memory Chinese character, to the character learning in Chinese character teaching, handwriting practicing with look into word and all have larger booster action.
Position relationship according to parts in whole word is classified to Chinese character, meets " modern commonly used word parts and portion completelyPart title specification GF0014-2009 " requirement.
In 6763 Chinese characters of Chinese Character Set Code for Informati-baseset GB2312, the Chinese character of left and right structureNearly 4272, exceed 63%. In line with effectively utilizing space encoder, the design principle that balanced coding distributes as far as possible, by left and rightStructure is subdivided into three subclasses: left (in) right equal, left few right many, left how right few.
The quantity of the Chinese character of up-down structure is also quite a lot of. In 6763 Chinese characters of GB2312, the Chinese character of up-down structure is total1560, account for 23%. Therefore be also divided three classes: upper (in) lower equal, upper few lower many, upper many less lower.
No matter be left and right structure or up-down structure, so-called equating,, with how many, for two parts words, refers to twoQuantitative relation between unit stroke; For three or three Chinese characters with upper-part, refer to the quantity between partsRelation.
About the method for splitting of Hanzi component, and parts many and few in about how to confirm or upper and lower two parts.Main Basis of the present invention " Hanzi component specification " development group is in " about some problems of working out " Hanzi component specification " " (abbreviation portionPart specification) middle relevant regulations.
The disassembler of Chinese character is defined as the " order that Chinese character separating is parts by parts specification. To the Chinese character of hierarchical structureSuccessive has motivation to split, and claims level to split; The Chinese character of planar structure is carried out to disposable have motivation fractionation or unreasonable certificateSplit, claim plane to split. "
The present invention splits and determines parts in left and right structure and up-down structure Chinese character according to the first floor in parts disassemblerDistribute be actually which side few which side is many. For example, " do " by " Ren, ten, mouth, The-Fan " four parts and form. Its first floor split result is" Ren " and " event ", so be " left few right many " structures. " newly " is made up of " vertical, wooden, jin " three parts, and its first floor split result is" parent " and " jin ", therefore be " left how right few " structure. " flower " is made up of " Lv, Ren, an ancient type of spoon " three parts, and its first floor split result is" Lv " and " change ", so be " upper few lower many " structures. " think " to be made up of " wood, order, the heart " three parts, its first floor split result is" phase " and " heart ", therefore be " upper how lower few " structure.
As can be seen here, the present invention has not only comprised number of components information, but also has reflected the standardising process that parts split.
Except left and right and up-down structure, the Chinese character quantity of other structures is fewer comparatively speaking. Therefore represent all two with 8The Chinese character of bread closed structure, comprises that upper left encirclement, upper right encirclement and lower-left surround three kinds; Represent all three sides surrounded Chinese with 9Word, surrounds and three kinds of right encirclements in lower-left comprising the right encirclement in upper left, upper lower-left. Represent completely encircle and other special knots with 0The Chinese character of structure.
How the key of encode character for computer is the huge character set of split amount. The 1st coding of the present invention is to compilingThe decomposition for the first time of code Chinese Character Set, the effect (uniformity) of its decomposition is very large on the impact of the repetition rate of coding and repeated code word similarity,Said method has decomposed a large amount of Chinese characters that are highly gathered in left and right structure and up-down structure region more equably. Number of components withPosition relationship belongs to macroscopical attribute of Chinese character, apparent, rarer ambiguity. The present invention utilizes these yuan usually to describe the ChineseWord, the ingenious number of components of having evaded is too much, splits a series of difficult problems such as lack of standardization, component names is indefinite.
From the embodiment of Fig. 1, can find out the pure shape coding of<Chinese-character digital>: :=<constructive code><quantity code>{<strokeCode>}. Wherein,<constructive code>: :=<single character>|<combinde rqdical character>, code that for example can military order single character is 1,<single character>: :=1;<combinde rqdical character>: :=<left and right structure>|<up-down structure>|<two sides surrounds>|<tri-bread enclose>|<completely encircle and spyDifferent structure>,<left and right structure>: :=<left (in) right equating>|<left few right many>|<left many right few>, order<left (in) right equating>::=2,<left few right many>: :=3,<left many right few>: :=4;<up-down structure>: :=<upper (in) lower equating>|<upper few lower many>|<Upper how lower few>,<upper (in) lower equating>: :=5,<upper few lower many>: :=6,<upper many lower few>: :=7,<two sides surrounds>: :=8,<tri-bread enclose>: :=9,<completely encircle and special construction>: :=0.
In the above-described embodiments, the part count that quantity code is Chinese character, in the time that Chinese character is single character, can be by this single characterStroke number as the 2nd of its coding.<quantity code>: :=<stroke number>|<component count>,<stroke number>: :=1|2|3|4| 5|6|7|8|9 (numeral 9 represents that stroke equals or exceeds 9 pictures),<component count>: :=1|2|3|4|5|6|7|8|9 (numeral 9 tablesShow that component count equals or exceeds 9).
Wherein, the parts in the embodiment of the present invention refer to radicals by which characters are arranged in traditional Chinese dictionaries in " Chinese character radicals table G0011-2009 " and " modern normalWith word parts and component names specification GF0014-2009 " in basic components.
The radicals by which characters are arranged in traditional Chinese dictionaries definition that " Chinese character radicals table " adopts be " a part of parts of structure word in batch " (GB/T12200); " existingFor commonly used word parts and component names specification " definition of the parts followed is: " the structure with assembly Chinese word function being made up of strokeWord unit " (GB/T12200). Most of specification radicals by which characters are arranged in traditional Chinese dictionaries are normative foundation parts. In " Chinese character radicals table " 201 main radicals,Only have 22 and do not belong to the basic components in " modern commonly used word parts and component names specification ". The present invention will belong to the base of radicals by which characters are arranged in traditional Chinese dictionariesPlinth parts are called " radicals by which characters are arranged in traditional Chinese dictionaries parts ", the basic components that are not radicals by which characters are arranged in traditional Chinese dictionaries are called to " non-radicals by which characters are arranged in traditional Chinese dictionaries parts ", will not belong to basic componentsRadicals by which characters are arranged in traditional Chinese dictionaries are called " non-parts radicals by which characters are arranged in traditional Chinese dictionaries ".
Only split Chinese character according to the requirement of radicals by which characters are arranged in traditional Chinese dictionaries and parts specification, just can obtain correct component count. Design is like thisFor structure knowledge of Chinese characters is dissolved in coding and is gone, to consolidate the memory to physical structure of Chinese characters by coding input, keep awayExempt to have faded from memory because use computer the font of Chinese character. Even now does the learning difficulty that can improve coding, but this face justTo the needs of Chinese character teaching, because parts are the crucial assembly units that form a connecting link in adopting Chinese character form, if think correct understanding and noteRecall Chinese character pattern, grasp the relation of font and the meaning of word, just must know Chinese character by which parts assembly is formed.
For combinde rqdical character, at least comprise two parts, therefore quantity code can not be 1. In the present invention, quantity code 1Do not represent the number of components of combinde rqdical character, but combinde rqdical character is looked as a whole, do not split. Also defeated as single characterEnter first four of Chinese character. Like this design in order that, when not knowing how to split, maybe cannot confirm the number of components of certain Chinese character timeAlso can input this Chinese character.
For example, stroke coding is followed national standard, and horizontal code is 1, and perpendicular code is 2, and the code of slash is 3, the generation of pointCode is 4, and the code of folding is 5; In order to break up " anyhow skimming folding " these four kinds of strokes that usage frequency is higher, and in stroke coding aspectThe cross reference that embodies stroke, the horizontal code intersecting with other strokes is 6, the code of lifting-hook (distinguishing with perpendicular) is 7, with itThe code of the slash that his stroke intersects is 8, and the code of the folding intersecting with other strokes is 9.<stroke code>: :=<horizontal stroke>|<perpendicular>|<Skim |<point>|<folding>,<horizontal stroke>: :=1|6,<perpendicular>: :=2|7,<skim: :=3|8,<point>: :=4,<folding>: :=5|9.
For example, when Chinese character is single character, the length of stroke code is determined because of stroke number, exceed 4 picture single characters get its front 4 draw intoRow coding.
For example, in the time that Chinese character comprises 2 parts, encode for first 2 that get respectively its 2 parts; When Chinese character comprises 3When parts, first 2 of the 1st and the 3rd parts that get respectively front 2 parts encode; When Chinese character comprises 4 and with topWhen part, encode for the 1st that gets respectively front 4 parts; Wherein, when if desired getting the parts of first 2 and only having 1 stroke,When coding, this stroke is repeated 2 times.
For example, be used for representing completely encircle structure except first, when coding, digital " 0 " also can be used as coding in advanceThe identifier finishing. The present invention not only can input Chinese character by individual character one by one, also can be used for inputting word. The present invention does not arrange and appointsWhat brevity code, in the time of input word, can finish the coding of current input Chinese character in advance with " 0 ", and enter next Chinese characterInput. The benefit of doing is like this without fixing code length, at any time according to experience in the past and the content of input, regulates arbitrarily codingLength (do not comprise end mark is the shortest can only have one), both can be unrestrained, can efficiently input again.
In order to be applicable to better Chinese character teaching, input method of the present invention is except (the letter below of the pure shape code of numeral described aboveClaim pure shape code) outside, also utilize first letter of pinyin, it is expanded, thereby form " combination of shape sound " and " alphabetical pure tone "Other two kinds of modes (below respectively referred to as " shape tone code " and " pure tone code ").
For example, in coding, last position with the first letter of pinyin of this Chinese character as this encode character for computer, this phonetic lead-inFemale can appearing on any position of this encode character for computer. The shape tone code of the present embodiment is on the basis of pure shape code, adds the ChineseWord phonetic first letter forms, and belongs to taking shape as main a kind of encode method for entering Chinese characters of shape sound combination. When pupil associationAfter the Chinese phonetic alphabet, can input Chinese character by shape tone code, so not only can consolidate learned phonetic, can also further reduceSelection rate, improves input efficiency.
In shape tone code of the present invention, first letter of pinyin is positioned at the last of coding, but does not fix its position. First letter of pinyin hasTwo effects. The first is used for decomposing repeated code, and it two is as end-of-encode mark. First effect is apparent, and secondIndividual effect is of the present invention one large feature.
With regard to code length, Chinese character digital coding can be divided into two kinds of fixing code length and on-fixed code lengths. Fixed-length coding excellentPoint is without end mark, and shortcoming is that existence is the idle bit for the length that gathers together enough in a large number. Although but not fixed-length coding code efficiencyHeight, but between two encodes character for computer, need separator, or need to specify the code length of (for example word input) under particular case.And shape tone code of the present invention only have last position be phonetic alphabet, all the other be all numeral, so these phonetic alphabet are effectiveCode element, has played again the effect of end mark, and can on any position of the 1st to the 6th of coding, key in this phonetic headLetter, last code element finishing in advance as coding.
In the time that first letter of pinyin appears in the first code bit, it is exactly pure tone code of the present invention. Pure tone code only has one, mainBe used for the fast longer word of input.
Shape tone code of the present invention is only used a first letter of pinyin, has just realized thoroughly code length freely, 1 to 7 anyA code bit, all allows to key in first letter of pinyin. The pure shape code that the present invention uses, need to finish coding in advance with extra " 0 ",Shape tone code has realized the code length free degree and the high efficiency perfect unity of coding.
Chinese character number form consonant canonical code input method of the present invention not only can be accomplished " seeing word knowledge code ", for suitable oneDivide Chinese character, can also accomplish " seeing code character learning ".
Below for according to one preferred embodiment of the present invention:
It implements software is a WPF (WindowsPresentationFoundation) program, may operate inUnder the operating system environment of WindowsXP and more highest version. Its major function is the Chinese that user is inputted by QWERTY keyboardWord code conversion becomes Chinese character or word.
Below in conjunction with implementing software and accompanying drawing, the invention will be further described.
Fig. 2 is this enforcement software interface, and the left side is coding input frame, and the right is prepare word choice box. On choice boxSide shows selection result, and the statistical parameter relevant to prepare word.
Fig. 3 to Fig. 6 inputs respectively four words of individual character " encode character for computer " with pure shape code, and wherein " Chinese " and " code " only needs input5 codings, can uniquely determine. The coding of " volume " is mutually overlapping with other words, but first-selected word, therefore select without key.
Fig. 7 is the process with pure shape code input word " encode character for computer ". First Chinese character has used 5 codings (42449) defeatedEnter. When after first input of word, remaining word is mostly inputted without all-key. In the present embodiment, only use dibit encoding(52) can input " word ". And " volume " and " code " only used respectively a coding, just complete the input of whole word. Thus canSee, use method of the present invention, even if only use ten numerals, also can effectively shorten code length, at a high speed input Chinese.
Fig. 8 has shown the process with shape tone code input word " encode character for computer ". When word input, shape tone code can be furtherShorten code length. In the present embodiment, the mean code length of four words only has 1.75.
Fig. 9 has recorded the process of pure tone code input word " encode character for computer ". It is more that pure tone code is mainly used to input number of wordsWord (more than three words). Can find out from the present embodiment, pure tone code is obviously not suitable for inputting individual character and two-character word, but input picture" encode character for computer " so repeatedly word is very efficient. The mean code length of the present embodiment is 1.
Figure 10 has embodied the feature of the present invention " three kinds of modes are mixed without switching ". User can enter according to Chinese character learningDegree, or the degree of awareness to concrete Chinese character pattern and pronunciation, and the experience accumulation of Chinese character input aspect, selects the most applicable oneselfOneself mode, thus the limitation of single input mode avoided.
The different coding mode that the present invention not only provides applicable Chinese character teaching different phase to use has also realized phase simultaneouslyUsing with without switching mutually. From only, with ten numeral input Chinese characters or word (pure shape), to using, numeral and first letter of pinyin are defeatedEnter Chinese character or word (combination of shape sound), then to only inputting word (pure tone) with first letter of pinyin. All can freely select with word with wordSelect.
For example, input word " encode character for computer ", can have distinct methods as shown in table 1 (but non-whole):
Table 1
Wherein, code length=(actual input coding+separator+key choosing symbol) ÷ 4.
Be more than the data of utilizing Chinese words Input Software that the inventor works out voluntarily to obtain, Chinese Character Set used isGB2312, dictionary capacity is more than 200,000 words.
Below feature of the present invention is further illustrated.
Canonical code input method, as its name suggests, must meet " specification ". The specification here refers to that country promulgates execution or examinationThe spoken and written languages of row and encode character for computer input relevant criterion, as shown in table 2.
Table 2
The present invention meets above national standard specification completely.
One of maximum feature of the present invention, is embodied in " with numeral description Chinese character pattern ". Not only with directly reflection of numeralThe stroke of Chinese character and number of components, and the Chinese character structure that the static structure of Chinese character and parts dynamic resolution level etc. is difficult to statementShape feature, embodies by simple digital.
Quantity has reflected the number that Chinese character comprises parts, is the secondary classification to structure. For example, Chinese character is concentrated the mostLeft and right structure is divided into " left (in) right equating ", " left few right many " and " left how less right " three subclasses, cooperation second quantity code, energyEnough better embody the quantity of parts and the position distribution in Chinese character, reflect the various features of Hanzi structure. Establishing like thisMeter can also effectively reduce repeated code keyboard and select rate.
Parts are gross features of Chinese character, and the modular construction of most of Chinese character is very obvious, and its number of components sees sth with half an eye.By simple quantitative relation reflection Chinese character pattern structure, meet the requirement of country to input method ease for use, be also conducive to for the ChineseThe classification memory of word font.
It is another feature of the present invention that bonded block extracts stroke feature. Choose the stroke of multiple parts as far as possible, allow volumeCode be evenly distributed on as far as possible Chinese character at all levels and local go up, make encoded packets contain more Hanzi attribute information, and with yardThe sequential write of order reflection Hanzi component and stroke. These all meet the requirement of teaching-oriented.
In addition, utilize the undefined numerical key of national regulation (6~9), effectively broken up higher the commonly using of usage frequencyStroke, has further reduced repeated code keyboard and has selected rate.
Two of the maximum feature of the present invention is embodied in the design of the long and last coding of free code. If encoded, last position isNumeral is digital pure shape code; If first letter of pinyin and code length are greater than 1, it is shape tone code; If code length shortens to oneAnd be first letter of pinyin, that is exactly pure tone code.
Encode character for computer code length CL excursion of the present invention provides as follows:
Pure shape code: 1≤CL≤6;
Shape tone code: 2≤CL≤7;
Pure tone code: CL=1.
Can be as required, select arbitrarily the 1st of coding to input Chinese character and word to L bit position (L≤maximum code length)Language.
Position, coding end is different with the first effect, but is all of paramount importance code bit. The present invention, in the design of position, end, uses" 0 " is as the end mark of pure shape code, to keep the digital feature of shape code; Last bit code with first letter of pinyin as shape tone codeUnit, both can save the separator between coding, can effectively reduce again the repetition rate of coding. Design Just because of this, just can accomplish threePlant coding without switching, can mix use. People can as required, with word, select to be applicable to arbitrarily the input side of oneself with wordMethod.
The present invention is simple in rule easy to learn, and exception, does not have brevity code, without memory, can accomplish to see word knowledge code completely. HaveQuite a few coding can also be shown in code character learning. Therefore can coordinate Chinese character teaching, synchronous study is used.
Say from the meaning of " character learning ", because the present invention has good level and similar polymerization to the coding of Chinese characterDegree, can be used as character learning method and uses, or is used for assisting student to arrange, sort out, understand and the memory Chinese character of learning.
With regard to " writing ", stroke, parts and the structural information comprising in coding contributes to grasp the font feature of Chinese characterWith component locations relation, be conducive to student's normalized written Chinese character.
Say for " looking into word ", pure shape code of the present invention only uses 10 numerals as code element, adds top-down classification volumeInk recorder system, only needs through the simple process in program, both can generate reasonably Chinese character sequence directly perceived. Ordering rule is simple and clear, symbolClose Chinese character sort from whole word to parts, the basic demand from parts to stroke.
For " typewriting ", because the code element of pure shape code only has 10 numerals, thus can be applicable to arriving greatly desktop computer, littleTo various information terminal input Chinese characters such as mobile phones. Because there is good level and the similar degree of polymerization, can input by turn, withWalk out of word, a lot of Chinese characters can be determined in advance without all-key input.
Visible, it is defeated that the present invention is that one has the standard Chinese character coding of " typewrite, become literate, write and look into word " four binding functionsEnter method.
One of ordinary skill in the art will appreciate that: accompanying drawing is the schematic diagram of an embodiment, the module in accompanying drawing orFlow process might not be that enforcement the present invention is necessary.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be passed throughThe hardware that programmed instruction is relevant completes, and aforesaid program can be stored in a computer read/write memory medium, this programIn the time carrying out, carry out the step that comprises said method embodiment; And aforesaid storage medium comprises: ROM, RAM, magnetic disc or lightThe various media that can be program code stored such as dish.
Finally it should be noted that: above embodiment only, in order to technical scheme of the present invention to be described, is not intended to limit; AlthoughWith reference to previous embodiment, the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still canModify with the technical scheme that previous embodiment is recorded, or part technical characterictic is wherein equal to replacement; AndThese amendments or replacement, do not make the essence of appropriate technical solution depart from spirit and the model of embodiment of the present invention technical schemeEnclose.

Claims (3)

1. an encode method for entering Chinese characters, is characterized in that, comprises the following steps:
According to Chinese character separating program, Hanzi structure is split, the position relationship according to parts in whole word divides Chinese characterClass, by Chinese character be divided into that single character, left and right equate or left, center, right equates, left few right many, left many right sides less, equal or upper, middle and lower phase up and downDeng, upper few lower many, upper many under less, two sides encirclement, three bread enclose and completely encircle and special construction, and use respectively a numerical keyAs its code; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses respectively a numerical key as its code;
According to the feature that will input Chinese character, using the code of its Hanzi structure as coding the 1st, by the number of components of Chinese characterOr stroke number is as the 2nd of described coding, and using the code of the stroke of this Chinese character all parts as described coding the 3rd~6;
By the numerical key on keyboard or soft keyboard by described coding input;
In the time that Chinese character is single character, using the stroke number of this single character as its coding the 2nd.
Horizontal code is 1, and perpendicular code is 2, and the code of slash is 3, and the code of point is 4, and the code of folding is 5, hands over other strokesThe horizontal code of fork is 6, and the code of lifting-hook is 7, and the code of the slash of intersecting with other strokes is 8, the folding intersecting with other strokesCode be 9;
When Chinese character is single character, the single character that exceedes 4 pictures is got it and front 4 is drawn and encode;
In the time that Chinese character comprises 2 parts, encode for first 2 that get respectively its 2 parts;
In the time that Chinese character comprises 3 parts, first 2 of the 1st and the 3rd parts that get respectively front 2 parts encode;
When Chinese character comprises 4 and during with upper-part, encode for the 1st that gets respectively front 4 parts;
Wherein, when if desired getting the parts of first 2 and only having 1 stroke, while coding, this stroke is repeated 2 times.
2. input method of Chinese character according to claim 1, is characterized in that, in any position of the 1st to the 6th of encode character for computerBe set up the first letter of pinyin of keying in this Chinese character, last code element finishing in advance as this encode character for computer.
3. input method of Chinese character according to claim 1, is characterized in that, the mark finishing in advance as coding by digital " 0 "Will symbol.
CN201110160422.1A 2011-06-15 2011-06-15 Encode method for entering Chinese characters Active CN102830809B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110160422.1A CN102830809B (en) 2011-06-15 2011-06-15 Encode method for entering Chinese characters

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110160422.1A CN102830809B (en) 2011-06-15 2011-06-15 Encode method for entering Chinese characters

Publications (2)

Publication Number Publication Date
CN102830809A CN102830809A (en) 2012-12-19
CN102830809B true CN102830809B (en) 2016-05-11

Family

ID=47333975

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110160422.1A Active CN102830809B (en) 2011-06-15 2011-06-15 Encode method for entering Chinese characters

Country Status (1)

Country Link
CN (1) CN102830809B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104978045B (en) * 2015-05-27 2019-07-05 腾讯科技(深圳)有限公司 A kind of Chinese character input method and device
CN105068671B (en) * 2015-06-29 2018-01-05 曾子力 A kind of input method of Chinese character
CN105807947A (en) * 2016-01-11 2016-07-27 金云中 Method for correspondingly identifying modular stroke coded Chinese characters
CN108629046B (en) * 2018-05-14 2023-08-18 平安科技(深圳)有限公司 Field matching method and terminal equipment
CN113900531A (en) * 2021-03-26 2022-01-07 刘跃军 Chinese character phonetic input method with transposition, continuous clicking, sound and shape and less selection
CN113377215A (en) * 2021-06-25 2021-09-10 刘跃军 Chinese-character 'Liulian' input method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204799A (en) * 1998-07-10 1999-01-13 陈澜 Coding method of Chinese character unit stroke numbers
CN1265482A (en) * 2000-04-13 2000-09-06 徐万胥 Digital union code Chinese character input method and its keyboard
CN1336578A (en) * 2001-09-05 2002-02-20 黄建东 Chinese character inputting method based on digital keypad
CN1499357A (en) * 2002-11-01 2004-05-26 ���Ծ Method for lablling united character and word as well as character patterns and character picture

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002358071A (en) * 2001-05-31 2002-12-13 Seiko Epson Corp Character object

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1204799A (en) * 1998-07-10 1999-01-13 陈澜 Coding method of Chinese character unit stroke numbers
CN1265482A (en) * 2000-04-13 2000-09-06 徐万胥 Digital union code Chinese character input method and its keyboard
CN1336578A (en) * 2001-09-05 2002-02-20 黄建东 Chinese character inputting method based on digital keypad
CN1499357A (en) * 2002-11-01 2004-05-26 ���Ծ Method for lablling united character and word as well as character patterns and character picture

Also Published As

Publication number Publication date
CN102830809A (en) 2012-12-19

Similar Documents

Publication Publication Date Title
CN102830809B (en) Encode method for entering Chinese characters
CN101634927B (en) Method and device for displaying candidate items in character input
CN103616960A (en) Six vowel binary syllabification input method
CN104063359A (en) Implementation method for personalized Chinese character word library
CN101118463A (en) Chinese phonetic input method used for digital keyboard
CN102750009B (en) A kind of without switching input method of Chinese character and keyboard
CN101551711A (en) Chinese character coding input method based on structure and primitive
CN105912139B (en) Method for correspondingly recognizing modular stroke coding Chinese characters
CN105487684B (en) The output intent of Chinese-character phonetic letter character and the output device of Chinese-character phonetic letter character
CN100371865C (en) Chinese character input method for number keyboard and corresponding electronic product
CN103207684A (en) Phonemic letter double-input method
WO2008089654A1 (en) Ordering retrieving method of chinese character type, device thereof and an information system
CN100458667C (en) Chinese character five-stroke fourteen-radicals inputting method on cellphone or computer
CN100576149C (en) Chinese character input output method and device
CN103176616A (en) Input method and device for guqin abbreviated character notation characters
CN107256092B (en) Chinese character digital shape code quick input method
CN104267824A (en) Chinese character wubi number digital coding input method
CN207457986U (en) Mobile phone three-stroke digital input method of Chinese character and keyboard
CN100390710C (en) Fast and easy Chinese character input method and keyboard
CN104536590B (en) Embedded software keyboard system based on West Xia Dynasty&#39;s text sound character roots input method
CN102622098B (en) New pictophonetic code Chinese character input method
CN105589574B (en) A kind of Sino-British number mixing character input method based on five first syllable codes
CN1836199B (en) Character inputting method of using word as unit
CN101419505A (en) Free code input method
CN101261564A (en) Dummy keyboard for inputting Chinese characters and operation method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20160713

Address after: 100044, room 17, floor 3, building 34, 2002 South Main Street, Haidian District, Beijing, Zhongguancun

Patentee after: Wen Hua (Beijing) Education Technology Co., Ltd.

Address before: Beijing City, Haidian District Weigongcun street, home of Wei Bohao 5-3-1102

Patentee before: Gao Jingmin

Patentee before: Dong Weiqun