CN102830809A - Chinese character coding input method - Google Patents
Chinese character coding input method Download PDFInfo
- Publication number
- CN102830809A CN102830809A CN2011101604221A CN201110160422A CN102830809A CN 102830809 A CN102830809 A CN 102830809A CN 2011101604221 A CN2011101604221 A CN 2011101604221A CN 201110160422 A CN201110160422 A CN 201110160422A CN 102830809 A CN102830809 A CN 102830809A
- Authority
- CN
- China
- Prior art keywords
- chinese character
- code
- stroke
- chinese
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 55
- 230000008676 import Effects 0.000 claims description 17
- 238000010276 construction Methods 0.000 claims description 7
- 235000008429 bread Nutrition 0.000 claims description 5
- 230000000694 effects Effects 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 238000013461 design Methods 0.000 description 7
- 230000003203 everyday effect Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 230000010365 information processing Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000008450 motivation Effects 0.000 description 2
- 238000006116 polymerization reaction Methods 0.000 description 2
- 208000000044 Amnesia Diseases 0.000 description 1
- 208000031091 Amnestic disease Diseases 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000006986 amnesia Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000005266 casting Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 210000003128 head Anatomy 0.000 description 1
- 210000001747 pupil Anatomy 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The invention discloses a Chinese character coding input method. The Chinese character coding input method comprises the following steps: dividing a Chinese character structure based on a Chinese character dividing program to obtain a single character structure, an equivalent left and right structure or an equivalent left, middle and right structure, a structure more in right and less in left, a structure more in left and less in right, an equivalent top and bottom structure or an equivalent top, middle and bottom structure, a structure less in top and more in bottom, a structure more in top and less in bottom, a double-face enclosing structure, a three-face enclosing structure, a four-face enclosing structure and a special structure; utilizing numeric keys as the codes of the structures; dividing the strokes of Chinese character into a horizontal stroke, a perpendicular stroke, a leftfalling stroke, a point stroke and a turning stroke, and utilizing the numeric keys as the codes of the strokes; setting the codes of the Chinese character structure as the first position of the code based on the characteristics of the Chinese character to be input, and adopting the number of parts or the number of strokes of the Chinese character as the second position of the code, and adopting the codes of the strokes of each part of the Chinese character as the third to the sixth positions of the code; and finally inputting the codes through the numeric keys on a keyboard or a soft keyboard.
Description
Technical field
The present invention relates to the Chinese information processing technology field, in particular to a kind of encode method for entering Chinese characters.
Background technology
Present encode Chinese characters for computer roughly can be divided into the level Four pattern: the first order is whole word pattern; The second level is the normal parts pattern of National standard; The third level is the non-standard parts split mode between normal parts and stroke; The fourth stage is the stroke pattern.
Wherein whole word pattern need not Chinese character is carried out any fractionation, and spelling input method and region-position code are exactly typical case's representative of this pattern.Current spelling input method is the most popular input method of Chinese character.Its advantage is will import by phonetic, is the most natural Chinese character coding input method.Its weakness be can't import not can pronunciation Chinese character because space encoder is too small, cause repeated code too much.Even more serious is to use the phonetic input for a long time, can weaken the memory to Chinese character pattern, reduces the writing level of Chinese character, even causes the Chinese character amnesia.
The problem of normal parts input method is to need to remember a large amount of parts, and " modern everyday character parts and the component names standard " GF0014-2009 that on July 1st, 2009 began to try has stipulated 514 parts.As far back as before this issue the GF3001-1997 information processing with GB13000.1 character set Hanzi component regulation and stipulation 560 parts.How hundreds of parts rationally are distributed on the QWERTY keyboard, are difficult problems that does not have solution always.
Third level pattern between normal parts and stroke because less consideration Chinese character self-law is not retrained by national standard, or has a preference for from the individual, or is limited to oneself opinion, and the various schemes that compare one's strong points with others' weak points emerge in an endless stream.This causes the major reason of Chinese character shape code and phonetic-stroke code " ten thousand yards Pentium " low-level repetition just.
The advantage of stroke pattern is to be prone to learn well note, can write and will import; Shortcoming is that the many inputs of stroke are slow, and the repeated code that stroke is few is many, can not create and can import.
This shows that China Computer Users is the most scarce still really meets character rule, is prone to learn well the Chinese character input method of usefulness.Moreover, domestic and international Chinese character teaching also needs really to help to become literate, write, look into the input coding for Chinese character of word and typewriting.
The effect of encode Chinese characters for computer not only is to import Chinese character.The most important teaching task of primary school period Chinese course is character learning; Good encode method for entering Chinese characters should be able to play the effect that other means do not have in character learning; Can effectively help student's learning and memory Chinese character; Obviously improve the timeliness of character learning, help the student to the understanding of Chinese character with have deep love for, let the popularization of Chinese-character canonical fulfill.These contemporary just Chinese character teachings, if encode Chinese characters for computer itself is exactly a kind of character learning method of standard and help to solve the big problem that Chinese character finds it difficult to learn to the encode method for entering Chinese characters requirement, and that just can't be better!
Summary of the invention
The present invention provides a kind of encode method for entering Chinese characters, in order to when Chinese character is imported, to help people's learning and memory Chinese character.
For achieving the above object; The invention provides a kind of encode method for entering Chinese characters; It may further comprise the steps: according to the Chinese character disassembler Hanzi structure is split; With Chinese character be divided into single character, about equate the left, center, right equates, few right many, left many right sides, a left side equate less, up and down or go up in equate down, go up few many down, go up many under less, the two sides encirclement, three bread enclose and completely encircle and special construction, and use a numerical key as its code respectively; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses a numerical key respectively as its code; According to import the characteristics of Chinese character, with the code of its Hanzi structure the 1st, with the number of components of Chinese character or stroke number the 2nd, and with the code of the stroke of these each parts of Chinese character the 3rd~6 as coding as coding as coding; Through the input of will encoding of the numerical key on keyboard or the soft keyboard.
Preferable, when Chinese character is single character, with the stroke number of this single character the 2nd as its coding.
Preferable, horizontal code is 1, and perpendicular code is 2, and the code of left-falling stroke is 3; The code of point is 4, and the code of folding is 5, and the code of the horizontal stroke that intersects with other strokes is 6; The code of lifting-hook is 7, and the code of the left-falling stroke that intersects with other strokes is 8, and the code of the folding that intersects with other strokes is 9.
Preferable, when Chinese character is a single character, get its preceding 4 pictures above the single character of 4 pictures and encode.
Preferable, when Chinese character comprises 2 parts, encode for preceding 2 that get its 2 parts respectively; When Chinese character comprised 3 parts, preceding 2 of the 1st and the 3rd parts that get preceding 2 parts respectively encoded; When Chinese character comprises 4 and during with upper-part, encode for the 1st that gets preceding 4 parts respectively; Wherein, if need get preceding 2 parts and have only 1 stroke the time, when then encoding this stroke is repeated 2 times.
Preferable, be used as the identifier that coding finishes in advance with digital " 0 ".
Preferable, in coding, use the first letter of pinyin of this Chinese character last position as this encode Chinese characters for computer, wherein this first letter of pinyin can appear on any position of this encode Chinese characters for computer.
In the foregoing description, carry out the input of Chinese character, not only can be used as the Efficient and Flexible Chinese character input method, be widely used in the information terminal apparatus of computing machine and all kinds of employing numeric keypads according to Hanzi structure, parts and the stroke of Chinese character; Can also be used to helping people's learning and memory Chinese character, to the character learning in the Chinese character teaching, handwriting practicing with look into word bigger booster action is all arranged.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is input method of Chinese character process flow diagram according to an embodiment of the invention;
Fig. 2 is encode Chinese characters for computer inputting interface screenshot capture according to an embodiment of the invention;
Fig. 3 is digital according to an embodiment of the invention pure font code individual character input " Chinese " screenshot capture;
Fig. 4 is digital according to an embodiment of the invention pure font code individual character input " word " screenshot capture;
Fig. 5 is digital according to an embodiment of the invention pure font code individual character input " volume " screenshot capture;
Fig. 6 is digital according to an embodiment of the invention pure font code individual character input " sign indicating number " screenshot capture;
Fig. 7 is digital according to an embodiment of the invention pure font code word input " encode Chinese characters for computer " screenshot capture;
Fig. 8 is the input of shape sound sign indicating number word according to an embodiment of the invention " encode Chinese characters for computer " screenshot capture;
Fig. 9 is the input of pure tone sign indicating number word according to an embodiment of the invention " encode Chinese characters for computer " screenshot capture;
Figure 10 is the input of trigram loan blend according to an embodiment of the invention " encode Chinese characters for computer " screenshot capture.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, complete description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not paying the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
The present invention relates to a kind of National standard standard,, have digital pure shape, the combination of shape sound and three kinds of forms of alphabetical pure tone and do not have the encode method for entering Chinese characters that function is used in switching with towards Chinese character teaching.This method is main with digital watch shape, is aided with first letter of pinyin, so be referred to as " number form consonant " input method.
Encode method for entering Chinese characters is the most important inlet that Chinese information gets into computing machine and mobile information terminal apparatus.Also be the important means and the instrument of Chinese character teaching simultaneously, and be the technological approaches that solves the Chinese character sort search problem.
The present invention is observing on the basis of national language literal related standards and standard fully; Start with from the quantitative relation between Hanzi structure and the configuration key element, extract " configuration quantity " this new code element, and directly express this code element with 0~90 numeral; Be aided with Chinese Pin Yin initial; Formed and followed the national standard standard,, had digital pure shape, the combination of shape sound and three kinds of forms of alphabetical pure tone and do not have the encode method for entering Chinese characters that function is used in switching with towards Chinese character teaching.
Fig. 1 is encode method for entering Chinese characters process flow diagram according to an embodiment of the invention.As shown in Figure 1, this input method of Chinese character may further comprise the steps:
S102; According to the Chinese character disassembler Hanzi structure is split; With Chinese character be divided into single character, about equate the left, center, right equates, few right many, left many right sides, a left side equate less, up and down or go up in equate down, go up few many down, go up many under less, the two sides encirclement, three bread enclose and completely encircle and special construction, and use a numerical key as its code respectively; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses a numerical key respectively as its code;
S104, according to import the characteristics of Chinese character, with the code of its Hanzi structure the 1st, with the number of components of Chinese character or stroke number the 2nd, and with the code of the stroke of these each parts of Chinese character the 3rd~6 as coding as coding as coding;
S106 is through the input of will encoding of the numerical key on keyboard or the soft keyboard.
In the present embodiment, carry out the input of Chinese character, not only can be used as the Efficient and Flexible Chinese character input method, be widely used in the information terminal apparatus of computing machine and all kinds of employing numeric keypads according to Hanzi structure, parts and the stroke of Chinese character; Can also be used to helping people's learning and memory Chinese character, to the character learning in the Chinese character teaching, handwriting practicing with look into word bigger booster action is all arranged.
According to the position relation of parts in whole word Chinese character is classified, meet the requirement of " modern everyday character parts and component names standard GF0014-2009 " fully.
In 6763 Chinese characters of Chinese Character Set Code for Informati-baseset GB2312, nearly 4272 of the Chinese characters of left and right sides structure have surpassed 63%.In line with effectively utilizing space encoder, the principle of design that the balanced coding of trying one's best distributes is subdivided into three sub-category with left and right sides structure: a left side (in) right side equates, a left side is few right many, left many right few.
The quantity of the Chinese character of up-down structure is also quite a lot of.In 6763 Chinese characters of GB2312, the Chinese character of up-down structure has 1560, accounts for 23%.So also it is divided three classes: go up (in) equate down, go up few many down, go up and manyly lack down.
No matter be left and right sides structure or up-down structure, so-called equating and what,, refer to two quantitative relations between the unit stroke for two parts words; For three or three Chinese characters, then be the quantitative relation between the finger with upper-part.
About the method for splitting of Hanzi component, and parts many and few in two parts about how confirming or up and down.The present invention is main according to " Hanzi component standard " development group relevant regulations in " about working out some problems of " Hanzi component standard " " (being called for short the parts standard).
The parts standard is defined as the disassembler of Chinese character that " Chinese character is split as the order of parts.Chinese character successive to hierarchical structure has motivation to split, and claims that level splits; Chinese character to planar structure carries out the disposable motivation fractionation or unreasonable according to splitting that has, and claims that the plane splits.”
The present invention according to the first floor in the parts disassembler split to confirm in left and right sides structure and the up-down structure Chinese character component distribution be actually which side few which side is many.For example, "do" from "Ren, ten, mouth, the Fan" four parts.Its first floor split result is "Ren" and "it", so that "left little more than the right" structure." newly " is made up of " upright, wooden, jin " three parts, and its first floor split result is " parent " and " jin ", so be " many right lacking of a left side " structure."Flower" from "Lv, Ren, dagger" three parts, whose first floor split result is "hua" and "technology", it is "much less on the next" structure." think " to be made up of " wood, order, the heart " three parts, its first floor split result is " phase " and " heart ", so be " going up how few down " structure.
This shows that the present invention has not only comprised number of components information, but also reflected the standardising process that parts split.
Except about and up-down structure, the Chinese character quantity of other structures is fewer comparatively speaking.Therefore with 8 represent all two sides investing mechanisms Chinese character, comprise a left side surround, go up right surround and left under three kinds of encirclements; Represent all three sides surrounded Chinese characters with 9, rightly surround, go up that a left side surrounds down and a left side is right down surrounds three kinds comprising upper left.With 0 represent completely encircle and other special constructions Chinese character.
How the key of encode Chinese characters for computer is the huge character set of split amount.The 1st coding of the present invention is to decomposing the first time of encoding Chinese characters collection; The effect of its decomposition (uniformity coefficient) is very big to the influence of the repetition rate of coding and repeated code word similarity, and said method has decomposed a large amount of Chinese characters that highly accumulate in left and right sides structure and up-down structure zone more equably.Number of components and position relation belong to macroscopical attribute of Chinese character, and be obvious, rarer ambiguity.The present invention utilizes these yuan usually to describe Chinese character, and the ingenious number of components of having evaded is too much, splits a series of difficult problems such as lack of standardization, that component names is indefinite.
From the embodiment of Fig. 1, can find out < the pure shape coding of Chinese-character digital >: :=< constructive code>< quantity sign indicating number>{ < stroke code>}.Wherein, < constructive code >: :=< single character>| < combinde rqdical character >, code that for example can the military order single character is 1, i.e. < single character >: :=1; < combinde rqdical character >: :=< left and right sides structure>| <up-down structure>| < two sides encirclement>| < three bread enclose>| < completely encircle and special construction >; < left and right sides structure >: :=left side (in) right equating | < left side is few right many>| < left side is how right few >; Make a left side (in) right equating: :=2; < left side is few right many >: :=3, < left side is how right few >: :=4; <up-down structure >: :=go up (in) equate down | < going up few many down>| < going up how few down >, go up (in) equate down: :=5, < going up few many down >: :=6; < going up how few down >: :=7; < two sides encirclement >: :=8, < three bread enclose >: :=9, < completely encircle and special construction >: :=0.
In the above-described embodiments, the quantity sign indicating number is the part count of Chinese character, when Chinese character is single character, and can be with the stroke number of this single character the 2nd as its coding.I.e. < quantity sign indicating number >: :=< stroke number>| < component count >, and < stroke number >: :=1|2|3|4|5|6|7|8|9 (numeral 9 expression strokes equal or exceed 9 and draw), < component count >: :=1|2|3|4|5|6|7|8|9 (numeral 9 expression component counts equal or exceed 9).
Wherein, the parts in the embodiment of the invention are meant the basic components among radicals by which characters are arranged in traditional Chinese dictionaries and " modern everyday character parts and the component names standard GF0014-2009 " in " Chinese character radicals table G0011-2009 ".
The radicals by which characters are arranged in traditional Chinese dictionaries definition that " Chinese character radicals table " adopts be " a part of parts of structure word in batch " (GB/T12200); The definition of parts that " modern everyday character parts and component names standard " followed is: " word-building unit of being made up of stroke with assembly Chinese word function " (GB/T 12200).Most of standard radicals by which characters are arranged in traditional Chinese dictionaries promptly are the normative foundation parts.Among " Chinese character radicals table " 201 principal part head, have only 22 and do not belong to the basic components in " modern everyday character parts and component names standard ".The basic components that the present invention will belong to radicals by which characters are arranged in traditional Chinese dictionaries are called " radicals by which characters are arranged in traditional Chinese dictionaries parts ", the basic components that are not radicals by which characters are arranged in traditional Chinese dictionaries are called " non-radicals by which characters are arranged in traditional Chinese dictionaries parts ", the radicals by which characters are arranged in traditional Chinese dictionaries that do not belong to basic components are called " non-parts radicals by which characters are arranged in traditional Chinese dictionaries ".
Only the requirement according to radicals by which characters are arranged in traditional Chinese dictionaries and parts standard splits Chinese character, just can obtain correct component count.Design is to go in order to be dissolved into structure knowledge of Chinese characters in the coding like this, so that consolidate the memory to physical structure of Chinese characters through the coding input, avoids having faded from memory because use a computer the font of Chinese character.Even now is done the learning difficulty that can improve coding; But this is just towards the needs of Chinese character teaching; Because parts are the crucial assembly units that form a connecting link in the adopting Chinese character form; If think correct understanding and memory Chinese character pattern, grasp the relation of the font and the meaning of word, just must know Chinese character by which parts assembly is formed.
For combinde rqdical character, comprise two parts at least, so the quantity sign indicating number can not be 1.Among the present invention, quantity sign indicating number 1 is not represented the number of components of combinde rqdical character, but combinde rqdical character is looked as a whole, does not split.Also i.e. input preceding four of Chinese character as single character.Design is in order that when not knowing how to split, also can import this Chinese character in the time of maybe can't confirming the number of components of certain Chinese character like this.
For example, stroke coding is followed national standard, and horizontal code is 1, and perpendicular code is 2, and the code of left-falling stroke is 3, and the code of point is 4, and the code of folding is 5; In order to break up higher " casting aside folding anyhow " these the four kinds of strokes of usage frequency; And embody the cross reference of stroke in the stroke coding aspect; The code of the horizontal stroke that intersects with other strokes is 6; The code of lifting-hook (with perpendicular difference mutually) is 7, and the code of the left-falling stroke that intersects with other strokes is 8, and the code of the folding that intersects with other strokes is 9.I.e. < stroke code >: :=< horizontal stroke>| < perpendicular>| < left-falling stroke>| < point>| < folding >, < horizontal stroke >: :=1|6, < perpendicular >: :=2|7, < left-falling stroke >: :=3|8, < point >: :=4, < folding >: :=5|9.
For example, when Chinese character is a single character, the length of stroke code is decided because of stroke number, surpasses 4 and draws single characters and get it and preceding 4 draw and encode.
For example, when Chinese character comprises 2 parts, encode for preceding 2 that get its 2 parts respectively; When Chinese character comprised 3 parts, preceding 2 of the 1st and the 3rd parts that get preceding 2 parts respectively encoded; When Chinese character comprises 4 and during with upper-part, encode for the 1st that gets preceding 4 parts respectively; Wherein, if need get preceding 2 parts and have only 1 stroke the time, when then encoding this stroke is repeated 2 times.
For example, be used for representing that except first the completely encircle structure, digital " 0 " also can be used as the identifier that coding finishes in advance during coding.The present invention not only can import Chinese character by individual character one by one, also can be used for importing word.The present invention is not provided with any brevity code, when the input word, can use " 0 " to come to finish in advance the coding of current input Chinese character, and get into the input of next Chinese character.The benefit of doing like this is to need not fixedly code length, at any time according in the past the experience and the content of input, regulates the length of encoding (do not comprise end mark weak point can have only one) arbitrarily, both can be unrestrained, can efficiently import again.
In order to be fit to Chinese character teaching better; Input method of the present invention is except the digital pure font code (hereinafter to be referred as pure font code) of above explanation; Also utilize first letter of pinyin; It is expanded, thereby form " combination of shape sound " and " alphabetical pure tone " dual mode (following " shape sound sign indicating number " and " the pure tone sign indicating number " of abbreviating as respectively) in addition.
For example, in coding, use the first letter of pinyin of this Chinese character last position as this encode Chinese characters for computer, this first letter of pinyin can appear on any position of this encode Chinese characters for computer.The shape sound sign indicating number of present embodiment is on the basis of pure font code, adds that first letter of phonetic transcriptions of Chinese characters constitutes, and it is main belonging to shape, a kind of encode method for entering Chinese characters that the shape sound combines.After the pupil has learned the Chinese phonetic alphabet, can import Chinese character with shape sound sign indicating number, so not only can consolidate the phonetic of learning, can also further reduce the repeated code key and select rate, improve input efficiency.
In the shape sound sign indicating number of the present invention, first letter of pinyin is positioned at the last of coding, but unfixing its position.First letter of pinyin has two effects.The first is used for decomposing repeated code, and it two is as the end-of-encode sign.First effect is obvious, and second effect then is of the present invention one big characteristics.
With regard to code length, Chinese character digital coding can be divided into fixedly two kinds of code lengths and on-fixed code length.The advantage of fixed-length coding is to need not end mark, and shortcoming is that existence is the idle bit for the length that gathers together enough in a large number.Though but not the fixed-length coding code efficiency is high, needs separator between two encodes Chinese characters for computer, perhaps need stipulate the code length of (for example word input) under the particular case.And shape sound sign indicating number of the present invention have only last the position be phonetic alphabet; All the other all are numerals; So these phonetic alphabet are effective code element, have played the effect of end mark again; And can on any position of the 1st to the 6th of coding, key in this first letter of pinyin, as last code element of encoding and finishing in advance.
When first letter of pinyin appears on first yard position, be exactly pure tone sign indicating number of the present invention.The pure tone sign indicating number has only one, mainly is used for input fast than long word.
Shape sound sign indicating number of the present invention is only used a first letter of pinyin, has just realized that code length is free completely, in any one yard position of 1 to 7, all allows to key in first letter of pinyin.The pure font code that the present invention uses needs to finish in advance coding with extra " 0 ", and shape sound sign indicating number has then been realized code length degree of freedom and the high efficiency perfect unity of coding.
Chinese character number form consonant standard coded input method of the present invention not only can be accomplished " seeing word knowledge sign indicating number ",, can also accomplish " seeing the sign indicating number character learning " for quite a few Chinese character.
Below for according to one preferred embodiment of the present invention:
It implements software is a WPF (Windows Presentation Foundation) program, may operate in Windows XP and more under the operating system environment of highest version.Its major function is to convert the encode Chinese characters for computer that the user imports through QWERTY keyboard to Chinese character or word.
Below in conjunction with implementing software and accompanying drawing, the present invention is described further.
Fig. 2 is this enforcement software interface, and the left side is the coding input frame, and the right is the prepare word choice box.Above choice box, show selection result, and the statistical parameter relevant with prepare word.
Fig. 3 to Fig. 6 imports four words of individual character " encode Chinese characters for computer " respectively with pure font code, and wherein " Chinese " and " sign indicating number " only needs 5 codings of input, can uniquely confirm.The coding of " volume " is heavy mutually with other words, but first-selected word, so need not the key choosing.
Fig. 7 is the process with pure font code input word " encode Chinese characters for computer ".5 codings (42449) input used in first Chinese character.After first input of word, remaining word mostly need not the all-key input.In the present embodiment, only used dibit encoding (52) can import " word ".And " volume " and " sign indicating number " only used a coding respectively, just accomplished the input of whole word.This shows, use method of the present invention,, also can effectively shorten code length, at a high speed input Chinese even only use ten numerals.
Fig. 8 has shown the process with shape sound sign indicating number input word " encode Chinese characters for computer ".During the word input, shape sound sign indicating number can further shorten code length.The mean code length of four words has only 1.75 in the present embodiment.
Fig. 9 has write down the process of pure tone sign indicating number input word " encode Chinese characters for computer ".The pure tone sign indicating number mainly is used for importing the more word of number of words (more than three words).Can find out that from present embodiment the pure tone sign indicating number obviously is not suitable for importing individual character and two-character word, but the repeatedly speech of input as " encode Chinese characters for computer " is very efficiently.The mean code length of present embodiment is 1.
Figure 10 has embodied the characteristics of the present invention " three kinds of modes do not have switching and use with ".The user can be according to the Chinese character learning progress, perhaps to the degree of awareness of concrete Chinese character pattern and pronunciation, and the experience accumulation of Chinese character input aspect, select most suitable mode, thereby avoid the limitation of single input mode.
The different coding mode that the present invention not only provides suitable Chinese character teaching different phase to use has realized also simultaneously that nothing is each other switched to use with., arrive with numeral and first letter of pinyin input Chinese character or word (combination of shape sound), with ten numeral input Chinese characters or word (pure shape) from only again to only importing word (pure tone) with first letter of pinyin.All can freely select with speech with word.
For example, input word " encode Chinese characters for computer " can have distinct methods as shown in table 1 (but non-whole):
Table 1
Wherein, code length=(actual input coding+separator+key choosing symbol) ÷ 4.
More than be the resulting data of Chinese words Input Software of utilizing the inventor to work out voluntarily, used Chinese Character Set is GB2312, and the dictionary capacity is more than 200,000 speech.
Further specify in the face of characteristics of the present invention down.
The standard coded input method as its name suggests, must meet " standard ".The standard here is meant that country's promulgation is carried out or tentative spoken and written languages and encode Chinese characters for computer input relevant criterion, and is as shown in table 2.
Table 2
The present invention meets above national standard standard fully.
One of maximum characteristics of the present invention are embodied on " describing Chinese character pattern with numeral ".The stroke and the number of components that not only directly reflect Chinese character with numeral, and with the Chinese character configuration characteristic that the static structure of Chinese character and parts dynamic resolution level etc. are difficult to explain, embody through simple digital.
Quantity has reflected that what of parts Chinese character comprise, and is the secondary classification to structure.For example; The left and right sides structure that Chinese character is concentrated the most is divided into " left side (in) right equating ", " left side is few right many " and " many right sides, a left side are lacked " three sub-category; Cooperate second order digit amount sign indicating number, can better embody the quantity of parts and the position distribution in Chinese character, reflect many characteristics of Hanzi structure.Such design can also effectively reduce the repeated code keyboard and select rate.
Parts are macrofeatures of Chinese character, and the modular construction of most of Chinese character is very obvious, and its number of components sees sth with half an eye.With simple quantitative relation reflection Chinese character pattern structure, meet the requirement of country to the input method ease for use, also help classification memory for Chinese character pattern.
It is another characteristics of the present invention that bonded block extracts stroke feature.Choose the stroke of a plurality of parts as far as possible, let coding be evenly distributed on as far as possible on the at all levels and part of Chinese character, make coding comprise more Hanzi attribute information, and reflect the sequential write of Hanzi component and stroke with the sign indicating number preface.These all meet towards the requirement of teaching.
In addition, utilize the undefined numerical key of national regulation (6~9), effectively broken up the higher stroke commonly used of usage frequency, further reduced the repeated code keyboard and selected rate.
Two of the maximum characteristics of the present invention are embodied in the design of free code length and position, end coding.Last position is a numeral if encode, and then is digital pure font code; If first letter of pinyin and code length greater than 1, then are shape sound sign indicating numbers; If code length shortens to one and be first letter of pinyin, that is exactly the pure tone sign indicating number.
Encode Chinese characters for computer code length CL variation range of the present invention is stipulated as follows:
Pure font code: 1≤CL≤6;
Shape sound sign indicating number: 2≤CL≤7;
Pure tone sign indicating number: CL=1.
Can be as required, select for use the 1st of coding to L bit position (L≤maximum code length) to import Chinese character and word arbitrarily.
Position, coding end is different with the first effect, but all is of paramount importance sign indicating number position.The present invention is in the design of position, end, with " 0 " end mark as pure font code, to keep the digital characteristics of font code; With the last bit symbols of first letter of pinyin as shape sound sign indicating number, both can save the separator between the coding, can effectively reduce the repetition rate of coding again.Design Just because of this can accomplish that just three kinds of codings need not to switch, and can mix use.People can with speech, select to be fit to the input method of oneself with word arbitrarily as required.
Rule of the present invention is easy to learn, and exception does not have brevity code, need not memory, can accomplish to see word knowledge sign indicating number fully.There is quite a few coding can also see the sign indicating number character learning.Therefore can cooperate Chinese character teaching, study is synchronously used.
Say from the meaning of " character learning ",, can be used as character learning method and use because the present invention has the good level and the similar degree of polymerization to the coding of Chinese character, perhaps be used for auxiliary student's arrangement, sort out, understand and the memory Chinese character of learning.
With regard to " writing ", the stroke that comprises in the coding, parts and structural information help to grasp the font characteristics and the component locations relation of Chinese character, help student's normalized written Chinese character.
Say that for " looking into word " pure font code of the present invention only uses 10 numerals as code element, add top-down hierarchical coding mechanism, only need, both can generate reasonably Chinese character sequence directly perceived through the simple process on the program.Ordering rule is simple and clear, meets Chinese character sort from putting in order word to parts, the basic demand from parts to the stroke.
For " typewriting " because the code element of pure font code has only 10 numerals, so can be fit to arrive greatly desktop computer, little to various information terminal such as mobile phone the input Chinese character.Because have the good level and the similar degree of polymerization, can import by turn, with walking out of word, a lot of Chinese characters need not the all-key input and can confirm in advance.
It is thus clear that the present invention is the standard Chinese character coded input method that a kind ofly has " typewrite, become literate, write and look into word " four combined function.
One of ordinary skill in the art will appreciate that: accompanying drawing is the synoptic diagram of an embodiment, and module in the accompanying drawing or flow process might not be that embodiment of the present invention is necessary.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be accomplished through the relevant hardware of programmed instruction; Aforesaid program can be stored in the computer read/write memory medium; This program the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
What should explain at last is: above embodiment is only in order to explaining technical scheme of the present invention, but not to its restriction; Although with reference to previous embodiment the present invention has been carried out detailed explanation, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that previous embodiment is put down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these are revised or replacement, do not make the spirit and the scope of the essence disengaging embodiment of the invention technical scheme of relevant art scheme.
Claims (7)
1. an encode method for entering Chinese characters is characterized in that, may further comprise the steps:
According to the Chinese character disassembler Hanzi structure is split; With Chinese character be divided into single character, about equate the left, center, right equates, few right many, left many right sides, a left side equate less, up and down or go up in equate down, go up few many down, go up many under less, the two sides encirclement, three bread enclose and completely encircle and special construction, and use a numerical key as its code respectively; The stroke of Chinese character is divided into horizontal, vertical, left, points, discount, and uses a numerical key respectively as its code;
According to import the characteristics of Chinese character, with the code of its Hanzi structure the 1st, with the number of components of Chinese character or stroke number the 2nd, and with the code of the stroke of these each parts of Chinese character the 3rd~6 as said coding as said coding as coding;
Through the numerical key on keyboard or the soft keyboard said coding is imported.
2. input method of Chinese character according to claim 1 is characterized in that, when Chinese character is single character, with as its coding the 2nd of the stroke number of this single character.
3. input method of Chinese character according to claim 1 is characterized in that, horizontal code is 1; Perpendicular code is 2, and the code of left-falling stroke is 3, and the code of point is 4; The code of folding is 5, and the code of the horizontal stroke that intersects with other strokes is 6, and the code of lifting-hook is 7; The code of the left-falling stroke that intersects with other strokes is 8, and the code of the folding that intersects with other strokes is 9.
4. input method of Chinese character according to claim 1 is characterized in that, when Chinese character is a single character, gets its preceding 4 pictures above the single character of 4 pictures and encodes.
5. input method of Chinese character according to claim 1 is characterized in that:
When Chinese character comprises 2 parts, encode for preceding 2 that get its 2 parts respectively;
When Chinese character comprised 3 parts, preceding 2 of the 1st and the 3rd parts that get preceding 2 parts respectively encoded;
When Chinese character comprises 4 and during with upper-part, encode for the 1st that gets preceding 4 parts respectively;
Wherein, if need get preceding 2 parts and have only 1 stroke the time, when then encoding this stroke is repeated 2 times.
6. input method of Chinese character according to claim 1 is characterized in that, is used as the identifier that coding finishes in advance with digital " 0 ".
7. input method of Chinese character according to claim 1 is characterized in that, in said coding, uses the first letter of pinyin of this Chinese character last position as this encode Chinese characters for computer, and wherein this first letter of pinyin can appear on any position of this encode Chinese characters for computer.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110160422.1A CN102830809B (en) | 2011-06-15 | 2011-06-15 | Encode method for entering Chinese characters |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110160422.1A CN102830809B (en) | 2011-06-15 | 2011-06-15 | Encode method for entering Chinese characters |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102830809A true CN102830809A (en) | 2012-12-19 |
CN102830809B CN102830809B (en) | 2016-05-11 |
Family
ID=47333975
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110160422.1A Active CN102830809B (en) | 2011-06-15 | 2011-06-15 | Encode method for entering Chinese characters |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102830809B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104978045A (en) * | 2015-05-27 | 2015-10-14 | 腾讯科技(深圳)有限公司 | Chinese character input method and device |
CN105068671A (en) * | 2015-06-29 | 2015-11-18 | 曾子力 | Chinese character input method |
CN105912139A (en) * | 2016-01-11 | 2016-08-31 | 金云中 | Corresponding recognition method for coding Chinese characters by using modular strokes |
WO2019218473A1 (en) * | 2018-05-14 | 2019-11-21 | 平安科技(深圳)有限公司 | Field matching method and device, terminal device and medium |
CN113377215A (en) * | 2021-06-25 | 2021-09-10 | 刘跃军 | Chinese-character 'Liulian' input method |
CN113900531A (en) * | 2021-03-26 | 2022-01-07 | 刘跃军 | Chinese character phonetic input method with transposition, continuous clicking, sound and shape and less selection |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1204799A (en) * | 1998-07-10 | 1999-01-13 | 陈澜 | Coding method of Chinese character unit stroke numbers |
CN1265482A (en) * | 2000-04-13 | 2000-09-06 | 徐万胥 | Digital union code Chinese character input method and its keyboard |
CN1336578A (en) * | 2001-09-05 | 2002-02-20 | 黄建东 | Chinese character inputting method based on digital keypad |
JP2002358071A (en) * | 2001-05-31 | 2002-12-13 | Seiko Epson Corp | Character object |
CN1499357A (en) * | 2002-11-01 | 2004-05-26 | ���Ծ | Method for lablling united character and word as well as character patterns and character picture |
-
2011
- 2011-06-15 CN CN201110160422.1A patent/CN102830809B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1204799A (en) * | 1998-07-10 | 1999-01-13 | 陈澜 | Coding method of Chinese character unit stroke numbers |
CN1265482A (en) * | 2000-04-13 | 2000-09-06 | 徐万胥 | Digital union code Chinese character input method and its keyboard |
JP2002358071A (en) * | 2001-05-31 | 2002-12-13 | Seiko Epson Corp | Character object |
CN1336578A (en) * | 2001-09-05 | 2002-02-20 | 黄建东 | Chinese character inputting method based on digital keypad |
CN1499357A (en) * | 2002-11-01 | 2004-05-26 | ���Ծ | Method for lablling united character and word as well as character patterns and character picture |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104978045A (en) * | 2015-05-27 | 2015-10-14 | 腾讯科技(深圳)有限公司 | Chinese character input method and device |
CN105068671A (en) * | 2015-06-29 | 2015-11-18 | 曾子力 | Chinese character input method |
CN105068671B (en) * | 2015-06-29 | 2018-01-05 | 曾子力 | A kind of input method of Chinese character |
CN105912139A (en) * | 2016-01-11 | 2016-08-31 | 金云中 | Corresponding recognition method for coding Chinese characters by using modular strokes |
WO2019218473A1 (en) * | 2018-05-14 | 2019-11-21 | 平安科技(深圳)有限公司 | Field matching method and device, terminal device and medium |
CN113900531A (en) * | 2021-03-26 | 2022-01-07 | 刘跃军 | Chinese character phonetic input method with transposition, continuous clicking, sound and shape and less selection |
CN113377215A (en) * | 2021-06-25 | 2021-09-10 | 刘跃军 | Chinese-character 'Liulian' input method |
Also Published As
Publication number | Publication date |
---|---|
CN102830809B (en) | 2016-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102830809A (en) | Chinese character coding input method | |
CN103616960A (en) | Six vowel binary syllabification input method | |
CN101551711A (en) | Chinese character coding input method based on structure and primitive | |
CN100533359C (en) | Oracle spelling and component disintegration and input method | |
CN105912139B (en) | Method for correspondingly recognizing modular stroke coding Chinese characters | |
CN102750009B (en) | A kind of without switching input method of Chinese character and keyboard | |
CN106227363B (en) | Accurate encoding of chinese characters and keyboard and input method on the basis of phonetic | |
CN100458667C (en) | Chinese character five-stroke fourteen-radicals inputting method on cellphone or computer | |
WO2008089654A1 (en) | Ordering retrieving method of chinese character type, device thereof and an information system | |
CN100520685C (en) | Chinese characters pinyin identification code input method | |
CN103176616A (en) | Input method and device for guqin abbreviated character notation characters | |
CN103207684A (en) | Phonemic letter double-input method | |
CN100371865C (en) | Chinese character input method for number keyboard and corresponding electronic product | |
CN107256092B (en) | Chinese character digital shape code quick input method | |
CN100390710C (en) | Fast and easy Chinese character input method and keyboard | |
CN104536590B (en) | Embedded software keyboard system based on West Xia Dynasty's text sound character roots input method | |
CN102177511A (en) | Method of organizing chinese characters | |
CN104267824A (en) | Chinese character wubi number digital coding input method | |
CN85100094A (en) | Phonetic transcriptions of Chinese characters association coding and spelling keyboard | |
CN1836199B (en) | Character inputting method of using word as unit | |
CN1027839C (en) | Chinese character encoding input method | |
CN102750002A (en) | Digital Chinese character inputting method | |
CN105589574B (en) | A kind of Sino-British number mixing character input method based on five first syllable codes | |
CN104731360A (en) | Hierarchical initial coding method | |
CN102637077A (en) | Phonological, calligraphic and tone hybrid coding method for inputting Chinese characters to computer |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160713 Address after: 100044, room 17, floor 3, building 34, 2002 South Main Street, Haidian District, Beijing, Zhongguancun Patentee after: Wen Hua (Beijing) Education Technology Co., Ltd. Address before: Beijing City, Haidian District Weigongcun street, home of Wei Bohao 5-3-1102 Patentee before: Gao Jingmin Patentee before: Dong Weiqun |