CN101470535A - Optimized Chinese character code input method - Google Patents

Optimized Chinese character code input method Download PDF

Info

Publication number
CN101470535A
CN101470535A CNA200710305327XA CN200710305327A CN101470535A CN 101470535 A CN101470535 A CN 101470535A CN A200710305327X A CNA200710305327X A CN A200710305327XA CN 200710305327 A CN200710305327 A CN 200710305327A CN 101470535 A CN101470535 A CN 101470535A
Authority
CN
China
Prior art keywords
character
code
coding
chinese
chinese character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA200710305327XA
Other languages
Chinese (zh)
Inventor
王治阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNA200710305327XA priority Critical patent/CN101470535A/en
Publication of CN101470535A publication Critical patent/CN101470535A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

Disclosed is a Chinese character coding input method for computers, namely an optimized Chinese character code input method, which is composed of a phonetic coding portion and a morphological coding portion. Encoding rules of the morphological coding portion include coding single Chinese characters with a first basic component and a final basic component of the single Chinese character according to the stroke order, and coding compound Chinese character, wherein when a first part of a compound Chinese character is a radical, coding two codes at most, when the first part of the compound Chinese character only takes one code, coding a previous code among the remained parts according to the stroke order, and when the first part of the compound Chinese character is not a radical, the first part and the remained parts respectively take one code. The computer Chinese character coding input method can realize high-speed input with 26 basic parts and five basic strokes, thereby thoroughly overcoming the difficulty of Chinese character coding keyboard input.

Description

Optimize Chinese character code input method
Technical field
The invention belongs to the computer Chinese character input method, just the Chinese language computer input method.Because it has solved a long-term unsolved Chinese character coding keyboard input difficult problem, so be called Chinese character code input method at that time, the present invention is the optimization improvement to Chinese character code input method, so be called the optimization Chinese character code input method, has also related to for realizing the keyboard of this input method.
Background technology
The keyboard input is the most ripe, the most popular input method of technology in the present input method.The characteristics of keyboard input are to encode to the Chinese character of input, and encode Chinese characters for computer is meant with Chinese character of one group of coded representation.By encode Chinese characters for computer place key, usually will be by Chinese character of 1~4 key input.The keyboard input is divided by coding, can be divided into sound sign indicating number, font code, phonetic-stroke code three classes.
The sound sign indicating number is based on the Chinese phonetic alphabet, utilizes the pronunciation of Chinese character to encode.The advantage of sound sign indicating number is to use conveniently, needs only the meeting mandarin, just can import, and is easy to learn, so use the most extensive.Shortcoming is that phonetically similar word is many, causes individual character repetition rate of coding height, and Chinese character input speed is slow.Can't directly import unacquainted word in addition, can not very fast input to the word of cacoepy.
Font code utilizes the font style characteristic of Chinese character to encode, and has overcome sound code weight code check height, input speed waits shortcoming slowly, but often Hanzi component is too many, and the memory trouble splits also trouble in addition sometimes.The font code that has adopts the method with five kinds of basic strokes of Chinese character, two or two combinations, and Hanzi component is few, memory is convenient, but paid directly perceived inadequately, the cost that Hanzi component has been broken.Though also can import rapidly unacquainted Chinese character though font code is claimed, but for an illiteracy, his speed of typewriting is restricted, basic reason is that the word and the speech of Chinese character do not have obvious interval, do not know which is a phrase, can't utilize the phrase input, so the font code entry personnel also wants literate ability to import fast.
Phonetic-stroke code utilizes the phonetic feature of Chinese character and the font style characteristic coding of Chinese character.It has utilized the sound sign indicating number and the font code advantage separately of Chinese character, the tone and shape of Chinese character have been taken into account, often comparatively simple, easy learning and memory, the phonetic-stroke code that has is low such as input method repetition rates of coding such as three five notes of traditional Chinese music sign indicating numbers, input speed is enough to compare with any font code, utilizes the phonetic-stroke code of whole phonetic to also help and popularizes Beijing pronunciation, and advantage is fairly obvious.The shortcoming of phonetic-stroke code is that the thinking between tone and shape of when typewriting is constantly changed, and is tired easily, certainly concerning the typewriting practician, often learnt the coding of Chinese character by heart, sees that word knows sign indicating number, do not have thinking transfer problem between the tone and shape.It should be appreciated that, only participate in the phonetic-stroke code of coding, owing to must give up simple or compound vowel of a Chinese syllable, be not inconsistent, just have sound shape transfer problem especially with people's custom thinking with Chinese character initial consonant.If intactly utilized the initial consonant and the simple or compound vowel of a Chinese syllable of Chinese character, promptly utilized the whole phonetic of Chinese character, and regulation sound sign indicating number part formerly, shape portion coded portion after, then thinking not quite need be changed between tone and shape in not stall, can not influence thinking substantially.Because in general article, great majority all are words, when typewriting usually, often adopt the mode of phrase input as far as possible, so all available phonetic input of most contents.Also has the also available phonetic input of some words commonly used, use shape portion coding if any, general also as long as import first yard that shape portion encodes, and first yard radical that mostly is Chinese character of shape portion coding, radical quantity is limited, and common radical quantity still less, and the regular coding code is generally all arranged, and also is to be easy to remember first yard of shape portion coding.Really needing the individual character of second yard of input shape portion coding is seldom, have only shape portion coding just remembered not quite easily in this part Chinese character, because after first yard of input shape portion coding, second yard meeting points out out, therefore in fact there is not much necessity need remember second yard of shape portion coding, therefore this phonetic-stroke code is actually based on phonetic, is encoded to auxiliary code with shape portion, and it can not influence thinking substantially.If therefore think the invention phonetic-stroke code, for avoiding a sound shape conversion difficult problem, utilize the whole phonetic input of Chinese character as far as possible, and only not utilize the initial or the initial consonant of phonetic.Owing to the influence that is subjected to dialect, some people reads the phonetic of inaccurate some Chinese character, but this also can blur the sound solution by south, and utilizes spell Chinese character input also to help popularizing Beijing pronunciation more.Even if in fact do not know the pronunciation of Chinese character,, also can get this word as long as know shape portion coding.Whole phonetic transcriptions of Chinese characters phonetic-stroke code formerly also can only use as spelling input method, Just because of this, whole phonetic transcriptions of Chinese characters phonetic-stroke code formerly relatively with other sound sign indicating number, font code, the phonetic-stroke code of usefulness first phonetic letter only, more and more demonstrate its superiority.
Can utilize spelling, the code length that then exists phonetic is long, the problem of input inconvenience, utilize Two bors d's oeuveres that code length is greatly shortened, the all difficult note of most of Two bors d's oeuveres that can be present will be memorize mechanicallyd any pithy formula, has invented several new Two bors d's oeuveres fortunately in person, because the inventor is Wang Zhiyang, so be referred to as Wang Zhi sun Two bors d's oeuveres, patent applied for, very easy to learn, do not remember pithy formula, a few minutes just can learn.So the key of invention phonetic-stroke code is to invent out easy to learn and can effectively distinguish the method for phonetically similar word, this will lean on the shape portion coded portion simplicity of design of phonetic-stroke code reasonable, yet often there is Hanzi component criterion problem many or directly perceived inadequately in the present various shape portion coding of the phonetic-stroke code of phonetic that utilized.How selecting the least possible Hanzi component for use, make the repetition rate of coding low as far as possible simultaneously, is a unsolved hang-up.
Be directed to this, I have invented Chinese character code input method, and it utilizes many stroke members and five kinds of basic strokes input shape portions codings about 28 again behind Wang Zhi sun Two bors d's oeuveres, have that Hanzi component is simple, directly perceived, the standard characteristics.Arrange by stroke number when not enough a little is basic element of character arrangement, a bit inconvenience memory is if may remember more convenient point with phonetic or pictograph arrangement.Code taking rule is not considered the situation of minority radicals by which characters are arranged in traditional Chinese dictionaries when suffix in addition, a large amount of repeated codes can take place in the Chinese character with Chuo, Fu, bird ending, so I have released after improvement by phonetic initial consonant or pictograph and have arranged, code taking rule is the reasonable optimizing Chinese character code input method more.
Summary of the invention
Like this, present input method of Chinese character or Hanzi component are lack of standardization or to choose Hanzi component too much; Code length is oversize; Repeated code is too high, influences input speed; Only utilize the initial consonant or the first letter of pinyin of Chinese character, otherwise directly perceived inadequately, otherwise code taking rule is not too reasonable, and the fine solution of all failing is simply not quick, remarkable fast this technical barrier, the input Chinese character is very not convenient.
The purpose of this invention is to provide that a kind of Hanzi component standard is directly perceived, easy to learn, input Chinese character simple and efficient computer Chinese character coding input method, that optimizes Chinese character code input method exactly.
For reaching the purpose of optimizing Chinese character code input method, the present invention stipulates that the coding of kanji code is made up of sound sign indicating number and shape portion coding two parts.The sound sign indicating number partly adopts Wang Zhi sun Two bors d's oeuveres, accounts for two yards.Shape portion coded portion also accounts for two yards at most.Certain also available spelling of sound sign indicating number or other Two bors d's oeuveres or simplicity or Chinese phonetic script phonetic or imperfect phonetic.
Sound sign indicating number part should adopt Wang Zhi sun Two bors d's oeuveres, Wang Zhi sun Two bors d's oeuveres is divided into a, o, e, i, u district with simple or compound vowel of a Chinese syllable by first letter, the series arrangement of simple or compound vowel of a Chinese syllable number what and a, o, e, i, u, n, g is pressed in every district again, has regularity, certainly also can be not according to the number of simple or compound vowel of a Chinese syllable, a consideration is pressed the order of a, o, e, i, u, n, g the simple or compound vowel of a Chinese syllable subregion is arranged.What unique needs were remembered is the merging rule of Two bors d's oeuveres simple or compound vowel of a Chinese syllable.When memory Two bors d's oeuveres simple or compound vowel of a Chinese syllable merges rule, as long as remember merge to arrange and merge to arrange be exactly with 4 letters of ang ending with the simple or compound vowel of a Chinese syllable of a plurality of letters of a, ong ending, a plurality of letters of ia, iong, these several i beginnings of iang are all merged, ui comes on the v in addition, uo comes on the o, the pronunciation brief note is " being surplus " (being me), " my nest ".See accompanying drawing 1,2.
Shape portion coding is made up of two codes, five kinds of basic strokes that the present invention is preferred and 26 or stroke member more than 28 participate in coding, these five kinds of basic strokes and many stroke members are called as the basic element of character, be called for short parts, all be selected from the radical of Chinese character, simple common directly perceived again, and quantity is few, memory easily.Because the State Language Work Committee also classifies as Hanzi component with five kinds of basic strokes, therefore claim five kinds of basic strokes to be called single in the present invention and draw parts, and 26 or 28 Hanzi components are made up of a plurality of strokes, are called many stroke members.When shape portion encodes, will preferentially press the many component codings of stroke, otherwise it is just meaningless to choose many stroke members.
The code taking rule of shape portion coding: single character, get the respective code coding of first and the last basic element of character by sequential write, when having only a basic element of character, the respective code of only getting this basic element of character.Combinde rqdical character is divided into two by one-piece construction, and separated into two parts is write part earlier and is called stem, also can be described as first portion; After write part and be called surplus portion, also can be described as the rear portion.When stem contains radicals by which characters are arranged in traditional Chinese dictionaries, by sequential write get stem first, the respective code of the last basic element of character, when stem had only a basic element of character, the respective code of getting first basic element of character of surplus portion by sequential write was encoded again; When stem does not contain radicals by which characters are arranged in traditional Chinese dictionaries, get the respective code coding of first basic element of character of stem by sequential write, the respective code of getting first basic element of character of radicals by which characters are arranged in traditional Chinese dictionaries to surplus portion is encoded again.To preferentially press the many basic element of character codings of stroke during code fetch.
This coding rule is the result of inspiration of concentrating on studies for many years and happen suddenly.For reducing the repetition rate of coding, must judge whether stem contains radicals by which characters are arranged in traditional Chinese dictionaries, does not contain radicals by which characters are arranged in traditional Chinese dictionaries, can only get one yard, get radicals by which characters are arranged in traditional Chinese dictionaries to surplus portion again and encode.Stem contains radicals by which characters are arranged in traditional Chinese dictionaries, and desirable two yards at most, when radicals by which characters are arranged in traditional Chinese dictionaries are certain preferred many stroke member, in the time of can only getting one yard, again first basic element of character of surplus portion to be encoded, this has just reduced the repetition rate of coding; When stem contains radicals by which characters are arranged in traditional Chinese dictionaries, more than basic element of character in the time of desirable two yards, is just got two yards, and the radicals by which characters are arranged in traditional Chinese dictionaries that have so just use two representation, and the group/cording quantity that this has just reduced radicals by which characters are arranged in traditional Chinese dictionaries has reached the purpose of simplifying radicals by which characters are arranged in traditional Chinese dictionaries quantity, thereby is simple and easy to remember.Why the regulation stem is preferably got the coding of two parts of head and the tail, is because if the regulation stem is got the code of the first two parts, and the coding of " field " and " shellfish " and " order " will be identical, causes repeated code.Minority radicals by which characters are arranged in traditional Chinese dictionaries such as Chuo, Fu, bird etc. usually appear at suffix, if whether do not consider stem is that the radicals by which characters are arranged in traditional Chinese dictionaries stem of laying down hard and fast rule can be got two yards on foot and just got two yards on foot, a large amount of repeated codes will take place, when therefore being necessary the regulation stem for radicals by which characters are arranged in traditional Chinese dictionaries, can only get one yard, get one yard to surplus portion again.By optimizing the regulation code taking rule, only just accomplished that with 26 basic elements of character and five kinds of basic strokes the individual character repetition rate of coding is extremely low like this.
In 6763 Chinese characters of GB, combinde rqdical character has accounted for the overwhelming majority, is about 95%.Unisonance is more with the combinde rqdical character quantity of radical, has six, 700 pairs approximately.Rui, Lv, mouth, wood, Rolling, Jin, Ren, woman, Yan, Xin, the moon, worm, soil, Si, fire, Epileptic, , , mountain, stone, day, king, Fu, fish, Woo comprise that the phonetically similar word of radicals generations such as Yi, standing grain is more, for reducing repeated code, these radicals are selected to come out, use a letter or other symbolic coding respectively, certainly indivedual radicals also can be abandoned and not select.Therefore consider that Woo and Yi belong to different radicals by which characters are arranged in traditional Chinese dictionaries, can only reduce the repeated code about five pairs altogether, in optimizing Chinese character code input method, also can abandon and do not select.And the radical that has as " field ", " order ", " shellfish " though etc. commonly used, can have only one, two pair of phonetically similar word, even do not have phonetically similar word, therefore abandon and do not select.I also find in research, after Chinese character is divided into two, unisonance is with the combinde rqdical character of radical, the first stroke that is not the part of radicals by which characters are arranged in traditional Chinese dictionaries is that the situation of similar basic stroke exceeds unexpected lacking especially, it is right to have only more than 100, and wherein parts " ten " and " Http " occur often, and four, five pairs of repeated codes are arranged approximately, they also can be selected, and use other letter or symbolic coding respectively.Consider that Hanzi component " ten " does not often appear at the stem of Chinese character, and " Http " also occurs morely at Chinese character radical, therefore, abandons " ten " and select " Http " in optimizing Chinese character code input method, " Http " with a letter or other symbolic coding, also can be selected in parts " ten " certainly.Parts such as Chuo, Fu, bird, the heart often appear at suffix, its center can be comprised by Xin, Fu is selected, and Chuo, bird often appear at suffix as radicals by which characters are arranged in traditional Chinese dictionaries, that is to say the surplus portion that appears at, by optimizing the regulation code taking rule, the regulation stem is not radicals by which characters are arranged in traditional Chinese dictionaries, can only get one yard, again to also abandoning after surplus portion gets one yard and do not select.Like this, stroke member more than 26 and five kinds of basic strokes are preferably come out, be aligned on the keyboard, this stroke member more than 26 is encoded with a corresponding letter or punctuation mark respectively, five kinds of basic strokes then can be from reducing the repeated code angle, optional several strokes and the merging of stroke member more than 26 wherein is arranged on the same key, with same letter or symbolic coding.Certainly this stroke member more than 26 is not fixed, and can be less than stroke member more than 26, can be more than stroke member more than 26, as long as about 26 yet.For example also Hanzi component " ten " or " Chuo " or " bird " or " Woo " can be comprised that parts such as " Yi " are selected, with a letter or punctuation mark coding.
For ease of memory, the order not according to stroke number and horizontal, vertical, left, points, discount when arranging the basic element of character in optimizing Chinese character code input method is arranged, and presses phonetic or pictograph arrangement, sees shown in accompanying drawing 3 or the accompanying drawing 4.Accompanying drawing 3 arrangements press in my suggestion, and it is arranged as the master with the phonetic alphabet by the radical pronunciation of the basic element of character, a few basic element of character with initial consonant is changed by the mode of pictograph arrange.Like this owing to almost need not memory by the radical pronunciation, in fact as long as a few basic element of character of note with initial consonant, because these several basic elements of character and English alphabet likeness in form, also can remember very soon, memory capacitance is very little, certainly the parts of Chinese character are distinguished to some extent with letter word after all, and are can only part similar.Accompanying drawing 4 is pressed pictograph fully and is arranged, but the square stroke member of Chinese character and letter word be difference to some extent after all, be difficult to accomplish very alike, the Chinese character basic character components that need press the pictograph memory will be how last several times, also corresponding how last several times of memory capacitance, so the present invention also not too advises arranging through the preferred basic element of character in this way.The frequency that occurs in this coding of basic strokes such as horizontal, vertical, left-falling stroke, point is higher, is to reduce repeated code, should not come on the same key with the basic element of character, is discharged on the punctuation mark key, and is more reasonable with the punctuation mark encoding ratio.Folding comes on same being good for the many stroke members that have because frequency is too low, can cause repeated code hardly, therefore just presses first letter of pinyin and arranges.
Like this, by preferred stroke member more than 26 and five kinds of basic strokes, optimize the regulation code taking rule, just accomplished the both simple and easy note of shape portion coding, can distinguish phonetically similar word effectively again, the repetition rate of coding is very low.This has just solved all unsolved difficult problem of other any input methods, has accomplished that really simple, intuitive, the repetition rate of coding are very low, input is quick, is the perfect input method of Chinese character of unique a kind of ideal.Here it is is called the basic reason of Chinese character code input method.
For the not high people of culture, be difficult to judge whether stem has radicals by which characters are arranged in traditional Chinese dictionaries, another easy code taking rule is provided here, that judges exactly whether surplus portion is certain preferred many stroke member or is radical " Chuo ", " bird ", be, stem is just only got one yard, arrives surplus portion code fetch again; Desirable two yards at most of words that no, stem when stem has only a basic element of character, in the time of can only getting one yard, arrive surplus portion code fetch again.At this moment for ease of memory, just " Chuo ", " bird " should be selected in many stroke members.Be the minimizing repeated code, keep the consistance of coding, just they are pressed the first sum of picture and encode, promptly " Chuo " and point merge arrangement, and " bird " and left-falling stroke merge arranges, and encodes with a punctuation mark respectively.See accompanying drawing 5.
Minority combinde rqdical character stem and afterbody all contain radicals by which characters are arranged in traditional Chinese dictionaries, also have minority combinde rqdical character stem and afterbody all to seem and do not contain radicals by which characters are arranged in traditional Chinese dictionaries, at this moment kanji code has adopted the tolerant code technology, and input method software provides fault-tolerant code taking rule, but regulation both stem is got two yards on foot, but also stem, surplus portion respectively get one yard.
Also have the minority Chinese character to be difficult to distinguish combinde rqdical character and single character, at this moment the tolerant code technology also is provided, input method software provides fault-tolerant code taking rule, and regulation both can be encoded by single character, also can encode by combinde rqdical character.
Utilize input method software, on computer keyboard, knock the key at the corresponding Chinese character coding place of certain Chinese character or phrase, just can finish input.
Description of drawings
Accompanying drawing 1 is one of Wang Zhi sun spelling keyboard Pareto diagram
Accompanying drawing 2 is two of a Wang Zhi sun spelling keyboard Pareto diagram
Accompanying drawing 3 is one of shape portion encoded keyboard Pareto diagram
Accompanying drawing 4 is two of a shape portion encoded keyboard Pareto diagram
Accompanying drawing 5 is three of a shape portion encoded keyboard Pareto diagram
Accompanying drawing 6 is four of a shape portion encoded keyboard Pareto diagram
Accompanying drawing 7 is five of a shape portion encoded keyboard Pareto diagram
Accompanying drawing 8 is six of a shape portion encoded keyboard Pareto diagram
Embodiment
Kanji code is made up of two parts, and a part is the sound sign indicating number, i.e. phonetic, or title phonetic sign indicating number, and another part is a shape portion coding.Two parts of kanji code can before can after, once selected, can not change, beat for ease of wanting, consistent with thinking, can make full use of the punctuation mark key again, suggestion phonetic formerly after shape portion is coded in, just adopts this method in the coding example.Phonetic can adopt spelling or Two bors d's oeuveres or simplicity or imperfect phonetic, does not want that the people who learns Two bors d's oeuveres can adopt spelling, promptly adopts the complete phonetic of a Chinese character.Said simplicity is meant Three kinds of hors d'oeuvres, is about to a phonetic at most with three letter representations.For shortening code length, improve input speed, Two bors d's oeuveres is used in suggestion, preferably adopts a few minutes learnable Wang Zhi sun Two bors d's oeuveres.Certainly also can mix and use spelling, Two bors d's oeuveres, Chinese character entering technique has had very big progress now, can accomplish spelling, Two bors d's oeuveres compatibility, and not need to adjust input method status.Select the Two bors d's oeuveres of Wang Zhiyang invention in an embodiment for use, also can be designed to and the spelling compatibility simultaneously, be used as all-phonetic input method and use.
Wang Zhi sun Two bors d's oeuveres is that the key position configuration of a kind of initial consonant, simple or compound vowel of a Chinese syllable is reasonable, the Two bors d's oeuveres computer input method for Chinese character that arrangement regulation is strong on keyboard.Can use as a kind of input method separately, also can be used as the sound sign indicating number of optimizing kanji code and partly use.
The technical scheme of Wang Zhi sun Two bors d's oeuveres is characterised in that:
(1) initial consonant of single letter is consistent with each letter key, and cacuminal ch, sh, zh represent with i, u, v respectively by the sound preface, so that memory can be represented ch also that certainly sh represents that with i zh represents with v that still single vowel ü represents with alphabetical v with u.According to the voice complementary relationship of simple or compound vowel of a Chinese syllable, the simple or compound vowel of a Chinese syllable of single letter merges rule except arranging by sound, outside the simple or compound vowel of a Chinese syllable that is comprised by it, can not arrange other simple or compound vowel of a Chinese syllable again.Compound vowel and vowel followed by a nasal consonant represent with single letter that also opening is exhaled simple or compound vowel of a Chinese syllable to be configured in the QWERTY keyboard and arranged, and are divided into a, o, e district by first letter; The class of syllables with i as the final or a final beginning with i simple or compound vowel of a Chinese syllable is configured on the keyboard to be arranged, and is called the i district, heals up to exhale and scoop up mouthful to exhale simple or compound vowel of a Chinese syllable to be configured under the QWERTY keyboard to arrange, be called the u district, comprise the ü district; See accompanying drawing 1.Certainly also can exhale healing up and scoop up and mouthful exhale simple or compound vowel of a Chinese syllable to be configured on the keyboard to arrange, be called u district, the class of syllables with i as the final or a final beginning with i simple or compound vowel of a Chinese syllable be configured under the keyboard and arranges, and is called the i district.From left to right arrange by simple or compound vowel of a Chinese syllable letter number again in every district, and the simple or compound vowel of a Chinese syllable that alphabetical number is identical is from left to right arranged by the order of a, o, e, i, u, n, g.
(2) each simple or compound vowel of a Chinese syllable is set at letter, punctuation mark mapping relations:
a——a b——uai c——un?ün d——ai
e——e f——an g——ang h——ou
i——i j——ong?iong k——ei l——en
m——uang?iang n——uan?üan o——o?uo
p——ing q——ie r——in?er s——ao t——iao
u——u v——ü?ui w——iu x——ue?üe
y——ian z——ua?ia ;——eng
See shown in the accompanying drawing 1.Here cacuminal ch, sh, zh represent with i, u, v respectively by the sound preface, it will be argued that this spelling keyboard arrangement mode is more satisfactory, therefore adopt this Two bors d's oeuveres in the coding example.
Certainly also can not consider the number of simple or compound vowel of a Chinese syllable, every district is from left to right arranged according to a, o, e, i, u, n, g by the simple or compound vowel of a Chinese syllable letter;
At this moment each simple or compound vowel of a Chinese syllable is set at letter, punctuation mark mapping relations:
a——a b——uang?iang c——uan?üan d——an
e——e f——ang g——ao h——ong?iong
i——i j——ou k——ei l——en
m——un?ün n——ue?üe o——o?uo p——iu
q——ian r——ie?er s——ai t——in
u——u v——ü?ui w——iao x——uai
y——ing z——ua?ia ;——eng
Arrangement on keyboard as shown in Figure 2, for preventing to walk around patent, ch makes code with u here, sh makes code with i.Certainly also can from left to right arrange by English phonetic order by the simple or compound vowel of a Chinese syllable that alphabetical number is identical.Also can not consider alphabetical number, English phonetic order be pressed in simple or compound vowel of a Chinese syllable arrange.Because Chinese are more convenient for remembering than English phonetic order to the order of a, o, e, i, u, n, g, the sequential memory of a, o, e, i, u, n, g is pressed in suggestion.
(3) have only simple or compound vowel of a Chinese syllable not have initial consonant, get e or o or a and make initial consonant, also the initial consonant code made in first letter of desirable simple or compound vowel of a Chinese syllable, fills the simple or compound vowel of a Chinese syllable code again, selects for use the e to make the initial consonant code in the coding example; Selecting for use e to make the initial consonant code has individual advantage, and that is exactly that er comes on the r, just can accomplish that the orthographic form of spelling and Two bors d's oeuveres is identical.
(4) input step of the sound sign indicating number of Chinese character code input method part is: Two bors d's oeuveres imports the initial consonant of single Chinese character successively according to the mapping relations of above-mentioned initial consonant or simple or compound vowel of a Chinese syllable and letter key and the code of simple or compound vowel of a Chinese syllable gets final product.
Again shape portion coding is elaborated below.
Chinese character can be divided into single character and combinde rqdical character two classes.I recognize for a long time that in the research of long-term coding whether a Chinese character is open-and-shut for left and right sides structure, the Chinese character of left and right sides structure is easy to produce the gap at it it is divided into two, and up and down, the Chinese character of investing mechanism is difficult for being divided into two sometimes, even to be difficult to distinguish a word sometimes be single character or up-down structure or investing mechanism.Whether divide for left and right sides structure according to a Chinese character is the most easy to learn, but be unfavorable for reducing repeated code like this, in fact can also solve with the technology of tolerant code for the Chinese character that is difficult to partition structure, promptly allow same Chinese character is carried out different coding according to different divisions, so should be divided into single character and combinde rqdical character, so also consistent with traditional thinking of people.It is to be noted combinde rqdical character is divided into two that be divided into the technology that two portions encode respectively and be of long duration, other people have also made significant contribution to this in invention.
Combinde rqdical character breaks it into two according to one-piece construction, presses sequential write, and the part that contains the first sum of picture is called stem, and promptly stem contains first stroke of Chinese-character writing order, also can be described as first portion, and rest parts also can be described as the rear portion for surplus portion.This division is of great use, the Chinese character of the investing mechanism that for example has as " or ", word such as " carrying ", its encirclement part according to stroke order will separately be write, because it is stem that regulation contains the part of the first stroke, the part that does not contain the first stroke is surplus portion, " or " stem of word be " dagger-axe " just, remaining part is just for remaining portion, the surplus portion of " carrying " word is " car " just, and other parts are stem.
To the Chinese character of left, center, right structure or upper, middle and lower structure, but regulation divides the intermediate portion into surplus portion, also can stipulate certainly stem is arrived in the pars intermedia graduation, and remainder divides surplus portion into, and also regulation is removed center section, and only getting the right or following part is surplus portion.
When a Chinese character is upper, middle and lower structure or left, center, right structure, can stipulate that also proportionately the word principle of priority is divided.Even both sides can both become word, divide by the principle on " both sides all become word preferential ", if can become word on one side, and be by " become on one side word preferential " division.As " battalion " word, be the upper, middle and lower structure, if Lv is classified as stem, then both sides all can not become word, if " Lu " is classified as surplus portion, can become word on one side, therefore to classify surplus portion to " Lu " as.And for example " case " word is classified stem as if " Http ", and then both sides all can not become word, if " wood " is classified as surplus portion, then both sides can both become word, so " peace " will be classified as stem, " wood " classifies the portion of remaining as.Certainly this class word can adopt the tolerant code technology to be solved.
When a Chinese character was upper, middle and lower structure or left, center, right structure, best division methods was that the combinde rqdical character according to the overwhelming majority all is the characteristics of phonogram, is divided into shape portion and two parts of part by the ideophone structure." case " word must " be pacified " and be classified stem as like this, and " wood " classifies surplus portion as." battalion " word, " Lu " classifies surplus portion as, and remainder is a stem.The combinde rqdical character that also has is an associative compounds, also can divide by understanding structure.
Again the various strokes of Chinese character are classified as five kinds of basic strokes of horizontal, vertical, left, points, discount by the regulation of State Language Work Committee, when stroke is writing Chinese characters, lines of once being write as incessantly, only considering the direction of wieling the pen of Chinese character, and when disregarding its weight length, stroke can be classified as five kinds of basic strokes of horizontal, vertical, left, points, discount, wherein carry and incorporate horizontal stroke into, lifting-hook is incorporated into perpendicular, right-falling stroke is incorporated into a little, the stroke of all the other various band turnovers is incorporated folding into, and five kinds of basic strokes of horizontal, vertical, left, points, discount in the present invention are called single and draw parts.For reducing repeated code, several stroke structures that also preferred 26 type frequencies or practical frequency are high are that the Hanzi component of State Language Work Committee regulation is discharged on the letter key, participate in coding, are called many stroke members in the present invention.Single is drawn parts and many stroke members general designation basic element of character, is called for short parts.
The code taking rule of shape portion coding is: single character, get the respective code coding of first and the last basic element of character by sequential write, and when having only a basic element of character, only get the respective code coding of this basic element of character.Combinde rqdical character is divided into two by one-piece construction, and separated into two parts is write part earlier and is called stem, also can be described as first portion; After write part and be called surplus portion, also can be described as the rear portion.When stem contains radicals by which characters are arranged in traditional Chinese dictionaries, by sequential write get stem first, the respective code of the last basic element of character, when stem had only a basic element of character, the respective code of getting first basic element of character of surplus portion by sequential write was encoded again; When stem does not contain radicals by which characters are arranged in traditional Chinese dictionaries, get the respective code coding of first basic element of character of stem by sequential write, the respective code of getting first basic element of character of radicals by which characters are arranged in traditional Chinese dictionaries to surplus portion is encoded again.Why the regulation stem contains radicals by which characters are arranged in traditional Chinese dictionaries in code taking rule, rather than the regulation stem is radicals by which characters are arranged in traditional Chinese dictionaries, is because the minority Chinese character radical is except containing radicals by which characters are arranged in traditional Chinese dictionaries, also contain miscellaneous part, for example the radicals by which characters are arranged in traditional Chinese dictionaries of the stem of " grain husk " word are in " standing grain " portion, and the stem Chu Nian of " warbler " also has Mi.Why regulation is got the radical-code of surplus portion, and first component coding of not getting surplus portion is because the minority Chinese character radicals at the centre or the suffix of surplus portion, as " winning " word of etc.ing, also can stipulate to get first component coding of the portion of remaining certainly.In the coding example, just press this regular code fetch.
This coding rule is the result of inspiration of concentrating on studies all the year round and happen suddenly.In 6763 Chinese characters of GB, combinde rqdical character has accounted for the overwhelming majority, is about 95%.Unisonance is more with the combinde rqdical character quantity of radical again, has five, 600 pairs approximately.Often appear in the radicals by which characters are arranged in traditional Chinese dictionaries of prefix, the phonetically similar word that radicals such as Rui, Lv, mouth, wood, Rolling, Jin, Ren produce is maximum, surplus Rui has 60 approximately to phonetically similar word, other three, 40 pairs of phonetically similar words are also arranged, for reducing repeated code, these radicals must be selected, and use a letter or other symbolic coding respectively.Radicals such as woman, Yan, Xin, the moon, worm, soil, Si, fire, Epileptic just have only ten to come phonetically similar word, and for reducing repeated code, these radicals also will be selected, and use a letter or other symbolic coding respectively.Radicals such as , , mountain, stone, day, king, Fu, fish, standing grain have only the phonetically similar word about five pairs, be to reduce repeated code, these radicals also can be selected come out, use a letter or other symbolic coding respectively, certainly indivedual radicals also can be abandoned and not select.And the radical that has as " field ", " order ", " shellfish " though etc. commonly used, can have only one, two pair of phonetically similar word, even do not have phonetically similar word, therefore abandon and do not select.I discover, unisonance is with the Chinese character of radical, to be that the situation of similar basic stroke exceeds especially unexpected few for the first stroke of part except that radicals by which characters are arranged in traditional Chinese dictionaries, it is right to have only more than 100, wherein parts ten, Http occur often, Http also often appears at stem, thus Http selected come out, with a letter or other symbolic coding.
The code taking rule of shape portion coding also can be defined as: single character, get the respective code coding of first and the last basic element of character by sequential write, and when having only a basic element of character, only get the respective code coding of this basic element of character.Combinde rqdical character is divided into two by one-piece construction, and separated into two parts is write part earlier and is called stem, also can be described as first portion; After write part and be called surplus portion, also can be described as the rear portion.When stem is that the basic element of character number of certain radicals by which characters are arranged in traditional Chinese dictionaries and these radicals by which characters are arranged in traditional Chinese dictionaries is during at the Chinese character more than two or two, by sequential write get stem first, the respective code of the last basic element of character coding, the respective code coding of first basic element of character of stem got in other Chinese characters by sequential write, the respective code of getting first basic element of character to surplus portion is encoded again.Adopt the same basic element of character, most of encodes Chinese characters for computer are by the structure of this coding rule coding and coming to the same thing of aforementioned coding rule coding.
Then stroke member more than 26 and five kinds of basic strokes are arranged on the keyboard.During arrangement, many stroke members generally are discharged on the letter key, make code with letter, and five kinds of basic strokes can merge arrangement with many stroke members, but also dispersed arrangement is encoded with punctuation mark to the punctuation mark key.For ease of the memory, during arrangement mainly the initial consonant with the parts pronunciation be code, for avoiding repeated code, the parts that some initial consonant pronunciations are identical have been got, and encode by the pictograph mode.The frequency of the appearance of basic strokes such as horizontal, vertical, left-falling stroke is higher, for reducing repeated code, should not but do not get rid of with many stroke members and come on the same key, in the coding example, with they be discharged to respectively three punctuation marks "; ", on ". ", "/" key, in order rationally with three punctuation marks ", ", ". ", "/" encoding ratio.The type frequency of basic strokes such as point, folding is relatively low, they and many stroke members can be merged arrangement, uses two alphabetic codings respectively.Certainly because the type frequency and the left-falling stroke of basic stroke point are more or less the same, also the basic stroke point can be come on other punctuation mark key, encode with this punctuation mark, such as come branch "; " on, with "; " coding, perhaps come single quotation marks " ' " on, with " ' " coding.In a coding example, the some branch "; " coding.Also the basic stroke folding can be come on other punctuation mark key, with this punctuation mark coding.Horizontal, vertical, left-falling stroke, four basic strokes of point are also had a benefit with four punctuation marks codings respectively, and that is exactly to have made full use of the key on the keyboard, has enlarged space encoder, has reduced the repetition rate of coding, does not influence the input of fingering and punctuation mark again.Stroke member is used alphabetic coding as far as possible more than 26.
Stroke member more than 26 and the preferred arrangement of five kinds of basic strokes on keyboard are seen shown in the accompanying drawing 3.Stroke member more than 26, five kinds of basic strokes are set at the relation of hinting obliquely at alphabetical, punctuation mark:
A---fish b---Epileptic c---Lv d---Rui
E---standing grain f---Rolling g---Http h---fire
I---worm j---Jin k---mouthful l---Si
M---wooden n---women o---day p---Fu
Q---month r---Ren s---stone t---soil
U---mountain v — —  w---king x---Xin
Y---Yan z---, folding;---the point
,---horizontal stroke.---perpendicular/---cast aside
According to setting relation parts are used corresponding letter and punctuation mark coding respectively.
Do concrete the explanation below: a is like fish, and the prefix of fish is similar to A again; B is the initial consonant of Epileptic; C is the initial consonant of Lv; D is the initial consonant of Rui; E is the simple or compound vowel of a Chinese syllable of standing grain; F is like Rolling; G is the initial consonant of Http; H is the initial consonant of fire; I is because ch arranges thereon, and ch is the initial consonant of worm; J is that the initial consonant k of Jin is the initial consonant of mouth; L is the first sum of like Si's; M is the initial consonant of wood; N is woman's initial consonant; O is the profile of the sun like day; P is like Fu; The seemingly incomplete sometimes round sometimes moon of Q is the initial consonant of Ren as moon code r just; S is the initial of the phonetic of stone; T is that the initial consonant u of soil comes on the u because of sh, and sh is the initial consonant on mountain; V is because zh comes on the u, and the initial consonant of zh Shi ; W is king's a initial consonant; X is the initial consonant of Xin; Y is the initial consonant of Yan; Z is the initial consonant of  and folding; These letters are just respectively as the coding of corresponding parts."; " as the code of putting; ", " is as the horizontal perpendicular code of code ". " conduct; "/" is as the code of casting aside.With four punctuation marks as horizontal, vertical, cast aside, the code name of point, two benefits are arranged: the one, avoid these four basic strokes and many stroke members to come on the same key after, identical with many stroke members code, the generation repeated code.The 2nd, enlarged space encoder, can not influence the input of punctuation mark again.
Certainly also can promptly arrange with Hanzi component according to arranging with the similarity degree of English according to pictograph.Accompanying drawing 4 is seen in a kind of preferred arrangement.At this moment stroke member more than 26, five kinds of basic strokes are set at the relation of hinting obliquely at alphabetical, punctuation mark:
A---Ren b---day c---Http d---stone
E---mountain f---Rolling g---sufficient h---Lv
I---Yan j---Epileptic k — —  l---Xin
M---wooden n---month o---mouthful p---Fu
Q---worm r---women s---Rui t---soil
U---fish v---Jin w---Si x---fire
Y---standing grain z---king, folding;---the point
,---horizontal stroke.---perpendicular/---cast aside
According to setting relation parts are used corresponding letter and punctuation mark coding respectively.Do concrete the explanation below: capitalization a is like Ren; Capitalization b is like day; C is like Http; D is like stone, and frame is in the bottom; Capitalization e is like the mountain; F is like Rolling, and is special in anti-Rolling; G is like foot, and frame is on top; Capitalization h is like Lv; I is like Yan; J is like Epileptic; K Si  is like Ban ; L is like Xin; M is like wood, and like the woods, the initial consonant of wood also is m; The n lunar; O is like mouth; P is like Fu; Capitalization q is that the pen of going out is arranged in the frame like worm; Capitalization r is like the woman; S is like Rui, and like current shape, the Rui initial consonant also is s; T is like soil, and the initial consonant of soil also is t; U is like fish, and like the fish bubble, pronunciation also seemingly; V is like Jin prefix or suffix; W is like Si; X is like fire; Y is like standing grain, and the shape of standing grain is often like y; Z is like the king, also with the appearance similar of rolling over " second ".For ease of memory, just with horizontal, vertical, cast aside, put distinguish in order ", ", ". ", "/", "; ".Certainly also can arrange respectively by other mode, as with horizontally-arranged "; ", with vertical setting of types in "/"; Left-falling stroke is come ", "; Point is come on ". ", with corresponding punctuation mark coding.Many stroke members are arranged on the key by likeness in form fully, individual benefit is arranged, that needn't be had to some parts is changed by pictograph row as by pronunciation row exactly, and coding principle has consistance, and the somebody may prefer this mode.
The basic element of character that has is after as radical, and font can change to some extent, but must be considered as the similar basic element of character, uses same alphabetic coding, this class basic element of character such as  and bamboo, and foot and , Ren and people, Yan and speech, Jin and gold, Xin contains the heart, fire and Xiangxi, Rolling and hand, Rui and water etc.This regulation is applicable to all accompanying drawings.
For the not high people of culture, be difficult to judge whether stem has radicals by which characters are arranged in traditional Chinese dictionaries, another easy code taking rule is provided here, that judges exactly whether surplus portion is certain preferred many stroke member or is radical " Chuo ", " bird ", the words that are, stem is just only got the respective code coding of a basic element of character by sequential write, the respective code of getting these many stroke members to surplus portion is encoded again; Words that no, stem can be got the respective code coding of first and the most last the basic element of character at most by sequential write, and when stem has only a basic element of character, in the time of can only getting one yard, the respective code of getting first parts by sequential write to surplus portion is encoded again.At this moment for ease of memory, just " Chuo ", " bird " should be selected in many stroke members.For reducing repeated code, the consistance of maintenance coding is just pressed the first sum of coding with them, i.e. " Chuo " and some merging arranged, and " bird " and left-falling stroke merge arrangement, respectively with a punctuation mark coding.See accompanying drawing 5.The difference of accompanying drawing 5 and accompanying drawing 3 be many two radicals " Chuo ", " bird ".
The relation of hinting obliquely at of at this moment many stroke members, basic stroke and letter, punctuation mark is set at:
A---fish b---Epileptic c---Lv d---Rui
E---standing grain f---Rolling g---Http h---fire
I---worm j---Jin k---mouthful l---Si
M---wooden n---women o---day p---Fu
Q---month r---Ren s---stone t---soil
U---mountain v — —  w---king x---Xin
Y---Yan z---, folding;---point, Chuo
,---horizontal stroke.---perpendicular/---cast aside, bird
Certainly, also they can be pressed first letter of pinyin and arrange, respectively with D, Z coding because the type frequency of basic strokes such as point, folding is relatively low.Also preferably " ten ", " Woo " comprise " Yi " these two parts, and at this moment many stroke members and five kinds of basic stroke arrangements on keyboard are seen shown in the accompanying drawing 6.The same Xinhua dictionary of aligning method, be according to stroke number what and cast aside to press down the tactic of folding anyhow.The relation of hinting obliquely at of many stroke members, basic stroke and letter, punctuation mark is set at:
A---Http b — —  c---Yi, Woo d---Si, point
E---Fu f---king g---wooden h---day
I---mountain j---month k---fiery l---stone
M---fish n--- o---Xin p---Rui
Q---Ren r---native s---women t---Rolling, ten
U---mouthful v---worm w---Yan x---Epileptic
Y---Lv z---standing grain, folding;---Jin
,---horizontal stroke.---perpendicular/---cast aside
Certainly also Hanzi component can be divided into horizontal, vertical, left, points, discount five districts according to the first stroke of a Chinese character, according to how many arrangements of Chinese character unit stroke numbers, stroke number is identical again in every district, arranges according to the order of horizontal, vertical, left, points, discount again, and accompanying drawing 7 is seen in a kind of preferred arrangement.Many stroke members " ten " are just not selected in accompanying drawing 7,8, and the relation of hinting obliquely at of at this moment many stroke members, basic stroke and letter, punctuation mark is set at:
A---native b — —  c---Jin d---Lv, point
E---Rui f---king g---wooden h---stone
I---Fu j---mouthful k---mountain l---day
M---sufficient n---fish o---women p---Si
Q------Yan r---Http s---Rolling t---fire
U---Yi, Woo v---standing grain w---Xin x---month
Y---Epileptic z---Ren, folding;---worm
,---horizontal stroke.---perpendicular/---cast aside
Certainly also Hanzi component can be divided into horizontal, vertical, left, points, discount five districts according to the first stroke of a Chinese character, arrange according to the order of horizontal, vertical, left, points, discount again in every district, and accompanying drawing 8 is seen in a kind of preferred arrangement.Here, be discharged to "; " on the key, the relation of hinting obliquely at of at this moment many stroke members, basic stroke and letter, punctuation mark is set at:
A---king b---month c — —  d---Rolling
E---Rui f---Lv g---wooden h---stone
I---Fu j---mouthful k---day l---foot
M---mountain n---fish o---women p---Si
Q---Epileptic r---fiery s---native t---Http
U---Yi, Woo v---Ren w---Xin x---standing grain
Y---Yan z---Jin, folding;---worm, point
,---horizontal stroke.---perpendicular/---cast aside
When encoding, shape portion also can advise definition part radicals by which characters are arranged in traditional Chinese dictionaries principle of priority, when shape portion encodes, get the radical coding of this Chinese character earlier, if the radical of this Chinese character is only to be some preferred basic elements of character, then get one yard, get first component coding of removing the remainder behind this radical in the Chinese character by sequential write again, promptly get first component coding that does not belong to radical parts by sequential write.If the radical of this Chinese character contains two and the above preferred basic element of character, then get two component codings of head and the tail of this radical by sequential write.These all are to distortion of the present invention.I not too agree with this method, because be not inconsistent with sequential write so sometimes.
As seen, shape portion coding is pressed accompanying drawing 3 and accompanying drawing 4 is fairly simple.Owing to press accompanying drawing 1 and accompanying drawing 3 arrangements, comparatively be simple and easy to note, in the coding example, press accompanying drawing 1 and accompanying drawing 3 codings.The coding example: as the coding of " Chinese ", initial consonant is h, and simple or compound vowel of a Chinese syllable is an, code is f, and sound sign indicating number part is hf just, and shape portion is encoded to combinde rqdical character, be divided into two by one-piece construction, stem is Rui, and surplus portion is " again ", and stem has only a parts Rui, be encoded to d, get first parts " folding " of surplus portion again and encode, code is z, being encoded to of " Chinese " " hfdz ".The coding of " word " and for example, the sound sign indicating number is partly pressed Two bors d's oeuveres, is zi, and shape portion coding press the combinde rqdical character coding, and stem has only a parts Http, is encoded to g, gets first parts " folding " of surplus portion again and encodes, and code is z, being encoded to of " word " " zigz ".And for example " envelope ", spelling are feng, and Two bors d's oeuveres is f; , when shape portion encodes, two yards of the desirable foot head and the tail of stem, head and the tail are respectively two parts " soil ", " soil ", and code is respectively t, t, the coding of " envelope " is " f just; Tt ".To follow when noting getting the basic element of character and get big preferential cryptoprinciple, to preferentially get the many basic element of character codings of stroke, should get two " soil " when for example the stem of " envelope " word is encoded, and can not get two horizontal strokes, because the stroke number of " soil " more than " horizontal stroke ", be encoded by " soil ".The coding of " wood " and for example, Two bors d's oeuveres is mu, is single character, has only a basic element of character " wood ", and code is m, and the coding of wood is mum just.
For improving input speed, for the high frequency word, designed brevity code, it just gets 1,2 of preceding volumes or 3 codings of its complete coding to Chinese character commonly used, adds 1 space bar again and just constitutes brevity code.Because regulation sound sign indicating number formerly, after shape portion was coded in, the shape portion of many Chinese characters coding did not need whole inputs, is main so the coding of individual character is actually the sound sign indicating number, was aided with shape portion coding.
Because the secondary brevity code of phonetic has only 400 Chinese characters, and space encoder has 729, therefore, for hundred space encoders of its excess-three, also can set up the brevity code speech, thereby further improve typing speed.Do not have the form of kian as the phonetic of Chinese character, double spelling coding does not just have the form of ky yet, and ky can be used as " can " coding because be respectively the coding of " can ", " with " for " k ", " y ".Owing to be provided with more than 300 brevity code speech, phrase is faster than the input speed of individual character in theory, so this can obviously improve the input speed of Chinese character.After having knocked the key at brevity code place of certain Chinese character or phrase on computers, knock space bar again, just can import corresponding Chinese character or phrase.
The word input is the most popular method that improves Chinese character input speed, because regulation sound sign indicating number is formerly, after shape portion was coded in, the word input just all utilized the sound sign indicating number to import, owing to select Wang Zhi sun Two bors d's oeuveres for use, the step of word input is:
A, two words language is got the initial consonant of each word, the code of simple or compound vowel of a Chinese syllable is imported successively; As " coding " code is byma.
B, three-character words and phrases are got the code of the initial consonant of each word and are imported successively, blank fill input again; Code as " computing machine " is " jsj ".Certainly the last sign indicating number that also can stipulate to get first word, second word is the code of initial consonant, gets the first two sign indicating number of the 3rd word again, and less than is just got one yard for two yards.Also can stipulate to get the first two sign indicating number of first word, less than is just got one yard for two yards again, and the last sign indicating number of getting second word, the 3rd word again is the code of initial consonant.
C, four words and above word are got the code of the initial consonant of first three word and the last character and are imported successively; As " science and technology " is four words, and the code of getting the initial consonant of each word is " kxju ".Wherein u is the code of the initial consonant sh of art.And for example " Xinjiang Uygur Autonomous Regions ", coding is got the code " xjwq " of the initial consonant of first three word and the last character " Xinjiang Wei Qu ".
Can run into coincident code problem during the phrase input, generally speaking the effective repetition rate of coding by spell Chinese character input will be lower than phonetic shape code, but neither not have, and runs into homonym, and when repeated code took place, a very easy method was based on context to adopt Intelligent treatment.Sometimes in the time of can't Intelligent treatment, individual skill be arranged, can avoid phonetically similar word to select substantially, that is exactly first code of importing the shape portion coding of first word in the phrase or the last character again, generally imports first code of the shape portion coding of first word.After input " uiji ", wherein u is the compression initial consonant of sh, speech such as " reality, deed, reagent, century, the Records of the Historians " can occur chooses from, at this moment to have numerical key selective in the front of each speech, the back have a letter or symbol selective, it is first yard of shape portion coding of first word, adopts the words of accompanying drawing 3, the code signal of Http, horizontal stroke, Yan, Nian, mouth is respectively " g ", ", ", " y ", " c ", " k ".Just can directly go up screen after knocking the key at respective coding code name place, and need not to select repeated code with numerical key again.This innovation is obviously very simple and practical, in fact can accomplish almost not have the repeated code speech.
Like this, the present invention has done successfully to handle to repeated code word and repeated code speech problem that spelling input method runs into, adopt the words of accompanying drawing 1 and accompanying drawing 3, just can learn in ten minutes, even a few minutes just can learn substantially, as long as remember that such as Two bors d's oeuveres a, o, e, i, the beginning of u district are just passable, also can import in conjunction with the Two bors d's oeuveres presenting bank, shape portion coding has only stroke member more than 26 and five singles to draw parts, and most of basic element of character all uses the initial consonant of phonetic to make code.The words that adopt accompanying drawing 2 and accompanying drawing 4 also as long as twenty or thirty minute just can be learned, make the present invention demonstrate great superiority, become unique desirable perfect input method of Chinese character.
Utilize to optimize Chinese character code input method software, knock the key at the respective coding place of certain Chinese character or phrase on computer keyboard, just can finish input, no repeated code and the Chinese character that reaches the regulation code length shield on automatically, have the Chinese character of repeated code to select according to presenting bank.Words compatibility of the present invention, code length all is 4 yards at most."~" key is omnipotent learning key, and when the coding of certain Chinese character was not known, available "~" replaced, and helps correct coding is found out, and utilizes reminder item to select again.
For the ease of using, also be provided with tolerant code, the Chinese character that import also can appear in the Chinese character to some codings are made mistakes easily when mistake is imported.
It is to be noted the equal case insensitive of letter in this instructions, claims and Figure of description.

Claims (9)

1, a kind of computer Chiense character code inputting method is promptly optimized Chinese character code input method, the various strokes of Chinese character are classified as five kinds of basic strokes of horizontal, vertical, left, points, discount by the regulation of State Language Work Committee after, it is characterized in that:
(1), kanji code is made up of two parts, a part is the sound sign indicating number, i.e. phonetic, or claim that phonetic sign indicating number, another part are shape portion codings, two parts of kanji code can before can after, once selecting, can not change;
(2), phonetic can adopt spelling or Two bors d's oeuveres or simplicity or imperfect phonetic;
(3), the code taking rule of shape portion coding is: single character, get the respective code coding of first and the last basic element of character by sequential write, when having only a basic element of character, only get the respective code of this basic element of character, combinde rqdical character is divided into two by one-piece construction, separated into two parts, write part earlier and be called stem, also can be described as first portion; After write part and be called surplus portion, also can be described as the rear portion, when stem contains radicals by which characters are arranged in traditional Chinese dictionaries, by sequential write get stem first, the respective code of the last basic element of character, when stem had only a basic element of character, the respective code of getting first basic element of character of surplus portion by sequential write was encoded again; When stem does not contain radicals by which characters are arranged in traditional Chinese dictionaries, get the respective code coding of first basic element of character of stem by sequential write, the respective code of getting first basic element of character of radicals by which characters are arranged in traditional Chinese dictionaries to surplus portion is encoded again;
When (4) shape portion encoded, preferred five kinds of basic strokes and 26 basic elements of character participated in coding, and 26 basic elements of character, five kinds of basic strokes are set at a kind of main relation of hinting obliquely at by pronunciation alphabetical, punctuation mark:
A---fish b---Epileptic c---Lv d---Rui
E---standing grain f---Rolling g---Http h---fire
I---worm j---Jin k---mouthful l---Si
M---wooden n---women o---day p---Fu
Q---month r---Ren s---stone t---soil
U---mountain v — —  w---king x---Xin
Y---Yan z---
Figure A200710305327C0003091653QIETU
, the folding;---the point
,---horizontal stroke.---perpendicular/---cast aside
The another kind of many stroke members, basic stroke and letter, the punctuation mark relation of hinting obliquely at is set at:
A---Ren b---day c---Http d---stone
E---mountain f---Rolling g---sufficient h---Lv
I---Yan j---Epileptic k — —  l---Xin
M---wooden n---month o---mouthful p---Fu
Q---worm r---women s---Rui t---soil
U---fish v---Jin w---Si x---fire
Y---standing grain z---king, folding;---the point
,---horizontal stroke.---perpendicular/---cast aside
The another kind of many stroke members, basic stroke and letter, the punctuation mark relation of hinting obliquely at is set at:
A---native b — —  c---Jin d---Lv, point
E---Rui f---king g---wooden h---stone
I---Fu j---mouthful k---mountain l---day
M---sufficient n---fish o---women p---Si
Q---Yan r---Http s---Rolling, ten t---fire
U---Yi, Woo v---standing grain w---Xin x---month
Y---Epileptic z---Ren, folding;---worm
,---horizontal stroke.---perpendicular/---cast aside
The another kind of many stroke members, basic stroke and letter, the punctuation mark relation of hinting obliquely at is set at:
A---Http b — —  c---Yi, Woo d---Si, point
E---Fu f---king g---wooden h---day
I---mountain j---month k---fiery l---stone
M---fish n--- O---Xin p---Rui
Q---Ren r---native s---women t---Rolling
U---mouthful v---worm w---Yan x---Epileptic
Y---Lv z---standing grain, folding;---Jin
,---horizontal stroke.---perpendicular/---cast aside
The another kind of many stroke members, basic stroke and letter, the punctuation mark relation of hinting obliquely at is set
For:
A---king b---month c — —  d---Rolling
E---Rui f---Lv g---wooden h---stone
I---Fu j---mouthful k---day l---foot
M---mountain n---fish o---women p---Si
Q---Epileptic r---fiery s---native t---Http
U---Yi, Woo v---Ren w---Xin x---standing grain
Y---Yan z---Jin, folding;---worm, point
,---horizontal stroke.---perpendicular/---cast aside
Utilize and optimize Chinese character code input method software, on computer keyboard, knock the key at the respective coding place of certain Chinese character or phrase, just can finish input.
2, optimization Chinese character code input method according to claim 1 is characterized in that: Two bors d's oeuveres preferably selects for use Wang Zhi sun Two bors d's oeuveres: ch, sh, zh to represent with i, u, v respectively that by the sound preface single vowel ü represents that with alphabetical v each simple or compound vowel of a Chinese syllable and alphabetical mapping relations are set at:
a——a b——uai c——un?ün d——ai
e——e f——an g——ang h——ou
i——i j——ong?iong k——ei l——en
m——uang?iang n——uan?üan o——o?uo
p——ing q——ie r——in?er s——ao t——iao
u——u v——ü?ui w——iu x——ue?üe
y——ian z——ua?ia ;——eng
Certainly also can not consider the number of simple or compound vowel of a Chinese syllable, every district is from left to right arranged according to a, o, e, i, u, n, g by the simple or compound vowel of a Chinese syllable letter;
At this moment each simple or compound vowel of a Chinese syllable and alphabetical mapping relations are set at:
a——a b——uang?iang c——uan?üan d——an
e——e f——ang g——ao h——ong?iong
i——i j——ou k——ei l——en
m——un?ün n——ue?üe o——o?uo p——iu
q——ian r——ie?er s——ai t——in
u——u v——ü?ui w——iao x——uai
y——ing z——ua?ia ;——eng
Certainly also can from left to right arrange by English phonetic order by the simple or compound vowel of a Chinese syllable that alphabetical number is identical, also can not consider alphabetical number, English phonetic order pressed in simple or compound vowel of a Chinese syllable arrange;
Have only simple or compound vowel of a Chinese syllable not have initial consonant, get e or.Or a makes initial consonant, and also the initial consonant code made in first letter of desirable simple or compound vowel of a Chinese syllable, fills the simple or compound vowel of a Chinese syllable code again, generally makes the initial consonant code with e.
3, optimization Chinese character code input method according to claim 1, it is characterized in that: a lot of radical of phonetically similar word of generations such as Rui, Lv, mouth, wood, Rolling, Jin, Ren must be selected, use a letter or other symbolic coding respectively, the more radicals of generation phonetically similar word such as woman, Yan, Xin, the moon, worm, soil, Si, fire, Epileptic also will be selected, use respectively a letter or other symbolic coding , ,
Figure A200710305327C0005092546QIETU
, mountain, stone, day, king, Fu, fish, standing grain,, Http etc. can take place several radicals to phonetically similar word also can, use a letter or other symbolic coding respectively, the basic element of character all is selected from the radical of Chinese character.
4, optimization Chinese character code input method according to claim 2 is characterized in that: when a Chinese character was upper, middle and lower structure or left, center, right structure, best division methods was to split by the ideophone structure or by understanding structure to divide two parts, splits by understanding structure.
5, optimization Chinese character code input method according to claim 1, it is characterized in that: to the Chinese character of upper, middle and lower structure or upper, middle and lower structure, the intermediate portion is divided into surplus portion, also but regulation is divided into stem with center section certainly, and also proportionately the preferential division principle of word is divided, when a Chinese character is upper, middle and lower structure or left, center, right structure, if both sides can both become word, to divide by the principle on " both sides all become word preferential ", if can become word on one side, be by " become on one side word preferential " division.
6, optimization Chinese character code input method according to claim 1, it is characterized in that: another code taking rule is: judge whether surplus portion is certain preferred many stroke member, the words that are, stem is just only got the respective code coding of a basic element of character by sequential write, the respective code of getting these many stroke members to surplus portion is encoded again; Words that no, stem can be got the respective code coding of first and the most last the basic element of character at most by sequential write, when stem has only a basic element of character, in the time of can only getting one yard, the respective code of getting first parts by sequential write to surplus portion is encoded again, at this moment for ease of memory, just should be with " Chuo ", " bird " selected many stroke members, they are pressed the first sum of coding, promptly " Chuo " merges arrangement with point, " bird " arranges with casting aside merging, and with a punctuation mark coding, the relation of hinting obliquely at of at this moment many stroke members, basic stroke and letter, punctuation mark is set at respectively:
A---fish b---Epileptic c---Lv d---Rui
E---standing grain f---Rolling g---Http h---fire
I---worm j---Jin k---mouthful l---Si
M---wooden n---women o---day p---Fu
Q---month r---Ren s---stone t---soil
U---mountain v — —  w---king x---Xin
Y---Yan z---
Figure A200710305327C0007092624QIETU
, the folding;---point, Chuo
,---horizontal stroke.---perpendicular/---cast aside, bird
7, optimization Chinese character code input method according to claim 1, it is characterized in that: the code taking rule of shape portion coding also can be defined as: single character, get the respective code coding of first and the last basic element of character by sequential write, when having only a basic element of character, only get the respective code coding of this basic element of character; Combinde rqdical character is divided into two by one-piece construction, and separated into two parts is write part earlier and is called stem, also can be described as first portion; After write part and be called surplus portion, also can be described as the rear portion; When stem is that the basic element of character number of certain radicals by which characters are arranged in traditional Chinese dictionaries and these radicals by which characters are arranged in traditional Chinese dictionaries is during at the Chinese character more than two or two, by sequential write get stem first, the respective code of the last basic element of character coding, the respective code coding of first basic element of character of stem got in other Chinese characters by sequential write, the respective code of getting first basic element of character to surplus portion is encoded again.
8, optimization Chinese character code input method according to claim 1 is characterized in that: for using frequent word, designed brevity code, it just gets 1,2 of preceding volumes or 3 codings of its complete coding to Chinese character commonly used, adds 1 space bar again and has just constituted brevity code.
9, optimization Chinese character code input method according to claim 1 is characterized in that: the step of word input is:
Two words languages is got the initial consonant of each word, the code of simple or compound vowel of a Chinese syllable is imported successively;
Three-character words and phrases is got the code of the initial consonant of each word and is imported successively, blank fill input again;
Four words and above word are got the code of the initial consonant of first three word and the last character and are imported successively.
CNA200710305327XA 2007-12-25 2007-12-25 Optimized Chinese character code input method Pending CN101470535A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA200710305327XA CN101470535A (en) 2007-12-25 2007-12-25 Optimized Chinese character code input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA200710305327XA CN101470535A (en) 2007-12-25 2007-12-25 Optimized Chinese character code input method

Publications (1)

Publication Number Publication Date
CN101470535A true CN101470535A (en) 2009-07-01

Family

ID=40828049

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA200710305327XA Pending CN101470535A (en) 2007-12-25 2007-12-25 Optimized Chinese character code input method

Country Status (1)

Country Link
CN (1) CN101470535A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838391A (en) * 2012-11-26 2014-06-04 王治阳 Echoism Chinese character code inputting method
CN103838394A (en) * 2014-03-19 2014-06-04 马前玲 Chinese character input method based on keyboard

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838391A (en) * 2012-11-26 2014-06-04 王治阳 Echoism Chinese character code inputting method
CN103838394A (en) * 2014-03-19 2014-06-04 马前玲 Chinese character input method based on keyboard

Similar Documents

Publication Publication Date Title
CN103616960A (en) Six vowel binary syllabification input method
CN102053719B (en) Input method for Chinese characters
CN101751134B (en) Right upper left Chinese character input method
CN101470535A (en) Optimized Chinese character code input method
CN102799282A (en) Stroke etymon holographic code Chinese character input method
CN102073383A (en) Initial component pinyin input method
CN103207685A (en) T-shaped Chinese character code input method
CN103207684A (en) Phonemic letter double-input method
CN101430604A (en) Chinese character code input method
CN105302330A (en) Combined phonetic and stroke type main and auxiliary code Chinese character and word and phrase coding input method and keyboard adopting method
CN102023718A (en) Initial and final consonant and stroke primary and secondary radical input method
CN101504572A (en) Perfect Chinese character code input method
CN102023717A (en) Three-five initial-subsequent phonetic code and keyboard thereof
CN101561713A (en) Method for inputting standard Chinese character code
CN103777771B (en) Easily prompt speed records serial input method
CN101571750A (en) Standard Chinese character code input method
CN103941882A (en) T-shaped Chinese character code input method
CN106708284A (en) Twenty-component Chinese character code input method
CN102073382A (en) Stroke, main and auxiliary radical input method
CN103838389A (en) Tail point removing Chinese character input method
CN101706685A (en) Chinese character input method
CN107066113A (en) The code inputting method of 20 part individual character two
CN103838391A (en) Echoism Chinese character code inputting method
CN103616961A (en) Phoneme T-shaped Chinese character code input method
CN100456214C (en) Chinese document quick-speed input processing technology and keyboard thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090701