CN1239242A - Phonetic letters method for typing-in phrases - Google Patents

Phonetic letters method for typing-in phrases Download PDF

Info

Publication number
CN1239242A
CN1239242A CN 99104227 CN99104227A CN1239242A CN 1239242 A CN1239242 A CN 1239242A CN 99104227 CN99104227 CN 99104227 CN 99104227 A CN99104227 A CN 99104227A CN 1239242 A CN1239242 A CN 1239242A
Authority
CN
China
Prior art keywords
code
chinese
character
compound vowel
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 99104227
Other languages
Chinese (zh)
Inventor
钟明华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN 99104227 priority Critical patent/CN1239242A/en
Publication of CN1239242A publication Critical patent/CN1239242A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A Chinese-character input system features that phonetic letters are used as primary tying approach, the phonetic letters are use in conjuction with shape, and such techniques are used as 3D locating index, dividing lenger code into shorter segments, architectural recombination and high-frequency word or phrase first. It contains 72000 words (phrases) by itself. Its advantages are short code length, low duplicate rate, easy mastering it, high input speed and high universality.

Description

Phonetic letters method for typing-in phrases
The present invention is a kind of Chinese character input scheme of computer, is under the jurisdiction of the computer utility category, is mainly used in various computing machine Chinese operating systems.
At present, Chinese character pinyin input method (comprising spelling, simplicity, Two bors d's oeuveres double-tone, intelligence phonetic letter etc.) is the very wide Chinese-character input scheme of a kind of range of application, almost each computing machine all is equipped with the spelling input method scheme, its user occupied China computer user near half.These spelling input methods (especially intelligence phonetic letter input method) are based on people's Chinese phonetic alphabet knowledge mostly, can use spelling, simplicity simultaneously, mix Chinese-character input schemes such as assembly, doubles, have characteristics such as flexible, efficient, but in actual use, there is following drawback again to some extent in they:
1, must possess phonetic transcriptions of Chinese characters knowledge accurately, as long as Chinese-character pronunciation is inaccurate, this word promptly can't hit.
2, repeated code is on the high side, has how many unisonance words how many repeated codes are just arranged often.
3, the whole sentence input of intelligence phonetic letter accuracy rate is 95% only, must return again when makeing mistakes word for word and revise.
4, Two bors d's oeuveres double-tone input method is incompatible very widely with use for " doubles " scheme of intelligence phonetic letter, is difficult for being accepted by vast Two bors d's oeuveres double-tone user.
5, input speed is slower.At present all spelling input methods all adopt establishment more than four yards or four yards, exist that coding is too fat to move, complex operation, the slow defective of input, give people study, use and bring certain difficulty.
Purpose of the present invention is to overcome above-mentioned drawback, develops simplyr, more practical, more efficient, and the Chinese-character input scheme of complete compatible Two bors d's oeuveres double-tone input method is to promote the development of China's computing machine cause.
The present invention is a kind of based on phonetic, with font is auxilliary, the Chinese characters keyboard input system of sound shape combination, it is visual in image that it draws all-phonetic input method, the advantage that is easy to learn and use, with the golygram initial consonant in the single-letter key replacement phonetic, simple or compound vowel of a Chinese syllable, with the complicated structure in the font, the integrated use three-dimensional localization techniques, the code length partitioning technique, classification recombinant technique and priority of high frequency technology, by the specific character root of keyboard plan of establishment, the word coding method scheme, the dictionary plan of establishment, operation input scheme and compatible scheme are formed, it is short to have coding, study easily, easy and simple to handle, input is quick, characteristics such as easily difficult integrated finally reach the purpose of simple and fast.
One, technical essential
1, three-dimensional localization.
Three-dimensional localization of the present invention mainly is meant the technology of required individual character being carried out three-dimensional retrieval by window by three different directions.It has that retrieval is quick, accurate positioning, brief characteristics of high efficiency.
Be applied to three-dimensional localization retrieval technique of the present invention, its three-dimensional search direction is: first radical of the initial consonant of phonetic transcriptions of Chinese characters, simple or compound vowel of a Chinese syllable and the font first stroke of a Chinese character, be called for short: initial consonant, simple or compound vowel of a Chinese syllable, prefix.
Each dimension retrieval direction of three-dimensional localization retrieval technique of the present invention is divided into 26 different section levels, to guarantee the accurate and low repetition rate of coding of result for retrieval.
Another characteristics of three-dimensional localization retrieval technique are exactly the triple bond location, promptly only need three keystrokes just can to retrieve out by required individual character.The input method of every employing three-dimensional localization retrieval technique, its code element setting are the trigram establishment, and its individual character maximum code length is 3, and all available triple bond retrieval of 6763 Chinese characters of GB one secondary character library draws, and the repetition rate of coding is extremely low.
We can say: the three-dimensional localization retrieval technique is science in the keyboard Chinese-character input method, the most brief individual character retrieval technique.
2, the code length subregion.
Be applied to code length partitioning technique of the present invention, be characterized in: individual character and phrase are distributed in the coding region of different Baud Lengths, thereby are fundamentally avoiding between the words drawback of repeated code, shielding mutually.
With regard to the present invention, it is characterized in that:
1., trigram one word, four yard one speech: the complete encoding setting of individual character in the coding region of 3 code elements, phrase
Encoding setting is in the coding region of 4 code elements fully.
2., the coding region of 1 code element distributes to the one-level brevity code and the one-level phrase uses jointly, the volume of 2 code elements
The sign indicating number region allocation uses for secondary brevity code and secondary phrase jointly.
3., the word coding method zone was both relatively independent, complemented one another again, and input need not the button switching between the words.
3, the classification reorganization.
The classification recombinant technique is the original creation technology, has following three characteristics:
1., coding classification.According to three-dimensional localization techniques, the present invention coding be divided into three kinds dissimilar: sound sign indicating number, rhythm
Sign indicating number and font code, and each type coding is divided into two kinds of ranks, that is: specific code with represent yard;
2., structural rearrangement.Code element dissimilar, different stage is carried out flexible combination, and formation has different merits
The input scheme structure that can, can adapt to the different levels needs;
3., integrated.Be equipped with the different structure input scheme and merge among common input system, each input in the system
Do not need button to switch between the scheme, have common one-level brevity code, common secondary brevity code, common
Dictionary.Continuity between each input scheme of maintenance system, form a kind of by Yi Jinan, by slow and fast,
Simplified and traditionally take into account, incremental academic environment and operation system.
The range of application of structure rating recombinant technique is extremely wide, as long as redefine code element key position corresponding relation, can generate the brand-new phonetic phrase input scheme of a cover, as intelligent doubles input method, natural double-spelling Chinese character input method etc. immediately.
4, priority of high frequency.
Priority of high frequency promptly is exactly in encode Chinese characters for computer is arranged, and individual character or phrase commonly used are placed on the foremost, ensures that thus everyday character and everyday words can hit indiscriminately.
The priority of high frequency technology has dynamic frequency and static frequency modulation dual mode.What the present invention adopted is static mode of frequency regulation.
According to the priority of high frequency principle, in phrase repeated code of the present invention was arranged, two words were placed on the foremost, and three words are placed on time front, and four words are placed on the 3rd ..., the rest may be inferred in other phrase ordering.
Two, keyboard plan
1, key bit symbols scheme
The keypad code element plan is meant the corresponding scheme of input method code element and computer keyboard position.
Code element is to form the base unit of encode Chinese characters for computer.In encoding scheme, each Chinese character can be regarded as and be made up of different code elements.Principle according to three-dimensional localization techniques and classification reorganization, the present invention is divided into initial consonant, simple or compound vowel of a Chinese syllable, these three kinds of different types of font with its code element, 210 of concrete code elements are set, 24 of initial consonants (comprising zero initial) wherein, 33 of simple or compound vowel of a Chinese syllable, 153 of fonts, and according to feature give these code elements with represent the sign indicating number, with its body key position of corresponding computer keyboard.
1., computer keyboard is divided into five base regions and a special case from " A " to " Z " these 26 key positions
The district;
2., first base region comprises " G, F, D, S, A " five key positions, and area code is " 1 ", each key
The position area code corresponds to " 11,12,13,14,15 " respectively;
3., second base region comprises " H, J, K, L, M " five key positions, and area code is " 2 ", each key
The position area code corresponds to " 21,22,23,24,25 " respectively;
4., the 3rd base region comprises " T, R, E, W, Q " five key positions, and area code is " 3 ", each key
The position area code corresponds to " 31,32,33,34,35 " respectively;
5., the 4th base region comprises " Y, U, I, O, P " five key positions, and area code is " 4 ", each key
The position area code corresponds to " 41,42,43,44,45 " respectively;
6., the 5th base region comprises " N, B, V, C, X " five key positions, and area code is " 5 ", each key
The position area code corresponds to " 51,52,53,54,55 " respectively;
7., " Z " key position is " special case district ", and area code is " 60 ".
The correspondence position of concrete code element of this input method and computer key position is seen accompanying drawing 1.
Code element of the present invention as can be seen exists following corresponding rule with the computer key position from accompanying drawing 1:
1., single-letter initial consonant, single-letter simple or compound vowel of a Chinese syllable are corresponding to respectively its alphabetical key position, place;
2., golygram initial consonant (zh, ch, sh) and zero initial are located at the key position at single-letter simple or compound vowel of a Chinese syllable place;
3., the corresponding relation of golygram simple or compound vowel of a Chinese syllable and key position is:
(1), the golygram simple or compound vowel of a Chinese syllable of " a " beginning is located at " 1 " district;
(2), the golygram simple or compound vowel of a Chinese syllable of " i " beginning is located at " N " " B " key in " 2 " district and " 5 " district
The position;
(3), the golygram simple or compound vowel of a Chinese syllable of " e " beginning is located at " 3 " district;
(4), the golygram simple or compound vowel of a Chinese syllable of " o " beginning is located at " 4 " district;
(5), " u " beginning the golygram simple or compound vowel of a Chinese syllable be located at " 5 " district " V " " C " " X " key position and
The special case district.
4., the corresponding relation of font code element and key position is:
(1), the radical of the horizontal stroke of first stroke of a Chinese character is located at " 1 " district;
(2), the perpendicular radical of drawing the first stroke of a Chinese character is located at " 2 " district;
(3), cast aside the radical of drawing the first stroke of a Chinese character and be located at " 3 " district;
(4), press down the radical of drawing the first stroke of a Chinese character and be located at " 4 " district;
(5), the radical of the folding stroke first stroke of a Chinese character is located at " 5 " district;
(6), seven radicals of " Rolling " " very little " " " " car " " power " " several " " skin " are located at
The special case district.
5., represent the sign indicating number and the corresponding relation of key position to be:
(1), on behalf of sign indicating number, initial consonant, simple or compound vowel of a Chinese syllable be located on first letters case of these phonetics;
(2), on behalf of sign indicating number, font be located on first stroke of key position of its radical first stroke of a Chinese character;
(3), on behalf of sign indicating number, the font in " special case district " be located on " N " key position.
6., inquiry wildcard key is " * " key, and any coding all can be inquired about or wildcard by this key.
2, the classification reorganization scheme
The present invention recombinates to the different stage code element flexibly according to the different levels needs, forms the coding structure of three kinds of different levels:
1., base level---simplicity letters method for typing-in phrases: sign indicating number is organic to be constituted by representing fully.Be characterized in: be based on
Most basic phonetic transcriptions of Chinese characters knowledge, easy to learn, cross the threshold very easily, but repeated code is on the high side.
2., popularize level---Two bors d's oeuveres letters method for typing-in phrases popular edition (popularizing Two bors d's oeuveres): by initial consonant specific code, simple or compound vowel of a Chinese syllable tool
The organic formation of sign indicating number represented in body sign indicating number and font.Be characterized in: be input as the master with phrase, simple and fast, suitable
Answer general user's literal input needs.
3., professional---Two bors d's oeuveres letters method for typing-in phrases professional version (accurately Two bors d's oeuveres): constitute by specific code is organic fully.
Be characterized in: take as the leading factor with phrase, based on individual character, the words combination, it is quick to be fit to the professional
Typing needs.
In this input system, above-mentioned three kinds of input methods are merged in one, have common one-level brevity code, common secondary brevity code and common dictionary, do not need button to switch between each input method, forms one by reach slowly soon, by Yi Jinan, simplified and traditionally take into account, incremental organic whole.
3, the word coding method scheme
The present invention determines the word coding method scheme of oneself according to three-dimensional localization, code length subregion, classification reorganization, four principles of priority of high frequency.
Code symbols: ABCDEFGHIJKLMNOPQRSTUVWXYZ
Select code element: 1234567890
Wildcard inquiry code: *
Individual character maximum code length: 3
Phrase code length: 4
Code identification:
The words sign One-level brevity code: y secondary brevity code: e all-key: q two words: ce2 three words: ce3 four words and the above speech of four words: ca4
The ordering sign Positive sequence: the p backward: n does not sort: 0
The individual character sequence number Span: 1~15 ignores: 0
Sound shape sign The sound sign indicating number: s rhythm sign indicating number: the y font code: x ignores: 0
The code element sequence number Span: 1~4 ignores the code element sequence number and only by its key position: 0
Coding expression:
[words sign]=[ordering sign] [individual character sequence number] [sound shape sign] [code element sequence number]+... [ordering sign] [individual character sequence number] [sound shape sign] [code element sequence number] }
Coding rule:
Coding rule of the present invention is divided into single character code and phrase coding two part, totally 6 rules:
1., single character code
The single character code scheme is divided into one-level brevity code, secondary brevity code, 3 rules of all-key:
(1), one-level brevity code.Only there is the coding of a code element to be referred to as the one-level brevity code, the compatible intelligence of one-level brevity code of the present invention
The energy spelling input method, coding rule is:
One-level brevity code=initial consonant or one-level brevity code=simple or compound vowel of a Chinese syllable
That is: get first letter in the individual character phonetic initial consonant or first letter in the simple or compound vowel of a Chinese syllable.
Expression formula is: y=p0s1 or: y=p0y2
One-level brevity code of the present invention has 26:
Individual g sends out this a of s of d institute of f
But with h with regard to j k l m
I play q by w his t people r and e
Having y to produce u is that i Europe o criticizes p
Your n not b person v from the little x of c
At z
(2), secondary brevity code.The coding of being made up of two code elements is referred to as the secondary brevity code, and coding rule is:
Secondary brevity code=initial consonant+simple or compound vowel of a Chinese syllable
That is: get sound sign indicating number and rhythm sign indicating number in its individual character phonetic.
Expression formula is: e=p080+n0y0
The present invention is provided with 434 of secondary brevity codes, specifically sees accompanying drawing 2:
In the accompanying drawing 2, single character code adds this word column code element by this word code element of being expert at.
(3), all-key.All-key is meant by the individual character of three code element establishments and encodes fully.Each Chinese character all has it to encode fully, and its coding system convention is:
Individual character is encoded=initial consonant+simple or compound vowel of a Chinese syllable+prefix fully
That is: get first radical of positive sequence in initial consonant, simple or compound vowel of a Chinese syllable and the font in the individual character phonetic.
Expression formula is: q=p0s0+p0y0+p1x0
Because structure difference all in each input method in the native system, its complete coding rule is also slightly different:
Simplicity letters method for typing-in phrases individual character all-key rule is:
Individual character encodes fully=and on behalf of sign indicating number+simple or compound vowel of a Chinese syllable, initial consonant represent sign indicating number+prefix to represent sign indicating number
Expression formula is: q=p0s1+p0y1+p0x1
Two bors d's oeuveres letters method for typing-in phrases popular edition individual character all-key is planned to:
Individual character encodes fully=and initial consonant specific code+simple or compound vowel of a Chinese syllable specific code+prefix represents sign indicating number
Expression formula is: q=p0s0+p0y0+p0x1
Two bors d's oeuveres letters method for typing-in phrases professional version individual character all-key rule is:
Individual character is encoded=initial consonant specific code+simple or compound vowel of a Chinese syllable specific code+prefix specific code fully
Expression formula is: q=p0s0+p0y0+p0x0
2., phrase coding
According to the principle of code length subregion, phrase coding of the present invention all is distributed in the scope of four code elements, and it is divided into 3 rules of the above speech of two words, three words, four words and four words.
(1), two words.
Two word coding method rules are:
Two words=lead-in initial consonant+lead-in simple or compound vowel of a Chinese syllable+secondary word initial consonant+secondary word simple or compound vowel of a Chinese syllable.
That is: the initial consonant and the simple or compound vowel of a Chinese syllable of each word got successively in two words.
Expression formula is:
ce2=p1s0+p1y0+p2s0+p2y0
(2), three words.
Three word coding method rules are:
Three words=lead-in initial consonant+secondary word initial consonant+last word initial consonant+last word simple or compound vowel of a Chinese syllable
That is: the initial consonant of each word got successively in three words, adds the simple or compound vowel of a Chinese syllable of last word again.
Expression formula is:
ce3=p1s0+p2s0+p3s0+p3y0
(3), four words and the above speech of four words.
The above Chinese word coding rule of four words and four words is:
The above speech of four words and four words=lead-in initial consonant+secondary word initial consonant+the 3rd word initial consonant+last word initial consonant
That is: four words and the above speech of four words get successively head, inferior, three, the initial consonant of last word.
Expression formula is:
Ca4=p1s0+p2s0+p3s0+n1s04, dictionary plan of establishment dictionary is an important component part of the present invention, it has three Special Significance: 1., improve the text input efficiency; 2., reduce the literal input error; 3., exempt the restriction of individual character repeated code.Dictionary setting of the present invention possesses following properties: 1., rich.Dictionary setting of the present invention is a source with the modern Chinese dictionary, other spoken language of incorporating things of diverse nature,
Slang, common-use words, 72000 of the total clauses and subclauses of dictionary, its sum occupy at present each input method dictionary it
First.2., practicality.Dictionary of the present invention is except including each dictionary, dictionary, the contained standard phrase of dictionary, simultaneously
Include the various non-standard phrases that in practical operation, run into, ensure the phrase coverage rate height of general manuscript
Reach more than 95%.3., novelty.The present invention includes various term words popular in the modern life, to adapt to modern society
The needs of practical operation.According to the classification recombinant technique, this input system dictionary is divided into three kinds of different ranks: 1., and the one-level phrase.It is the one-level phrase that this input system is provided with 130 two words commonly used, and it is encoded to this
First code element that phrase is encoded fully, each 5 of the one-level phrases of each same symbol are respectively with heavy
Coding mode places after the one-level brevity code individual character.2., secondary phrase.It is the secondary phrase that this input system is provided with 2100 two words commonly used, and it is encoded to
The head that this phrase is encoded fully, inferior code element, the secondary phrase of each same symbol each 5, respectively with
The repeated code mode places after the secondary brevity code individual character.3., all-key phrase, i.e. phrase of encoding fully.In this input system, each bar phrase all has it complete
Coding.Wherein two words are 45000,11000 of three words, four words and more than four words
16000 of speech.
5, the input operation scheme
1., individual character input:
(1), one-level brevity code:, click space bar again by hitting respective symbol key position once;
(2), secondary brevity code: by hitting head, inferior code element correspondent button position, click space bar more successively;
(3), all-key: by hitting head, inferior, the first correspondent button of last code position, click space bar more successively.
Wherein, one-level brevity code, secondary brevity code do not have repeated code.All-key input as repeated code occurs, by space bar then lead-in navigate to the screen cursor position automatically; Importing other individual character as need then selects by numerical key.
2., phrase input:
(1), I and II phrase:, press numerical key again and select successively by hitting phrase code element correspondent button position;
(2), the all-key phrase: four yard one speech, successively by hitting phrase code element correspondent button position;
(3), the phrase repeated code is selected: navigate to the screen cursor position automatically by the then the first phrase of space bar; Importing other phrase as need then selects by numerical key.
3., input in full
(1), phrase is input as the master, and individual character is input as auxilliary.
(2), in the phrase input,, be auxilliary with three words and the above speech of four words based on two words and four words.The coverage rate of the present invention's two words and four words is all up to more than 96%.
(3), as the auxiliary individual character input that replenishes part, based on one-level brevity code, secondary brevity code, be input as auxilliary with all-key as far as possible.In actual use, the frequency of utilization of one-level brevity code, secondary brevity code is the highest, and they all do not have repeated code simultaneously, help improving text input speed.
6, compatible scheme
At present, Chinese character pinyin input method (comprising all-phonetic input method, simple phoneticizing (lu's Simple Phoneticizing) input method, Two bors d's oeuveres double-tone input method, intelligence phonetic letter input method) is one of inputting method that China's range of application is the widest, practicality is the strongest, computing machine more than 90% all is equipped with spelling input method, and the computer operation person more than 40% is using spelling input method.The present invention is as a kind of brand-new input method, on the basis of comprehensively adhering to Chinese-character canonical phonetic, as much as possible various spelling input methods (comprise to wherein classic intelligence phonetic letter input method, Two bors d's oeuveres double-tone input method) are carried out compatibility and accept, make every effort to make vast spelling input method user under the prerequisite that only needs study a little, just can grasp the present invention rapidly.
Compare with various spelling input methods, it is specific as follows that code element of the present invention is provided with situation:
1., compare, have identical spelling key position scheme with spelling, simplicity, intelligence phonetic letter input method;
2., compare with Two bors d's oeuveres double-tone input method, code element key of the present invention position scheme is basic identical, but " ing " simple or compound vowel of a Chinese syllable by "; " bond shifting is to " M " key position, " ie " simple or compound vowel of a Chinese syllable by " M " bond shifting to " Q " key position.
3., compare with 8 " font description " sign indicating number of intelligence phonetic letter, " graphemic code " of the present invention is increased to 153, is divided into the radical that Chinese character is expressed in 26 key positions, and be more visual in image; They are summed up as 5 of " one " " Shu " " Pie " " Dian " " second " according to the difference of the first stroke of a Chinese character again and represent sign indicating number, and are more easy to learn.
4., compare with the input method of the single character of intelligence phonetic letter input method " deciding word with speech ", what the present invention adopted is the three-dimensional localization scheme, and trigram one word is more convenient quick.
5., the present invention need not " syllable-dividing mark ", returns the drawback of modification after the also not whole sentence input.
According to above-mentioned principle, the spelling input method user just can grasp immediately, use the present invention with study hardly, and easier to be faster stronger.
1., for intelligence phonetic letter input method (comprising spelling, simplicity) user: only need to key in first letter of initial consonant, simple or compound vowel of a Chinese syllable, key in the first stroke of this word first stroke of a Chinese character again, can hit required Chinese character;
2., for Two bors d's oeuveres double-tone input method user: only need to key in initial consonant, simple or compound vowel of a Chinese syllable correspondent button position, key in this word first stroke of a Chinese character first stroke (or first radical of this word) again, can hit required Chinese character; Key in four code elements according to the phrase coding rule, can hit required phrase.
3., code element wildcard query key of the present invention by "? " bond shifting is to " * " key position, and each spelling input method user all available " * " wildcard key is inquired about, replaced.
In addition, because " font code " among the present invention mainly is meant " prefix " part in the individual character all-key, therefore, in actual use, as long as we adhere to being input as the master with a secondary brevity code, or, just can avoid " font code " fully and pin down with the input of phrase input replacement individual character; Even when the input of individual character all-key was used in last resort, also available " * " wildcard key replaced " prefix ", so, any spelling input method user all can grasp, use this input method immediately.
Inner structure of the present invention is similar to intelligence phonetic letter input method, but all simplicity, Two bors d's oeuveres (doubles) input, all graphemic code arranged, but the present invention to be not equal to be exactly intelligence phonetic letter input method, it is compared with intelligence phonetic letter input method, characteristics are respectively arranged, have both advantages and disadvantages, wherein have many obvious differences:
The contrast project Intelligence phonetic letter input method Phonetic letters method for typing-in phrases
Number of symbols ???????65 ?????210
Application technology The intelligence perception Three-dimensional localization code length subregion
The accurate positioning degree Accurately Accurately
The individual character maximum code length Five yards Trigram
The individual character repeated code Many Few
Dictionary clauses and subclauses (comprising morpheme) Article 60,000, Article 80,000,
Input style The spelling simplicity is mixed and is pieced together doubles The accurate Two bors d's oeuveres of simplicity Two bors d's oeuveres
Concern between each input method Independent separately slightly relevant Incremental combining together
Whether need separator Need Do not need
Whole sentence correct rate for input ???????95% ?????100%
The mode of correcting mistakes Return modification after the whole sentence input Be that mistake promptly changes
This shows that the present invention compares with intelligence phonetic letter input method, except the intelligent perception technology aspect was slightly too late, all other indexs were all won comprehensively, and easier to be faster stronger, raise the efficiency to reach more than 20%.
Four, special advantages
The present invention is a standard, flexible, simple and direct Chinese character entering technique, be based on the most basic Chinese language knowledge and computer code potential, concentrate the strong point of numerous input method of Chinese character, formation is input as the master, is input as auxilliary individual style and advantage with individual character with phrase, build one from the superficial to the deep, study atmosphere from slow to fast and the integral input system that can adapt to the different levels needs.It is compared with various on the market input methods at present, has advantages such as easy, simple to operation, the complete compatible Two bors d's oeuveres double-tone input method of study, market outlook be wide.
1, study is easy:
1., coding is short.The present invention takes the three-dimensional localization retrieval technique, and each Chinese character just can hit with triple bond at most in the GB one secondary character library, easily is people institute learning and mastering;
2., code symbols of the present invention is benchmark with the phonetic and the radical of China's Chinese-character canonical, and is visual in image, meets the spelling rules of Chinese character;
3., keypad code element plan distribution standard of the present invention, each initial consonant, simple or compound vowel of a Chinese syllable, font all have zone, correspondent button position, are provided with even, reasonable, regular strong;
4., trigram one word of the present invention adopts the simplest and the most direct simplicity, Two bors d's oeuveres mode, and the user just can grasp the present invention immediately with study hardly;
5., the present invention abandons the blank character in the whole sentence input fully, alleviates user's learning burden and manipulation strength significantly.
2, input is quick:
1., keystroke is few.Trigram one word of the present invention, add the secondary brevity code, the secondary phrase that self have, huge dictionary (not comprising morpheme) with 72,000 clauses and subclauses, making continuous text input stroke reach minimum degree, is the inputting method that China's current encoder is the shortest, touch potential is minimum, input speed is the fastest;
2., repeated code is low.(1), the code length subregion: individual character of the present invention and phrase are separately positioned on the coding region of different length, and the phrase input is not subjected to pining down of individual character input fully, has fundamentally abandoned the drawback of mutual repeated code between the words; (2), the combination of sound shape: the present invention's one secondary brevity code does not have repeated code, and coding then has the in-line sign indicating number to differentiate sign indicating number as it fully, and its phonetically similar word repetition rate of coding is significantly reduced.
3., the present invention is input as the master with phrase, and individual character is input as auxilliary, and the dictionary setting is a source with the modern Chinese dictionary, incorporate things of diverse nature other spoken language, slang, the couplet written on scrolls and hung on the pillars of a hall, common-use words.Except the standard phrase, include the required various non-standard phrase of practical operation simultaneously, the phrase coverage rate is up to more than 95%, and manuscript almost can use phrase to realize input fast fully in the whole text.
4., accurate positioning.The present invention takes the promptly wrong mode of correcting mistakes that promptly changes, need not resemble the intelligent input method and after whole sentence input, return modification again, but required words is directly navigated on the screen cursor position, use the impassioned and forceful sense of making us rising spontaneously " 3,000 chis that fly down straightly, as if the Silver River were falling from Heaven ".
3, compatible present various spelling input methods
The present invention deeply and carefully analyzes present various spelling input method inner structures; on spelling, simplicity, Two bors d's oeuveres double-tone three big input method angles, do a large amount of compatibilities and accepted work; the knowledge production of each spelling input method user long-term accumulation is adequately protected; almost can expertly use the present invention under the prerequisite need not relearning, and easier to be faster stronger.
Easier---trigram one word, individual character need not whole phonetic alphabet typings, need not to decide word with speech, and the words and phrases input does not have
Need blank character;
Faster---be that mistake promptly changes, the continuous typing of several available phrases of manuscript in the whole text;
Stronger---the dictionary capacity (not comprising monosyllabic word and morpheme) of 70,000 2 thousand clauses and subclauses, raise the efficiency 20%
More than.
4, market outlook are wide:
The trigram establishment technique is adopted in the individual character input among the present invention, the kernel exquisiteness, and committed memory is little, travelling speed is fast, few to hardware requirement, except can easily running on 80386 above PC types, can also install, be used on various electronic notebook, computer learning machine, the digital set-top box.
The present invention and each Chinese operating platform combine together, support the good characteristic that each Chinese operating platform is intrinsic, guarantee that each application software is loading under the situation of the present invention and can normally move.
The present invention is the input method based on the sound typing, and its unique coding structure is very similar to the interface structure of massage voice reading module, voice typing module, and it has prepared solid foundation for functions such as follow-up massage voice reading, voice typings.
Can predict that along with constantly advancing of the wheel of history, computer input method will turn to multi-functional input mode by single input mode, by noiseless turn to sound, and easier to be faster stronger ...Every function of the present invention has met the inexorable trend of this historical development fully, and gains all first chance in this historical trend with great strength and vigour.We can say: it may be the inputting method that China has development prospect and practical value at present most, and it has vast market prospect, is containing huge social benefit and economic benefit, has immeasurable development space.
Its one seat that in numerous input method, wins oneself surely!
Five, the approach of realization
The present invention is as a kind of Chinese-character input scheme, and the realization that each Chinese operating platform is it provides effective instrument, as the Limd of UCDO S, and the Keytooo of TWAY.Input method generator in the simplified Chinese edition Windcws9x of the U.S. Microsoft company series be can yet be regarded as and realized one of a kind of mode of the present invention.By this input method generator, can generate have own individual character, consistent with Windows operating system style, and can give full play to the phonetic letters method for typing-in phrases of Windows operating system good characteristic.
Concrete steps are as follows:
1, create phonetic phrase input system code table source file
1., start Chinese Windows9x system, click START button, point to " program ", " annex ", click " board " again;
2., according to the coding rule that Windows9x series inputting method form and the present invention determine, set up with, TXT is the plain text code table source file of suffix:
(1), set up the phonetic letters method for typing-in phrases code table source file head that meets Chinese Windows9x standard input method:
[Description]
Name=phonetic phrase
MaxCodes=4
MaxElement=1
UsedCodes=abcdefghijklmnopqrstuvwxyz
WildChar=*
NumRules=3
[Rule]
ce2=p11+p12+p21+p22
ce3=p11+p21+p31+p32
ca4=p11+p21+p31+n11
[Text] (2), establishment phonetic letters method for typing-in phrases one-level brevity code source file: this a is b not ... (3), establishment phonetic letters method for typing-in phrases secondary brevity code source file: prick aa and grab ab ... (4), establishment simple phoneticizing (lu's Simple Phoneticizing) input method individual character all-key source file: peace oay presses oat ... (5), double-spelling Chinese character input method individual character all-key source file is popularized in establishment: oah Ah oan ... (6), work out accurate double-spelling Chinese character input method individual character all-key source file: like ose dust osf ... (7), establishment one-level phrase source file: certain y possibility k ... (8), establishment secondary phrase source file: the powerful qhx of love and esteem oada hobby oa ... (9), establishment phrase all-key source file: patriotic osgo love osfu ... agdm ajhm according to plan in accordance with regulations ... the patriotism agzy alfp that distributes according to work ... the ahrg of the agrj People's Republic of China (PRC) of the Chinese People's Liberation Army ... 3., merge, withdraw from, save as Windows system pycz.txt.2,1. entry sorts, and clicks START button, points to " program ", " annex ", clicks " input method generator " again; 2., click " entry ordering " button and open button, double-click " pycz.txt " code table source file; 3., click " ordering " button, when ordering finishes, press " determining " button.3, reject code table source file coding and repeat words 1., click START button, point to " program ", " annex ", click " board " again; 2., open Windows system pycz.txt code table source file; 3., check the code table source file, reject the words that repeats in the same coding; 4., preserve, withdraw from.3, create phonetic phrase input system 1., click START button, point to " program ", " annex ", click " input method generator " again.2., select " establishment input method " label, click " browsing ", select pycz.txt code table source file, insert input method information such as " phonetic phrases ", then click OK.
3., click " conversion " button, generate the codes table file of new pycz.mb.
4., click the Create button, insert version number and organization names.
5., click " user is given " option, click the Browse button again, select icon (.ico file), bitmap (.bmp) and the help file (.hlp file) oneself liked respectively.
6., the click OK button can generate a phonetic phrase input system file (pycz.ime) that has the own individual character of user, is consistent, also can gives full play to the various good characteristics of PWindows9x with PWindows9x Chinese edition style.
7., after the generation input method, whether system will point out and install.After selecting to install, system will install input method automatically.At this moment, newly-generated phonetic phrase input system is promptly added in the Chinese Windows9x system, and the operator just can use input method that this is newly-generated as the input method of using other prepackage.
(attached: the code table inverse conversion of the above-mentioned U.S. PWindows9x of Microsoft company input method generator, can be text with codes table file decompiling of the present invention, can comprehensively retrieve and examine radical setting of the present invention, word coding method, dictionary setting thus.)
(intact in full)

Claims (10)

1, a kind of computword input system, it is characterized in that with three-dimensional localization retrieval technique, code length partitioning technique, classification reorganization, priority of high frequency four directions surface technology be support, accepting scheme by the specific character root of keyboard plan of establishment, word coding method scheme, the dictionary plan of establishment, operation input scheme and compatibility forms, trigram one word, four yard one speech, the combination of sound shape has that coding is short, study is easy, easy and simple to handle, input is quick, the characteristics of accurate positioning.
2, the character root of keyboard scheme according to claim 1 is characterized in that:
1., code element is a standard with Chinese-character canonical phonetic and radical, and 200 of numbers are set;
2., to be divided into " 1,2,3,4,5 " five according to its initial consonant, simple or compound vowel of a Chinese syllable, radicals by which characters are arranged in traditional Chinese dictionaries feature big for code element
District and a special case district, G, F, D, S, the A key position of the corresponding computing machine of difference, H,
J, K, L, M key position, T, R, E, W, Q key position, Y, U, I, O,
P key position, N, B, V, C, X key position and Z key position.
3., single-letter initial consonant, single-letter simple or compound vowel of a Chinese syllable are corresponding to respectively its alphabetical key position, place;
4., golygram initial consonant (zh, ch, sh) and zero initial are located at the key position at single-letter simple or compound vowel of a Chinese syllable place;
5., " a " beginning golygram simple or compound vowel of a Chinese syllable and the corresponding computer keyboard of " horizontal stroke " first stroke of a Chinese character code element " 1 " district;
" i " beginning golygram simple or compound vowel of a Chinese syllable and the corresponding computer keyboard of " erecting " first stroke of a Chinese character code element " 2 " district;
" e " beginning golygram simple or compound vowel of a Chinese syllable and the corresponding computer keyboard of " left-falling stroke " first stroke of a Chinese character code element " 3 " district;
" o " beginning golygram simple or compound vowel of a Chinese syllable and the corresponding computer keyboard of " right-falling stroke " first stroke of a Chinese character code element " 4 " district;
" u " beginning golygram simple or compound vowel of a Chinese syllable and the corresponding computer keyboard of " folding " first stroke of a Chinese character code element " 5 " district;
" un " simple or compound vowel of a Chinese syllable and " Rolling " " very little " " " " car " " power " " skin " " several " code element are right
Answer computer keyboard special case district.
6., the wildcard query key is " * ".
3, the encoding scheme according to claim 1 is characterized in that:
1., encoding scheme is divided into single character code and phrase coding two big class, totally six rules;
The single character code rule is:
One-level brevity code=undisciplined prefix or one-level brevity code=simple or compound vowel of a Chinese syllable
Secondary brevity code=initial consonant+simple or compound vowel of a Chinese syllable
Coding=initial consonant+simple or compound vowel of a Chinese syllable+prefix fully
The phrase coding rule is:
Two words=lead-in initial consonant+lead-in simple or compound vowel of a Chinese syllable+secondary word initial consonant+secondary word simple or compound vowel of a Chinese syllable
Three words=lead-in initial consonant+secondary word initial consonant+last word initial consonant+last word simple or compound vowel of a Chinese syllable
The above speech of four words and four words=lead-in initial consonant+secondary word initial consonant+three word initial consonants+last word initial consonant
2., 26 of one-level brevity codes are set, 434 of secondary brevity codes are set.
4, the dictionary plan of establishment according to claim 1 is characterized in that:
1., the total clauses and subclauses of dictionary are greater than 72000;
2., two words clauses and subclauses are greater than 45000;
3., three words clauses and subclauses are greater than 11000;
4., the above entry order of four words and four words is greater than 16000.
5, the specification file according to claim 1 is characterized in that:
1., to the elaboration of claim 1 general situation of development;
2., to the elaboration of claim 1 functional characteristics;
3., to the elaboration of claim 1 technical scheme;
4., to the elaboration of claim 1 method of operating:
5., to the elaboration of claim 1 mount scheme;
6., to the elaboration of claim 1 upgrading scheme.
7., the single character code of claim 1 is arranged in classification;
8., the phrase coding of claim 1 is arranged in classification.
6, accept scheme according to the compatibility of claim 1, it is characterized in that:
1., compatibility is accepted Two bors d's oeuveres double-tone input method;
2., compatibility is accepted intelligence phonetic letter input method.
7, a kind of retrieval by window technology of computword is characterized in that:
1., three-dimensional search---from three different directions Chinese character is carried out the crossings on different level retrieval by window, each side
To being divided into 26 section levels according to computer keyboard A to Z key position again;
2., triple bond location---individual character retrieval maximum length is 3 code elements.
8, the computword retrieval scheme according to claim 7 is characterized in that:
1., the first dimension retrieval direction is the initial consonant of phonetic transcriptions of Chinese characters, and the second dimension retrieval direction is the rhythm of phonetic transcriptions of Chinese characters
Mother, third dimension retrieval direction is first radical of Hanzi structure;
2., determine concrete Chinese character by three code elements at most, 6763 Chinese characters of GB one secondary character library all can
Represent with three code elements.
9, a kind of technical scheme of dividing computer character input method word coding method length is characterized in that:
1., trigram one word, four yard one speech: the complete encoding setting of individual character in the zone of 3 code elements, phrase
Encoding setting is in the zone of 4 code elements fully;
2., the coding region of 1 key distributes to the one-level brevity code and the one-level phrase uses jointly, 2 key encode zones
Distribute to the secondary brevity code and the secondary phrase uses jointly;
3., the complete coding region of words was both relatively independent, complemented one another again, and the words input need not button and switches.
10, a kind of technical scheme of creating the computer character input method coding structure is characterized in that:
1., coding classification: the computer character input method code element is divided into sound sign indicating number, rhythm sign indicating number, three kinds of inhomogeneities of font code
Type, each type code element are divided into specific code again, represent two kinds of ranks of sign indicating number;
2., structural rearrangement: the code element flexible combination of dissimilar, different stage, form simplicity, popularize two
The encoding scheme of assembly, accurate three different levels of Two bors d's oeuveres;
3., integrated: the encoding scheme of different structure level is merged in common input system, has common
One-level brevity code, secondary brevity code and dictionary, need not button between each encoding scheme and switch, form
By Yi Jinan, by take into account slowly and soon, simplified and traditional, incremental academic environment and operation system.
CN 99104227 1999-04-26 1999-04-26 Phonetic letters method for typing-in phrases Pending CN1239242A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 99104227 CN1239242A (en) 1999-04-26 1999-04-26 Phonetic letters method for typing-in phrases

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 99104227 CN1239242A (en) 1999-04-26 1999-04-26 Phonetic letters method for typing-in phrases

Publications (1)

Publication Number Publication Date
CN1239242A true CN1239242A (en) 1999-12-22

Family

ID=5271573

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 99104227 Pending CN1239242A (en) 1999-04-26 1999-04-26 Phonetic letters method for typing-in phrases

Country Status (1)

Country Link
CN (1) CN1239242A (en)

Similar Documents

Publication Publication Date Title
CN1040276A (en) Simplified and complex character root Chinese character entering technique and keyboard thereof
CN1577229A (en) Method for inputting note string into computer and diction production, and computer and medium thereof
CN1239242A (en) Phonetic letters method for typing-in phrases
CN1048343C (en) Free combination code Chinese character input method and key board
CN1154502A (en) Method and device for ducation standardized inputting Chinese characters by five stroke
CN1241101C (en) Chinese syllable double reading scheme, Chinese keyboard and information input and processing method
CN1050914C (en) Lin code Chinese character input method
CN1358300A (en) OHAI technology user interface
CN1026924C (en) Chinese-character sound dissection encode and input method
CN1109287C (en) Chinese phrase enter method
CN1258037A (en) Chinese keyboard and Chinese-character phonetic code input method
CN1123819C (en) Chinese character key-position code input method for computer
CN1025896C (en) New concept Chinese character coding
CN1303504C (en) 'Letter' input-method for Chinese characters
CN1275732A (en) Chinese character keyboard input system and applied technology thereof
CN1081355C (en) Three-phonetic code Chinese character input method for computer and its keyboard
CN1110806A (en) Intelligence five-stroke double-spelling code letter-word chain type positioning association input method
CN1093654C (en) Structure code Chinese character input method and universal keyboard used thereof
CN1019527B (en) Character pixel input method and its keyboard
CN1220127C (en) 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters
CN1016008B (en) Intelligent processing system of words and phrases of man, xibe, mongol and tuo languages
CN1120408C (en) Chinese-character struture-pronunciation input method for computer
CN1713120A (en) English word root inputting method
CN1128371A (en) Chinese character-splitting coded method and its keyboard for computer
CN86107235A (en) Speech word binary coding input hanzi system and keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication