CN1053976C - Full and double phoneticizing combined type Chinese input method - Google Patents

Full and double phoneticizing combined type Chinese input method Download PDF

Info

Publication number
CN1053976C
CN1053976C CN95121307A CN95121307A CN1053976C CN 1053976 C CN1053976 C CN 1053976C CN 95121307 A CN95121307 A CN 95121307A CN 95121307 A CN95121307 A CN 95121307A CN 1053976 C CN1053976 C CN 1053976C
Authority
CN
China
Prior art keywords
spelling
character
chinese
input
syllable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN95121307A
Other languages
Chinese (zh)
Other versions
CN1152737A (en
Inventor
徐火辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN95121307A priority Critical patent/CN1053976C/en
Publication of CN1152737A publication Critical patent/CN1152737A/en
Application granted granted Critical
Publication of CN1053976C publication Critical patent/CN1053976C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Abstract

The present invention relates to a full and double-spelling combined type Chinese input method which makes a sound code input method taking a keyboard as a basic input device or a sound and shape code input method comprising a sound code make a user freely use the full and double-spelling code as desired in the way of combination to input Chinese without switching or using any identification key, and thereby, the traditional full-spelling input system and the double-spelling input system which are mutually excluded are elaborately coordinated and united, which opens up a new concept and a new way for inputting Chinese by the sound code.

Description

Full-spelling double-spelling is used the type Chinese character input method with
The present invention relates to computing machine input in Chinese technical field, particularly use keyboard mode, based on the Chinese character input method of the various sound sign indicating numbers of the Chinese phonetic alphabet, and the phonetic-stroke code or the shape sound sign indicating number Chinese character coding input method that comprise the sound sign indicating number.
The keyboard-type input is the main means of input characters on the computing machine.Input Chinese (Chinese character) can use Universal English keyboard, or specially designed keyboard special.Input Chinese need be to the Chinese words Chinese word coding on keyboard.Coding has font code and two kinds of citation forms of sound sign indicating number and based on these two kinds of citation forms, derivative various phonetic-stroke codes or shape sound sign indicating number.
In the sound sign indicating number Chinese character input method, the most generally based on the Scheme for the Chinese Phonetic Alphabet design various spelling input methods, their great majority are basic input equipment with Universal English keyboard.
In the prior art based on the sound sign indicating number input in Chinese of the Scheme for the Chinese Phonetic Alphabet, three kinds of basic coding forms are arranged.
First kind is so-called " spelling ", promptly directly uses the Scheme for the Chinese Phonetic Alphabet itself, perhaps only it is done some small changes, imports Chinese; The user impacts the key position by the symbol of Scheme for the Chinese Phonetic Alphabet regulation on keyboard, the input Chinese speech, computing machine is converted to Chinese character, speech or sentence to it, shows in the display prompts district, if repeated code is arranged, then select input with numerical key or cursor movement key or mouse etc.Use the Universal English keyboard input Pinyin that a special processing is arranged, promptly to alternative key of Chinese simple or compound vowel of a Chinese syllable ü regulation.General provision substitutes with v.
Second kind is so-called " simplicity ", and it is the initial consonant or the simple or compound vowel of a Chinese syllable that partly surpass a letter representation in the Scheme for the Chinese Phonetic Alphabet, and promptly the initial consonant of golygram or simple or compound vowel of a Chinese syllable substitute with a letter respectively.Use the alternative key of these regulations when input is Chinese on keyboard, can reduce stroke.For example, regulation substitutes simple or compound vowel of a Chinese syllable ing with alphabetical g, the stroke of this simple or compound vowel of a Chinese syllable can be reduced to a key from triple bond.Otherwise processing and spelling are similar.
The third is so-called " Two bors d's oeuveres ", it is all surpass the initial consonant or the simple or compound vowel of a Chinese syllable of a letter representation in the Scheme for the Chinese Phonetic Alphabet, all regulation represents with a letter that respectively make each Chinese syllable, can be expressed as regular biliteral form: a consonant character adds a rhythm alphabetic character.Do not have the independent syllable of initial consonant for minority,, can stipulate additional " zero initial " key position, make them be transformed into the biliteral form as ang (holding high) etc.
Comprise in the phonetic-stroke code or shape sound sign indicating number that pinyin syllable is encoded that various their Pinyin coding part also can be included into above-mentioned spelling, simplicity and three kinds of forms of Two bors d's oeuveres.The general Two bors d's oeuveres form that adopts as more popular in the market " natural code " input method, is a kind of phonetic-stroke code that comprises double spelling coding.Some phonetic-stroke code or shape sound sign indicating number only use the initial consonant of phonetic, perhaps only use the simple or compound vowel of a Chinese syllable of phonetic.
The input of sound sign indicating number develops into the system that comprises speech input and sentence inputting function also from simple word input system.Yet from the formal investigation of coding, these systems are all based on spelling, simplicity or Two bors d's oeuveres.
Therefore, in the input in Chinese field of sound sign indicating number or the combination of sound shape at present, spelling and Two bors d's oeuveres are two kinds of coding forms that application is the most general and prior aries.
No matter be at the pure tone sign indicating number or in phonetic-stroke code, spelling and Two bors d's oeuveres have relative merits separately.
The advantage of spelling is the most directly perceived, meets the relevant rules policy of national language literal work, and is unified with national knowledge background, unified with the teaching of Chinese pin yin of middle and primary schools Chinese language, so learnability and versatility are best.One of its major defect is that stroke is too many.Syllable at most will be with 6 letter representations in the Scheme for the Chinese Phonetic Alphabet.Phonetic chuang as " bed " word.
The advantage of Two bors d's oeuveres input is that stroke is few and regular, and it is unified with Chinese syllable of two letter representations, has significantly reduced stroke.One of its major defect is simple or compound vowel of a Chinese syllable and the initial consonant of representing owing to golygram, all will use to substitute the key position, therefore needs certain memory capacitance, and is also directly perceived not as spelling in form.Learning difficulty and forgetting rate all are higher than spelling.And, if promote the input in Chinese of computing machine Two bors d's oeuveres, investigate from pedagogy and psychology at primary school period, all must in the infantile psychology cognition, cause spelling and Two bors d's oeuveres obscuring to a certain degree, increase teaching difficulty and burden.
In order to remedy spelling and Two bors d's oeuveres these shortcomings separately, general calculation machine Chinese information processing system generally all is equipped with spelling, double-spelling Chinese character input method (also being equipped with some font code input method) simultaneously, makes the user can select a kind of use in these methods.
Yet, no matter select any phonetic code input method, it all has exclusivity and exclusiveness.Promptly under certain spelling (or Two bors d's oeuveres) input state, if switch without modes such as particular hot key or mouses, if difference key that perhaps need not be specific identifies the character string of leading input or the character string of follow-up input is the spelling coding or is double spelling coding, just can not directly use Two bors d's oeuveres (or spelling).
So if the user feels that under the spelling input state input is too slow, it is impossible wishing Direct Learning and using Two bors d's oeuveres.If the user under the Two bors d's oeuveres input state, does not remember some Two bors d's oeuveres and substitutes key, perhaps forget some alternative key, and wished directly to use spelling, also be impossible; And when the user produces between spelling coding and double spelling coding when obscuring, computing machine is correct searched targets words just, also needs user oneself to search alternative key position.
It is contemplated that thus, if can get up the double-spelling Chinese character input method and the all-phonetic input method coordinating and unifying, mode that need not any manual intervention is switched or is identified, just can make the user under with a kind of input state, freely optionally mix to use spelling coding and double spelling coding input Chinese speech, bear the identification conversion process work of robotization, realization input in Chinese by computer system, above-mentioned Two bors d's oeuveres and spelling shortcoming separately just can be overcome, and advantage just can be compatible.
The purpose of this invention is to provide a kind of computer keyboard input method, it can or use in the phonetic-stroke code or shape sound sign indicating number input mode of sound sign indicating number in sound sign indicating number input mode, the input method combination of prior art spelling coding and double spelling coding is got up with coordination, do not use other to use functional hot key, mouse, the program portrait is selected or the like the switching mode of any extra manual intervention, between spelling input state and Two bors d's oeuveres input state, do not carry out any switching, do not use the special identifier key of any difference spelling coding and double spelling coding yet, character string with the character string that identifies leading input or follow-up input is the spelling coding or is double spelling coding, just can make the user under with a kind of input state, freely optionally mix and use spelling and certain Two bors d's oeuveres input Chinese speech, automatically discern conversion by computer system, realize input in Chinese; In this input method, obscuring of Two bors d's oeuveres and spelling allows, and it does not influence normal and correct input, no longer is a kind of mistake therefore; In a word, it can freely optionally mix and use spelling and certain Two bors d's oeuveres under same state, impacts the combination in any sequence of spelling coding and double spelling coding.Express Chinese syllable.And combine with display.Realize input Chinese words, speech, phrase or sentence on computers, thereby overcome in the existing sound sign indicating number input technology.The spelling input is separated from each other the shortcoming separately that repulsion produces with the Two bors d's oeuveres input.The advantage of the learnability of compatible spelling and the high efficiency of Two bors d's oeuveres is significantly improved existing spelling coding and double spelling coding input technology.
In below narrating, if not otherwise specified, " spelling " this speech.Just be meant directly based on the Scheme for the Chinese Phonetic Alphabet, not to wherein golygram initial consonant and golygram simple or compound vowel of a Chinese syllable adopt the Pinyin coding input method of alternative key position.
Key of the present invention is in computer system the spelling coding to be coordinated mutually with double spelling coding.Purpose of the present invention can be by realizing by the following technical solutions:
Create a kind of full-spelling double-spelling and use the type Chinese character input method with.It is being in the phonetic code input method of basic input equipment or the phonetic-stroke code and form-sound code input method that comprises the sound sign indicating number with Universal English keyboard or other form keyboards, and the input mode combination and the coordination of spelling coding and double spelling coding are got up.By the automatic spelling and the free mixed sequence of double spelling coding that are received of identification and conversion of computer system, and combine the realization input in Chinese with display; The steps necessary of described input method comprises, at first is, directly uses the Scheme for the Chinese Phonetic Alphabet itself to set up a spelling key letter position; In addition, this method also comprises the steps, be exactly,
1) the definition double spelling coding substitutes key position or dedicated array of keys position.Be used for substituting or represent that initial consonant and the simple or compound vowel of a Chinese syllable that those are write with two or more letters, define method are in the Scheme for the Chinese Phonetic Alphabet, the key position character that is used for representing Chinese phonetic alphabet golygram initial consonant in the double spelling coding, be defined into spelling consonant key position character character set inequality on, promptly do not belong to character set { b, c, d, f, g, h, j, k, l, m, n, p, q, r, s, t, w, x, y, z};
2) the key position character that is used for representing the golygram simple or compound vowel of a Chinese syllable in the Chinese phonetic alphabet in the double spelling coding, be defined into first letter (except alphabetical ü) of spelling simple or compound vowel of a Chinese syllable character set inequality on, promptly do not belong to character set a, e, i, o, u}:
3) when character h being defined as the golygram simple or compound vowel of a Chinese syllable and substituting key, it can only substitute the simple or compound vowel of a Chinese syllable that can not occur after flat tongue consonant initial consonant z, c, the s in Chinese phonetic alphabet syllable;
4) the automatic identification represented spelling and the free mixed sequence of double spelling coding in key position of definition in this way of computer program realizes that Chinese full-spelling double-spelling uses input with.
More at large describe, described technical scheme should comprise;
1). according to prior art definition spelling key letter position;
2). the alternative key position or the dedicated array of keys position of definition double spelling coding, substitute or represent that those are in the Scheme for the Chinese Phonetic Alphabet, initial consonant and simple or compound vowel of a Chinese syllable with two or more letter representations, the principle of definition is, key position character being used for substituting or representing the golygram initial consonant of spelling coding in the double spelling coding is defined on the character set of not conflicting with the composite sequence of spelling consonant letter.The key position character that is used for substituting or represents the golygram simple or compound vowel of a Chinese syllable in the spelling coding in the double spelling coding, be defined on the character set of not conflicting with the composite sequence of first letter of spelling simple or compound vowel of a Chinese syllable, thereby can satisfy, with respect to Chinese syllable, in the independent assortment sequence of any possible spelling, double spelling coding character.Between the alternatives sequence of Two bors d's oeuveres and the character string of spelling, do not conflict mutually;
3). above-mentioned 1) with 2) and in defined spelling key position and Two bors d's oeuveres substitutes or the basis of dedicated array of keys position character set on, the user is when using sound sign indicating number input Chinese, do not need to use hot key, mouse, the switching mode of any manual interventions such as program portrait selection, between spelling and Two bors d's oeuveres input state, switch, do not need to use the special identifier key of any difference spelling coding and double spelling coding yet, character string with the character string that identifies leading input or follow-up input is the spelling coding or is double spelling coding, just can be under same input state, Two bors d's oeuveres according to the rules substitutes key position system and spelling key position system, freely optionally mix and impact alternative key position of Two bors d's oeuveres or spelling key position, it is the combination in any sequence of double spelling coding and spelling coding, can beat spelling by initial consonant, Two bors d's oeuveres beaten in simple or compound vowel of a Chinese syllable, perhaps initial consonant is beaten Two bors d's oeuveres, spelling beaten in simple or compound vowel of a Chinese syllable, can be in the input of two or more continuous syllables, equally freely optionally at any initial consonant, simple or compound vowel of a Chinese syllable, or syllable partly beats Two bors d's oeuveres or spelling, imports the syllable of Chinese;
4). 1) with 2) and on the basis of defined spelling key position and double spelling key-position character set, because the character combination sequence of spelling coding is not conflicted mutually with the character combination sequence of double spelling coding, computer program is with 1) with 2) in regulation spelling key position and Two bors d's oeuveres substitutes or dedicated array of keys position system is a foundation, the legal input coding character string that receives from keyboard key-position---no matter they are spelling coded character or double spelling coding character, no matter also they are any array configurations of spelling coding and double spelling coding---convert the character string of the phonetic representation notation of unified inside to, this notation can be exactly a spelling, it also can be certain Two bors d's oeuveres, it can also be intermediary's coding of any one and Chinese phonetic alphabet equivalence, with the character series after this conversion, go the Chinese words of match retrieval, speech, phrase or sentence, thus reach the Chinese purpose of input;
5). by 4) in the computer system narrated the spelling of input is changed with the identification of double spelling coding mixed characters string, be that Automatic Program is carried out fully, its only relies on according to 1) with 2) in the spelling key letter position system of defined and double spelling coding key position system and the transformation rule formulated, without any need for the extra difference spelling coding and the special identifier key of double spelling coding, the character string that does not need these special key positions to identify the character string of leading input or follow-up input is the spelling coding or is double spelling coding, just can automatically all finish spelling is encoded and the identification conversion work of the character string that double spelling coding freely mixes;
6). by 4) in the identification conversion of the computer system narrated character string that the spelling and the double spelling coding of input mixed, it allows a plurality of syllables of input continuously, it is under all unambiguous situations, do not need the user to import syllable splitting information, just can automatically finish the cutting between the syllable, only under human reader can not the ambiguity situation of right area syllabify, just need the user from syllable splitting information of outside input, to offer program cutting syllable.
Technical scheme of the present invention is done further described in detail below in conjunction with two most preferred embodiments.
The part of common core in the description technique scheme at first.
(1), definition spelling key position system and double spelling key-position system.Definition spelling key position system can directly adopt existing mature technologies such as the Scheme for the Chinese Phonetic Alphabet, repeats no more.
The definition Two bors d's oeuveres substitutes key position or dedicated array of keys position (hereinafter to be referred as alternative key position).The method of definition is: being used in the double spelling coding substituted or the alternative key position character of expression golygram initial consonant, be defined into spelling consonant key position character character set inequality on, promptly do not belong to character set (b, c, d, f, g, h, j, k, l, m, n, p, q, r, s, t, w, x, y, z); The key position character that is used for substituting or representing spelling golygram simple or compound vowel of a Chinese syllable in the double spelling coding, be defined into spelling final key position first character (except alphabetical ü) character set inequality on, promptly do not belong to character set (a, e, i, o, u), and guarantee, with respect to Chinese syllable, these characters do not cause conflicting in the identification with the composite sequence of spelling character when making up with consonant character.
The spelling golygram initial consonant sequence one that needs the regulation Two bors d's oeuveres to substitute the key position has 3, and they are:
ch,sh,zh。
The spelling golygram rhythm auxiliary sequence that needs regulation to substitute the key position has 26 altogether, and they are:
ai,an,ang,ao,ei,en,eng,ia,ian,iang,iao,ie,in,ing,iong,iu,ong,ou,ua,uai,uan,uang,ue,ui,un,uo。
In addition, no matter be Two bors d's oeuveres or spelling, all will be to Chinese phonetic alphabet ü specified key position.
This defining principle, with the alternative key-position definition method of prior art double-spelling Chinese character input method, basic identical.The definition of tradition Two bors d's oeuveres substitutes the key position, and utilize following character exactly: in the Scheme for the Chinese Phonetic Alphabet, all consonant characters are not first characters of simple or compound vowel of a Chinese syllable, and first character of all simple or compound vowel of a Chinese syllable is not a consonant character.So, can substitute the golygram initial consonant with first character of simple or compound vowel of a Chinese syllable, can substitute the golygram simple or compound vowel of a Chinese syllable with consonant character.And this has also automatically satisfied above-mentioned Two bors d's oeuveres alternatives and the mutual in actual use principle of not conflicting of the composite sequence of spelling character basically.
In the above-mentioned requirements, the regulation that simple or compound vowel of a Chinese syllable is substituted key has a restriction to note: " not causing conflicting in the identification with the composite sequence of spelling character when making up with consonant character ".This limits pairing situation following surface analysis.
Character h is used as the situation of the alternative key position of golygram simple or compound vowel of a Chinese syllable.According to " inequality " with spelling simple or compound vowel of a Chinese syllable first character this, h is not first character of golygram simple or compound vowel of a Chinese syllable in the Chinese phonetic alphabet, can be used as the alternative key of golygram simple or compound vowel of a Chinese syllable.But, also may produce conflict in actual use.For example, in " natural code " input method, h is defined as alternative simple or compound vowel of a Chinese syllable ang, like this, for second character h among the character string ch that receives, it is h self among the spelling cacuminal ch that program just can not be discerned it, is still representing the Two bors d's oeuveres of simple or compound vowel of a Chinese syllable ang to substitute key, because cang also is a syllable legal in the Chinese.That is to say that it does not satisfy above-mentioned principle of not conflicting, thereby this Two bors d's oeuveres alternative definitions is not suitable for full-spelling double-spelling and uses the type input method with.Yet,, just can avoid this conflict if h is used as the simple or compound vowel of a Chinese syllable that can not occur after instead of flat lingual initial consonant z, c, the s.For example, if h is defined as alternative simple or compound vowel of a Chinese syllable ian, owing in Chinese, do not have cian, sian, these three syllables of zian, so when receiving character string ch, sh or zh, it is second character in the initial consonant with leading alphabetical c, s or z combination that program can be discerned h in ch, sh or these three character strings of zh, constitute three cacuminals in the spelling respectively, and get rid of it is the double spelling coding that substitutes simple or compound vowel of a Chinese syllable ian, because the simple or compound vowel of a Chinese syllable that h substitutes can not occur after z, c, s, this has just satisfied the requirement that does not conflict.
And for example, can stipulate to substitute simple or compound vowel of a Chinese syllable ü and simple or compound vowel of a Chinese syllable ui simultaneously with v, at this moment, any one initial consonant, if it and ü be combined into legal syllables, this pronunciation is promptly arranged in the Chinese, it just can not be combined into legal syllables with ui, does not promptly have this pronunciation in the Chinese; Vice versa.For example, this syllable of n ü is arranged in the Chinese, and do not have this syllable of nui, this syllable of kui is arranged, and do not have this syllable of k ü, or the like.Therefore, substitute simple or compound vowel of a Chinese syllable ü and ui simultaneously, can satisfy the requirement of " not conflicting " with respect to Chinese speech with this key position of V.But, if regulation v substitutes ü and ao simultaneously, will clash, for example, for the character string nV of input, it is syllable n ü that program just can't be discerned it, or syllable nao, this just causes conflict, thereby does not meet the definition requirement of above-mentioned key position.
It is to be noted, h is used as simple or compound vowel of a Chinese syllable substitutes these special circumstances of key except above-mentioned, when the regulation Two bors d's oeuveres substitutes the key position, as long as satisfy aforementioned " inequality " requirement, and satisfy a character and can only substitute a simple or compound vowel of a Chinese syllable or an initial consonant, just can satisfy in actual use the requirement of " not conflicting ".If character substitutes simultaneously or represents the simple or compound vowel of a Chinese syllable that two or more are different, will make a concrete analysis of it and the situation of all initial consonants combinations, whether inspection might produce conflict.Below in the table 1 key position such as j and m be exactly to have substituted two simple or compound vowel of a Chinese syllable simultaneously, by checking that they do not clash in actual use, so be feasible regulation.
Except the principle of above-mentioned " not conflicting ", the define method that Two bors d's oeuveres is substituted the key position limits without any other, defines in the scope of 26 letters that can be on Universal English keyboard, also can use other character definition, because use the character of the non-Chinese phonetic alphabet, just do not conflict with the spelling coding.Therefore also can the design specialized keyboard, golygram initial consonant and the simple or compound vowel of a Chinese syllable in the Chinese phonetic alphabet, a new key position for example, to simple or compound vowel of a Chinese syllable ang, is set up specially in more most of ground or use the dedicated array of keys bit representation fully, or the like.
Below table 1 provide a kind of feasible alternative key position system that uses English universal keyboard.
Table 1, a kind of Two bors d's oeuveres, spelling of can be used for are freely mixed the alternative key mapping table of the Two bors d's oeuveres of importing
Phonetic key position substitutes the key position ai an ang ao ei en eng ia ian iang s d f g r t y z h j
Phonetic key position substitutes the key position iao ie in ing iong iu ong ou ua uai k q l ; m w m p z x
Phonetic key position substitutes the key position uan uang ue ui un uo ü ch sh zh c j b v n o v i u v
The alternative key bitmap of the use Universal English keyboard corresponding with table 1 is seen Figure of description.
Annotate 1.To the golygram simple or compound vowel of a Chinese syllable independence syllable of no initial consonant, and the alphabetical number of syllable is above 2, if import then special provision with double-spelling method: must add " zero initial " a: o in front, and then use alternative key according to rule; When the alphabetical number of syllable is no more than 2, then directly beat the pinyin character of this no initial sounds joint.For example, independent syllable ang, when with the input of Two bors d's oeuveres form, its key position is: of.Syllable an independently, when importing with Two bors d's oeuveres, keystroke position an.This is not clearly expression in table neutralization figure.
Annotate 2: the initial consonant and the simple or compound vowel of a Chinese syllable of the phonetic that those are represented with an English alphabet, identical with the spelling coding in double spelling coding, there is not substitution problem, so unlisted in showing.
Annotate 3: capitalization is represented the key position of Universal English keyboard in the Figure of description, and lowercase is represented the initial consonant or the simple or compound vowel of a Chinese syllable of the replaced Chinese phonetic alphabet.
(2), the character of not conflicting mutually according to the character combination sequence of spelling and Two bors d's oeuveres, and to substitute key position system with the Two bors d's oeuveres of prior regulation be foundation, program can be according to the legal coded character that 1. receives from keyboard, 2. the order of these characters, 3. other possible legal function keys (are not the function keys that input state is switched, not that the leading or successive character string of difference belongs to the spelling coding or the special identifier key of double spelling coding, and be meant under same input state, input method needed some with coding function associated key, for example, tone key, key distinguished in syllable, font code key in selection key of duplicat codes or the phonetic-stroke code or the like), automatically discern double spelling coding and spelling coding, they are converted to the unified Chinese speech notation of the inside of any prior regulation, offer the program search Chinese words, speech, phrase or sentence.
The identification transformation rule is conceptive very simple, and it follows following three total principles:
The first, the identification conversion operations carries out according to the order of (initial consonant → simple or compound vowel of a Chinese syllable), if input method comprises the disyllabic word input, then carries out according to the order of (initial consonant → simple or compound vowel of a Chinese syllable → initial consonant → simple or compound vowel of a Chinese syllable), if comprise the input of linguistic units such as polysyllabic word, the rest may be inferred.Program can be according to the kind of the legal character of input and the actual result of order, identification transformation rule and conversion, judge that current is to be in initial consonant failure translate phase, still be in simple or compound vowel of a Chinese syllable identification translate phase, correctly implement with control identification conversion, control transformation operates in ruly hocketing between initial consonant translate phase and the simple or compound vowel of a Chinese syllable translate phase.
The second, the two corresponding characters composite sequence of spelling and Two bors d's oeuveres does not conflict mutually, and this character is the necessary and sufficient condition of automatic program identification conversion.Its being embodied as on conversion operations:
1. in the identification conversion of first character of initial consonant,, just must be the Two bors d's oeuveres alternatives of initial consonant if run into the consonant character that does not belong to spelling; For example, first consonant character belongs to that (in the time of v), they are not the spelling consonant characters for i, u, so only may be the Two bors d's oeuveres alternatives of initial consonant; This moment just by the alternative key gauge of Two bors d's oeuveres then they be converted to respectively the spelling initial consonant (ch, sh, zh); Here, spelling is not conflicted mutually with the Two bors d's oeuveres character string; If first consonant character legal character that is spelling, it also must be the legal character of Two bors d's oeuveres initial consonant; For example, consonant key position g is a spelling, also is Two bors d's oeuveres; Here, spelling is not conflicted mutually with the character of Two bors d's oeuveres yet; Again because in Two bors d's oeuveres, any initial consonant only shows with a key mapping table, therefore, if receive second consonant character at the initial consonant translate phase, just must be the spelling consonant character; When this situation only occurred in first and second key position and belongs to golygram initial consonant (ch, sh, zh), at this moment, wherein second key position h was identified as second character of spelling initial consonant, and can not be the alternative key position of Two bors d's oeuveres;
2. based on same reason, in the identification conversion of first character of simple or compound vowel of a Chinese syllable,, just can only be the Two bors d's oeuveres alternatives of simple or compound vowel of a Chinese syllable if run into the rhythm alphabetic character that is not spelling, here, both do not conflict mutually at the corresponding characters sequence; If first rhythm alphabetic character is the legal character of spelling, it also must be the legal character of Two bors d's oeuveres simple or compound vowel of a Chinese syllable; Here, both corresponding characters collection do not conflict mutually yet; Again because in Two bors d's oeuveres, any simple or compound vowel of a Chinese syllable only shows with a key mapping table, therefore, if receive second character at the simple or compound vowel of a Chinese syllable translate phase, just must be the rhythm alphabetic character of spelling; Therefore, in simple or compound vowel of a Chinese syllable identification transfer process,, just change according to the rule identification of spelling since second rhythm alphabetic character.
Thus, when the double spelling coding of defining principle freely mixes use with the spelling coding in satisfied (one), fully can be by automatic program identification.
The 3rd, use with in the input scheme at full-spelling double-spelling, judge that input of character string belongs to spelling character, or Two bors d's oeuveres character, and carry out conversion operations, without any ambiguity.Yet, the ambiguity that syllable is distinguished might take place.In the Scheme for the Chinese Phonetic Alphabet self,, just might produce the ambiguity that syllable is separated if carry out word link writing.For example, for character string: piao, it can represent the syllable of " wafing ", or the syllable of representative " fur-lined jacket ", even human reader can not distinguish.So in the Scheme for the Chinese Phonetic Alphabet, in the spelling input in Chinese, stipulated that all additional syllable-dividing mark comes the cutting syllable.If piao represents two syllables in the last example, just must between two syllables, add a blank character.This regulation is used input method with for full-spelling double-spelling and also is suitable for.That is to say that full-spelling double-spelling is used with in the type input method owing to can use spelling, also just nature " successions " the syllable splitting ambiguity phenomenon of spelling self existence.When this ambiguity takes place, need to come artificial syncopation joint with sound insulation key position.For example, the all-phonetic input method of at present popular WPS Chinese information processing system comes the cutting Chinese syllable with space bar.Therefore, use with in the input method, under the situation that ambiguity does not take place at full-spelling double-spelling, automatically discern by computer system, finish syllable splitting, when the ambiguity that human reader also can not judge takes place, the requirement user to syllable splitting, for example uses space bar with other key positions.Must be pointed out simultaneously, this sound insulation key, be not input state to be changed or input coding is carried out distinctiveness identify, and the syllable splitting key of only assisting under same input state, when being used for the polysyllabic word input, it does not change character and the state that full-spelling double-spelling freely mixes input.
Narrate full-spelling double-spelling below more meticulously and use one of key component of type input method with: the identification transfer algorithm of full-spelling double-spelling mixed characters string sequence.Also illustrate transformation rule emphatically for the ease of reading comprehension, for key problem in technology clearly is described fully, the mode that adopts natural language to combine with form, directly perceived popular and accurate whole operating process of at large describing algorithm, help at length explaining the identification meaning that transformation rule had like this and be not only to list rule, so that the technician can deep enough understanding discern the principle of changing, make the technician convert it to various computerese written program easily according to the description of this explanation.
Suppose:
1). the Two bors d's oeuveres of conversion institute foundation substitutes the key scheme, is by table 1 and the alternative key scheme of the Two bors d's oeuveres of accompanying drawing defined; Spelling key position is identical with the Scheme for the Chinese Phonetic Alphabet, and alphabetical ü represents with v; Do not use tone code;
2). the actual speech of the inside after the conversion is expressed the retrieval symbol, also directly uses spelling;
3). this algorithm is a module in the whole Chinese character coding input method system program, this part is changed in the identification of full-spelling double-spelling coding mixed sequence in its special disposal enter key position, it is subjected to the control of master routine, the identification conversion of those and full-spelling double-spelling mixed characters string sequence does not have the operation of direct relation, in master routine and in other modules of master routine control, finish with prior art, repeat no more here;
4). first space bar of input after the simple or compound vowel of a Chinese syllable coded character is the syllable separation key.
Full-spelling double-spelling mixed characters string identification transfer algorithm is described 1. initial consonant translate phase 1SHENGMU_1 initial consonants, first input character identification transformation rule table
(also may be the initial of zero consonant syllable) input of character string a b c d e f g h i j k l m output string a! B c #D e! F g h ch j k l m input of character string n o p q r s t u v w x y z output string n o! P q r s #T sh zh w x y z #
Further operation to output string:
1). keep the initial consonant symbol in the output string, when changing for follow-up simple or compound vowel of a Chinese syllable with reference to use.
2). according to the trailing character of output string, divide following three kinds of situations to handle:
A) if. trailing character is that the initial consonant symbol with leading in the output string is used to retrieve Chinese words.Finish this and take turns the initial consonant conversion, accept next legal coded character, be assumed to be first input character of simple or compound vowel of a Chinese syllable, change and remove to carry out YUNMU_1 from keyboard.(this situation is no matter with respect to spelling or with respect to Two bors d's oeuveres, can confirm the character of first input, is exactly the occasion of the final initial consonant symbol of confirming).
B) if. trailing character is! , the initial consonant symbol is changed to zero, finish this and take turns the initial consonant conversion, change and remove to carry out YUNMU_1.(this situation correspondence first input character be the occasion of zero initial).
C) if. trailing character is #, and the initial consonant symbol with leading in the output string is used to retrieve Chinese words.This takes turns initial consonant conversion end as yet, accepts next legal coded character from keyboard, supposes that it is second character of initial consonant, changes and removes to carry out SHENGMU_2.(this situation is the situation that can't confirm whether consonant character has finally been confirmed, and also promptly the character of first input is the occasion of flat tongue consonant initial consonant).
2. the initial consonant translate phase 2
When being c, s, z, leading initial consonant symbol enters this program SHENGMU_2 initial consonant second input character transformation rule table
(also may be first rhythm alphabetic character that receives behind the initial consonant) input of character string a b c d e f g h i j k l m input of character string h input of character string n o p q r s t u v w x y z; Output string
Further processing to output string:
1) if. output string is h, then with h splicing after leading c or s or z, generate initial consonant symbol ch or sh or zh, be used to retrieve Chinese words, and keep, when changing for follow-up simple or compound vowel of a Chinese syllable with reference to use.Accept next legal character from keyboard, be assumed to be first input character of simple or compound vowel of a Chinese syllable, change and remove to carry out YUNMU_1.
2) if. output string Shi, then this input character is assumed to be first input character of simple or compound vowel of a Chinese syllable, change and remove to carry out YUNMU_1.
Attention: according to the alternative key gauge of table 1. and accompanying drawing then, since second enter key position, semi-colon key is legal key letter position, so comprise the identification to semi-colon key in this transformation rule table.
3. the simple or compound vowel of a Chinese syllable translate phase 1
After the final affirmation of initial consonant conversion, enter this program.
The YUNMU_1 simple or compound vowel of a Chinese syllable first input character transformation rule table input of character string a b c d e f g h i output string a! Ue uan an e! Ang ao ian i! Input of character string j k l m n o p q r output string iang iao in ong un o! Ou ie ei input of character string s t u v w x y z; Output string ai en u! Ui iu uai eng ia ing as follows to the further processing of output string:
1) if. leading initial consonant do not belong to (b, f, m, l, p w), neither not have initial sounds joint, then output string o! Change into uo!
2) if. output string is ui, then
(x), output string changes u into for j, q when leading initial consonant belongs to.
(n, l), output string changes V (v represents simple or compound vowel of a Chinese syllable ü) into when leading initial consonant belongs to.
3) if. leading initial consonant do not belong to (d, j, l, q, x), then output string ia changes ua into.
4) if. leading initial consonant do not belong to (j, l, n, q, x), then output string iang changes uang into.
5) if. leading initial consonant belongs to that (x), then output string ong changes iong into for j, q.
6). the simple or compound vowel of a Chinese syllable symbolic component of front in the output string, offer the retrieval module of program, the retrieval words
7) if. the trailing character Shi of output string, the simple or compound vowel of a Chinese syllable conversion is finally confirmed, finish conversion of epicycle simple or compound vowel of a Chinese syllable and syllable conversion, (it is the situation that Two bors d's oeuveres substitutes key that this correspondence that this identification conversion confirms as to return master routine, restart new one and take turns the syllable conversion, after promptly accepting next legal coded character, change and remove to carry out SHENGMU_1)
8) if. the trailing character of output string is! (this correspondence this identification conversion confirm as be the situation of spelling simple or compound vowel of a Chinese syllable first character), preserve the simple or compound vowel of a Chinese syllable part of this character string, accept next legal coded character from keyboard, as second rhythm alphabetic character of hypothesis, (for example, this rhythm alphabetic character is u after this rhythm alphabetic character in splicing, the fresh character that receives from keyboard is a, then be spliced into ua), change and remove executive routine YUNMU_2.Annotate: 1), 2), 3), 4), 5) in operation all be according to double spelling coding rule and spelling coding rule, analyze in the composite sequence with different leading initial consonants, should then change according to what alternative key gauge.
4. the simple or compound vowel of a Chinese syllable translate phase 2
When leading simple or compound vowel of a Chinese syllable is a, o, e, i during u, enters this program
Conditional operation: at first check second rhythm alphabetic character of the hypothesis that newly receives before changing, if it is the syllable separation key, promptly space bar then finishes this and takes turns the simple or compound vowel of a Chinese syllable conversion, and finishes this and take turns the syllable conversion, returns the master routine master routine.If not space bar, carry out following conversion operations.The spliced transformation rule table of YUNMU_2 simple or compound vowel of a Chinese syllable second input character
(being the transformation rules of preceding two rhythm alphabetic characters as input of character string) input of character string ai an ao ei en ia ie in io output string ai Ao ei en! Ie in! Other output strings of input of character string iu on ou ua ue ui un uo iu ong! Ou ua! Ue ui un uo other *
Further processing to input of character string and output string is as follows:
1) if. leading initial consonant is zero (being that no initial sounds saves), if then input of character string is (of, in the time of oy) (these character strings all are included into " other " in last table), they are converted to respectively (ang, eng), as output string.
2) if. the trailing character of output string is not *, the front simple or compound vowel of a Chinese syllable symbolic component in the output string, offers the retrieval module of program, the retrieval words.
3) if. the trailing character Shi of output string, finish this and take turns simple or compound vowel of a Chinese syllable conversion, and finish this and take turns the syllable conversion, return master routine (to restart new one and take turns the syllable conversion, promptly receive next legal coded character after, change and remove to carry out SHENGMU_1)
4) if. the trailing character of output string is! Preserve the simple or compound vowel of a Chinese syllable part of this character string, accept next legal coded character from keyboard, the 3rd rhythm alphabetic character as hypothesis, (for example, this time the rhythm alphabetic character of output is ua, and the fresh character that receives from keyboard is n after this rhythm alphabetic character in splicing, then be spliced into uan), change and remove executive routine YUNMU_3.
5) if. the trailing character of output string is *, and (first consonant character that second character in this description character string is next syllable) finishes this and take turns simple or compound vowel of a Chinese syllable conversion, and finishes this and take turns the syllable conversion.With second character in the output string, first input character as new syllable, (subsequent operation is equivalent to change and carries out SHENGMU_1 to return master routine, conversion discerned automatically in the syllable of a beginning new round, this operation has also represented program under all possible situation, automatically syllable is carried out cutting).
Specify: (see Table 1 according to Two bors d's oeuveres is substituted the special provision of using the key position.The explanation of back), independent syllable represented in the trigram of no initial consonant,, first keystroke position o. and then beat the alternative key position of simple or compound vowel of a Chinese syllable if according to Two bors d's oeuveres form input.This regulation make no initial consonant independence syllable (ang, Two bors d's oeuveres input form eng) be (of, oy), so have this stage 1). relevant decision operation.
5. the simple or compound vowel of a Chinese syllable translate phase 3
Conditional operation: judge at first that before changing the 3rd rhythm alphabetic character of the hypothesis that newly receives returns the master routine master routine if space bar then finishes this and takes turns the simple or compound vowel of a Chinese syllable conversion, and finishes this and take turns syllable conversion.If not space bar, carry out following operation.The spliced transformation rule table of YUNMU_3 simple or compound vowel of a Chinese syllable the 3rd input character
Other output strings of (being the transformation rule of first three rhythm alphabetic character as input of character string) input of character string ang eng ian iao ing ion ong uai uan ang eng Iao ing ionq! Ong uai uan! Other * are as follows to the further processing of output string:
1) if. the trailing character of output string is not *, the simple or compound vowel of a Chinese syllable symbolic component of front in the output string, offers the retrieval module of program, the retrieval words.
2) if. the trailing character Shi of output string, finish this and take turns simple or compound vowel of a Chinese syllable conversion, and finish this and take turns the syllable conversion, return master routine (to restart new one and take turns the syllable conversion, promptly receive next legal coded character after, change and remove to carry out SHENGMU_1)
3) if. the trailing character of output string is! Preserve the simple or compound vowel of a Chinese syllable part of this character string, accept next legal coded character from keyboard, the 4th rhythm alphabetic character as hypothesis, (for example, this time the rhythm alphabetic character of output is uan, and the fresh character that receives from keyboard is g after this rhythm alphabetic character in splicing, then be spliced into uang), change and remove executive routine YUNMU_4.
4. if the trailing character of output string is *, (the 3rd first consonant character that character is next syllable in this description character string) finishes this and takes turns the simple or compound vowel of a Chinese syllable conversion, and finishes this and take turns the syllable conversion.With the 3rd character in the output string, as new input character, (subsequent operation is equivalent to change and carries out SHENGMU_1, the syllable identification conversion of a beginning new round to return master routine, this operation has represented program under all possible situation, automatically syllable is carried out cutting).
6. the simple or compound vowel of a Chinese syllable translate phase 4
Conditional operation: before changing, at first judge, the hypothesis that newly receives the 4th rhythm alphabetic character if space bar then finishes this and takes turns the simple or compound vowel of a Chinese syllable conversion, and finish this and take turns the syllable conversion, return the master routine master routine.If not space bar, carry out following operation.
The spliced transformation rule table of YUNMU_4 simple or compound vowel of a Chinese syllable the 4th input character
Other * of (being the transformation rules of preceding four rhythm alphabetic characters as input of character string) other output strings of input of character string iang iong uang iang iong uang are as follows to the further processing of output character:
1) if. the trailing character of output string is not *, the front simple or compound vowel of a Chinese syllable symbolic component in the output string, offers the retrieval module of program, the retrieval words.
2) if. the trailing character Shi of output string, finish this and take turns simple or compound vowel of a Chinese syllable conversion, and finish this and take turns the syllable conversion, return master routine (to restart new one and take turns the syllable conversion, promptly receive next legal coded character after, change and remove to carry out SHENGMU_1)
3) if. the trailing character of output string is *, and (the 4th first consonant character that character is next syllable in this description character string) finishes this and take turns simple or compound vowel of a Chinese syllable conversion, and finishes this and take turns the syllable conversion.With the 4th character in the output string, as new input character, (subsequent operation is equivalent to change and carries out SHENGMU_1 to return master routine, conversion discerned automatically in the syllable of a beginning new round, simultaneously, this operation has also represented program under all possible situation, automatically syllable is carried out cutting).
7. polysyllabic conversion
The identification of the whole possible array configuration of the initial consonant of spelling, Two bors d's oeuveres and simple or compound vowel of a Chinese syllable conversion has below intactly been described in the syllable.Multisyllable character string is the sequence that syllable joins in proper order, easily knows according to induction, as long as be fully feasible, be exactly repetitive cycling operation to the conversion of a plurality of syllables, so without any difficulty to the single syllable conversion to the conversion of a syllable.Also as can be seen, judge a simple or compound vowel of a Chinese syllable conversion when program and finally confirm from top algorithm, finish fully, just mean that the conversion of this syllable also finishes fully, the identification transfer process that can prepare to enter next syllable enters cycling.
(3), describe full-spelling double-spelling more than in detail and used whole rules of in the type input method enter key position character string being discerned automatically conversion with.Here again scheme is done some supplementary notes.
1. full-spelling double-spelling is used the type Chinese character coding input method with, and the spelling coding that it adopts and the concrete scheme of double spelling coding can have various ways.According to the double spelling key-position defining principle of being narrated in (), can define many kinds of double spelling key-position schemes.Table 1 and the corresponding listed double spelling key-position system of explanation accompanying drawing only are a kind of in numerous feasible programs.As previously mentioned, as long as satisfy the requirement that does not conflict mutually, any Two bors d's oeuveres scheme all is feasible.To the definition of double spelling coding scheme, only need to satisfy the condition of being narrated in () of not conflicting mutually, without any other restrictions.For example, employed Two bors d's oeuveres substitutes the key position in the Two bors d's oeuveres double-tone input method of at present popular WPS Chinese information processing system, though different with this instructions table 1 defined can be used for full-spelling double-spelling equally and use the type input method with.
2. the spelling coding freely mixes the Chinese character coding input method of using with double spelling coding, can use Universal English keyboard, also can use other forms of keyboard, comprise custom-designed keyboard, only require and comprised spelling key letter position and double spelling coding key position on these keyboards, and they satisfy Two bors d's oeuveres described in () and the in use mutual condition of not conflicting of the symbol sebolic addressing of spelling.
3. above-mentioned conversion identification algorithm only is a kind of in numerous feasible algorithms, also exists many other feasible transfer algorithms, their equivalent equivalences.Here, the key that conversion operations can be implemented when being regulation double spelling key-position and spelling key position, satisfies the condition that its character combination sequence described in () is not conflicted, and this condition is the necessity and the sufficient condition of conversion operations feasibility.
4. to the detailed description of recognizer,, just be equivalent to prove the adequate condition of conversion feasibility in top (two) joint because it has comprised all possible character combination sequence; Above in (one) to particular key position h, thereby be used as the analysis that can cause the example that conflict can't be discerned when substituting simple or compound vowel of a Chinese syllable ang, just be equivalent to prove the necessary condition of conversion feasibility.
5. this full-spelling double-spelling is used the character string identification conversion operations of type input method with, is autonomous self-sustaining, so it can be used for the pure tone sign indicating number, can also can be used to comprise the phonetic-stroke code or the shape sound sign indicating number of chinese-character syllable encode with or without tone.
For example, if use toned sound sign indicating number, so only need be when stipulating these tone key positions, condition satisfied and that full-spelling double-spelling character combination sequence is not conflicted gets final product.For example, four circumflexs of regulation Chinese are represented with numerical key 1,2,3,4 respectively, and be defined in to have imported and import the tone key position after the simple or compound vowel of a Chinese syllable, so obvious, not contradiction is changed in the automatic identification of these tone key position characters and full-spelling double-spelling character string sequence, is actually favourable, because in case receive tone key position character, program just can judge that the input of a simple or compound vowel of a Chinese syllable and a syllable finishes, and can prepare to enter the initial consonant identification translate phase of next syllable.
In like manner, to any phonetic-stroke code or the shape sound sign indicating number that phonetic and font are combined, as long as satisfy regulation to the font code coded character, do not conflict with full-spelling double-spelling key bit pattern sequence in actual use, above-mentioned full-spelling double-spelling freely mixes the identification conversion operations of input, just is applicable to the sound sign indicating number part in this phonetic-stroke code or the shape sound sign indicating number.
6. the automatic identification transformation rule of the spelling of being narrated in (two), double spelling key-position mixed sequence is an independent parts, and it does not have cross influence to other parts of input method.Therefore, other parts of this input method can adopt any prior art to realize.For example, the presenting bank display technique, priority of high frequency and dynamic frequency modulation technology, application of the various theory and technology of natural language understanding or the like is to constitute a complete input method.General method is exactly, after the Dynamic Recognition conversion operations in arbitrary stage is finished, program is just according to the phonetic representation symbol sebolic addressing of changing the up-to-date inside that is obtained, the corresponding Chinese words Candidate Set of retrieval in character word stock, if the words project of coupling is unique, just directly show in the display prompts district, if it is not unique, then according to other technologies such as priority of high frequency technology and natural language understandings, determine the priority of highlight display items display after, on highlight, show, the user impacts the options button of preferred term end key or repeated code item, make to shield on the target item to put in place, finish this and take turns input, or continue input descendant key position.Because the program circuit of these parts can adopt ripe prior art fully, just repeats no more.
7. above-mentioned spelling coding freely mixes in the Chinese character coding input method of using with double spelling coding, the process of its input operation, from input coding key position on the keyboard to retrieving the coupling words, show in display prompts, so that the user can import the operation of target word, can adopt two kinds of different modes: discern conversion and display mode and asynchronous identification conversion and display mode synchronously.
The so-called conversion of identification synchronously display mode, be meant from first enter key position, whenever receive a coded character from keyboard, just identification conversion at that time, the inside retrieval code that just will change the back acquisition at that time offers program search, the words that will successfully mate in the time of just will retrieving at that time shows in the display prompts district that by the mode of certain regulation the user just can use at that time shields the target words that the key input that puts in place is hit.
So-called asynchronous identification conversion shows, be meant that computer program is not whenever to receive the just identification conversion immediately of a coded character, but to wait for that the user has beaten whole coded sequence strings of a syllable or whole coded sequence strings or whole coded sequences of an initial consonant unit or whole coded sequences of a simple or compound vowel of a Chinese syllable unit or the like of a speech, just begin to discern conversion and demonstration.
This instructions embodiment is described, is the mode that identification is synchronously changed and shown, promptly whenever receives a coded character, just identification conversion immediately and demonstration occurrence.Asynchronous identification conversion shows with synchronous identification conversion with demonstration to be compared, unique difference, only be after having received certain element string all, just enter actual identification conversion and display operation, and the rule of its identification conversion operations itself and principle do not change, so as long as identification conversion synchronously is feasible, asynchronous identification conversion also must be feasible.Therefore, this spelling coding and double spelling coding mixing input method can adopt the method for synchronization to carry out, and also can adopt asynchronous system to carry out.The inventor recommends to use synchronous identification conversion display mode.
More than be described in detail full-spelling double-spelling and used the part of the common key in the embodiment of type input method with, narrate other relative sections in two embodiments below, to the identification conversion portion of relevant full-spelling double-spelling coded character composite sequence wherein, because or repetition identical with top narration no longer specifically launches.
1. embodiment 1, and full-spelling double-spelling is used the sound sign indicating number Chinese character coding input method of type with.
A kind of pure tone sign indicating number Chinese character coding input method of using spelling and Two bors d's oeuveres of freely mixing.Satisfy:
1. Two bors d's oeuveres substitutes the key position according to this instructions table 1 and Fig. 1 definition, does not adopt tone code.
2. when the simple or compound vowel of a Chinese syllable conversion of first syllable, if receive first space bar following closely, it then is the syllable separation key individual character repeated code display key of holding concurrently, for example, if after receiving character string pi, and then receive a space bar, just confirm that syllable pi finishes, with all Chinese characters of syllable correspondence therewith, show at presenting bank according to the high frequency principle of priority, and if continue to receive new input character, the initial consonant that then enters second syllable is changed.Otherwise pi also may combine with follow-up input character, still is used as first syllable identification conversion.
3. when the conversion of the simple or compound vowel of a Chinese syllable of second syllable, if receive following closely first space bar, then be the syllable separation key disyllabic word repeated code display key of holding concurrently, its principle of operation is with 2. similar.
4. when the conversion of the simple or compound vowel of a Chinese syllable of the 3rd syllable, if receive following closely first space bar, then be the syllable separation key trisyllable repeated code display key of holding concurrently, its principle of operation is with 2. similar.But whole this taken turns input and left it at that, and also is that this input method is three to a maximum syllable coding number that cuit allowed.Subsequent operation reenters the conversion operations of first syllable of a new cuit.
If 5. receive two space bars continuously, then second space bar is that going up of the first of presenting bank shielded the key that puts in place.
6. the above polysyllabic word of four syllables or four syllables is imported according to the compressed encoding of non-syllable form, and coding rule is to get the initial consonant code of first three syllable and the initial consonant code of ultima.Polysyllabic word shows separately at presenting bank, and specifies with enter key as shielding the key that puts in place in the special use of polysyllabic word, the end input.Like this, the input of polysyllabic word separate with the input of other words part, do not conflict mutually.Simultaneously, the identification of polysyllabic word conversion has only the identification conversion of initial consonant, does not have the identification conversion of simple or compound vowel of a Chinese syllable.
The procedure operation flow process:
From first legal input character, program whenever receives a character, just according to aforementioned spelling, Two bors d's oeuveres identification transformation rule, the code conversion of outside input is become inner speech retrieval sign indicating number, retrieval code sequence according to this inside, in the character word stock of in computer memory device, preestablishing, the matching candidate collection of the Chinese words that retrieval is corresponding, if the project in the set of the words of coupling is unique, if just directly instant playback on presenting bank is not unique, then according to the other technologies of priority of high frequency technology or natural language understanding or the like, determine the priority of display items display, instant playback on presenting bank again, the user can impact the corresponding selection key of preferred term end key or repeated code item, make to shield on the target item and put in place, finish this and take turns input, perhaps continue input descendant key position, take turns end of input up to this.When having repeated code, adopt the processing identical with prior art in the phonetic code input method, show the repeated code item at presenting bank, and with each repeated code item in order distribute digital show that simultaneously the user points out according to presenting bank, select input with numerical key; If presenting bank shows inadequately, then use the key that skips of regulation to continue to search subsequent project.Automatically discern the introduction of transformation rule part according to aforementioned full-spelling double-spelling, program does not cause under the situation of ambiguity at all, can finish syllable automatically and distinguish,, then require the user from the information of keyboard input space bar to provide current syllable to finish if ambiguity takes place.
2. embodiment 2, and full-spelling double-spelling is used the phonetic-stroke code Chinese character coding input method of type with.
A kind of phonetic-stroke code Chinese character coding input method of using spelling and Two bors d's oeuveres of freely mixing.
The general rule of coding is as follows:
First word, second word the 3rd word end word individual character phonetic+font code two-character word phonetic phonetic+font code three words pinyin phonetic phonetics+font code multi-character words initial consonant initial consonant initial consonant initial consonant
Wherein, font code is the first sum of sign indicating number that is drawn into of getting this word.The all possible the first sum of picture of Chinese character is included into five classes: horizontal, vertical, point, left-falling stroke, folding, their key position is respectively first character of the phonetic initial consonant of stroke name separately, promptly horizontal stroke → h is perpendicular → s, point → d, left-falling stroke → p, folding → z.The sound sign indicating number can be used spelling and Two bors d's oeuveres with.
Input method satisfies:
1. Two bors d's oeuveres substitutes the key position according to this instructions table 1 and Fig. 1 definition, does not adopt tone code.
2. adopt the prior art of " quickening input " (to see patent of invention " literal input accelerated method " by the long gradient separations of speech, number of patent application is: 92112716.2), whenever receive an input coding key position, program is after the identification conversion of necessity, retrieval respectively in four character word stocks preestablishing, these four character word stocks are, the individual character storehouse, the double word dictionary, three character word stocks, the multiword dictionary, with the preferential occurrence in each storehouse that retrieves, press individual character at presenting bank, two-character word, three words, the order of multi-character words, from left to right arrange, show simultaneously, and set up respectively corresponding to individual character, two-character word, three words, the upward screen end key separately of multi-character words also promptly has four kinds to go up screen end key positions, the prescriptive procedure of these key positions can be the instructions of 92112716.2 patent of invention referring to above-mentioned application number.
3. when the simple or compound vowel of a Chinese syllable conversion of first syllable, if receive first space bar following closely, it then is the syllable separation key individual character repeated code display key of holding concurrently, for example, if after receiving character string pi, and then receive a space bar, just confirm that syllable pi finishes, with all Chinese characters of syllable correspondence therewith, show at presenting bank according to the high frequency principle of priority, and if continue to receive new input character, the initial consonant that then enters second syllable is changed.Otherwise pi also may combine with follow-up input character, still is used as first syllable identification conversion.
4. when the conversion of the simple or compound vowel of a Chinese syllable of second syllable, if receive following closely first space bar, then be the syllable separation key disyllabic word repeated code display key of holding concurrently, its principle of operation is with 2. similar.
5. when the conversion of the simple or compound vowel of a Chinese syllable of the 3rd syllable, if receive following closely first space bar, then be the syllable separation key trisyllable repeated code display key of holding concurrently, its principle of operation is with 2. similar.
If 6. receive two space bars continuously, then second space bar is that going up of the first of presenting bank shielded the key that puts in place.Promptly finish the first item of presenting bank and put in place and go up the screen operation, this means that this input finishes.
7. after the input of first character (corresponding to the font code key of individual character) of first syllable back,, then be individual character repeated code display key if receive following closely first space bar; Close two-character word, three words and multi-character words retrieval this moment, only at the individual character library searching and show occurrence; After first character (corresponding to the font code key of two-character word) input of second syllable back, if receive first space bar following closely, then be two-character word repeated code display key, close the retrieval of three words and multi-character words this moment, only at the two-character word library searching and show occurrence.
8. after first character input of first syllable back, if receive the legal encodings character of unblank key following closely, promptly this character belongs to initial consonant or simple or compound vowel of a Chinese syllable coding, then closes the individual character retrieval; After first character input of second syllable back, if receive the legal encodings character of unblank key following closely, promptly this character belongs to initial consonant or simple or compound vowel of a Chinese syllable coding, then closes the two-character word retrieval.
9. the polysyllabic word code identification is handled identical with embodiment 1.
The procedure operation flow process has an important difference with embodiment 1: the coded character that program is imported the keyboard that receives logically is to distinguish individual character, two-character word, three words, multi-character words totally four kinds of situations, and identification is changed and retrieved respectively.For example, first character that receives in first syllable back need not change just to be used as the individual character retrieval, because its correspondence the font code of individual character, simultaneously, is used to retrieve two-character word and three words after must changing it again.Other operations are similar to Example 1, repeat no more.
Full-spelling double-spelling is used the type Chinese character coding input method with, in the development of phonetic code input method, and in the development of the phonetic-stroke code that uses tone code or shape tone code, has proposed new ideas and new method, and it has the following advantages.
The first, it has the separately advantage of spelling and Two bors d's oeuveres concurrently, has overcome again spelling and Two bors d's oeuveres shortcoming separately. It is by automatic " bluring " identification translation function of computer-internal, so that the large method of two in the spelling input method: all-phonetic input method and double-spelling Chinese character input method are unified harmoniously. It uses Two bors d's oeuveres for the user learning that uses spelling, and the approach of most convenient is provided, and has greatly optimized the learnability of Two bors d's oeuveres. It no longer is only to rely on external method, and for example the keycap at keyboard stamps the alternative key mapping of Two bors d's oeuveres, ejects Two bors d's oeuveres with hot key and substitute key bitmap etc. on screen, relies on this external method to help the user to learn no longer fully and uses Two bors d's oeuveres. But the program of computer-internal oneself is finished the identification of full-spelling double-spelling key letter bit sequence, significantly reduced the mental operation difficulty of input, for using Two bors d's oeuveres, provide to greatest extent " fault-tolerant " function: no matter be that the user does not remember or temporarily forgets double spelling key-position, or when Two bors d's oeuveres is inputted and the spelling key mapping obscure, do not affect normally carrying out of input, because input method itself just allows arbitrarily composite sequence of full-spelling double-spelling code character. So in this input method, " the obscuring " between spelling and the Two bors d's oeuveres can not affect normally and correctly carrying out of input, therefore " obscuring " no longer is a kind of mistake. It provides the man-machine interaction mode of more friendly cordiality, and computer system is also because allowing human user " obscuring mistake " to occur, still can normally correctly work, and more brings into play intelligentized feature, and " understanding " also seems more.
The second, it is particularly conducive to and popularizes Chinese in middle and primary schools and write computerization, has fundamentally eliminated the side effect of mutually obscuring of using Two bors d's oeuveres to produce on teaching of Chinese pin yin. It Educational Psychology and engineering psychology the application in Chinese character coding input method unite, it is so that in computerized Pinyin Chinese input, can not only mutually coordinate with the teaching of Chinese pin yin of primary school, and can with pedagogy in small step teaching methodology unite, because, it does not require that the student once grasps whole double spelling key-positions, but of a key mapping of a key mapping that can be incremental, or of the several key mappings of several key mappings, etc., grasp a key mapping and just can grasp several key mappings and just can use several key mappings with a key mapping, and do not resemble original spelling input and Two bors d's oeuveres input, be in mutually exclusive state, perhaps all use Two bors d's oeuveres, perhaps the alternative key of Two bors d's oeuveres all can not use, etc., this is to alleviating children's learning burden, safeguard children's physical and mental health, use as early as possible the modern science and technology instrument to improve learning efficiency, can play good effect.
The 3rd, it meets regulations and the policy of the work of national language literal, meets the modern developing direction of Chinese Language, with national knowledge and education background, highly compatible or unification.
The 4th, it is the performance that further improves the tone code Chinese character coding input method, and new starting point, new basis are provided.

Claims (1)

1. a full-spelling double-spelling is used the type Chinese character input method with, it is being in the phonetic code input method of basic input equipment or the phonetic-stroke code and form-sound code input method that comprises the sound sign indicating number with Universal English keyboard or other form keyboards, the input mode combination of spelling coding and double spelling coding is got up with coordination, by the computer system automatically spelling that received of identification and conversion and the free mixed sequence of double spelling coding, and combine with display, realize input in Chinese; The steps necessary of described input method comprises, at first is, directly uses the Scheme for the Chinese Phonetic Alphabet itself to set up a spelling key letter position; The method is characterized in that:
Also comprise the steps, be exactly,
1) the definition double spelling coding substitutes key position or dedicated array of keys position.Be used for substituting or represent that initial consonant and the simple or compound vowel of a Chinese syllable that those are write with two or more letters, define method are in the Scheme for the Chinese Phonetic Alphabet, the key position character that is used for representing Chinese phonetic alphabet golygram initial consonant in the double spelling coding, be defined into spelling consonant key position character character set inequality on, promptly do not belong to character set { b, c, d, f, g, h, j, k, l, m, n, p, q, r, s, t, w, x, y, z };
2) the key position character that is used for representing the golygram simple or compound vowel of a Chinese syllable in the Chinese phonetic alphabet in the double spelling coding, be defined into first letter with the spelling simple or compound vowel of a Chinese syllable, except alphabetical ü, on the character set inequality, promptly do not belong to character set { a, e, i, o, ü };
3) when character h being defined as the golygram simple or compound vowel of a Chinese syllable and substituting key, it can only substitute the simple or compound vowel of a Chinese syllable that can not occur after flat tongue consonant initial consonant z, c, the s in Chinese phonetic alphabet syllable;
4) the automatic identification represented spelling and the free mixed sequence of double spelling coding in key position of definition in this way of computer program realizes that Chinese full-spelling double-spelling uses input with.
CN95121307A 1995-12-23 1995-12-23 Full and double phoneticizing combined type Chinese input method Expired - Fee Related CN1053976C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN95121307A CN1053976C (en) 1995-12-23 1995-12-23 Full and double phoneticizing combined type Chinese input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN95121307A CN1053976C (en) 1995-12-23 1995-12-23 Full and double phoneticizing combined type Chinese input method

Publications (2)

Publication Number Publication Date
CN1152737A CN1152737A (en) 1997-06-25
CN1053976C true CN1053976C (en) 2000-06-28

Family

ID=5082419

Family Applications (1)

Application Number Title Priority Date Filing Date
CN95121307A Expired - Fee Related CN1053976C (en) 1995-12-23 1995-12-23 Full and double phoneticizing combined type Chinese input method

Country Status (1)

Country Link
CN (1) CN1053976C (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1586191A4 (en) * 2003-01-22 2009-03-11 Min-Kyum Kim Apparatus and method for inputting alphabet characters
CN100437441C (en) * 2004-05-31 2008-11-26 诺基亚(中国)投资有限公司 Method and apparatus for inputting Chinese characters and phrases
CN102043473A (en) * 2011-01-15 2011-05-04 靳友鹏 Computer Chinese character input method for realizing fast input and selection of pinyin
CN111310481B (en) * 2020-01-19 2021-05-18 百度在线网络技术(北京)有限公司 Speech translation method, device, computer equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN87100313A (en) * 1987-01-15 1988-07-27 翁天祥 Compatible code and compatible keyboard with Chinese phonetic alphabet compatibility

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN87100313A (en) * 1987-01-15 1988-07-27 翁天祥 Compatible code and compatible keyboard with Chinese phonetic alphabet compatibility

Also Published As

Publication number Publication date
CN1152737A (en) 1997-06-25

Similar Documents

Publication Publication Date Title
CN1205572C (en) Language input architecture for converting one text form to another text form with minimized typographical errors and conversion errors
CN1191514C (en) System and method for processing chinese language text
CN102902660B (en) Chinese phonetics codes spelling and Mixed Pinyin Chinese holographic information processing method
CN87107540A (en) Choose the method and apparatus of storage and demonstration Chinese character
CN1384940A (en) Language input architecture fot converting one text form to another text form with modeless entry
CN1248333A (en) Reduced keyboard disambiguating system
KR20030094632A (en) Method and Apparatus for developing a transfer dictionary used in transfer-based machine translation system
CN101739143B (en) Character inputting method and character inputting system
Huang et al. A new input method for human translators: integrating machine translation effectively and imperceptibly
JP2004220616A (en) Machine translation system for simultaneously displaying and editing three or more parallel translation screens
CN1053976C (en) Full and double phoneticizing combined type Chinese input method
CN1101567C (en) Method and apparatus for Chinese character text input
CN102479078A (en) Chinese programming method for computer by using Chinese phonetic codes
CN115310433A (en) Data enhancement method for Chinese text proofreading
JP5751537B2 (en) International Japanese input system
JP4588657B2 (en) Translation device
CN103246354A (en) Inputting method for encoding and expressing Chinese characters through common language characters and keyboards of inputting method
CN1018205B (en) Chinese voice-digit coding input technique for computer
CN100485590C (en) Chinese character input method
Sinha Computer Processing of Indian Languages and Scripts—Potentialities & Problems
JP4588417B2 (en) Translation device
CN1052200A (en) Pronunciation-form-meaning words encode series with compatibility and keyboard
CN85100087A (en) " Chinese coded sound " scheme and its implementation
JP2009116584A (en) Machine translation device and machine translation program
CN1836226A (en) Method and apparatus for converting characters of non-alphabetic languages

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee