CN1836199B - Character inputting method of using word as unit - Google Patents

Character inputting method of using word as unit Download PDF

Info

Publication number
CN1836199B
CN1836199B CN2004800234193A CN200480023419A CN1836199B CN 1836199 B CN1836199 B CN 1836199B CN 2004800234193 A CN2004800234193 A CN 2004800234193A CN 200480023419 A CN200480023419 A CN 200480023419A CN 1836199 B CN1836199 B CN 1836199B
Authority
CN
China
Prior art keywords
sign indicating
indicating number
characters
speech
chinese character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2004800234193A
Other languages
Chinese (zh)
Other versions
CN1836199A (en
Inventor
刘向东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CNB031537553A external-priority patent/CN100362455C/en
Priority claimed from CNA2003101134274A external-priority patent/CN1542594A/en
Priority claimed from CN 200410058195 external-priority patent/CN1737735A/en
Application filed by Individual filed Critical Individual
Publication of CN1836199A publication Critical patent/CN1836199A/en
Application granted granted Critical
Publication of CN1836199B publication Critical patent/CN1836199B/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

Since currently the speed is slow and the efficiency is low when inputs the Chinese characters, and the input state must be switched when characters, such as Chinese characters, Arabic numerals, interpunctions, English letter of alphabet etc., are input in various language system, the present invention provides a common character input method based on 'phrase' as unit. It induces pronounciation code and tune code according to the Chinese Phonetic Alphabet, and induces radical-character body pronounciation code input method according to the pentype of Chinese characters, and inputs the phrases consist of 1, 2, 3 or more than 3 characters according to different rules. This can realize a combined input of the Chinese Phonetic Alphabet and the pen-type of the Chinese characters, and realize a combined input of various characters, such as Chinese characters, Arabic numerals, interpunction and English letter of alphabet etc..According to the present invention, the average code length is shot, the duplicate code rate is low and the efficiency is high, and the Chinese Phonetic Alphabet and the pen-type can be combined input. It's easy to learn and can be used easily. Under the situation of doing not switch the input state, various characters, such as Chinese characters, Arabic numerals, interpunction and English letter of alphabet etc., can be combined input.

Description

With the speech is the characters input method of unit
Technical field
The present invention is a kind of characters input method, and particularly utilizing numeric keypad is the unit input character method of (comprising Chinese character, arabic numeral, punctuation mark, English alphabet etc.) with the speech.
Background technology
At present, computing machine has occurred on knee, hand held, microminiaturized trend, mobile phone has engendered the trend that possesses the every function of computing machine, PDA, set-top box, e-book, handheld terminal, household electrical appliance, automotive electronics, ATM (Automatic Teller Machine), bar-code reader, data acquisition unit, game console, the Karaoke planter, the MP3 player, the public information inquiry terminal, it is convenient that embedded device such as bidirection pager and landline telephone also all presses for numerical key, import various information quickly, this changes the characters input method of existing main use alphabet code into the characters input method of main use numerical key coding with regard to requiring us.
With Intelligent ABC and Microsoft's phonetic is that the spelling input method of representative is people import Chinese character on PC a main flow input method.For improving the input efficiency of Chinese-character phonetic letter input method, the inventor is the Chinese language knowledge according to standards such as " Scheme for the Chinese Phonetic Alphabet ", " basic principles for Chinese phonetic alphabet " in nineteen ninety-five, invented pinyin mixing input technology (Chinese patent ZL95102608.9), solved the contradiction of Chinese character input learnability in early stage and later stage efficient effectively, made main use Chinese phonetic alphabet calculation of coding machine Chinese character input become more quick and convenient.In this invention, the notion of sound sign indicating number I and sound sign indicating number II has been proposed.Zh, ch, sh, ng, ü in the phonetic transcriptions of Chinese characters of tonal symbol not are transformed to respectively η, v constitute compression phonetic sign indicating number; When the code length of compression phonetic sign indicating number is 1, repeat this compression phonetic sign indicating number and constitute the phonetic sign indicating number, when the code length of compression phonetic sign indicating number greater than 1 the time, the pressure phonetic sign indicating number formation phonetic sign indicating number that contracts; First yard with the phonetic sign indicating number is decided to be sound sign indicating number I, will be decided to be sound sign indicating number II except that the phonetic sign indicating number first yard.
But, in embedded device fields such as hand held, microminiaturized computer and mobile phone, set-top box, e-book, handheld terminal, household electrical appliance, automotive electronics, ATM (Automatic Teller Machine), bar-code reader, data acquisition unit, game console, Karaoke planter, MP3 player, public information inquiry terminal, bidirection pager and landline telephones, the efficient of Chinese character input is also lower.Using more aspect the field of mobile phones input Chinese character is that the word of the iTAP of T9, company of Motorola (Motorola) of the U.S. special prompt communication (TegicCommunications) company and Canadian word source (Zi) company can (eZiText) Chinese character coding input method, its spelling input method mainly uses 2,3,4,5,6,7,8,9 these eight numerical keys, with first input individual character again the mode of additional association function import, efficient is far below being the Chinese character input method of unit with the speech on the PC.
In these fields, at present main phonetics input method all is to be that the main cause of unit input Chinese character is with the word: if use all-key (not using tone), the Chinese phonetic alphabet code length of any one monosyllabic word is the 1-6 sign indicating number, two-character word is the 2-12 sign indicating number, three words then are the 3-18 sign indicating number ... coding is long, ambiguity is many, is not easy to input.If the use brevity code then can increase the repeated code number of syllable, also can influence input efficiency conversely.
In addition, a lot of users are not fine to Chinese phonetic alphabet grasp, and they tend to use form of a stroke or a combination of strokes input Chinese character.But because Chinese character itself, many Chinese-character strokes are extremely many, as "
Figure G2004800234193D00021
" nearly 48 of strokes, the stroke input method mean code length is long, repeated code is many, efficient is low.
In addition, no matter be the input system of Chinese or English and other language, when mixing information such as importing arabic numeral, punctuation mark, English alphabet, all must switch input state, keystroke efficient is low, and the user feels very inconvenience to this.
The present invention is directed to the problems referred to above, designed a kind of general be the characters input method of unit with the speech, solved the problems referred to above dexterously, particularly having solved as speech how is the difficult problem of unit input Chinese character.This method not only can be applied to input in Chinese, and can be widely used among the various language inputs such as English input, Japanese input, German input, French input, Spanish input.
Summary of the invention
The present invention proposes in view of above-mentioned problems of the prior art, its purpose is to provide a kind of can utilize the quick input character of numeric keypad, and can under the situation of not switching input state, mix any character of input, particularly can utilize sound sign indicating number or radicals by which characters are arranged in traditional Chinese dictionaries-body method to import the method for Chinese character fast.
To achieve these goals, the invention provides a kind of is the characters input method of unit with the speech, it is characterized in that: use the numeric keypad input character, this numeric keypad is defined as follows,
(sound sign indicating number η is defined in respectively on 0,1 two key position, wherein any two the sound sign indicating numbers of each key position definition)
The character that comprises Chinese character by following rule input:
(1) numerical key of the sound sign indicating number II correspondence of the numerical key+Chinese character of the sound sign indicating number I correspondence of the coding=Chinese character of Chinese character; The numerical key that is encoded to itself correspondence of numerical character; English alphabet be encoded to numerical key according to the definition of above-mentioned numeric keypad; The coding of punctuation mark is first yard pairing numerical key of the pairing coding of its pronunciation;
(2) if the speech of being made up of 1 character, its coding is exactly the coding of this character, and the order keystroke is imported this speech;
(3) if the speech of being made up of 2 characters, (be longer than N sign indicating number before the getting of N sign indicating number, N is 〉=1 natural number with the coding of 2 characters forming this speech, if N<5, space encoder is less, and repeated code is too much, effect is bad, therefore advises that N is generally more than or equal to 5), the order keystroke is imported this speech;
(4) if the speech of being made up of 3 characters, with the coding of first character-coded first yard+latter two character (be longer than the getting of N sign indicating number before N sign indicating number), the order keystroke is imported this speech;
(5) if the speech of being made up of the character more than 3, with the remainder code except that first yard of first yard+last character coding of the coding of all characters (be longer than the getting of N sign indicating number before N sign indicating number), keystroke is imported this speech in proper order.
In addition, if by 3 or 3 speech that above character is formed, with the remainder code except that first yard of first yard+last character coding of the coding of all characters (be longer than the getting of N sign indicating number before N sign indicating number), keystroke is imported this speech in proper order.
In addition, according to following relation, on numeric keypad, define tone code:
Figure G2004800234193D00041
Be made up of two or more characters when a speech, its last character is a Chinese character, and during the not enough N sign indicating number of the code length of this speech, adds the tone code of last Chinese character behind the coding of this speech, and the order keystroke is imported this speech.
In addition, if the speech of forming by 1 Chinese character, also can be with the numerical key of the standard Chinese phonetic correspondence of the Chinese character that constitutes this speech, the order keystroke is imported this speech.
In addition, the sound sign indicating number I when Chinese character is The time, can be with zh, ch, sh coding as its sound sign indicating number I; In the sound sign indicating number II of Chinese character, η is arranged, can replace η to constitute the coding of sound sign indicating number II with ng; The numerical key of the coding correspondence of the sound sign indicating number II of the numerical key+Chinese character of the coding correspondence of the sound sign indicating number I of the coding=Chinese character of Chinese character.
In addition, above-mentioned sound sign indicating number Be defined on the key position 1, η is defined on the key position 0.
In addition, if by 2 or 2 speech that above character is formed, with the remainder code except that first yard of first yard+last character coding of the coding of all characters (be longer than the getting of N sign indicating number before N sign indicating number), keystroke is imported this speech in proper order.
In addition, do not grasp the problem of part phonetic transcriptions of Chinese characters at a lot of users, the coding of Chinese character also can not adopt its sound sign indicating number form, and adopts following radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method coding:
(a) get comprise and only comprise horizontal stroke " ", perpendicular " Shu ", cast aside " Pie ", point " Dian ", Zhe “ Ya " the radicals by which characters are arranged in traditional Chinese dictionaries set of these five single radicals by which characters are arranged in traditional Chinese dictionaries, according to horizontal 1, perpendicular 2, cast aside 3, point 4, folding 5 codings, constitute its radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number;
The radicals by which characters are arranged in traditional Chinese dictionaries of stipulating each Chinese character all are the first strokes of this Chinese character, if Chinese character be one of radicals by which characters are arranged in traditional Chinese dictionaries in definite radicals by which characters are arranged in traditional Chinese dictionaries set, or constitute by the variant of radicals by which characters are arranged in traditional Chinese dictionaries, then the coding of this Chinese character is exactly the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this radicals by which characters are arranged in traditional Chinese dictionaries correspondence.As " " 1, " second " 5 etc.
If a Chinese character has only one, then this Chinese character is exactly the Chinese character of a no body, and as " one ", " second " etc., otherwise this Chinese character is a Chinese character that body is arranged.For the Chinese character that body is arranged, the part outside the radicals by which characters are arranged in traditional Chinese dictionaries constitutes the body of this Chinese character, and for casting aside, body is a little as the radicals by which characters are arranged in traditional Chinese dictionaries of " people "; The radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number being got in radicals by which characters are arranged in traditional Chinese dictionaries, body is got the body sign indicating number, is 3 as the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of " people ", and the body sign indicating number is 5; The radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number adds the coding that the body sign indicating number is exactly this Chinese character, and concrete code fetch order is consistent with the relative position of radicals by which characters are arranged in traditional Chinese dictionaries and body, but the longest N sign indicating number of getting, promptly by omitting the later part of N sign indicating number behind the regular code fetch.Specifically following the example of of body sign indicating number is as follows: according to the sequential write of body according to horizontal 1, perpendicular 2, cast aside 3, the rule of point 4, folding 5 is by the body code fetch of pen to Chinese character, constitutes the body sign indicating number of this Chinese character.
More than coding is exactly the standard sign indicating number.Any one Chinese character all has the form of standard sign indicating number." one " 1 during as N=8, " second " 5, " people " 34, " greatly " 134, " meeting " 341154, " structure " 12343554, " sign indicating number " 13251551, " " 41431251 etc.
(b) if the font of a Chinese character is a left right model, but the radicals by which characters are arranged in traditional Chinese dictionaries of its standard sign indicating number are not the whole parts that are positioned at its left side or the right, and whole parts on the left side or the right can constitute a Chinese character; Perhaps the font of a Chinese character is to go up mo(u)ld bottom half, but the radicals by which characters are arranged in traditional Chinese dictionaries of its standard sign indicating number are not to be positioned at its top or following whole parts, and top or following whole parts can constitute a Chinese character, and then this Chinese character has tolerant code:
I. the above-mentioned part that can constitute Chinese character is defined as virtual radicals by which characters are arranged in traditional Chinese dictionaries, the numeral of getting its sound sign indicating number correspondence constitutes the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of Chinese character.Get " wood " as " structure " and be virtual radicals by which characters are arranged in traditional Chinese dictionaries, virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number is 68 (mu).
Ii. the virtual radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character are outer part is defined as virtual body, encode according to following dummy rules body, the definition coding result be virtual body sign indicating number: (1) if virtual body by 1 the part form, and this part is radicals by which characters are arranged in traditional Chinese dictionaries or a Chinese character, then get the numeral of the sound sign indicating number correspondence of the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of these radicals by which characters are arranged in traditional Chinese dictionaries or this Chinese character, constitute the virtual body sign indicating number of this body; Body be radicals by which characters are arranged in traditional Chinese dictionaries be again a Chinese character simultaneously, according to being that radicals by which characters are arranged in traditional Chinese dictionaries are handled.Get " colluding " as " structure " and be virtual body, virtual body sign indicating number is 468 (gou).(2) if virtual body by 1 the part form, and this part is neither radicals by which characters are arranged in traditional Chinese dictionaries, neither a Chinese character, then determine the code length of the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this Chinese character earlier, by pen virtual body is got stroke by the sequential write of virtual body then, get a difference coding of the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number code length of N and this Chinese character at most, constitute the virtual body sign indicating number of this body.As " city " virtual radicals by which characters are arranged in traditional Chinese dictionaries is " towel ", and virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number is 546, and length is 3, and then virtual body sign indicating number is 41.(3) if virtual body is that 2 or 2 are with top, then each part is got one yard virtual body sign indicating number that constitutes this Chinese character respectively: the part that constitutes 1 radicals by which characters are arranged in traditional Chinese dictionaries is got first yard of the pairing radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of these radicals by which characters are arranged in traditional Chinese dictionaries, the part that does not constitute radicals by which characters are arranged in traditional Chinese dictionaries but constitute a Chinese character is got first yard of numeral of this phonetic code Chinese character correspondence, and other situations are got the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this part the first stroke correspondence without exception.As " " virtual radicals by which characters are arranged in traditional Chinese dictionaries be " Long ", virtual body is made up of two " Long ", virtual body sign indicating number is 55 (ll).
Iii. according to " coding that it is exactly this Chinese character that virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number adds virtual body sign indicating number; concrete code fetch order is consistent with the relative position of virtual radicals by which characters are arranged in traditional Chinese dictionaries and virtual body; but the longest N sign indicating number of getting " the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of principle combinations Chinese character and the virtual body sign indicating number of Chinese character, constitute the tolerant code of Chinese character.
" structure " gets " wood " for virtual radicals by which characters are arranged in traditional Chinese dictionaries during as N=6, and " colluding " be virtual body, and tolerant code is 68468 (mugou), and can certainly get " colluding " is virtual radicals by which characters are arranged in traditional Chinese dictionaries, and " wood " is virtual body, comes to the same thing; " sign indicating number " got " stone " and is virtual radicals by which characters are arranged in traditional Chinese dictionaries, and " horse " is virtual body, and tolerant code is 0462 , can certainly get " horse " and be virtual radicals by which characters are arranged in traditional Chinese dictionaries, " stone " is virtual body, comes to the same thing; " as " get " woman " for virtual radicals by which characters are arranged in traditional Chinese dictionaries, " mouth " is virtual body, and tolerant code is 68568 (nvkou), and can certainly get " mouth " is virtual radicals by which characters are arranged in traditional Chinese dictionaries, and " woman " is virtual body, comes to the same thing; Get " standing " as " erecting " and be virtual radicals by which characters are arranged in traditional Chinese dictionaries, " standing " outer part is virtual body, and tolerant code is 225454 (2254li); For another example " " tolerant code be 54055 (" " meet the definition of tolerant code, have tolerant code, the coding 55 of the virtual body of digital 540+ " Long Long " of the sound sign indicating number correspondence of promptly virtual radicals by which characters are arranged in traditional Chinese dictionaries first " Long "), etc.
The definition of above-mentioned Chinese-character canonical code and tolerant code, given full play to the potential of 10 key positions on the numeric keypad, mix the coding of importing speech by standard sign indicating number and tolerant code combination in any, can make the efficient of the efficient of font code input near keypad pinyin mixing input technology.
In order to improve the learnability of above-mentioned radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method, the present invention can improve (hereinafter referred to as improving 1) to above-mentioned radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method: stipulate that virtual body sign indicating number is except virtual body itself is the virtual body sign indicating number of numeral as Chinese character of the Chinese character sound sign indicating number correspondence of still getting this Chinese character, other situations are got stroke by pen to virtual body without exception, constitute the virtual body sign indicating number of Chinese character.The code taking method of virtual body sign indicating number has been simplified in this improvement, has improved the learnability of radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method.As "
Figure G2004800234193D00065
" virtual radicals by which characters are arranged in traditional Chinese dictionaries be " Long ", virtual body is made up of two " Long ", according to this improvement, virtual body sign indicating number is made of preceding 3 (414) of virtual body during N=6.
For further improving learnability of the present invention, the present invention also can improve following (hereinafter referred to as improving 2) to above-mentioned radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method: regulation is got stroke by pen to virtual body in any case without exception, constitute the virtual body sign indicating number of Chinese character. this improvement, further simplified the code taking method of virtual body sign indicating number, make the learnability of tolerant code reach top. " reason " is encoded to 112125 (standard sign indicating numbers) during as N=6, perhaps 920251 (get " king " and be virtual radicals by which characters are arranged in traditional Chinese dictionaries, " lining " is virtual body, virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number is 920, virtual body sign indicating number is the numerical coding 251 of " lining " first three correspondence), perhaps 112154 (get " lining " and be virtual radicals by which characters are arranged in traditional Chinese dictionaries, " king " is virtual body, virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number is 54, and virtual body sign indicating number is the numerical coding 1121 of " king " preceding four correspondences), or the like.
In above-mentioned coding rule, a lot of Chinese characters all have standard sign indicating number and tolerant code simultaneously, and therefore, the possible array configuration of the coding of speech is many, and the committed memory space is big, and partially embedded equipment implements and acquires a certain degree of difficulty.For this reason, can to above-mentioned radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method with and improve one's methods and further improve (hereinafter referred to as improving 3): only last Chinese character of speech allows to use tolerant code.During as N=6 " computing machine " be encoded to 431234 (" machine " uses the standard sign indicating number), perhaps 436854 (" machine " uses tolerant code, muji), or the like.
In addition, above-mentioned improvement can further improve (hereinafter referred to as improving 4): any one Chinese character is operating specification sign indicating number coding only all, uses tolerant code in no instance.During as N=6 " computing machine " be encoded to 431234, or the like.
In addition, above-mentioned improvement 4 can further improve (hereinafter referred to as improving 5): by 2 or 2 speech that above Chinese character is formed, when its code length deficiency N sign indicating number, mend 0 in its coding back.The all-key of " one by one " is 110 during as N=6, and so just repeated code does not take place the all-key 11 with " two ", effectively reduces the possibility of the speech be made up of 1 Chinese character and the all-key generation repeated code of the speech of being made up of 2 or 2 above Chinese characters.
In addition, said method also can further improve (hereinafter referred to as improving 6): by the speech that 1 Chinese character is formed, can compatiblely use the tolerant code coding of Chinese character.During as N=6 " machine " be encoded to 123435 (standard sign indicating numbers), perhaps 6854 (tolerant code muji), has so just been accelerated the input speed of the speech be made up of 1 Chinese character.
In addition, for further improving the efficient of the speech input of being made up of 1 Chinese character, the present invention also can improve (hereinafter referred to as improving 7) to said method: in input process, to compatible sound sign indicating number of the speech of being made up of 1 Chinese character and the input of standard Chinese phonetic.Be any one speech of forming by 1 Chinese character, can import as stated above, the sound sign indicating number of the Chinese character of also available this speech of composition, perhaps its standard Chinese phonetic input.It is unclear to the stroke order of part Chinese character that this has just solved certain customers, the problem that is difficult to import.In the specific implementation, can select a compatibility standard Chinese phonetic alphabet, also can select only compatible sound sign indicating number, perhaps can select compatible simultaneously.This has in fact just realized the compatible input of shape sound.
Above-mentioned improvement itself also can continue to improve, as stipulating: the Chinese character in the speech of forming by 2 or 2 above characters, if existing standard sign indicating number when again tolerant code being arranged, only uses tolerant code, not the operating specification sign indicating number.Can effectively reduce the quantity of the coding of speech like this, be convenient to the application of embedded system.
Mentioned the compatible input of shape sound above, in fact, the present invention can also realize that sound shape is mixed input, is defined in exactly in the process of sound sign indicating number input, and radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method that the speech of being made up of 1 Chinese character can compatible this Chinese character is imported.
In addition, the coding of any one speech can save its last sign indicating number except that first yard or last two yards or last trigram ..., or last (N-1) sign indicating number, constitute the brevity code of this speech.
At present, comparatively popular repeated code system of selection mainly contains two kinds, a kind of is that all repeated codes are arranged in order, using down right and left key selects, after moving to the speech that to import, choose with the ok key, this method button is a lot, efficient is very low. and second kind is long by technology, the order of arranging just according to repeated code, to the repeated code of relevant position, if length is chosen by corresponding numeral. do not have in the current presenting bank, with key page turning up and down. this method is improved than above-mentioned technology, not long by pressing length consuming time than weak point, but also to look for the position of candidate word, and efficient is lower. and the present invention not only can use any method in above-mentioned two kinds of methods to carry out repeated code and select, but also can be on the basis of above-mentioned brevity code, this the brand-new repeated code system of selection of triplex row prompt facility has creatively been proposed, realize all speech, and the mixing of all-key and brevity code input, further improving input efficiency. the concrete mode of triplex row prompt facility is:
For the input any coding, with coding itself as a candidate word (this candidate word is all to be made of numeral).As after importing 234567, the rarest repeated code, be exactly 234567 itself.Each presenting bank shows 3 repeated codes.For quick input, select 3 repeated codes respectively with ok, *, #.
Above-mentioned digital candidate word generally is arranged in second of presenting bank first screen, but under any circumstance, when there is not corresponding speech in the coding (no matter all-key or brevity code) of input except that this numeral candidate word, first display digit then, second, the 3rd is shown as sky.
Except this situation: (a) when the code length of keying in coding is 1, first is Chinese character (all-key or brevity code), perhaps English alphabet, perhaps other characters, second is numeral, the 3rd is punctuation mark, is 1 the back that is arranged in then with other all-key length, is that all brevity codes are 1 speech subsequently.(b) when the key entry coding is longer than 1, when the all-key of correspondence, first shows that the most frequently used speech in the all-key, second display digit, the 3rd shows that the most frequently used in brevity code speech is (if without any the brevity code of correspondence, then continue to show other all-keys), show remaining all-key repeated code subsequently, be all brevity code repeated codes then; When coding did not have corresponding all-key, first showed that the most frequently used brevity code, and second display digit shows other whole brevity codes subsequently.
This triplex row prompt facility biggest advantage need not be grown exactly by just importing, no matter repeated code is in first, second, third position of any screen of presenting bank, after finding corresponding repeated code (first screen need not by up and down key page turning) with upper and lower key page turning, can directly import by ok, *, # respectively, efficiency ratio length greatly improves by technology.As appear at first, second, third of presenting bank by 24 back " fore-tellings ", " 24 ", " Beijing ", directly press # input " Beijing ", press the * key and import " 24 ", press the ok key and import " fore-telling ".If import its coding after this speech do not appear at presenting bank first the screen, get final product by upper and lower key page turning, as after 24, importing " Beijing " (brevity code can be 24), but what import the first screen demonstration of 24 back presenting banks is " fore-telling ", " 24 ", " Beijing ", do not have " Beijing ", page turning downwards at this moment necessarily can be found " Beijing " this speech.
" the sound sign indicating number " that the present invention mentioned is meant sound sign indicating number I+ sound sign indicating number II." Chinese character " mentioned is meant Chinese character itself.
The " char " that the present invention mentioned is meant the character in any one character set (as ISO 10646 or Unicode, GB18030, GBK, GB2312, BIG5 or the like, with and superset or subclass).
Characters input method of the present invention, in the input in Chinese system, the coding of punctuation mark is first yard pairing numerical key of the pairing sound sign indicating number of its Chinese pronunciation; In non-input in Chinese system, the coding of punctuation mark is first alphabetical pairing numerical key of the corresponding foreign language word of this punctuation mark.For example, in English input system, the coding of punctuation mark is first alphabetical pairing numerical key of the corresponding English word of this punctuation mark, such as, the coding of punctuation mark ", " is the pairing numeral of first letter " c " " 2 " of " comma ".For another example, the coding of punctuation mark ". " is the pairing numeral of first letter " d " " 3 " of " dot ", being encoded to of " .net " " 3638 " then, and being encoded to of " .com " " 3266 ", or the like.Other punctuation marks, rule is analogized according to this.
Embodiment
Implementation method one: selected keyboard is defined as:
Figure G2004800234193D00101
Coding rule is: N=8, adopt and improve 5 coding rule.The coding that is Chinese character determines that by the standard sign indicating number of radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method the coding rule of speech is: the remainder code except that first yard of first yard+last character coding of the coding of all characters (being longer than the preceding N sign indicating number of getting of N sign indicating number).According to the coding rule of this speech to Chinese word coding after, by 2 or 2 speech that above Chinese character is formed, when the not enough N sign indicating number of its code length, mend 0 in its coding back.According to above-mentioned rule, as long as determined long codes length N, the coding of all speech is just unique have been determined.Like this, the calculator memory that takies is little, and system overhead is few, is easy to realize.
Then having generated a kind of is the characters input method of unit with the speech.As: " three " are encoded to " 111 " (standard sign indicating number); " individual " be encoded to " 342 " (standard sign indicating number); Being encoded to of " three " " 13420 " (first yard of the standard sign indicating number of " three "+the standard sign indicating number of " individual ", the not enough N sign indicating number of coding total length mends 0); " computing machine " be encoded to " 43123435 " (standard sign indicating number of first yard left-falling stroke+" machine " of the standard sign indicating number of first code-point of the standard sign indicating number of " meter "+" calculation "), " not to advance is to go back " is encoded to " 11251154 " (first yard horizontal stroke of the standard sign indicating number of first three word anyhow+the first five sign indicating number of the standard sign indicating number that last Chinese character " moves back "); " Indonesia " is encoded to " 34511224 " (apostrophe folding is horizontal+the standard sign indicating number of " Asia " preceding four yards); Being encoded to of " People's Republic of China (PRC) " " 23351325 " (the perpendicular standard sign indicating number of casting aside folding horizontal left-falling stroke+" state " preceding 2 yards); " T9 " is encoded to " 89 " (" t "+" 9 "); Being encoded to of " 123456 " " 123456 "; " Intelligent ABC " be encoded to " 35222 " (" intelligence ", " energy " are respectively got first yard of standard sign indicating number, promptly cast aside, roll over ,+" A " 2, " B " 2, " C " 2); " Mobile " is encoded to " 662453 " (" M "+" o "+" b "+" i "+" l "+" e "); Being encoded to of ": " (numeral of first yard correspondence of the sound sign indicating number of colon), " " be encoded to " 4222 ", " :) " be encoded to 65 (colon 6+ brackets 5) etc.
Certainly, more than coding has the brevity code form.For example: " three " brevity code is " 1 " or " 11 ", " three " brevity code is " 1 " or " 13 " or " 134 " or " 1342 ", " computing machine " brevity code is " 4 " or " 43 " or " 431 " or " 4312 " or " 43123 " or " 431234 " or " 4312343 ", and " not to advance is to go back " brevity code is " 1 " or " 11 " or " 112 " or " 1125 " or " 11251 " or " 112511 " or " 1125115 "; " Indonesia " brevity code is " 3 " or " 34 " or " 345 " or " 3451 " or " 34511 " or " 345112 " or " 3451122 "; " People's Republic of China (PRC) " brevity code is " 2 " or " 23 " or " 233 " or " 2335 " or " 23351 " or " 233513 " or " 2335132 "; " T9 " brevity code is " 8 "; " 123456 " brevity code is " 1 " or " 12 " or " 123 " or " 1234 " or " 12345 "; " Intelligent ABC " brevity code is " 3 " or " 35 " or " 352 " or " 3522 "; " Mobile " brevity code is " 6 " or " 66 " or " 662 " or " 6624 " or " 66245 ", "
Figure G2004800234193D00113
" brevity code is " 422 ", " 42 ", " 4 ", " :) " be encoded to 6 etc.
Above-mentioned coding method is in implementation process, adopt the triplex row prompt facility. as by 3: first demonstrations " " (all all-keys and brevity code are that the most frequently used speech in 3 the speech), second demonstration " 3 ", the 3rd shows "; ", non-Chinese character Chinese such as numeral and punctuation mark information can be imported fast like this; Show " fore-tellings " (in all-key the most frequently used that speech) by 24: first for another example, second demonstration " 24 ", the 3rd shows " Beijing " (in brevity code the most frequently used that speech), the new stroke input method of the T9 in the efficiency ratio correlation technique will height; Show " 133111 " by 133111: first for another example, second, the 3rd does not have corresponding repeated code not show, direct input digit, or the like.
This implementation method can certainly adopt and improve 4, and unique difference is exactly when by 2 or 2 speech that above Chinese character is formed, and when its code length deficiency N sign indicating number, does not also mend 0 in its coding back.As be encoded to " 1342 " of " three ", brevity code is " 134 ", " 13 ", " 1 ".
Implementation method two: selected keyboard definition is with implementation method one.N=6 adopts improvement 5 and improves 7 coding rule.In advantage with implementation method one, can be for the unclear Chinese character of part stroke order directly with standard Chinese phonetic or the input of sound sign indicating number.
The coding of speech (all-key and brevity code) is to be longer than preceding 6 yards of 6 yards get with the difference of implementation method one, as be encoded to " 233513 " of " People's Republic of China (PRC) ".In addition, but the speech compatibility standard Chinese phonetic alphabet of forming by 1 Chinese character or the input of sound sign indicating number not only can import as " sign indicating number " with the mode of implementation method one, also can use phonetic 62 (ma), with and brevity code 6 import; " group " not only can import with the mode of implementation method one, also can with phonetic 2264 (bang) with and brevity code 226,22,2 input, perhaps use sound sign indicating number 220 (ba η) with and brevity code 22,2 inputs, or the like.
Certainly, in the specific implementation, can only use standard Chinese phonetic, also can only use the sound sign indicating number, also can both use simultaneously.
Implementation method three: selected keyboard definition is with implementation method one.N=6 adopts and improves 5,6,7 coding rule, and the regulation of tolerant code is determined according to basic coding rule.
Other codings are identical with implementation method two, but the speech of being made up of 1 Chinese character also can compatible tolerant code input, imports as the mode of " sign indicating number " not only available implementation method two, also can (virtual radicals by which characters are arranged in traditional Chinese dictionaries are that stone, virtual body are yard with its tolerant code 0462, be encoded to 0462, promptly ) input, can certainly import with 0462 brevity code 046,04,0, for another example "
Figure G2004800234193D00122
" mode of both available implementation method two imports, and also can use tolerant code 56055 (lo η ll) and brevity code 5605,560,56,5 inputs thereof, or the like.
Because the speech be made up of 1 Chinese character can adopt the mode of tolerant code to import, the input efficiency of the speech of being made up of 1 Chinese character improves greatly.
Implementation method four: selected keyboard definition is with implementation method one.N=6 adopts and improves 3,7 coding rule, and the regulation of tolerant code is determined according to basic coding rule.
Other codings are identical with implementation method three, but last Chinese character of speech can encode with tolerant code, and when the not enough N sign indicating number of the code length of the speech of being made up of 2 or 2 above Chinese characters, also do not mend 0 in its coding back.During as N=6 " computing machine " be encoded to 431234 (" machine " uses the standard sign indicating number), perhaps 436854 (" machine " uses tolerant code, muji); " machine " be encoded to 123435 (standard sign indicating numbers), perhaps 6854 (tolerant codes, muji), certain above-mentioned coding also all has the brevity code form, as the brevity code of " computing machine " is 43123,4312,431,43,4 or 43685,4368,436 etc., the brevity code of " machine " is 12345,1234,123,12,1 or 685,68,6, or the like.
Because last Chinese character of all speech can adopt the mode of tolerant code to import, the input efficiency of speech improves greatly.
Implementation method five: selected keyboard definition is with implementation method one .N=6, Chinese character adopts radicals by which characters are arranged in traditional Chinese dictionaries-body pronunciation input method coding, the coding rule of speech is the remainder code except that first yard (being longer than preceding 6 yards of 6 yards get) of all character-coded first yard+last character coding, and adopts and improve 7.
Other codings are identical with implementation method four, but by 2 or 2 speech that above character is formed, its other Chinese characters except that last Chinese character also can use the tolerant code input, even and, do not mend 0 in the back by the not enough N sign indicating numbers of forming by 2 or 2 above characters of speech coding total length that Chinese character is formed yet.As being encoded to of " mechanism ": (standard sign indicating number first yard) or 6 (tolerant code first yard) that first yard of the coding of " machine " is 1, preceding 5 yards of the coding of " structure " is 12343 (standard sign indicating number) or 68468 (tolerant codes, mugou), then " mechanism " is encoded to 112343,612343,168468,668468, its brevity code is 11234,1123,112,11,1 or 61234,6123,612,61,6 or 16846,1684,168,16 or 66846,6684,668,66, or the like.
In this embodiment, form any one Chinese character of speech, both can encode by the operating specification sign indicating number, also can use the tolerant code coding, but both combination in any, this is very easily for the user.Simultaneously, because such scheme uses 10 numerical keys to import, can give full play to the potential of numeric keypad, input efficiency is also very high, can compare favourably with the input speed of keypad pinyin mixing input technology.
In implementation method three, four, five, if the regulation of tolerant code according to improving 1 or improve 2 coding rule and determine that then just variation has partly taken place the fault-tolerant body sign indicating number of part Chinese character, specific implementation and these implementation methods are similar substantially.As adopt improve 1 after, " mechanism " in the implementation method five, and the coding of " machine " in the implementation method three, four, five, " structure " all not have variation, but in the implementation method three, four, five "
Figure G2004800234193D00141
" coding variation has taken place, its fault-tolerant body sign indicating number was 55 originally, became 414 now.Adopt " mechanism " that improve in the 2 back implementation methods five for another example, and implementation method three, four, " machine " in five, variation has also all taken place in the coding of " structure ", " machine " be encoded to 123435 (standard sign indicating number) or 6835 (tolerant codes, mu35), " structure " be encoded to 123435 (standard sign indicating number) or 683554 (tolerant codes, mu3554), then " mechanism " is encoded to 112343,612343,168355,668355, certainly their brevity code is also corresponding, and variation taken place, brevity code as " machine " is 12345,1234,123,12,1 or 683,68,6, or the like.
Implementation method six: implementation method five is carried out following qualification:, when again tolerant code being arranged, in to the Chinese word coding of forming by 2 or 2 above characters, only use the tolerant code coding of Chinese character when an existing standard sign indicating number of Chinese character.Like this, the coding number of the speech of being made up of 2 or 2 above characters is significantly reduced.
Other codings are identical with implementation method five, but the Chinese character in the speech of being made up of 2 or 2 above characters, if having only the standard sign indicating number, operating specification sign indicating number when not having tolerant code, other any situations are only used tolerant code, not the operating specification sign indicating number.As " mechanism " be encoded to 668468 (" machine " try to please first yard of error code, " structure " try to please error code preceding 5 yards), brevity code is 66846,6684,668,66, or the like.This improvement project, editor-in-chief's number of codes is little, and the CyberSpace that takies is few, and this embedded device to partial memory space requirement strictness is fit closely.
Implementation method seven selected keyboards are defined as:
Figure G2004800234193D00142
Coding rule is: N=6, Chinese character adopts syllable code.The coding rule of speech is: the remainder code except that first yard (being longer than preceding 6 yards of 6 yards get) of first yard+last character coding of the coding of all characters, and regulation is worked as a speech and is made up of two or more characters, its last character is a Chinese character, and during the code length of this speech deficiency N sign indicating number, behind the coding of this speech, add the tone code of last Chinese character.
Then having generated a kind of is the characters input method of unit with the speech.As: " three " are encoded to " 726 " (san); " individual " be encoded to " 42 " (ge); " three " are encoded to " 7424 " (s+ge+4); " computing machine " be encoded to " 57541 " (j+s+ji+1), " not to advance is to go back " is encoded to " 259884 " (b+j+z+tui); " Indonesia " is encoded to " 936992 " (y+d+n+x+ya); Being encoded to of " People's Republic of China (PRC) " " 147644 " " T9 " is encoded to " 89 " (" t "+" 9 "); Being encoded to of " 123456 " " 123456 "; Being encoded to of " Intelligent ABC " " 16222 "
Figure G2004800234193D00153
" Mobile " is encoded to " 662453 " (" M "+" o "+" b "+" i "+" l "+" e "); Being encoded to of ": " (numeral of first yard correspondence of the sound sign indicating number of colon), Be encoded to " 4222 ", " :) " be encoded to 65 (colon 6+ brackets 5) etc.
More than coding all has the brevity code form, for example: the brevity code of " three " is " 72 " or " 7 ", the brevity code of " individual " is " 4 ", the brevity code of " three " is " 742 ", " 74 " or " 7 ", the brevity code of " computing machine " is " 5754 ", " 575 ", " 57 " or " 5 ", the brevity code of " not to advance is to go back " is " 25988 ", " 2598 ", " 259 ", " 25 ", " 2 ", the brevity code of " Indonesia " is " 93699 ", " 9369 ", " 936 ", " 93 " or " 9 ", the brevity code of " People's Republic of China (PRC) " " 14764 ", " 1476 ", " 147 ", " 14 " or " 1 ", the brevity code of " T9 " is " 8 ", the brevity code of " 123456 " is " 12345 ", " 1234 ", " 123 ", " 12 " or " 1 ", the brevity code of " Intelligent ABC " is " 1622 ", " 162 ", " 16 " or " 1 ", the brevity code of " Mobile " is " 66245 ", " 6624 ", " 662 ", " 66 " or " 6 "
Figure G2004800234193D00156
Brevity code be " 422 ", " 42 " or " 4 ", the brevity code of " :) " is " 6 ".
Implementation method eight selected keyboards and coding rule are with embodiment seven, and the speech that regulation is made up of a Chinese character can the input of the compatibility standard Chinese phonetic alphabet.
Other codings are identical with embodiment seven, but by the speech that a Chinese character is formed, also can import with its standard Chinese phonetic, not only can import as " standard " with the method for embodiment seven, also can import with 9486 (zhun), perhaps its brevity code 948,94,9 inputs, or the like.
Selected keyboards of implementation method nine and coding rule be with embodiment eight, and regulation if the speech of forming by 2 characters import with the coding (being longer than the preceding N sign indicating number of getting of N sign indicating number) of 2 characters forming this speech.
Other codings are identical with embodiment eight, but the speech of forming by 2 characters, variation has taken place in coding rule, be encoded to " 726434 " (san+ge+4) as " three ", its brevity code is 7,72,726,7264,72643 etc. its advantage is: be made up of Chinese character for all, and by the speech of syllable code (not comprising the speech of forming by 1 Chinese character) by Chinese phonetic alphabet coding, the speech code length of being made up of 1 Chinese character is the 2-4 sign indicating number, the speech code length of being made up of 2 or 3 Chinese characters is the 5-6 sign indicating number, other speech code lengths are 6 yards without exception, repeated code does not take place with the all-key of the speech of being made up of 2 or 2 above Chinese characters in the speech of being made up of 1 Chinese character, is convenient to Computer Processing.
Selected keyboards of implementation method ten and coding rule be with embodiment nine, and regulation is if the speech of being made up of 3 characters, with the coding (being longer than the preceding N sign indicating number of getting of N sign indicating number) of first character-coded first yard+latter two character.
Other codings are identical with embodiment nine, but the speech of being made up of 3 characters, and variation has taken place coding rule, as " computing machine " be encoded to " 578265 " (j+suan+j), its brevity code is 5,57,578,5782,57826 etc.Its advantage is: " calculating " be encoded to " 547826 " (ji+suah), brevity code is 5,54,547,5478,54782, " department of computer science " is encoded to " 575944 " (j+s+j+xi+4), brevity code is 5,57,575,5759,57594, the brevity code of " calculating " and " computing machine " except code length be 1 o'clock identical, other situations are all different, simultaneously the brevity code of " department of computer science " and " computing machine " except code length be 1 identical with 2 o'clock, other situations are also all different, can make three words obtain importing the most efficiently.
Selected keyboards of implementation method 11 and coding rule be with embodiment ten, and regulation as the sound sign indicating number I of Chinese character is
Figure G2004800234193D00161
The time, can be with zh, ch, sh coding as its sound sign indicating number I; In the sound sign indicating number II of Chinese character, η is arranged, can replace η to constitute the coding of sound sign indicating number II with ng; The numerical key of the coding correspondence of the sound sign indicating number II of the numerical key+Chinese character of the coding correspondence of the sound sign indicating number I of the coding=Chinese character of Chinese character.
Other codings are identical with embodiment ten, if but the character of forming speech has Chinese character, and preceding two yards of the standard Chinese phonetic of this Chinese character are zh, ch or sh, perhaps back two yards when being ng, the coding of this speech has the form of tolerant code, and promptly the sound sign indicating number I when Chinese character is
Figure G2004800234193D00171
The time, can be with zh, ch, sh coding as its sound sign indicating number I; In the sound sign indicating number II of Chinese character, η is arranged, can replace η to constitute the coding of sound sign indicating number II with ng; The numerical key of the coding correspondence of the sound sign indicating number II of the numerical key+Chinese character of the coding correspondence of the sound sign indicating number I of the coding=Chinese character of Chinese character.Meet above-mentioned condition as " preparation ", not only can be according to the method input of embodiment ten, also has tolerant code " 948622 " form (zhun+be), " standardization " can be according to the method input of embodiment ten, also has tolerant code " 294864 " form (b+zhun+h), certainly, these tolerant codes also have the form of brevity code, or the like.
Implementation method 12 selected keyboards and coding rule are with embodiment ten, and the speech that regulation is made up of a Chinese character can compatible radicals by which characters are arranged in traditional Chinese dictionaries-body sound sign indicating number input mode input.
Other codings are identical with embodiment ten, but by the speech that a Chinese character is formed, not only can also can use radicals by which characters are arranged in traditional Chinese dictionaries-body sound sign indicating number input mode coding input according to the method input of embodiment ten.Can compatible its standard sign indicating number 123435 as " machine ", and tolerant code 6854 (muji) input, these codings also have the form of brevity code, or the like.
The present invention compared with prior art has following remarkable advantage:
1, the input take word as unit, average code length, the repetition rate of coding is low, the input efficiency height;
2, for Chinese character, can realize phonetic-form of a stroke or a combination of strokes mixing input, improved input efficiency;
3, the study starting point of characters input method of the present invention is low, and it is natural and tripping to encode, and is easy to learn and use;
4, highly versatile, in the situation of not switching input state, can mixed inputting Chinese characters, the various characters such as numeral, punctuation mark, English alphabet, greatly improved input efficiency.

Claims (14)

1. one kind is the characters input method of unit with the speech, it is characterized in that: use the numeric keypad input character, this numeric keypad is defined as follows,
Zh, ch, sh, ng, ü in the phonetic transcriptions of Chinese characters of tonal symbol not are transformed to respectively
Figure F2004800234193C00012
η, v, constitute compression phonetic sign indicating number, when the code length of compression phonetic sign indicating number is 1, repeat this compression phonetic sign indicating number and constitute the phonetic sign indicating number, when the code length of compression phonetic sign indicating number greater than 1 the time, the pressure phonetic sign indicating number that contracts constitutes the phonetic sign indicating number, and first yard of phonetic sign indicating number is decided to be sound sign indicating number I, to be decided to be sound sign indicating number II except that the phonetic sign indicating number first yard
The sound sign indicating number η is defined in respectively on 0,1 two key position, wherein any two the sound sign indicating numbers of each key position definition;
By following rule is the unit input character with the speech:
(1) numerical key of the sound sign indicating number II correspondence of the numerical key+Chinese character of the sound sign indicating number I correspondence of the coding=Chinese character of Chinese character; The numerical key that is encoded to itself correspondence of numerical character; Numerical key on the above-mentioned numeric keypad that is encoded to this English alphabet correspondence of English alphabet; The coding of punctuation mark is first yard pairing numerical key of the pairing coding of its pronunciation;
(2) if the speech of being made up of 1 character, its coding is exactly the coding of this character, and the order keystroke is imported this speech;
(3) if the speech of forming by 2 characters, with the coding of 2 characters forming this speech, be longer than N sign indicating number before the getting of N sign indicating number, N be 〉=1 natural number, keystroke is imported this speech in proper order;
(4) if the speech of forming by 3 characters, with the coding of first character-coded first yard+latter two character, be longer than N sign indicating number before the getting of N sign indicating number, the order keystroke is imported this speech;
(5) if the speech of forming by the character more than 3, with the remainder code except that first yard of first yard+last character coding of the coding of all characters, be longer than N sign indicating number before the getting of N sign indicating number, keystroke is imported this speech in proper order.
2. according to claim 1 is the characters input method of unit with the speech, it is characterized in that: substitute above-mentioned (4), if the speech of forming by 3 characters, the remainder code except that first yard with first yard+last character coding of the coding of all characters, be longer than the preceding N sign indicating number of getting of N sign indicating number, the order keystroke is imported this speech.
3. according to claim 1 and 2 is the characters input method of unit with the speech, it is characterized in that: according to following relation, define tone code on numeric keypad:
Be made up of two or more characters when a speech, its last character is a Chinese character, and during the not enough N sign indicating number of the code length of this speech, adds the tone code of last Chinese character behind the coding of this speech, and the order keystroke is imported this speech.
4. according to claim 3 is the characters input method of unit with the speech, it is characterized in that: if the speech of forming by 1 Chinese character, also can be with the numerical key of the standard Chinese phonetic correspondence of the Chinese character that constitutes this speech, and the order keystroke is imported this speech.
5. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: above-mentioned sound sign indicating number
Figure F2004800234193C00022
Be defined on the key position 1,
Figure F2004800234193C00023
η is defined on the key position 0.
6. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: when the sound sign indicating number I of Chinese character is
Figure F2004800234193C00024
The time, can be with zh, ch, sh coding as its sound sign indicating number I; In the sound sign indicating number II of Chinese character, η is arranged, can replace η to constitute the coding of sound sign indicating number II with ng; The numerical key of the coding correspondence of the sound sign indicating number II of the numerical key+Chinese character of the coding correspondence of the sound sign indicating number I of the coding=Chinese character of Chinese character.
7. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: substitute above-mentioned (3), if the speech of forming by 2 characters, the remainder code except that first yard with first yard+last character coding of the coding of all characters, be longer than the preceding N sign indicating number of getting of N sign indicating number, the order keystroke is imported this speech.
8. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: in input process, to the compatible radicals by which characters are arranged in traditional Chinese dictionaries of the speech of being made up of a Chinese character-body sound sign indicating number input mode input, described radicals by which characters are arranged in traditional Chinese dictionaries-body sound sign indicating number input mode is:
(a) get comprise and only comprise horizontal stroke " ", perpendicular " Shu ", cast aside " Pie ", point " Dian ", Zhe “ Ya " the radicals by which characters are arranged in traditional Chinese dictionaries set of these five single radicals by which characters are arranged in traditional Chinese dictionaries, according to horizontal 1, perpendicular 2, cast aside 3, point 4, folding 5 codings, constitute its radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number;
The radicals by which characters are arranged in traditional Chinese dictionaries of stipulating each Chinese character all are the first strokes of this Chinese character, if Chinese character be one of radicals by which characters are arranged in traditional Chinese dictionaries in definite radicals by which characters are arranged in traditional Chinese dictionaries set, or constitute by the variant of radicals by which characters are arranged in traditional Chinese dictionaries, then the coding of this Chinese character is exactly the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this radicals by which characters are arranged in traditional Chinese dictionaries correspondence;
If a Chinese character has only one, then this Chinese character is exactly the Chinese character of a no body, otherwise this Chinese character is a Chinese character that body is arranged, and for the Chinese character that body is arranged, the part outside the radicals by which characters are arranged in traditional Chinese dictionaries constitutes the body of this Chinese character; The radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number got in radicals by which characters are arranged in traditional Chinese dictionaries, body is got the body sign indicating number; The radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number adds the coding that the body sign indicating number is exactly this Chinese character, and concrete code fetch order is consistent with the relative position of radicals by which characters are arranged in traditional Chinese dictionaries and body, but the longest N sign indicating number of getting, promptly by omitting the later part of N sign indicating number behind the regular code fetch; Specifically following the example of of body sign indicating number is as follows: according to the sequential write of body according to horizontal 1, perpendicular 2, cast aside 3, the rule of point 4, folding 5 is by the body code fetch of pen to Chinese character, constitutes the body sign indicating number of this Chinese character;
More than coding is exactly the standard sign indicating number, and any one Chinese character all has the form of standard sign indicating number;
(b) if the font of a Chinese character is a left right model, but the radicals by which characters are arranged in traditional Chinese dictionaries of its standard sign indicating number are not the whole parts that are positioned at its left side or the right, and whole parts on the left side or the right can constitute a Chinese character; Perhaps the font of a Chinese character is to go up mo(u)ld bottom half, but the radicals by which characters are arranged in traditional Chinese dictionaries of its standard sign indicating number are not to be positioned at its top or following whole parts, and top or following whole parts can constitute a Chinese character, and then this Chinese character has tolerant code:
I. the above-mentioned part that can constitute Chinese character is defined as virtual radicals by which characters are arranged in traditional Chinese dictionaries, the numeral of getting its sound sign indicating number correspondence constitutes the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of Chinese character;
Ii. the virtual radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character are outer part is defined as virtual body, encode according to following dummy rules body, the definition coding result be virtual body sign indicating number: (1) if virtual body by 1 the part form, and this part is radicals by which characters are arranged in traditional Chinese dictionaries or a Chinese character, then get the numeral of the sound sign indicating number correspondence of the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of these radicals by which characters are arranged in traditional Chinese dictionaries or this Chinese character, constitute the virtual body sign indicating number of this body; Body be radicals by which characters are arranged in traditional Chinese dictionaries be again a Chinese character simultaneously, according to being that radicals by which characters are arranged in traditional Chinese dictionaries are handled; (2) if virtual body by 1 the part form, and this part is neither radicals by which characters are arranged in traditional Chinese dictionaries, neither a Chinese character, then determine the code length of the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this Chinese character earlier, by pen virtual body is got stroke by the sequential write of virtual body then, get a difference coding of the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number code length of N and this Chinese character at most, constitute the virtual body sign indicating number of this body; (3) if virtual body is that 2 or 2 are with top, then each part is got one yard virtual body sign indicating number that constitutes this Chinese character respectively: the part that constitutes 1 radicals by which characters are arranged in traditional Chinese dictionaries is got first yard of the pairing radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of these radicals by which characters are arranged in traditional Chinese dictionaries, the part that does not constitute radicals by which characters are arranged in traditional Chinese dictionaries but constitute a Chinese character is got first yard of numeral of this phonetic code Chinese character correspondence, and other situations are got the radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of this part the first stroke correspondence without exception;
Iii. according to " coding that it is exactly this Chinese character that virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number adds virtual body sign indicating number; concrete code fetch order is consistent with the relative position of virtual radicals by which characters are arranged in traditional Chinese dictionaries and virtual body; but the longest N sign indicating number of getting " the virtual radicals by which characters are arranged in traditional Chinese dictionaries sign indicating number of principle combinations Chinese character and the virtual body sign indicating number of Chinese character, constitute the tolerant code of Chinese character.
9. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: the coding of any one speech can save its last sign indicating number except that first yard or last two yards or last trigram ..., or last (N-1) sign indicating number, constitute the brevity code of this speech.
10. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: the coding of punctuation mark is first yard pairing numerical key of the pairing sound sign indicating number of its Chinese pronunciation.
11. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: the coding of punctuation mark is first alphabetical pairing numerical key of the corresponding English word of this punctuation mark.
12. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: utilize triplex row to point out and carry out the repeated code selection, promptly for any coding of importing, with numerical coding itself as a candidate word, this candidate word generally is arranged in second of presenting bank first screen, but under any circumstance, when there is not corresponding speech in the coding of importing except that this numeral candidate word, first display digit then, second, the 3rd is shown as sky;
Except above-mentioned situation: (a) when key entry coding code length was 1, first was Chinese character, perhaps English alphabet, perhaps other characters, second is numeral, the 3rd is punctuation mark, being 1 the back that is arranged in then with other all-key length, is that all brevity codes are 1 speech subsequently; (b) when the key entry coding is longer than 1, when the all-key of correspondence, first shows that the most frequently used speech in the all-key, second display digit, the 3rd shows that the most frequently used speech in the brevity code, if without any the brevity code of correspondence, then continue to show other all-keys, showing remaining all-key repeated code subsequently, is all brevity code repeated codes then; When coding did not have corresponding all-key, first showed that the most frequently used brevity code, and second display digit shows other whole brevity codes subsequently;
No matter repeated code is in first, second, third position of any screen of presenting bank, find corresponding repeated code with upper and lower key page turning after, first screen need not by about the key page turning, can directly import by ok, *, # respectively.
13. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: N=6.
14. according to claim 4 is the characters input method of unit with the speech, it is characterized in that: 7≤N≤10.
CN2004800234193A 2003-08-20 2004-08-19 Character inputting method of using word as unit Expired - Fee Related CN1836199B (en)

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
CNB031537553A CN100362455C (en) 2003-08-20 2003-08-20 Digitalized Chinese character computer input method using words as unit
CN03153755.3 2003-08-20
CN200310113427.4 2003-11-10
CNA2003101134274A CN1542594A (en) 2003-11-10 2003-11-10 Chinese characters general computer input method using word as unit
CN200410058195.1 2004-08-18
CN 200410058195 CN1737735A (en) 2004-08-18 2004-08-18 Digital keyboard Chinese character input method using word as unit
PCT/CN2004/000967 WO2005043369A1 (en) 2003-08-20 2004-08-19 Character input method based on “phrase” as unit

Publications (2)

Publication Number Publication Date
CN1836199A CN1836199A (en) 2006-09-20
CN1836199B true CN1836199B (en) 2010-05-05

Family

ID=34556682

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2004800234193A Expired - Fee Related CN1836199B (en) 2003-08-20 2004-08-19 Character inputting method of using word as unit

Country Status (2)

Country Link
CN (1) CN1836199B (en)
WO (1) WO2005043369A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7395203B2 (en) 2003-07-30 2008-07-01 Tegic Communications, Inc. System and method for disambiguating phonetic input
CN113253853B (en) * 2021-03-29 2023-01-10 周长河 Chinese character input method for computer and mobile phone

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1201177A (en) * 1998-05-21 1998-12-09 王照璐 Pronunciation-font code for Chinese-characters input on computer and input keyboard thereof
CN1050432C (en) * 1995-12-25 2000-03-15 中国中文信息学会 Full-spelling double-spelling normalized code Chinese character enter mode

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1309343A (en) * 2000-02-16 2001-08-22 赵钢 Chinese-character shape-first phonetic letter input method with numeral keypad
CN1306236A (en) * 2000-03-25 2001-08-01 中国科学院长春应用化学研究所 Chinese-character radical-first phonetic letter input method
CN1147779C (en) * 2000-12-15 2004-04-28 戴尔晗 Chinese-character phonetic letters input method using digital codes and its keyboard

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1050432C (en) * 1995-12-25 2000-03-15 中国中文信息学会 Full-spelling double-spelling normalized code Chinese character enter mode
CN1201177A (en) * 1998-05-21 1998-12-09 王照璐 Pronunciation-font code for Chinese-characters input on computer and input keyboard thereof

Also Published As

Publication number Publication date
CN1836199A (en) 2006-09-20
WO2005043369A1 (en) 2005-05-12

Similar Documents

Publication Publication Date Title
US7256769B2 (en) System and method for text entry on a reduced keyboard
CN100462901C (en) GB phoneticize input method
CN100498662C (en) Vowel pinyin Chinese characters input method
CN1836199B (en) Character inputting method of using word as unit
CN102750009B (en) A kind of without switching input method of Chinese character and keyboard
CN100520685C (en) Chinese characters pinyin identification code input method
Po et al. Six-digit stroke-based Chinese input method
CN107256092B (en) Chinese character digital shape code quick input method
CN101021753A (en) Chinese character five-stroke fourteen-radicals inputting method on cellphone or computer
CN101105724B (en) Chinese character mixing input method for simplifying phonetics, digitalizing letter, and the keypad
CN104536590B (en) Embedded software keyboard system based on West Xia Dynasty's text sound character roots input method
CN100371862C (en) Universal keypad Chinese character input method using word as unit
CN102012749A (en) Ten-stroke encoding method of Chinese characters
CN1333325C (en) Pictographic character direct-viewing coding input method
CN102279653A (en) Keyboard used for inputting Chinese
CN101122820A (en) Stroke input method
CN101561712B (en) Method for inputting Korea character using Korean character keyboard
CN100495299C (en) Twin number input method
CN1409201A (en) Yi character input method for computer
CN103488309A (en) Chinese character input method combining simple spelling and component figures
CN1188772C (en) Chinese character phonetic transcriptions input candidate word three-stage classification input method for large character set
CN100389375C (en) Digital code input method
CN102622098A (en) New sound and shape encoding Chinese character input method
CN107390882A (en) A kind of 12 palace lattice key letter mapping methods with the ultralow spelling repetition rate of coding
CN102541277A (en) Method for inputting Chinese characters by aid of numeric keyboard and keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
DD01 Delivery of document by public notice

Addressee: Liu Xiangdong

Document name: Notification of Termination of Patent Right

C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100505

Termination date: 20120819