Daziran Chinese Character input keyboard and input method thereof
The present invention relates to a kind of computer Chinese character key-board input media and input method thereof, belong to the computer data information processing technology field.
In the Hanzi keyboard of Two bors d's oeuveres scheme was imported in the past, double spelling coding was to develop to form on the basis of the Chinese phonetic alphabet mostly, only was fit to have the user on Chinese phonetic alphabet basis to use.The present invention has adopted a kind of method of cutting sound, based on the English alphabet pronunciation, redefined the Two bors d's oeuveres scheme, has broken away from the restriction that Two bors d's oeuveres in the past is subjected to Chinese phonetic alphabet rule.Past natural code Chinese character input method (hereinafter to be referred as " natural code ") had once been arranged the optional auxiliaring coding of font code as double spelling coding, utilize radical pronunciation initial consonant as coding, solved the problem of individual character repeated code under the Pinyin coding preferably, key in font code separately and also can search and import unacquainted word, but on the keys arrangement of font code, too much consider the situation that radical is concentrated, for reducing repeated code, some have been arranged more than the non-pronunciation part, more than used the coding of four symbolic keys as stroke and part radicals by which characters are arranged in traditional Chinese dictionaries, thereby on learnability, caused certain problem.In addition, the natural code encoding scheme in past does not adopt multiple selection mode when handling associative compounds and the unconspicuous Chinese character in radicals by which characters are arranged in traditional Chinese dictionaries position, brings the result of code fetch confusion to natural code, causes many users can not be on top of to the font code of natural code and correct the use.
State Language Work Committee and Press and Publication Administration unite clear and definite Chinese character disassembly principle and basic components in the announcement " information processing with GB13000.1 character set Hanzi component standard " (hereinafter to be referred as " standard "), provide foundation to encode Chinese characters for computer." stroke standard " can reduce the compatible coding of past to abnormal stroke in " standard ", and " splitting rule " solved the multiple understanding problem that different custom users split Chinese character with " basic components ".
The objective of the invention is to provide a kind of and contain the multicomponent coding new departure of " Two bors d's oeuveres, tone, radicals by which characters are arranged in traditional Chinese dictionaries, parts " according to " standard ", this schemes synthesis the multiple information of Chinese character, 20982 whole among the GBK " Hanzi expanded internal code specification " Chinese characters are encoded one by one, in use can import with multiple Chinese character array mode above-mentioned information.
The object of the present invention is achieved like this:
A kind of Hanzi input keyboard and input method thereof, on the keyboard of 26 English alphabet keys positions, encode according to Chinese character pronunciation, it is characterized in that, this method is made up of sound sign indicating number and font code, and can import the keyboard arrangements of pressing character mark such as Fig. 1 that its middle pitch sign indicating number is used by multiple array mode:
![Figure 9810173100057](https://patentimages.storage.googleapis.com/4b/c5/45/36cefe4ec888f9/9810173100057.png)
Every lattice top left is an English alphabet among Fig. 1, and top is right-hand to be the pronunciation example word of Two bors d's oeuveres initial consonant, and middle part and bottom are the pronunciation example word of Two bors d's oeuveres simple or compound vowel of a Chinese syllable.When formulating Fig. 1, utilized the similarity between Chinese-character pronunciation and the English alphabet pronunciation, arranged Two bors d's oeuveres consistent as far as possible with the pronunciation of English alphabet, as: " e " as initial consonant or as simple or compound vowel of a Chinese syllable all represent " complying with "; " o " all represents " Europe " as initial consonant or as simple or compound vowel of a Chinese syllable; " c " expression " west ", " i " expression " love "; " v " represents micro-; " y " expression " outward ".
When formulating Fig. 1, also utilized the pictograph relation of phonetic symbol and English alphabet, as: " room " represented with " x " in phonetic symbol, and therefore " x " all represents " room " as initial consonant or as simple or compound vowel of a Chinese syllable.
The pronunciation of all Chinese characters is divided into two, represents, in the somewhat similar ancient dictionary of its method employed " cutting sound " with new initial consonant and new simple or compound vowel of a Chinese syllable.As: " essence " word is divided into initial consonant " chicken " and simple or compound vowel of a Chinese syllable " English ", and double spelling coding is " gy ".
Use close " a " and " r " to do initial consonant respectively the Chinese character of zero initial pronunciation.That is: the Two bors d's oeuveres of " " is that the Two bors d's oeuveres of " aa ", " peace " is that the Two bors d's oeuveres of " rn ", " Ei " be " rz " for the Two bors d's oeuveres of " ac ", " volume " for the Two bors d's oeuveres of " rr ", " grace " for the Two bors d's oeuveres of " ai ", " recessed " for the Two bors d's oeuveres of " ak ", " loves " for the Two bors d's oeuveres of " af ", " hold high ".
The also available phonetic notation marking keyboard of user for the custom phonetic notation is arranged as Fig. 2.
In this input method, basic element of character keyboard arrangements such as Fig. 4 that font code is used:
Extender keyboard arrangements such as Fig. 5 that font code is used:
The left side of Fig. 4, Fig. 5 is the initial consonant of font code, and the right is the simple or compound vowel of a Chinese syllable of font code.The parts that occur on Fig. 4 all are the basic elements of character of using always, than being easier to understand and not needing to remember, the parts that occur on Fig. 5 be of little use and the complex form of Chinese characters in need the extender used.The coding of shape adopts the Two bors d's oeuveres of parts to encode, and the special case of making coding without Two bors d's oeuveres only has 4 groups: parts " day, the moon, order " the circular thing of expression is arranged on the letter " O "; Parts " hand, Rolling,
" with the English alphabet image on approaching, be arranged on the letter " F ", parts ", Shu, Ya, , , ∠ " with the first stroke of a Chinese character anyhow, are arranged on the letter " A "; " Bing, Rui, ,
" because of the first stroke of a Chinese character is a little, be arranged on the letter " D ".
Encode Chinese characters for computer of the present invention is got the Two bors d's oeuveres of Chinese character, the Two bors d's oeuveres of first part and the Two bors d's oeuveres of tail piece and is combined, when first part or tail piece are not character formation component or basic components, get the coding of an end stroke of first part first stroke of a Chinese character stroke or tail piece respectively, according to 7 altogether of the formed codings of above-mentioned principle, its order is, the 1st, 2 is the Two bors d's oeuveres of Chinese character, and the 3rd is tone, 4,5 is the Two bors d's oeuveres of first part, and 6,7 is the Two bors d's oeuveres of tail piece; One of desirable following method is imported during operation:
1. use preceding 2 and the 4th, 5 input;
The Two bors d's oeuveres that adds first part with the Two bors d's oeuveres of Chinese-character pronunciation is imported.During as input " good " word, the Two bors d's oeuveres " hc " of " good " and the Two bors d's oeuveres " nv " of first part " woman " are combined to form " hcnv ".Wherein: " h " be " good " initial consonant " with ", " c " is the simple or compound vowel of a Chinese syllable " recessed " of " good ", " n " is the initial consonant " slow " of " woman ", " v " be " woman " simple or compound vowel of a Chinese syllable " with ".When and for example importing " dress " word, the Two bors d's oeuveres " yg " of " dress " and the Two bors d's oeuveres " ee " of first part " clothing " are combined to form " ygee ".
2. use preceding 2 and the 4th, 6 input;
Add the Two bors d's oeuveres initial consonant of first part and the Two bors d's oeuveres initial consonant of tail piece is imported with the Two bors d's oeuveres of Chinese-character pronunciation.During as input " good " word, the Two bors d's oeuveres " hc " of " good ", the Two bors d's oeuveres initial consonant " n " of first part " woman " and the Two bors d's oeuveres initial consonant " z " of " son " are combined to form " hcnz ".Wherein: " h " be " good " initial consonant " with ", " c " is the simple or compound vowel of a Chinese syllable " recessed " of " good ", " n " is the initial consonant " slow " of " woman ", " z " is the initial consonant " certainly " of " son ".When and for example importing " dress " word, the Two bors d's oeuveres " yg " of " dress ", the Two bors d's oeuveres initial consonant " e " of first part " clothing " and the Two bors d's oeuveres initial consonant " y " that tail piece " is strengthened " are combined to form " ygey ".
3. use the 1st and the 4th, 6 input;
Add the Two bors d's oeuveres initial consonant of first part and the Two bors d's oeuveres initial consonant of tail piece is imported in the triple bond mode with the Two bors d's oeuveres initial consonant of Chinese character.During as input " good " word, " n " is combined to form " hnz " with the Two bors d's oeuveres initial consonant " z " of " son " with the Two bors d's oeuveres initial consonant " h " of " good ", the Two bors d's oeuveres initial consonant of first part " woman ".Wherein: " h " be " good " initial consonant " with ", " n " is the initial consonant " slow " of " woman ", " z " is the initial consonant " certainly " of " son ".When and for example importing " dress " word, the Two bors d's oeuveres initial consonant " yg " of " dress ", the Two bors d's oeuveres initial consonant " e " of first part " clothing " and the Two bors d's oeuveres initial consonant " y " that tail piece " is strengthened " are combined to form " yey ".
4. use back four (the 4th, 5,6,7) input;
The Two bors d's oeuveres that adds tail piece with the Two bors d's oeuveres of Chinese character radical spare is imported, and is used to import the Chinese character of not knowing pronunciation.As input during " good " word, " " and the Two bors d's oeuveres " zj " of tail piece " son " is combined to form " nvzi " to nv with the Two bors d's oeuveres of the first part " woman " of " good ".Wherein: " n " is the initial consonant " slow " of " woman ", and " v " is the simple or compound vowel of a Chinese syllable " v " of " woman ", and " z " is the initial consonant " certainly " of " son ", and " j " is the simple or compound vowel of a Chinese syllable " day " of " son ".And for example input is during " dress " word, with the Two bors d's oeuveres " ee " of first part " clothing ", and the Two bors d's oeuveres that tail piece " is strengthened " " yg " and be combined to form " eeyg ".
5. 1., the Two bors d's oeuveres with preceding 2 and 4,5 converts spelling input respectively to by method;
As input " good " word, convert preceding 2 and 4,5 s' Two bors d's oeuveres to spelling respectively, then become " haonv ".And for example input " dress " word converts spelling to, then becomes " zhuangyi "
6. 2., preceding 2 Two bors d's oeuveres is converted to the spelling input by method;
As input " good " word, convert preceding 2 Two bors d's oeuveres to spelling, then become " haonz ".And for example input " dress " word converts spelling to, then becomes " zhuangey ".
7. 4., the Two bors d's oeuveres with 4,5 and 6,7 converts the spelling input respectively to by method.
As input " good " word, convert 4,5 and 6,7 s' Two bors d's oeuveres to spelling respectively, then become " nvhao ".And for example input " dress " word converts spelling to, then becomes " eezhuang ".
When formulating (Fig. 4, Fig. 5) or (Fig. 6, Fig. 7), the radical of getting Chinese character is as first part.First part adopts and the identical method code fetch of " Xinhua dictionary " and " dictionary " radicals by which characters are arranged in traditional Chinese dictionaries indexing method, thereby is accepted by the user than being easier to, and is convenient to listen think input, and is simultaneously also fairly simple; The tail piece whole word of Chinese character remainder maximum; Split when ambiguity is arranged when running into associative compounds or Chinese character, follow not necessarily that " from top to bottom, principle from left to right " also can form the situation of many yards of words by multiple mode code fetch; If when tail piece is not a character formation component, choose last maximum character formation component or basic components or stroke.
Use the user of natural code Two bors d's oeuveres for custom, can replace Fig. 1, and with the relation of Fig. 3 and Fig. 1, mapping graph 4 and Fig. 5 form Fig. 6 and Fig. 7, to adapt to the natural code user with Fig. 3.In Fig. 3, " holding high " word in the natural code Two bors d's oeuveres is adjusted, " ag " is adjusted into " ah ", all the other are with disclosed natural code Two bors d's oeuveres is consistent.
Have 7 according to the formed coding of above-mentioned principle, when actual keyboard is imported, only choose usually wherein below 4 or 4, utilize different array modes to import.
" spelling " of indication of the present invention is standard Chinese phonetic." Two bors d's oeuveres " is meant each phonetic with a letter of sound." basic components " are meant the minimal parts that requirement has in " standard ", generally are a single character and cross connection parts together." character formation component " is meant the Chinese character that appears among the CJK, but is not the parts of basic components." splitting rule " is meant employed rule when by shape Chinese character being split, requires basic components must not split into other parts in " standard ", can only split by stroke in case of necessity." first part " is meant the previous parts after Chinese character is divided into two parts." tail piece " is meant the back parts after Chinese character is divided into two parts.
The CJK of indication is standard GB 130000.1 " CJK unifies Hanzi coded character set " among the present invention, is equal to international standard ISO10646.1 " universal multiple-octet coded character set (UCS) " fully.
The GBK full name of indication is " Hanzi expanded internal code specification " in the invention, and it receives the word deficiency in order to solve Chinese character, simplified and traditional isoplanar coexists and formulates.The whole Chinese characters, the non-Chinese symbol that comprise GB2312-80; Other CJK Chinese characters among the GB13000.1; Do not take in 52 Chinese characters of GB 13000.1 in " simplified character repertoire " as yet; Whole 7,000 Chinese characters of " contemporary Chinese common word table "; The whole simplified Chinese characters in " simplified character repertoire " and the corresponding complex form of Chinese characters thereof; Do not take in radicals by which characters are arranged in traditional Chinese dictionaries and the important component of GB 13000.1 in " 42-volume Chinese dictionary compiled during the regin of Kang Xi in the Qing Dynasty " and " Ci hai " as yet; 13 Hanzi structure symbols; 139 of the graphical symbols of not taken in " BIG5 ", singly in ISO 10646.1, exist by GB2312-80; 30 of phonetic alphabet and a, the g of formal income band tone; Chinese character " 0 "; The vertical setting of types punctuation mark of encoding among the GB 12345-90; From select 21 Chinese characters in the compatible district of the CJK of ISO 10646.1/GB 13000.1; 31 IBM OS/2 additional characters.
Owing to adopt such scheme, make the present invention have following important feature and obvious effects than other present scheme:
1) can use the Two bors d's oeuveres input, also can use complete, assembly mixing input, also can add the Two bors d's oeuveres or the spelling input of upper-part in the back of Two bors d's oeuveres or spelling, the coincident code problem when having solved Two bors d's oeuveres or spelling input individual character.
2) the present invention splits into two with Chinese character, and just in time phonogram and the associative compounds group word method with Chinese character is consistent, meets " standard " standard, thereby is easy to learn and use, and can import the Chinese character of not knowing pronunciation.By the coded query subsidiary function, can also know spelling, tone, method for splitting, stroke number, if sound card has been installed, can also hear orthoepy.
3) the present invention is on the code fetch of font code, as first part, with " Xinhua dictionary " radicals by which characters are arranged in traditional Chinese dictionaries indexing method basically identical, running into associative compounds or radicals by which characters are arranged in traditional Chinese dictionaries can be by multiple mode code fetch when concealment part with radical, so not only meet Chinese character group word rule, also meet common people's custom simultaneously.
4) the present invention can allow the multiple input mode while simultaneously and deposit, when operation, can switch by any on ﹠ off switch, such as the word that both can know pronunciation in when input by the mode input that normal Two bors d's oeuveres adds font code, also can import the word of not knowing pronunciation by the font code mode.
5) font code of the present invention coding also can combine with other Two bors d's oeuveres, produces other input mode according to the characteristics of front 1-4.
In conjunction with enforcement method for designing of the present invention, based on the Nature keyboard Two bors d's oeuveres and natural code Two bors d's oeuveres, the coding that constructs provides accompanying drawing and subordinate list.
Description of drawings:
Fig. 1 is the Nature keyboard double spelling key-position Chinese character contrast figure;
Fig. 2 is the Nature keyboard double spelling key-position phonetic notation contrast figure;
Fig. 3 is the Nature keyboard nature double spelling key-position contrast figure.
Fig. 4 is the definition figure of the basic portion of the Nature keyboard font code;
Fig. 5 is the Nature keyboard font code extender definition figure;
Fig. 6 is the Nature keyboard (natural code Two bors d's oeuveres) font code basic element of character definition list;
Fig. 7 is the Nature keyboard (natural code Two bors d's oeuveres) font code extender definition list;
The subordinate list explanation:
Table 1 is that the Nature keyboard Chinese character coding matrix section is taken passages;
Table 2 expands the coding schedule of radical for the Nature keyboard;
Table 3 is that the Nature keyboard (natural code Two bors d's oeuveres) encode Chinese characters for computer matrix section is taken passages;
Table 4 is the coding schedule that the Nature keyboard (natural code Two bors d's oeuveres) expands radical.
For the Nature keyboard font code basic element of character definition list (table 1), the parts that occur on this figure all are common components, than being easier to understand and do not need special memory, wherein the left side is an English alphabet, the Hanzi component of the right for defining on the English alphabet keys, the English alphabet of Chinese character back is the pronunciation simple or compound vowel of a Chinese syllable of these parts, and boldface type is represented and the inconsistent particular component of Two bors d's oeuveres.
For the Nature keyboard font code extender definition list (table 2), this table is gone up the parts that occur and is of little use, and the pronunciation of these parts is mistaken easily, and a therefore single-row table is convenient to inquiry.Wherein the left side is an English alphabet, and the right is the Hanzi component that defines on the English alphabet keys, and the English alphabet of Chinese character back is the pronunciation simple or compound vowel of a Chinese syllable of these parts.The boldface type connotation is the same.
The Nature keyboard Chinese character coding matrix section is taken passages (table 1), has comprised the part in whole 20982 codings, and other Chinese character of postorder and coding omit from table because of the length problem; (table 2) is for expanding the coding schedule of radical.
For the user of custom use natural code Two bors d's oeuveres, can shine upon according to the Two bors d's oeuveres of natural code Two bors d's oeuveres, thereby obtain (table 3) and (table 4) (table 1), (table 2).Table 1, the Nature keyboard Chinese character coding matrix section are taken passages
Table 2, the Nature keyboard expand the coding schedule of radical
Table 3, the Nature keyboard (natural Two bors d's oeuveres) encode Chinese characters for computer matrix section are taken passages
Table 4, the Nature keyboard (natural code Two bors d's oeuveres) expand the coding schedule of radical