CN101650601A - 52-key mapping ultra-large character set Chinese character input method - Google Patents
52-key mapping ultra-large character set Chinese character input method Download PDFInfo
- Publication number
- CN101650601A CN101650601A CN200810210460A CN200810210460A CN101650601A CN 101650601 A CN101650601 A CN 101650601A CN 200810210460 A CN200810210460 A CN 200810210460A CN 200810210460 A CN200810210460 A CN 200810210460A CN 101650601 A CN101650601 A CN 101650601A
- Authority
- CN
- China
- Prior art keywords
- chinese
- character
- coding
- syllable
- input
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention discloses a 52-key mapping ultra-large character set Chinese character input method, belonging to computer keyboard Chinese character coding input technique. A monosyllable character anddisyllable coding method is adopted to differentiate homophones; an integral coded unit is ensured to contain four essential factors, namely, sound, rhyme, tone and meaning by the way that mutual definition of two syllables in the disyllable reveals the meaning of the code, with the property of brief pinyin Chinese characters; and in ancient Chinese and complex context, monosyllable characters are set to be automatically displayed on a screen, while in modern Chinese context, the two syllable characters are set to be automatically displayed on a screen, thereby ensuring fast and accurate input of Chinese characters in the ultra-large character set.
Description
The invention belongs to the computer keyboard Chinese character coding input technology, especially utilize the Chinese phonetic alphabet to carry out the method for super large character set encode Chinese characters for computer input.
So-called super large character set is meant that the character amount is the character set of GBK (21003 Chinese characters) more than 3 times.At present, known super large character set Chinese character entering technique only " new allusion quotation code inputting method " is a kind of, is used to solve the input problem of upright The Orchid Pavilion " word sea " 65000 words." new allusion quotation code inputting method " adopts interactive graphic interface according to the search condition of features such as the radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character, stroke, the order of strokes observed in calligraphy, stroke number as the input Chinese character, by the characteristics combination of continuous select target Chinese character, finally realizes the Chinese character input by click.Angle from the encode Chinese characters for computer input, " new allusion quotation code inputting method " is a kind of not fixing code length, determines the Chinese character input method of coded character, and input efficiency is very low, but do not knowing under the situation of Chinese-character pronunciation certain value is arranged as a kind of radicals by which characters are arranged in traditional Chinese dictionaries stroke retrieval scheme.
The objective of the invention is to make each Chinese character to have a Latin alphabet coding that comprises sound, rhyme, tone and meaning four elements at least, realize the efficient input of super large character set Chinese character.
The object of the present invention is achieved like this:
(1) algorithm is selected: { unisonance character } ∩ { qualification character }={ upper screen character symbol }
Also promptly, determine upper screen character symbol collection according to unisonance character set and the common factor that limits character set.If make the upper screen character symbol unique, must make specific unisonance character set and specific qualification character set have only a total element.
(2) input method: upper screen character symbol pronunciation+qualification character pronunciation=upper screen character symbol
Modern Chinese has the trend of double-tone jointization, comprises the qualification character so the Modern Chinese linguistic context can be set the upper screen character symbol, can make the efficient of input higher like this.
(3) pronunciation spelling: roman initial consonant+roman simple or compound vowel of a Chinese syllable=high and level tone pronunciation
Roman initial consonant+italic simple or compound vowel of a Chinese syllable=rising tone pronunciation
Italic initial consonant+italic simple or compound vowel of a Chinese syllable=last sound pronunciation
Italic initial consonant+roman simple or compound vowel of a Chinese syllable=falling tone pronunciation
Roman initial consonant+roman simple or compound vowel of a Chinese syllable '=pronunciation softly
The initial consonant and the simple or compound vowel of a Chinese syllable of pronunciation spelling of the present invention, all the key-position letter in computer keyboard distributes represents that the key position of initial consonant and simple or compound vowel of a Chinese syllable distributes and sees also application for a patent for invention instructions " second-generation regional code input method of Chinese character " patent No.: ZL94115551.X and publication application specification " Chinese QWERTY keyboard and replacement input method of " application number: 2006100933678 with sound in the Scheme for the Chinese Phonetic Alphabet and simple or compound vowel of a Chinese syllable.
Owing to adopt such scheme, the theoretical coding of the space encoder unit that " upper screen character symbol pronunciation+qualification character pronunciation " opened up has 524, promptly 7311616.According to the resulting data of actual retrieval Chinese-character pronunciation, the coding unit has 14002, promptly 1960000 approximately.This huge space encoder can be used for the coding of all characters and all disyllabic words in the super large character set.Coding discloses meaning each other by the association of former and later two characters.Coding contains the sound, rhyme, tone and the meaning four elements of character, has a kind of character of " Chinese-character phonetic letter " concisely.
The drawing explanation: Fig. 1 is 422 syllable sound coordination contours, the corresponding relation of initial consonant and key position is represented in stringer, walk crosswise the corresponding relation of expression simple or compound vowel of a Chinese syllable and key position, the stringer initial consonant is with to walk crosswise the simple or compound vowel of a Chinese syllable point of crossing be a syllable, and the upper left corner of figure is high and level tone, rising tone, go up sound, falling tone and the example of five kinds of tone method for expressing softly.Fig. 2 is the distribution in keyboard of roman, tilted letter and the corresponding relation of consonant, vowel and key position, latin alphabet key position the Heavenly Stems, twelve Earthly Branches symbol other sign field bit code coding input symbol for indicating on Chinese character and the key position in " One Thousand Character Primer " to indicate on the external key position.
Elaborate with regard to input method of the present invention below in conjunction with drawings and Examples.
Chinese Pin Yin pseudonym of the present invention, simple or compound vowel of a Chinese syllable are distributed on 26 romans letter and 26 the tilted letter key positions, and initial consonant, simple or compound vowel of a Chinese syllable that letters case of the same name distributes are identical, referring to table 1.
The roman letters case | Tilted letter key position | Initial consonant | Simple or compound vowel of a Chinese syllable | The roman letters case | Tilted letter key position | Initial consonant | Simple or compound vowel of a Chinese syllable |
??A | ??A | ??zh | ??a,üan | ??N | ??N | ??n | ??n,iou(iu) |
??B | ??B | ??b | ??ia,ua | ??O | ??O | The capable zero initial of the capable o of a | ??uo,o,io |
??C | ??C | ??c | ??uan | ??P | ??P | ??p | ??ou,er |
??D | ??D | ??d | ??ao | ??Q | ??Q | ??q,ng | ??ng,ing |
??E | ??E | The capable zero initial of e | ??e,ün | ??R | ??R | ??r | ??en |
??F | ??F | ??f | ??an | ??S | ??S | ??s | ??ai,ê |
??G | ??G | ??g | ??ang | ??T | ??T | ??t | ??eng |
??H | ??H | ??h | ??iang,uang | ??U | ??U | ??sh | ??u |
??I | ??I | ??ch | ??i,ueng | ??V | ??V | M, n, the zero initial of ng | ??üe,??uei(ui) |
??J | ??J | ??j | ??ian | ??W | ??W | The capable zero initial of u | ??ei |
??K | ??K | ??k | ??iao | ??X | ??X | ??x | ??uai,ü |
??L | ??L | ??l | ??in | ??Y | ??Y | The capable zero initial of the capable ü of i | ??ong,iong |
??M | ??M | ??m | ??m,ie | ??Z | ??Z | ??z | ??uen(un) |
Table 1
Initial consonant zh among the table l, ch, sh use single-letter a respectively, i, u represents; The capable zero initial of e refers to all simple or compound vowel of a Chinese syllable e with the e beginning, ê, and ei, en, eng during the er self-syllable, needs to use the e cover; The capable zero initial of the capable o of a refers to all simple or compound vowel of a Chinese syllable a with a or o beginning, ai, and an, ang, ao, o, ou during the ong self-syllable, needs to use the o cover; M, n, the zero initial of ng refers to nasal sound m, n during the ng self-syllable, needs the v cover; The capable zero initial of u refers to all simple or compound vowel of a Chinese syllable u with the u beginning, ua, and uai, uan, uang, uei (ui), uen (un), ueng during the uo self-syllable, needs the w cover; The capable zero initial of the capable ü of i refers to all simple or compound vowel of a Chinese syllable i with i or ü beginning, ia, and ian, iang, iao, ie, in, ing, io, iou (iu), iong, ü, ü an, ü e during ü n self-syllable, needs the y cover; Initial consonant ng is exclusively used in spelling Cantonese " (I) " ng ú (being encoded to QU); When the m in the simple or compound vowel of a Chinese syllable hurdle, n, ng are meant these three sound self-syllables, they are used as simple or compound vowel of a Chinese syllable treat.
The present invention represents that with the positive italic permutation and combination of key-position letter tone is referring to table 2.
Table 2
The present invention is with the pronunciation of a Chinese character of two Latin alphabets spelling, and this pronunciation comprises three key elements of sound, rhyme, tone.Can not " know its meaning " if " listen its sound ", then belong to unisonance people having the same aspiration and interest speech, coding as " interim " and " final " all is QIAY, need further to limit, coding is rewritten into QIAYJJ " interim () ", QIAYJM " final (knot) ", the syllable separator generates automatically, " " and " knot " two words when input not on screen.
A complete coding unit has sound, rhyme, tone and meaning four elements.The present invention carries out double-tone joint coding to the single syllable character, and its purpose makes each character that a complete coding unit is all arranged exactly, has that screen does not need to select on the character of complete coding unit.After the single syllable character double-tone jointization, its coding form is identical with disyllabic word.Here need to solve three technical matterss:
(1) difference of coding spelling: the present invention adopts syllable-dividing mark to address this problem.For example: the coding of " rice, rice " can be " MIFF, MIFF ".
(2) selection of upper screen character symbol: Ancient Chinese and complicated linguistic context monocase are gone up screen automatically, and Modern Chinese double word symbol is gone up screen automatically.
(3) processing of tone-off Chinese character: a pronunciation of using in the pronunciation system is encoded.
The present invention with the input of 52 key positions, takes into account tone in fact in computer keyboard as a kind of coding input element, will cause the tone input incorrect if roman letter and tilted letter selection are wrong.Consider the real standard of super large character set input method of Chinese character user mandarin, the setting of fault tolerance is necessary.The so-called fault tolerance of the present invention is meant roman and the italic selection of not considering letter, points to the union that roman, tilted letter coding sensing subclass are pressed in these letter strictnesses by these letter arrangements.For example: " UGHS " this coding can point to: coded objects such as " Shanghai, injury, commercial circles " if do not consider this coding factor of tone.Fault tolerance often causes a plurality of screens of may going up to be selected, and needs to select to go up screen.
The Heavenly Stems, twelve Earthly Branches key are mainly used in and select or insert function among Fig. 2 of the present invention, partial function illustrated in table 3 describes in detail and sees publication application specification " Chinese QWERTY keyboard and replacement input method of " application number: 2006100933678 (the present invention adjusts the partial function of the Heavenly Stems, twelve Earthly Branches key in the above-mentioned prospectus).
Table 3
Character input example in the table 3: do not hitting under the situation of letter key, directly hit the last of the ten Heavenly stems key or son import " radix point ", " 0-9 ", " comma " and " fullstop " respectively to last of the twelve Earthly Branches key; Hit tilted letter Y, hit second key input " ∈ " again, hit tilted letter Z key, hit again that third key input " " " " etc.---forward character hits the Y key, and the character after leaning on hits the Z key, and then hits character place key.
The input method of different syllabograms or character string is described with table 4 below:
Table 4
Consonant coding input in the table 4 be equivalent to the broken phonetic input in some input methods, but consonant coding of the present invention input has 5 zero initials, makes the initial consonant sum reach 26.The difference of roman, italic also will make coding point to different coded objects.
Now technology path of the present invention is done a brief description:
(1) brings each character in the super large character set into a pronunciation system.The present invention may give each character at least one pronunciation as the pronunciation reference system with 422 syllables that retrieved and five kinds of pronunciations of each syllable.There is the use of generally acknowledged pronunciation to generally acknowledge pronunciation, generally acknowledges pronunciation, in the pronunciation reference system, use a pronunciation.The pronunciation spelling is referring to Fig. 1, table 1 and table 2.
(2) single syllable character double-tone jointization.The present invention with the pronunciation of character in the super large character set as first syllable, one or more related words of selecting this character again are as limiting word, to limit character pronunciation as second syllable, the selection of second syllable should be avoided " double-tone joint repeated code " as far as possible---and a double-tone joint coding has two or more coded objects.
(3) differentiation of unisonance people having the same aspiration and interest disyllabic word or polysyllabic word.The present invention selects to limit word once more and it is broken up out as ultima with its pronunciation according to the key word in disyllabic word or the polysyllabic word.
(4) coded object and hardware are selected.The present invention is the basic coding object with upright The Orchid Pavilion " word sea " 65000 words, selects Chinese QWERTY keyboard to be input hardware.
(5) the coded reference bibliography is selected.The basic bibliography of " Chinese dictionary entry word sound sequence index " and " Chinese big dictionary " conduct that the present invention selects U.S. sinologist professor Mei Weiheng to engage domestic expert to write.
(6) ultimate aim is to pave the technology road for the alphabetizing of block character.A complete coding unit of the present invention is exactly so-called Chinese-character phonetic letter, for example " One World, One Dream." be encoded to
TY·YH?YIGE?UIJM,TY·YH?YIGE?MTXH.。
Claims (2)
1.52 key position super large character set input method of Chinese character is a kind of computer keyboard Chinese character coding input technology, it is characterized in that: single syllable character double-tone joint coding, disclose the meaning of coding in the double-tone joint coding by the mutual qualification of two syllables, each encode Chinese characters for computer all contains sound, rhyme, tone and meaning four elements.
2. 52 key position super large character set input method of Chinese character according to claim 1 by indicating the different keys of the two classes position input initial consonant and the simple or compound vowel of a Chinese syllable of roman and tilted letter, are imported tone automatically in the process of input initial consonant and simple or compound vowel of a Chinese syllable.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810210460A CN101650601A (en) | 2008-08-14 | 2008-08-14 | 52-key mapping ultra-large character set Chinese character input method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200810210460A CN101650601A (en) | 2008-08-14 | 2008-08-14 | 52-key mapping ultra-large character set Chinese character input method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101650601A true CN101650601A (en) | 2010-02-17 |
Family
ID=41672850
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200810210460A Pending CN101650601A (en) | 2008-08-14 | 2008-08-14 | 52-key mapping ultra-large character set Chinese character input method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101650601A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110716651A (en) * | 2019-09-25 | 2020-01-21 | 舒从如 | Artificial intelligence computer keyboard |
-
2008
- 2008-08-14 CN CN200810210460A patent/CN101650601A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110716651A (en) * | 2019-09-25 | 2020-01-21 | 舒从如 | Artificial intelligence computer keyboard |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107562824B (en) | Text similarity detection method | |
WO2018149250A1 (en) | Chinese character skeleton code input method and system having suggestion screen interface | |
CN101334692A (en) | Link-up phonetic input method | |
CN1027558C (en) | Five-stroke and two-dimension encoding method and keyboard | |
CN104106023A (en) | Input method for compatible keyboard | |
CN101825955A (en) | Eight-final pinyin input method | |
CN100498662C (en) | Vowel pinyin Chinese characters input method | |
CN103616960A (en) | Six vowel binary syllabification input method | |
CN101630309A (en) | Word processing system with fault tolerance function and method | |
CN101650601A (en) | 52-key mapping ultra-large character set Chinese character input method | |
CN100568166C (en) | A kind of character-checking typewriting idem code input method and input media thereof and application | |
CN101055501A (en) | Phonetic input method for Chinese | |
CN106201007A (en) | Integrate phonetic and the Chinese character input system of character shape coding various ways | |
CN101231558A (en) | Oracle spelling and component resolution input method | |
US20070160292A1 (en) | Method of inputting chinese characters | |
KR20110039419A (en) | Chinese character input method adapting for chinese teaching | |
CN105807949B (en) | Tibetan language input method and system | |
CN109766015A (en) | Chinese character Latin code inputting method | |
CN104076939A (en) | Pinyin character scheme | |
CN108459735A (en) | Phonetic double-click touch screen method for inputting pinyin | |
CN1409201A (en) | Computer Yi character input method | |
CN1306240A (en) | Chinese-character 'shape-pronunciation code' input method | |
CN1050206C (en) | Regular Chinese phonetic alphabet Chinese character input method | |
CN110502128B (en) | Chinese character multi-element input method and system | |
CN101216735A (en) | Computer main keyboard phonetic small keyboard assorted stroke symmetric code input method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20100217 |