CN101650601A - 52-key mapping ultra-large character set Chinese character input method - Google Patents

52-key mapping ultra-large character set Chinese character input method Download PDF

Info

Publication number
CN101650601A
CN101650601A CN200810210460A CN200810210460A CN101650601A CN 101650601 A CN101650601 A CN 101650601A CN 200810210460 A CN200810210460 A CN 200810210460A CN 200810210460 A CN200810210460 A CN 200810210460A CN 101650601 A CN101650601 A CN 101650601A
Authority
CN
China
Prior art keywords
chinese
character
coding
syllable
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200810210460A
Other languages
Chinese (zh)
Inventor
舒从如
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN200810210460A priority Critical patent/CN101650601A/en
Publication of CN101650601A publication Critical patent/CN101650601A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a 52-key mapping ultra-large character set Chinese character input method, belonging to computer keyboard Chinese character coding input technique. A monosyllable character anddisyllable coding method is adopted to differentiate homophones; an integral coded unit is ensured to contain four essential factors, namely, sound, rhyme, tone and meaning by the way that mutual definition of two syllables in the disyllable reveals the meaning of the code, with the property of brief pinyin Chinese characters; and in ancient Chinese and complex context, monosyllable characters are set to be automatically displayed on a screen, while in modern Chinese context, the two syllable characters are set to be automatically displayed on a screen, thereby ensuring fast and accurate input of Chinese characters in the ultra-large character set.

Description

52 key position super large character set input method of Chinese character
The invention belongs to the computer keyboard Chinese character coding input technology, especially utilize the Chinese phonetic alphabet to carry out the method for super large character set encode Chinese characters for computer input.
So-called super large character set is meant that the character amount is the character set of GBK (21003 Chinese characters) more than 3 times.At present, known super large character set Chinese character entering technique only " new allusion quotation code inputting method " is a kind of, is used to solve the input problem of upright The Orchid Pavilion " word sea " 65000 words." new allusion quotation code inputting method " adopts interactive graphic interface according to the search condition of features such as the radicals by which characters are arranged in traditional Chinese dictionaries of Chinese character, stroke, the order of strokes observed in calligraphy, stroke number as the input Chinese character, by the characteristics combination of continuous select target Chinese character, finally realizes the Chinese character input by click.Angle from the encode Chinese characters for computer input, " new allusion quotation code inputting method " is a kind of not fixing code length, determines the Chinese character input method of coded character, and input efficiency is very low, but do not knowing under the situation of Chinese-character pronunciation certain value is arranged as a kind of radicals by which characters are arranged in traditional Chinese dictionaries stroke retrieval scheme.
The objective of the invention is to make each Chinese character to have a Latin alphabet coding that comprises sound, rhyme, tone and meaning four elements at least, realize the efficient input of super large character set Chinese character.
The object of the present invention is achieved like this:
(1) algorithm is selected: { unisonance character } ∩ { qualification character }={ upper screen character symbol }
Also promptly, determine upper screen character symbol collection according to unisonance character set and the common factor that limits character set.If make the upper screen character symbol unique, must make specific unisonance character set and specific qualification character set have only a total element.
(2) input method: upper screen character symbol pronunciation+qualification character pronunciation=upper screen character symbol
Modern Chinese has the trend of double-tone jointization, comprises the qualification character so the Modern Chinese linguistic context can be set the upper screen character symbol, can make the efficient of input higher like this.
(3) pronunciation spelling: roman initial consonant+roman simple or compound vowel of a Chinese syllable=high and level tone pronunciation
Roman initial consonant+italic simple or compound vowel of a Chinese syllable=rising tone pronunciation
Italic initial consonant+italic simple or compound vowel of a Chinese syllable=last sound pronunciation
Italic initial consonant+roman simple or compound vowel of a Chinese syllable=falling tone pronunciation
Roman initial consonant+roman simple or compound vowel of a Chinese syllable '=pronunciation softly
The initial consonant and the simple or compound vowel of a Chinese syllable of pronunciation spelling of the present invention, all the key-position letter in computer keyboard distributes represents that the key position of initial consonant and simple or compound vowel of a Chinese syllable distributes and sees also application for a patent for invention instructions " second-generation regional code input method of Chinese character " patent No.: ZL94115551.X and publication application specification " Chinese QWERTY keyboard and replacement input method of " application number: 2006100933678 with sound in the Scheme for the Chinese Phonetic Alphabet and simple or compound vowel of a Chinese syllable.
Owing to adopt such scheme, the theoretical coding of the space encoder unit that " upper screen character symbol pronunciation+qualification character pronunciation " opened up has 524, promptly 7311616.According to the resulting data of actual retrieval Chinese-character pronunciation, the coding unit has 14002, promptly 1960000 approximately.This huge space encoder can be used for the coding of all characters and all disyllabic words in the super large character set.Coding discloses meaning each other by the association of former and later two characters.Coding contains the sound, rhyme, tone and the meaning four elements of character, has a kind of character of " Chinese-character phonetic letter " concisely.
The drawing explanation: Fig. 1 is 422 syllable sound coordination contours, the corresponding relation of initial consonant and key position is represented in stringer, walk crosswise the corresponding relation of expression simple or compound vowel of a Chinese syllable and key position, the stringer initial consonant is with to walk crosswise the simple or compound vowel of a Chinese syllable point of crossing be a syllable, and the upper left corner of figure is high and level tone, rising tone, go up sound, falling tone and the example of five kinds of tone method for expressing softly.Fig. 2 is the distribution in keyboard of roman, tilted letter and the corresponding relation of consonant, vowel and key position, latin alphabet key position the Heavenly Stems, twelve Earthly Branches symbol other sign field bit code coding input symbol for indicating on Chinese character and the key position in " One Thousand Character Primer " to indicate on the external key position.
Elaborate with regard to input method of the present invention below in conjunction with drawings and Examples.
Chinese Pin Yin pseudonym of the present invention, simple or compound vowel of a Chinese syllable are distributed on 26 romans letter and 26 the tilted letter key positions, and initial consonant, simple or compound vowel of a Chinese syllable that letters case of the same name distributes are identical, referring to table 1.
The roman letters case Tilted letter key position Initial consonant Simple or compound vowel of a Chinese syllable The roman letters case Tilted letter key position Initial consonant Simple or compound vowel of a Chinese syllable
??A ??A ??zh ??a,üan ??N ??N ??n ??n,iou(iu)
??B ??B ??b ??ia,ua ??O ??O The capable zero initial of the capable o of a ??uo,o,io
??C ??C ??c ??uan ??P ??P ??p ??ou,er
??D ??D ??d ??ao ??Q ??Q ??q,ng ??ng,ing
??E ??E The capable zero initial of e ??e,ün ??R ??R ??r ??en
??F ??F ??f ??an ??S ??S ??s ??ai,ê
??G ??G ??g ??ang ??T ??T ??t ??eng
??H ??H ??h ??iang,uang ??U ??U ??sh ??u
??I ??I ??ch ??i,ueng ??V ??V M, n, the zero initial of ng ??üe,??uei(ui)
??J ??J ??j ??ian ??W ??W The capable zero initial of u ??ei
??K ??K ??k ??iao ??X ??X ??x ??uai,ü
??L ??L ??l ??in ??Y ??Y The capable zero initial of the capable ü of i ??ong,iong
??M ??M ??m ??m,ie ??Z ??Z ??z ??uen(un)
Table 1
Initial consonant zh among the table l, ch, sh use single-letter a respectively, i, u represents; The capable zero initial of e refers to all simple or compound vowel of a Chinese syllable e with the e beginning, ê, and ei, en, eng during the er self-syllable, needs to use the e cover; The capable zero initial of the capable o of a refers to all simple or compound vowel of a Chinese syllable a with a or o beginning, ai, and an, ang, ao, o, ou during the ong self-syllable, needs to use the o cover; M, n, the zero initial of ng refers to nasal sound m, n during the ng self-syllable, needs the v cover; The capable zero initial of u refers to all simple or compound vowel of a Chinese syllable u with the u beginning, ua, and uai, uan, uang, uei (ui), uen (un), ueng during the uo self-syllable, needs the w cover; The capable zero initial of the capable ü of i refers to all simple or compound vowel of a Chinese syllable i with i or ü beginning, ia, and ian, iang, iao, ie, in, ing, io, iou (iu), iong, ü, ü an, ü e during ü n self-syllable, needs the y cover; Initial consonant ng is exclusively used in spelling Cantonese " (I) " ng ú (being encoded to QU); When the m in the simple or compound vowel of a Chinese syllable hurdle, n, ng are meant these three sound self-syllables, they are used as simple or compound vowel of a Chinese syllable treat.
The present invention represents that with the positive italic permutation and combination of key-position letter tone is referring to table 2.
Figure A20081021046000061
Table 2
The present invention is with the pronunciation of a Chinese character of two Latin alphabets spelling, and this pronunciation comprises three key elements of sound, rhyme, tone.Can not " know its meaning " if " listen its sound ", then belong to unisonance people having the same aspiration and interest speech, coding as " interim " and " final " all is QIAY, need further to limit, coding is rewritten into QIAYJJ " interim () ", QIAYJM " final (knot) ", the syllable separator generates automatically, " " and " knot " two words when input not on screen.
A complete coding unit has sound, rhyme, tone and meaning four elements.The present invention carries out double-tone joint coding to the single syllable character, and its purpose makes each character that a complete coding unit is all arranged exactly, has that screen does not need to select on the character of complete coding unit.After the single syllable character double-tone jointization, its coding form is identical with disyllabic word.Here need to solve three technical matterss:
(1) difference of coding spelling: the present invention adopts syllable-dividing mark to address this problem.For example: the coding of " rice, rice " can be " MIFF, MIFF ".
(2) selection of upper screen character symbol: Ancient Chinese and complicated linguistic context monocase are gone up screen automatically, and Modern Chinese double word symbol is gone up screen automatically.
(3) processing of tone-off Chinese character: a pronunciation of using in the pronunciation system is encoded.
The present invention with the input of 52 key positions, takes into account tone in fact in computer keyboard as a kind of coding input element, will cause the tone input incorrect if roman letter and tilted letter selection are wrong.Consider the real standard of super large character set input method of Chinese character user mandarin, the setting of fault tolerance is necessary.The so-called fault tolerance of the present invention is meant roman and the italic selection of not considering letter, points to the union that roman, tilted letter coding sensing subclass are pressed in these letter strictnesses by these letter arrangements.For example: " UGHS " this coding can point to: coded objects such as " Shanghai, injury, commercial circles " if do not consider this coding factor of tone.Fault tolerance often causes a plurality of screens of may going up to be selected, and needs to select to go up screen.
The Heavenly Stems, twelve Earthly Branches key are mainly used in and select or insert function among Fig. 2 of the present invention, partial function illustrated in table 3 describes in detail and sees publication application specification " Chinese QWERTY keyboard and replacement input method of " application number: 2006100933678 (the present invention adjusts the partial function of the Heavenly Stems, twelve Earthly Branches key in the above-mentioned prospectus).
Table 3
Character input example in the table 3: do not hitting under the situation of letter key, directly hit the last of the ten Heavenly stems key or son import " radix point ", " 0-9 ", " comma " and " fullstop " respectively to last of the twelve Earthly Branches key; Hit tilted letter Y, hit second key input " ∈ " again, hit tilted letter Z key, hit again that third key input " " " " etc.---forward character hits the Y key, and the character after leaning on hits the Z key, and then hits character place key.
The input method of different syllabograms or character string is described with table 4 below:
Figure A20081021046000081
Table 4
Consonant coding input in the table 4 be equivalent to the broken phonetic input in some input methods, but consonant coding of the present invention input has 5 zero initials, makes the initial consonant sum reach 26.The difference of roman, italic also will make coding point to different coded objects.
Now technology path of the present invention is done a brief description:
(1) brings each character in the super large character set into a pronunciation system.The present invention may give each character at least one pronunciation as the pronunciation reference system with 422 syllables that retrieved and five kinds of pronunciations of each syllable.There is the use of generally acknowledged pronunciation to generally acknowledge pronunciation, generally acknowledges pronunciation, in the pronunciation reference system, use a pronunciation.The pronunciation spelling is referring to Fig. 1, table 1 and table 2.
(2) single syllable character double-tone jointization.The present invention with the pronunciation of character in the super large character set as first syllable, one or more related words of selecting this character again are as limiting word, to limit character pronunciation as second syllable, the selection of second syllable should be avoided " double-tone joint repeated code " as far as possible---and a double-tone joint coding has two or more coded objects.
(3) differentiation of unisonance people having the same aspiration and interest disyllabic word or polysyllabic word.The present invention selects to limit word once more and it is broken up out as ultima with its pronunciation according to the key word in disyllabic word or the polysyllabic word.
(4) coded object and hardware are selected.The present invention is the basic coding object with upright The Orchid Pavilion " word sea " 65000 words, selects Chinese QWERTY keyboard to be input hardware.
(5) the coded reference bibliography is selected.The basic bibliography of " Chinese dictionary entry word sound sequence index " and " Chinese big dictionary " conduct that the present invention selects U.S. sinologist professor Mei Weiheng to engage domestic expert to write.
(6) ultimate aim is to pave the technology road for the alphabetizing of block character.A complete coding unit of the present invention is exactly so-called Chinese-character phonetic letter, for example " One World, One Dream." be encoded to
TY·YH?YIGE?UIJM,TY·YH?YIGE?MTXH.。

Claims (2)

1.52 key position super large character set input method of Chinese character is a kind of computer keyboard Chinese character coding input technology, it is characterized in that: single syllable character double-tone joint coding, disclose the meaning of coding in the double-tone joint coding by the mutual qualification of two syllables, each encode Chinese characters for computer all contains sound, rhyme, tone and meaning four elements.
2. 52 key position super large character set input method of Chinese character according to claim 1 by indicating the different keys of the two classes position input initial consonant and the simple or compound vowel of a Chinese syllable of roman and tilted letter, are imported tone automatically in the process of input initial consonant and simple or compound vowel of a Chinese syllable.
CN200810210460A 2008-08-14 2008-08-14 52-key mapping ultra-large character set Chinese character input method Pending CN101650601A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200810210460A CN101650601A (en) 2008-08-14 2008-08-14 52-key mapping ultra-large character set Chinese character input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200810210460A CN101650601A (en) 2008-08-14 2008-08-14 52-key mapping ultra-large character set Chinese character input method

Publications (1)

Publication Number Publication Date
CN101650601A true CN101650601A (en) 2010-02-17

Family

ID=41672850

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200810210460A Pending CN101650601A (en) 2008-08-14 2008-08-14 52-key mapping ultra-large character set Chinese character input method

Country Status (1)

Country Link
CN (1) CN101650601A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110716651A (en) * 2019-09-25 2020-01-21 舒从如 Artificial intelligence computer keyboard

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110716651A (en) * 2019-09-25 2020-01-21 舒从如 Artificial intelligence computer keyboard

Similar Documents

Publication Publication Date Title
CN107562824B (en) Text similarity detection method
WO2018149250A1 (en) Chinese character skeleton code input method and system having suggestion screen interface
CN101334692A (en) Link-up phonetic input method
CN1027558C (en) Five-stroke and two-dimension encoding method and keyboard
CN104106023A (en) Input method for compatible keyboard
CN101825955A (en) Eight-final pinyin input method
CN100498662C (en) Vowel pinyin Chinese characters input method
CN103616960A (en) Six vowel binary syllabification input method
CN101630309A (en) Word processing system with fault tolerance function and method
CN101650601A (en) 52-key mapping ultra-large character set Chinese character input method
CN100568166C (en) A kind of character-checking typewriting idem code input method and input media thereof and application
CN101055501A (en) Phonetic input method for Chinese
CN106201007A (en) Integrate phonetic and the Chinese character input system of character shape coding various ways
CN101231558A (en) Oracle spelling and component resolution input method
US20070160292A1 (en) Method of inputting chinese characters
KR20110039419A (en) Chinese character input method adapting for chinese teaching
CN105807949B (en) Tibetan language input method and system
CN109766015A (en) Chinese character Latin code inputting method
CN104076939A (en) Pinyin character scheme
CN108459735A (en) Phonetic double-click touch screen method for inputting pinyin
CN1409201A (en) Computer Yi character input method
CN1306240A (en) Chinese-character 'shape-pronunciation code' input method
CN1050206C (en) Regular Chinese phonetic alphabet Chinese character input method
CN110502128B (en) Chinese character multi-element input method and system
CN101216735A (en) Computer main keyboard phonetic small keyboard assorted stroke symmetric code input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20100217