CN1838044A - Chinese spelling, tone and stroke combined input method - Google Patents

Chinese spelling, tone and stroke combined input method Download PDF

Info

Publication number
CN1838044A
CN1838044A CN 200610020399 CN200610020399A CN1838044A CN 1838044 A CN1838044 A CN 1838044A CN 200610020399 CN200610020399 CN 200610020399 CN 200610020399 A CN200610020399 A CN 200610020399A CN 1838044 A CN1838044 A CN 1838044A
Authority
CN
China
Prior art keywords
character
string
chinese
stroke
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200610020399
Other languages
Chinese (zh)
Other versions
CN100399245C (en
Inventor
丁光耀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNB2006100203995A priority Critical patent/CN100399245C/en
Publication of CN1838044A publication Critical patent/CN1838044A/en
Application granted granted Critical
Publication of CN100399245C publication Critical patent/CN100399245C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

This invention relates to a Chinese spelling, tone and stroke combined input method. Wherein, applying Chinese code and special characters to form Chinese code database with Chinese code equal to the first spelling character plus Chinese vowel plus the tone plus the stroke, tone equal to one of the character opposite to one of the five tones in Chinese spelling, and the stroke equal to first stroke character plus the second stroke character plus the final stroke character (making up with the final stroke if total strokes less than three). First inputting Chinese the spelling character, then one or more of strings of vowel, tone, and stroke with any character can be inserted into another string or past; or inputting expression or file content. This invention is easy to study, has high efficiency and strong function, and brings the most flexibility to user.

Description

Chinese pinyin tone and stroke combinatorial input method
Technical field:
The present invention relates to a kind of Chinese Keypad Entry, relate in particular to the Chinese keyboard input method that a kind of pinyin stroke combines.
Background technology:
In the existing Chinese-character keyboard input method, can be divided three classes:
One, based on the input method of font
Input method based on font typically is represented as five character-shape input methods, with radical Chinese character or phrase are encoded, the repetition rate of coding is low, speed is fast, but the phrase amount is limited, and learning difficulty is big, and there is same problem in the input method of other character shape codings, be fit to professional words input personnel and use, be not suitable for ordinary populace and use.
Five stroke input methods adopt well-known 5 basic strokes, simply need not learn, and few with key, be fit to the keypad Chinese character input of mobile phone and so on, but the input code length, input speed is slow, and everyone order of strokes observed in calligraphy is not necessarily unified.
Two, based on the word phonetic input method of phonetic
Because the mass education basis of phonetic itself, be a kind of input method of popularizing the most based on the input method of phonetic, need not learn, along with the increase of dictionary amount, it is perfect that sentence is handled, and speed also improves constantly, and is subjected to liking of more and more masses.Typical case's representative has purple light phonetic, intelligence phonetic letter, Microsoft's phonetic etc.
Pure spelling input method also exists some defectives, at first, is inevitable individual character input, because the phonetically similar word of Chinese character is a lot, so its repetition rate of coding height have to be searched character reminding page by page.Secondly, reduce input code length owing to covet, remove the space and import the phonetic of cutting apart between Chinese character, and need continuous a plurality of phonetics are decomposed into automatically the phonetic of individual character, this is the automatic segmentation technology of phonetic.On the one hand,, there is inevitably input cutting ambiguity in theory, needs the user to get involved just and can avoid mistake owing to there is no initial consonant Chinese character; On the other hand, the automatic segmentation technology has caused also that phonetic is difficult to stroke, tone is organic combines neatly, and can not reduce the repetition rate of coding with simple mode; During automatic segmentation technical finesse words and phrases, require the complete input of simple or compound vowel of a Chinese syllable, lost dirigibility, perhaps simply do not import simple or compound vowel of a Chinese syllable, caused repetition rate of coding raising.
Three, the input method that makes up based on sound shape
These class methods are subdivided into two kinds of methods again:
1, focus on a class input method of minimizing repeated code and input code length, must utilize phonetic and simple font to carry out more secondary coding, during use, the user must encode in hypermnesia, has lost the simplicity of phonetic, makes input become complicated, and difficult note finds it difficult to learn.
2, emphasis point is to reduce repeated code and keeps a class input method of the simplicity of phonetic, generally select for use the phonetic of Chinese character and the most basic stroke to carry out simple code, its maximum characteristics are need not learn or learn simply, and have overcome the repetition rate of coding of individual character input, are fit to the demand of ordinary populace.Mainly there is following shortcoming in existing this class input method: when input Pinyin, can only omit the character of back, need the character of input Pinyin more, sign indicating number is longer, simultaneously the auxiliary input of stroke as phonetic; Phonetic and stroke must be imported by fixing mode, and owing to the adding of stroke encoding has destroyed the intrinsic rule of Pinyin coding and can not utilize the automatic segmentation technology to carry out the automatic decomposition of phonetic; And modes such as necessary artificial input space, the cutting Chinese character.This is to be cost to sacrifice input efficiency, reduces repeated code and improves user-friendliness.In the sentence input, when input " People's Republic of China (PRC) ", the user-friendliness of both of these case below observing: " zhonghuarenmingongheguo ", " zhong hua ren min gong heguo " both of these case, obviously the latter event user-friendliness is better.
Because pinyin stroke is easy to learn, along with informationalized constantly universal and deep, like that based on what the input method of phonetic tone and stroke composite class can be subjected to more and more ordinary populaces what people pressed for is Chinese character coding input method simple to operate, powerful, wieldy.
Summary of the invention:
Purpose of the present invention just provides a kind of Chinese pinyin tone and stroke combinatorial input method, and this kind input method is easy to learn, uses flexible operation, convenience, and input code is short, and the input efficiency height is powerful.
The technical solution adopted for the present invention to solve the technical problems is: a kind of Chinese pinyin tone and stroke combinatorial input method, comprise with the Chinese character code storehouse be pre-stored in the computing machine, character string on keyboard in the input Chinese character code storehouse, then with character string the step that Chinese code database is retrieved, shown and selects of input, be characterized in: the Chinese character code storehouse is the Hanzi font library that is made of encode Chinese characters for computer and special symbol coding, wherein:
Encode Chinese characters for computer=phonetic initial character+simple or compound vowel of a Chinese syllable+tone+stroke,
First character in phonetic initial character=Chinese phonetic alphabet,
Simple or compound vowel of a Chinese syllable in simple or compound vowel of a Chinese syllable=Chinese phonetic alphabet,
In five characters of five tone correspondences in tone=Chinese phonetic alphabet one,
Stroke=first stroke character+second stroked character+last stroked character, the Chinese character that the stroke less than is three is supplied three with last stroke, stroked character be point, horizontal, vertical, cast aside, in five characters of five stroke correspondences of folding one.
Compared with prior art, the invention has the beneficial effects as follows: whole coding is based on spelling scheme of Chinese character, and is easy to learn; Existing phonetic alphabet have tone again in the coding, also have stroke, and therefore the repetition rate of coding of input reduces the input efficiency height greatly; Stroke only select for use point, horizontal, vertical, cast aside, five kinds of basic strokes of folding, make that also the present invention is easy to learn, easy to operate; Three strokes selecting for use are the easiest the first sum of, two and the end pens remembered of people, and need not the middle stroke that be difficult to distinguish and remember order be encoded and imported, this also makes the present invention easy to learn, easy to operate, and three repetition rates of coding that are enough to make Chinese character have guaranteed the input efficiency of Chinese character to without the searching of page turning.Can make input method of the present invention both can import individual character in conjunction with corresponding search method and also can import words and phrases easily, and the file content that has had in the input computing machine, powerful.
The character set that above-mentioned simple or compound vowel of a Chinese syllable, tone, the character that stroke adopted constitute is mutually disjointed.Also promptly do not have common character in three kinds of character set, the present invention can upset the order of three kinds of codings arbitrarily and import when input simple or compound vowel of a Chinese syllable, tone, stroke like this, and computing machine also can accurately decompose three kinds of character set automatically, and recovers correct order.This makes input method of the present invention more flexibly, conveniently, is easy to use.
Five characters of five tone correspondences in the above-mentioned Chinese phonetic alphabet are: y-1, and L-2, c-3, x-4, w-softly; Point in the stroke, five characters horizontal, vertical, that cast aside, roll over five stroke correspondences are: d-point, h-horizontal stroke, s-perpendicular, p-left-falling stroke, z-folding.
This keypad character corresponded manner, the pronunciation first letter of pinyin of tone and stroke title is corresponding with its tone and stroke, alleviated the memory difficulty of study greatly; And also import tone and stroke specially, only need the English character keyboard of standard just passable without collateral key.Method of the present invention is easily learned, the convenient use, adaptability is strong.
Above-mentioned special symbol coding=specific code+classification sign indicating number+symbolic code, wherein:
Specific code=u,
Among classification sign indicating number=b-punctuation mark, s-mathematic sign, x-Greek alphabet, r-set with Japanese alphabet or q-other symbol one;
The character string that the initial character of symbolic code=each syllable of symbol pronunciation is formed, the assumed name symbol then constitutes with the set with Japanese alphabet phonetic symbol for communicating with the eyes.
Because specific code u is different with all first letter of pinyin, therefore when importing, special symbol need not change input mode, easy to operate, the input efficiency height, and the classification sign indicating number is made up of the initial character of the phonetic of the first Chinese character of item name, symbolic code is made up of each syllable initial character of designation pronunciation, easily learns easily note.
Above-mentioned on keyboard during the Chinese character coding set string in the input Chinese character code storehouse, import the phonetic initial character of Chinese character earlier, carry out word selection or import one or more combination in three kinds of character strings of simple or compound vowel of a Chinese syllable, tone, stroke of corresponding Chinese character again, and do not have sequencing when simple or compound vowel of a Chinese syllable, tone, three kinds of character strings combinations of stroke during combination; And the character in any one character string in these three kinds of character strings can insert in another character string; When the input simple or compound vowel of a Chinese syllable was an, en, in, un, ang, eng, ing or ong, the input sequence of rhythm alphabetic character can change.In the step that above-mentioned character string with input is retrieved, shown and selects Chinese code database: also be that input string decomposes at first, it is decomposed into pinyin string, tone string, stroke string to the character string of importing; The concrete steps of decomposing are:
The a step: pinyin string, tone string, stroke string assignment are empty string,
The b step: get the phonetic initial character of input string and be connected to pinyin string,
The c step: if input string has been got the bundle decomposition that finishes, otherwise get the initial character of the residue character of input string,
If this character belongs to the rhythm alphabetic character set, this character is connected to pinyin string,
If this character belongs to the stroked character collection, this character is connected to the stroke string,
If this character belongs to the tone character collection, this character is connected to the tone string,
Repeat the process in this step, got, finish to decompose until input string.
Simple or compound vowel of a Chinese syllable in the pinyin string that obtains after the above-mentioned decomposition step also must carry out a preface adjustment, and the concrete steps of position preface adjustment are:
The a step:, then n is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the n character in the simple or compound vowel of a Chinese syllable string;
The b step: in second step,, g is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the g character in the simple or compound vowel of a Chinese syllable string.
Above imputting Chinese characters and corresponding computing machine decomposition method thereof, make the present invention when the input Chinese character, requirement to memory is extremely low, simple or compound vowel of a Chinese syllable, tone, three kinds of character strings of stroke can remember what what just imports, nor use the pipe order, so the input of input method of the present invention is very random, convenience.
Above-mentioned on keyboard during the Chinese character string in the character string in the input Chinese character code storehouse, the character in simple or compound vowel of a Chinese syllable, tone, the stroked character string can omit arbitrarily, so that only need import a character.
Above-mentioned pinyin string, tone string, stroke string are gone here and there by the retrieval that is linked in sequence into of pinyin string, tone string, stroke string, with this retrieval string Hanzi font library are retrieved then, and the concrete steps of retrieval are as follows:
A step: encode Chinese characters for computer=a get encode Chinese characters for computer in the character library,
The b step: compose: first character of S=encode Chinese characters for computer, first character of P=retrieval string,
The c step: if S or P are the string end mark, change the d step,
If S=P then composes: the character late of S=encode Chinese characters for computer, the character late of P=retrieval string,
Otherwise compose: the character late of S=encode Chinese characters for computer, P is constant,
Repeat this step process;
D step: if P is the string end mark, think retrieval string and encode Chinese characters for computer string coupling, choose the Chinese character of encode Chinese characters for computer string correspondence, otherwise, abandon the Chinese character of this encode Chinese characters for computer,
The e step: repeat a step, finish up to the character library retrieval, and be shown on the screen Chinese character that retrieves selective.
Above this imputting Chinese characters and corresponding character library search method thereof, make when the present invention imports Chinese character, simple or compound vowel of a Chinese syllable, tone, stroked character string are not required that note is complete, can at will default any character, this makes one aspect of the present invention can reduce code length neatly, improves input efficiency; Avoid misspelling and dialect pronunciation mistake, the problem that the Chinese character that may bring can't be imported by ignore character on the other hand; Therefore, it makes the present invention more flexibly, conveniently, arbitrarily, is fit to very much ordinary populace and uses.
Above-mentioned Chinese character code storehouse also includes dictionary, and all the Chinese character single character codes in words and phrases are formed a words and phrases coded strings array successively, and all words and phrases coded strings arrays constitute dictionary; When importing 3 words or the words and phrases more than 3 words on keyboard, behind the first Chinese character of input, remaining Chinese character can omit input arbitrarily, so that only need import a Chinese character more earlier.
Aforesaid way is after importing two or more Chinese character coding set string on the keyboard, computing machine is outside the current Chinese character string that will import is retrieved the Hanzi font library in the Chinese code database, also the Chinese character coding set of two or more that will import polyphone is connected into a retrieval string array, dictionary in the Chinese code database is retrieved, and the concrete steps of dictionary retrieval are as follows:
A step: words and phrases coded strings array=get a words and phrases coded strings array in the dictionary;
The b step: compose: first encode Chinese characters for computer of S2=words and phrases coded strings array,
First encode Chinese characters for computer of P2=retrieval string array;
The c step:, change the g step, otherwise carry out following operation if S2 or P2 are empty string:
The d step: compose: first character of S=S2, first character of P=P2,
The e step: if S or P are the string end mark, change the f step,
If S=P then composes: the character late of S=S2, the character late of P=P2,
Otherwise compose: the character late of S=S2, P is constant,
Repeat this step process;
The f step:, then compose if P is the string end mark: the next encode Chinese characters for computer of S2=words and phrases coded strings array,
The next one retrieval string of P2=retrieval string array,
Otherwise compose: the next encode Chinese characters for computer of 82=words and phrases coded strings array, P2 is constant,
Repeat the c step,
The g step: if P2 is an empty string, thinks and retrieve string array and words and phrases coded strings array coupling, choose the words and phrases of coded strings array correspondence; Otherwise abandon the words and phrases of this words and phrases coded strings array representation,
The h step: repeat a step, finish, be shown on the screen words and phrases that retrieve selective up to the words and phrases library searching.
Such input mode and search method thereof, make the present invention when the input words and phrases, can omit arbitrarily as the Chinese character individual character, when importing the character string of Chinese character certainly, character wherein can omit arbitrarily again, this dual any abridged words and phrases input mode promptly makes input mode very flexible, has reduced the input code length especially greatly, obviously improve Chinese character input efficiency, make function of the present invention very powerful.
Words and phrases in the dictionary in the above-mentioned Chinese character code storehouse also comprise the words and phrases that the filename of tape file name identifier is formed, when the filename of words and phrases of importing and dictionary mates, display screen provides the corresponding prompt symbol, represent that these words and phrases are filenames, after choosing this document name, computing machine reads out full content in the respective file as current input.
Computing machine carries out the above-mentioned dictionary that comprises the filename words and phrases of tape file name identifier when retrieving, and retrieval is shown on the screen words and phrases that retrieve selective after finishing; When if the words and phrases that retrieve are the filename words and phrases of tape file name identifier, then with these words and phrases on display screen the time, also demonstrate the filename identifier simultaneously, if choose this document nominal sentence, then computing machine is read the full content of filename institute respective file as current input.
The input mode of this file content can simply promptly be called in existing file content in the current text, and omitted search, open file, duplicate, a series of manual operationss such as stickup, be very easy to the user, improved input efficiency.
Above-mentioned on keyboard during the input special symbol coded string: input specific code u earlier, import any one of classification sign indicating number, symbolic code again, and the character of classification sign indicating number and symbolic code can omit arbitrarily, so that only need import a character.
Behind the input special symbol coded string, computing machine is formed a retrieval string with the special symbol string of input, with this retrieval string Hanzi font library is retrieved then, and the concrete steps of retrieval are as follows:
The a step: the special symbol that special symbol is encoded=got in the character library is encoded,
The b step: compose: first character of S=special symbol coding, first character of P=retrieval string,
The c step: if S or P are the string end mark, change the d step,
If S=P then composes: the character late of S=special symbol coding, the character late of P=retrieval string,
Otherwise compose: the character late of S=special symbol coding, P is constant,
Repeat this step process;
The d step: if P retrieves string and special symbol coded strings coupling for going here and there end mark, thinking, choose the special symbol of special symbol coded strings correspondence, otherwise, abandon the special symbol that this special symbol is encoded,
The e step: repeat a step, the special symbol retrieval finishes in character library, and is shown on the screen special symbol that retrieves selective.
This makes the also unusual simple and flexible of input of special symbol, and the match retrieval method makes and needn't remember the classification sign indicating number, can import as long as know the division name of special symbol, further reduces the requirement to memory.
The invention will be further described below in conjunction with embodiment.
Embodiment
Embodiment
A kind of embodiment of the present invention is: a kind of Chinese pinyin tone and stroke combinatorial input method, comprise with the Chinese character code storehouse be pre-stored in the computing machine, character string on keyboard in the input Chinese character code storehouse, the step that the character string of input is retrieved, shown and select Chinese code database then.The Chinese character code storehouse is the Hanzi font library that is made of encode Chinese characters for computer and special symbol coding, wherein:
Encode Chinese characters for computer=phonetic initial character+simple or compound vowel of a Chinese syllable+tone+stroke,
First character in phonetic initial character=Chinese phonetic alphabet also is phonetic initial character collection={ a, b, c, d, e, f, g, h, j, k, l, m, n, o, p, q, r, s, t, w, x, y, character among the z};
Simple or compound vowel of a Chinese syllable in simple or compound vowel of a Chinese syllable=Chinese phonetic alphabet also is rhythm alphabetic character set={ a, e, i, u, r, v, o, n, character among the g};
In five characters of five tone correspondences in tone=Chinese phonetic alphabet one,
Stroke=first stroke character+second stroked character+last stroked character, the Chinese character that the stroke less than is three is supplied three with last stroke, stroked character be point, horizontal, vertical, cast aside, in five characters of five stroke correspondences of folding one.
The character set that simple or compound vowel of a Chinese syllable in this example, tone, the character that stroke adopted constitute is mutually disjointed.
More specifically:
Five characters of five tone correspondences in this routine Chinese phonetic alphabet are: y-1, and L-2, c-3, x-4, w-softly; Point in the stroke, five characters horizontal, vertical, that cast aside, roll over five stroke correspondences are: d-point, h-horizontal stroke, s-perpendicular, p-left-falling stroke, z-folding.The d that stroked character is concentrated has represented a little and has pressed down (being called long point) two kinds of basic strokes; The h that stroked character is concentrated has represented horizontal stroke and has chosen (being called horizontal choosing) two kinds of basic strokes; The z that stroked character is concentrated has represented all turnover basic strokes; The c that tone character is concentrated gets " ginseng " first letter of pinyin (chan), because of " ginseng " and " three " likeness in form; The x that tone character is concentrated gets the first letter of pinyin of the Cantonise dialect pronunciation (xi) of " four ".
This routine special symbol coding=specific code+classification sign indicating number+symbolic code, wherein:
Specific code=u,
Among classification sign indicating number=b-punctuation mark, s-mathematic sign, x-Greek alphabet, r-set with Japanese alphabet or q-other symbol one;
The character string that the initial character of symbolic code=each syllable of symbol pronunciation is formed then constitutes with the set with Japanese alphabet phonetic symbol for the set with Japanese alphabet symbol.For example:
Left square bracket ' being encoded to of [' " ubzfkh ";
Being encoded to of Greek alphabet ' α ' " uxaef ";
Being encoded to of mathematic sign ' ∑ ' " ussgm ";
Being encoded to of set with Japanese alphabet ' か ' " urka ";
Being encoded to of other symbol ' ★ ' " uqwjx "
When on keyboard, importing the Chinese character string in the Chinese character code storehouse, import the phonetic initial character of Chinese character earlier, carry out word selection or import one or more combination in three kinds of character strings of simple or compound vowel of a Chinese syllable, tone, stroke of corresponding Chinese character again, and do not have sequencing when simple or compound vowel of a Chinese syllable, tone, three kinds of character strings combinations of stroke during combination; And the intercharacter in any one character string in these three kinds of character strings can insert in another character string; When the input simple or compound vowel of a Chinese syllable was an, en, in, un, ang, eng, ing or ong, the input sequence of rhythm alphabetic character can change.This is because in the simple or compound vowel of a Chinese syllable, because character n and ng always appear at the last of simple or compound vowel of a Chinese syllable, even changed input bit sequence during input, also can be adjusted into correct position preface by straightforward procedure.
This characteristic is significant in any omission input.
For example: import " worker ", phonetic is " gong ", and stroke is " hsh "
During input, " gg " of input Pinyin at first, import " hs " of stroke again, input string becomes " gghs ", if " worker " also do not appear in input prompt, can then continue input " on ", whole input string becomes " gghson ", through algorithm process, can decompose and obtain character string " ggon " and stroke " hs "; Simple or compound vowel of a Chinese syllable position preface is being adjusted, obtained at last: phonetic " gong ", stroke " hs ".
Can also import in the combined crosswise mode, for example: " worker " can first input Pinyin initial " g ", imports " g " of simple or compound vowel of a Chinese syllable again, import " h " of stroke then, import " o " character of simple or compound vowel of a Chinese syllable again, form input string " ggho ", can on screen, select input " worker " word.This example has been omitted tone character, and simple or compound vowel of a Chinese syllable has saved " n " character, and stroke " h " has been inserted between the rhythm alphabetic character " g ", " o " of phonetic, and " g " of simple or compound vowel of a Chinese syllable and " o " put upside down order.
When importing the Chinese character coding set string in the Chinese character code storehouse on keyboard, the character in simple or compound vowel of a Chinese syllable, tone, the stroked character string can omit arbitrarily, so that only need import a character.Certainly in fact also can in this kind character string, omit a character and not import, at this moment, then become the omission of a kind of character string in the described three kinds of character strings of epimere.
This kind omits input arbitrarily two vital role:
One can omit character in simple or compound vowel of a Chinese syllable, tone, three kinds of character strings of stroke arbitrarily by user flexibility ground, reduces the input code length, for example: when the pinyin stroke " chuangdhd " of input " bed ", can import " cugd ", " cagd "
Its two, by any omission, can avoid cerebral mistake, simple or compound vowel of a Chinese syllable to combine mistake or simple or compound vowel of a Chinese syllable dialect mistake into syllables.
The Chinese character code storehouse of the input method that this is routine also includes dictionary, and all the Chinese character single character codes in words and phrases are formed a words and phrases coded strings array successively, and all words and phrases coded strings arrays constitute dictionary; When importing 3 words or the words and phrases more than 3 words on keyboard, behind the first Chinese character of input, remaining Chinese character can omit input arbitrarily, so that only need import a Chinese character more earlier.
For example: the entry of supposing to exist in the Chinese vocabulary bank " The Ministry of Education of the People's Republic of China ("MOE") ".
Only need import random default substring during input: " middle people teaches altogether ", " middle the Republic of China educates "
Random default substring decision method by dual carries out match retrieval to Chinese vocabulary bank, can import this entry, and, can also pass through the random default substring decision method, the coding of each word is carried out the random default input, further reduce code length.
Words and phrases in this routine dictionary also comprise the words and phrases that the filename of tape file name identifier is formed, when the filename of words and phrases of importing and dictionary mates, display screen provides the corresponding prompt symbol, represent that these words and phrases are filenames, after choosing this document name, computing machine reads out full content in the respective file as current input.
On keyboard during the input special symbol coded string: input specific code u earlier, import any one of classification sign indicating number, symbolic code again, and the character of classification sign indicating number and symbolic code can omit arbitrarily, so that only need import a character.For example: input special symbol " [", its pronunciation is " left square bracket ", belongs to the punctuation mark class, complete being encoded to " ubzfkh " only needs input " ufk " only to need get final product, omitted classification code b, z (left side), h (number); Perhaps input " uzk ", omitted classification code b, f (side), h (number); Or the like.
In the step of in the input method character string of input being retrieved, being shown and selects Chinese code database: also be that input string decomposes at first, it is decomposed into pinyin string, tone string, stroke string to the character string of importing; The concrete steps of decomposing are:
The a step: pinyin string, tone string, stroke string assignment are empty string,
The b step: get the phonetic initial character of input string and be connected to pinyin string,
The c step: if input string has been got the bundle decomposition that finishes, otherwise get the initial character of the residue character of input string,
If this character belongs to the rhythm alphabetic character set, this character is connected to pinyin string,
If this character belongs to the stroked character collection, this character is connected to the stroke string,
If this character belongs to the tone character collection, this character is connected to the tone string,
Repeat the process in this step, got, finish to decompose until input string.
Simple or compound vowel of a Chinese syllable in the pinyin string that obtains after the above decomposition step also must carry out a preface adjustment, and the concrete steps of position preface adjustment are:
The a step:, then n is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the n character in the simple or compound vowel of a Chinese syllable string;
The b step: in second step,, g is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the g character in the simple or compound vowel of a Chinese syllable string.
Decomposing the pinyin string, tone string, the stroke string that obtain after the preface adjustment of position also need retrieve again, go here and there by the retrieval that is linked in sequence into of pinyin string, tone string, stroke string earlier during retrieval, with this retrieval string Hanzi font library is retrieved then, the concrete steps of retrieval are as follows:
A step: encode Chinese characters for computer=a get encode Chinese characters for computer in the character library,
The b step: compose: first character of S=encode Chinese characters for computer, first character of P=retrieval string,
The c step: if S or P are the string end mark, change the d step,
If S=P then composes: the character late of S=encode Chinese characters for computer, the character late of P=retrieval string,
Otherwise compose: the character late of S=encode Chinese characters for computer, P is constant,
Repeat this step process;
D step: if P is the string end mark, think retrieval string and encode Chinese characters for computer string coupling, choose the Chinese character of encode Chinese characters for computer string correspondence, otherwise, abandon the Chinese character of this encode Chinese characters for computer,
The e step: repeat a step, finish up to the character library retrieval, and be shown on the screen Chinese character that retrieves selective.
Dictionary retrieval: after importing two or more Chinese character coding set string on the keyboard, computing machine is outside the current Chinese character string that will import is retrieved the Hanzi font library in the Chinese code database, also the Chinese character coding set of two or more that will import polyphone is connected into a retrieval string array, dictionary in the Chinese code database is retrieved, and the concrete steps of dictionary retrieval are as follows:
A step: words and phrases coded strings array=get a words and phrases coded strings array in the dictionary;
The b step: compose: first encode Chinese characters for computer of S2=words and phrases coded strings array,
First encode Chinese characters for computer of P2=retrieval string array;
The c step:, change the g step, otherwise carry out following operation if S2 or P2 are empty string:
The d step: compose: first character of S=S2, first character of P=P2,
The e step: if S or P are the string end mark, change the f step,
If S=P then composes: the character late of S=S2, the character late of P=P2,
Otherwise compose: the character late of S=S2, P is constant,
Repeat this step process;
The f step:, then compose if P is the string end mark: the next encode Chinese characters for computer of S2=words and phrases coded strings array,
The next one retrieval string of P2=retrieval string array,
Otherwise compose: the next encode Chinese characters for computer of S2=words and phrases coded strings array, P2 is constant,
Repeat the c step,
The g step: if P2 is an empty string, thinks and retrieve string array and words and phrases coded strings array coupling, choose the words and phrases of coded strings array correspondence; Otherwise abandon the words and phrases of this words and phrases coded strings array representation,
The h step: repeat a step, finish, be shown on the screen words and phrases that retrieve selective up to the words and phrases library searching.
Words and phrases in this routine dictionary also include the words and phrases of the filename of tape file identifier, therefore, when if the words and phrases that retrieve are the filename words and phrases of tape file name identifier, then with these words and phrases on display screen the time, also demonstrate the filename identifier simultaneously, if choose this words and phrases, then computing machine is read the full content of filename institute respective file as current input.For example:
Suppose to have existed in the words and phrases storehouse entry " fourth so-and-so contact method ※ ", ※ wherein is the filename identifier that this example adopted, and the present invention also can adopt other symbols that are of little use as file identifier when implementing certainly.Existing with " fourth so-and-so contact method " in the literature kit sub-directory of the input method catalogue in the computing machine is the file of filename, theing contents are as follows in the file:
Fourth so-and-so: scape light RDS science and technology share corporate president;
Cell-phone number: 13111111111;
Office telephone: 22222222;
Residence phone: 33333333;
CompanyAddress: PVG
After the method for the present invention that adopts is imported " fourth connection side " arbitrarily, entry " fourth so-and-so contact method ※ " will show
Be shown on the screen, select this entry after, what obtain is that " fourth so-and-so contact method " is the above content in the file of filename.Certainly file identification symbol and the filename identifier in the dictionary that shows also can be inconsistent.
The present invention has taken all factors into consideration the simplicity that Chinese character coding input method should possess, dirigibility, adaptability, principal elements such as sign indicating number is short, the repetition rate of coding is low, speed, dialect processing comprehensively, phonetic, stroke, tone are organically combined together, do not lose the phonetic standard, and design a kind of simple, flexibly, adaptability is strong, need not learn, comprise word, speech, sentence, the input of file plurality of kinds of contents, the Chinese pinyin tone and stroke combinatorial input method of complete function.It has not only inherited the simplicity of phonetic, has overcome the repetition rate of coding problem that the individual character page turning is searched, and also in the selection that reduces code length and phonetic, tone, stroked character input, provides dirigibility to greatest extent to the user.The proposition of literature kit input makes Chinese character coding input method after the sentence input method that Microsoft proposes, and is more further strengthened, brings the user and greatly facilitates.

Claims (15)

1, a kind of Chinese pinyin tone and stroke combinatorial input method, comprise the Chinese character code storehouse is pre-stored in the computing machine, character string on keyboard in the input Chinese character code storehouse, then with character string the step that Chinese code database is retrieved, shown and selects of input, it is characterized in that: described Chinese character code storehouse is the Hanzi font library that is made of encode Chinese characters for computer and special symbol coding, wherein:
Encode Chinese characters for computer=phonetic initial character+simple or compound vowel of a Chinese syllable+tone+stroke,
First character in phonetic initial character=Chinese phonetic alphabet,
Simple or compound vowel of a Chinese syllable in simple or compound vowel of a Chinese syllable=Chinese phonetic alphabet,
In five characters of five tone correspondences in tone=Chinese phonetic alphabet one,
Stroke=first stroke character+second stroked character+last stroked character, the Chinese character that the stroke less than is three is supplied three with last stroke, stroked character be point, horizontal, vertical, cast aside, in five characters of five stroke correspondences of folding one.
2, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 1 is characterized in that: the character set that described simple or compound vowel of a Chinese syllable, tone, the character that stroke adopted constitute is mutually disjointed.
3, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 2, it is characterized in that: five characters of five tone correspondences in the described Chinese phonetic alphabet are: y-1 sound, L-2 sound, c-3 sound, x-4 sound, w-are softly; Point in the stroke, five characters horizontal, vertical, that cast aside, roll over five stroke correspondences are: the d-point, and the h-horizontal stroke, s-is perpendicular, and p-casts aside, the z-folding.
4, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 1 is characterized in that: described special symbol coding=specific code+classification sign indicating number+symbolic code, wherein:
Specific code=u,
Among classification sign indicating number=b-punctuation mark, s-mathematic sign, x-Greek alphabet, r-set with Japanese alphabet or q-other symbol one,
The character string that the initial character of symbolic code=each syllable of symbol pronunciation is formed then constitutes with the set with Japanese alphabet phonetic symbol for the set with Japanese alphabet symbol.
5, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 3, it is characterized in that: described on keyboard during the Chinese character coding set string in the input Chinese character code storehouse, import the phonetic initial character of Chinese character earlier, carry out word selection or import one or more combination in three kinds of character strings of simple or compound vowel of a Chinese syllable, tone, stroke of corresponding Chinese character again, and do not have sequencing when simple or compound vowel of a Chinese syllable, tone, three kinds of character strings combinations of stroke during combination; And the character in any one character string in these three kinds of character strings can insert in another character string; When the input simple or compound vowel of a Chinese syllable was an, en, in, un, ang, eng, ing or ong, the input sequence of rhythm alphabetic character can change.
6, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 5, it is characterized in that: described on keyboard during the Chinese character string in the character string in the input Chinese character code storehouse, character in simple or compound vowel of a Chinese syllable, tone, the stroked character string can omit arbitrarily, so that only need import a character.
7, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 6, it is characterized in that, in the described step that the character string of input is retrieved, shown and selects Chinese code database: also be that input string decomposes at first, it is decomposed into pinyin string, tone string, stroke string to the character string of importing; The concrete steps of decomposing are:
The a step: pinyin string, tone string, stroke string assignment are empty string,
The b step: get the phonetic initial character of input string and be connected to pinyin string,
The c step: if input string has been got the bundle decomposition that finishes, otherwise get the initial character of the residue character of input string,
If this character belongs to the rhythm alphabetic character set, this character is connected to pinyin string,
If this character belongs to the stroked character collection, this character is connected to the stroke string,
If this character belongs to the tone character collection, this character is connected to the tone string,
Repeat the process in this step, got, finish to decompose until input string.
8, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 7 is characterized in that: to the simple or compound vowel of a Chinese syllable in the pinyin string that obtains after the described decomposition step, also must carry out a preface adjustment, the concrete steps of position preface adjustment are:
The a step:, then n is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the n character in the simple or compound vowel of a Chinese syllable string;
The b step: in second step,, g is adjusted to the last of simple or compound vowel of a Chinese syllable string if having the g character in the simple or compound vowel of a Chinese syllable string.
9, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 8, it is characterized in that: described pinyin string, tone string, stroke string are gone here and there by the retrieval that is linked in sequence into of pinyin string, tone string, stroke string, with this retrieval string Hanzi font library is retrieved then, the concrete steps of retrieval are as follows:
A step: encode Chinese characters for computer=a get encode Chinese characters for computer in the character library,
The b step: compose: first character of S=encode Chinese characters for computer, first character of P=retrieval string,
The c step: if S or P are the string end mark, change the d step,
If S=P then composes: the character late of S=encode Chinese characters for computer, the character late of P=retrieval string,
Otherwise compose: the character late of S=encode Chinese characters for computer, P is constant,
Repeat this step process;
D step: if P is the string end mark, think retrieval string and encode Chinese characters for computer string coupling, choose the Chinese character of encode Chinese characters for computer string correspondence, otherwise, abandon the Chinese character of this encode Chinese characters for computer,
The e step: repeat a step, finish up to the character library retrieval, and be shown on the screen Chinese character that retrieves selective.
10, according to claim 2,3,5 or 6 described a kind of Chinese pinyin tone and stroke combinatorial input methods, it is characterized in that: described Chinese character code storehouse also includes dictionary, article one, all the Chinese character single character codes in the words and phrases are formed a words and phrases coded strings array successively, and all words and phrases coded strings arrays constitute dictionary; When importing 3 words or the words and phrases more than 3 words on keyboard, behind the first Chinese character of input, remaining Chinese character can omit input arbitrarily, so that only need import a Chinese character more earlier.
11, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 10, it is characterized in that: after importing two or more Chinese character coding set string on the keyboard, computing machine is outside the current Chinese character string that will import is retrieved the Hanzi font library in the Chinese code database, also the Chinese character coding set of two or more that will import polyphone is connected into a retrieval string array, dictionary in the Chinese code database is retrieved, and the concrete steps of dictionary retrieval are as follows:
A step: words and phrases coded strings array=get a words and phrases coded strings array in the dictionary;
The b step: compose: first encode Chinese characters for computer of S2=words and phrases coded strings array,
First encode Chinese characters for computer of P2=retrieval string array;
The c step:, change the g step, otherwise carry out following operation if S2 or P2 are empty string:
The d step: compose: first character of S=S2, first character of P=P2,
The e step: if S or P are the string end mark, change the f step,
If S=P then composes: the character late of S=S2, the character late of P=P2,
Otherwise compose: the character late of S=S2, P is constant,
Repeat this step process;
The f step:, then compose if P is the string end mark: the next encode Chinese characters for computer of S2=words and phrases coded strings array,
The next one retrieval string of P2=retrieval string array,
Otherwise compose: the next encode Chinese characters for computer of S2=words and phrases coded strings array, P2 is constant,
Repeat the c step,
The g step: if P2 is an empty string, thinks and retrieve string array and words and phrases coded strings array coupling, choose the words and phrases of coded strings array correspondence; Otherwise abandon the words and phrases of this words and phrases coded strings array representation,
The h step: repeat a step, finish, be shown on the screen words and phrases that retrieve selective up to the words and phrases library searching.
12, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 10, it is characterized in that: the words and phrases in the dictionary in the described Chinese character code storehouse also comprise the words and phrases that the filename of tape file name identifier is formed, when the filename of words and phrases of importing and dictionary mates, display screen provides the corresponding prompt symbol, represent that these words and phrases are filenames, after choosing this document name, computing machine reads out full content in the respective file as current input.
13, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 12, it is characterized in that: after importing two or more Chinese character coding set string on the keyboard, computing machine is outside the current Chinese character string that will import is retrieved the Hanzi font library in the Chinese code database, also the Chinese character coding set of two or more that will import polyphone is connected into a retrieval string array, dictionary in the Chinese code database is retrieved, and the concrete steps of dictionary retrieval are as follows:
A step: words and phrases coded strings array=get a words and phrases coded strings array in the dictionary;
The b step: compose: first encode Chinese characters for computer of S2=words and phrases coded strings array,
First encode Chinese characters for computer of P2=retrieval string array;
The c step:, change the g step, otherwise carry out following operation if S2 or P2 are empty string:
The d step: compose: first character of S=S2, first character of P=P2,
The e step: if S or P are the string end mark, change the f step,
If S=P then composes: the character late of S=S2, the character late of P=P2,
Otherwise compose: the character late of S=S2, P is constant,
Repeat this step process;
The f step:, then compose if P is the string end mark: the next encode Chinese characters for computer of S2=words and phrases coded strings array,
The next one retrieval string of P2=retrieval string array,
Otherwise compose: the next encode Chinese characters for computer of S2=words and phrases coded strings array, P2 is constant,
Repeat the c step,
The g step: if P2 is an empty string, thinks and retrieve string array and words and phrases coded strings array coupling, choose the words and phrases of coded strings array correspondence; Otherwise abandon the words and phrases of this words and phrases coded strings array representation,
The h step: repeat a step, finish, be shown on the screen words and phrases that retrieve selective up to the words and phrases library searching; When if the words and phrases that retrieve are the filename words and phrases of tape file name identifier, then with these words and phrases on display screen the time, also demonstrate the filename identifier simultaneously, if choose this words and phrases, then computing machine is read the full content of filename institute respective file as current input.
14, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 4, it is characterized in that, on keyboard during the input special symbol coded string: input specific code u earlier, import any one of classification sign indicating number, symbolic code again, and the character of classification sign indicating number and symbolic code can omit arbitrarily, so that only need import a character.
15, a kind of Chinese pinyin tone and stroke combinatorial input method according to claim 14, it is characterized in that, behind input special symbol coded string on the keyboard, computing machine is formed a retrieval string with the symbol string of input, with this retrieval string Hanzi font library is retrieved, the concrete steps of retrieval are as follows:
The a step: the special symbol that special symbol is encoded=got in the character library is encoded,
The b step: compose: first character of S=special symbol coding, first character of P=retrieval string,
The c step: if S or P are the string end mark, change the d step,
If S=P then composes: the character late of S=special symbol coding, the character late of P=retrieval string,
Otherwise compose: the character late of S=special symbol coding, P is constant,
Repeat this step process;
The d step: if P retrieves string and special symbol coded strings coupling for going here and there end mark, thinking, choose the special symbol of special symbol coded strings correspondence, otherwise, abandon the special symbol that this special symbol is encoded,
The e step: repeat a step, the special symbol retrieval finishes in character library, and is shown on the screen special symbol that retrieves selective.
CNB2006100203995A 2006-03-03 2006-03-03 Chinese spelling, tone and stroke combined input method Expired - Fee Related CN100399245C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006100203995A CN100399245C (en) 2006-03-03 2006-03-03 Chinese spelling, tone and stroke combined input method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006100203995A CN100399245C (en) 2006-03-03 2006-03-03 Chinese spelling, tone and stroke combined input method

Publications (2)

Publication Number Publication Date
CN1838044A true CN1838044A (en) 2006-09-27
CN100399245C CN100399245C (en) 2008-07-02

Family

ID=37015458

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006100203995A Expired - Fee Related CN100399245C (en) 2006-03-03 2006-03-03 Chinese spelling, tone and stroke combined input method

Country Status (1)

Country Link
CN (1) CN100399245C (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103051971A (en) * 2012-12-26 2013-04-17 深圳创维数字技术股份有限公司 Input method and digital television terminal
CN110471540A (en) * 2019-08-27 2019-11-19 杨平 Pinyin stroke input method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1262473A (en) * 1999-02-02 2000-08-09 乔永军 Chinese-caracter input method by phonetic letters with numeral key pad
CN1276560A (en) * 1999-06-08 2000-12-13 施益平 Chinese phonetic-letter input method with telephone keypad configuration

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103051971A (en) * 2012-12-26 2013-04-17 深圳创维数字技术股份有限公司 Input method and digital television terminal
CN110471540A (en) * 2019-08-27 2019-11-19 杨平 Pinyin stroke input method

Also Published As

Publication number Publication date
CN100399245C (en) 2008-07-02

Similar Documents

Publication Publication Date Title
CN1026525C (en) Intellect five strokes double spelling Chinese ideograph code programme
CN1113305C (en) Language processing apparatus and method
CN1648828A (en) System and method for disambiguating phonetic input
CN101067766A (en) Method for cancelling character string in inputting method and word inputting system
CN1993692A (en) A character display system
CN101075262A (en) Method and system for inputting Chinese character by computer
CN101038508A (en) GB phoneticize input method
CN1737739A (en) Tibetan input method based on English keyboard
CN1838044A (en) Chinese spelling, tone and stroke combined input method
CN1186711C (en) Mongol input method
CN1121645C (en) Sound and shape word code Chinese character input method
CN1102768C (en) Chinese character phono configurational code input method for electronic computer
CN1187677C (en) Method for inputting Chinese holophrase into computers by using partial stroke
CN1257445C (en) Chinese-character 'Pronunciation-meaning code' input method
CN1246758C (en) Four-corner code Chinese character input method for computer and keyboard thereof
CN1584809A (en) Inputting method for Chinese code as phonetic Chinese
CN1260530A (en) Chinese character inputting method by shape and sound encode
CN1734404A (en) Phonetic code and recognition phonetic code, database technology, stroke code and numeric stroke code
CN1156744C (en) Chinese-character 'meta-root code' input method
CN1760818A (en) Chinese character input method convenient for selecting coincident codes rapidly
CN1093654C (en) Structural code Chinese character entering method and used general keyboard
CN1228565A (en) Computer file automatic error detection and error correction device and its method
CN1081773A (en) " many recursion associations " Chinese word encoding
CN1120408C (en) Chinese-character struture-pronunciation input method for computer
CN1527184A (en) Chinese character input method and keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080702

Termination date: 20110303