CN1376969A - 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters - Google Patents
'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters Download PDFInfo
- Publication number
- CN1376969A CN1376969A CN 02108826 CN02108826A CN1376969A CN 1376969 A CN1376969 A CN 1376969A CN 02108826 CN02108826 CN 02108826 CN 02108826 A CN02108826 A CN 02108826A CN 1376969 A CN1376969 A CN 1376969A
- Authority
- CN
- China
- Prior art keywords
- chinese
- character
- chinese character
- code
- parts
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
A "dual-separation Chinese character", a "dual-separation" input method and the combining fonts are disclosed. Said "dual-separation Chinese character" features that the Chinese character, its phonetic letter and input code are combined and the standard ASCII character set is used to record and transmit Chinese characters. Said combining fonts features that they are the normalized and personalized parts and are combined to form Chinese characters.
Description
The present invention is made up of with a kind of pair of division input method and a kind of Chinese character combined character a kind of two Chinese characters that divide, it utilizes existing Chinese resource and infotech, put into practice spelling of Chinese character at message area, and improve existing Chinese character information technology, belong to the reform of a writing system and Chinese character information technical field.
Between existing infotech and existing reform of a writing system practice, lack a kind of written form of the attribute that possesses skills of compatible and bag (being called for short " compatibility ").Be in particular in: (1) notation is many.Aspect individual's use, now there have been Chinese character, the Chinese phonetic alphabet and three kinds of notations of input coding for Chinese character.The Chinese character and the Chinese phonetic alphabet are not easy to information processing, in the reform of a writing system, will progressively move towards spelling of Chinese character.Input coding for Chinese character fails the reform of a writing system is combined with infotech, is free on outside the Chinese character and the Chinese phonetic alphabet, and is of a great variety.Aspect information processing, literal and input coding disunity, outer sign indicating number and ISN disunity.(2) man-machine not general.Existing Chinese notation is not easy to the people and machine uses jointly.Chinese character, pronunciation can not be represented well in font; The Chinese phonetic alphabet, spelling exists uncertain: input coding for Chinese character, no literal function.(3) illiteracy's inconvenience is used.Illiterate people, in face of existing Chinese notation, inconvenient typing and understanding information; Be not easy to utilize the existing information technology, carry out " self-service eliminating illiteracy " and study.The technical attributes of this literal is that with the fundamental difference of existing Chinese notation it singly is not the mark system of record Chinese, also should be the technical symbol system of transmission Chinese.Existing Chinese notation is not accomplished this point.
There is " three difficulties " in existing Chinese character, is not easy to information processing.Chinese character exists difficulty to read, be difficult to write and difficult note shortcomings such as (being called for short " three difficulties ").Chinese character " three difficulties " for a change, the existing practice are to give the Chinese character marking pronunciation and carry out simplified Chinese characters.On the Chinese character head, mark pronunciation, be not easy to input and composing; At Chinese character back mark pronunciation, what recognize earlier is Chinese character, is not easy to people's (or machine) identification; At Chinese character front mark pronunciation,, confeuse the parimary with secondary like some by existing custom; These mark, and all do not give the meaning of infotech aspect.Simplified Chinese characters, font still can not be represented pronunciation well.In existing information is handled, simplified Chinese characters fail to reduce the on the whole quantity of Chinese character, the continuous simplification of Chinese character is not easy to existing Chinese character information processing.How to utilize existing information technology " simplified Chinese characters ", need a kind of practical exploration form.
This explanation is made following agreement for sake of convenience.Regard known Chinese character sum as 60,000, Chinese characters in common use are regarded 7000 as, and all the other regard " non-common Chinese character " as.Regard the Chinese character that GB GB2312-80 (being called for short GB) includes as Chinese characters in common use (" simplified version "), first-level Chinese characters is wherein regarded " generally can recognize Chinese character " as.So-called " generally can recognize Chinese character " is meant in primary school's scope, is used for (about 3500) Chinese character of the Chinese phonetic alphabet teaching of literacy, is the Chinese character that general crowd can both memorize.With GB GB18030-2000 character set, be called for short GBK.With the double word phrase, regard 16800 (seeing indivedual openly code tables) or 28600 (seeing several open code tables) as." transmission Chinese " in this explanation, is meant that input, output, transmission or the machine intimate to Chinese information handled.
The existing Chinese phonetic alphabet can't be used for writing down and transmission Chinese.The existing Chinese phonetic alphabet has only " Scheme for the Chinese Phonetic Alphabet " (being called for short " scheme ") to have authority, legitimacy.The shortcoming of existing " scheme " is that the words spelling exists uncertainty, the corresponding a plurality of Chinese characters of promptly a kind of spelling with Chinese-character word-phrase.Be in particular in (1) unisonance words inconvenience differentiation, (2) ambiguity words is difficult for difference, (3) inconvenient dialect spelling, and aspects such as classical Chinese are spelt in (4) inconvenience.With the unisonance words is example.Syllable mark accent considered in 416 of the basic syllables of the Chinese phonetic alphabet (according to " Xinhua dictionary "), and 1282 kinds of pronunciations can be arranged.In the GBK scope, each basic syllable on average has 50 phonetically similar words, and every kind of pronunciation on average has 16 phonetically similar words.In 16800 double word phrases, 18% phrase spelling (not marking tone) existence uncertainty is arranged.In " Xinhua dictionary ", basic syllable " ji " has 116 of phonetically similar words, and the phonetically similar word of reading " ji4 " has 40; Basic syllable " yi " has 125 of phonetically similar words, 69 of the phonetically similar words of thought " yi4 ".Other pattern of the Chinese phonetic alphabet, also has the shortcoming of " the words spelling exists uncertain with Chinese-character word-phrase " as " Chinese phonetic script " etc.How to utilize the existing information technology that the Chinese phonetic alphabet is developed continuously and be the spelling of Chinese character literal, also need a kind of practical exploration form.
Existing ASCII character character can not directly be used for writing down and transmission Chinese.Literal in information processing, adopts character code to represent.General in the world character code is an ASCII character.This code, with 1 character of 1 byte representation, 128 kinds of alphabets are called standard A SCII sign indicating number again.The character keys of universal keyboard is corresponding with ASCII character character commonly used.This code gives information processing much convenient.But this standard A SCII code character can not directly be used for writing down and transmission Chinese.Certainly, just more can not use standard A SCII code character and Chinese character (or its shape Yi Tezheng), or Hanzi component (or its shape Yi Tezheng) writes down and transmits Chinese.
Also there is shortcoming in existing kanji code, needs to improve: (1) presses word code, and character is many, and matrix magazine is huge.In the GBK character set, include 2.7 ten thousand of Chinese characters, needed huge matrix magazine support.(2) the code kind is many.Chinese character information processing needs multiple codes such as input code, internal code and transmission code.Wherein input code divides many types again.(3) fail to represent whole Chinese characters.Chinese character " word does not have fixed number ".To not including Chinese character, " inconvenience " is handled.(4) the incompatibility reform of a writing system needs.For using existing Chinese character information technology, before 20 years, just the someone mentions, " existing Chinese character just can not constantly be simplified as before dribs and drabs ".The reform of a writing system is the historical process of continuous development and change.Existing encode Chinese characters for computer can not in time reflect the development and change of this historical process.
Existing Chinese character input method, code table is compiled longer and longer, and corpus is done bigger and bigger; Software function is more and more, makes individual's dynamic role fewer and feweri; The personalization that is unfavorable for Chinese is expressed.For the individual, there is a large amount of redundancy encodings in " code table is fixed "; Extended immobilization is selected speech word, particularly teenager in certain scope, virtually, will cause language to ossify, the individual character forfeiture.The individualized feature that Chinese character uses is a kind of language characteristic of Chinese.Select for use at words, vocabulary accumulation and commonly use aspect such as a formula, people have the characteristics of personality of oneself.The words that the individual uses always is seldom commonly used vocabulary still less.People need be fit to the input method of own characteristics of personality.A good input method for the individual, should be: simple, easily learn, and bear in memory; One yard of one word does not have and repeats; Do not have individual no words: " translation " amount of spell shape code fetch is little, brain and eyes indefatigability; Do not influence the fluency of thinking.Reach this requirement, existing Chinese character input method is still waiting to improve.
Existing input coding for Chinese character, no literal function.Input coding for Chinese character is to aim at Chinese character information processing and the notation worked out, and is of a great variety.The sound sign indicating number is pressed Chinese-character pronunciation coding, and repeated code is many, and is poor with the correspondence of Chinese character, can not use as literal.Font code is pressed Chinese character coding structure, and repeated code is few, and is good with the correspondence of Chinese character, but that Chinese character splits is meticulous, is not easy to " see sign indicating number know word ", and do not have pronunciation, can not use as literal.Phonological and calligraphical syn thesize coding, combine both strong points, Chinese-character pronunciation is arranged, the shape justice feature description of Chinese character is arranged, good with the correspondence of Chinese character, but because of being conceived to the coding input of Chinese character, pronunciation does not partly indicate, be not easy to man-machine reading and word segmentation processing, it is meticulous that Chinese character splits, and is not easy to " see sign indicating number know word ", still can not use as literal.
Existing phonological and calligraphical syn thesize coding is representative with the natural code.It is input as the master with the Two bors d's oeuveres word.Its single character code, 5 yards of all-key code lengths, form is: sound sign indicating number (initial consonant+simple or compound vowel of a Chinese syllable)+font code (adopted portion parts+parts 2+ parts 3).The code of most parts is close with its sounding.But aspect single character code, also have the total shortcoming of font code: it is meticulous that (1) Chinese character splits, and spell shape code fetch amount is big.Character formation component (about 150) is selected for use less, and it is meticulous that Chinese character is split, and spell shape code fetch " translation " amount is bigger.(2) part codes still has certain memory capacitance.The above Chinese character of (3) three parts, structure representation is incomplete.(4) same coding fails to be used for multiple (comprising standard and numeral etc.) keyboard.
Also there is shortcoming in existing Chinese character font: (1) quantity is big.1 Chinese character is arranged, just need 1 type matrix.(2) fail to represent all Chinese characters.Do not include Chinese character, do not have type matrix.(3) Chinese character font of newly making lacks standardization.(4) type matrix " everybody's one ", no individuality formation.
The purpose of this invention is to provide a kind of two Chinese character and a kind of pair of division input method and a kind of combined character of dividing, (1) is that the infotech and the reform of a writing system are put into practice, and a kind of written form of the compatible attribute that possesses skills is provided; (2) progressively solve Chinese character " three difficulties ", make Chinese character simplifiedly, finish at literal self; (3) overcome the shortcoming of the Chinese phonetic alphabet, make the words spelling have determinacy; (4) realize writing down and transmitting Chinese with standard A SCII code character (or with standard A SCII code character and Chinese character or its shape Yi Tezheng, or with Hanzi component or its shape Yi Tezheng horizontally-arranged); (5) improve kanji code, reduce its quantity (or kind), realize the coded representation of all Chinese characters, adapt to reform of a writing system needs; (6) improve Chinese character input method, the personalization that is beneficial to Chinese is expressed; (7) make input coding for Chinese character possess literal function; (8) improve phonological and calligraphical syn thesize coding, make Chinese character split maximization, reduce the difficulty of spell shape code fetch; The realization part codes need not be remembered; The Hanzi structure comprehensive representation is in order to the teaching of literacy; With same coding, be applied to multiple (comprising standard and numeral etc.) keyboard; (9) simplify type matrix, realize that the type matrix of all Chinese characters is represented, make the personalized and new coinage mould standardization of type matrix style.
The object of the present invention is achieved like this:
(1) is the infotech and reform of a writing system practice, a kind of written form of the compatible attribute that possesses skills is provided.Two minutes Chinese characters combine Chinese character, the Chinese phonetic alphabet and input coding for Chinese character together.It is the combination of the Chinese phonetic alphabet and Chinese character (or its shape Yi Tezheng) on form, or is the combination of Hanzi component (or its shape Yi Tezheng), and " input coding for Chinese character " is a kind of pattern of self.It is mutually comprehensive with the function of Chinese character, the Chinese phonetic alphabet and input coding for Chinese character on function.Adopt two full word symbol patterns that divide Chinese character, can realize with standard A SCII character record and transmission Chinese.Adopt two Chinese characters that divide, can simplify the Chinese notation, accomplish general man-machine, utilize existing information technological improvement literal, can be used for information processing, can be used for reform of a writing system practice again, realize compatible in the literal use of infotech and the reform of a writing system.Illiterate person can " contrast " two minutes Chinese-character texts, and " according to sample " entry information is not imported Chinese pronunciation and shape Yi Tezheng " conscious ", under prior art helps, and understanding information; Can also carry out self-service eliminating illiteracy, promptly utilize the existing information technology, learn " conscious " and use two Chinese characters that divide, learn other cultural knowledge.
(2) progressively solve Chinese character " three difficulties ", make Chinese character simplifiedly, finish at literal self.The Chinese character difficulty is read, and just with two pronunciation parts of dividing Chinese character, gives each Chinese character marking pronunciation.Chinese character is difficult to write, difficulty is remembered, just at the shape justice characteristic of two minutes Chinese characters, describes the body of Chinese character with a spot of shape justice feature that is easy to memorize.With these shapes Yi Tezheng horizontally-arranged, both simplified Hanzi structure, simplified writing of Chinese character again.Shape justice feature description when two minutes Chinese characters progressively carries out the transition to full word symbol pattern, just can progressively solve Chinese character " three difficulties ".Two minutes Chinese characters are represented to help characterization, contoured, the symbolism of Hanzi component than the Chinese character of multi-part composition with less shape justice feature: make code with parts " pronunciation ", realize " the unisonance merger " of parts, can reduce the quantity of part codes; Horizontally-arranged with parts is write pattern, can simplify and unify the structure type of Chinese character; Utilize self form evolution rule, can gradually reduce the shape justice feature description of Chinese character; Not increasing existing Chinese character total amount and not influencing under the prerequisite of use,,, play the effect of simplified Chinese characters by self form evolution in literal inside.Divert from one use to another the existing information technology, help the realization of this purpose.As, divert from one use to another " screen prompt ", " code table is counter to be looked into " reaches prior aries such as " word frequency statistics ", make the shape justice feature description of words, existing meticulous all-key comprehensively has practical succinct brevity code again; Brevity code is described, in concrete linguistic context, do not produced the fork meaning; Accomplish that " word source " (incoming road) of simplifying Chinese characters is clear, " simplification " practicality.
(3) overcome the shortcoming of the Chinese phonetic alphabet, make the words spelling have determinacy.Utilize two shape justice feature descriptions of dividing Chinese characters, the font and the meaning of word of Chinese character is described, make the Chinese character of same pronunciation, have different separately shape justice feature descriptions; Again the pronunciation part is combined with shape justice characteristic, just can realize that the words spelling has determinacy with all Chinese-character word-phrases.The determinacy of words spelling provides precondition for the Chinese phonetic alphabet moves towards spelling of Chinese character, also provides convenience for information processing.Simultaneously, can utilize existing Chinese character information technology, as " priority of high frequency ", " use and shift to an earlier date " and " input prompt " etc., spelling of Chinese character put into practice induce, optimization and standard.
(4) realize writing down and transmitting Chinese with standard A SCII code character (or with standard A SCII code character and Chinese character or its shape Yi Tezheng, or with Hanzi component or its shape Yi Tezheng horizontally-arranged).Utilizing two Chinese characters that divide, is the Character Style with its shape justice characteristic spelling (or conversion), just can represent all Chinese characters with standard A SCII code character.Realize just putting into practice spelling of Chinese character in areas of information technology with standard A SCII code character record and transmission Chinese.As the mode that adopts standard A SCII code character to combine with Chinese character or its shape Yi Tezheng, or adopt Hanzi component or its shape Yi Tezheng horizontally-arranged form, and represent all Chinese characters, can obtain two other Apply Styleses that divide Chinese character, realize the diversity of Chinese record and transmission, to satisfy the needs of the reform of a writing system.
(5) improve kanji code, reduce its quantity (or kind), realize the coded representation of all Chinese characters, adapt to reform of a writing system needs.1. adopt the full words symbol patterns of two branch Chinese characters,, will simplify numerous Chinese character " input code ", " input code ", " internal code " unification are standard A SCII sign indicating number, can reduce the kind and the quantity of kanji code with standard A SCII code character record and transmission Chinese.If be output as standard A SCII code character, the matrix magazine capacity can be done very for a short time.If be output as two other patterns that divide Chinese character, the content of internal code, matrix magazine Chinese character part may be defined as encoding Chinese characters, or Chinese characters in common use and parts, or Hanzi component etc.The two kinds of definition in back can reduce the kanji code quantity of (containing type matrix).Output can be converted to patterns (needing corresponding code table and matrix magazine support) such as Chinese character, two minutes Chinese characters (parts horizontally-arranged form) and synthetic Chinese character.2. adopt two other patterns of Chinese character that divide to write down Chinese, can reduce the quantity of input code, the content of internal code, matrix magazine Chinese character part may be defined as encoding Chinese characters, or Chinese characters in common use and parts, or Hanzi component etc.The two kinds of definition in back can reduce the kanji code quantity of (containing type matrix).Output can be patterns such as Chinese character, two minutes Chinese characters (parts horizontally-arranged form) and synthetic Chinese character.3. Chinese character is synthetic by parts, and two minutes Chinese characters have the determinacy of words spelling, for not including Chinese character, also can adopt above-mentioned two kinds of patterns to represent, realize the coded representation of all Chinese characters.Its output may be defined as patterns such as Chinese character, two minutes Chinese characters (comprising parts horizontally-arranged form) and synthetic Chinese character.(4) adopt two Chinese characters that divide, pronunciation with the spelling mark Chinese character of standard, the shape Yi Tezheng of Chinese character is described with character (or character and parts) coding of standard, give birth to the encode Chinese characters for computer of standard, the simplification of Chinese character and the interpolation of word newly, with not being subjected to the restriction of existing coded system, can constantly absorb the spelling of Chinese character achievement, help reform of a writing system practice.
(6) improve Chinese character input method, the personalization that is beneficial to Chinese is expressed.Utilize two determinacy of dividing the Chinese-character word-phrase spelling, realize individual character " one yard of a word ", no repeated code; Select Chinese word coding, without repeated code.Utilize two diversity of dividing Chinese record and transmission Chinese, can select the language record pattern of oneself liking; Utilize the diversity of two division input method coded systems, coded format is self-defined, can select the literal input pattern of oneself liking; Adopt the method for " type selecting, with the sign indicating number word selection " as required, the user can select the type of coding for use according to oneself needs: oneself to select sign indicating number that yard type provides for use, arrange the words of oneself commonly using; When selecting text style, input mode, can also define the way of output of oneself liking.Learn a kind of input method, can on multiple keyboards such as QWERTY keyboard and numeric keypad, use.
(7) make input coding for Chinese character possess literal function.The input coding of two minutes Chinese characters is two full word symbol patterns that divide Chinese character.It has the tone and shape of reading justice characteristic, or only tangible adopted characteristic.It both can be used as existing Chinese character and two input coding that divides Chinese character self, can be used as the literal of record Chinese again.Pronunciation and shape Yi Tezheng describe respectively, are convenient to the man-machine reading and the word segmentation processing of Chinese.
(8) improve phonological and calligraphical syn thesize coding, make Chinese character split maximization, reduce the difficulty of spell shape code fetch; The realization part codes need not be remembered; The Hanzi structure comprehensive representation is in order to the teaching of literacy; With same coding, be applied to multiple (comprising standard and numeral etc.) keyboard.Two sub-units with the coded representation of its pronunciation, know that pronunciation just knows code.With its stroke coded representation of writing, see that stroke just knows code; Pronunciation code and stroke code refer to object flag on keyboard, do not need the memory." Chinese character two minutes ", a Chinese character has only " selecting part " and " remainder ", has realized that Chinese character splits the comprehensive representation of maximization and Hanzi structure, has reduced the difficulty of spell shape code fetch and has helped the teaching of literacy.Because the encode Chinese characters for computer available characters represents that also available digital is represented, makes same encoding scheme, can be applied to multiple (comprising standard and numeral etc.) keyboard.
(9) simplify type matrix, realize that the type matrix of all Chinese characters is represented, make new coinage mould standardization and the personalization of type matrix style.1. Chinese character output (comprising demonstration, printing etc.) is two Chinese character (parts horizontally-arranged) patterns that divide, with a spot of parts type matrix combination, exports all Chinese characters, can simplify type matrix, its Chinese character part can a reserved unit type matrix.2. adopt combined character, can simplify type matrix, its Chinese character part reserved unit type matrix realizes that the type matrix of all Chinese characters is represented, is output as synthetic Chinese character.Chinese character is synthetic by parts.The parts type matrix of code requirement, generated data according to the rules, the Chinese character font of generation standard will be reduced the number of the content of matrix magazine, realizes that the type matrix of all Chinese characters is represented, and can realize newly making the Chinese character font standardization.Generated data can be to press the parts data that parts are described separately, also can be by the whole structured data of describing of textural classification, gives man-machine application with convenient.Baroque parts type matrix can be synthetic with parts type matrix simple in structure.Utilize personalized parts type matrix,, generate personalized combined character, be output as personalized synthetic Chinese character according to the generated data of definition.
Compare prior art, the present invention has following advantage:
1. two minutes Chinese characters combine Chinese character, the Chinese phonetic alphabet and three kinds of notations of input coding for Chinese character together, have simplified the notation of existing Chinese, help saving social resources; The creativeness of two minutes Chinese characters is, makes full use of existing resource, makes the literal attribute that possesses skills, and makes input coding have word attribute; Help self-service study (or eliminating illiteracy).
2. two minutes Chinese characters for Chinese character simplified, provide a technology to realize approach.It makes Chinese character simplified, develops in the literal self structure, does not increase new word.The shape justice feature description of Chinese character, in the spelling of Chinese character process, brief gradually.Particularly two parts horizontally-arranged forms that divide Chinese character make Hanzi structure, and are unified at the very start for left and right sides horizontally-arranged, help writing and memorize.In areas of information technology, put into practice Chinese character simplified.
3. two minutes Chinese characters make the words spelling of the Chinese phonetic alphabet have determinacy; Move towards spelling of Chinese character from the Chinese phonetic alphabet, can develop by the form of self and finish, realize the spelling of Chinese character continuous transition.For the Chinese phonetic alphabet moves towards spelling of Chinese character, provide a technological approaches.
4. the pronunciation of Chinese character can be represented in two minutes Chinese characters, overcomes the shortcoming of the existing Chinese phonetic alphabet, can inherit the strong point that Chinese-character pronunciation shape justice combines again; Both can read, can appreciate again; Realize man-machine sharedly, character learning person and illiterate person are shared.
5. use two Chinese characters that divide, can realize to improve the condition that has Chinese character information processing now with standard A SCII code character (or standard A SCII code character and Chinese character or its shape justice characteristics combination, or Hanzi component or its shape justice characteristics combination) record and transmission Chinese.
6. adopt two Chinese characters that divide, will reduce the quantity (or kind) of kanji code, realize the coded representation of all Chinese characters, make input coding for Chinese character have literal function, Chinese character simplified and new coinage speech will not be subjected to the restriction of existing coded system, help the reform of a writing system.
7. improved existing phonological and calligraphical syn thesize coding.Two division input methods adopt " Chinese character two minutes ", make Chinese character split maximization, have reduced the difficulty of spell shape code fetch; The realization part codes need not be remembered; The Hanzi structure comprehensive representation helps the teaching of literacy; Same coding can be used for multiple (comprising standard and numeral etc.) keyboard.
8. simplify type matrix, realize that the type matrix of all Chinese characters is represented.Adopt two Chinese characters (parts horizontally-arranged form) or Chinese character combined characters of dividing, can simplify to having only the Hanzi component type matrix having Chinese character font now, and can represent all Chinese characters.Particularly adopt the Chinese character combined character, all Chinese characters (comprising new word) can be expressed as the pattern of synthetic Chinese character.The popularization and application of synthetic Chinese character will promote the improvement of Chinese operating system.
9. using two Chinese characters that divide, is rationally diverting from one use to another social resources.In Chinese character information processing, use two Chinese characters that divide, help the record and the transmission of Chinese character information.The input of existing Chinese character is manyed one formality than the phonetic literal, will learn coding before promptly going up machine, wants continuous " translation " to encode behind the last machine.On the basis that the Chinese character memorize is difficult for, must learn cover (or a few cover) abstract code sign.As use these energy, and learn and use two Chinese characters that divide, put into practice spelling of Chinese character, be rationally diverting from one use to another to social resources.
10. two minutes Chinese characters have multiple application form by self form evolution, can adapt to the multiple needs of spelling of Chinese character process, satisfy the real needs of different crowd.
11. adopt the type matrix synthetic method, generate synthetic Chinese character, be the Chinese character information technology, increased new Chinese character output pattern, help the improvement of existing operating system.Both can realize the synthetic standardization of type matrix, also can realize individual type matrix personalization.
12. the input method of two minutes Chinese characters is common to (comprising standard and numeral etc.) multiple keyboard, can save intellectual resources.
13. the improvement Chinese character input method, the personalization that helps Chinese is expressed.One yard of one word selects Chinese word coding, makes Chinese character input " returning Piao returns very ", saves social resources.Personalized text style, personalized input mode, personalized input code table, the personalized way of output, the personalization that helps Chinese is expressed.
Below the invention will be further described.
One, two minutes Chinese characters
Two minutes Chinese characters are the information-based transition literal (proposed projects) of a kind of spelling of Chinese character.On body, it is the combination of the Chinese phonetic alphabet and Chinese character (or its shape Yi Tezheng), or is the combination of Hanzi component (or its shape Yi Tezheng); On function, it combines Chinese character, the Chinese phonetic alphabet and input coding for Chinese character together: on using, it has the diversity of spelling pattern, to adapt to the spelling of Chinese character need of practice; Technically, it is infotech and reform of a writing system practice, and a kind of written form of the compatible attribute that possesses skills is provided.As Chinese character " tree ", its two Chinese characters that divide, have 1. shu4 (tree), 2. shu4 ` set (tree), 3. shu4 ` wood to (tree), 4. shu4 ` mu-dui (tree), 5. shu4 ` wood (tree) ..., 6. ` wood is to multiple patterns such as (trees).These patterns can both be used for Chinese character information processing.So-called " two branch ", one is meant that it can have two parts of the tone and shape of reading Yi Tezheng, two are meant that the main method that it describes shape Yi Tezheng is " Chinese character two minutes ".The title of " two minutes Chinese characters " can be upgraded in the spelling of Chinese character practice.Its principal character is: (1) has pronunciation part and shape justice characteristic, (2) or only tangible adopted characteristic; (3) the words spelling has determinacy with all Chinese characters; (4) adopt standard A SCII code character record and transmission Chinese; (5) or with standard A SCII code character combine with Chinese character (or its shape Yi Tezheng) record and the transmission Chinese; (6) or with Hanzi component (or its shape Yi Tezheng) horizontally-arranged written record and the transmission Chinese; (7) infotech is combined with reform of a writing system practice.Two minutes Chinese characters, as the information-based experimental tool of spelling of Chinese character, it has: (1) opening.On form and content, can absorb, can sublate again.Its each part (comprising list separator) can be accepted or rejected according to putting into practice the needs definition.(2) determinacy.The words spelling, corresponding mutually with Chinese and Chinese character, have the relation of determining.(3) dirigibility.Can and use the different of object according to the application scenario, select (or generating automatically) different practical pattern for use.(4) stability.With certain a part of standardization, as the prerequisite of flexible Application.(5) technical.Spelling of Chinese character is combined with the Chinese character information technology, give technical attributes the Chinese notation.
Two minutes Chinese characters have pronunciation part and shape justice characteristic.The pronunciation part is described Chinese pronunciation, reads for people's (or machine).Shape justice characteristic, the distinguishing characteristics of description and unisonance words (using where necessary), help others (or machine) understood, for people's appreciation.
" the pronunciation part " of two minutes Chinese characters is the direct application of spelling of Chinese character standard.Current, be exactly regulation according to " Scheme for the Chinese Phonetic Alphabet " and " basic principles for Chinese phonetic alphabet ", spelling words and record Chinese.Syllable is spelt, and can adopt spelling, Two bors d's oeuveres and other pattern of the Chinese phonetic alphabet.Spelling is the standard pattern of the Chinese phonetic alphabet.Each letter writes out in the syllable.Two bors d's oeuveres is represented initial consonant, simple or compound vowel of a Chinese syllable or letter (or its combination) with 1 alphanumeric codes, 1 syllable, and maximum 2 letters are simplification patterns of spelling.Other pattern of syllable spelling comprises existing simplicity, or the novel type that may occur from now on.Tone is marked on basic syllable back, with numeral or use letter representation.Here, with numeral " 1,2,3,4 " the expression Chinese four tones of standard Chinese pronunciation.Tone mark in actual applications, also can be omitted.The syllable spelling reduces symbol as far as possible and uses, and adopts basic syllable pattern as far as possible.The Chinese phonetic alphabet has only " Scheme for the Chinese Phonetic Alphabet " to have authority and legitimacy.Adopt the Chinese-character pronunciation of other pattern mark of the Chinese phonetic alphabet, can be converted to the pattern of " scheme "; As wouldn't changing, can be within the specific limits, as the special interim form of two minutes Chinese characters; Further narration is not done in this explanation.The Chinese character of non-Chinese pronunciation as " Japan and Korea S's Chinese character ", can mark local standard pronunciation with the Chinese phonetic alphabet, or not mark pronunciation, only adopts shape justice feature description, uses within the specific limits.
" the shape justice characteristic " of two minutes Chinese characters is to the traditional succession of Chinese character and develop.The effect of shape justice characteristic mainly is to describe in the Chinese unisonance words at font and literalness distinguishing characteristics.These features, form, structure type, parts (or stroke) combination that shows Chinese character and words such as get in touch at the aspect.Describing these features, find out the difference of words and other unisonance words exactly, is to realize that the words spelling has deterministic basic method with all Chinese-character word-phrases.Shape justice characteristic has opening, and promptly the describing method of feature and select for use quantity unrestricted can determine in use have very big dirigibility according to actual needs.This dirigibility is again based on certain standard.The specification description of shape Yi Tezheng, based on font style characteristic, the meaning of word is characterized as auxilliary, quantitatively makes every effort to " number " to the greatest extent in feature.Shape Yi Tezheng selects for use flexibly, is prerequisite ambiguity not occur.Reduce the quantity of feature description as far as possible, make two Chinese characters that divide make great efforts to draw close to the simple Chinese phonetic alphabet.Shape Yi Tezheng selects for use flexibly, also can be by means of the existing information technology.A kind of simple and easy way is: 1. two Chinese characters that divide imported in record automatically; 2. various features is deposited in database table; 3. the two branch Chinese characters that will import and existing record contrast; 4. identical as occurring, provide prompting, and specification description is provided; 5. after confirming, automatic revisal is two branch Chinese characters of (or before) input now.Utilize shape justice characteristic, can also describe some information character phenomenons.As, " xiao ` " :) ", expression " laughing at ".Describe the method for shape Yi Tezheng, mainly contain: word etc. got in Chinese character two minutes, stroke code and connection speech.
Chinese character two minutes is to describe Chinese character according to Chinese character in the feature aspect " font ".It is the Hanzi features describing method, is again the Chinese character method for splitting.It is divided into two parts to Chinese character from the aspects such as version, parts (stroke) combination, literal meaning, logical relation and aesthetic habit of Chinese character.Wherein, version and parts (stroke) combination is two minutes a main thought of Chinese character.With Chinese character two minutes, that selects earlier was " selecting parts " (or be called " selecting part ", be called for short " selecting "), remaining " remainder " (or be called " remainder ", be called for short " residue ") that just be called.The order of choosing is: 1. choose by writing priority (or order of strokes observed in calligraphy); 2. or by the rule of " the one-tenth word is preferential ", " getting big preferential " choose.Two minutes general rule of Chinese character comprises: 1. from can divide, 2. link to each other can divide, 3. become word preferentially, 4. get greatly preferentially and 5. meaning connect and be regardless of.Its rule definition is: 1. from dividing, be meant that structurally there are several parts that are separated from each other in Chinese character, and just can two minutes; 2. link to each other and can divide, be meant that Chinese character structurally can be regarded as by several parts to be formed by connecting, just can two minutes: 3. become word preferential, be meant two schemes that are divided into character formation component of paying the utmost attention to; 4. get greatly preferential, be meant and pay the utmost attention to two parts that are divided into the structure maximum; 5. meaning even is regardless of, and is meant the several discrete stroke that meaning of a passage links to each other, and regards an integral body as, as
" Zhao " do not split.These rules need to take all factors into consideration in concrete the application." selecting " that Chinese character obtained in two minutes and " residue " two parts have pronunciation, with its pronunciation as code, no pronunciation, with its stroke writing as code (or represent with other shape justice feature code).Chinese character was given an example in two minutes: 1. " sweet ", regard " tongue " and " sweet " as; " graupel " regards " rain " and " loosing " as; Pressed structure type two minutes; " sweet ", left and right sides structure; " graupel ", up-down structure.2. " not ", regard as " bow " with
" well ", regard as " two " with
: pressed unit construction two minutes.③ “ Qe ", regard " Pie " and " " as; " ten " regard " one " and " Shu " as; Pressed stroke combination two minutes.4. " in vain ", regard " Pie " and " day " as; " rich " regards " three " and " Shu " as; Pressed stroke and unit construction two minutes.5. micro-, regard as " Chi " with
: " famine ", regard " Lv " Yu “ Dumplings as "; Pressed literal meaning two minutes;
“ Dumplings " regard word as.6. " second ", " second (stroke is arranged) " and " " (no stroke) regarded as in only word "; Pressed logical relation two minutes; The no stroke of available letter " w " expression.7. " mountain " regards " Shu " (selecting part) and " Qian " (remainder) as; Pressed logical relation two minutes; Having or not of stroke, selecting and remaining of parts is a kind of logical relation.8. " rank of nobility " regarded as
With
" device " regarded as
" dog "; Pressed aesthetic habit two minutes; " rank of nobility " is divided into impartial two parts up and down; " device " is without separating into " crying " and " Song ".Chinese character two minutes can overcome the shortcoming that has single-character splitting now.As the micro-word, be example with the natural code, press the parts correspondence, can be split as 5 parts such as " Chi, mountain,, several, The-Fan ", split thinner; Coding is got 3 yards at most, loses 2 parts, and structure representation is incomplete; Part codes needs memory.Chinese character two minutes, with the micro-word be split as " Chi " and
, two parts have realized that Chinese character splits maximization and Hanzi structure comprehensive representation.In Chinese character two minutes, resemble parts such as " Chi ", " mountain ", " ", " several ", " The-Fan ", can both be with pronunciation as code.If necessary, Chinese character can carry out in two minutes step by step.As " good fortune ", can be split as " Woo " Yu “ Bi earlier " two parts; " Bi ", removable again being divided into
With " field " two parts;
, can also be split as " one " and " mouth " two parts; Whole basic elements of character of " good fortune " are: " Woo,, mouth, field ".Chinese character adopted " the one-tenth word is preferential " rule in two minutes, and purpose is that parts can be recognized as far as possible, adopt the pronunciation of parts to make code as far as possible, are beneficial to spelling of Chinese character after Chinese character was split.But the character learning level (how much), there is individual difference.In concrete the application, also to consider, be split as " generally can recognize Chinese character " (first-level Chinese characters among the approximate GB) as far as possible.Resemble " separating ", just do not belong to " generally can recognize Chinese character ".This character formation component has nearly 200 (in the GB scopes) in " Chinese character two minutes ", account for 10% of total number of parts.Can handle like this: 1. keep its pronunciation code, confess that the knowledgeable uses; 2. provide stroke code (or other shape justice feature code), for not knower's use; 3. or to it continue to split, take off the one-level parts as code; 4. in two division input methods, can adopt prior aries such as " Chinese character (and parts) candidates " to solve.In " Chinese character two minutes ", " with reading " phenomenon appears sometimes, and promptly the syllable of parts (or basic syllable) is identical with the syllable (or basic syllable) of Chinese character.For " with reading ", can do following processing: 1. keep the pronunciation code of existing parts, it was carried out next stage two minutes, increase by 1 fresh code again; 2. cast out the pronunciation code of existing parts, it was carried out next stage two minutes, extract 1 fresh code.The pronunciation code of two sub-units can be expressed as the spelling pattern of the Chinese phonetic alphabet, or the Two bors d's oeuveres pattern, or other pattern.To the polyphone in the character formation component, get its " general pronunciation " as code.So-called " general pronunciation " is exactly in the pronunciation of polyphone, current pronunciation: be embodied in, and the frequency of utilization height, group speech record is many; This explanation will be labeled as the pronunciation conduct " general pronunciation " of " " temporarily in " Xinhua dictionary ".The parts of two minutes Chinese characters can be called two sub-units (or two sub-units).Two sub-units are made code with pronunciation, can realize " the unisonance merger " of parts, help Chinese character simplified.
The stroke code is described Chinese character according to " feature " of the stroke writing of Chinese character or parts.It is combined by basic code and condition code.Here enumerate two kinds.One, little stroke code (being called for short " stroke code ") is represented stroke " feature " with 10 codes.(1) " digital code " pattern: 1. basic stroke is divided into " cast aside anyhow to press down to roll over and turn " six kinds; Wherein, for wieling the pen to clockwise commentaries on classics, " turning " is for wieling the pen to counterclockwise rotation " folding ": " folding " and " turning " apportion are to consider that the form of a stroke or a combination of strokes that they comprise is too many; With " 1,3,5,7,9,0 " numerical code expression, be called basic code.2. have or not with stroke and other stroke and intersect, have other stroke to intersect on every " cast aside anyhow and press down " stroke,, be called condition code with " 2,4,6,8 " numerical code expression as " feature ".Both combinations, " 1,2,3,4,5,6,7,8,9,0 ", these ten numbers just are called " digital code " of " cast aside anyhow to press down to roll over and turn ".Press sequential write, give character code with it.As " Na ", the stroke code is " 26 " , “ Myeon ", the stroke code is " 15 ".(2) " alphanumeric codes " pattern: the number in the digital code " 1,2,3,4,5,6,7,8,9,0 " as substituting with letter " g, h, f, j, d, k, s, l, a, m ", promptly obtains " alphanumeric codes ".As top " Na (26) ", alphanumeric codes are " hk ", " Myeon (15) ", and alphanumeric codes are " gd ".Alphanumeric codes relatively are fit to QWERTY keyboard and use.Alphanumeric codes, also available one group of other letter substitutes.Two, big stroke code is represented stroke " feature " with 25 alphanumeric codes.1. basic stroke is divided into " cast aside anyhow and press down folding " five kinds, uses number " 1,2,3,4,5 " expression respectively, be called " basic code "; The stroke here " folding " comprises that the stroke of addressing previously " turns "; 2. according to " commissure " situation of stroke and other stroke, be divided into " solely, first, in, tail, friendship " five kinds of states, use number " 1,2,3,4,5 " expression again respectively, be called " condition code ".These five kinds of states are defined as: " solely ", and not with other stroke commissure mutually; " head ", the first stroke of a Chinese character is connected with other stroke; " in ", the stroke middle part is connected with other stroke; " tail ", the stroke afterbody is connected with other stroke; " friendship ", stroke and other stroke intersect.Basic code and condition code is combined, just obtain 25 kinds big stroke codes.As " youngster ", big stroke code is " 3151 ", " several ", and big stroke code is " 3252 ".Every kind big stroke code, available again 1 letter character is represented.The combination of numbers of big stroke code and corresponding letter and the definition of key position thereof are seen " two minutes Chinese characters define with the keyboard of two division input methods " part.Two kinds of stroke codes are directly read from character stroke, do not need memory; The corresponding pen type of code can indicate on keyboard.
Word got in the connection speech, is to describe Chinese character according to Chinese character in the feature aspect " meaning of word ".Double word phrase commonly used can be formed in 60% of Chinese characters in common use.Average each individual character is associated with 7 double word phrases commonly used at least.These individual characters are used in connection speech mode at ordinary times.Import separately as need, then import the pronunciation of this word earlier,, import the pronunciation of another word again, as shape justice characteristic as the pronunciation part; As get a back word that joins speech, and then adding "~" at the word end, the expression letter economizes.(1) certain word is before phrase, and as " big ", writing " wei3 ` da4 " (" big " of " greatness "): (2) certain word as " close ", is write " mi4 ` yan2~" (" close " of " tightly ") behind phrase.Word got in the connection speech, has " many yards of words " phenomenon, but do not influence the determinacy of understanding and spelling.Because the repetition rate of coding of common phrase is not high, is lower than 20%; The phrase that repeated code is arranged can also increase the length of shape justice feature description.Word got in the connection speech, is applicable to the individual character meaning of word feature description of words commonly used.The word spelling got in the connection speech of individual character, can also form the nestable form of circulation that word got in the connection speech as the shape Yi Tezheng of another words, uses for information processing.
Two minutes Chinese characters can also adopt alternate manner to the description of form meaning of characters feature.(1) structure type.With the structure type of Chinese character shape Yi Tezheng as Chinese character.Structure type of Chinese characters generally has three big classes such as left and right sides structure, up-down structure, heterozygosis (encirclement) structure.It can be respectively with digital " 1,2,3 " expression.In each big class, multiple pattern is arranged again, with its ordering, use respectively again number " 123 ... " expression.As " rosy clouds " word, belong to the 4th kind of pattern in the up-down structure (" 2 "), note is done " 24 ".(2) font style characteristic.There is technicality in the nearly Chinese character of shape (or parts) on the font form, also can be used as the form meaning of characters feature.Have as " mouthful, mouthful " vary in size, " day, say " have long flat branch.These " large and small, long, flat " as shape Yi Tezheng, can be represented with its initial consonant " d, x, c, b " respectively.(3) stroke difference.Identical stroke also has " length " difference in writing, " flat perpendicular " difference.As " soil, scholar, not, end " etc., there is " length " difference of stroke.Can represent with the initial consonant " c, d " of " length ".(4) commissure position.Identical stroke with other stroke commissure mutually, also has various technicality in writing.As the commissure position, the difference of " first, in, tail " is arranged." head " refers to the beginning of stroke; " in ", refer to the middle part of stroke; " tail " refers to the end of stroke.As, " cutter, power ", stroke writing is identical, but commissure position difference, and one is " middle head " commissure, and one is " in " commissure.This " tail among the head " difference can be used phonetic alphabet " s, z, w " expression.(5) stroke segmentation.The big class of identical stroke as " folding ", comprises a lot of concrete strokes.As " Bao,
" wait second of parts, in Chinese character input stroke classification, belong to " folding " class together, but their stroke title difference.The title of available Chinese-character stroke is represented these Hanzi components, and is used as the shape Yi Tezheng of Chinese character." Bao,
", the stroke title is respectively " casting aside the cross break hook ", " casting aside horizontal hook ", " the hard cross break hook of casting aside ".The stroke title can be represented with its initial consonant,, be expressed as " phzg ", as part codes as " casting aside the cross break hook ".(6) parts name.Some structure radicals (parts) do not have title, can give these radicals (parts) unified name, so that import with pronunciation coding (or natural-sounding).As
(by " Ao " word),
(" trembling with fear " prefix) etc., ununified name just is not easy to describe with pronunciation.If it is give pronunciation codes such as " ao ", " han ", just convenient sometimes than stroke code.(6) standard is used.As do not require " splitting maximization ", and can utilize the achievement in research of existing Chinese character, as normal parts, five strokes etc., Chinese character is carried out shape justice feature description.The shape justice feature description of these " other " modes can be used for special circumstances, such as, two nearly parts of shape or identical stroke are made fine description.Code made in available digital or letter; Identical code sign under different prerequisites, is represented different distinguishing characteristicss.
The spelling principle of two minutes Chinese characters.The general pattern of two minutes Chinese characters is: (pronunciation part) ` (shape justice characteristic).Separate (also may be defined as other symbol, or separate) with list separator " ` " between pronunciation and shape Yi Tezheng two parts without symbol.Pronunciation part, front are the Chinese phonetic alphabet of words, and the back is a Chinese language tone.The syllable spelling is in accordance with the regulation of " Scheme for the Chinese Phonetic Alphabet " and " basic principles for Chinese phonetic alphabet ".Shape justice characteristic with Chinese character, Hanzi component or other character, is described the form meaning of characters feature; Separate (or separate, or be defined as other symbol) with list separator "-" between each feature as " (), ,+" etc. without symbol.Shape Yi Tezheng takes, and determines as required.(1) individual character spelling.As " despot " word, pronunciation is " ba4 ", tone code is " 4 ", shape Yi Tezheng gets whole parts, be " rain, leather, the moon " that code is its pronunciation, should be " yu, ge, yue " mutually, two Chinese characters that divide of its all-key pattern (promptly write out tangible adopted feature code) are " ba4 ` yu-ge-yue " (despot), or " ba4 ` yu-(ge-yue) " (despot); Shape Yi Tezheng gets 1 parts, as gets " rain ", and code is its pronunciation " yu ", and two Chinese characters that divide of its brevity code pattern (promptly writing out part shape justice feature code) are " ba4 ` yu " (despot).The shape Yi Tezheng of individual character also can be expressed as Chinese character, Hanzi component or other the Character Style, and as " despot " word, its all-key pattern is " ba4 ` rain (the leather moon) " (despot), and the brevity code pattern is " ba4 ` rain " (despot).(2) phrase spelling.As " advancing ", pronunciation is " qianjin24 ", tone code is " 24 ", shape Yi Tezheng gets whole parts, be " , cut off the feet, well, Chuo " that code is its pronunciation or stroke code, should be " sdg, yue, jing, sas " mutually, two branch Chinese characters of its all-key pattern are " qianjin24 ` sdg-yue-jing-sas " (advancing), or " qianjin24 ` (sdg-yue)-(jing-sas) " (advancing); Shape Yi Tezheng gets 1 parts, as gets " moon ", and code is its pronunciation " yue ", and two branch Chinese characters of its brevity code pattern are " qianjin ` yue " (advancing).The shape Yi Tezheng of phrase also can be expressed as the Hanzi component pattern, and as " advancing ", all-key is " qianjin24 ( cuts off the feet)-(well Chuo) " (advancing), and brevity code also can be regarded " the qianjin ` month " (advancing) as.(3) spelling of two minutes Chinese characters also can be provided by input method.Its easy way is, utilizes existing Chinese character entering technique, sets up two code list of Hanzi that divide, and by input Chinese character or its code, instead looks into two spelling codings that divide Chinese characters, can obtain two spelling codings that divide the multiple pattern of Chinese characters.Can directly read two division input method codings by " two minutes Chinese characters ", as " ba ` yu " (despot) and " qianjin ` yue " (advancing), its Two bors d's oeuveres input coding is " ba ` yu " (despot) and " qmjn ` yt " (advancing), or the omission list separator, be " bayu " (despot) and " qmjnyt " (advancing).The pronunciation part of two minutes Chinese characters can be expressed as " spelling " or " Two bors d's oeuveres " pattern, and other pattern.In the Chinese phonetic alphabet, simple or compound vowel of a Chinese syllable " ü ", when being write as " ü " at needs, available letter " v " replaces.The spelling of two minutes Chinese characters except that general pattern, can also have the multiple pattern of spelling flexibly, to adapt to the spelling of Chinese character need of practice.
The list separator of two minutes Chinese characters and the syllable-dividing mark of the Chinese phonetic alphabet.Both expression symbols can define respectively, also can unified Definition.Definition respectively, the list separator of two minutes Chinese characters is between pronunciation and shape Yi Tezheng, with symbol " ` " expression; Between a plurality of shape Yi Tezheng, with symbol "-" expression; Distinguish mutually with the syllable-dividing mark of the Chinese phonetic alphabet; Also can be defined as other symbol.Unified Definition is with the list separator of the two minutes Chinese characters syllable-dividing mark " ' with the Chinese phonetic alphabet " expression.The definition of letter symbol needs the check of reform of a writing system practice, so the symbol definition of two minutes Chinese characters has dirigibility.
The spelling pattern of two minutes Chinese characters.Chinese character had open structure in two minutes, and front and back two parts can constantly absorb the spelling of Chinese character achievement, can be defined as multiple pattern according to putting into practice needs.The structure of form and the employing of spelling character pressed in two minutes Chinese characters, can be divided into typical-sample, special pattern and simplify pattern.Typical-sample, pronunciation part (containing tone) and shape justice characteristic are complete; The spelling character is a single standard ASCII character character.Special pattern is two special applications of dividing Chinese characters, is different from typical-sample in the employing of the structure of form or spelling character.Simplifying pattern, is that the simplification of typical-sample and special pattern is used.Two minutes Chinese characters by the employing of spelling character, can be divided into full word symbol pattern, character and Chinese character (or parts) assemble pattern, Chinese character (or parts) assemble pattern and digital code pattern.The Character Style, promptly single standard ASCII character character comprises letter, numbers and symbols.Character and Chinese character (or parts) assemble pattern on the basis of the Character Style, has also increased the employing of Chinese character (or parts) character.Chinese character (or parts) assemble pattern is write as with single Chinese character (or parts) character.The digital code pattern is represented two branch Chinese characters with numeral as code, or by other pattern, is converted to through " word/number ".The Character Style here, character and Chinese character (or parts) assemble pattern, Chinese character (or parts) assemble pattern and digital code pattern are different pattern classifications, do not have the relation of inclusion on (character and letter, numeral etc.) literal meaning.The diversity of the two minutes practical patterns of Chinese character is deferred to the form evolution rule of self, does not influence the determinacy of spelling, and practice provides multiple choices to spelling of Chinese character for it, and it gives practical application with bigger dirigibility.The user can select the spelling pattern that is fit to oneself for use according to s own situation, enters the spelling of Chinese character practice.
1. the typical-sample form is: words=(pronunciation (containing tone))+(shape Yi Tezheng)
(1) spelling form Wei4 renmin22 ` ds fuwu24 ` yue (serving the people); Adopted Chinese character two minutes, words is carried out shape justice feature description, the mark tone, " ds " of shape justice part is the stroke code of " people "; " yue ", " clothes " word select the parts pronunciation code of " moon ";
Wei4 renmin22 fuwu24 (serving the people); " the connection speech is used ", mark tone: the approximate Chinese phonetic alphabet.
(2) Two bors d's oeuveres form Wz4 rfmn22 ` ds fuwu24 ` yt (serving the people); " Two bors d's oeuveres ", the deflation spelling pattern of the Chinese phonetic alphabet;
Wz4 rfmn22 fuwu24 (serving the people); Two bors d's oeuveres, " the connection speech is used ".
2. simplify to use the applying flexibly of typical-sample and special pattern, its prerequisite is that certain part must be definite.The pronunciation part, current, can only simple tone; Shape justice characteristic can be as required, brief shape justice feature description.Yi Tezheng gradually reduces when shape, and Chinese character will progressively be drawn close to the Chinese phonetic alphabet in two minutes.When ambiguity not occurring, also can omit list separator.In this instructions, between pronunciation part and shape justice characteristic, the form expression formula of "+" number is arranged, its list separator have " usefulness " and " need not " two kinds of selections; The form expression formula of not having "+" number, its list separator have only " usefulness " or " need not " a kind of selection.Parts in the form (or feature) sequence numbers " n ", expression is taken to the most last 1 parts (or feature)." the connection speech is used " of two minutes Chinese characters is similar to the Chinese phonetic alphabet.Below, be example with the Two bors d's oeuveres pattern, the pronunciation part is tonal not.
(1) (pronunciation)+(shape justice feature code) form is as, Wz rfmn ` ds fuwu ` yt (serving the people).
(2) (pronunciation)+(parts that pronunciation is arranged) form is as, Wz rfmn fuwu ` yt (serving the people).
(3) (pronunciation) form is as, Wz rfmn fuwu (serving the people); Two bors d's oeuveres has only the pronunciation part, the approximate Chinese phonetic alphabet.
(4) (pronunciation) (shape Yi Tezheng) form is as, the rfmn people (people); Omit list separator.
3. special pattern is distinguished unisonance words, the meaning of a word with shape justice characteristic, and the definition part of speech writes down dialect, classical Chinese, gives the unfamiliar word phonetic notation, the emergent expression, or in the use of shape justice characteristic Chinese character, parts or other character, Description of Chinese Character Structure etc.As:
1. (pronunciation)+(Chinese character) form as, yt ` cuts off the feet (cutting off the feet), mi ` Mi (Mi), xnug11 ` new life (new life), or be spelled as yt and cut off the feet (cutting off the feet), mi Mi (Mi), xnug11 new life (new life) omits list separator; Be used for also can be used to carry out the teaching of literacy and popularizing Beijing pronunciation, or improve existing machine (voice) and read to unfamiliar word phonetic notation, classical Chinese spelling.
2. (pronunciation)+(words difference) form as, gsui (public affair), gsui ` formula (formula); , that phrase and other unisonance phrase phase region is other with certain word in the phrase or certain parts as tab-like adopted feature.Make it in linguistic context, ambiguity do not occur.
3. (pronunciation)+(meaning of a word difference) form as, nzxn (heart, at heart), nzxn ` jh (heart, geometrical concept is with " jh " expression " how much "); In shape justice part meaning of a word category is illustrated, phrase and other unisonance phrase phase region is other.
4. (pronunciation)+(part of speech difference) form is as, klgr ` mc (switch, noun is with " mc " expression " noun "), klgr ` dc (switch, verb, usefulness " dc " expression " verb "); Be both " switch ", but the part of speech difference: at shape justice characteristic lex pos.
5. (pronunciation)+(connection speech feature) form is as, xn ` tb (" heart " of " in mind "), or, tb ` xn~(" head " of " in mind "); " word got in the connection speech " defines individual character in the connection speech is used.Get a back word of connection speech, must add a character "~", represent this word.
6. (words pronunciation)+(words meaning) form is as, giga ` bubmiuli (embarrassment); Sometimes, can run into the embarrassment of " can not say and can write ".As, " embarrassment " can not write, if think the meaning of " inconvenience is handled ", just directly adds the Chinese phonetic alphabet (Two bors d's oeuveres) " bubmiuli " of " inconvenience is handled " in shape justice part, plays the emergent effect of expressing.
7. (voice or the phonetic aspect of a dialect)+(note) form is as, gege ` jclo (corner); The phonetic aspect of a dialect, dialect have specific diction.As Chinese character or the inconvenient situation about expressing of the Chinese phonetic alphabet appear, can adopt the way of " the mark voice are at shape justice part mandarin note " to solve.
(8) (pronunciation (mark tone do not mark tone) form as, ziybwhgo4222 ziybwhgo (realm of freedom); Adopt Two bors d's oeuveres to add the connection speech and use, morphology is neat, the approximate again Chinese phonetic alphabet.
(9) ` (parts
1)+(parts
2), or ` (parts
1)+(parts
2)+... + (parts
n) form as, ` Ren (so doing),
Order 2 (emitting), ` Epileptic the third 3 (disease); Having saved the pronunciation part, is two parts horizontally-arranged patterns that divide Chinese character.` (parts 1)+(parts 2).Be two branch literary styles of Chinese character, fit Chinese character is converted to left and right sides structure.Represent that with numeral " 2 " this word was up-down structure originally, represent that with numeral " 3 " this word was hybrid structure originally.Also available
Deng the former Hanzi structure of character representation, as, ` Ren event (doing),
Order
(emitting), ` Epileptic third
(disease).Its full word symbol pattern can be used as the two font codes that divide input of Chinese character.
(10) (pronunciation)+(unit construction)+(structure type), or (pronunciation)+(parts 1+ parts 2+ ... + parts n)+(structure type) form is as, ba ` mouth eight
(" ", left and right sides structure), vi ` mouth
(" only ", up-down structure): Hanzi component and structure type are carried out careful description: the structure character
Deng, or other respective code, can be used as the code of Chinese character font generated data again; This spelling pattern can be used for the synthetic of Chinese character font.In the form, the former can not will to the greatest extent " number " description of parts.
(11) (pronunciation)+(selecting)+(residue)+(structure)+[(pronunciation or stroke code)+(selecting)+(residue)+(structure)]
2+ ... + [(pronunciation or stroke code)+(selecting)+(residue)+(structure)]
NForm is represented two to divide Chinese characters with it, or with it as kanji code; With Chinese character two minutes step by step, fine description.As, " good fortune ", the first order two minutes is regarded " Woo " as with “ Bi ", left and right sides structure, writing
The second level two fens , “ Bi ", regard as
" field ", up-down structure, writing
The third level two minutes,
Regard " one " and " mouth " as, up-down structure, writing
The whole two minutes procedural representations of " good fortune " are: " good fortune "=" Woo "+“ Bi "+
+ [“ Bi "+
+ " field "+
]+[
+ " one "+" mouth "+
].If the parts preferred order is determined, equation the right, " pronunciation or the stroke code " part in two minutes at different levels can be omitted.As, " good fortune "=" Woo "+“ Bi "+
+ [
+ " field "+
]+[" one "+" mouth "+
].
(12) (pronunciation)+(feature 1+ feature 2+ ... + feature n) form has reduced the structure type specification than form (10), but " feature " is more than " parts " intension.Shape Yi Tezheng to Chinese character describes comprehensively, and feature is taken and made every effort to " number " to the greatest extent, can be used for improving existing kanji code.
(13) (pronunciation)+(information character combination) form utilizes two Chinese characters that divide, and describes " literal " phenomenon of some information character combinations.As, " xiao ` " :) ", expression " laughing at ".
(14) the digital code form is represented two Chinese characters that divide with digital code; Or pass through " word/number " conversion by other form and get.Can be used as a kind of digital input coding uses.Also can be used as two machine codes of Chinese character of dividing uses.
Two literary compositions are used with.Two minutes Chinese characters and Chinese character are used (or the Chinese phonetic alphabet and Chinese character use with) with, seem to lack standard, are a kind of complementation and transition actually, are the performances of inheritance and development.As sentence, " yellow he zhi water tian goes up lai." (the water sky of the Yellow River comes up.), seemingly than " the water sky of the Yellow River comes up." brief, the rhythm and pace of moving things is arranged.As phrase, " yellow he " (the Yellow River), " tribute xian " (contribution), " sending out the ` shellfish " (making a good deal of money) and " sending out fen figure qiang " (working with a will to make the country strong) etc. had both been simplified Chinese-character writing, had increased the determinacy of words spelling again.
The use of two minutes Chinese characters.Two minutes Chinese characters, in the pronunciation spelling, the Chinese phonetic alphabet is the same with using, just when ambiguity (uncertain spelling) occurring, just increase shape justice feature description, the unisonance words is distinguished: on shape justice feature description, Chinese character is the same with using, and shape Yi Tezheng is succession and the simplification to adopting Chinese character form.Two minutes Chinese characters and Chinese character have definite corresponding relation, and in use, rule such as Chinese glossary, grammer and rhetoric is constant, can directly use.Only know Chinese character, the person that is ignorant of phonetic can adopt font style characteristic to describe earlier, as select Hanzi component horizontally-arranged pattern for use, enters two Chinese characters that divide.Only can phonetic, the person that fails to see the Chinese character can adopt earlier that the connection speech is used, the connection speech is got modes such as word and mark tone and entered two Chinese characters that divide.Both understood phonetic, the person that knows the Chinese character again, preferably direct selection standard ASCII character the Character Style enters two Chinese characters that divide.Illiterate person also can utilize two Chinese character typing and understanding information of dividing, and carries out self-service study.Under two division input methods help, no matter enter in which way, can both the two Chinese characters that divide of flexible Application.The diversity of the practical pattern of two minutes Chinese characters does not influence the determinacy of spelling.Utilize existing Chinese character information technology, can assist two study and uses that divide Chinese character.(1) input Chinese character is learned with two Chinese characters that divide.Set up a code table for two Chinese characters that divide, utilize the coding inverse check function of existing input method, can obtain two Chinese character spelling codes that divide.(2) take input prompt, help correct two Chinese characters that divide that use.Utilize the prompt facility of existing input method, to the mosaic speech, prompting is given in brevity code application etc.(3) utilize database table, set up individual character word stock, the shape Yi Tezheng of words is pointed out, provide and select for use.
Two minutes Chinese characters, aspect input and output, can realize " multiple pattern input " and. " multiple pattern output ".As, a Chinese character can have multiple input coding pattern; An input coding can be output as two Chinese characters that divide of different patterns.
Self form of two minutes Chinese characters develops.The typical-sample of two minutes Chinese characters is the combination of the Chinese phonetic alphabet and Chinese character (or its shape Yi Tezheng), on the literal form, has own characteristic, and it reads tone and shape justice characteristic, can in use constantly develop.With Chinese character " paste " is example.(1) Chinese character: stick with paste (paste), the special pattern of two minutes Chinese characters can be regarded letter as and economize the pronunciation part.(2) phonetic ` Chinese character: hu2 ` sticks with paste (paste), and shape justice is characterized as whole Chinese character, can regard as to the phonetic annotation of Chinese characters.(3) phonetic ` Hanzi component combination: the hu2 ` Mi-Gu-moon (paste), shape justice is characterized as whole unit constructions of Chinese character.(4) phonetic ` selects parts-remainder: hu2 ` Mi-Hu (paste), shape justice is characterized as two " Chinese character two minutes " patterns that divide Chinese character.(5) phonetic ` parts 1:hu2 ` rice (paste), it is 1 parts that shape Yi Tezheng develops; When this word only selects 1 shape justice feature, in the GB scope, same code word does not appear.(6) phonetic: hu2 (paste), the two minutes another special patterns of Chinese character can be regarded letter as and economize shape justice characteristic; This word does not have shape justice feature description, in the GB scope, more than 20 same code word occur.(7) phonetic is not marked tone: hu (paste), has only basic syllable, can use in no ambiguity linguistic context.(8) basic syllable distortion.With the character spelling metamorphosis of basic syllable, distinguish high frequency unisonance words.As, " recklessly, lake, paste " etc., if necessary, can be out of shape by syllable and represent.Through the evolution of associate (1) to (6), Chinese character " paste " has become phonetic by Chinese character; Equally, through the evolution of associate (6) to (1), phonetic " hu2 " has become Chinese character by phonetic.Shape justice characteristic with (1) to (5) is converted to character code, is exactly two standard A SCII code character patterns that divide Chinese character.To (6), is that the form of shape justice characteristic develops from (1), and the pronunciation of two minutes Chinese characters does not become, and brief constantly the obtaining of adopting Chinese character form conformal Yi Tezheng simplified, and the increase of the determinacy conformal Yi Tezheng of words spelling constantly strengthens; To (8), is that the form of pronunciation part develops from (6), and pronunciation will progressively separate with spelling; Its Practical significance need stand the check of spelling of Chinese character practice.The pronunciation part develops with the form of shape justice characteristic, can carry out simultaneously.Utilize two self form of Chinese characters of dividing to develop, can be the spelling of Chinese character practice, a kind of new thinking is provided.Existing Chinese character information technology can be utilized these characteristics, puts into practice spelling of Chinese character.
Two minutes Chinese characters realize that the words spelling has determinacy with all Chinese-character word-phrases.The major defect of the existing Chinese phonetic alphabet is exactly that the words spelling exists uncertain with Chinese-character word-phrase.For overcoming this shortcoming, its way is: adopt shape justice feature description, make the words of same pronunciation, have different separately feature codes.Describe the not similar shape Yi Tezheng of Chinese character, the simplest way is exactly according to the needs of distinguishing the unisonance words, progressively increases the quantity of shape justice feature description, and it is not duplicated.Pronunciation with all words combines with shape Yi Tezheng again, has just realized that the words spelling has determinacy with all Chinese-character word-phrases.Its concrete effect is as follows: (1) describes the form meaning of characters feature with Chinese character two minutes and stroke code.In the GB scope, do not mark tone, give more than 4800 additional 1 shape Yi Tezheng of (accounting for total number of word more than 67%) everyday character, give more than 1950 additional 2 shape Yi Tezheng of (accounting for total number of word more than 26%) everyday character, give 16 words additional 3 shape Yi Tezheng, just can realize the definite description of whole individual characters.(2) the connection speech is used and is added the font style characteristic description.60% of Chinese characters in common use can join speech to be used, and is about to Chinese characters in common use and forms double word phrase commonly used, is spelled as simple Chinese phonetic alphabet pattern, is applied.This speech is used, and makes the determinacy (not marking tone) of words spelling, by below 20% of individual character application, rises to more than 80%.In 16800 double word phrases, (do not mark tone), have 82% entry can join speech and use, give more than 3000 phrase (account for sum 18%) additional 1 shape Yi Tezheng, give 8 phrases additional 2 shape Yi Tezheng, just can realize the definite description of whole phrases.(3) adopt the mark pronunciation, the way that word got in the connection speech can realize the definite description of Chinese characters in common use.Way is: use basic syllable, determine 416 words; With mark pronunciation and tone, determine 1282 words; Get word with the connection speech, do not mark tone, determine 4357 words; All the other adopt the connection speech to get word, and the mark tone is determined.(4) with (pronunciation)+(Hanzi component combination)+(structure type), or (pronunciation)+(feature 1+ feature 2+ ... + feature n) etc. multiple way can realize the complete description to all Chinese characters.All Chinese characters all are made up of parts (or stroke); Chinese character is split step by step as far as possible, can obtain a series of parts (or stroke) combination; One word is different from its word, just is that its parts (or stroke) combination is different; With these shapes Yi Tezheng, fine description, and symbolism just can be used as the code of all Chinese characters.(5) each component coding of two minutes Chinese characters has unique determinacy.Chinese character had nearly 2,000 two sub-units in two minutes, these parts, and the same with two minutes Chinese characters, additional 1 to 2 shape Yi Tezheng just can realize unique definite description.This is two " are the coding unit with parts " of dividing Chinese character, and application such as input Chinese character and the output of parts horizontally-arranged provide condition.
Two minutes Chinese characters are realized with standard A SCII code character record and transmission Chinese.Two minutes Chinese characters adopt shape justice feature description, realize that the words spelling has determinacy with all Chinese-character word-phrases, for spelling of Chinese character has been created condition.With shape justice feature description, spelling (or conversion) is the Character Style, again with rhythm alphabetic character " ü ", when needs writings " ü ", replaces (also may be defined as other character) with letter " v ", and two minutes Chinese characters have just had standard A SCII code character form.In Chinese character information processing, use two Chinese characters that divide of full word symbol pattern, just realized with international standard A SCII code character record and transmission Chinese.1. this realization will reduce the kind and the quantity of kanji code, 2. realize the coded representation of all Chinese characters, 3. make input coding for Chinese character have the literal function and the 4. favourable reform of a writing system.Between the standard A SCII code character pattern and Chinese character (or Chinese phonetic alphabet) of two minutes Chinese characters, multiple interim form can be arranged.These interim forms are for the spelling of Chinese character practice provides multiple possibility.Such as, " Chinese " word, available standards ASCII character character representation is: han ` ssg (Chinese) is 1.; Also available two other style sheet of Chinese character of dividing is shown: han ` Rui (Chinese) 2., han ` Rui not only (Chinese) 3., the han ` Chinese (Chinese) 4., ` Rui but also (Chinese) 5. wait pattern.
The feasibility of two minutes Chinese characters and legitimacy.Can whether Chinese character has feasibility in two minutes, obtain legitimacy, depended on its practicality.At first, two minutes Chinese characters have practicality.(1) two minutes Chinese characters can corresponding be diverted from one use to another in the place of using the Chinese character or the Chinese phonetic alphabet, can also be used in the place that can not use the Chinese character or the Chinese phonetic alphabet.(as with " xiao ` " :) ", expression " laughing at ".) (2) meeting phonetic, just can use two Chinese characters that divide; Can phonetic, as by counter the looking into of encoding, also can use two Chinese characters that divide.Under existing input method helps, the illiterate pair branch Chinese characters that also can use.(3) " (pronunciation)+(Chinese character) " pattern of two minutes Chinese characters helps popularizing Beijing pronunciation and the spelling of Chinese character practice, has potential social benefit.(4) two minutes Chinese characters meet reform of a writing system direction, are the utilities of spelling of Chinese character, it is integrated application to Chinese character, the Chinese phonetic alphabet and input coding for Chinese character, both having helped reform of a writing system practice, and helped Chinese character information processing again, is taking all factors into consideration of real world applications and longterm planning.(5) utilize the existing information technology, can strengthen two practicality of dividing Chinese character.Seem that two minutes Chinese characters have increased the spelling length of words, in a short time, also increased taking of the paper and the space of a whole page, in fact, this is a kind of long-range reform of a writing system investment, has lasting economic results in society.Utilize " code input ", " input prompt " and " brevity code is used in real time " methods such as (providing brevity code), the application of two branch Chinese characters is all made things convenient for than Chinese character and phonetic according to linguistic context.Secondly, two minutes Chinese characters on form major part legalize.In typical-sample, Chinese character is legal, and the Chinese phonetic alphabet is legal, both are put together, and also should be legal.The Chinese phonetic alphabet exists uncertain in the spelling of minority words, adopts two Chinese characters that divide, and it is carried out shape justice feature description, is to compensate its defect.In other pattern, its pronunciation part is the direct application of the Chinese phonetic alphabet, has legitimacy.In its shape justice characteristic, the Chinese character of employing and most of parts have legitimacy.Stroke code (or other individual code) is not had a legitimacy, but has practicality, in the ordinary course of things, uses few.Shape Yi Tezheng is that its practicality will create conditions for striving for that it legalizes to the replenishing of current specifications.In actual applications, use no or little shape justice feature description as far as possible, two Chinese characters that divide are drawn close to the simple Chinese phonetic alphabet as far as possible.Two minutes Chinese characters in practice process, to using choosing and the specifically spelling of words of pattern, should be carried out the principle of " legal preferential ".
Adopt two Chinese characters that divide, improve kanji code.Make kanji code with two minutes Chinese characters, multiple pattern can be arranged, following several for the practice select for use.(1) (pronunciation)+(feature 1+ feature 2+ ... + feature n) form.Utilize the shape justice characteristics combination (feature is taken and made every effort to " number " to the greatest extent) of Chinese character, Chinese character is carried out fine description.(2) (pronunciation)+(parts 1+ parts 2+ ... + parts n)+(structure type of Chinese characters) form.Utilize the unit construction of Chinese character, Chinese character is carried out fine description.Structure type of Chinese characters has defined the type matrix generated data of Chinese character.It also can be used as the code of Chinese character combined character.(1), (2) two kinds of coded formats, all can encode to all Chinese characters.The kanji code that generates has literal function.Can be by differences such as high frequency word, everyday character, non-common word, the quantity that defined feature is taken.The intension of " feature " is greater than " parts ".Do not consider tone as the pronunciation part, 96% Chinese characters in common use adopt 1 to 2 shape Yi Tezheng, just can realize unique definite description.Adopt the Two bors d's oeuveres pattern, without list separator, 10 yards of maximum code length account for 96% below 7 yards, use 4 features at most, compressed encoding, and mean code length is less than 4.5 yards.(3) ` (parts 1+ parts 2+ ... + parts n) form.Do not have the pronunciation part, shape justice characteristic has only Hanzi component.Adopt the combination of Hanzi component coding, Chinese character is carried out fine description.Can only use a small amount of Hanzi component, Chinese character is expressed as two parts horizontally-arranged patterns that divide Chinese character, carry out input and output.(4) (pronunciation)+(selecting)+(residue)+(structure)+[(pronunciation or stroke code)+(selecting)+(residue)+(structure)]
2+ ... + [(pronunciation or stroke code)+(selecting)+(residue)+(structure)]
NForm; If the parts preferred order determines, " pronunciation or the stroke code " part in the subsequent stages two minutes can be omitted, and is (pronunciation)+(selecting)+(residue)+(structure)+[(selecting)+(residue)+(structure)]
2+ ... + [(selecting)+(residue)+(structure)]
NForm.Adopt two this forms that divide Chinese character, Chinese character was carried out two minutes step by step, and, be expressed as full word symbol pattern, as encode Chinese characters for computer with two fens results.This coding has following characteristics: 1. Chinese character and parts, as long as can also continue two fens, just went down in two minutes always.All structure members and structure type to Chinese character are carried out the all-key description.This description can be regarded " two minutes step by step " from front to back as, can regard " synthetic step by step " back to front as.2. can carry out the brevity code description to the structure member and the structure type of Chinese character.Arbitrary Chinese character when the first order repeated code do not occur after two minutes, did not just enter the second level two minutes; When the second level repeated code do not occur after two minutes, just do not enter the third level two minutes; Circulation splits step by step, till not having a repeated code.This is the simplification that all-key is described.3. can be to the description of encoding of all Chinese characters.One word is different from his word, is the pronunciation difference of Chinese character (and parts), is parts (or stroke) combination and structure difference.With pronunciation, parts and the structure etc. of a word, " number " described to the greatest extent comprehensively, just can avoid identical, realizes the unique definite description of coding to Chinese character.4. this coding can be realized generating complicated Chinese character (or parts) coding with simple Chinese character (or parts) coding.5. the benefit of this coding also is: be convenient to represent Chinese character with the Hanzi component horizontally-arranged; Be convenient to the synthetic Chinese character font of Hanzi component type matrix.The coding of form (4) for example.As, " rosy clouds ", pronunciation are " xia2 ", one-level two is divided into " rain ", " Jia ", up-down structure, writing
" rain ", pronunciation are " yu3 ", can be divided into by secondary two
With
, hybrid structure, writing
, also can no longer divide; " Jia ", pronunciation are " jia3 ", and secondary two is divided into
With
, left and right sides structure, writing
, the stroke code is " 9198 ", also can be expressed as letter " agal ", three grade two is divided into " コ " and " again ", up-down structure, writing
So far, the discreet component of " rosy clouds " finished in whole two minutes; Whole two fens expression formulas step by step of " rosy clouds " are: " rosy clouds "=" xia2 "+" rain "+" Jia "+
+ [" yu3 "+
+
+
]+[" jia3 "+
+
+
]+[" ag "+" コ "+" again "+
].Convert thereof into two letter character patterns that divide Chinese character, wherein character
Be converted to digital code " 24,35,11,21 ", its all-key is: " rosy clouds "=xia2-yu3-jia3-24+ (yu3-gfaj-ssss-35)+(jia3-agfgg-agal-11)+(agal-ag-you4-21); Its brevity code is: " rosy clouds "=xia2-yu3-jia3-24+ (yu3-gfah-ssss-35), or " rosy clouds "=xia2-yu3-jia3-24; " 24,35,11,21 " in the expression formula are structure type shape justice feature code.The code of each parts has unique determinacy.Two minutes kanji codes, if be output as standard A SCII code character, internal code, matrix magazine are identical with the western language code.If be output as other pattern, several situations are then arranged: (1) Chinese character.Internal code, matrix magazine are the same with existing encode Chinese characters for computer.(2) the parts horizontally-arranged pattern of two minutes Chinese characters.Two minutes Chinese characters, parts are in 2000.Adopt the macrostructure parts, be convenient to organize the association of word and coding.Internal code, matrix magazine, its Chinese character part can be less to 2000 kinds.(3) synthetic Chinese character.Under the prerequisite that speed allows, can use synthetic all Chinese characters of the basic element of character (hundreds of is individual).Internal code, matrix magazine, its Chinese character part can arrive the hundreds of kind less.
Adopt two Chinese characters that divide, progressively overcome Chinese character " three difficulties ".Give the Chinese character marking pronunciation, solve " difficulty is read "; Adopting Chinese character form is brief, change version, favourable memorize and writing overcomes that Chinese character " is difficult to write ", " difficult note ".When Chinese character can not accurate recording Chinese, can adopt two Chinese characters that divide to write down Chinese.Realize spelling of Chinese character, could finally solve Chinese character " three difficulties ".Below propose some and put into practice pattern.(1) to single character mark pronunciation, makes its readability.In GB, single character accounts for 4%.Single character is the parts of combinde rqdical character mostly, and is simple relatively on font, writes with memorize relatively easy.Tone can be marked in the mark pronunciation, also can not mark tone.As: 1. yi2 ` razes (name for ancient tribes in the east), yi4 ` Qe (Qe), the mark tone; Or, yi ` smooth (name for ancient tribes in the east), yi ` Qe (Qe), do not mark tone.2. wu ` penta (penta), xu ` the eleventh of the twelve Earthly Branches (the eleventh of the twelve Earthly Branches), and shu ` defends (defending), the nearly sign pronunciation of shape.(2) combinde rqdical character adopts two " pronunciation+shape Yi Tezheng " expressions that divide Chinese character, makes its readability, easily writes, easily remembers.Shape Yi Tezheng, available unit add structure type to be described, and also can only use component representation.The selection of Apply Styles is as the criterion to help overcoming Chinese character " three difficulties ".1. complete 1 (bolt) of suanl ` wood, shuangl ` rain is 2 (frosts) mutually, and xiangl ` factory is 3 (railway carriage or compartments) mutually, the mark tone, structure type (big class) is with 1 numeral, " 1 " expression left and right sides structure, " 2 " expression up-down structure, " 3 " represent heterozygosis (encirclement) structure.2. suan ` wood complete (bolt), shuang ` rain phase (frost), xiang ` factory phase (railway carriage or compartment) is not marked tone, does not mark structure type.3. suan ` wood is complete
(bolt), shuang ` rain phase
(frost), xiang ` factory phase
(railway carriage or compartment), the structure type character representation, relatively more directly perceived.4. ` wood is complete
(bolt), ` rain phase
(frost), ` factory phase
(railway carriage or compartment) do not mark pronunciation.5. ` wood complete (bolt), ` rain phase (frost), ` factory phase (railway carriage or compartment) is not marked pronunciation, does not mark structure type.More than in the various patterns, shape Yi Tezheng adopts Hanzi component to describe, and also can be converted into Chinese phonetic alphabet and describe, and makes it close to spelling of Chinese character.Chinese character among the shape Yi Tezheng (or parts) has pronunciation, represents with the Chinese phonetic alphabet of its pronunciation, no pronunciation, with its stroke coded representation, can finish this conversion.As, 6. suan ` mu-quan (bolt), shuang ` yu-xiang (frost), part codes is listed as entirely.7. suan ` mu (bolt), shuang ` yu (frost) gets the part part codes.8. suan (bolt), shuang (frost) does not use in the linguistic context of not having the fork meaning, at this moment, the two minutes approximate Chinese phonetic alphabet of Chinese character.
Utilize two Chinese characters that divide, the simplified Chinese characters body.The Chinese character of multi-part combination, available two Chinese characters that divide are expressed as pattern: " pronunciation+parts
1+Parts
2++ parts
n" (1).In actual applications, the shape Yi Tezheng of 1 (or 2) parts as certain word only used in Chinese character in two minutes, just can realize this word qualitative expression really.Like this, the expression formula of front (1) will be reduced to: " pronunciation+parts
1(or+parts
2) " (2).From expression formula (2) as can be seen, adopting Chinese character form in use simplified in Chinese character in two minutes.And this simplification does not influence the determinacy of spelling, does not increase the total amount of existing Chinese character yet.In GB, there is 80% Chinese character can be expressed as " pronunciation+parts
1" pattern, and have unique determinacy of spelling.In other words, adopt two Chinese characters that divide, 80% Chinese characters in common use are arranged, use the structure member of half at most, just can normally use.With Hanzi component, represent with its pronunciation code, can also realize " the unisonance merger " of parts, the structrual description of Chinese character is simplified more.In the GB scope, adopt the merger of parts unisonance after, 70% Chinese character is arranged, only with 1 component feature, just can realize that unique determinacy represents, and have only the pronunciation code of parts, created condition to spelling of Chinese character.The shape justice feature description of two minutes Chinese characters, with make Chinese character the body memorize, write, progressively move towards characterization, contoured and symbolism, and then realize spelling of Chinese character.
The transversely arranged pattern of the parts of two minutes Chinese characters.Hanzi component has the certain structure form in the unit construction of Chinese character, three big classes such as left and right sides structure, up-down structure, heterozygosis (encirclement) structure are generally arranged.Each big subdivision has tens kinds of concrete forms.Two minutes Chinese characters are described Chinese character in two fens modes of Chinese character, and the version of existing Chinese character has kind more than 20.As Hanzi component is transversely arranged without exception, can make Chinese character structurally obtain simplifying.This simplification is feasible.Among the GB, the Chinese character of left and right sides structure accounts for 66%, and with its horizontally-arranged, knowledge is recognized unaffected; Hybrid structure accounts for 8%, and behind the 40% employing horizontally-arranged wherein, knowledge is recognized unaffected; Up-down structure accounts for 25%, and 93% contact is a little wherein arranged, and also can conveniently know and recognize; Remainder, the filling structural code is illustrated, still can know and recognize application.In application, horizontally-arranged is as the criterion with practicality.As: 1. left and right sides structure: qin2 ` wood fowl (Qin), qin3 ` Jin
(carving), the qin4 ` Rui heart (oozing); Or writing: ` wood fowl (Qin), ` Jin
(carving), the ` Rui heart (oozing).2. up-down structure: the qing1 ` month 2 (green grass or young crops), qin2 ` Lv jin 2 (celerys), qin2 ` Jue the present 2 (qin); Or writing: the ` month 2 (green grass or young crops), ` Lv jin 2 (celery) , ` Jue the presents 2 (qin), numeral " 2 " expression up-down structure; Or writing: the ` month
(green grass or young crops), ` Lv jin
(celery) , ` Jue the present
(qin) adds the structure character
3. hybrid structure: ` Epileptic third (disease), ` walk oneself (rising), ` is an ancient type of spoon (spoon), does not add structural code, does not also influence knowledge and recognizes; Or writing: ` Epileptic third
(disease), ` walks oneself
(rising), ` is an ancient type of spoon
(spoon) adds the structure character
Deng.
The double spelling code definition of two minutes Chinese characters.In two minutes Chinese characters, the pronunciation of Chinese character or parts can be write as the Two bors d's oeuveres pattern of the Chinese phonetic alphabet.In the Two bors d's oeuveres pattern, initial consonant, simple or compound vowel of a Chinese syllable or letter (or its combination) are represented with 1 alphanumeric codes on the keyboard.Double spelling code should be put into practice the needs definition according to spelling of Chinese character, so that absorb the spelling of Chinese character achievement.This explanation is for the double spelling code definition provides a kind of practice scheme.In this explanation, 1. simple or compound vowel of a Chinese syllable " ü " when being write as " ü " at needs, replaces with letter " v "; 2. simple or compound vowel of a Chinese syllable " ê, er, ueng ", no sound cooperates in mandarin, belongs to zero consonant syllable, separately the definition key position; " ê " uses separately as need, and available characters " e ' " expression; " ueng " if there is sound to cooperate needs, represents with " u-eng " monogram, and with each monogram part, is converted to corresponding double spelling code, as " u-g ", not fettered by existing Two bors d's oeuveres form; Definition character " ng " is for corresponding with phonetic symbol " towering "; If 3. the syllable of new generation is arranged, before not having the definition key position, can adopt the approaching spelling pattern of phoneme, or the approaching spelling pattern of form, be expressed as " x-y ...-z " pattern (1 letter or its combination represented in each character), and be converted to corresponding double spelling code, to deal with needs: 4. zero consonant syllable, " scheme " regulation is followed in the conversion of alliteration " i, u, ü ", the remaining head vowel of a final and rhythm portion are represented with corresponding double spelling code; As, " ian " uses separately, is transformed to " yan ", its double spelling code be " yj " (y-an), rather than " m " is (ian).It specifically is defined as: " A ", represent simple or compound vowel of a Chinese syllable " a "; " B " represents initial consonant " b ", simple or compound vowel of a Chinese syllable " ou "; " C " represents initial consonant " c ", simple or compound vowel of a Chinese syllable " iao "; " D " represents initial consonant " d ", simple or compound vowel of a Chinese syllable " uang, iang "; " E " represents simple or compound vowel of a Chinese syllable " e "; " F " represents initial consonant " f ", simple or compound vowel of a Chinese syllable " en "; " G " represents initial consonant " g ", simple or compound vowel of a Chinese syllable " eng " and letter " ng "; " H " represents initial consonant " h ", simple or compound vowel of a Chinese syllable " ang "; " I " represents initial consonant " ch ", simple or compound vowel of a Chinese syllable " i "; " J " represents initial consonant " j ", simple or compound vowel of a Chinese syllable " an ": " K ", represent initial consonant " k ", simple or compound vowel of a Chinese syllable " ao "; " L " represents initial consonant " l ", simple or compound vowel of a Chinese syllable " ai "; " M " represents initial consonant " m ", simple or compound vowel of a Chinese syllable " ian "; " N " represents initial consonant " n ", simple or compound vowel of a Chinese syllable " in "; " O " represents simple or compound vowel of a Chinese syllable " o, uo "; " P " represents initial consonant " p ", simple or compound vowel of a Chinese syllable " un, vn "; " Q " represents initial consonant " q ", simple or compound vowel of a Chinese syllable " iu "; " R " represents initial consonant " r ", simple or compound vowel of a Chinese syllable " uan, van "; " S " represents initial consonant " s ", simple or compound vowel of a Chinese syllable " iong, ong "; Letter " T " is represented initial consonant " t ", simple or compound vowel of a Chinese syllable " ve "; Letter " U " is represented initial consonant " sh ", simple or compound vowel of a Chinese syllable " u "; " V " represents initial consonant " zh ", simple or compound vowel of a Chinese syllable " ui, v "; " W " represents letter " w ", simple or compound vowel of a Chinese syllable " ua, ia "; " X " represents initial consonant " x ", simple or compound vowel of a Chinese syllable " ie "; " Y " represents letter " y ", simple or compound vowel of a Chinese syllable " uai, ing "; " Z " represents initial consonant " z ", simple or compound vowel of a Chinese syllable " ei ".But Two bors d's oeuveres alphanumeric codes direct mark does not need memory on keyboard.
Chinese character helped Chinese speech input and output The Application of Technology in two minutes.In the Chinese natural phonetic entry, the unisonance words of Chinese is not easily distinguishable, and can have influence on the determinacy of input.The form evolution rule of two minutes Chinese characters has been pointed out the relation of voice and shape Yi Tezheng.As in phonetic entry, auxiliary with words shape Yi Tezheng, can improve the accuracy rate of phonetic entry.Its way can be: 1. in phonetic entry, with the shape Yi Tezheng of keyboard (or pen) input words.2. when phonetic entry, the shape Yi Tezheng of " reading in " words.The shape Yi Tezheng of words is represented with voice.No pronunciation parts can be unified name, or directly read the stroke code.Adopt two Chinese characters that divide, also help the application of voice output technology (as phonetic synthesis, voice reading etc.).The determinacy of two minutes Chinese characters and Chinese (and Chinese character) corresponding relation helps improving the determinacy of Chinese speech output, as solving " a word multitone " etc.The pronunciation part of two minutes Chinese characters is for the voice output The Application of Technology provides convenience.Utilize two pronunciation parts of dividing Chinese characters, can realize that " being unit with the syllable " synthesize Chinese, " is unit with the syllable " reading manuscript.
Utilize two Chinese characters that divide, popularize Beijing pronunciation, the auxiliary teaching of literacy is put into practice spelling of Chinese character.1. utilize two form evolution rules of dividing Chinese character, improve the existing Chinese phonetic alphabet and the teaching of literacy.2. two " (pronunciation) ` (Chinese character) " that divide Chinese character, or " (pronunciation) (Chinese character) " pattern, as " al " (), overcome the phonetic notation inconvenience of existing Chinese character, it is combined as a whole the Chinese character and the Chinese phonetic alphabet, can be used for popularizing Beijing pronunciation and carries out the teaching of literacy.3. two " (pronunciation) ` (parts that divide Chinese character
1Parts
2) " pattern, as " al ` mouth Ah " () is divided into two parts with Chinese character " ", on the basis of strengthening pronunciation, and the memorize of having given prominence to Hanzi structure and parts again.4. two " (pronunciation) ` (parts 1 code-parts 2 codes) " patterns that divide Chinese character, as " al ` kou-a " (), the full word symbol pattern of two minutes Chinese characters helps putting into practice spelling of Chinese character, makes the juvenile understand phonetic, will Chinese (Chinese character) input.Juvenile that can phonetic also can be according to the two Chinese characters that divide of sample input, under voice suggestion, and phonetic, character learning and reading.In the teaching of literacy,, can read two pronunciation parts of dividing Chinese character earlier to failing to see the juvenile of Chinese character; In a large amount of readings, progressively grasp the Chinese character or the Hanzi component of shape justice part; On a large amount of bases of reading, the words composition is write in study.
Utilize the two characteristics of Chinese character aspect literal, input and output of dividing, can be information security technology a kind of new approaches are provided.(1) file encryption.Its characteristics are: 1. the text style of record instruction is new, and can define selection.2. the input code table of literal is special-purpose, can define selection.Keyboard key-position can define.3. font file is special-purpose, can define selection.It is special-purpose being used for the synthetic parts type matrix of Chinese character.4. Shu Chu mode has diversity, and can define selection.(2) credit identification.Utilize the two personalized definable features of Chinese character aspect literal and input and output of dividing, as individual's credit sign.Such as, an envelope Email is got in touch as having no credit with the litigant, does not just have the other side's credit code table, can only be a pile mess code, will be removed automatically, can avoid litigant's the letter of making an uproar scratched and disturb.(3) network security.Networks development needs the standardization of infotech; But the safety of network but needs the personalization and the creditization of infotech.Two minutes Chinese characters can be used as the instrument of putting into practice of this personalization and creditization.(4) prevention and cure of viruses.The appearance of existing virus is all being reminded us at any time, coding and microprogram in the general machine, and it is dangerous greatly to lie dormant.Message pick-up should be prerequisite with credit; Code should be personalized in the machine.The personalized definable characteristic of two minutes Chinese characters can be used as the trial of this respect.Under the support of existing information safety technique, two minutes Chinese characters will provide a kind of new selection for it.
The publication of two minutes Chinese characters with the fundamental difference of existing publication, just is the technical of it.This technical, an illiterate people can use it to carry out machine reading and study.
Two minutes Chinese characters can be the disabled person and provide convenience.The blind person is not easy to " word selection input ", available it to the fine description of shape Yi Tezheng, accurately express Chinese.The deaf-mute, available its characteristic aspect voice annotation are carried out (machine) speech exchange and sign language easily and are expressed.
Two, two division input methods
Two division input methods are two input method and application aspect existing Chinese character input thereof that divide Chinese character self.So the narration of relevant two division input methods comprises two parts, the one, the input of existing Chinese character, the 2nd, two inputs that divide other pattern of Chinese character.Existing Chinese character information processing, quite complete aspect software engineering, here, only the coding characteristic of just two division input methods is narrated.The input coding of two division input methods, can directly read by two minutes Chinese characters (or remove list separator, or compress, with its simplification).As, " sign indicating number " word can have " ma ` shi " (sign indicating number), or " mashi " (sign indicating number), or " ma " different patterns such as (sign indicating numbers).Do not understand two Chinese character persons that divide, can pass through " Chinese character two minutes " yet, extract the input coding of Chinese character.The diversity of the two minutes practical patterns of Chinese character makes two division input methods have multiple coded format, provides multiple choices to practical application, can enter two Chinese character states that divide from different perspectives, realizes Chinese character and two input that divides other pattern of Chinese character.
(1) existing Chinese character input
Existing Chinese character is two special patterns that divide Chinese character.The input method of existing Chinese character is narrated separately, be that the supposition user does not understand two Chinese characters that divide, and mainly narrated from the angle of existing Chinese character input.Here, by the application of " Chinese character two minutes " method, input is illustrated to the Chinese character of two division input methods.For sake of convenience, the content of having addressed in " two minutes Chinese characters " no longer repeats as far as possible.
The coding principle of Chinese character input.According to " Chinese character two minutes " rule, Chinese character is divided into " selecting part " and " remainder " (being called for short " selecting " and " residue ").Every part is with the coded representation of its pronunciation (or stroke).Pronunciation code can be spelling pattern, Two bors d's oeuveres pattern or other pattern of the Chinese phonetic alphabet.The stroke code adopts 10 numerals (or letter) to represent the feature of stroke.Character formation component is selected " generally can recognize Chinese character " for use as far as possible, and the pronunciation code of its polyphone adopts " general pronunciation " expression.As parts and Chinese character " with reading ",, also its " with reading " code can be cast out the fresh code of selecting for use its next stage to split for shortening code.The general format of coding is: encode Chinese characters for computer=(pronunciation)+(selecting part)+(remainder) (the every definable in the right is accepted or rejected).The encode Chinese characters for computer of reading is used for QWERTY keyboard, just uses character representation; Be used for numeric keypad, just use numeral.Between (pronunciation) and (selecting part), between (selecting part) and (remainder), separate, also can separate without symbol with symbol.Single character code, 1. the priority (or order of strokes observed in calligraphy) of writing by Chinese character reads coding; 2. also can read coding by the rule of " the one-tenth word is preferential ", " getting big preferential "; 3. based on general format " coding=(pronunciation)+(selecting)+(residue) ", spelling is encoded; Its pronunciation is the actual pronunciation of Chinese character or phrase; 4. unfamiliar word provides font code; Unfamiliar word font code=(selecting)+(residue).The phrase coding, based on the two-character word group coding, general format is: phrase coding=(pronunciation part)+(parts part)." the pronunciation part " of coding, 1. the double word phrase adopts " sound sound " form; Three words groups adopt " several sound " form; Four words and the above phrase of four words adopt " several " form, the 4th yard, get the code of the most last 1 word; 2. the above phrase of three words also can adopt " sound
1Sound
2Sound
N" form, every word Two bors d's oeuveres, sound is complete, gets 6 words at most, and last 1 word of phrase got in the 6th word." the parts part " of phrase coding used when needed.If needed, can extract code (extract 1 code as every word, or every word extracting a plurality of codes) by the individual character order.Be that coding is given an example below.With GB is the discussion scope, and the pronunciation code of parts is the Two bors d's oeuveres pattern.Individual character in " Chinese character two minutes ", closes word and accounts for more than 96%, and solely the word less than is 200, and only word has only several.(1) closes word code.To close word two minutes, as " despot " word, be split as " rain " and "
" two parts: " despot " word, pronunciation are " ba "; The pronunciation of " rain " is " yu ", "
", row is two minutes again, is " leather " and " moon ", and its code is " ge-yt "; Its all-key is " ba ' yu-ge-yt " (despot), 11 yards of code lengths; After saving list separator, be encoded to " bayugeyt " (despot), 8 yards of code lengths.Parts "
", also available stroke coded representation; The stroke code is got preceding two here, i.e. " hj " (" horizontal stroke " and " erecting " have " intersection "); So another of " despot " word is encoded to " ba ' yu-hj " (despot); After saving list separator, be encoded to " bayuhj " (despot).The compressed code of " despot " word is " bay " (despot), and code length is 3 yards.As " despot " is unfamiliar word, does not know pronunciation, and its font code is " yu-ge-yt " or " yugeyt " (despot), 8 yards or 6 yards of code lengths.(2) only word code.Only word two is divided into " stroke and parts ", or " stroke and stroke combination "; As " in vain " word, be split as " Pie " and " day " two parts; Pronunciation is " bl "; The stroke code of " Pie " is " d ", and the pronunciation code of " day " is " ri "; Its all-key is " bl ' d-ri " (in vain), 7 yards of code lengths; After saving list separator, be encoded to " bldri " (in vain), 5 yards of code lengths.The compressed code of " in vain " word is " bl " (in vain), 2 yards of code lengths.As " in vain " is unfamiliar word, and its font code is " d-ri " (in vain), stroke and unit construction; Or be " dfagg " (in vain), full stroke code.(3) only word code.Only word code, " the having " and " nothing " of seeing its stroke; As " second " word, it is regarded as two parts of " second " (stroke is arranged) and " " (no stroke is with " w " expression); The pronunciation of " second " is " yi "; Its all-key is " yi ' yi-w " (second), 7 yards of code lengths, save list separator after, be encoded to " yiyiw " (second), 5 yards of code lengths.The compressed code of " second " word is " yi " (second), 2 yards of code lengths.(4) two-character word group coding.The two-character word group coding adopts " sound sound " form,, is encoded to " rggs " (manually) as " manually "; If repeated code is arranged, increase by 1 (or 2) part codes: the 1st word is " people ", as gets stroke " Pie ", and code is " d ", and phrase is encoded to " rggs ' d " (manually); If also have repeated code, the 2nd word is " worker ", as gets stroke " ", and code is " g ", and phrase is encoded to " rggs ' dg " (manually).In double word phrase commonly used, get 2 parts (or stroke) code at most, just can guarantee unique determinacy of encode Chinese characters for computer.In the double word phrase, taking for example of part codes also is applicable to other multiword phrase.(5) three words group codings.Three words group codings adopt " several sound " form,, are encoded to " rghu " (man-made lake) as " man-made lake ", and the 3rd word rounds a basic syllable.(6) four words (and more than) the phrase coding.Four words (and more than) phrase, adopt " several " form, the 4th yard, get the last word initial consonant,, be encoded to " rgiy " (rainmaking) as " rainmaking ".The coding of three words (more than reaching) phrase also can adopt " sound
1Sound
2Sound
N" form,, be encoded to " rfgsjdyu " as " rainmaking ".
" Chinese character two minutes " is divided into nearly 2000 parts with the Chinese character in the GB model horse stable.Wherein, nearly 1400 of character formation component, nearly 600 of character non-formation component.In the character formation component, Chinese characters in common use account for 90% (wherein " generally can recognize Chinese character " and account for 90% again), and non-common Chinese character accounts for 10%; Individual surplus the mutiread sound Chinese character 100, account for 5%.In the character non-formation component, the traditional structure parts only account for 16%, and all the other are Chinese character two minutes " remainder ", account for 84%: the traditional structure parts have pronunciation (comprising " Gu is read ") mostly: in " remainder ", what include the pronunciation parts accounts for 50%.In " Chinese character two minutes " parts, the parts that pronunciation arranged or have pronunciation code after treatment have more than 1700, account for 87% of whole parts; About 250 of no pronunciation parts account for 13%.
The pronunciation parts are arranged, make code with its pronunciation.Pronunciation can be represented with spelling, Two bors d's oeuveres or other pattern of the Chinese phonetic alphabet.Polyphone is with " general pronunciation " expression.Single character when marking pronunciation, for reducing repeated code, can increase the stroke code description, and code length can be prepared 6 yards.Non-common Chinese character also can be prepared its stroke code when making code with pronunciation, look into usefulness for the user.No pronunciation parts, available its stroke coded representation: the coding code length can be prepared 6 yards, in concrete form, can only select 2 or 3 yards for use.No pronunciation parts also can be given name, and it can be represented with pronunciation code." residue " parts in the no pronunciation parts include the pronunciation parts mostly, and the code of these " residue " parts represented in available its pronunciation, or add the stroke code in the pronunciation code back, and 3 yards of code lengths are distinguished mutually with " the pronunciation parts are arranged ".All parts, all available stroke coded representation.Get preceding 5 and the last coding of each parts, it determines that rate is 80%; Get preceding 3 codings of each parts, it determines that rate is 16%; Get preceding 2 codings of each parts, it determines that rate is 3.5%.
Encode Chinese characters for computer between the Character Style and the digit style (or the Character Style), can be changed mutually.Abbreviate " word/number " conversion as, or " number/word " conversion, or " word/word " conversion.The mutual conversion of this character and numeral (or character) can realize that same coding is applied to the keyboard of QWERTY keyboard, numeric keypad or other form.After character code is converted to numerical coding, the no repetition rate of coding of encode Chinese characters for computer will descend to some extent.This variation has correlativity with character-coded average stroke.Be character-coded average stroke more near maximum code length, the digitally coded no repetition rate of coding will be high more.So the basic coding with every kind of form is advised in " word/number " conversion, promptly the representativeness of this form is encoded.Numerical coding is converted to character code, does not have this situation.This " word/number " or " number/word " changed, and do not change the input using method of original coding.The keyboard of so-called other form can be two keys, triple bond, or multikey, can realize available coding configuration by " word/number " conversion." word/word " conversion can self-service definition and conversion Two bors d's oeuveres (or other) model code.Its realization program belongs to general technology.
The practical form of Chinese character input.Two division input methods provide multiple encode Chinese characters for computer combination, no matter can realize which kind of pattern can both importing Chinese character with, satisfy various different demands.Encode Chinese characters for computer presents Discrete Distribution, and under same coded format, owing to arrangements such as taking of word preface, part length and list separator are different, its specific coding also just has " similar ".The various coded formats that will narrate below are summaries of certain coding thinking, are the comprehensive narrations of several coding patterns.
1. the general format of the two division input method phonological and calligraphical syn thesize codings of phonological and calligraphical syn thesize coding: single character code=(Chinese-character pronunciation)+(selecting part)+(remainder)." pronunciation " code, Two bors d's oeuveres are 1 or 2 yard, and spelling is 1 to 6 yard; " stroke " code, parts can be taken to 2 or 3 yards, and solely word can be taken to 4 or 5 yards.At the all-key state of general format, pronunciation code generally can only be got the Two bors d's oeuveres pattern, because code length is subjected to the restriction of operating system sometimes.Between " pronunciation " code and " selecting " code, between " selecting " part and " residue " part, additional list separator, pronunciation or unit construction relation with outstanding coding make coding have literal function.
1. coding=(pronunciation) ' (selecting)+(residue) form; " pronunciation " code is got the Two bors d's oeuveres pattern, is 1 or 2 yard; " stroke " code, parts can be taken to 2 yards, and solely word is got 4 or 5 yards at most.Basic coding (not doing any adjustment, representational coding), all-key, the no repetition rate of coding 92% (, all write down 7271, below identical) in the GB scope.7 yards of maximum code length, average keystroke 6.89 times.Characteristics are, use list separator, have given prominence to pronunciation, making coding have literal function.All-key, after " word/number " conversion, the no repetition rate of coding is more than 77%.Its compressed encoding, " is unit with the character " since 1 yard, increases code length gradually, carries out the uniqueness screening, it compressed, and do not have repeated code and handle, and can realize 6 yards of maximum code length, average keystroke 4.41 times.After " word/number " conversion, do not do any adjustment, the no repetition rate of coding will drop to 47%, in the GB scope, lack Practical significance.
2. coding=(pronunciation) (selecting)+(residue) form; 1. compare with form, except that not having list separator " ' ", all the other are identical.Basic coding, all-key do not have repeated code and reach 92%.6 yards of maximum code length, average keystroke 5.89 times.Its compressed encoding, " is unit with the character " compresses it, do not have repeated code and handles, and can realize 5 yards of maximum code length, average keystroke 3.58 times." word/number " conversion, 1. approximate with form.
3. coding=(pronunciation)+(selecting)+(residue) form; " pronunciation " code is got the Two bors d's oeuveres pattern, is 1 or 2 yard; " stroke " code, " selecting " part can be taken to 2 yards, and " residue " part does not limit, and solely word is got 4 to 5 yards at most.Make 8 yards as maximum code length, no repeated code, average keystroke 6.00 times; " word/number " conversion, the no repetition rate of coding is 95%.Make 6 yards as maximum code length, the no repetition rate of coding is 98%, average keystroke 5.95 times; " word/number " conversion, the no repetition rate of coding is 92%.Its " is unit with the character " compressed compressed encoding, no repeated code, average keystroke 3.60 times; " word/number " conversion, in the GB scope, no Practical significance.
4. coding=(pronunciation)+(selecting)+(residue)+(to " residue " two minutes once more) form; " pronunciation " code is got the Two bors d's oeuveres pattern, is 1 or 2 yard; " stroke " code can be taken to 2 yards, and solely word is got 4 to 5 yards at most.The characteristics of coding are: " is unit with parts " compressed: by " pronunciation ", and " selecting ", the order of " residue ", increase code length gradually, increase by 1 or 2 yard: after taking " residue ", still have repeated code at every turn, then " residue " parts are carried out two minutes second time, and compress.The no repeated code of encoding.Maximum 10 yards, have only 9 greater than 8 yards records.Average keystroke 4.47 times.After " word/number " conversion, the no repetition rate of coding is 59%.
5. coding=(pronunciation)+(individual character stroke) form; " pronunciation " code is got the Two bors d's oeuveres pattern, is 1 or 2 yard: " individual character stroke " code, press the order of writing strokes code fetch of individual character, and get 4 or 5 yards at most.The no repetition rate of coding 84%.6 yards of maximum code length, average keystroke 5.98 times.After " word/number " conversion, the no repetition rate of coding is 68%.
6. coding=(pronunciation)+(selecting stroke)+(residue stroke) form; " pronunciation " code is got the Two bors d's oeuveres pattern, is 1 or 2 yard; The stroke code, " selecting " gets 4 yards at most with " residue " every part, and solely word is got 6 yards at most.No repeated code, 10 yards of maximum code length, average keystroke 9.04 times.After " word/number " conversion, the no repetition rate of coding is 98%.The compressed encoding of its " is unit with the character ", no repeated code, 8 yards of maximum code length, average keystroke 4.06 times.After " word/number " conversion, the no repetition rate of coding is 53%.
7. coding=(pronunciation)+(popular font code part codes) form; With the shape Yi Tezheng of Chinese character, describe with the part codes of popular font code; The user who is beneficial to familiar popular font code enters two Chinese character states that divide.As, " pronunciation " combines with " five strokes ".
8. coding=(pronunciation) invisible adopted part is a sound sign indicating number pattern in fact.Directly import the Chinese phonetic alphabet of Chinese character.Its principle is by " basic syllable ", to define word more than 400; By " basic syllable adds tone ", definition word more than 1200; By " word (not marking tone) got in the connection speech ", definition word more than 4300; All the other are by " word (mark tone) got in the connection speech " definition.Meeting phonetic just can use.Can carry out " word/number " conversion.
2. the font code of the two division input methods of font code can Chinese phonetic alphabet person not provided convenience.Its general format is: single character code=(selecting part)+(remainder)." select " and the adopted feature description of the shape of " residue ", can adopt pronunciation code, also can adopt the stroke code.Pronunciation code, as adopting the Two bors d's oeuveres pattern, code length 1 or 2 yards: stroke code, maximum 6 yards of code length.
1. coding=(selecting)+(residue) (pronunciation and stroke code) form; Chinese character is divided into " selecting " and " residue " two parts, every part is with its pronunciation or stroke coded representation.Basic coding, all-key, the no repetition rate of coding 80.47%.Every part is got 3 yards at most.Maximum 6 yards of all-key code length, average keystroke 4.39 times.After " word/number " conversion, the no repetition rate of coding is 52%.Its " is unit with the character " compressed the no repetition rate of coding 86.03%, maximum 5 yards of code length, average keystroke 3.84 times.
2. coding=(selecting)+(residue), (4+6) form; Press the order of writing strokes of parts, take the stroke code, " selecting " part is got 4 yards at most, and " residue " part is got 6 yards at most, is called for short " 4+6 " form.It is 95.88% that basic coding, all-key do not have the repetition rate of coding.Maximum 10 yards of code length, average keystroke 8.31 times.Its " is unit with the character " compressed, and the no repetition rate of coding is 99.46%, maximum 10 yards of code length, average keystroke 5.99 times.Encode by 1 yard increase gradually when its " residue " part, code length with the pass of the no repetition rate of coding is; 6 yards, 64.16%; 7 yards, 85.35%; 8 yards, 95.37%; 9 yards, 98.87%; 10 yards, 99.46%.After " word/number " conversion, the no repetition rate of coding is constant.
3. coding=(selecting)+(residue), (4+6m) form; 2. approximate with form, just last 1 yard of " selecting " and " residue " part, the most last 1 yard of taking stroke writing.Its compressed encoding, the no repetition rate of coding are 99.53%, maximum 10 yards of code length, average keystroke 5.95 times.
4. (5+5) form of coding=(selecting)+(residue); " select " part and get 5 yards at most, " residue " part is got 5 yards at most, is called for short " 5+5 form ".Basic coding, all-key, the no repetition rate of coding 94.51%.10 yards of maximum code length, average keystroke 8.48 times.Its " is unit with the character " compressed, and the no repetition rate of coding is 99.28%, maximum 10 yards of code length, average keystroke 6.23 times.Encode by 1 yard increase gradually when its " residue " part, code length with the pass of the no repetition rate of coding is: 6 yards, and 56.07%:7 sign indicating number, 79.46%; 8 yards, 92.50%; 9 yards, 97.90%; 10 yards, 99.28%.After " word/number " conversion, the no repetition rate of coding is constant.
5. (3+6) form of coding=(selecting)+(residue); " select " part and get 3 yards at most, " residue " part is got 6 yards at most, is called for short " 3+6 " form.Basic coding, all-key, the no repetition rate of coding 87.78%.9 yards of maximum code length, average keystroke 7.77 times.Its " is unit with the character " compressed, and the no repetition rate of coding is 97.83%, maximum 9 yards of code length, average keystroke 5.73 times.Encode by 1 yard increase gradually when its " residue " part, code length with the pass of the no repetition rate of coding is: 6 yards, and 71.48%; 7 yards, 87.68%; 8 yards, 95.14%; 9 yards, 97.83%.After " word/number " conversion, the no repetition rate of coding is constant.
6. (3+3) form of coding=(selecting)+(residue); " select " part and get 3 yards at most, " residue " part is got 3 yards at most, is called for short " 3+3 " form.Its " is unit with the character " compressed, and the no repetition rate of coding is 71.48%, maximum 6 yards of code length, average keystroke 5.39 times.
7. (2+6) form of coding=(selecting)+(residue); " select " part and get 2 yards at most, " residue " part is got 6 yards at most, is called for short " 2+6 " form.Basic coding, all-key, the no repetition rate of coding 72.64%.8 yards of maximum code length, average keystroke 6.90 times.Its " is unit with the character " compressed, and the no repetition rate of coding is 89.38%, maximum 8 yards of code length, average keystroke 5.42 times.Encode by 1 yard increase gradually when its " residue " part, code length with the pass of the no repetition rate of coding is: 5 yards, and 50.58%; 6 yards, 71.21%; 7 yards, 82.97%; 8 yards, 89.38%.After " word/number " conversion, the no repetition rate of coding is constant.
8. (2+4) form of coding=(selecting)+(residue); " select " part and get 2 yards at most, " residue " part is got 4 yards at most, is called for short " 2+4 " form.Its " is unit with the character " compressed, and the no repetition rate of coding is 71.21%, maximum 6 yards of code length.
9. (2+3) form of coding=(selecting)+(residue); " select " part and get 2 yards at most, " residue " part is got 3 yards at most, is called for short " 2+3 " form.Its " is unit with the character " compressed, and the no repetition rate of coding is 50.58%, maximum 5 yards of code length, average keystroke 4.85 times.
10. the whole word stroke input of whole word stroke input coding as the special pattern (promptly selecting stroke and residue stroke pattern) of " Chinese character two minutes ", is the auxiliary pattern of Chinese character input.It according to stroke order reads the input coding of Chinese character with the letter or number code of Chinese-character writing stroke.Its 5+1 form, preceding 5 that get individual character add the most last 1, and basic coding is done compression and is handled, and the no repetition rate of coding is more than 56%, 6 yards of code lengths.Its 3+3 form, get individual character preceding 3 and 3 at end (input difficulty increase), basic coding is done compression and is handled, and the no repetition rate of coding reaches more than 72%, 6 yards of code lengths.Compare with traditional stroke input, increased the alphabetical pattern of stroke code.
(11) ` (unfamiliar word (or parts) stroke code)+(other code of unfamiliar word (or parts)) form; In the input Chinese character, other coding of unfamiliar word (or parts) is provided, look into usefulness for study.Such as, " Tou ", non-common word, pronunciation are " pou3 ", stroke coding is " sgsdgfag ", and it is pressed the stroke input, behind appearance " Tou " word, will show its pronunciation code " pou3 " at the end of coding.Whole being encoded to of " Tou " word " ` sgsdgfag-pou3 ".Restricted as code length, then shorten stroke coding, pronunciation code adopts the Two bors d's oeuveres pattern.Separate with symbol between stroke coding and the pronunciation code.Can add before the coding with symbol " ` ", to distinguish mutually with other coding.
(12) the big big stroke code of stroke code coding is to the further describing of stroke " commissure " feature, and can divert from one use to another in the place of adopting little stroke code, can improve unique determinacy of stroke coding.Can use for the professional.
3. the two screen prompt inputs that divide Chinese character of screen prompt input are concrete application of " Chinese character two minutes " technical characteristic.Chinese character is imported, and the form of " whole word input " was once arranged, and because of its word selection is difficult for, uses wideless.Two division input methods utilize " Chinese character two minutes ", with all Chinese characters of GB scope, are divided into two parts, and every part has only a hundreds of parts, can realize " two branch " input of Chinese character.The application of " screen prompt key " is a prior art.The screen prompt input of two minutes Chinese characters can have two kinds of forms.
1. (pronunciation)+(selecting)+(residue) form; Import the pronunciation code of Chinese character earlier, at this moment, on the prompting key of screen, with " selecting " parts of show candidate; Select " selecting " parts of candidate, and key in the code of this prompting key, in the ordinary course of things, just finished the input of 1 Chinese character; After keying in " selecting " parts, the Chinese character that needs does not appear in " prompt window ", at this moment, and at " residue " parts of pointing out on the key with show candidate; Select " residue " parts of candidate, and key in the code of this prompting key, just finished the input of 1 Chinese character.Be characterized in, simple, easily to learn, the input of most of Chinese characters is only finished with 3 keys.At the phrase input state, utilize the candidate's parts on the prompting key to distinguish with the sign indicating number phrase.
2. (select)+(residue) form; Import the pronunciation or the stroke code of " selecting " parts earlier, at this moment, on the prompting key of screen, with " residue " parts of show candidate; Select " residue " parts of candidate, and key in the code of this prompting key, just finished the input of 1 Chinese character.Part codes can be prepared 6 yards.Its characteristics remain, and are simple, easily learn, but 1. stroke increase than form.At the phrase input state, utilize the candidate's parts on the prompting key to distinguish with the sign indicating number phrase.
4. two-character word group coding Chinese has double-tone trend, and in common phrase, double word phrase quantity is bigger.The two-character word group coding is followed the general format that phrase is encoded: phrase coding=(pronunciation part)+(parts part).
(1) phonological and calligraphical syn thesize coding
1. phrase coding=(phrase pronunciation)+(1 feature got in individual character 1)+(or 1 feature got in individual character 2) form; With 16800 two-character word group codings.The letter pattern, with the basic coding compression, equal-length code can not realize that whole phrase codings have unique determinacy: the average keystroke of every phrase 3.98 times.
2. phrase coding=(phrase pronunciation) form; I.e. " sound sound " form: with 28600 two-character word group codings.The Character Style, 4 yards of maximum code length, the no repetition rate of coding 76%.Digit style, 4 yards of maximum code length, the no repetition rate of coding 24%.
(2) stroke font code coding
Here, with 28600 two-character word group codings.The no repetition rate of coding, digit style is identical with the Character Style.
1. phrase coding=(3 strokes got in individual character 1)+(3 strokes got in individual character 2): maximum 6 yards of all-key, can 11344 phrases of unique definite description, account for 39.66% of 28600 records; Also can make " 3+x " form, promptly individual character 1 is got 3 yards, and individual character 2 increases code length gradually since 1 yard, unfixed-length coding, and carry out the uniqueness screening, can realize that whole phrases do not have repeated code.
2. phrase coding=(4 strokes got in individual character 1)+(4 strokes got in individual character 2); Maximum 8 yards of all-key can 21935 phrases of unique definite description, account for 76.70% of 28600 records.Also can make " 4+x " form, promptly individual character 1 is got 4 yards, and individual character 2 increases code length gradually since 1 yard, unfixed-length coding, and carry out the uniqueness screening, can realize that whole phrases do not have repeated code.
(3) input of connection speech is an example with the double word phrase.Other phrase, by that analogy.
1. (basic syllable 1)+(tone+1)+(basic syllable 2)+(tone 2) pattern; Import the basic syllable of certain word earlier, as do not have this word, import the tone of this word, still do not have this word, the basic syllable of second word of input and the application of this couplet speech as also there not being this phrase, continues to import the tone of second word again; As still there not being this phrase, then " page turning " select speech input.
2. (basic syllable 1)+(basic syllable 2)+(tone 1)+(tone 2) pattern; Import the basic syllable of each Chinese character in the phrase earlier, import the tone of each Chinese character again.As do not have this phrase, then " page turning " select the speech input.
5. another kind of digital input format is: coding=(pronunciation code+pinyin character sequence number) (or+(shape justice feature code+pinyin character sequence number)).Key in the pronunciation part earlier, key in shape justice characteristic again.As sentence " nihk (you are good) ", key in the digital code " 6445 " of " nihk " earlier, key in the order sequence number " 2322 " of " nihk " each character on the button sign then, just obtain the numerical coding " 64452322 " of " nihk ".Shape justice characteristic decides what to use according to the linguistic context needs.Here " order sequence number " is meant that the positional alignment of character on a certain numerical key as numerical key " 1 ", represented character " ab ", and the sequence number of " a " is " 1 ", and the sequence number of " b " is " 2 ".
More than, multiple input coding form, particularly font code to existing Chinese character have carried out detailed narration, can conclude some useful promptings.1. can select a yard type (being coded format) for use according to the quantity of coding words, estimate the no repetition rate of coding roughly, help definite expression of Chinese character input.2. code length has correlativity with the no repetition rate of coding and sign indicating number position (can supply the number of coding).Code length is long more, and the no repetition rate of coding is high more, and the sign indicating number position that provides is many more.This correlativity can be changed in same sign indicating number type, has Practical significance.Utilize above prompting, the user can adopt self-service mode, the Hanzi inputing code table of design personalized.People's the vocabulary of commonly using is not quite similar, and quantity is quite limited; The words frequency of utilization varies with each individual: often the individual character that uses is few: the Hanzi inputing code table that needs to be fit to own individual character; Two division input methods provide the condition of self-service design for the user.
A kind of personalized realization of two division input method codings.1. use database table, constantly collect individual's the words of commonly using at any time, and carry out screening of words statistics and frequency of utilization ordering.2. commonly use the quantity of words according to the individual, the input pattern of liking is selected suitable sign indicating number type.3. utilize existing pair of division input method code table, import the coded data of words.4. the sign indicating number position that utilizes the sign indicating number type to provide, or " the uniqueness setting " of employing Database field, the word coding method that screening imports.5. " input method generator " that utilizes system to provide generates individual words code table.Specifically be exemplified below.In 3500, also be no more than 6000 a later period as your existing individual character of commonly using, commonly use vocabulary and have only several thousand, you can select the 2+3 form of font code for use.It can realize the no repeated code input of 3500 individual characters, 5 yards of maximum code length, average keystroke 4.85 times.It might not cover your individual character of commonly using, and you can carry out " unique determinacy " screening and " no repeated code is handled " of back." no repeated code is handled ", the most easy realization increases shape justice feature description exactly, is applied to special circumstances.Utilize database table, import the words data of font code 2+3 form, carry out " unique determinacy " screening, by the frequency of utilization ordering, derived data generates the code table text.Utilize " input method generator " of existing system, generate personalized input code table.Increased as your the words quantity of commonly using from now on, too high as duplication rate, you can increase the code fetch code length of parts 2, select 2+4 form, 2+5 form or 2+6 form for use, just can satisfy the demand.The method of input is with original the same, and the no repetition rate of coding is guaranteed, and average stroke increases at most 0.60 time.Reduced as your the words quantity of commonly using from now on, have more than is needed so much word coding methods also can be selected the 2+2 form for use conversely.
A kind of child's embodiment.Utilize computing machine, adopt two (pronunciation)+(Chinese character) forms that divide Chinese character, phonetic, character learning and information input are combined, for child before learning provides a kind of intelligence enlightening form.Illiterate child can carry out the computing machine input according to two letter and strokes that divide Chinese character; Under the voice and picture cues of computing machine, study phonetic and Chinese character.Existing two child who divides basis of Chinese character, can organize speech, make sentences and write divergent thinking training such as words under area of computer aided: can be on a large amount of bases of reading, the words composition is write in study.Along with the raising of character learning level, the shape of Chinese character justice feature description progressively rises to parts from stroke; All-key helps character learning, and brevity code helps input.The effect of two minutes Chinese characters allows the child just regard phonetic, Chinese character and information input as the same thing from little exactly, carries and several years ago grasps phonetic, character learning and information input technical ability.
(2) input of two minutes other patterns of Chinese character
Two minutes Chinese characters except existing Chinese character pattern, also have other multiple pattern.These patterns, the spelling character by expression formula reduces four kinds, full word symbol pattern, character and Chinese character (or parts) assemble pattern, Chinese character (or parts) assemble pattern and digital code pattern.Here, its input method is narrated.
1. the two full word symbol pattern inputs that divide Chinese character of full word symbol pattern can be adopted two kinds of methods.The one,, directly import with the ASCII character character, may be output as the full word symbol pattern of Chinese character, two minutes Chinese characters, or other pattern.The 2nd,, with code input (as double spelling code), can reduce stroke.As, phrase " hanzishuru ` you-zi-che " (Chinese character input), has been selected 3 shape Yi Tezheng for use here, and " again, son, car " adopts the input of full word symbol pattern, needs keystroke 21 times.The input of employing code such as with " several " code, only needs keystroke 4 times.The code input needs corresponding code table.
2. character and Chinese character (or parts) assemble pattern also has two kinds of input methods.The one,, character is imported with ASCII character; Chinese character or unit construction are imported with code.The 2nd,, all adopt the code input.The code input needs corresponding code table.As, phrase " fenfatuqiang ` goes all out to make the country strong " (going all out to make the country strong) adopts the code input, such as with " several " code, only needs keystroke 4 times.
3. this pattern of Chinese character (or parts) assemble pattern adopts the code input.Need corresponding code table.As, " ` sends out shellfish " (making a good deal of money), " ` wood order is very little again " (relatively) are adopted the code input, directly input " making a good deal of money ", and the Chinese character input code of " relatively " just can be realized pair input and output that divide Chinese character of this pattern.
4. coding=(pronunciation)+(each several part code)+(type matrix generated data) or this form of coding=(each several part code)+(type matrix generated data) form, directly import the pronunciation and the input each several part code of Chinese character, or only import the each several part code, import the type matrix generated data of Chinese character again, when realizing the Chinese character input, under application help, can also realize with the synthetic Chinese character font of parts type matrix, be output as the synthetic Chinese character pattern of type matrix, or Hanzi component horizontally-arranged pattern.
5. parts are imported each building block with a Chinese character continuously, import continuously with the coding of each parts self, realize the parts code table with lesser amt, are two parts horizontally-arranged patterns that divide Chinese character with the Chinese character input and output.This form can reduce the quantity of Chinese character font.
6. the numerical key code is directly keyed in the digital code input.Carefully do not state.
The input of the list separator of two minutes Chinese characters and the syllable-dividing mark of the Chinese phonetic alphabet.At the Chinese character input state, these symbols as the code element of input coding, be imported them as punctuation mark, can 1. switch to English input state input; Or 2. use prior art its input character is discerned, what distinguish input is code element or punctuation mark, and outfit provides automatically.
The use of two division input methods.The handling characteristics of two division input methods, be exactly, under a coding thinking, provide multiple application choice, do not increase the use difficulty.The application of its typical format, the same with Chinese phonetic alphabet input, just when repeated code occurring, increase shape justice feature description, import its code.Can phonetic, the person that only knows the Chinese character can select suitable font code for use.Only can the phonetic person, the person that fails to see the Chinese character can select suitable sound sign indicating number for use, and can assist and get patterns such as word, the application of connection speech with the connection speech.Illiterate person also can under voice suggestion, understand, learns and use two Chinese character and pair division input methods of dividing with the input of two minutes Chinese-character texts " according to sample ".Utilize " prompting gradually " and modes such as " Chinese character (parts) candidates ", can realize that non-" generally can recognize Chinese character " and unknown parts need not remember.Two division input methods utilize existing software engineering, can realize subsidiary functions such as " commonly using the words statistics ", " dynamic frequency adjustment ", " using in advance ".A kind of easy application of two minutes input codings, be exactly, generate oneself satisfied codes table file, join in the input method supervisory routine of existing operating system.
Three, the output of two minutes Chinese characters
Chinese character can be output as full word symbol pattern in two minutes, Chinese character, and type matrix synthesizes Chinese character, or other pattern of two minutes Chinese characters.Need to be equipped with corresponding input code table, matrix magazine.1. full word symbol pattern is output as standard A SCII code character, and matrix magazine can be accomplished minimum.2. Chinese character can keep the existing way of output constant, and its matrix magazine is constant; Also Chinese characters in common use (or custom field) can be adopted the existing way of output, all the other non-common Chinese characters are output as the synthetic Chinese character of type matrix, can reduce the quantity of matrix magazine.3. the synthetic Chinese character of type matrix is the Chinese character that adopts the parts type matrix synthetic, and font is compared with existing Chinese character, and its parts font has standardized feature, and matrix magazine can be done very for a short time.4. two other patterns that divide Chinese character.Parts horizontally-arranged pattern, the Chinese character part of matrix magazine can have only the type matrix of Hanzi component.The mixed pattern of character and Chinese character (or parts), matrix magazine can define as required.The various output patterns of two minutes Chinese characters can define as required, and are equipped with corresponding Chinese character input code table and type matrix.
Four, combined character
Existing Chinese character font is " pressing word code ", and promptly a type matrix made in a word, and type matrix quantity is big.The output of Chinese character (showing or printing) needs huge matrix magazine support.Chinese character quantity is big, and " word does not have fixed number ", can not (also can not) realize " pressing word code " to all Chinese characters.If carry out " pressing component coding ", promptly parts are made a type matrix, adopt the parts type matrix to synthesize Chinese character font, or adopt basic element of character type matrix to synthesize the complex component type matrix, will reduce the number of the quantity of type matrix, realize new coinage mould standardization and the personalization of type matrix style.This with the synthetic Chinese character font (or parts type matrix) of parts type matrix, be called combined character.Chinese character (or parts) with this combined character output just is called synthetic Chinese character (or compound component).
Combined character is different from " EUDC Editor " in the existing operating system.Seem that both can both generate type matrix, but on function, meaning, method and pattern, have essential distinction.Such as, on the matrix magazine capacity, one is the increase capacity, one is the minimizing capacity.Existing " EUDC Editor " 1. is not the parts type matrix with standard, or personalized parts type matrix, according to the generated data generation Chinese character font of definition; 2. can not realize in application program that Chinese character font is synthetic; 3. can not reduce the quantity of kanji code and type matrix; 4. all Chinese characters (comprise and newly make Chinese character) can not be shown as synthetic Chinese character pattern, or the synthetic Chinese character with personalized style; 5. can not improve existing output (showing or printing) mode; 6. but it can be used as research combined character aid.Below, with Chinese character dot matrix and two Chinese characters that divide, combined character is illustrated.
(1) preparation of parts type matrix.Preparation parts type matrix, the easiest way splits existing Chinese character font exactly.Chinese character is synthetic by parts.With Chinese character font, split by the parts composition, can obtain a series of parts dot patterns.With these dot patterns sort out, arrangement, make its figure maximization, the font standardization just can generate the parts type matrix of standard.Utilize two " Chinese character two minutes " rules of dividing Chinese character, can realize fractionation easily Chinese character font.In the GB scope, Chinese character font " two minutes " will generate about 2000 base part type matrixes.Wherein, become about 1400 classes of word type matrix, about 600 classes of non-word type matrix.Utilize " EUDC Editor " of the prior art, can realize the standardization of non-word type matrix easily, and, deposit font file in automatically with its coding.
(2) type matrix generated data.The generated data of type matrix comprises the 1. parts type matrix that combined character is required, and the 2. feature size of these parts type matrixes and 3. position coordinates.Its general expression formula can be written as: generated data=[parts
1..., parts
N]+[(height, wide)
1, (horizontal stroke, vertical)
1]
1+ ... + [(height, wide)
N, (horizontal stroke, vertical)
N]
NIn the formula, " [parts
1..., parts
N] ", represent the code of required parts; " (height, wide)
N", the size of expression parts type matrix; " (horizontal stroke, vertical)
N", the position of expression parts type matrix; " [(height, wide)
N, (horizontal stroke, vertical)
N]
N", represent the generated data of a certain parts; Can be with parts
1To parts
NGenerated data set, press the Hanzi structure classified description, or use coded representation.Included the generated data of Chinese character, can in existing dot pattern, measure factually.Do not include Chinese character and newly make the generated data of Chinese character, can 1. adopt existing " EUDC Editor ", with standardized characters model word, in dot pattern, measure generated data then earlier; 2. or directly definition component type matrix and generated data thereof.The feature size of parts type matrix and position coordinates can be described by sub-unit, also can be by the structure type of Chinese character, and classification is whole to be described.The former is called parts data; The latter is called structure type data (abbreviation structured data).The theoretical foundation of structured data is the stationarity of Chinese radical structure.Such as, left and right sides structure accounts for more than 60% of Chinese characters in common use, and its concrete pattern in " Chinese character two minutes ", generally is divided into 3 big classes.On feature size, about having plenty of half and half, different about having plenty of.For the individual character of determining, about two parts, position separately and size are fixed.With this 3 class left and right sides structure segmentation, 6 kinds of patterns can also be arranged.According to data, the structure type of Chinese character has tens kinds." Chinese character two minutes ", structure type (in the GB scope) has twenties kinds.These structure type standardization, digitizing, or mix the code of easy note, be used for representing the generated data of certain class Chinese character.The description of generated data, feature size and high wide difference can be used multiplying power (or number percent) office, as, be standard type matrix several times (or a few percents); Position coordinates can be used ratio (as number percent) office, as, the upper left corner of dot matrix is (0%, 0%), the lower right corner is (100%, 100%); Length also can be used coordinate representation.Generated data for example.As, " phase " word, left-right symmetric, the 1st class that belongs to left and right sides structure is (available
Expression); Comprise " wood " and " order " two parts, represent with two full word symbol patterns of Chinese character that divide; Component sizes, if the height and width of normal parts type matrix are defined as 100%, the height and width of these two parts can be defined as (100%, 50%); Component locations, with the upper left corner coordinate representation of parts type matrix, " wood " is defined as (0%, 0%), and " order " is defined as (50%, 0%); The generated data of " phase " word=[mu ' hjds, mu ' faggg]+[(100%, 50%), (0%, 0%)]+[(100%, 50%), (50%, 0%)]; Here " [(100%, 50%), (0%, 0%)]+[(100%, 50%), (50%, 0%)] " is a kind of structured data, can be with character and coded representation: as be expressed as
Or " 11 ", the generated data of " phase " word=[mu ' hjds, mu ' faggg]+[11].Generated data can be defined by the individual, generates personalized type matrix.
(3) generated data takes.The generated data of type matrix can directly be imported from keyboard, also can obtain by tabling look-up in the machine from the two kanji codes that divide of keyboard input.This " table ", the compositive relation of reflection Chinese character and parts comprises data such as contained parts, parts feature size, position coordinates, and coding in the machine of these parts.This " tabling look-up in the machine " can adopt existing compilation (or other) program to realize.In the prior art, Chinese character font is taken like this: with encode Chinese characters for computer, convert machine inner code to, the font search program takes out the corresponding Chinese character type matrix according to given ISN visit character library.Taking of type matrix generated data can be in the following way; 1. input " two minutes Chinese characters " is encoded → table look-up, and obtains the type matrix and the generated data of parts composition and generated data → ISN → each parts of taking-up.2. import " two sub-unit " coding → ISN → taking-up parts type matrix.3. a kind of simple and easy realization is directly measured from existing dot pattern.
(4) the synthetic General Principle of type matrix.1. obtain the parts dot array data.2. obtain the type matrix generated data.3. the dot pattern with each parts zooms to prescribed level.The parts figure need zoom to the high wide requirement of regulation.4. in " blank dot matrix " (being " 0 " entirely), with the position coordinates placement in accordance with regulations of the parts figure behind the convergent-divergent.The dot matrix code of each parts after 5. will placing in accordance with regulations carries out additive operation.6. if " carry " not appear in each line code, or " carry " meet the requirements, and illustrates that the mutual alignment is suitable.7. if " carry " of certain line code is undesirable, then the figure to associated components carries out the coordinate translation test, makes it meet " carry " requirement.Here " carry " is meant that code is the point of " 1 " in the dot matrix, overlap (addition), and overlapping appears in two parts that promptly are separated from each other.8. each parts figure is carried out superposition, realize that type matrix is synthetic.With the method for the synthetic complex component type matrix of basic element of character type matrix, with identical with the method for the synthetic Chinese character font of parts type matrix.Just increased the synthetic link of circulation.Above step can realize by the QBASIC program under non-Chinese environment.
(5) application of combined character.The application of combined character needs to be equipped with corresponding environment for use.1. utilize combined character, improve existing coinage mode, realize new coinage mould standardization, the individual uses (type matrix) personalization.Utilize existing " EUDC Editor ", the parts type matrix of code requirement and standardized generated data can generate the Chinese character font of standard; Adopt personalized parts type matrix and personalized generated data, can generate personalized Chinese character font, use for the individual.The code of new coinage mould, available two " (pronunciation)+(unit construction)+(structured data) " patterns of Chinese character that divide are represented.2. utilize combined character, in application program, realize specific function.Such as, in the QBASIC of English application program, utilize combined character, be implemented in the arbitrary position of screen, the synthetic also synthetic Chinese character of display definition size.Again such as, in specific application program, show encrypt file.3. utilize combined character, improve output (showing or the printing) mode of existing Chinese character.The real meaning of combined character is the way of output of improving existing Chinese character.But, the realization of this purpose.Need the support (not doing further narration herein) of corresponding Chinese character operating system.4. type matrix personalization.Select standardized component (or stroke) type matrix of a certain style of calligraphy for use, with the scheme structure French of Chinese character book skill, about hundred kinds, determine its mutual alignment and feature size, carry out superposition and synthesize, generate type matrix with characteristic feature.Book skill synthesized with type matrix combine, will change the adopting Chinese character form present situation of " everybody's one ".The implementation that it is easy is exactly to utilize a series of personalized type matrixes of " EUDC Editor " generation standby.5. the synthetic input of Chinese character.Utilize combined character, generate a kind of new Chinese character input form.With existing " encode Chinese characters for computer " input, be improved to " component coding " input.Directly import Chinese character basic character components and structure type from keyboard, the amalgamation Chinese character font is for showing and printing and use.Input coding can be used as character code, is used for the storage of text.Such as, need " benevolence " word of input, left and right sides structure, left small and right large, the structure type code be " 12 ", direct input " Ren ", " two " code and structured data " 12 " then then shows and prints to export and synthesize Chinese character " benevolence ".Its input coding can be write " ` rf-er-12 ", as the code of " benevolence " word, and is used for the text storage of " benevolence " word.The synthetic input of Chinese character needs a corresponding operating system, could satisfactory realization.In existing operating system, the synthetic input of Chinese character can be used in specific application program.6. divert from one use to another in existing plate making technology.With existing whole word type matrix, use combined character instead, the literal form is synthetic Chinese character pattern.
Chinese character font is that still " pressing word code " should be determined as required with " parts are synthetic "." by word code " can be combined with " parts are synthetic ".Chinese characters in common use (or commonly using word) adopt " press word code ", non-common Chinese character (or the non-word of commonly using) with newly make Chinese character, adopt " parts are synthetic ", will realization with limited " type matrix ", show and print all Chinese characters.Also can adopt " extreme usage ".The Chinese character font tens kinds of basic element of character type matrixes of only packing into.All combined character is all adopted in the demonstration and the printing of Chinese character.Combined character can generate temporarily, also can store with the back, and the individual is standby.Synthetic Chinese character can adopt two Chinese characters that divide as code, is convenient to text storage.Utilize combined character, on macroscopic view, can realize that the type matrix of all Chinese characters is represented; On concrete the use, only need to be equipped with a spot of individual and commonly use type matrix; To save social resources.The basic element of character of combined character can indicate on the key face of keyboard,
Five, the keyboard definition of two minutes Chinese characters and two division input methods
In the Chinese phonetic alphabet, simple or compound vowel of a Chinese syllable " ü ", when needs were write as character " ü ", available letter " v " replaced.The list separator of two minutes Chinese characters and the syllable-dividing mark of the Chinese phonetic alphabet can define respectively, also can unified Definition.It is defined as respectively: the list separator of two minutes Chinese characters, between pronunciation and shape Yi Tezheng, No. 41 key characters " ` " (the ASCII character value of character is 96) expression with the IBM QWERTY keyboard, between shape Yi Tezheng, represent with No. 12 key characters "-" (the ASCII character value of character is 45) of IBM QWERTY keyboard: or adopt other symbolic representation.The syllable-dividing mark of the Chinese phonetic alphabet is with No. 40 key characters " ` " (the ASCII character value of character is 39) or other character representation of IBM QWERTY keyboard.Its unified Definition is: the list separator of two minutes Chinese characters and the syllable-dividing mark of the Chinese phonetic alphabet, and unified for Chinese phonetic alphabet syllable-dividing mark, with No. 40 key characters " ` " (the ASCII character value of character is 39) or other character representation of IBM QWERTY keyboard.In numeric keypad, for reducing symbol definition, list separator and syllable-dividing mark unification are syllable-dividing mark, with numerical key " 0 " expression.The definition of Chinese punctuation mark, consistent with operating system.
1. the key position of QWERTY keyboard definition
The standard of primary standard keyboard is provided with constant.The definition of spelling code, consistent with original definition of QWERTY keyboard.Here, only narrate the definition of double spelling code and stroke code." XX key (XX) " is the key bit number of IBM QWERTY keyboard, is the ASCII character value of character in the bracket.
(1) the key position of double spelling code definition: No. 16 keys (81), represent initial consonant " q ", simple or compound vowel of a Chinese syllable " iu "; No. 17 keys (87) are represented letter " w ", simple or compound vowel of a Chinese syllable " ua, ia "; No. 18 keys (69) are represented simple or compound vowel of a Chinese syllable " e "; No. 19 keys (82) are represented initial consonant " r ", simple or compound vowel of a Chinese syllable " uan, van "; No. 20 keys (84) are represented initial consonant " t ", simple or compound vowel of a Chinese syllable " ve "; No. 21 keys (89) are represented letter " y ", simple or compound vowel of a Chinese syllable " uai, ing "; No. 22 keys (85) are represented initial consonant " sh ", simple or compound vowel of a Chinese syllable " u "; No. 23 keys (73) are represented initial consonant " ch ", simple or compound vowel of a Chinese syllable " i "; No. 24 keys (79) are represented simple or compound vowel of a Chinese syllable " o, uo "; No. 25 keys (80) are represented initial consonant " p ", simple or compound vowel of a Chinese syllable " un, vn "; No. 30 keys (65) are represented simple or compound vowel of a Chinese syllable " a "; No. 31 keys (83) are represented initial consonant " s ", simple or compound vowel of a Chinese syllable " iong, ong "; No. 32 keys (68) are represented initial consonant " d ", simple or compound vowel of a Chinese syllable " uang, iang "; No. 33 keys (70) are represented initial consonant " f ", simple or compound vowel of a Chinese syllable " en "; No. 34 keys (71) are represented initial consonant " g ", character " eng, ng "; No. 35 keys (72) are represented initial consonant " h ", simple or compound vowel of a Chinese syllable " ang ": No. 36 keys (74), represent initial consonant " j ", simple or compound vowel of a Chinese syllable " an "; No. 37 keys (75) are represented initial consonant " k ", simple or compound vowel of a Chinese syllable " ao ": No. 38 keys (76), represent initial consonant " l ", simple or compound vowel of a Chinese syllable " ai "; No. 44 keys (90) are represented initial consonant " z ", simple or compound vowel of a Chinese syllable " ei "; No. 45 keys (88) are represented initial consonant " x ", simple or compound vowel of a Chinese syllable " ie "; No. 46 keys (67) are represented initial consonant " c ", simple or compound vowel of a Chinese syllable " iao "; No. 47 keys (86) are represented initial consonant " zh ", simple or compound vowel of a Chinese syllable " ui, v "; No. 48 keys (66) are represented initial consonant " b ", simple or compound vowel of a Chinese syllable " ou "; No. 49 keys (78) are represented initial consonant " n ", simple or compound vowel of a Chinese syllable " in "; No. 50 keys (77) are represented initial consonant " m ", simple or compound vowel of a Chinese syllable " ian ";
(2) the key position of stroke code definition: No. 30 keys (65), character " A ", representative " folding "; No. 31 keys (83), character " S ", representative " right-falling stroke "; No. 32 keys (68), character " D ", representative " left-falling stroke "; No. 33 keys (70), character " F ", representative " erecting "; No. 34 keys (71), character " G ", representative " horizontal stroke "; No. 35 keys (72), character " H ", representative " horizontal fork "; No. 36 keys (74), character " J ", representative " perpendicular fork "; No. 37 keys (75), character " K ", representative " is cast aside fork "; No. 38 keys (76), character " L ", representative " is pressed down fork "; No. 50 keys (77), character " M ", representative " turning ".
The initial consonant that double spelling code refers to, simple or compound vowel of a Chinese syllable and letter, the pen type and the symbol of stroke code and sound insulation (and separation) symbol correspondence all indicate on the keycap of QWERTY keyboard, or sign are by keycaps.
2. the key position of numeric keypad definition
The key position definition of Chinese phonetic alphabet, existing national proposed standard.Here, be another kind of definition pattern.Between two kinds of patterns, can carry out " word/number " conversion by basic code table.
(1) Chinese phonetic alphabet:
Numerical key " 1 " is represented " a, the b " of phonetic alphabet; Numerical key " 2 " is represented " c, the d " of phonetic alphabet;
Numerical key " 3 " is represented " e, the f " of phonetic alphabet; Numerical key " 4 " is represented " g, h, the i " of phonetic alphabet;
Numerical key " 5 " is represented " j, k, the l " of phonetic alphabet; Numerical key " 6 " is represented " m, n, the o " of phonetic alphabet;
Numerical key " 7 " is represented " p, q, the r " of phonetic alphabet; Numerical key " 8 " is represented " s, t, the u " of phonetic alphabet;
Numerical key " 9 " is represented " v, w, the x " of phonetic alphabet; Numerical key " 0 " is represented " y, the z " of phonetic alphabet.
(2) double spelling code:
Numerical key " 1 " is represented the initial consonant " b " of double spelling code, simple or compound vowel of a Chinese syllable " a, ou ";
Numerical key " 2 " is represented the initial consonant " c, d " of double spelling code, simple or compound vowel of a Chinese syllable " iao, iang, uang ";
Numerical key " 3 " is represented the initial consonant " f " of double spelling code, simple or compound vowel of a Chinese syllable " e, en ";
Numerical key " 4 " is represented the initial consonant " g, h, ch " of double spelling code, character " eng, ng, ang, i ";
Numerical key " 5 " is represented the initial consonant " j, k, l " of double spelling code, simple or compound vowel of a Chinese syllable " an, ao, ai ";
Numerical key " 6 " is represented the initial consonant " m, n " of double spelling code, simple or compound vowel of a Chinese syllable " ian, in, o, uo ";
Numerical key " 7 " is represented the initial consonant " p, q, r " of double spelling code, simple or compound vowel of a Chinese syllable " un, vn, iu, uan, van ";
Numerical key " 8 " is represented the initial consonant " s, t, sh " of double spelling code, simple or compound vowel of a Chinese syllable " iong, ong, ve, u ";
Numerical key " 9 " is represented the initial consonant " zh, x " of double spelling code, letter " w ", simple or compound vowel of a Chinese syllable " ui, v, ia, ua, ie ";
Numerical key " 0 " is represented the initial consonant " z " of double spelling code, letter " y ", simple or compound vowel of a Chinese syllable " ing, uai, ei ".
(3) stroke code: numerical key " 1 ", representative " horizontal stroke "; Numerical key " 2 ", representative " horizontal fork "; Numerical key " 3 ", representative " erecting "; Numerical key " 4 ", representative " perpendicular fork "; Numerical key " 5 ", representative " left-falling stroke "; Numerical key " 6 ", representative " is cast aside fork "; Numerical key " 7 ", representative " right-falling stroke "; Numerical key " 8 ", representative " is pressed down fork "; Numerical key " 9 ", representative " folding "; Numerical key " 0 ", representative " turning ".
The initial consonant that digital code refers to, simple or compound vowel of a Chinese syllable and letter, the corresponding pen type and the symbol of stroke code and sound insulation (and separation) symbol, sign is on the keycap of keyboard, or sign is by keycap.
3. the digital code combination of big stroke code defines with the key position that the key position of corresponding letter defines big stroke code, and numeric keypad is narrated with QWERTY keyboard.Punctuation mark definition and systems compliant.Sound insulation (and separation) symbol definition, as described above.Five kinds " cast aside and press down folding " to basic stroke anyhow, uses number " 1,2,3,4,5 " expression respectively; " commissure " feature " solely, first, in, tail, friendship " five kinds of states, also use number " 1,2,3,4,5 " expression respectively.Stroke and feature are combined,, form the feature code of stroke as " solely horizontal ", " the perpendicular friendship " etc.Be described below.Stroke feature " solely horizontal " is " 11 " with numeral, is " G " with letter representation; Stroke feature " horizontal first " is ' 12 with numeral ", be " F " with letter representation; Stroke feature " in the horizontal stroke " is " 13 " with numeral, is " D " with letter representation; Stroke feature " horizontal tail " is " 14 " with numeral, is " S " with letter representation; Stroke feature " traversed by " is " 15 " with numeral, is " A " with letter representation; Stroke feature " solely perpendicular " is " 21 " with numeral, is " H " with letter representation; Stroke feature " perpendicular first " is " 22 " with numeral, is " J " with letter representation; Stroke feature " in perpendicular " is " 23 " with numeral, is " K " with letter representation; Stroke feature " perpendicular tail " is " 24 " with numeral, is " L " with letter representation; Stroke feature " the perpendicular friendship " is " 25 " with numeral, is " M " with letter representation; Stroke feature " is cast aside solely ", is " 31 " with numeral, is " T " with letter representation; Stroke feature " is cast aside first ", is " 32 " with numeral, is " R " with letter representation; Stroke feature " in the left-falling stroke " is " 33 " with numeral, is " E " with letter representation; Stroke feature " left-falling stroke tail " is " 34 " with numeral, is " W " with letter representation; Stroke feature " is cast aside and is handed over ", is " 35 " with numeral, is " Q " with letter representation; Stroke feature " is pressed down solely ", is " 41 " with numeral, is " Y " with letter representation; Stroke feature " is pressed down first ", is " 42 " with numeral, is " U " with letter representation; Stroke feature " in the right-falling stroke " is " 43 " with numeral, is " I " with letter representation; Stroke feature " right-falling stroke tail " is " 44 " with numeral, is " O " with letter representation; Stroke feature " is pressed down and is handed over ", is " 45 " with numeral, is " P " with letter representation; Stroke feature " folding solely " is " 51 " with numeral, is " N " with letter representation; Stroke feature " folding is first " is " 52 " with numeral, is " B " with letter representation; Stroke feature " compromise " is " 53 " with numeral, is " V " with letter representation; Stroke feature " folding tail " is " 54 " with numeral, is " C " with letter representation; Stroke feature " folding is handed over " is " 55 " with numeral, is " X " with letter representation.
Big stroke code, corresponding stroke and feature pen type can indicate and need not remember on the keycap of keyboard.
Claims (10)
1. one kind pair is divided Chinese character, Chinese character, the Chinese phonetic alphabet and input coding are combined together, belong to the reform of a writing system and Chinese character information technical field, it is characterized in that: (1) has pronunciation part and shape justice characteristic, is the combination of the Chinese phonetic alphabet and Chinese character (or its shape Yi Tezheng); (2) or only tangible adopted characteristic, be the combination of Hanzi component (or its shape Yi Tezheng); (3) the words spelling has determinacy; (4) adopt standard A SCII code character record and transmission Chinese; (5) or adopt standard A SCII code character with Chinese character (or its shape Yi Tezheng) record with transmit Chinese; (6) or with Hanzi component (or its shape Yi Tezheng) horizontally-arranged written record and transmission Chinese: (7) are put into practice infotech and the reform of a writing system and are combined.
2. two division input method, belong to the reform of a writing system and Chinese character information technical field, it is characterized in that: (1) input coding, directly read by two minutes Chinese characters: (2) or with " Chinese character two minutes ", every part represents with its pronunciation code, or with its stroke coded representation, by " coding=Chinese-character pronunciation+select part+remainder (the every definable choice in the right) " form extraction: (3) or with " type selecting as required; with the sign indicating number word selection ", self-service design; (4) input coding character representation, or use numeral; (5) be applicable to multiple keyboard.
3. combined character, Chinese character (or parts) is output as synthetic Chinese character (or compound component) pattern, belong to the reform of a writing system and Chinese character information technical field, it is characterized in that: the type matrix of Chinese character (or parts), by definition (1) parts generated data, or the structure generated data, (2) use the parts type matrix of standard, or synthesize with personalized parts type matrix.
4. two divide Chinese characters or with claim 1 is described with described pair of division input method of claim 2 or with the commercial publication (comprising its application in CD and related software) of the described combined character realization of claim 3.
5. with described two phonetic entry export technique and the products that divide Chinese character to realize of claim 1.
6. with described two information input and output technology and the products that divide Chinese character or realize of claim 1 with described pair of division input method of claim 2.
7. with described two digital input and output technology and the products that divide Chinese character or realize of claim 1 with described pair of division input method of claim 2.
8. Chinese character input and output technology and the product of realizing with the described Chinese character combined character of claim 3 (comprising its application in press).
9. with described two Chinese character input and output technology and the products that divide Hanzi component (or its shape Yi Tezheng) horizontally-arranged pattern to realize of claim 1.
10. two divide Chinese characters or with claim 1 is described with described pair of division input method of claim 2 or with the information security technology and the product of the described combined character realization of claim 3.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 02108826 CN1220127C (en) | 2001-08-29 | 2002-04-09 | 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN01108790.0 | 2001-08-29 | ||
CN01108790 | 2001-08-29 | ||
CN 02108826 CN1220127C (en) | 2001-08-29 | 2002-04-09 | 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1376969A true CN1376969A (en) | 2002-10-30 |
CN1220127C CN1220127C (en) | 2005-09-21 |
Family
ID=25740326
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 02108826 Expired - Fee Related CN1220127C (en) | 2001-08-29 | 2002-04-09 | 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1220127C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104238762A (en) * | 2013-06-09 | 2014-12-24 | 张家港市赫图阿拉信息技术有限公司 | Input method |
-
2002
- 2002-04-09 CN CN 02108826 patent/CN1220127C/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104238762A (en) * | 2013-06-09 | 2014-12-24 | 张家港市赫图阿拉信息技术有限公司 | Input method |
Also Published As
Publication number | Publication date |
---|---|
CN1220127C (en) | 2005-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1023916C (en) | Chinese keyboard entry technique with both simplified and original complex form of Chinese character root and its keyboard | |
CN103995600B (en) | A kind of braille Chinese character converter and its method | |
CN1342276A (en) | Keyboard input devices, methods and systems | |
CN102053719A (en) | Input method for Chinese characters | |
CN1220127C (en) | 'Dual-separation' Chinese characters, 'dual-separation' input method and combined characters | |
CN100337232C (en) | Braille-Chinese contrapositive editing/typesetting system and editing/typesetting method | |
CN1645356A (en) | Multiple dimensional Chinese studying systems | |
CN103777771B (en) | Easily prompt speed records serial input method | |
CN1499357A (en) | Method for lablling united character and word as well as character patterns and character picture | |
CN1129058C (en) | Chinese character phonetic code and keyboard design | |
CN1275732A (en) | Chinese character keyboard input system and applied technology thereof | |
CN85100087A (en) | " Chinese coded sound " scheme and its implementation | |
CN1108552C (en) | Perfecting method (PHF) for phoenticizing Chinese charaters | |
CN1058342C (en) | Chinese character byte codes and its keyboard of using the same | |
CN1455358A (en) | Chinese phonetic alphabet unified scheme, and single phonetic alphabet input and intelligent conversion translation | |
CN1023917C (en) | Method for treating Chinese characters | |
Nederhof | Automatic alignment of hieroglyphs and transliteration | |
CN1114146C (en) | Chinese morpheme code and its computer keyboard input | |
CN1063370A (en) | A kind of Roman character spelling of Chinese characters and suitable input equipment | |
CN1836226A (en) | Method and apparatus for converting characters of non-alphabetic languages | |
KR20000053095A (en) | Method for converting non-phonetic characters into surrogate words for inputting into a computer | |
CN1093654C (en) | Structure code Chinese character input method and universal keyboard used thereof | |
CN101763170A (en) | Holographic chinese character input method | |
CN1055434A (en) | The pixel input method of character and keyboard thereof | |
CN1379307A (en) | Chinese-character universal normalized holographic encode and high-speed input method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20050921 Termination date: 20100409 |