CN105204657A - Combined pinyin type main and auxiliary code Chinese character and word coding input method and keyboard thereof - Google Patents

Combined pinyin type main and auxiliary code Chinese character and word coding input method and keyboard thereof Download PDF

Info

Publication number
CN105204657A
CN105204657A CN201410288523.0A CN201410288523A CN105204657A CN 105204657 A CN105204657 A CN 105204657A CN 201410288523 A CN201410288523 A CN 201410288523A CN 105204657 A CN105204657 A CN 105204657A
Authority
CN
China
Prior art keywords
chinese character
code
addressable part
coding
successively
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410288523.0A
Other languages
Chinese (zh)
Other versions
CN105204657B (en
Inventor
黄振荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201410288523.0A priority Critical patent/CN105204657B/en
Publication of CN105204657A publication Critical patent/CN105204657A/en
Application granted granted Critical
Publication of CN105204657B publication Critical patent/CN105204657B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention relates to a combined pinyin type main and auxiliary code Chinese character and word coding input method. According to the GF3001 specification stipulation, 687 coding components are selected and merged into 409 coding component sets, 31 high-frequency coding components are determined, letter type main codes and auxiliary codes 1 of the dominant shape coding components are determined according to the pronunciation or pinyin letters of called names of the dominant shape coding components, pinyin initial letters of the first stroke names and the second stroke names of the dominant shape coding components serve as letter type auxiliary codes 2 and 3 of the dominant shape coding components, letter type coding resources are formed and then transformed into corresponding digit type coding resources according to national regulations, and then the letter type, digit type, pinyin initial letter and non-pinyin initial letter Chinese character and word coding input method is formed through the coding resources, combined application is achieved, both characters which can be read and characters which cannot be read can be input, most of the repeated code number of a GB18030 character library and a three-level word library is not larger than 10 when coding input is conducted on the GB18030 character library and the three-level word library, and the application effect is good.

Description

Combined type phonetic class major-minor code Chinese character, word coded input method and keyboard thereof
Technical field
The invention belongs to, for computing machine or other, Chinese character for the treatment of apparatus, the method for word coding input and keyboard are carried out to Chinese character information.
Background technology
Published Hanzi coding input method, for the parts participating in coding, general only give an alphabetical tone code or shape code, the one class pronunciation-form-meaning Chinese character coding of the ZL03112606.5 of my invention, introduce sound, shape code and adopted class code and define that the repetition rate of coding is low sees the good computer Chinese input method that word just can input, achieve good effect.But require that user will will understand the adopted class code of more than 400 addressable part groups to remember to grasp, have certain difficulty, need the regular hour.
Goal of the invention
The object of the invention is the phonetic class Chinese character, the word coded input method that propose to improve ZL03112606.5 mono-class pronunciation-form-meaning Chinese character coding, make the grasp of user to method easier, the arrangement of addressable part is also more reasonable, the repetition rate of coding also controls lower, and the input of word, word is also smooth.
Summary of the invention
A Chinese character has several fractionation scheme, then preferentially meet (1) successively and get that to split into minimum that of addressable part a kind of; (2) get that one formerly that disassembled coding unit stroke is many, or adopt another kind of scheme, get that one formerly that disassembled coding unit stroke is few; (3) get stroke sorting that one preceding that addressable part plays stroke, determine a kind of fractionation scheme;
Except meeting above-mentioned fractionation requirement, the present invention has three kinds of Chinese character separating methods, one of scheme, be referred to as general Split Method, the Chinese character that two or more addressable part is formed, stem and remaining part two parts can be divided into, regulation, at up-down structure or tiled configuration, surround in the Chinese character of class formation, as long as first addressable part is in independent position or first addressable part and an end addressable part when being all independent addressable part, all determine that first addressable part is divided into the stem in two parts as Chinese character, Chinese character after removing this addressable part remaining addressable part be just remaining part, such as: " base of a fruit " word " Lv " is stem, " Supreme Being " is remaining part, " Country " word " mouth " is stem, and "or" is remaining part, specify again, the 1st addressable part of Chinese character and one or more addressable part are in an aspect, then an end addressable part is remaining part, Chinese character removes the combination of remaining multiple addressable part of remaining part, be called combination stem, such as: " worrying " word, " autumn " is combination stem, in order to reduce the repetition rate of coding, can also be be defined as combiner, in Chinese character separating, regard as combination stem, such as " win " word, be a part, regard as combination stem, " shellfish " is another part, is considered as remaining part, surround the word of structure to upper right, such as, with the Chinese character that " Chuo ", " Yin " are radicals by which characters are arranged in traditional Chinese dictionaries, beyond its removing " Chuo ", " Yin " is a part, and be considered as combining stem, " Chuo ", " Yin " are another part, are single remaining part, scheme two, be referred to as radicals by which characters are arranged in traditional Chinese dictionaries Split Method: get Chinese character radicals according to the radicals by which characters are arranged in traditional Chinese dictionaries that GF0011-2009 " Chinese character radicals table " regulation and dictionary before this specification, dictionary specify, the Chinese character radicals of GF0012-2009 " GB13000.1 character set Chinese character radicals returns portion's specification " is adopted to return portion's rule, 1. from the left side of Chinese character, external position gets radicals by which characters are arranged in traditional Chinese dictionaries, if left and right, upper and lower, outer and inner be all radicals by which characters are arranged in traditional Chinese dictionaries, then only get the radicals by which characters are arranged in traditional Chinese dictionaries of a left side, upper, external position, if 2. Chinese character a left side, on be not radicals by which characters are arranged in traditional Chinese dictionaries, right, under be radicals by which characters are arranged in traditional Chinese dictionaries.Then get the radicals by which characters are arranged in traditional Chinese dictionaries of the right side, upper/lower positions, the word of semi-surrounding structure, if be not radicals by which characters are arranged in traditional Chinese dictionaries outward, is inside radicals by which characters are arranged in traditional Chinese dictionaries, then in getting, if 3. Chinese character left and right, upper and lower be not radicals by which characters are arranged in traditional Chinese dictionaries, then according to first left and then right, first up and then down order, get radicals by which characters are arranged in traditional Chinese dictionaries from the position of radical, if 4. get word less than the tiled configuration of radicals by which characters are arranged in traditional Chinese dictionaries, up-down structure, encirclement structure or other words by above-mentioned position, get single radicals by which characters are arranged in traditional Chinese dictionaries from the position of the first stroke of a Chinese character, if when 5. few pen and many several radicals by which characters are arranged in traditional Chinese dictionaries occur superimposed on selecting-components ' position, then get many radicals by which characters are arranged in traditional Chinese dictionaries, do not get a few radicals by which characters are arranged in traditional Chinese dictionaries, according to the radicals by which characters are arranged in traditional Chinese dictionaries that above-mentioned rule is got, Chinese character is being divided in stem and remaining part two parts without exception all as stem, in general, it is remaining part that Chinese character removes stem remainder, but for getting the Chinese character of single encoded parts of single radicals by which characters are arranged in traditional Chinese dictionaries, in order to keep the integrality of addressable part, the whole addressable part of this addressable part of this Chinese character is as the remaining part of this Chinese character, such as, " weight " word is the Chinese character of single encoded parts, the stem of " weight " is " Pie ", and the remaining part of " weight " is " weight ", Chinese character itself for single encoded parts is the no longer fractionation of radicals by which characters are arranged in traditional Chinese dictionaries, Chinese character remaining part in this scheme, or remove the stroke order after stem to each addressable part code fetch according to Chinese character, scheme three, be referred to as sound symbol Split Method, major part Chinese character is all phonogram, accorded with by sound and forming with pictograph, sound symbol is also referred to as the phonetic element of a Chinese pictophonetic character, sound accords with identical Chinese character and defines word race, the dictionary pooling together formation with word race is called sonic system dictionary, pictograph is also referred to as the pictographic element of a pictophonetic, for this Chinese character is divided into sound symbol and pictograph two parts, with " wide rhythm sonic system " for source, the sound symbol determined with " wide rhythm sonic system ", as Chinese character sound symbol part, the part that Chinese character removes sound symbol some residual is just considered as the pictograph of Chinese character, if " wide rhythm sonic system " does not determine what Chinese character sound accorded with, and radicals by which characters are arranged in traditional Chinese dictionaries clearly can be determined in Chinese character, Chinese character removes the remainder of radicals by which characters are arranged in traditional Chinese dictionaries, just be considered as sound symbol, here radicals by which characters are arranged in traditional Chinese dictionaries are also just considered as pictograph, following several addressable part combination is had in Chinese character: form Chinese character race, also be defined as sound symbol for this reason, the remainder that Chinese character removes the combination of these addressable parts is exactly pictograph, for single encoded parts, no longer split, whole addressable part regards as sound symbol, and using the sound of Chinese character symbol as the Part I splitting Chinese character, the pictograph of Chinese character is as the Part II splitting Chinese character,
The present invention utilizes the alphabetic keypad of computing machine, numeric keypad or mobile phone etc., and other carry out soft, hard alphabetic keypad, the numeric keypad for the treatment of apparatus to Chinese character information, carry out coding input Chinese character, word.Concrete performing step is as follows:
One, selected addressable part
According to State Language Work Committee GF3001---the requirement of 1997 " information processing GB13000.1 character set Hanzi component specifications ", splits Chinese character, determines the addressable part participating in coding.
Select GF3001---560 basic components of 1997 " information processing GB13000.1 character set Hanzi component specifications ", select GB0011---201 main radicals of 2009 " Chinese character radicals tables " and 100 attached shape radicals by which characters are arranged in traditional Chinese dictionaries, then select the Chinese character containing the some non-word basic components in 560 basic components and Chinese character member: inferior, northern, hurriedly, Cao, spring, list, section, send out, Guan, Kamei, Tortoises, heptan, the last of the twelve Earthly Branches, Pot, China, also, also, with, violet, hold concurrently, can, Lou, exempt from, the fourth of the twelve Earthly Branches, south, capsule, agriculture, abandoned, Pull, its, wife, front, black, Ukraine, not, net, row, Jia, the legendary ruler of great antiquity, the first of the Three August Ones, with, system, amount to 44, make the numeric class Chinese character commonly used and character all be decided to be addressable part for the ease of memory to have selected again: one, hundred, six, zero, deduction double counting number, amount to the basic coding unit having selected 687 parts to adopt as this Chinese character coding method, be referred to as addressable part, and it is identical according to the structure word motivation of addressable part, or literary style is slightly different, or province subtracts to some extent, or put to different variants, or it is numerous each other, the relevances such as simplified spelling, merger becomes 409 addressable part groups, first addressable part in group is called main graphemic code parts, by addressable part be made up of multiple basic components, as: Wind, encode time will its entirety regard as one coding elementary cell----addressable part.Under the prerequisite not violating GF3001 specification, these 687 addressable part bases allow the addressable part amount of selecting increasing, subtract 20 percent, only the repetition rate of coding is slightly affected, but do not change the essence of this coded input method.
Two, the alpha type primary key of high-frequency coding parts and main site location and high-frequency coding parts is determined
Form in five or six hundred parts of Chinese character and have more than 30 the parts word-building ability being referred to as radicals by which characters are arranged in traditional Chinese dictionaries strong especially, according to I adds up that they approximately constitute whole Chinese character about 40 percent, in these more than 30 radicals by which characters are arranged in traditional Chinese dictionaries, the radicals by which characters are arranged in traditional Chinese dictionaries that the present invention determines 31 word-building abilities wherein strong are especially high-frequency coding parts, remove 31 remaining addressable parts of high-frequency coding parts and be referred to as common addressable part in 687 addressable parts that the present invention determines.
In order to reduce the repetition rate of coding, same letters case only arranges the high-frequency coding parts of one or one group numerous, simplified spelling body each other, further define the position of its principal part position simultaneously, main site location is exactly that these high-frequency coding parts are forming position usually residing in Chinese character, in order to reduce the repetition rate of coding, wherein high-frequency coding parts: mountain, Rolling, si, Si, the moon, 12 high-frequency coding parts of 9 addressable part groups such as Ren, Mu, Nian, Yan, Yan are not the primary keys using the initial of the Chinese phonetic alphabet of their pronunciation or radicals by which characters are arranged in traditional Chinese dictionaries title as them, but artificially specify.The shape of 31 the high-frequency coding parts determined, alpha type primary key, main site location be as shown in Table 1: table one:
31 high-frequency coding parts alphabetic keypad card layout as shown in Figure 1.
The setting of above-mentioned 31 high-frequency coding parts and alpha type primary key, the setting of its quantity and alpha type primary key can change in the scope not exceeding 40 percent, only has impact to the repetition rate of coding, but does not change the essence of coding method.
Three, determine phonetic class-letter type primary key, sub-code 1, sub-code 2, the sub-code 3 of each addressable part, form the phonetic class coding resource of the method for Chinese character coding
687 selected addressable part merger become 409 addressable part groups, first addressable part in its group is referred to as main graphemic code parts, the primary key of other addressable parts in group is all the same with the primary key of main graphemic code parts, main graphemic code parts have certain pronunciation or call, except the primary key of high-frequency coding parts has determined, the primary key of other main graphemic code parts has generally all got the alpha type primary key of the first letter of phonetic as phonetic class of its pronunciation or call title.In order to reduce the repetition rate of coding, an improved plan, be referred to as a folding I method: the first letter of phonetic of the pronunciation of main graphemic code parts is Y's, and main graphemic code parts the first sum of be point (Dian), folding (Ya) get I as its alpha type primary key, the first sum of for horizontal (one), perpendicular (Shu), skim (Pie) get Y as its alpha type primary key; Another scheme, be referred to as a folding Y method, the first letter of phonetic main graphemic code parts for Y's of the pronunciation of main graphemic code parts the first sum of is point (Dian), folding (Ya) still get Y as its alpha type primary key, the first sum of for horizontal (one), perpendicular (Shu), skim (Pie) get I as its alpha type primary key; In addition the primary key of the main graphemic code parts of other common addressable parts, still gets the alpha type primary key of the first letter of phonetic as phonetic class of its pronunciation or call.
The determination of common addressable part and high-frequency coding parts phonetic class-letter type sub-code 1: have two schemes, one of scheme, be referred to as general law: except the Chinese phonetic alphabet of main graphemic code parts pronunciation or title, its initial consonant is the biliteral initial consonants such as zh, ch, sh, the letter type code of alpha type sub-code 1 made in the 3rd letter then getting its Chinese phonetic alphabet, in addition, remaining all gets the 2nd its alpha type sub-code 1 of letter work of its Chinese phonetic alphabet; Scheme two, be referred to as optimum seeking method: the Chinese phonetic alphabet of main graphemic code parts pronunciation or title, its initial consonant is j, q, x, and simple or compound vowel of a Chinese syllable is initial is the complex tone simple or compound vowel of a Chinese syllable of i, or its initial consonant is the biliteral initial consonants such as zh, ch, sh, or its Chinese phonetic alphabet the 1st letter is y, and the 2nd letter is i, and simple or compound vowel of a Chinese syllable is vowel followed by a nasal consonant, then its alpha type sub-code 1 made in the 3rd letter all getting its Chinese phonetic alphabet, in addition, remaining all gets the 2nd its alpha type sub-code 1 of letter work of its Chinese phonetic alphabet; Each addressable part alpha type sub-code of same addressable part group is identical with the alpha type sub-code 1 of the main graphemic code parts of this group;
The phonetic class-letter type sub-code 2 of common addressable part and high-frequency coding parts, the determination of sub-code 3: according to national regulation, Chinese character by horizontal (one), perpendicular (Shu), skim (Pie), point (Dian) rolls over (Ya) five kinds of strokes form, the present invention represents with first alphabetical H, S, P, D, Z of the Chinese phonetic alphabet of these five kinds of stroke and pronunciations successively.Each addressable part get successively first stroke, the 2nd stroke the letter type code of stroke as alpha type sub-code 2, the alpha type sub-code 3 of addressable part; For the addressable part formed less than 2 strokes, the 2nd stroke of disappearance, can determine arbitrarily that a letter is expressed, the present invention determines to get alphabetical V; Another scheme is, for the addressable part formed less than 2 strokes, this addressable part alpha type sub-code 3 with regard to vacancy.
The alpha type primary key of 687 addressable parts of 409 addressable part groups of the present invention, sub-code 1, sub-code 2, sub-code 3 is determined according to said method, be arranged in order, as shown in Table 2, sub-code 1 adopts optimum seeking method to determine, for the addressable part formed less than 2 strokes, determine the scheme all adopting alphabetical V to supply; The first letter of phonetic of the pronunciation of main graphemic code parts is Y's, adopts a some folding I method, phonetic class major-minor code in table two, in order to see to obtain the eye-catching capitalization that have employed, and it and lowercase equivalence, when actual coding, coding schedule employing lowercase; Table two:
Four, conversion obtains each addressable part phonetic class numeric type primary key, sub-code 1, sub-code 2, sub-code 3 yards, forms phonetic class numeric type coding resource
Conversion forms numeric type coding resource scheme: conversion forms numeric type coding resource two schemes, one of scheme, be referred to as the different formula conversion plan of alphabetical stroke: according to standard GB/T/T18031-2000 " infotech digital keyboard Chinese character input General Requirement ", to the sub-code 2 of addressable part phonetic class-letter type, sub-code 3 relates to five class strokes, all convert numerical code to the regulation of " the key mapping setting of Chinese-character stroke " of this standard, instead of convert corresponding numerical code to the Chinese Pin Yin initial of stroke title, in addition to the above, all kinds of alpha type primary keys of addressable part, other letters of sub-code 1 and first letter of pinyin etc., with the letter of regulation of " the 10 key mapping letter key of Chinese pin yin position setting " and " 8 key mapping letter key of Chinese pin yin position setting " of this standard and the corresponding relation of numeral, convert 10 key mapping method phonetic class numeric type primary keys respectively successively to, sub-code 1, sub-code 2, sub-code 3 and 8 key mapping phonetic class numeric type primary key, sub-code 1, sub-code 2, the numerical code of sub-code 3 and first letter of pinyin, for high-frequency coding parts, on the basis of the major and minor code of all kinds of numeric types of above-mentioned conversion gained, adjustment makes all kinds of numeric type primary keys of each high-frequency coding parts, the combination of numbers of sub-code 1 slightly, and not identical each other in same type, concrete scheme as shown in Table 6, above-mentioned conversion plan is referred to as the different formula conversion plan of alphabetical stroke, scheme two, be referred to as the female conversion plan of full word: the letter type code turnover number font code being to relate to " 8 key mapping letter key of Chinese pin yin positions set " with one of scheme difference, the first letter of pinyin conversion of stroke being skimmed " Pie " is decided to be numeral 1, other are constant, still with the alphabetical form of the Chinese Pin Yin initial of stroke title, the regulation of establishing criteria converts numerical code to, that is, only according to letter and the digital corresponding relation of " the 10 key mapping letter key of Chinese pin yin positions setting " and " 8 key mapping letter key of Chinese pin yin positions setting " of GB/T18031 standard, all kinds of for each addressable part set above alpha type primary key, sub-code 1, sub-code 2, sub-code 3 converts 10 key mapping methods and 8 key mapping method phonetic class numeric type primary keys correspondingly respectively to, sub-code 1, sub-code 2, sub-code 3, for high-frequency coding parts on the basis of the numeric type code of above-mentioned conversion gained, adjustment makes all kinds of numeric type primary keys of each high-frequency coding parts slightly, the combination of numbers of sub-code 1, not identical each other in same type, concrete scheme as shown in Table 7, above-mentioned conversion plan is referred to as the female conversion plan of full word, " the key mapping setting of Chinese-character stroke " is as shown in following table three:
Table three:
" 10 key mapping letter key of Chinese pin yin position setting " is as shown in following table four:
Table four:
" 8 key mapping letter key of Chinese pin yin position setting " is as shown in following table five:
Table five:
Phonetic class numeric type primary key, sub-code 1, sub-code 2, the sub-code 3 of the high-frequency coding parts of " the different formula conversion plan of alphabetical stroke ", express successively with the corresponding numerical key of numeric keypad, the high-frequency coding parts phonetic class numeric type major-minor code of 31 high-frequency coding arrangement of components, concrete scheme as shown in Table 6: table six:
Phonetic class numeric type primary key, sub-code 1, sub-code 2, the sub-code 3 of the high-frequency coding parts of " full word matrix conversion plan ", express successively with the corresponding numerical key of numeric keypad, the high-frequency coding parts phonetic class numeric type major-minor code of 31 high-frequency coding arrangement of components, concrete scheme as shown in Table 7: table seven:
Five, combined type phonetic class major-minor code Chinese character, word coded input method
When implementing coding, first fractionation scheme to be determined, the phonetic class radicals by which characters are arranged in traditional Chinese dictionaries major-minor code word parent form method of Chinese character coding adopts dictionary, dictionary Chinese character radicals Split Method, remaining various Chinese character, encoding method of words and phrases all adopt general Split Method, getting many that a kind of that one formerly few with getting disassembled coding unit stroke formerly of disassembled coding unit stroke, in this two schemes, determine one; Converted to the two schemes of numerical code by character code: the different formula conversion plan of alphabetical stroke and full word matrix conversion plan, in this two scheme, determining one; And " 10 key mapping letter key of Chinese pin yin position setting " and " 8 key mapping letter key of Chinese pin yin positions set ", the letter of these two kinds regulations, with the corresponding relation of numeral, determines one;
Combined type phonetic class major-minor code Chinese character, word coded input method are made up of combined type phonetic class major-minor code word parent form Chinese character, word coded input method and combined type phonetic class major-minor yardage font Chinese character, word coded input method two parts;
Part I combined type phonetic class major-minor code word parent form Chinese character, word coded input method
Utilize phonetic class-letter type coding resource, formation combined type phonetic class major-minor code word parent form Chinese character, word coded input method include: the 1. phonetic class first letter of pinyin major-minor code word parent form method of Chinese character coding; 2. phonetic class first letter of pinyin major-minor code word parent form encoding method of words and phrases; 3. the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding; 4. phonetic class non-pinyin initial major-minor code encoding method of words and phrases; 5. the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding; 6. the phonetic class major-minor code word parent form sonic system method of Chinese character coding; for 6763 Chinese characters in common use of GB2312, the Chinese Character Set passed through in Chinese Taiwan in the Chinese character set more than 27000 of everyday expressions (or Chinese large word collection) and GB18030 or GB13000 or japanese character collection or Korean Chinese Character Set and several ten thousand and even the coding input of Chinese character of large character set of Chinese character more than 100,000, assembly coding input operation can be carried out: to Chinese characters in common use from following mode, such as, one-level character library in 6763 Chinese characters of GB2312 or one-level, secondary character library and everyday words repertorie, adopt the phonetic class first letter of pinyin alpha type method of Chinese character coding, phonetic class first letter of pinyin alpha type word coded input method, the Chinese Character Set current for the Chinese Taiwan in the Chinese character more than 27000 of GB18030-2000 or GB13000 or the japanese character centralized procurement phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding or adopt the phonetic class major-minor code radicals by which characters are arranged in traditional Chinese dictionaries alpha type method of Chinese character coding or the phonetic class major-minor code word parent form sonic system method of Chinese character coding, obtain coding, also can input with phonetic class non-pinyin initial alpha type encoding method of words and phrases for everyday expressions, the coding of the coding of the coding of the phonetic class first letter of pinyin alpha type method of Chinese character coding, the coding of phonetic class first letter of pinyin alpha type encoding method of words and phrases and the phonetic class non-pinyin initial alpha type method of Chinese character coding or the coding of the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding or the phonetic class major-minor code word parent form sonic system method of Chinese character coding, same code table can be combined in, also can divide and be listed in different code table, switching is called, phonetic class first letter of pinyin alpha type Chinese character, word coding is encoded can be combined in same code table with phonetic class non-pinyin initial alpha type Chinese character, word, and also can divide and be listed in two code tables, switching is called.The phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding also can be single-row for ministry of electronics industry's lead-in allusion quotation, glossary identical for radicals by which characters are arranged in traditional Chinese dictionaries can be gathered together; The phonetic class major-minor code word parent form sonic system method of Chinese character coding can also be single-row for electronics sonic system dictionary, sound can be accorded with identical glossary and gather together;
The coding method of phonetic class-letter type Chinese character, word is as follows:
In the following description, specify: 1 yard is got to an addressable part, namely gets its alpha type primary key, get 2 yards, namely get its alpha type primary key, sub-code 1 successively, get 3 yards, namely get its alpha type primary key, sub-code 1, sub-code 2 successively; Get 4 yards, namely get its alpha type primary key, sub-code 1, sub-code 2, sub-code 3 successively.
(1) the phonetic class first letter of pinyin major-minor code word parent form method of Chinese character coding
A, code length are indefinite, and when encoding with group code table with the word being greater than code length, encode Chinese characters for computer does not reach code length, can terminate with end key, also can with in display box titled with digital selective key select; End key terminates rear repeated code in addition, and options button is selected; There is Multi-encoding scheme, unless otherwise specified, all therefrom select one; Above-mentioned 2 each methods be applicable to below; Determine to adopt phonetic class-letter type coding resource;
The coding of the Chinese character of B, single encoded parts: if the first letter of pinyin of this Chinese character is identical with the alpha type primary key of this addressable part: so to high-frequency coding parts, then get the alpha type primary key of this addressable part, sub-code 1 successively, so to common addressable part, there are two kinds of encoding schemes, one of scheme, be referred to as trigram method, then get the alpha type primary key of this addressable part, sub-code 1, sub-code 2 successively, scheme two, be referred to as four yards of methods, get the alpha type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3 successively, if the first letter of pinyin of this Chinese character is not identical with the alpha type primary key of this addressable part, again high-frequency coding parts, there are five kinds of code plans of encoding, one of scheme, be referred to as two yards of methods, then get the first letter of pinyin of this word successively, the alpha type primary key of these high-frequency coding parts, scheme two, be referred to as trigram method, then get the first letter of pinyin of this Chinese character successively, 2 yards of this high-frequency coding parts, scheme three, be referred to as alternative trigram method, then get the first letter of pinyin of this Chinese character successively, the sub-code 1 of these high-frequency coding parts, sub-code 2, scheme four, be referred to as four yards of methods, then get the first letter of pinyin of this Chinese character successively, 3 yards of this addressable part, scheme five, be referred to as alternative four yards of methods, then get the first letter of pinyin of Chinese character successively, the sub-code 1 of this addressable part, sub-code 2, sub-code 3, if the first letter of pinyin of this Chinese character is different from the alpha type primary key of this addressable part, it is again common addressable part, and the alpha type primary key of addressable part is not I, there are five kinds of encoding schemes, the first scheme, be referred to as secondary two methods, get the first letter of pinyin of this Chinese character successively, the alpha type primary key of this addressable part, sub-code 1, sub-code 2, first scheme, be referred to as secondary three methods, get the first letter of pinyin of this Chinese character successively, the alpha type sub-code 1 of this addressable part, sub-code 2, sub-code 3, the third method, be referred to as trigram method, get the first letter of pinyin of this Chinese character successively, the alpha type sub-code 1 of this addressable part, sub-code 2, scheme four, be referred to as four yards of methods, then get the first letter of pinyin of this Chinese character successively, 3 yards of this addressable part, scheme five, be referred to as replacement four yards of methods, then get the first letter of pinyin of this Chinese character successively, the alpha type sub-code 1 of this addressable part, sub-code 2, sub-code 3, if the first letter of pinyin of this Chinese character is y and the alpha type primary key of addressable part is I's, there are three kinds of encoding schemes, one of scheme, be referred to as trigram method, then get the first letter of pinyin of this Chinese character, the sub-code 1 of this addressable part, sub-code 2 successively, scheme two, be referred to as four yards of methods, then get the first letter of pinyin of this Chinese character, the alpha type primary key of this addressable part, sub-code 1, sub-code 2 successively, scheme three, be referred to as alternative four yards of methods, then get the first letter of pinyin of this Chinese character, the alpha type sub-code 1 of this addressable part, sub-code 2, sub-code 3 successively,
The Chinese character of C, more than 2 or 2 addressable part compositions, Chinese character can be divided into stem and remaining part two parts;
D, the Chinese character that the addressable part by more than 2 or 2 is formed, point following two parts determine that its alpha type is encoded successively:
Part 1, gets and encodes as the alpha type of part 1 according to the initial of the Chinese phonetic alphabet of Chinese character;
Part 2, get the stem of Chinese character and the coding of remaining part with following method:
Chinese Character parent form coding method for the addressable part by 2 forms: if stem is the high-frequency coding parts being in principal part position, there are two kinds of encoding schemes, one of scheme, be referred to as trigram method, then successively 1 yard got to that addressable part of stem, 2 yards are got to that addressable part of remaining part, scheme two, be referred to as two yards of methods, then successively 1 yard got to that addressable part of stem, 1 yard is got to that addressable part of remaining part; If first part addressable part is common addressable part, there are two kinds of encoding schemes, one of scheme, be referred to as one or two methods, then successively 1 yard is got to that addressable part of stem, 2 yards are got to that addressable part of remaining part, scheme two, be referred to as 21 methods, then successively that addressable part of stem got to 2 yards, got 1 yard to that addressable part of remaining part;
Chinese character for the addressable part by more than 3 or 3 forms: if single stem Chinese character, and for example fruit stem is the high-frequency coding parts being in principal part position, then get the 1st successively, the 2nd, end each 1 yard an of addressable part; If single stem Chinese character, and for example fruit stem is a common addressable part, there are two kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get the 1st addressable part 2 yards, an end addressable part 1 yard, scheme two successively, be referred to as monic method, then get the 1st successively, the 2nd, end each 1 yard an of addressable part; If its stem is the Chinese character of combination stem, then this combination stem gets 2 yards, has two kinds of code fetch schemes, one of scheme, is referred to as first and last method, respectively gets 1 yard, as the coding of stem successively to the 1st, a end addressable part of combination stem; Scheme two, is referred to as method first, respectively gets 1 yard, as the coding of stem successively to the 1st, the 2nd addressable part of combination stem; Combination stem is the Chinese character of stem, and its remaining part is single encoded parts, gets 1 yard;
Above-mentioned 1st, the 2 two alpha type that part is got coding, be combined into the coding of whole Chinese character successively;
When E, coding, adopt English lower case, or adopt English capitalization;
(2) the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding
A, code length are indefinite, adopt phonetic class-letter type coding resource;
The Chinese character of B, single encoded parts: if this addressable part is high-frequency coding parts, then get its alpha type primary key, alpha type sub-code 1 successively; If common addressable part: there are two kinds of encoding schemes, one of scheme, are referred to as trigram method, then get its alpha type primary key, sub-code 1, sub-code 2 successively, scheme two, be referred to as four yards of methods, then get its alpha type primary key, sub-code 1, sub-code 2, sub-code 3 successively;
C, the Chinese character be made up of more than 2 or 2 addressable parts: stem and remaining part two parts can be divided into;
D, the Chinese Character parent form coding method that the addressable part by 2 is formed: if stem is the high-frequency coding parts being in principal part position, there are two encoding schemes, one of scheme, be referred to as trigram method, then successively 1 yard got to that addressable part of stem, 2 yards are got to that addressable part of remaining part, scheme two, be referred to as four yards of methods, then successively 1 yard got to that addressable part of stem, 3 yards are got to that addressable part of remaining part; If first part addressable part is common addressable part, there are two kinds of encoding schemes, one of scheme, being referred to as is two or two methods, then successively 2 yards are got to that addressable part of stem, 2 yards are got to that addressable part of remaining part, scheme two, be referred to as one or three methods, then successively that addressable part of stem got to 1 yard, got 3 yards to that addressable part of remaining part;
E, for the Chinese character be made up of 3 addressable parts: if stem is a single stem, again the high-frequency coding parts being in principal part position, there are two kinds of encoding schemes, one of scheme, is referred to as last two methods, then get 1 yard to those high-frequency coding parts of stem successively, 1st addressable part of complementary gets 1 yard, the 2nd addressable part of remaining part gets 2 yards, scheme two, is referred to as trigram method, then get 1 yard to those high-frequency coding parts of stem successively, the 1st, the 2nd addressable part of complementary respectively gets 1 yard; If stem is a single stem, it is again a common addressable part, there are two kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get 2 yards to that addressable part of stem successively, the 1st, the 2nd addressable part of remaining part respectively get 1 yard, scheme two, be referred to as last two methods, then successively 1 yard got to that addressable part of stem, the 1st addressable part of remaining part gets 1 yard, the 2nd addressable part of remaining part get 2 yards; If stem is combination stem, then respectively get 1 yard to the 1st, the 2nd addressable part of combination stem successively, that addressable part of remaining part gets 2 yards;
F, the Chinese character that the addressable part by more than 4 or 4 is formed: if stem is a single stem, again the high-frequency coding parts being in principal part position, then get successively that addressable part 1 yard of stem, remaining part the 1st, the 2nd, end each 1 yard an of addressable part; If stem is a single stem, be again a common addressable part, have two kinds of code fetch schemes, one of scheme, two methods headed by being referred to as, then successively 2 yards are got to that addressable part of stem of Chinese character, the 1st, a end addressable part of remaining part respectively gets 1 yard; Scheme two, is referred to as one yard of method, then get 1 yard to that addressable part of the stem of Chinese character successively, the 1st, the 2nd, a end addressable part of remaining part respectively gets 1 yard; If stem is combination stem, two kinds of code fetch schemes are had for combination stem, first method, be referred to as combination stem last code fetch method first, namely successively to the 1st of combination stem, the 2nd, an end addressable part respectively gets 1 yard, those parts of the remaining part of Chinese character get 1 yard, second method, be referred to as first, secondary, the secondary code fetch method of combination stem, namely respectively get 1 yard to the 1st, the 2nd, the 3rd addressable part of combination stem successively, those parts of the remaining part of Chinese character get 1 yard;
The coding that each for above-mentioned Chinese character addressable part is got, with addressable part Chinese character composition in priority for sequence, form the coding of whole Chinese character successively;
When E, coding, adopt English lower case, or adopt English capitalization;
(3) phonetic class first letter of pinyin major-minor code word parent form Chinese word and phrase coding method
A, employing phonetic class-letter type coding resource, the encode Chinese characters for computer obtained according to the phonetic class first letter of pinyin method of Chinese character coding, gets the coding of word; 1 yard is got to the word of in word, gets the first letter of pinyin of this word exactly; 2 yards are got to a word in a word, if the Chinese character of single encoded parts, then get the first letter of pinyin of this Chinese character, the sub-code 1 of this addressable part successively, if the Chinese character be made up of two or more addressable part, be exactly the 1st, the 2nd character code of the coding of this word first letter of pinyin alpha type of getting this word successively, namely get the alpha type primary key of the 1st addressable part of the first letter of pinyin of this word, this word successively; Getting 3 yards to a Chinese character, is exactly the 1st, the 2nd, the 3rd character code of the coding getting this word successively; The maximum length code of word coding is long is set as 6, and can be set as 4 or 5, this does not change the essence of coding yet, only affects to some extent the repetition rate of coding;
B, for the word be made up of 2 Chinese characters, there are 4 kinds of encoding schemes, one of scheme.Be referred to as two or two methods, successively 2 yards respectively got to the 1st Chinese character, the 2nd Chinese character; Scheme two, is referred to as one or three methods, gets the 1st Chinese character 1 yard successively, the 2nd Chinese character 3 yards; Scheme three, is referred to as 31 methods, gets the 1st Chinese character 3 yards successively, second Chinese character 1 yard, and scheme four, is referred to as two or three methods, then get 2 yards, the 2nd Chinese character to the 1st Chinese character successively and get 3 yards; One of them scheme can be selected when implementing coding in above-mentioned scheme, or select two schemes wherein simultaneously, or select wherein three schemes simultaneously, encode; Also hybrid combining can be carried out, two or two method codings are all adopted to the word of all two Chinese character compositions, for the word that some word-building capacities are strong especially, as occur as the 1st word " sending out ", " no ", " greatly ", " going out " two word group words, then increase employing one or three method to the coding of word; For the two word group words of " head ", " work ", " heart " that occur as the 2nd word, then increase employing 31 method to the coding of word;
C, for the word be made up of 3 Chinese characters, there are four kinds of encoding schemes, one of scheme, are referred to as two methods one by one, then get the 1st Chinese character 1 yard, the 2nd Chinese character 1 yard, the 3rd Chinese-character ' two code ' successively; Scheme two, is referred to as 211 method, then get the 1st Chinese-character ' two code ', the 2nd Chinese character 1 yard, the 3rd Chinese character 1 yard successively; Scheme three, is referred to as one two one methods, then get the 1st Chinese character 1 yard, the 2nd Chinese-character ' two code ', the 3rd Chinese character 1 yard successively, scheme four, is referred to as 212 methods, then get the 1st Chinese-character ' two code ', the 2nd Chinese character 1 yard, the 3rd Chinese-character ' two code ' successively; When implementing coding, one of them scheme can be selected in above-mentioned scheme, or select two schemes wherein, or select wherein three schemes, encode;
D, for the word be made up of 4 Chinese characters, there are three kinds of encoding schemes, one of scheme, is referred to as four yards of methods, then that gets the 1st, the 2nd, the 3rd, the 4th Chinese character successively respectively gets 1 yard, scheme two, be referred to as five yards of methods, then that gets the 1st, the 2nd, the 3rd Chinese character each 1 yard, the 4th Chinese character successively gets 2 yards, scheme three, two methods headed by being referred to as, then get successively the 1st Chinese-character ' two code ', each 1 yard of the 2nd, the 3rd, the 4th Chinese character; When implementing coding, one of them scheme can be selected in above-mentioned scheme, or select two schemes wherein simultaneously, encode;
E, for the word be made up of 5 Chinese characters, there are three kinds of encoding schemes, one of scheme, are referred to as five yards of methods, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character successively; Scheme two, is referred to as six yards of methods, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th Chinese character successively, and the 5th Chinese character gets 2 yards, scheme three, two methods headed by being referred to as, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th Chinese character successively; When implementing coding, one of them scheme can be selected in above-mentioned scheme, or selecting two schemes wherein simultaneously, encoding;
F, for the word be made up of more than 6 or 6 Chinese characters, there are four kinds of encoding schemes, one of scheme, be referred to as along six methods, get the 1st of word the successively, 2nd, 3rd, 4th, 5th, each 1 yard of 6th Chinese character, scheme two, be referred to as last six methods, then get the 1st of word the successively, 2nd, 3rd, 4th, 5th, end each 1 yard an of Chinese character, scheme three, two methods headed by being referred to as, then get the 1st Chinese-character ' two code ' of word successively, 2nd, 3rd, 4th, each 1 yard of 5th Chinese character, scheme four, two last methods headed by being referred to as, then get the 1st Chinese-character ' two code ' of word successively, 2nd, 3rd, 4th, end each 1 yard an of Chinese character,
When G, coding, adopt English lower case, or adopt English capitalization; ,
(4) phonetic class non-pinyin initial major-minor code word parent form Chinese word and phrase coding method
Utilize the phonetic class non-pinyin initial major-minor code word parent form encode method for entering Chinese characters determined to the coding of each Chinese character, implement the non-pinyin first alphabetic coding to Chinese terms; 1 yard is got to a Chinese character, namely gets the alpha type primary key of the 1st addressable part of this Chinese character, 2 yards are got to a Chinese character: if the word of single encoded parts, then get the alpha type primary key of this addressable part, sub-code 1 successively; If the Chinese character be made up of more than two or two addressable parts, two kinds are had to follow the example of, one of follow the example of, be referred to as method first, namely the alpha type primary key of the 1st, the 2nd addressable part of this Chinese character is got successively, follow the example of two, headed by being referred to as, remaining method, namely gets the alpha type primary key of the 1st addressable part alpha type primary key of the stem of this Chinese character, the 1st addressable part of remaining part successively; 3 yards are got to a Chinese character, if the Chinese character of single encoded parts, then gets primary key, sub-code 1, the sub-code 2 of the alpha type coding of this addressable part successively; If the Chinese character of two addressable parts, then get the 1st addressable part 1 yard of Chinese character, the 2nd addressable part 2 yards successively; If the Chinese character of more than three or three addressable parts, then get the 1st of Chinese character the successively, the 2nd, end each 1 yard an of addressable part; The maximum code length of word is set as 6; The specific coding method of word is as follows:
A, for the word coding be made up of 2 Chinese characters, there are three kinds of schemes, one of scheme, are referred to as two or two methods, by method first, to the Chinese character code fetch of word, namely get the 1st Chinese-character ' two code ' of this word, the 2nd Chinese-character ' two code ' successively; Scheme two, three methods headed by being referred to as, namely get the 1st Chinese character 3 yards of this word, the 2nd Chinese character 1 yard successively, scheme three, is referred to as last three methods, gets the 1st Chinese character 1 yard of this word, the 2nd Chinese character 3 yards successively;
B, for be made up of 3 Chinese characters word coding: have three kinds of encoding schemes, one of scheme, be referred to as last two methods, namely get each 1 yard of the 1st, the 2nd Chinese character of this word, the 3rd Chinese-character ' two code ' successively; Scheme two, two methods headed by being referred to as, namely get each 1 yard of the 1st Chinese-character ' two code ' of this word, the 2nd, the 3rd Chinese character, scheme three successively, are referred to as time two methods, then get the 1st Chinese character 1 yard, the 2nd Chinese-character ' two code ', the 3rd Chinese character 1 yard successively;
C, for be made up of 4 Chinese characters word coding, have two kinds of schemes, one of scheme, be referred to as last two methods, namely get each 1 yard of the 1st, the 2nd, the 3rd Chinese character of this word, the 4th Chinese-character ' two code ' successively; Scheme two, is referred to as four yards of methods, namely gets each 1 yard of the 1st, the 2nd, the 3rd, the 4th Chinese character of this word successively;
D, for be made up of 5 Chinese characters word coding, get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character of this word successively;
E, for be made up of more than 6 or 6 Chinese characters word coding, the 1st, the 2nd, the 3rd, the 4th, the 5th, the 6th Chinese character getting this word successively respectively gets 1 yard;
(5) the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding
A, employing phonetic class major-minor code word parent form coding resource, Chinese character separating adopts dictionary, dictionary Chinese character radicals Split Method; Code length is indefinite;
B, addressable part as the radicals by which characters are arranged in traditional Chinese dictionaries of the stem of Chinese character, have three kinds to get encoding scheme, one of scheme, be referred to as two yards of methods, get this addressable part 2 yards, namely get the alpha type primary key of this addressable part, sub-code 1; Scheme two, is referred to as trigram method, gets this addressable part 3 yards, namely gets the alpha type primary key of this word addressable part, sub-code 1, sub-code 2; Scheme three, is referred to as four yards of methods, gets this addressable part 4 yards, namely gets the alpha type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3;
C, addressable part as the remaining part of Chinese character, there are two kinds of code fetch schemes, one of scheme, be referred to as two yards of methods, for the remaining part of single encoded parts, then successively 2 yards are got to this addressable part, if the remaining part of more than two or two addressable parts then respectively gets 1 yard to its 1st, the end addressable part; Scheme two, be referred to as trigram method, if the remaining coding of single encoded parts, then successively 3 yards are got to this addressable part, if the remaining part of two addressable parts, then successively the 1st addressable part is got to 1 yard, is got 2 yards to the 2nd addressable part, if the remaining part be made up of the addressable part of more than three or three, then get successively its 1st, the 2nd, an end addressable part respectively gets 1 yard;
D, for the addressable part of radicals by which characters are arranged in traditional Chinese dictionaries of stem not having remaining part, there are three kinds to get encoding scheme, one of scheme, are referred to as two yards of methods, get this addressable part 2 yards, namely get the alpha type primary key of this addressable part, sub-code 1; Scheme two, is referred to as trigram method, gets this addressable part 3 yards, namely gets the alpha type primary key of this word addressable part, sub-code 1, sub-code 2; Scheme three, is referred to as four yards of methods, gets this addressable part 4 yards, namely gets the alpha type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3;
E, the coding of the stem of Chinese character and remaining part to be combined successively, become the coding of whole Chinese character;
(6) the phonetic class major-minor code word parent form sonic system method of Chinese character coding
A, employing phonetic class major-minor code word parent form coding resource, Chinese character separating employing sound symbol Split Method; Code length is indefinite;
B, coding is got to sound symbol, there are three kinds of schemes, one of scheme, be referred to as two yards of methods, sound for single encoded parts accords with, then get this addressable part 2 yards, for the sound symbol be made up of more than two or two addressable parts, then get successively its 1st, end each 1 yard an of addressable part, scheme two, be referred to as trigram method, sound for single encoded parts accords with, then get this addressable part 3 yards, for the sound symbol be made up of two addressable parts, then there are again two kinds and get encoding scheme, one of scheme, two methods headed by being referred to as, then get its 1st addressable part 2 yards successively, 2nd addressable part 1 yard, scheme two, be referred to as monic method, then get its 1st addressable part 1 yard successively, 2nd addressable part 2 yards, for the sound symbol be made up of more than three or three addressable parts, then get the 1st successively, 2nd, end each 1 yard an of addressable part, scheme three, is referred to as four yards of methods, and the sound for single encoded parts accords with, then get this addressable part 4 yards, for the sound symbol be made up of two addressable parts, have again three kinds to get encoding scheme, one of scheme is referred to as monic method, then get its 1st addressable part 1 yard, the 2nd addressable part 3 yards successively, scheme two, two methods headed by being referred to as, then get its each 2 yards of the 1st, the 2nd addressable part successively, scheme three, three methods headed by being referred to as, then get the 1st addressable part 3 yards, the 2nd addressable part 1 yard successively, for the sound symbol of three addressable part compositions, two kinds are had again to get encoding scheme, one of scheme, be referred to as monic method, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively, scheme two, two methods headed by being referred to as, then get each 1 yard of its 1st addressable part 2 yards, the 2nd, the 3rd addressable part successively, for the sound symbol be made up of four or more addressable part, then get successively its 1st, the 2nd, the 3rd, end each 1 yard an of addressable part,
C, coding is got to pictograph, have three kinds to get encoding scheme, one of scheme, be referred to as two yards of methods, if pictograph is single encoded parts, then get its 2 yards, if pictograph is more than two or two addressable parts compositions, then get successively its 1st, end each 1 yard an of addressable part; Scheme two, is referred to as trigram method, if pictograph be single encoded parts get this addressable part 3 yards, if pictograph is two addressable parts compositions, then get its 1st addressable part 1 yard, the 2nd addressable part 2 yards successively; If pictograph is the addressable part composition of more than three or three, then get the 1st successively, the 2nd, an end addressable part respectively gets one yard; Scheme three, is referred to as one yard of method, gets the alpha type primary key of the 1st addressable part of pictograph;
D, accord with for there is no the sound of pictograph and get coding, there are three kinds of schemes, one of scheme, be referred to as two yards of methods, sound for single encoded parts accords with, then get this addressable part 2 yards, for the sound symbol be made up of more than two or two addressable parts, then get successively its 1st, end each 1 yard an of addressable part, scheme two, be referred to as trigram method, sound for single encoded parts accords with, then get this addressable part 3 yards, for the sound symbol be made up of two addressable parts, then there are again two kinds and get encoding scheme, one of scheme, two methods headed by being referred to as, then get its 1st addressable part 2 yards successively, 2nd addressable part 1 yard, scheme two, be referred to as monic method, then get its 1st addressable part 1 yard successively, 2nd addressable part 2 yards, for the sound symbol be made up of more than three or three addressable parts, then get the 1st successively, 2nd, end each 1 yard an of addressable part, scheme three, is referred to as four yards of methods, and the sound for single encoded parts accords with, then get this addressable part 4 yards, for the sound symbol be made up of two addressable parts, have again three kinds to get encoding scheme, one of scheme is referred to as monic method, then get its 1st addressable part 1 yard, the 2nd addressable part 3 yards successively, scheme two, two methods headed by being referred to as, then get its each 2 yards of the 1st, the 2nd addressable part successively, scheme three, three methods headed by being referred to as, then get the 1st addressable part 3 yards, the 2nd addressable part 1 yard successively, for the sound symbol of three addressable part compositions, two kinds are had again to get encoding scheme, one of scheme, be referred to as monic method, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively, scheme two, two methods headed by being referred to as, then get each 1 yard of its 1st addressable part 2 yards, the 2nd, the 3rd addressable part successively, for the sound symbol be made up of four or more addressable part, then get successively its 1st, the 2nd, the 3rd, end each 1 yard an of addressable part,
Before what E, sound symbol was got be coded in, after what pictograph was got be coded in, form the coding of whole Chinese character successively;
Part II: combined type phonetic class major-minor yardage font Chinese character, word coded input method
Phonetic class numeric type coding resource is utilized to implement combined type phonetic class major-minor yardage font Chinese character, word coding input; Code length is indefinite, and maximum code length is set as 6;
Combined type phonetic class major-minor yardage font Chinese character, the word coded input method of employing phonetic class numeric type coding resource include: 1. based on the first letter of pinyin major-minor yardage font method of Chinese character coding, 2. based on first letter of pinyin major-minor yardage font encoding method of words and phrases, 3. based on the non-pinyin initial major-minor yardage font method of Chinese character coding, 4. based on non-pinyin initial major-minor yardage font encoding method of words and phrases, for 6763 Chinese characters in common use of GB2312, the Chinese Character Set passed through in Chinese Taiwan in more than 27000 Chinese character set of everyday expressions (or Chinese large word collection) and GB18030 or GB13000 or japanese character collection or Korean Chinese Character Set and several ten thousand and even the coding input of Chinese character of large character set of Chinese character more than 100,000, can select from following combination, carry out assembly coding input operation: to Chinese characters in common use, such as, one-level character library in 6763 Chinese characters of GB2312 or one-level, secondary character library and everyday words repertorie adopt based on first letter of pinyin numeric type Chinese character, word coded input method, the Chinese Character Set passed through in Chinese Taiwan in whole Chinese character of GB2312 and even the Chinese character more than 27000 of GB18030-2000 or GB13000 or the japanese character centralized procurement coding obtained based on the non-pinyin initial major-minor yardage font method of Chinese character coding, same code table can be organized in, also can apportion two table, switching is called, also can with based on non-pinyin initial numeric type word coding input for everyday expressions, phonetic class can be combined in same code table with phonetic class based on the coding of non-pinyin initial numeric type Chinese character, word based on first letter of pinyin numeric type Chinese character, word coding, and also can divide and be listed in two code tables, switching is called.
The coding method of various numeric type Chinese character, word is as follows:
In the following description, regulation: 1 yard is got for an addressable part, namely its numeric type primary key is got, get 2 yards, namely get its numeric type primary key, sub-code 1 successively, get 3 yards, namely get its numeric type primary key, sub-code 1, sub-code 2 successively, get 4 yards, namely get its numeric type primary key, sub-code 1, sub-code 2, sub-code 3 successively;
In following bar coding clause, there is multiple encoding scheme, when implementing specific coding, a therefrom selected encoding scheme;
(7) phonetic class is based on the first letter of pinyin major-minor yardage font method of Chinese character coding
A, determine to adopt phonetic class numeric type coding resource; In code fetch, be divided into two parts and get coding, then the coding that part 1 and part 2 are got, be combined into the coding of whole Chinese character successively;
The coding of the Chinese character of B, single encoded parts: the Chinese character for addressable part being high-frequency coding parts, if its first letter of pinyin numeric type code is identical with the numeric type primary key of these high-frequency coding parts, then getting 2 yards to these high-frequency coding parts, is exactly the numeric type primary key of the addressable part of this Chinese character, sub-code 1; If different, there are two kinds of encoding schemes, one of scheme, be referred to as to spell first method, then get 1 numerical code of initial transformation one-tenth, 2 yards of these high-frequency coding parts successively, scheme two, is referred to as method of substitution, then get 1 numerical code, the numeric type sub-code 1 of this addressable part, sub-code 2 that initial transformation becomes successively; For the Chinese character of common addressable part, the numeric type code of its first letter of pinyin is identical with the numeric type primary key of this addressable part: have two kinds of encoding schemes, one of scheme, be referred to as trigram method, then get this addressable part 3 yards successively, scheme two, be referred to as four yards of methods, then get this addressable part 4 yards successively, as the coding of this Chinese character; If not identical, there are three kinds of encoding schemes, one of scheme, be referred to as trigram method, the numeric type sub-code 1 of the numerical code that the first letter of pinyin then getting this word successively converts to, this addressable part, sub-code 2, scheme two, be referred to as four yards of methods, the numerical code that the first letter of pinyin then getting this word successively converts to, 3 numerical codes of this addressable part, scheme three, be referred to as Shift Method, then the numeric type sub-code 1 of the numerical code that the first letter of pinyin getting this Chinese character successively converts to, this addressable part, sub-code 2, sub-code 3;
C, for the Chinese character be made up of 2 addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1, part 2, coding is got: if the 1st addressable part is the high-frequency coding parts being in principal part position with following method, there are five kinds of encoding schemes, one of scheme is referred to as monic end two methods, then successively 1 yard is got to the 1st addressable part, 2nd addressable part gets 2 yards, scheme two, two last methods headed by being referred to as, then successively 1 yard is got to the 1st addressable part, 2nd addressable part gets 2 yards, scheme three, two last two methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd addressable part 2 yards, scheme four, be referred to as monic end three methods, then get the 1st addressable part 1 yard successively, 2nd addressable part 3 yards, scheme five, two last three methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 3rd addressable part 3 yards, if the 1st addressable part is common addressable part, there are six kinds of encoding schemes, one of scheme, two last methods headed by being referred to as, then get the 1st addressable part 2 yards of this word successively, 2nd addressable part 1 yard, scheme two, two last two methods headed by being referred to as, then get the 1st addressable part 2 yards of this word successively, 2nd addressable part 2 yards, scheme three, be referred to as monic end three methods, then get the 1st addressable part 1 yard successively, 2nd addressable part 3 yards, scheme four, be referred to as monic end two methods, then get the 1st addressable part successively and get 1 yard, 2nd addressable part gets 2 yards, scheme five, three last two methods headed by being referred to as, then get the 1st addressable part 3 yards successively, 2nd addressable part 2 yards, scheme six, three last methods headed by being referred to as, then get the 1st addressable part 3 yards successively, 2nd addressable part 1 yard,
D, for the Chinese character be made up of 3 addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1, part 2, coding is got: if stem is single stem with following method, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, four kinds are had to get encoding scheme: one of scheme, be referred to as trigram method, then successively to the 1st, 2nd, 3rd addressable part respectively gets 1 yard, scheme two, be referred to as last two methods, then successively to the 1st, 2nd addressable part respectively gets 1 yard, 2 yards are got to the 3rd addressable part, scheme three, two methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd, each 1 yard of 3rd addressable part, scheme four, two last two methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd addressable part 1 yard, 3rd addressable part gets 2 yards, if stem is single stem, and for example fruit the 1st addressable part is common addressable part, there are four kinds of encoding schemes: one of scheme, be referred to as last two methods, then successively to the 1st, 2nd addressable part respectively gets 1 yard, 3rd addressable part gets 2 yards, scheme two, two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd, 3rd addressable part respectively gets 1 yard, scheme three, two last two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd addressable part gets 1 yard, 3rd addressable part gets 2 yards, scheme four, three methods headed by being referred to as, then get the 1st addressable part successively and get 3 yards, 2nd, 3rd addressable part respectively gets 1 yard, if stem is the Chinese character of combination stem, there are three kinds of encoding schemes, one of scheme, is referred to as Yu Erfa, then get each 1 yard of the 1st, the 2nd addressable part, that addressable part of remaining part 2 yards of combination stem successively, scheme two, two methods headed by being referred to as, then get the 1st addressable part 2 yards, the 2nd addressable part 1 yard, that addressable part of remaining part 2 yards of combination stem, scheme three successively, be referred to as excess-three method, then get each 1 yard of the 1st, the 2nd addressable part, that addressable part of remaining part 3 yards of combination stem successively,
E, for the Chinese character be made up of four addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1, part 2, coding is got: if stem is single stem with following method, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, three kinds are had to get encoding scheme: one of scheme, be referred to as four yards of methods, then successively 1 yard is respectively got to the 1st, the 2nd, the 3rd, the 4th addressable part, scheme two, be referred to as last two methods, then successively the 1st, the 2nd, the 3rd addressable part is respectively got to 1 yard, got 2 yards to the 4th addressable part, scheme three, two methods headed by being referred to as, then get the 1st addressable part 2 yards, each 1 yard of the 2nd, the 3rd, the 4th addressable part successively, if stem is single stem, and for example fruit the 1st addressable part is common addressable part, there are four kinds of encoding schemes: one of scheme, be referred to as last two methods, then successively to the 1st, 2nd, 3rd addressable part respectively gets 1 yard, 4th addressable part gets 2 yards, scheme two, two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd, 3rd, 4th addressable part respectively gets 1 yard, scheme three, be referred to as four yards of methods, then get the 1st successively, 2nd, 3rd, each 1 yard of 4th addressable part, scheme four, three methods headed by being referred to as, then get the 1st addressable part 3 yards successively, 2nd, end each 1 yard an of addressable part, if stem is the Chinese character of combination stem, there are three kinds of encoding schemes, one of scheme, be referred to as Yu Erfa, then get each 1 yard of the 1st, the 2nd, the 3rd addressable part, that addressable part of remaining part 2 yards of combination stem successively, scheme two, two methods headed by being referred to as, then get the 1st addressable part 2 yards of combination stem, the 2nd, the 3rd addressable part 1 yard, that addressable part of remaining part 1 yard successively, scheme three, be referred to as four yards of methods, then get the 1st, the 2nd, the 3rd each 1 yard, that addressable part of remaining part 1 yard of combination stem successively,
F, following two parts are divided to determine that its numeric type is encoded successively for the Chinese character be made up of more than five or five addressable parts:
Part 1, gets a numerical code that this Chinese characters phonetic initial converts to the coding as part 1, part 2, coding is got: if stem is single stem with following method, 1st addressable part is the high-frequency coding parts being in principal part position, there are three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get successively the 1st addressable part 2 yards, the 2nd, the 3rd, the end each 1 yard an of addressable part, scheme two, be referred to as sequential method, then successively 1 yard respectively got, scheme three to the 1st, the 2nd, the 3rd, the 4th, the 5th addressable part, be referred to as to get last method, then successively to the 1st, the 2nd, the 3rd, the 4th, an end addressable part respectively gets 1 yard, if stem is single stem, it is again common addressable part, there are three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd, 3rd, end each 1 yard an of addressable part, scheme two, be referred to as sequential method, then successively to the 1st, 2nd, 3rd, 4th, 5th addressable part respectively gets 1 yard, scheme three, be referred to as to get last method, then successively to the 1st, 2nd, 3rd, 4th, an end addressable part respectively gets 1 yard, scheme four, three methods headed by being referred to as, then get the 1st addressable part 3 yards successively, 2nd, end each 1 yard an of addressable part, if stem is the Chinese character of combination stem, there are two kinds of encoding schemes, one of scheme, be referred to as front four methods, then successively to combination stem the 1st, the 2nd, the 3rd, the 4th addressable part respectively gets 1 yard, that addressable part of remaining part gets 1 yard, scheme two, is referred to as first three last method, then respectively get 1 yard to the 1st, the 2nd, the 3rd, a end addressable part of combination stem successively, that addressable part of remaining part gets 1 yard,
(8) phonetic class is based on the non-pinyin initial major-minor yardage font method of Chinese character coding
A, determine to adopt phonetic class numeric type coding resource;
B, for the Chinese character be made up of single encoded parts: if this addressable part is high-frequency coding parts, then get addressable part 2 yards successively; If this addressable part is common addressable part, there is two kinds of encoding schemes, one of scheme, is referred to as trigram method, then get this addressable part 3 yards successively, scheme two, be referred to as four yards of methods, then get this addressable part 4 yards successively;
The Chinese character of C, more than 2 or 2 addressable part compositions: stem and remaining part two parts can be divided into;
D, for the Chinese character be made up of 2 addressable parts, if the 1st addressable part is the high-frequency coding parts being in principal part position, there are six kinds of encoding schemes, one of scheme, two last two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd addressable part gets 2 yards, scheme two, two last three methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd addressable part 3 yards, scheme three, two last four methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd addressable part gets 2 yards, scheme four, be referred to as monic end three methods, then successively 1 yard is got to the 1st addressable part, 2nd addressable part gets 3 yards, scheme five, be referred to as monic end two methods, then get the 1st addressable part 1 yard successively, 2nd addressable part 2 yards, scheme six, two last methods headed by being referred to as, then get the 1st addressable part 2 yards successively, 2nd addressable part 1 yard, if the 1st addressable part is common addressable part, there are four kinds of encoding schemes, one of scheme, three last two methods headed by being referred to as, then successively 3 yards are got to the 1st addressable part, the 2nd addressable part gets 2 yards, scheme two, three last three methods headed by being referred to as, then get each 3 yards of the 1st, the 2nd addressable part successively, scheme three, be referred to as monic end three methods, then the 1st addressable part gets 1 yard successively, the 2nd addressable part gets 3 yards, scheme four, is referred to as monic end four methods, then gets the 1st addressable part 1 yard, the 2nd addressable part 4 yards successively,
E, Chinese character for being made up of 3 addressable parts: if the 1st addressable part is single stem of the high-frequency coding parts being in principal part position, there are five kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd, 3rd addressable part respectively gets 1 yard, scheme two, two last two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd addressable part gets 1 yard, 3rd addressable part gets 2 yards, scheme three, two last three methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part, 2nd addressable part gets 1 yard, 3rd addressable part gets 3 yards, scheme four, be referred to as monic end three methods, then successively to the 1st, 2nd addressable part respectively gets 1 yard, 3rd addressable part gets 3 yards, scheme five, be referred to as monic end two methods, then get the 1st successively, each 1 yard of 2nd addressable part, 3rd addressable part 2 yards, if the 1st addressable part is single stem word of common addressable part, there are four kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then successively 3 yards are got to the 1st addressable part, 2nd, 3rd addressable part respectively gets 1 yard, scheme two, three last two methods headed by being referred to as, then successively 3 yards are got to the 1st addressable part, 2nd addressable part gets 1 yard, 3rd addressable part gets 2 yards, scheme three, be referred to as monic end three methods, then successively to the 1st, 2nd addressable part respectively gets 1 yard, 3rd addressable part gets 3 yards, scheme four, be referred to as monic end two methods, then get the 1st successively, each 1 yard of 2nd addressable part, 3rd addressable part 2 yards, for the Chinese character that stem is combination stem, there are three kinds of encoding schemes, one of scheme, be referred to as remaining part trigram, then successively the 1st, the 2nd addressable part of combination stem is respectively got to 1 yard, got 3 yards to that addressable part of remaining part, scheme two, be referred to as remaining part two yards of methods, then successively the 1st, the 2nd addressable part of combination stem is respectively got to 1 yard, got 2 yards to that addressable part of remaining part, scheme three, be referred to as Yu Sifa, then successively the 1st, the 2nd addressable part of combination stem respectively got to 1 yard, got 4 yards to that addressable part of remaining part,
F, Chinese character for being made up of four addressable parts: if the 1st addressable part is the Chinese character of single stem of the high-frequency coding parts being in principal part position, there are five kinds of encoding schemes, one of scheme, two last methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part of Chinese character, 2nd, 3rd, 4th addressable part respectively gets 1 yard, scheme two, two last two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part of Chinese character, 2nd, 3rd addressable part respectively gets 1 yard, 4th addressable part gets 2 yards, scheme three, be referred to as monic end two methods, then successively to Chinese character the 1st, 2nd, 3rd addressable part respectively gets 1, 4th addressable part gets 2 yards, scheme four, be referred to as monic end three methods, then successively to Chinese character the 1st, 2nd, 3rd addressable part respectively gets 1, 4th addressable part gets 3 yards, scheme five, be referred to as 41 methods, then successively to Chinese character the 1st, 2nd, 3rd, 4th addressable part respectively gets 1 yard, if the 1st addressable part is the Chinese character of single stem of common addressable part, there are two kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then successively 3 yards, the 2nd, the 3rd, the 4th addressable part is got to the 1st addressable part and respectively get 1 yard, scheme two, be referred to as monic end three methods, then respectively get 1 yard to the 1st, the 2nd, the 3rd addressable part successively, the 4th addressable part gets 3 yards, for the Chinese character of combination stem, there are two kinds of encoding schemes, one of scheme, be referred to as Yu Erfa, then successively to combination stem the 1st, the 2nd, the 3rd addressable part respectively gets 1 yard, that addressable part of remaining part gets 2 yards, scheme two, be referred to as excess-three method, then respectively get 1 yard to the 1st, the 2nd, the 3rd addressable part of combination stem successively, that addressable part of remaining part gets 3 yards,
G, Chinese character for being made up of five coding code parts: if the 1st addressable part is single stem Chinese character of the high-frequency coding parts being in principal part position, there are three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then successively 2 yards are got to the 1st addressable part of Chinese character, 2nd, 3rd, 4th, 5th addressable part respectively gets 1 yard, scheme two, be referred to as last two methods, then successively to Chinese character the 1st, 2nd, 3rd, 4th addressable part respectively gets 1, 5th addressable part gets 2 yards, scheme three, be referred to as all methods, then successively to Chinese character the 1st, 2nd, 3rd, 4th, 5th addressable part respectively gets 1 yard, if the 1st addressable part is the Chinese character of single stem of common addressable part, there are three kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then successively to the 1st addressable part get 3 yards, the 2nd, the 3rd, a end addressable part respectively gets 1 yard, scheme two, be referred to as last two methods, then successively 1 yard is respectively got to the 1st, the 2nd, the 3rd, the 4th addressable part, 5th addressable part gets 2 yards, scheme three, is referred to as all methods, then respectively get 1 yard to the 1st, the 2nd, the 3rd, the 4th, the 5th addressable part successively, for the Chinese character of combination stem, there are two kinds of encoding schemes, one of scheme, be referred to as Yu Erfa, then successively to combination stem the 1st, the 2nd, the 3rd, the 4th addressable part respectively gets 1 yard, that addressable part of remaining part gets 2 yards, scheme two, be referred to as all methods, then respectively get 1 yard to the 1st, the 2nd, the 3rd, the 4th addressable part of combination stem successively, that addressable part of remaining part gets 1 yard,
H, for the Chinese character be made up of more than six or six coding code parts: if the 1st addressable part is single stem Chinese character of the high-frequency coding parts being in principal part position, there are two kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then successively to the 1st addressable part of Chinese character get 2 yards, the 2nd, the 3rd, the 4th, a end addressable part respectively gets 1 yard, scheme two, is referred to as all methods, then successively to the 1st of Chinese character, the 2nd, the 3rd, the 4th, the 5th, an end addressable part respectively gets 1 yard, if the 1st addressable part is the Chinese character of single stem of common addressable part, there are two kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then successively to the 1st addressable part get 3 yards, the 2nd, the 3rd, a end addressable part respectively gets 1 yard, scheme two, is referred to as all methods, then successively to the 1st, the 2nd, the 3rd, the 4th, the 5th, an end addressable part respectively gets 1 yard, for the Chinese character of combination stem, there are three kinds of encoding schemes, one of scheme, be referred to as Yu Erfa, then successively to combination stem the 1st, 2nd, 3rd, an end addressable part respectively gets 1 yard, that addressable part of remaining part gets 2 yards, scheme two, be referred to as all methods, then successively to combination stem the 1st, 2nd, 3rd, 4th, an end addressable part respectively gets 1 yard, that addressable part of remaining part gets 1 yard, scheme three, be referred to as cis and more than two rules successively to combining the 1st of stem, 2nd, 3rd, 4th addressable part respectively gets 1 yard, that addressable part of remaining part gets 2 yards, ,
(9) phonetic class is based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method
Utilize phonetic class major-minor yardage font encoding resource, implement to input Chinese word and phrase coding:
Setting: 1 yard is got to a Chinese character, namely gets the 1st numerical code of the major-minor yardage font encode Chinese characters for computer based on first letter of pinyin of Chinese character, namely get the first letter of pinyin of this Chinese character, 2 yards are got to a Chinese character, namely gets the 1st, the 2nd numerical code of the major-minor yardage font encode Chinese characters for computer based on first letter of pinyin of Chinese character successively, 3 yards are got to a Chinese character, three kinds of situations are divided to get coding: if the Chinese character of single encoded parts, two kinds are had to follow the example of, one of follow the example of, be referred to as the method for supplying, if according to encoding scheme, this Chinese character only has 2 yards add successively again to get 1 numeric type code, such as, " female " word, the numeric type code of 8 key mappings is: 66, get 3 yards, then add peek font code more successively: 5, also just 3 yards are got to " female " word, be: 665, follow the example of two, be referred to as compiling method, according to encoding scheme this Chinese character only get 2 yards still get 2 yards, also be considered as getting 3 yards, such as " female " still only gets: 66, other then get the 1st, the 2nd, the 3rd number font encoding of this Chinese character successively, if the Chinese character of single stem, then get the 1st of this Chinese character the successively, the numeric type primary key of the 2nd addressable part of the 2nd number font encoding, Chinese character, if the Chinese character of combination stem, two kinds are had to follow the example of, one of scheme, be referred to as sequential method, 1st, the 2nd, the 3rd the phonetic class of getting combination stem Chinese character is successively encoded based on the numeric type of first letter of pinyin major-minor yardage font encode Chinese characters for computer, scheme two, remaining method headed by being referred to as, gets the 1st of this Chinese character the, the numeric type primary key of that addressable part of remaining part of the 2nd number font encoding, this Chinese character successively, one is determined from above-mentioned following the example of, 4 yards are got to a Chinese character: if the Chinese character of single encoded parts, have two kinds to follow the example of, one of follow the example of, be referred to as the method for supplying: the phonetic class first letter of pinyin major-minor yardage font all-key getting Chinese character, namely less than 4 yards, according to its encoding scheme, be increased to 4 yards successively, numeric type code for 8 key mappings: " female " word, getting 4 yards is: 6653, if the numeric type code of first letter of pinyin is not identical with the numeric type primary key of this addressable part, employing method of substitution, such as, " mountain ": 7225, " sheep ": 9243, " five ": 9815, follow the example of two, be referred to as compiling method, the encoding scheme of the Chinese character namely adopted, getting several yards is several yards, no longer increases, and such as, " female " only gets 66, is also considered as getting 4 yards, if the Chinese character of two addressable parts, two kinds are had to follow the example of, one of follow the example of, be referred to as the method for supplying, get the numerical code of its first letter of pinyin, the numeric type primary key of the 1st, the 2nd addressable part, the numeric type sub-code 1 of the 2nd addressable part, follow the example of two, be referred to as compiling method, get the numeric type primary key of the numerical code of its first letter of pinyin, the 1st, the 2nd addressable part successively, if the Chinese character of more than three or three addressable parts, then get the numeric type primary key of the numeric type code of first letter of pinyin, the 1st, the 2nd, the 3rd addressable part, concrete coding method is as follows:
A, the word that two Chinese characters are formed: have three kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then respectively get 3 yards to the 1st, the 2nd Chinese character successively, scheme two, two last three methods headed by being referred to as, then get the 1st Chinese character successively and get 2 yards, the 2nd Chinese character gets 3 yards, scheme three, two last four methods headed by being referred to as, then get the 1st Chinese character successively and get 2 yards, the 2nd Chinese character gets 4 yards;
B, word for being made up of three Chinese characters: have six kinds of encoding schemes, one of scheme, be referred to as 222 methods, then get the 1st of word the successively, 2nd, 3rd Chinese character respectively gets 2 yards, scheme two, be referred to as 212 methods, then get the 1st Chinese-character ' two code ' of word successively, 2nd Chinese character 1 yard, 3rd Chinese-character ' two code ', scheme three, be referred to as 213 methods, then get the 1st Chinese-character ' two code ' of word successively, 2nd Chinese character 1 yard, 3rd Chinese character 3 yards, scheme four, be referred to as monic end two methods, then get the 1st of word the successively, each 1 yard of 2nd Chinese character, 3rd Chinese-character ' two code ', scheme five, be referred to as monic end three methods, then get the 1st of word the successively, each 1 yard of 2nd Chinese character, 3rd Chinese character 3 yards, scheme six, be referred to as monic end four methods, then get the 1st successively, each 1 yard of 2nd Chinese character, 3rd Chinese character 4 yards,
C, word for being made up of four Chinese characters: have four kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get the 1st Chinese-character ' two code ' successively, 2nd, 3rd, each 1 yard of 4th Chinese character, scheme two, two last two methods headed by being referred to as, then get the 1st Chinese-character ' two code ' successively, 2nd, each 1 yard of 3rd Chinese character, 4th Chinese-character ' two code ', scheme three, be referred to as last two methods, then get the 1st successively, 2nd, each 1 yard of 3rd Chinese character, 4th Chinese-character ' two code ', scheme four, be referred to as last three methods, then get the 1st successively, 2nd, each 1 yard of 3rd Chinese character, 4th Chinese character 3 yards, scheme five, be referred to as monic end two methods, then get the 1st successively, 2nd, each 1 yard of 3rd Chinese character, 4th Chinese-character ' two code ',
D, for the word be made up of five Chinese characters: have three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get successively the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th each 1 yard of Chinese character, scheme two, is referred to as last two methods, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th Chinese character, the 5th Chinese-character ' two code ' successively, scheme three, is referred to as each method, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
E, for the word be made up of more than six or six Chinese characters: have three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get each 1 yard of the 1st Chinese-character ' two code ' of word, the 2nd, the 3rd, the 4th, the 5th Chinese character successively, scheme two, be referred to as along a method, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th, the 6th Chinese character of word successively, scheme three, is referred to as a last method, then get the 1st of word the successively, the 2nd, the 3rd, the 4th, the 5th, end each 1 yard an of Chinese character;
(10) phonetic class is non-based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method
Utilize phonetic class major-minor yardage font encoding resource, implement to input Chinese word and phrase coding:
Phonetic class is non-based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method; Set again: 1 yard is got to a Chinese character: the 1st the addressable part phonetic class numeric type primary key namely getting this Chinese character; 2 yards are got to a Chinese character: if the Chinese character of single encoded parts, then get the numeric type primary key of this addressable part, sub-code 1 successively; If single stem Chinese character, then namely get each 1 yard of the 1st, the 2nd addressable part of this Chinese character successively; If the Chinese character of combination stem, there are two kinds of code fetch schemes, one of scheme, remaining method headed by being referred to as, then get the numeric type primary key of the 1st addressable part, the numeric type primary key of that addressable part of remaining part of combination stem successively, scheme two, is referred to as sequential method, then get each 1 yard of the 1st, the 2nd addressable part of this Chinese character successively; 3 yards are got to a Chinese character: if the Chinese character of single encoded parts, then get the major and minor code 1 of numeric type, the sub-code 2 of this addressable part successively; If the Chinese character of the single stem be made up of two addressable parts, then get the 1st addressable part numeric type primary key, the 2nd addressable part numeric type primary key, sub-code 1 successively; If the Chinese character of the single stem be made up of more than three or three addressable parts, then get each 1 yard of the 1st, the 2nd, the 3rd addressable part of Chinese character successively; If the Chinese character of combination stem, there are two kinds of code fetch schemes, one of scheme, remaining method headed by being referred to as, then get Chinese character combination the 1st, the 2nd addressable part of stem, each 1 yard of that addressable part of remaining part successively, scheme two, is referred to as sequential method, then get each 1 yard of the 1st, the 2nd, the 3rd addressable part of Chinese character successively; 4 yards are got to a Chinese character: for the Chinese character of single encoded parts, then get the numeric type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3; For the Chinese character of two addressable parts, there are two kinds of code fetch schemes, one of scheme, be referred to as one or three methods, then get the 1st addressable part 1 yard, the 2nd addressable part 3 yards successively, scheme two, be referred to as two or two methods, then get the 1st addressable part 2 yards, the 2nd addressable part 2 yards successively; For the Chinese character of three addressable parts, stem is the Chinese character of single stem, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively; For the Chinese character of four or more addressable part, there is two schemes, one of scheme, be referred to as sequential method, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th addressable part successively, scheme two, be referred to as along last method, then get the 1st successively, the 2nd, the 3rd, end each 1 yard an of addressable part; For the Chinese character of the combination stem of three addressable parts, then get each 1 yard of the 1st, the 2nd addressable part, that addressable part of remaining part 2 yards of combination stem successively; For the Chinese character of the combination stem by four or more addressable part, have two kinds of code fetch schemes, one of scheme, is referred to as sequential method, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th addressable part of Chinese character successively.Scheme two, is referred to as along last method, then get the 1st successively, the 2nd, the 3rd, end each 1 yard an of addressable part; Specific coding method is as follows:
A, the word that two Chinese characters are formed: have three kinds of encoding schemes, one of scheme, three methods headed by being referred to as, then respectively get 3 yards, scheme two to the 1st, the 2nd Chinese character successively, two last three methods headed by being referred to as, then the 1st Chinese character gets 2 yards successively, and the 2nd Chinese character gets 3 yards, scheme three, two last four methods headed by being referred to as, then get the 1st Chinese-character ' two code ' successively, the 2nd Chinese character get 4 yards;
B, word for being made up of three Chinese characters: have five kinds of encoding schemes, one of scheme, be referred to as 222 methods, then get the 1st of word the successively, 2nd, 3rd Chinese character respectively gets 2 yards, scheme two, be referred to as 212 methods, then get the 1st Chinese-character ' two code ' of word successively, 2nd Chinese character 1 yard, 3rd Chinese-character ' two code ', scheme three, be referred to as 213 methods, then get the 1st Chinese-character ' two code ' of word successively, 2nd Chinese character 1 yard, 3rd Chinese character 3 yards, scheme four, be referred to as monic end three methods, then get the 1st of word the successively, each 1 yard of 2nd Chinese character, 3rd Chinese character 3 yards, scheme five, be referred to as monic end four methods, then get the 1st successively, each 1 yard of 2nd Chinese character, 3rd Chinese character 4 yards,
C, for the word be made up of four Chinese characters: have four kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th Chinese character successively, scheme two, two last two methods headed by being referred to as, then get the 1st Chinese-character ' two code ', the 2nd, the 3rd Chinese character each 1 yard, the 4th Chinese-character ' two code ' successively, scheme three, be referred to as last two methods, then get each 1 yard of the 1st, the 2nd, the 3rd Chinese character, the 4th Chinese-character ' two code ' successively, scheme four, be referred to as last three methods, then get each 1 yard of the 1st, the 2nd, the 3rd Chinese character, the 4th Chinese character 3 yards successively;
D, for the word be made up of five Chinese characters: have three kinds of encoding schemes, one of scheme, two methods headed by being referred to as, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th Chinese character successively, scheme two, is referred to as last two methods, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th Chinese character, the 5th Chinese-character ' two code ' successively, scheme three, is referred to as each method, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
E, word for being made up of more than six or six Chinese characters: have four kinds of schemes, one of scheme, two methods headed by being referred to as, then get the 1st Chinese-character ' two code ' successively, 2nd, 3rd, 4th, each 1 yard of 5th Chinese character, scheme two, two last methods headed by being referred to as, then get the 1st Chinese-character ' two code ' successively, 2nd, 3rd, 4th, end each 1 yard an of Chinese character, scheme three, be referred to as along a method, then get the 1st successively, 2nd, 3rd, 4th, 5th, each 1 yard of 6th Chinese character, scheme four, be referred to as along a last method, then get the 1st of word the successively, 2nd, 3rd, 4th, 5th, end each 1 yard an of Chinese character,
For the ease of input, reducing the repetition rate of coding also can by some symbolic keys standing on numeric keypad or their combination, realizes that punctuation mark guides, English upper and lower case letter guides, terminates to guide, repeated code is selected to guide, space guides, digital numerical value guides, switch the functions such as the guiding of Chinese Character collection, the former alphabetical input mode guiding of mobile phone.
Six, the keyboard adopted
Phonetic class major-minor yardage font Chinese character, word coded input method all adopt the numeric keypad of general computer keyboard or mobile phone, the universal keypad of telephone set or corresponding soft keyboard;
Phonetic class major-minor code word parent form Chinese character, word coded input method can adopt the alphabetical keypad of general computer keyboard and its various improved keyboard or corresponding soft keyboard; Complete input operation;
Seven, input operation is used
For the utilization input operation method of numeric type, successively one by one numeral one by one numeral hit numerical key input, reach the numeric type code maximum code long number of Chinese character, word, if not enough code length number, terminate with end key, or in input display box the hat that shows word, the numerical code before word wanting to input; If complete input without repeated code, if any repeated code, then hit selection key of duplicat codes, complete input;
Utilization input operation method for alpha type: successively one by one letter one by one letter hit letter key input, the code length of alpha type Chinese character is indefinite, the maximum code long number of word is 6, if not enough code length number, terminate with end key, or in input display box the hat that shows word, the numerical code before word wanting to input; If complete input without repeated code, if any repeated code, then hit selection key of duplicat codes, complete input;
Useful effect
Combined type phonetic class major-minor code Chinese character, word coded input method compare with existing Hanzi coding technique and have significant beneficial effect.It makes full use of the Chinese character that people have grasped, the Chinese phonetic alphabet, the knowledge such as stroke, the encode Chinese characters for computer parts of specification are carried out " location ", give master, secondary 1, secondary 2, secondary 3 yards, simple in rule, be convenient to memory grasp, due to the word of first letter of pinyin class and non-pinyin initial class, word input can be combined in same code table carries out nothing switching input, the word can not read sound or read inaccurate sound can be inputted like a cork, word, and, its repetition rate of coding, be significantly less than the input method of phonetic class, further provide alpha type, the first letter of pinyin of numeric type and the Study of Universal Combined Encoding Method of non-pinyin initial, therefore there is motivation strong, applicability is wide, easy grasp, the advantage that promotion is good.The alpha type Chinese character of combined type phonetic class major-minor code, word coded input method, adopt word, word, brevity code Mixed design, the maximum code length of word is set as 4, the maximum code length of word is set as 6, at first letter of pinyin word, non-pinyin lead-in alphabetic word, during the Mixed design of first letter of pinyin word, whole words of the word of GB18030 character library and GB/T15732-1995 " Hanzi keyboard inputs with general word collection " are added the newly-increased word of " modern Chinese dictionary " enlarged edition in 2002 again, be made on a code table, the repeated code of its input does not mostly all exceed 10, and have and much there is no repeated code, touch system input can be carried out.
Accompanying drawing explanation
Accompanying drawing 1 is that combined type phonetic class major-minor code word parent form encode method for entering Chinese characters is used " letter key card typing key district key mapping arrangement schematic diagram ".Show the high-frequency coding parts of 31 high-frequency coding arrangement of components that each letters case distributes and the position of its principal part position in Chinese character in figure, the letter in key mapping is exactly the alpha type primary key of these high-frequency coding parts of phonetic class;
Accompanying drawing 2 is that 8 key mapping 1 numerical codes of combined type phonetic class major-minor yardage font encode method for entering Chinese characters substitute one of golygram code digital keyboard card key mapping arrangement schematic diagram;
Accompanying drawing 3 is that 10 strong position 1 numerical codes of numeric type encode method for entering Chinese characters of the present invention substitute one of golygram code digital keyboard card key mapping arrangement schematic diagram;
Accompanying drawing 4 is that 8 key mapping 1 numerical codes of numeric type encode method for entering Chinese characters of the present invention substitute golygram code digital keyboard card key mapping arrangement schematic diagram two;
Accompanying drawing 5 is that 10 key mapping 1 numerical codes of numeric type encode method for entering Chinese characters of the present invention substitute golygram code digital keyboard card key mapping arrangement schematic diagram two.
Specific implementation method:
When Chinese character separating; Get that one formerly that disassembled coding unit stroke is many, concrete Chinese character separating adopts general Split Method; Adopt some folding I method; Alpha type converts numeric type to and adopts, the different formula conversion plan of alphabetical stroke;
Phonetic class first letter of pinyin major-minor code word parent form Chinese character, encoding method of words and phrases: to encode Chinese characters for computer, the coding of the Chinese character of single encoded parts: if the first letter of pinyin of this Chinese character is identical with the alpha type primary key of this addressable part: so to high-frequency coding parts, then get the alpha type primary key of this addressable part, sub-code 1 successively, so to common addressable part, adopt trigram method; If the first letter of pinyin of this Chinese character is not identical with the alpha type primary key of this addressable part, again high-frequency coding parts, adopt trigram method, if the first letter of pinyin of this Chinese character is not identical with the alpha type primary key of this addressable part, it is again common addressable part, and alpha type primary key is not i, also adopt trigram method; If the alpha type primary key that the first letter of pinyin of this Chinese character is y and addressable part is i's, also adopt trigram method; Chinese character for the addressable part by 2 forms: if stem is high-frequency coding parts, adopts two yards of methods; If stem is normal elements, adopt one or two methods; Chinese character for the addressable part by more than 3 or 3 forms: if single stem is common addressable part, adopts first two methods, if its stem is the Chinese character of combination stem, adopts first and last method; Its routine word coding: female: NU; Day: RI; Mountain: SAA; Bird: NIP; Horse: MAZ; Clothing: YID; Celestial: XPA; Stretch: SPS; Cake: BMBA; Blighted: BMBI; As: BZSU; Serge: BKBS; More: YRDX; Ci: CQIN; Hub: GSCS; Draw a bow to the full: GSGS; Precious: BBYU; Ridge: LARJ; Confused: FLBD; Fen: FLID; 2 word group words adopt two or two methods, and 3 word group words adopt two methods one by one, and 4 word group words adopt four yards of methods, and 5 word group words adopt five yards of methods, and 6 or 6 words adopt along six methods with the word that last word forms; Example word coding: the sun: TDYE; Woman worker: NUGO; Work; GOZP; Wage: GOZB; Work post: GOZH; Computing machine: JSJU; Wholeheartedly: YXYY; Technical expertise conference: JSJDH; Bureau of the Legislative Affairs under the State Council: GWYFZJ; Foreign Affairs Office of the State Council: GWYWSB;
Phonetic class non-pinyin initial major-minor code word parent form Chinese character, word coded input method, code length is indefinite, and the maximum code length of word is set as 6;
The Chinese character of single encoded parts, if common addressable part, adopt trigram method; The Chinese character forming for the addressable part by 2, if stem is high-frequency coding parts that are in principal part position, adopt trigram method, then successively 1 yard is got to that addressable part of stem, 2 yards are got to that addressable part of remaining part, if first part addressable part is common addressable part, adopt two or two methods; Chinese Character parent form coding method for being formed by 3 addressable parts: if stem is a single stem, again high-frequency coding parts that are in principal part position, adopt last two methods, then successively 1 yard is got to those high-frequency coding parts of stem, the 1st addressable part of complementary is got 1 yard, the 2nd addressable part of remaining part and is got 2 yards; If stem is a single stem, be again a common addressable part, adopt first two methods, then successively 2 yards are got to that addressable part of stem, the 1st, the 2nd addressable part of remaining part is respectively got 1 yard; Chinese Character parent form coding method for being formed by more than 4 or 4 addressable parts: if stem is a single stem, be again a common addressable part, adopt first two methods; If stem is combination stem, adopt combination stem head, inferior, last code fetch method; Its routine word coding: female: NU; Bird: NIP; Horse: MAZ; Bin: BHBY; Hub: SMCS; Hu: SMKS; Draw a bow to the full: SMGS; Shell: SMJS; Precious: BAYU; Serge: KBSI; : KBE; E: KHII; Ai: BIAD; Ridge: ARDJ; ling: AARJ; High: AMBZ; Jun: SIMZ; Lift up: BANI; Hall: XAMT; Humming-sound: HHMI; Thoroughbred horse: MABB; Benefit: QABM; Fill: TOME; Meeting: REEM; Cloud: ERMO; 2 yards are got to a Chinese character: if the word of single encoded parts, then get alpha type primary key, the sub-code 1 of this addressable part successively; If the Chinese character being made up of more than two or two addressable parts, adopts first remaining method, namely gets the alpha type primary key of the 1st addressable part of the 1st addressable part alpha type primary key, the remaining part of the stem of this Chinese character successively; Encode for the word being formed by 2 Chinese characters, adopt two or two methods; For the word being formed by 3 Chinese characters, adopt last two methods; Encode for the word being formed by 4 Chinese characters, adopt four yards of methods; Example word coding: wholeheartedly: YXYI; The sun: DDER; Woman worker: NUGO; Work; GOPZ; Wage: GOBB; Coding: LHSM; Work post: GOHZ; Computer: IZUJ; Technical expertise conference: EUDBR; Bureau of the Legislative Affairs under the State Council: WZEDZS; Foreign Affairs Office of the State Council: WZEXSL;
Phonetic class is based on first letter of pinyin major-minor yardage font Chinese character, word coded input method, and maximum code length is set as 6; Letter type code converts numeric type code to, adopt " the different formula conversion plan of alphabetical stroke ", adopt " 8 key mapping letter key of Chinese pin yin position setting ", " the major and minor code 1 of phonetic class numeric type, sub-code 2, the sub-code 3 of the high-frequency coding parts of ' the different formula conversion plan of alphabetical stroke '; express successively with the corresponding numerical key of numeric keypad, a high-frequency coding parts sound class numeric type major-minor code of 31 high-frequency coding arrangement of components " as shown in Table 6; 3 yards are got to a Chinese character: if the Chinese character of single encoded parts, adopt and supply method; If the Chinese character of combination stem, adopt first remaining method, get the 1st of this Chinese character the successively, the numeric type primary key of that addressable part of remaining part of the 2nd number font encoding, this Chinese character; 4 yards are got to a Chinese character: if the Chinese character of single encoded parts, method is supplied in employing: if the Chinese character of two addressable parts, method is supplied in employing, gets the numerical code of its first letter of pinyin, the numeric type primary key of the 1st, the 2nd addressable part, the numeric type sub-code 1 of the 2nd addressable part; If the Chinese character of more than three or three addressable parts, then get the numeric type primary key of the numeric type code of first letter of pinyin, the 1st, the 2nd, the 3rd addressable part;
Coding to the Chinese character of single encoded parts: for high-frequency coding parts, if its first letter of pinyin numeric type code is not identical with the numeric type primary key of these high-frequency coding parts, adopts and spells first method; For the Chinese character of common addressable part, the numeric type code of its first letter of pinyin is identical with the numeric type primary key of this addressable part, with not identical, all adopts trigram method; Chinese character for being made up of 2 addressable parts: if the 1st addressable part is high-frequency coding parts, adopts first two last two methods, if the 1st addressable part is common addressable part, adopts first three last two methods; For the Chinese character be made up of 3 addressable parts, if stem is single stem, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two methods; If stem is single stem, and for example fruit the 1st addressable part is common addressable part, adopts first three methods; If stem is the Chinese character of combination stem, adopt Yu Erfa; For the Chinese character be made up of four addressable parts, if stem is single stem, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two methods; If stem is single stem, and for example fruit the 1st addressable part is common addressable part, adopts first three methods; If stem is the Chinese character of combination stem, adopt Yu Erfa; Chinese character for being made up of more than five or five addressable parts: if stem is single stem, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two methods; If stem is single stem, and for example fruit the 1st addressable part is common addressable part, adopts first three methods; If stem is the Chinese character of combination stem, adopt front four methods;
Its routine word coding: female: 66; Mountain: 722; Bird: 643; Precious: 222498; Serge: 25627; High: 522629; Jun: 574369; : 25622; Look at: 53952; Jiangxi: 447942; Hu: 476457;
Word is encoded, the word for two Chinese character compositions: adopt first three methods; For the word of three Chinese character compositions, adopt 222 methods; Word for being made up of four Chinese characters: adopt monic end two methods; Word for being made up of five Chinese characters: adopt each method; For the word be made up of more than six or six Chinese characters, adopt along a method; Example word coding: wholeheartedly: 99994; The sun: 833937; Woman worker: 665461; Cure: 936974; Computing machine: 547958; Technical expertise conference: 57534; Bureau of the Legislative Affairs under the State Council: 499395; Foreign Affairs Office of the State Council: 499972; Chinese People's Political Consultative Conference: 947699.

Claims (10)

1. combined type phonetic class major-minor code Chinese character, word coded input method, utilizes soft, the hard figure keyboard of soft, the hard alphabetic keypad of computer general-purpose or mobile phone, computing machine, and input Chinese character, word, is characterized in that
One, selected addressable part
According to State Language Work Committee GF3001---the requirement of 1997 " information processing GB13000.1 character set Hanzi component specifications ", splits Chinese character, determines the addressable part participating in coding;
Select GF3001---560 basic components of 1997 " information processing GB13000.1 character set Hanzi component specifications ", select GB0011---201 main radicals of 2009 " Chinese character radicals tables " and 100 attached shape radicals by which characters are arranged in traditional Chinese dictionaries, then select the Chinese character containing the some non-word basic components in 560 basic components and Chinese character member: inferior, northern, not, Cao, spring, list, section, send out, Guan, Kamei, Tortoises, heptan, the last of the twelve Earthly Branches, Pot, China, also, also, with, violet, hold concurrently, can, Lou, exempt from, the fourth of the twelve Earthly Branches, south, capsule, agriculture, abandoned, Pull, its, wife, front, black, Ukraine, not, net, row, Jia, the legendary ruler of great antiquity, the first of the Three August Ones, with, system, amount to 44, make the numeric class Chinese character commonly used and character all be decided to be addressable part for the ease of memory to have selected again: one, hundred, six, zero, deduction double counting number, amount to the basic coding unit having selected 687 parts to adopt as this Chinese character coding method, be referred to as addressable part, and it is identical according to the structure word motivation of addressable part, or literary style is slightly different, or province subtracts to some extent, or put to different variants, or it is numerous each other, the relevances such as simplified spelling, merger becomes 409 addressable part groups, first addressable part in group is called main graphemic code parts, under the prerequisite not violating GF3001 specification, these 687 addressable part bases allow increase, subtract the addressable part amount of selecting of 20 percent, only the repetition rate of coding is slightly affected, but do not change the essence of this coded input method,
Two, determine that the radicals by which characters are arranged in traditional Chinese dictionaries that the alpha type primary key of high-frequency coding parts and main site location and high-frequency coding parts determines 31 word-building abilities strong are especially high-frequency coding parts, in 687 addressable parts that the present invention determines, remove 31 remaining addressable parts of high-frequency coding parts be referred to as common addressable part;
Same letters case only arranges the high-frequency coding parts of one or one group numerous, simplified spelling body each other, further define the position of its principal part position simultaneously, to be exactly these high-frequency coding parts forming positions usually residing in Chinese character, wherein high-frequency coding parts to main site location: mountain, Rolling, si, Si, the moon, 12 high-frequency coding parts of 9 addressable part groups such as Ren, Mu, Nian, Yan, Yan are not the primary keys using the initial of the Chinese phonetic alphabet of their pronunciation or radicals by which characters are arranged in traditional Chinese dictionaries title as them, but artificially specify; The shape of 31 the high-frequency coding parts determined, alpha type primary key, main site location be as shown in Table 1: table one:
the setting of above-mentioned 31 high-frequency coding parts and alpha type primary key, the setting of its quantity and alpha type primary key can change in the scope not exceeding 40 percent, only has impact to the repetition rate of coding, but does not change the essence of coding method;
Three, determine phonetic class primary key, sub-code 1, sub-code 2, the sub-code 3 of each addressable part, form the phonetic class coding resource of the method for Chinese character coding
687 selected addressable part merger become 409 addressable part groups, and first addressable part in its group is referred to as main graphemic code parts, and the primary key of other addressable parts in group is all the same with the primary key of main graphemic code parts; Main graphemic code parts have certain pronunciation or call, and except the primary key of high-frequency coding parts has determined, the primary key of other main graphemic code parts generally all gets the alpha type primary key of the first letter of phonetic as phonetic class of its pronunciation or call; Adopt some folding I method, the first letter of phonetic for the pronunciation of main graphemic code parts is Y's, and main graphemic code parts the first sum of be point (Dian), folding (Ya) get I as its alpha type primary key, the first sum of for horizontal (one), perpendicular (Shu), skim (Pie) get Y as its alpha type primary key; In addition the primary key of the main graphemic code parts of other common addressable part still gets the alpha type primary key of the first letter of phonetic as a sound class of its pronunciation or call;
The determination of common addressable part and high-frequency coding parts phonetic class-letter type sub-code 1: adopt optimum seeking method: the Chinese phonetic alphabet of main graphemic code parts pronunciation or title, its initial consonant is j, q, x, and simple or compound vowel of a Chinese syllable is initial is the complex tone simple or compound vowel of a Chinese syllable of i, or its initial consonant is the biliteral initial consonants such as zh, ch, sh, or its Chinese phonetic alphabet the 1st letter is y, and the 2nd letter is i, and simple or compound vowel of a Chinese syllable is vowel followed by a nasal consonant, its alpha type sub-code 1 made in the 3rd letter then all getting its Chinese phonetic alphabet, in addition, remaining all gets the 2nd its alpha type sub-code 1 of letter work of its Chinese phonetic alphabet; Each addressable part alpha type sub-code of same addressable part group is identical with the alpha type sub-code 1 of the main graphemic code parts of this group;
The phonetic class-letter type sub-code 2 of common addressable part and high-frequency coding parts, the determination of sub-code 3: according to national regulation, Chinese character by horizontal (one), perpendicular (Shu), skim (Pie), point (Dian) rolls over (Ya) five kinds of strokes form, the present invention represents with first alphabetical H, S, P, D, Z of the Chinese phonetic alphabet of these five kinds of stroke and pronunciations successively.Each addressable part get successively first stroke, the 2nd stroke the letter type code of stroke as alpha type sub-code 2, the alpha type sub-code 3 of addressable part; For the addressable part formed less than 2 strokes, the 2nd stroke of disappearance, the present invention determines to get alphabetical V;
The alpha type primary key of 687 addressable parts of 409 addressable part groups, sub-code 1, sub-code 2, sub-code 3 is determined according to said method, be arranged in order, sub-code 1 adopts precedence method to determine, for the addressable part formed less than 2 strokes, determines the scheme all adopting alphabetical V to supply; The first letter of phonetic of the pronunciation of main graphemic code parts is Y's, adopts a some folding I method; Said method determines alpha type primary key, sub-code 1, sub-code 2, the sub-code 3 of 687 addressable parts of 409 addressable part groups, is arranged in order as shown below:
Four, conversion obtains each addressable part phonetic class numeric type primary key, sub-code 1, sub-code 2, sub-code 3 yards, forms phonetic class numeric type coding resource
According to standard GB/T/T18031-2000 " infotech digital keyboard Chinese character input General Requirement ", to the sub-code 2 of addressable part phonetic class-letter type, sub-code 3 relates to five class strokes, all convert numerical code to the regulation of " the key mapping setting of Chinese-character stroke " of this standard, instead of convert corresponding numerical code to the Chinese Pin Yin initial of stroke title, in addition to the above, all kinds of alpha type primary keys of addressable part, other letters of sub-code 1 and first letter of pinyin etc., with the letter of regulation of " the 10 key mapping letter key of Chinese pin yin position setting " and " 8 key mapping letter key of Chinese pin yin position setting " of this standard and the corresponding relation of numeral, convert 10 key mapping method phonetic class numeric type primary keys respectively successively to, sub-code 1, sub-code 2, sub-code 3 and 8 key mapping phonetic class numeric type primary key, sub-code 1, sub-code 2, the numerical code of sub-code 3 and first letter of pinyin, for high-frequency coding parts, on the basis of the major and minor code of all kinds of numeric types of above-mentioned conversion gained, adjustment makes all kinds of numeric type primary keys of each high-frequency coding parts, the combination of numbers of sub-code 1 slightly, and not identical each other in same type, concrete scheme as shown in Table 6, above-mentioned conversion plan is referred to as the different formula conversion plan of alphabetical stroke, " the key mapping setting of Chinese-character stroke " is as shown below:
" 10 key mapping letter key of Chinese pin yin position setting " is as shown below:
" 8 key mapping letter key of Chinese pin yin position setting " is as shown below:
Phonetic class numeric type primary key, sub-code 1, sub-code 2, the sub-code 3 of the high-frequency coding parts of " the different formula conversion plan of alphabetical stroke ", express successively with the corresponding numerical key of numeric keypad, the high-frequency coding parts phonetic class numeric type major-minor code of 31 high-frequency coding arrangement of components, concrete scheme is as shown below:
Five, combined type phonetic class major-minor code Chinese character, word coded input method
When Chinese character separating, except the phonetic class radicals by which characters are arranged in traditional Chinese dictionaries major-minor code word parent form method of Chinese character coding adopts dictionary, dictionary with except radicals by which characters are arranged in traditional Chinese dictionaries Chinese character separating method, other various Chinese characters, encoding method of words and phrases all adopt general Split Method, get that one formerly that disassembled coding unit stroke is many; Alpha type converts numeric type to and adopts the different formula conversion plan of alphabetical stroke; The letter and the corresponding relation of numeral that adopt " 8 key mapping letter key of Chinese pin yin position setting " to specify, convert numeric type code to by letter type code;
Combined type phonetic class major-minor code Chinese character, word coded input method are made up of combined type phonetic class major-minor code word parent form Chinese character, word coded input method and combined type phonetic class major-minor yardage font Chinese character, word coded input method two parts, two parts encoding setting is switched on different code table and calls;
Part I combined type phonetic class major-minor code word parent form Chinese character, word coded input method
Utilize phonetic class-letter type coding resource, formation combined type phonetic class major-minor code word parent form Chinese character, word coded input method include: the 1. phonetic class first letter of pinyin major-minor code word parent form method of Chinese character coding; 2. phonetic class first letter of pinyin major-minor code word parent form encoding method of words and phrases; 3. the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding; 4. phonetic class non-pinyin initial major-minor code encoding method of words and phrases; 5. the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding; 6. the phonetic class major-minor code word parent form sonic system method of Chinese character coding; For the coding input of the Chinese character of the large character set of the Chinese character set more than 27000 etc. of Chinese characters in common use, everyday expressions and GB18030, carry out assembly coding input operation: first letter of pinyin alpha type Chinese character, word coded input method are adopted to Chinese characters in common use, word, Chinese character more than 27000 for GB18030 adopts the non-pinyin initial major-minor code word parent form method of Chinese character coding, obtains coding; Phonetic class first letter of pinyin alpha type Chinese character, word coding and the encode Chinese characters for computer of phonetic class non-pinyin initial alpha type, be combined in same code table;
The coding method of phonetic class-letter type Chinese character, word is as follows:
In the following description, specify: 1 yard is got to an addressable part, namely gets its alpha type primary key, get 2 yards, namely get its alpha type primary key, sub-code 1 successively, get 3 yards, namely get its alpha type primary key, sub-code 1, sub-code 2 successively; Get 4 yards, namely get its alpha type primary key, sub-code 1, sub-code 2, sub-code 3 successively;
(1) the phonetic class first letter of pinyin major-minor code word parent form method of Chinese character coding
A, code length are indefinite, determine to adopt phonetic class-letter type coding resource;
The coding of the Chinese character of B, single encoded parts: if the first letter of pinyin of this Chinese character is identical with the alpha type primary key of this addressable part: so to high-frequency coding parts, then get the alpha type primary key of this addressable part, sub-code 1 successively; So to common addressable part, adopt four yards of methods, get the alpha type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3 successively; If the first letter of pinyin of this Chinese character is not identical with the alpha type primary key of this addressable part, be again high-frequency coding parts, adopt and substitute trigram method, then get the first letter of pinyin of this Chinese character, the sub-code 1 of these high-frequency coding parts, sub-code 2 successively; If the first letter of pinyin of this Chinese character is not identical with the alpha type primary key of this addressable part, it is again common addressable part, and alpha type primary key is not i, adopts replacement four yards of methods, get the first letter of pinyin of this Chinese character, the alpha type sub-code 1 of this addressable part, sub-code 2, sub-code 3 successively; If the alpha type primary key that the first letter of pinyin of this Chinese character is y and addressable part is i's, adopts and substitute four yards of methods, then get the first letter of pinyin of this Chinese character, the alpha type sub-code 1 of this addressable part, sub-code 2, sub-code 3 successively;
The Chinese character of C, more than 2 or 2 addressable part compositions, Chinese character can be divided into stem and remaining part two parts;
D, the Chinese character that the addressable part by more than 2 or 2 is formed, point following two parts determine that its alpha type is encoded successively:
Part 1, gets and encodes as the alpha type of part 1 according to the initial of the Chinese phonetic alphabet of Chinese character;
Part 2, get the stem of Chinese character and the coding of remaining part with following method:
Chinese Character parent form coding method for the addressable part by 2 forms: if stem is the high-frequency coding parts being in principal part position, adopts trigram method, then gets 1 yard to that addressable part of stem successively, get 2 yards to that addressable part of remaining part; If first part addressable part is common addressable part, adopts one or two methods, then successively 1 yard is got to that addressable part of stem, 2 yards are got to that addressable part of remaining part;
Chinese character for the addressable part by more than 3 or 3 forms: if single stem Chinese character, and for example fruit stem is the high-frequency coding parts being in principal part position, then get the 1st successively, the 2nd, end each 1 yard an of addressable part; If single stem Chinese character, and for example fruit stem is a common addressable part, adopt monic method, then get the 1st successively, the 2nd, end each 1 yard an of addressable part, if its stem is the Chinese character of combination stem, then this combination stem gets 2 yards, adopt first and last method, respectively get 1 yard to the 1st, a end addressable part of combination stem successively, its remaining part is single encoded parts, gets 1 yard;
The alpha type coding that above-mentioned 1st, the 2nd two parts are got, be combined into the coding of whole Chinese character successively;
When E, coding, adopt English lower case;
(2) the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding
A, code length are indefinite, adopt phonetic class-letter type coding resource;
The Chinese character of B, single encoded parts: if this addressable part is high-frequency coding parts, then get its alpha type primary key, alpha type sub-code 1 successively; If common addressable part: adopt four yards of methods, then get its alpha type primary key, sub-code 1, sub-code 2, sub-code 3 successively;
C, the Chinese character be made up of more than 2 or 2 addressable parts: stem and remaining part two parts can be divided into;
D, the Chinese Character parent form coding method formed for the addressable part by 2: if stem is the high-frequency coding parts being in principal part position, adopt trigram method, then get 1 yard to that addressable part of stem successively, get 2 yards to that addressable part of remaining part; If first part addressable part is common addressable part, adopts two or two methods, then successively 2 yards are got to that addressable part of stem, 2 yards are got to that addressable part of remaining part;
E, for the Chinese character be made up of 3 addressable parts:, if stem is a single stem, again the high-frequency coding parts being in principal part position, adopt last two methods, then successively 1 yard is got to those high-frequency coding parts of stem, the 1st addressable part of complementary gets 1 yard, the 2nd addressable part of remaining part gets 2 yards; If stem is a single stem, be again a common addressable part, adopt first two methods, then get 2 yards to that addressable part of stem successively, the 1st, the 2nd addressable part of remaining part respectively gets 1 yard; If stem is combination stem, then respectively get 1 yard to the 1st, the 2nd addressable part of combination stem successively, that addressable part of remaining part gets 2 yards; F, the Chinese character that the addressable part by more than 4 or 4 is formed: if stem is a single stem, again the high-frequency coding parts being in principal part position, then get successively that addressable part 1 yard of stem, remaining part the 1st, the 2nd, end each 1 yard an of addressable part; If stem is a single stem, be again a common addressable part, adopt first two methods, then successively 2 yards are got to that addressable part of stem of Chinese character, the 1st, a end addressable part of remaining part respectively gets 1 yard; If stem is combination stem, for combination stem code fetch scheme, adopt first, secondary, the last code fetch method of combination stem, namely successively to the 1st of combination stem, the 2nd, the individual addressable part in end respectively gets 1 yard, those parts of the remaining part of Chinese character get 1 yard;
The coding that each for above-mentioned Chinese character addressable part is got, with addressable part Chinese character composition in priority for sequence, form the coding of whole Chinese character successively;
When E, coding, adopt English lower case;
(3) phonetic class first letter of pinyin major-minor code word parent form Chinese word and phrase coding method
A, employing phonetic class-letter type coding resource, the encode Chinese characters for computer obtained according to the phonetic class first letter of pinyin method of Chinese character coding, gets the coding of word; The maximum length code of word coding is long is set as 6;
B, for the word be made up of 2 Chinese characters, adopt two or two methods, successively 2 yards are respectively got to the 1st Chinese character, the 2nd Chinese character;
C, for the word be made up of 3 Chinese characters, adopt two methods one by one, then get the 1st Chinese character 1 yard, the 2nd Chinese character 1 yard, the 3rd Chinese-character ' two code ' successively;
D, for the word be made up of 4 Chinese characters, adopt four yards of methods, then that gets the 1st, the 2nd, the 3rd, the 4th Chinese character successively respectively gets 1 yard;
E, for the word be made up of 5 Chinese characters, adopt five yards of methods, then get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
F, for the word be made up of more than 6 or 6 Chinese characters, along six methods, get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th, the 6th Chinese character of word successively;
When G, coding, adopt English lower case; ,
(4) phonetic class non-pinyin initial major-minor code word parent form Chinese word and phrase coding method
Utilize the phonetic class non-pinyin initial major-minor code word parent form encode method for entering Chinese characters determined to the coding of each Chinese character, implement the non-pinyin first alphabetic coding to Chinese terms; The maximum length code of word coding is long is set as 6; 2 yards are got to a Chinese character: if the Chinese character be made up of more than two or two addressable parts, adopt method first, namely get the alpha type primary key of the 1st, the 2nd addressable part of this Chinese character successively;
A, for the word coding be made up of 2 Chinese characters, adopt two or three methods, namely get the 1st Chinese-character ' two code ' of this word, the 2nd Chinese character 3 yards successively;
B, for be made up of 3 Chinese characters word coding: adopt last two methods, namely get each 1 yard of the 1st, the 2nd Chinese character of this word, the 3rd Chinese-character ' two code ' successively;
C, for be made up of 4 Chinese characters word coding, adopt four yards of methods, namely get each 1 yard of the 1st, the 2nd, the 3rd, the 4th Chinese character of this word successively;
D, for be made up of 5 Chinese characters word coding, get each 1 yard of the 1st, the 2nd, the 3rd, the 4th, the 5th Chinese character of this word successively;
E, for the word coding be made up of more than 6 or 6 Chinese characters, adopt along six methods, the 1st, the 2nd, the 3rd, the 4th, the 5th, the 6th Chinese character getting this word successively respectively gets 1 yard;
When G, coding, adopt English lower case;
Part II: combined type sound class major-minor yardage font Chinese character, word coded input method
Phonetic class numeric type coding resource is utilized to implement combined type phonetic class major-minor yardage font Chinese character, word coding input; Code length is indefinite, and maximum code length is set as 6;
Combined type phonetic class major-minor yardage font Chinese character, the word coded input method of employing phonetic class numeric type coding resource include: 1. based on the first letter of pinyin major-minor yardage font method of Chinese character coding, 2. based on first letter of pinyin major-minor yardage font encoding method of words and phrases, 3. based on the non-pinyin initial major-minor yardage font method of Chinese character coding, 4. based on non-pinyin initial major-minor yardage font encoding method of words and phrases, for 6763 Chinese characters in common use of GB2312, the coding input of the Chinese character of the large character set of more than 27000 Chinese character set of everyday expressions (or the large word collection of Chinese) and GB18030 etc., adopt following assembly coding input operation: adopt based on first letter of pinyin numeric type Chinese character to Chinese characters in common use and everyday words repertorie, word coded input method, the coding obtained based on the non-pinyin initial major-minor yardage font method of Chinese character coding is adopted for whole Chinese character of GB2312 and even the Chinese Character Set of GB18030-2000, be organized in same code table,
The coding method of various numeric type Chinese character, word is as follows:
(5) phonetic class is based on the first letter of pinyin major-minor yardage font method of Chinese character coding
A, determine to adopt phonetic class numeric type coding resource; In code fetch, be divided into two parts and get coding, then the coding that part 1 and part 2 are got, be combined into the coding of whole Chinese character successively;
The coding of the Chinese character of B, single encoded parts: the Chinese character for addressable part being high-frequency coding parts: if its first letter of pinyin numeric type code is identical with the numeric type primary key of these high-frequency coding parts, then getting 2 yards to these high-frequency coding parts, is exactly the numeric type primary key of the addressable part of this Chinese character, sub-code 1; If different, adopt and spell first method, then get 1 numerical code of initial transformation one-tenth, 2 yards of these high-frequency coding parts successively; Chinese character for common addressable part: if the numeric type code of its first letter of pinyin is identical with the numeric type primary key of this addressable part, adopts four yards of methods, then gets this addressable part 4 yards successively, as the coding of this Chinese character; If not identical, adopt method of substitution, then the numerical code that the first letter of pinyin getting this word successively converts to, the numeric type sub-code 1 of this addressable part, sub-code 2, sub-code 3;
C, for the Chinese character be made up of 2 addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1; Part 2, gets coding with following method: if the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two last three methods, then get the 1st addressable part 2 yards, the 3rd addressable part 3 yards successively; If the 1st addressable part is common addressable part, adopt first three last two methods, then get the 1st addressable part 3 yards, the 2nd addressable part 2 yards successively;
D, for the Chinese character be made up of 3 addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1; Part 2, coding is got: if stem is single stem with following method, and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two last two methods, then get the 1st addressable part 2 yards successively, the 2nd addressable part 1 yard, the 3rd addressable part get 2 yards; If stem is single stem, and for example fruit the 1st addressable part is common addressable part, and adopt first three methods, then get the 1st addressable part successively and get 3 yards, the 2nd, the 3rd addressable part respectively gets 1 yard; If stem is the Chinese character of combination stem, adopt Yu Erfa, then get each 1 yard of the 1st, the 2nd addressable part, that addressable part of remaining part 2 yards of combination stem successively;
E, for the Chinese character be made up of four addressable parts, point following two parts determine that its numeric type is encoded successively: part 1, get a numerical code that this Chinese characters phonetic initial converts to the coding as part 1; Part 2, gets coding with following method: if stem is single stem, and and for example fruit the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two methods, then get the 1st addressable part 2 yards, each 1 yard of the 2nd, the 3rd, the 4th addressable part successively; If stem is single stem, and for example fruit the 1st addressable part is common addressable part, adopts first three methods, then get successively the 1st addressable part 3 yards, the 2nd, the end each 1 yard an of addressable part; If stem is the Chinese character of combination stem, adopt Yu Erfa, then get each 1 yard of the 1st, the 2nd, the 3rd addressable part, that addressable part of remaining part 2 yards of combination stem successively;
F, following two parts are divided to determine that its numeric type is encoded successively for the Chinese character be made up of more than five or five addressable parts:
Part 1, gets a numerical code that this Chinese characters phonetic initial converts to the coding as part 1; Part 2, get coding with following method: if stem is single stem, the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two methods, then get successively the 1st addressable part 2 yards, the 2nd, the 3rd, the end each 1 yard an of addressable part; If stem is single stem, is again common addressable part, adopt first three methods, then get successively the 1st addressable part 3 yards, the 2nd, the end each 1 yard an of addressable part; If stem is the Chinese character of combination stem, adopts front four methods, then to combining, the 1st, the 2nd, the 3rd, the 4th addressable part of stem respectively gets 1 yard, that addressable part of remaining part gets 1 yard successively;
(6) phonetic class is based on the non-pinyin initial major-minor yardage font method of Chinese character coding
A, determine to adopt phonetic class numeric type coding resource;
B, for the Chinese character be made up of single encoded parts: if this addressable part is high-frequency coding parts, then get addressable part 2 yards successively; If this addressable part is common addressable part, adopts four yards of methods, then get this addressable part 4 yards successively;
The Chinese character of C, more than 2 or 2 addressable part compositions: stem and remaining part two parts can be divided into;
D, for the Chinese character be made up of 2 addressable parts, if the 1st addressable part is the high-frequency coding parts being in principal part position, adopts first two last four methods, then successively 2 yards, the 2nd addressable part is got to the 1st addressable part and get 4 yards; If the 1st addressable part is common addressable part, adopt first three last three methods, then successively 3 yards are got to the 1st addressable part, the 2nd addressable part gets 3 yards;
E, for the Chinese character be made up of 3 addressable parts: if the 1st addressable part is single stem of the high-frequency coding parts being in principal part position, adopt first two last three methods, then successively 2 yards, the 2nd addressable part is got to the 1st addressable part and get 1 yard, the 3rd addressable part and get 3 yards; If the 1st addressable part is single stem word of common addressable part, adopt first three last two methods, then successively 3 yards, the 2nd addressable part is got to the 1st addressable part and get 1 yard, the 3rd addressable part and get 2 yards; For the Chinese character that stem is combination stem, adopt remaining part four yards of methods, then successively the 1st, the 2nd addressable part of combination stem is respectively got to 1 yard, got 4 yards to that addressable part of remaining part;
F, for the Chinese character be made up of four addressable parts: if the 1st addressable part is the Chinese character of single stem of the high-frequency coding parts being in principal part position, adopt first two last two methods, then successively 2 yards, the 2nd, the 3rd addressable part is got to the 1st addressable part of Chinese character and respectively get 1 yard, the 4th addressable part and get 2 yards; If the 1st addressable part is the Chinese character of single stem of common addressable part, adopt first three methods, then successively 3 yards, the 2nd, the 3rd, the 4th addressable part is got to the 1st addressable part and respectively get 1 yard; For the Chinese character of combination stem, adopt excess-three method, then respectively get 1 yard to the 1st, the 2nd, the 3rd addressable part of combination stem successively, that addressable part of remaining part gets 3 yards;
G, for the Chinese character be made up of five coding code parts: if the 1st addressable part is single stem Chinese character of the high-frequency coding parts being in principal part position, adopt first two methods, then successively to the 1st addressable part of Chinese character get 2 yards, the 2nd, the 3rd, the 4th, the 5th addressable part respectively gets 1 yard; If the 1st addressable part is the Chinese character of single stem of common addressable part, adopt first three methods, then successively to the 1st addressable part get 3 yards, the 2nd, the 3rd, a end addressable part respectively gets 1 yard; For the Chinese character of combination stem, adopt Yu Erfa, then successively to combination stem the 1st, the 2nd, the 3rd, the 4th addressable part respectively gets 1 yard, that addressable part of remaining part gets 2 yards;
H, for the Chinese character be made up of more than six or six coding code parts: if the 1st addressable part is single stem Chinese character of the high-frequency coding parts being in principal part position, adopt first two methods, then successively to the 1st addressable part of Chinese character get 2 yards, the 2nd, the 3rd, the 4th, a end addressable part respectively gets 1 yard; If the 1st addressable part is the Chinese character of single stem of common addressable part, adopt first three methods, then successively to the 1st addressable part get 3 yards, the 2nd, the 3rd, a end addressable part respectively gets 1 yard; For the Chinese character of combination stem, adopt cis to more than two methods, then respectively get 1 yard to the 1st, the 2nd, the 3rd, the 4th addressable part of combination stem successively, that addressable part of remaining part gets 2 yards;
(7) phonetic class is based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method
Utilize phonetic class major-minor yardage font encoding resource, implement to input Chinese word and phrase coding:
Setting: 1 yard is got to a Chinese character, namely gets the 1st numerical code of the major-minor yardage font encode Chinese characters for computer based on first letter of pinyin of Chinese character, namely get the first letter of pinyin of this Chinese character; 2 yards are got to a Chinese character, namely gets the 1st, the 2nd numerical code of the major-minor yardage font encode Chinese characters for computer based on first letter of pinyin of Chinese character successively; Get 3 yards to a Chinese character, point three kinds of situations get coding: if the Chinese character of single encoded parts, adopt compiling method, according to this Chinese character of encoding scheme only get 2 yards still get 2 yards, be also considered as getting 3 yards; Other then get the 1st, the 2nd, the 3rd number font encoding of this Chinese character successively; If the Chinese character of single stem, then get the 1st of this Chinese character the successively, the numeric type primary key of the 2nd addressable part of the 2nd number font encoding, Chinese character; If the Chinese character of combination stem, adopt sequential method, the 1st, the 2nd, a 3rd sound class of getting combination stem Chinese character is successively encoded based on the numeric type of first letter of pinyin major-minor yardage font encode Chinese characters for computer; Get 4 yards to a Chinese character: if the Chinese character of single encoded parts, adopt compiling method, the encoding scheme of the Chinese character namely adopted, getting several yards is several yards, no longer increases; If the Chinese character of two addressable parts, adopt compiling method, get the numeric type primary key of the numerical code of its first letter of pinyin, the 1st, the 2nd addressable part successively; If the Chinese character of more than three or three addressable parts, then get the numeric type primary key of the numeric type code of first letter of pinyin, the 1st, the 2nd, the 3rd addressable part; Concrete coding method is as follows:
A, the word that two Chinese characters are formed: adopt first two last four methods, then successively 2 yards, the 2nd Chinese character is got to the 1st Chinese character and get 4 yards;
B, for the word be made up of three Chinese characters: adopt 213 methods, then get the 1st Chinese-character ' two code ' of word, the 2nd Chinese character 1 yard, the 3rd Chinese character 3 yards successively;
C, for the word be made up of four Chinese characters: adopt first two last two methods, then get the 1st Chinese-character ' two code ', the 2nd, the 3rd Chinese character each 1 yard, the 4th Chinese-character ' two code ' successively;
D, for the word be made up of five Chinese characters: adopt first two methods, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
E, for the word be made up of more than six or six Chinese characters: adopt first two methods, then get each 1 yard of the 1st Chinese-character ' two code ' of word, the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
(8) phonetic class is non-based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method
Utilize phonetic class major-minor yardage font encoding resource, implement to input Chinese word and phrase coding:
Phonetic class is non-based on first letter of pinyin major-minor yardage font Chinese word and phrase coding method: setting: get 1 yard to a Chinese character: the 1st the addressable part phonetic class numeric type primary key namely getting this Chinese character; 2 yards are got to a Chinese character: if the Chinese character of single encoded parts, then get the numeric type primary key of this addressable part, sub-code 1 successively; If single stem Chinese character, then namely get each 1 yard of the 1st, the 2nd addressable part of this Chinese character successively; If the Chinese character of combination stem, adopt first remaining method, then get the numeric type primary key of the 1st addressable part, the numeric type primary key of that addressable part of remaining part of combination stem successively; 3 yards are got to a Chinese character: if the Chinese character of single encoded parts, then get the major and minor code 1 of numeric type, the sub-code 2 of this addressable part successively; If the Chinese character of the single stem be made up of two addressable parts, then get the 1st addressable part numeric type primary key, the 2nd addressable part numeric type primary key, sub-code 1 successively; If the Chinese character of the single stem be made up of more than three or three addressable parts, then get each 1 yard of the 1st, the 2nd, the 3rd addressable part of Chinese character successively; If the Chinese character of combination stem, adopt first remaining method, then get Chinese character combination the 1st, the 2nd addressable part of stem, each 1 yard of that addressable part of remaining part successively; 4 yards are got to a Chinese character: for the Chinese character of single encoded parts, then get the numeric type primary key of this addressable part, sub-code 1, sub-code 2, sub-code 3; For the Chinese character of two addressable parts, adopt one or three methods, then get the 1st addressable part 1 yard, the 2nd addressable part 3 yards successively; For the Chinese character of three addressable parts, stem is the Chinese character of single stem, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively; For the Chinese character of four or more addressable part, adopt along last method, then get the 1st successively, the 2nd, the 3rd, end each 1 yard an of addressable part; For the Chinese character of the combination stem of three addressable parts, then get each 1 yard of the 1st, the 2nd addressable part, that addressable part of remaining part 2 yards of combination stem successively; For the Chinese character of the combination stem by four or more addressable part, adopt along last method, then get the 1st successively, the 2nd, the 3rd, end each 1 yard an of addressable part; Specific coding method is as follows:
A, the word that two Chinese characters are formed: adopt first two last four methods, then get the 1st Chinese-character ' two code ' successively, the 2nd Chinese character gets 4 yards;
B, for the word be made up of three Chinese characters: adopt 213 methods, then get the 1st Chinese-character ' two code ' of word, the 2nd Chinese character 1 yard, the 3rd Chinese character 3 yards successively;
C, for the word be made up of four Chinese characters: adopt first two last two methods, then get the 1st Chinese-character ' two code ', the 2nd, the 3rd Chinese character each 1 yard, the 4th Chinese-character ' two code ' successively;
D, for the word be made up of five Chinese characters, adopt first two methods, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th Chinese character successively;
E, for the word be made up of more than six or six Chinese characters: adopt first two methods, then get each 1 yard of the 1st Chinese-character ' two code ', the 2nd, the 3rd, the 4th, the 5th Chinese character successively.
2., in combined type phonetic class major-minor code Chinese character according to claim 1, word coded input method, it is characterized in that, when Chinese character separating; Get that one formerly that disassembled coding unit stroke is few.
3. combined type phonetic class major-minor code Chinese character according to claim 1, in word coded input method, it is characterized in that, alpha type converts numeric type to, adopt " full word matrix conversion plan ", be the letter type code turnover number font code relating to " 8 key mapping letter key of Chinese pin yin position setting ", for the conversion of stroke, the first letter of pinyin conversion of stroke being skimmed " Pie " is decided to be numeral 1, other are constant, still with the alphabetical form of the Chinese Pin Yin initial of stroke title, the regulation of establishing criteria converts numerical code to, that is, only according to letter and the digital corresponding relation of " the 10 key mapping letter key of Chinese pin yin positions setting " and " 8 key mapping letter key of Chinese pin yin positions setting " of GB/T18031 standard, all kinds of for each addressable part set above alpha type primary key, sub-code 1, sub-code 2, sub-code 3 converts 10 key mapping methods and a 8 key mapping method sounds class numeric type primary key correspondingly respectively to, sub-code 1, sub-code 2, sub-code 3, for high-frequency coding parts on the basis of the numeric type code of above-mentioned conversion gained, adjustment makes all kinds of numeric type primary keys of each high-frequency coding parts slightly, the combination of numbers of sub-code 1, not identical each other in same type,
Phonetic class numeric type primary key, sub-code 1, sub-code 2, the sub-code 3 of the high-frequency coding parts of " full word matrix conversion plan ", express successively with the corresponding numerical key of numeric keypad, a high-frequency coding parts sound class numeric type major-minor code of 31 high-frequency coding arrangement of components, concrete scheme is as shown below:
4. in combined type phonetic class major-minor code Chinese character according to claim 1, word coded input method, it is characterized in that, alpha type converts numeric type to and adopts, the different formula conversion plan of alphabetical stroke; The letter and the corresponding relation of numeral that adopt " 10 key mapping letter key of Chinese pin yin position setting " to specify, convert numeric type code to by letter type code.
5. in combined type phonetic class major-minor code Chinese character according to claim 2, word coded input method, it is characterized in that, alpha type converts numeric type to, adopt " full word matrix conversion plan ", the letter and the corresponding relation of numeral that adopt " 8 key mapping letter key of Chinese pin yin position setting " to specify, convert numeric type code to by letter type code.
6. in combined type phonetic class major-minor code Chinese character according to claim 2, word coded input method, it is characterized in that, alpha type converts numeric type to, adopt " full word matrix conversion plan ", the letter and the corresponding relation of numeral that adopt " 10 key mapping letter key of Chinese pin yin position setting " to specify, convert numeric type code to by letter type code.
7. according in the combined type phonetic class major-minor code Chinese character of claim 1,2,3,4,5,6 described in one of them, word input method, it is characterized in that, the first letter of phonetic for the pronunciation of main graphemic code parts is Y's, adopt some folding Y method, the first letter of phonetic main graphemic code parts for Y's of the pronunciation of main graphemic code parts the first sum of is point (Dian), folding (Ya) still get Y as its alpha type primary key, the first sum of for horizontal (one), perpendicular (Shu), skim (Pie) get I as its alpha type primary key.
8. according in the combined type phonetic class major-minor code Chinese character of claim 1,2,3,4,5,6,7 described in one of them, word input method, it is characterized in that, in the phonetic class first letter of pinyin major-minor code word parent form method of Chinese character coding, the Chinese character be made up of more than 2 or 2 addressable parts: stem and remaining part two parts can be divided into; Chinese Character parent form coding method for the addressable part by 2 forms: if first part addressable part is common addressable part, adopts one or three methods, then successively that addressable part of stem is got to 1 yard, got 3 yards to that addressable part of remaining part; Chinese character for being made up of 3 addressable parts: if stem is a single stem, it is again a common addressable part, adopt last two methods, then successively 1 yard is got to that addressable part of stem, the 1st addressable part of remaining part gets 1 yard, the 2nd addressable part of remaining part get 2 yards; Chinese character for the addressable part by more than 4 or 4 forms: if stem is a single stem, it is again a common addressable part, adopt one yard of method, then successively 1 yard is got to that addressable part of the stem of Chinese character, the 1st, the 2nd, a end addressable part of remaining part respectively gets 1 yard.
9. according in the combined type phonetic class major-minor code Chinese character of claim 1,2,3,4,5,6,7,8 described in one of them, word input method, it is characterized in that, the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding is substituted by the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding, and the phonetic class major-minor code word parent form radicals by which characters are arranged in traditional Chinese dictionaries method of Chinese character coding is specific as follows:
A, employing phonetic class major-minor code word parent form coding resource, Chinese character separating adopts radicals by which characters are arranged in traditional Chinese dictionaries Split Method; Code length is indefinite;
B, addressable part as the radicals by which characters are arranged in traditional Chinese dictionaries of the stem of Chinese character, adopt trigram method to do, the addressable part of radicals by which characters are arranged in traditional Chinese dictionaries all gets 3 yards, namely gets the alpha type primary key of the addressable part of these radicals by which characters are arranged in traditional Chinese dictionaries, sub-code 1, sub-code 2;
C, addressable part as the remaining part of Chinese character, adopt trigram method, if the remaining coding of single encoded parts, then successively 3 yards are got to this addressable part, if the remaining part of two addressable parts, then successively the 1st addressable part is got to 1 yard, is got 2 yards to the 2nd addressable part, if the remaining part be made up of the addressable part of more than three or three, then get successively its 1st, the 2nd, an end addressable part respectively gets 1 yard;
D, for the addressable part of radicals by which characters are arranged in traditional Chinese dictionaries of stem not having remaining part, adopt trigram method, get this addressable part 3 yards, namely get the alpha type primary key of this word addressable part, sub-code 1, sub-code 2;
E, the coding of the stem of Chinese character and remaining part to be combined successively, become the coding of whole Chinese character.
10. according in the combined type phonetic class major-minor code Chinese character of claim 1,2,3,4,5,6,7,8 described in one of them, word input method, it is characterized in that, the phonetic class non-pinyin initial major-minor code word parent form method of Chinese character coding is substituted by the phonetic class major-minor code word parent form sonic system method of Chinese character coding, and the phonetic class major-minor code word parent form sonic system method of Chinese character coding is specific as follows:
A, employing phonetic class major-minor code word parent form coding resource, Chinese character separating employing sound symbol Split Method; Code length is indefinite;
B, to sound symbol get coding, adopt four yards of methods, sound for single encoded parts accords with, then get this addressable part 4 yards, for the sound symbol be made up of two addressable parts, adopt first two methods, then get its each 2 yards of the 1st, the 2nd addressable part successively, for the sound symbol of three addressable part compositions, adopt monic method, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively, for the sound symbol be made up of four or more addressable part, then get successively its 1st, the 2nd, the 3rd, end each 1 yard an of addressable part;
C, coding is got to pictograph, adopts two yards of methods, if pictograph is single encoded parts, then get its 2 yards, if pictograph is more than two or two addressable parts compositions, then get successively its 1st, end each 1 yard an of addressable part;
D, for do not have the sound of pictograph accord with get coding, adopt four yards of methods, sound for single encoded parts accords with, then get this addressable part 4 yards, for the sound symbol be made up of two addressable parts, adopt first two methods, then get its each 2 yards of the 1st, the 2nd addressable part successively, for the sound symbol of three addressable part compositions, adopt monic method, then get each 1 yard of the 1st, the 2nd addressable part, the 3rd addressable part 2 yards successively, for the sound symbol be made up of four or more addressable part, then get successively its 1st, the 2nd, the 3rd, end each 1 yard an of addressable part;
Before what E, sound symbol was got be coded in, after what pictograph was got be coded in, form the coding of whole Chinese character successively.
CN201410288523.0A 2014-06-24 2014-06-24 Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard Expired - Fee Related CN105204657B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410288523.0A CN105204657B (en) 2014-06-24 2014-06-24 Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410288523.0A CN105204657B (en) 2014-06-24 2014-06-24 Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard

Publications (2)

Publication Number Publication Date
CN105204657A true CN105204657A (en) 2015-12-30
CN105204657B CN105204657B (en) 2018-02-23

Family

ID=54952390

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410288523.0A Expired - Fee Related CN105204657B (en) 2014-06-24 2014-06-24 Combined type phonetic class major-minor code Chinese character, word coded input method and its keyboard

Country Status (1)

Country Link
CN (1) CN105204657B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1515988A (en) * 2003-01-01 2004-07-28 黄振荣 Phonetic, form and meaning Chinese character code input method
CN1604017A (en) * 2003-09-29 2005-04-06 刘君度 Chinese character characterized location encoding combination input method based on one-key -for-one-character
US20050185849A1 (en) * 2004-02-16 2005-08-25 Yongmin Wang Six-Code-Element Method of Numerically Encoding Chinese Characters And Its Keyboard

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1515988A (en) * 2003-01-01 2004-07-28 黄振荣 Phonetic, form and meaning Chinese character code input method
CN1604017A (en) * 2003-09-29 2005-04-06 刘君度 Chinese character characterized location encoding combination input method based on one-key -for-one-character
US20050185849A1 (en) * 2004-02-16 2005-08-25 Yongmin Wang Six-Code-Element Method of Numerically Encoding Chinese Characters And Its Keyboard

Also Published As

Publication number Publication date
CN105204657B (en) 2018-02-23

Similar Documents

Publication Publication Date Title
CN105302330A (en) Combined phonetic and stroke type main and auxiliary code Chinese character and word and phrase coding input method and keyboard adopting method
CN102750000A (en) Binary syllabification input method
CN105278697B (en) Combined double-spelling class major-minor code Chinese character, word coded input method and its keyboard
CN101551711A (en) Chinese character coding input method based on structure and primitive
CN101488057B (en) Combined coding technique
CN105204657A (en) Combined pinyin type main and auxiliary code Chinese character and word coding input method and keyboard thereof
CN105320291A (en) Combined pronunciation and meaning type main and auxiliary code Chinese character and word and expression coding inputting method and keyboard thereof
CN104133560B (en) Double class major-minor code Chinese characters of combined type, word coded input method and its keyboard
CN102253726A (en) Method for inputting Chinese word digital strokes of computer and keyboard technology
CN102511021A (en) Number-order-code-element keyboard and information input method thereof
CN100361057C (en) Chinese character input method using small keyboard of computer keyboard
CN1908870B (en) Method and keyboard for mixed inputting English and Chinese characters with single button and multiple buttons
CN106959764B (en) A kind of code input method facilitating correct writing Chinese characters
CN102073382A (en) Stroke, main and auxiliary radical input method
CN102375558A (en) Computer Chinese character rapid-code five-stroke input method
CN104133556B (en) Double-stroke type main and auxiliary code letter type radical dictionary and sonic dictionary Chinese character coding input method and keyboard adopting method
CN113253853B (en) Chinese character input method for computer and mobile phone
CN104238765B (en) Students in middle and primary schools' keyboard marks phonetic code inputting method
CN101470535A (en) Optimized Chinese character code input method
CN1204487C (en) Chinese character input method based on code of radicals and sound
CN106325540A (en) Simplified input method of northeast Yunnan sub-dialect Miao language and application of simplified input method
CN1125393C (en) Chinese character encoding and inputting method and keyboard
CN105278696A (en) Sound-stroke main-auxiliary code letter type Chinese character encoding input method for radical dictionary and acoustic system dictionary, and keyboard thereof
CN1063856C (en) Keyboard and method for computer input of character-separated phonetic transcriptions
CN105320290A (en) Pronunciation and meaning type main and auxiliary code letter radical dictionary and sonic system dictionary Chinese character encoding input method and keyboard thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180223

Termination date: 20180624

CF01 Termination of patent right due to non-payment of annual fee