CN1268690A - Dictionary code Chinese character input method - Google Patents
Dictionary code Chinese character input method Download PDFInfo
- Publication number
- CN1268690A CN1268690A CN 00112270 CN00112270A CN1268690A CN 1268690 A CN1268690 A CN 1268690A CN 00112270 CN00112270 CN 00112270 CN 00112270 A CN00112270 A CN 00112270A CN 1268690 A CN1268690 A CN 1268690A
- Authority
- CN
- China
- Prior art keywords
- word
- press
- yard
- letter key
- chinese character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Document Processing Apparatus (AREA)
Abstract
The invented dictionary Chinese character input method is also Chinese character retrieval method. The Chinese character is decomposed into structural parts; then use the structural parts to proceed coding, finally use the code of Chinese character parts as the code of said word, the longest is four codes.
Description
The present invention relates to a kind of computer Chinese input method, also is a kind of search method of Chinese character simultaneously, specifically a kind of dictionary code Chinese character input method.
Computer Chinese input method of the prior art has a variety of, their design philosophys have mainly been considered high input speed, have been grasped and aspects such as the repetition rate of coding is low easily, substantially do not consider that Chinese character input method and primary and secondary students interrelate to the study of Chinese character, do not consider that the Chinese character layout of input method and dictionary, retrieval interrelate yet.Particularly those adopt and split most of input schemes that Hanzi structure is encoded, the artificial Chinese character of working out splits rule, often inconsistent with the Chinese-character canonical of national departments concerned formulation, people are grasped Hanzi structure for this and order of writing strokes all has very big negative effect.Along with popularizing with a large amount of of computing machine used, use keyboard " to write " for a long time, can cause the write with a pen decline of word ability of people, much the phenomenon of " forgetting what to write when actually writing " has appearred in the people that often work on computers.This phenomenon is to just in a kind of especially infringement of the numerous primary and secondary students of learning phase.The study computing machine will be picked up from the doll, and uses the keyboard input to have contradiction with learning Chinese characters for a long time, and existing Chinese character input method does not address this problem.In addition, primary and secondary students are in the various input method of Chinese character of study, the method of looking up the dictionary also in study, present various input method of Chinese character are irrelevant mutually with the method for looking up the dictionary, and by the method for combination that has dictionary now, retrieve from radicals by which characters are arranged in traditional Chinese dictionaries and to search word meaning and will carry out three retrievals, very inconvenient, these have all increased student's burden.Concerning the adult, searching is a kind of trouble, uninteresting thing in dictionary, has reduced people and has become literate with the interest of word.
Task of the present invention is the shortcoming that overcomes the input of Chinese character in the prior art and Chinese character search method, a kind of high input speed be provided, grasp easily, the repetition rate of coding is low, the method of Chinese character coding that meets national regulation, and its encoding law is identical with the rule of primary and secondary students' learning Chinese characters.Have the advantage that can combine into syllables between Hanzi component of this method and the coding, its rule just as in people's custom normal say " length of bow opens; Sub-Lee of wood ", can accomplish and see that word knows sign indicating number, when making primary and secondary students grasp this Chinese character input method, consolidate memory and grasp to Hanzi structure; This coding method can be used for the Chinese character layout of dictionary again, make the search method of dictionary easier, and make the student to the study of Chinese character, the study of Hanzi keyboard input method, study three that dictionary is consulted method are organically combined, make more science of these methods, be more prone to grasp.Standard of the present invention is according to being State Language Work Committee issue in 1997, the spoken and written languages standard GF3001-1997 that implemented in 1998 " information processing GB13000.1 character set Hanzi component standard ", and " the Modern Chinese general words stroke standard " of State Language Work Committee issue in 1997 and enforcement.Chinese character in this method of Chinese character coding decomposes does not have the order of strokes observed in calligraphy of falling, the parts setting with split the compliant requirement, meet the characteristics of Chinese character self, meet the basic understanding of people to Chinese character.The technical scheme that realizes the foregoing invention purpose is to adopt following principle: earlier Chinese character is resolved into word-building part, then Hanzi component is encoded, get the coding (the longest be four yard) of the coding of Hanzi component as this word at last.Specify as follows:
One, the parts of Chinese character
Chinese character can resolve into various strokes or parts, and the classification and the coding rule of these strokes or parts are as follows:
1, character formation component, the initial of getting this Chinese characters phonetic is as coding, for example: parts " people " word is into word, and it is encoded to R.Character non-formation component, that Chinese character phonetic initial letters of getting in its title this parts principal character of reflection is as coding, for example: the name of character non-formation component " Ren " is called " single side ", and tagged word is " people ", and first letter of pinyin is R, and then " Ren's " is encoded to R.
But 2, the parts of Chinese character are divided into cross portion and can not cross portion two big classes, amount to 219 (group), 21 of the letters that component coding uses see " Hanzi component coding Yuan 1 " (being mainly used in the Chinese character input) and " Hanzi component coding schedule 2 " (being mainly used in Chinese character index) for details.But cross portion has 9 (groups), and each can intersect (stroke between the part of finger is intersected) between these 9 parts, and they are respectively:
Wherein
(folding) comprises that the various foldings stroke encoding except that lifting-hook is Z;
Become word mouth (mouth) to be encoded to K:
Become word order (saying) to be encoded to R.
Can not have 210 (groups) by cross portion, these parts itself can not intersect with miscellaneous part (referring to intersecting between the stroke).For example: " people " word is can not cross portion, so " interior " word can not split into " " and " people ", but but split into " " and " Pie, " with cross portion, be encoded to KPD.Point can be regarded lodge as on miscellaneous part with right-falling stroke " Dian and ", does not treat as cross portion.For example " scold " word should split into " jin Dian "; " history " word should split into "
".
Two, the fractionation of Chinese character
1, the one stroke Chinese character need not to split, and its coding is the coding of parts, and for example " one " word is one (horizontal stroke) stroke, it be encoded to H, " second " word is a folding stroke, it be encoded to Z.
2, the Multi strokes Chinese character must split into plural parts, as adjacent component the situation of cross stroke is arranged, but then splits with 9 kinds of cross portions.When Multi strokes Chinese character has multiple split result, must follow following priority ranking:
Do not allow down stroke (promptly not allowing the fractionation of falling stroke Chinese character) between a, the adjacent component
B, preferential (the sort of split result override that promptly can occur big parts in the two adjacent parts) that parts are big
C, become word big preferential (when the maximum part stroke number in the two adjacent parts is identical, becoming big preferential of word)
Preferential (the maximum part stroke number in two adjacent parts is identical, and the character formation component stroke number is identical or when all not becoming word, the sort of split result override that maximum part occurs at first) before the d, order.
3, the general step of Chinese character fractionation is: begin to split and search from the parts of maximum, if maximum part is not the parts in " dictionary code component coding table ", then it is split into smaller parts again, as run into the situation that cross stroke is arranged with adjacent component, but preferentially split with nine kinds of cross portions.For example " suitable " word is decomposed into " river " and " page or leaf " earlier, look into component coding table 1 or 2, " river " be encoded to C, and " page or leaf " is not the parts in this coding schedule, therefore " page or leaf " split into “ Myeon shellfish again ", look into coding schedule again, its code is respectively M and B; promptly " suitable " word resolves into three parts, is encoded to CMB; And for example: " well " word is made up of cross portion, uses 9 kinds of cross portions and splits, and split result is encoded to HPS for " two Pie Shu ".
Three, encode Chinese characters for computer
The coding of getting the 1st, 2,3 and last parts of Chinese character is formed the coding of this word, and maximum code length is four yards.This programme is during as search method, and each Chinese character does not also require that four sign indicating numbers are necessarily arranged; And as Chinese character input method, when four yards of the component coding deficiencies of a word, mend with first letter of this phonetic transcriptions of Chinese characters, after mending still during four yards of less thaies, or because of can not pronunciation can not correctly mend the time, then press space bar into phonetic alphabet.The component coding of for example " opening " word is GC, mends its first letter of pinyin Z, and then the dictionary code of this word is GCZ.And for example the computer Chiense character code of " one " word is HY.
This encoding scheme also is provided with fuzzy key V, as universal key.
The repetition rate of coding of this programme is very low, as when repeated code occurring, then according to prompting on the screen with the code word order, press the figure key that this word marks and get final product.
Four, phrase coding
Two-character word: form by preceding two yards of every word,
Three words: first sign indicating number got in preceding two words, and last word is got preceding two yards,
Four words: get every word first yard,
Multi-character words: get the first sign indicating number of first three word and the first sign indicating number of the last word.
" the Hanzi component coding schedule 1 " of this programme and " Hanzi component coding schedule 2 " are attached to the back of this " instructions " embodiment.
According to above principle, the present invention uses general calculation switch dish concrete operations scheme to be: a kind of dictionary code Chinese character input method the steps include:
A, the stroke order of standard during according to writing Chinese characters press the 1st, 2,3 and the key letter of last parts successively,
B, for the Chinese character of four yards of component coding less thaies, press first letter key or the V key of this phonetic transcriptions of Chinese characters again, still during four yards of less thaies, press space bar again after the key entry,
C, when repeated code occurring, press figure key.
The step of input phrase is:
Two-character word: press the letter key of first yard of first word,
Press the letter key of second yard of first word,
Press the letter key of first yard of second word,
Press the letter key of second yard of second word.
Three words: press the letter key of first yard of first word,
Press the letter key of first yard of second word,
Press the letter key of first yard of the 3rd word,
Press the letter key of second yard of the 3rd word.
Four words: press first yard letter key of first word,
Press first yard letter key of second word,
Press first yard letter key of the 3rd word,
Press first yard letter key of the 4th word.
Multi-character words: press the letter key of first yard of first word,
Press the letter key of first yard of second word,
Press the letter key of first yard of the 3rd word,
Press the letter key of first yard of the last word.
Dictionary code of the present invention has overcome the shortcoming of existing Chinese character input method, be a kind of high input speed, grasp easily, the repetition rate of coding is low, and its encoding law is identical with the rule of primary and secondary students' learning Chinese characters, promote the use of this dictionary code in school, can make primary and secondary students when grasping this Chinese character input method, consolidate memory and grasp Hanzi structure; This coding method can be used for the Chinese character layout of dictionary again, make the search method of dictionary easier, and make the student consult the study triplicity of method to the study of Chinese character, to the study of Hanzi keyboard input method, to dictionary, make more science of these methods, the easier grasp of primary and secondary students, and can alleviate student's learning burden.
Embodiment 1, and " thing " word splits into by normalized written: but one, mouth, four cross portions of , Shu, be encoded to HKXS.
Embodiment 2, and the correct method for splitting of " ten thousand " word is to be divided into " one
", be encoded to HB, rather than split into " factory
" or " one
Pie ".
Embodiment 3, and it is big that " U.S. " word should split into “ Ha king ", be encoded to DWD, rather than " soil is big ".
Embodiment 5, and " OK " word should split into " Chi one fourth ", is encoded to RYD, rather than " Chi two Shu ".
Embodiment 6, and " state " word should split into " king Dian one ", is encoded to KWDH, rather than " mouthful king Dian ".
Embodiment 7, and " justice " word should be split as " Dian ㄨ ", is encoded to DY, rather than " ㄨ Dian ".
Embodiment 9, and " dying " word should be split as " Tou
", be encoded to WZ, rather than " Dian
".
Embodiment 10, and as index editor dictionary, ascending order (A-Z) arrangement that the index of Chinese Characters or the body part of traditional dictionary are pressed dictionary code gets final product with dictionary code of the present invention, and for example " I " word code is PHSD, just can find " I " word at the PHSD place.
Hanzi component coding schedule (1)
Notes 1: roll over () and comprise 亅) the various foldings stroke except that lifting-hook (.Annotate 2: in this table, Chinese character becomes this Chinese character phonetic initial letters that is encoded to of word; The volume of character non-formation component
Sign indicating number should be got the first letter of pinyin that reflects that Chinese character of this parts principal character in the component names
Coding as these parts.For example: the name of " Ren " is called " single side ", its feature
Word is " people ", and the first letter of pinyin of " people " word is r, so the coding of " Ren "
Be r.Annotate 3: the parts in the parenthesis are similar allosome parts, and they have identical coding.
Claims (2)
1, a kind of dictionary code Chinese character input method the steps include:
A, the stroke order of standard during according to writing Chinese characters press the 1st, 2,3 and the key letter of last parts successively,
B, for the Chinese character of four yards of component coding less thaies, press first letter key or the V key of this phonetic transcriptions of Chinese characters again, still during four yards of less thaies, press space bar again after the key entry,
C, when repeated code occurring, press figure key.
2, according to the described dictionary code Chinese character input method of claim 1, the step of input phrase is:
Two-character word: press the letter key of first yard of first word,
Press the letter key of second yard of first word,
Press the letter key of first yard of second word,
Press the letter key of second yard of second word,
Three words: press the letter key of first yard of first word,
Press the letter key of first yard of second word,
Press the letter key of first yard of the 3rd word,
Press the letter key of second yard of the 3rd word,
Four words: press first yard letter key of first word,
Press first yard letter key of second word,
Press first yard letter key of the 3rd word,
Press first yard letter key of the 4th word,
Multi-character words: press the letter key of first yard of first word,
Press the letter key of first yard of second word,
Press the letter key of first yard of the 3rd word,
Press the letter key of first yard of the last word.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB001122703A CN1142474C (en) | 2000-05-11 | 2000-05-11 | Dictionary code Chinese character input method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB001122703A CN1142474C (en) | 2000-05-11 | 2000-05-11 | Dictionary code Chinese character input method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1268690A true CN1268690A (en) | 2000-10-04 |
CN1142474C CN1142474C (en) | 2004-03-17 |
Family
ID=4582142
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB001122703A Expired - Fee Related CN1142474C (en) | 2000-05-11 | 2000-05-11 | Dictionary code Chinese character input method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1142474C (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106349745A (en) * | 2016-08-25 | 2017-01-25 | 浙江劲光实业股份有限公司 | Activated azo dye and preparation method and application thereof |
-
2000
- 2000-05-11 CN CNB001122703A patent/CN1142474C/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106349745A (en) * | 2016-08-25 | 2017-01-25 | 浙江劲光实业股份有限公司 | Activated azo dye and preparation method and application thereof |
Also Published As
Publication number | Publication date |
---|---|
CN1142474C (en) | 2004-03-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1142474C (en) | Dictionary code Chinese character input method | |
CN1136496C (en) | Simplified spelling-touching screen mouse chinese character input method | |
CN1035083C (en) | Word-oriented Chinese character typing device | |
CN1054219C (en) | Substitution type Chinese phonetic character, word input coding method and keyboard thereof | |
CN1025540C (en) | Double-combination encoding method by use of initial consonants and vowels of Chinese syllables | |
CN1147780C (en) | Three-stroke digital code Chinese character input method and keyboard | |
CN1275127C (en) | Chinese characters input method according to stroke sequence and keyboard thereof | |
CN1074146C (en) | Scheme for inputting Chinese characters | |
CN1178121C (en) | Double Chinese character stroke order-radical input system | |
CN1673935A (en) | Jiaguwen (inscriptions on bones or tortoise shells of the Shang Dynasty) computer inputting method | |
CN1328282A (en) | Chinese-character 'Natural code', input method | |
CN1458566A (en) | Chinese character plain code input method | |
CN1049418A (en) | Chinese character keyboard input method for unified code computer | |
CN1245678C (en) | Chinese character input method using phoneticizing and complement code | |
CN86103506A (en) | " a key diadic " keyboard and China and foreign countries' characters rapid input method | |
CN1160243A (en) | Character shape stroke order code Chinese character entering system and keyboard thereof | |
CN1046402A (en) | Shape note Chinese character, symbolic coding and keyboard thereof | |
CN1749026A (en) | Check-up method for digital dictionary | |
CN1794150A (en) | Phonetic Chinese charater input method | |
CN1567155A (en) | Input method of characters and words in common use based on soft keyboard | |
CN1081523A (en) | Dual spelling Chinese words coding method and keyboard thereof | |
CN1485715A (en) | Method of encoding for Chinese characters using sound and font information | |
CN1173661A (en) | Computer input method of Yuanma codes Chinese characters | |
CN1210299A (en) | Phonetical unit codes for Chinese characters | |
CN1567158A (en) | Phonetic-stroke-ordering Chinese characters input keyboard and input method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |