CN1162767C - Square round classify pictographic code - Google Patents

Square round classify pictographic code Download PDF

Info

Publication number
CN1162767C
CN1162767C CNB021317135A CN02131713A CN1162767C CN 1162767 C CN1162767 C CN 1162767C CN B021317135 A CNB021317135 A CN B021317135A CN 02131713 A CN02131713 A CN 02131713A CN 1162767 C CN1162767 C CN 1162767C
Authority
CN
China
Prior art keywords
code
word
radical
exception
chinese character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB021317135A
Other languages
Chinese (zh)
Other versions
CN1414457A (en
Inventor
汪世龙
万利云
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNB021317135A priority Critical patent/CN1162767C/en
Publication of CN1414457A publication Critical patent/CN1414457A/en
Application granted granted Critical
Publication of CN1162767C publication Critical patent/CN1162767C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention discloses a square round classification pictographic code which comprises: etymons in Chinese character structures, which frequently appear, and basic strokes are used as the code elements of Chinese character codes, and the code elements are skillfully classified into a plurality of classes according to the form similarity and the meaning similarity; form similarity corresponding association and meaning similarity classification association are carried out to the forms of the uppercase letters of 21 English letters, the lowercase letters of 5 English letters and the Chinese characters of Arabic numerals from 2 to 10 and the classified etymons, and therefore, association memory is basically adopted by all keyboard etymons. But each of 9 number keys from 2 to 10 only uses the form thereof as a pictographic classification symbol, and when the 9 number keys are coded, English letter keys namely W to P under the number keys are stroked, so that when Chinese character codes are input, only 26 English letter keys are used as input keys. The present invention is a form code input technology in which etymons are disassembled according to the character form, and the code scheme has the advantages of small memory amount, less coincident codes, big vocabulary and unique coding method; both professional typists and unprofessional typists can achieve quick input.

Description

The square round classify pictographic code input method of Chinese character
One, invention technical field:
The invention belongs to the Technology of Chinese Information Processing field, specifically relate to a kind of computer Chinese input method.
Two, background technology:
Present Chinese computer input method, be called for short encode Chinese characters for computer, reached the hundreds of kind, but actual drop into use have only tens kinds, with regard at present still with regard to the coding of using, be divided three classes: the first kind is the phonetic sign indicating number: well-known, there is 100% repeated code in the phonetic sign indicating number, also need page turning, can't import because of cacoepy and unacquainted word, input speed is slower.Second class is a pronunciation-form combination code: pronunciation-form combination code is to be based upon on the basis of phonetic sign indicating number, reduced repeated code, progressive to some extent, but it also exists some shortcomings of phonetic sign indicating number, the shortcoming that also has the memory of font code radical distribution needs simultaneously, more than 30 key of the phonetic-stroke code that has is difficult to improve input speed, and the people of employing is also less.The 3rd class is a font code: font code is a class maximum in the current encoder scheme, numerous font codes all is that Chinese character root is placed on the different keyboards, what have has used more than 30 key, the character root of keyboard memory capacitance is quite big, with regard to several font codes commonly used, more than 200 radical all arranged basically, be distributed on the different keyboards, mostly be by memorize mechanicalling, and coding method, repeated code identification regulation is more, and the thinking degree is bigger, and the word difficulty is torn difficult the branch open, exist the diversity of divining by means of characters, memory capacitance is bigger, and it is 30 several perhaps to use keyboard to reach, if a period of time does not use, easily forget, so it only is fit to the professional typist that has a good memory.Have several font codes also to adopt the method corresponding with the English alphabet pictograph, but since many opposite sex of Chinese character root be difficult to corresponding one by one, so a lot of radical still by memory by force, and some pictographic code used more than 30 key, also is not easy to improve input speed.
Three, summary of the invention:
The object of the present invention is to provide minimum, the easy study of a kind of memory capacitance, the radical number is few, repeated code is few, coding rule is simple, it is directly perceived to divine by means of characters, the square round classify pictographic code input method of Chinese character of high input speed.
For achieving the above object, square round classify pictographic code input method of Chinese character of the present invention is to select the higher radical of the frequency of occurrences in the Hanzi structure and the basic stroke code element as encode Chinese characters for computer for use, and be divided into some classes according to the appearance similar purpose is close, utilize the English alphabet of computer keyboard to have radian, and Chinese character is square, at first the English alphabet shape is had an area of conversion, after the conversion with the capitalization of 24 English alphabets, 5 English alphabet small letters, the profile of 2-0 arabic numeral Chinese character is carried out the approximate corresponding classification association of sorting out and being close in meaning of profile with the radical that we are classified; 9 numerical keys in the keyboard of computing machine the top 2 to 0 only utilize profile to sort out symbol as pictograph, corresponding English alphabet keys W---P below utilizing it in when input coding has only used 26 English alphabet keys as enter key when encode Chinese characters for computer is imported; When encoding, adopt word lead-in to associate the prompting input synchronously;
The coding rule and the method for described square round classify pictographic code input method of Chinese character are:
(1), 26 high frequency words are only got one yard and are added space bar and finish; 26 key name characterized radicals are hit this key earlier and are hit A once and add space bar and finish; Characterized radical is hit this key earlier once without exception on all the other keys, and the stroke end to end of getting this word then adds space bar to be finished;
(2), except that 26 high frequency words and key name characterized radical, other word gets three end one according to its radical that comprises without exception, has only two or three radicals, adds space bar; According to sequential write, form successively according to radical during code fetch, big sign indicating number is preferential, and promptly a word can be torn two yards open and not get trigram, can tear trigram open and not get four yards, get according to character root of keyboard when yardage is identical big preferential, but in the outer earlier back of pen clip structure and full shaped as frame structure;
(3), two singles draw to intersect, the shaped as frame structure do not take apart without exception;
(4), structure of ganging up from top to bottom more than two, it is the F class, the horizontal above and perpendicular structure of ganging up of two perpendicular pens of two perpendicular pens of ganging up, it is the H class, get such condition code F, H during code fetch without exception earlier, get the folding pen and the shaped as frame feature code of being gone here and there in the structure again, other no knuckle stroke omits without exception; So F, H class formation add the condition code coding by the class sign indicating number without exception;
(5), the shaped as frame structure lower end that can not be split and other stroke radical intersect, and just do not gang up frame, regards as overlapping without exception;
(6), two code words, just the high frequency word adds A and becomes the two-character word of being formed headed by two code words, lead-in is got two yard second word and is got sign indicating number end to end; Form the above speech of three words headed by two code words, lead-in is got the first sign indicating number of getting last two words of this speech after two yards again; Lead-in is only got a trigram when trigram or the Chinese word coding formed headed by the word more than four yards, and the head sign indicating number of getting this speech the last character again gets final product;
(7), when the punctuation mark that has on the keyboard, arabic numeral input, the keystroke method is with English state;
The key name of described keyboard and the corresponding relation of Chinese character root are as follows:
Figure C0213171300061
The specific design principle of square round classify pictographic code input method of Chinese character of the present invention is:
1, radical selects for use the higher parts of the frequency of occurrences in basic stroke, radical and the Hanzi structure with Chinese character as code symbols, and, purpose similar according to the radical shape is close is divided into some classes, so that associative memory, have only several radicals to belong to memory by force, memory capacitance is minimum, memory easily, the radical number is few.
2, have radian substantially according to the computer keyboard English letter, and Chinese character root seldom has radian, many squarelys, so at pictograph to having an area of conversion at once, promptly circular letter is regarded as square, wholecircle shape see help square, semicircle is regarded en shape etc. as, utilize like this Chinese literary style that keyboard English capitalization (except that D, G, N, Y, Z) adds English lower case e, i, k, n, t and arabic numeral 2-10 promptly two, three, four, five, six, seven, eight, nine, ten shape and our sorted radical to carry out pictograph corresponding; It is H, D, N that the phonetic association is adopted in horizontal stroke, point, three basic strokes of right-falling stroke; The implication of Z, G two key utilizations English zoo, go is defined as animal class radical and relevant radicals by which characters are arranged in traditional Chinese dictionaries or the radical of travelling outdoors, to reach associative memory, like this coding rule simple, divine by means of characters intuitively, high input speed.
Though 3, this encoding scheme has been utilized 210 the profile glyph as code symbols or radical, does not use the 2-0 key when encode Chinese characters for computer is imported, but hits its pairing English key in below, i.e. two-W, three-E, four-R, five-T, six-y, seven-U, eight-I, nine-O, ten-p only uses 26 keys to get final product like this when encode Chinese characters for computer is imported, and accelerates input speed greatly.
4, the coding method uniqueness of this encoding scheme speech, convenient and practical in order to accomplish, no matter several words, when coding, adopt association's prompting input of this first word of speech, the i.e. two-character word of forming headed by two code words, sign indicating number end to end got in second word after lead-in was got two yards, and the above speech of three words, lead-in is got the first sign indicating number of getting last two words after two yards again; The head sign indicating number that trigram is got this speech the last character then only got in the prefix word of being formed headed by the word more than the trigram.
The coding rule and the method for square round classify pictographic code input method of Chinese character are:
1,26 high frequency words are only got one yard and are added space bar and finish; 26 key name characterized radicals are hit this key earlier and are hit A once and add space bar and finish; Characterized radical is hit this key earlier once without exception on all the other keys, and the stroke end to end of getting this word then adds space bar to be finished.
2, this encoding scheme is except that 26 high frequency words and key name characterized radical, and other word is got three end one according to its radical that comprises without exception, has only getting entirely of two or three radicals to add space bar.According to sequential write, form successively according to radical during code fetch, big sign indicating number is preferential, and promptly a word can be torn two yards open and not get trigram, can tear trigram open and not get four yards, get according to character root of keyboard when yardage is identical big preferential, but in the outer earlier back of pen clip structure and full shaped as frame structure.
3, two singles draw to intersect, the shaped as frame structure do not take apart without exception.
4, structure (F class) of ganging up from top to bottom more than two, the horizontal above and perpendicular structure (H class) of ganging up of two perpendicular pens of two perpendicular pens of ganging up, get such condition code F, H during code fetch without exception earlier, get the folding pen and the shaped as frame feature code of being gone here and there in the structure again, other no knuckle stroke omits without exception.So F, H class formation add the condition code coding by the class sign indicating number without exception.
5, the shaped as frame structure lower end that can not be split and other stroke radical intersect (not ganging up frame) to be regarded as overlapping without exception.
6, the two-character word formed headed by (the high frequency word adds A and becomes two code words) of two code words, lead-in are got two yard second word and are got sign indicating number end to end; Form the above speech of three words headed by two code words, lead-in is got the first sign indicating number of getting last two words of this speech after two yards again; Lead-in is only got a trigram when trigram or the Chinese word coding formed headed by the word more than four yards, and the head sign indicating number of getting this speech the last character again gets final product.
7, during the input of the punctuation mark that has on the keyboard, arabic numeral, the keystroke method is with English state.
The square round classify pictographic code input method of Chinese character has following characteristics:
1, this encoding scheme radical select for use comparison rationally, the simple, intuitive of divining by means of characters and meet relevant national standard substantially.
2, this encoding scheme repeated code about 5%-6% when being untreated about 0.5%, is less through repeated code after the simple process in the scheme that memory is not handled by force.
3, this encoding scheme has also increased the Chinese character that uses on name collected in over thousands of the Xinhua dictionary, place name, the animals and plants name except that having collected the collected whole Chinese characters of GB80 GB character library.
4, this encoding scheme radical sorting technique uniqueness, have good association understand memory function and use a period of time after the prolonged characteristics of not forgetting.Memory capacitance is little, is fit to amateur keyboarder and imports Chinese character fast.
5, the method for associating the prompting input with first word code of this speech is synchronously adopted in the input of this encoding scheme speech, has improved the frequency of utilization and the input speed of speech.Have a large vocabulary, except that having collected more than 40,000 vocabulary that " national language liberal normalization and standard a collection of selected materials " taken in, also selected nearly ten thousand of every profession and trade everyday expressions and phrases for use.
6, the keyboard arrangement of this encoding scheme makes the mentality of designing basically identical of the frequency of utilization and the QWERTY keyboard of English alphabet keys, i.e. the use of home key is higher than key position up and down, and forefinger, middle finger, the third finger, little finger of toe frequency of utilization when input file is successively decreased successively.
Four, description of drawings:
Accompanying drawing 1 is the corresponding relation figure of key name on the Chinese character root of square round classify pictographic code input method of Chinese character of the present invention and the computer keyboard;
Five, embodiment:
As shown in Figure 1, the square round classify pictographic code input method of Chinese character is just as foregoing design philosophy, utilize the shape of the Chinese literary style of the capitalization of 21 letters of computer keyboard, 5 alphabetical small letters, nine numerals of 2-10 to do the classification glyph of Chinese character root, amount to 35 glyphs, be respectively: Q, W, E, e, R, T, t, U, I, i, O, P, A, S, F, H, J, K, k, L, X, C, V, B, n, M, two, three, four, five, six, seven, eight, nine, ten; The phonetic association is adopted in three basic stroke points, horizontal stroke, right-falling strokes, and D is that point, H press down for horizontal, N; The Z key is association with the English implication of zoo zoo, is that animal name or the radicals by which characters are arranged in traditional Chinese dictionaries that are mainly used in animal name (deinsectization is outer) all are placed on the Z key with radical itself.The G key also is that the meaning of the English implication of utilizing go " where " is defined as the travelling key, and the related radical of status of action is done associative memory the vehicles of travelling outdoors and when going out.This adds the higher characterized radical of seven type frequencies as hypermnesia, promptly in Q key thousand, D key, L key wood, the C key power again, V key cun.This arrangement mode is mainly considered the minimizing repeated code, but two---10 font symbols do not hit numerical key when carrying out the Chinese character Chinese word coding, and hit corresponding W---the P key in below, and this sample encoding scheme has only been used 26 English key-positions when carrying out word coding method.
This programme Chinese character split and coding method specifically:
1, single Chinese character (remove the high frequency word, the key name characterized radical is outer) is generally got two to four yards, gets one yard at a trigram end more than four yards word, and the space bar that adds that less than is four yards finishes.As: " to " (CV), " sun " (PB), " crystalline substance " (BBB), micro-(AWHR).
2, high frequency word one key adds the space bar input.Be encoded to B as "Yes" and add space bar.The key name word is the higher characterized radical of the frequency of utilization on this key, is encoded to hit this key and add A once and finish with space bar.Other characterized radical is to hit the stroke end to end that this key adds this word once again to finish with space bar.As: " order " (QA), " ear " (QHH).Not characterized radical and radicals by which characters are arranged in traditional Chinese dictionaries hit this key without exception and add VV once and get final product, as: Ren (AVV).
3, sign indicating number is preferential greatly during individual character extracting code, can get two yards and not get trigram, can get trigram and not get four yards.Yardage is identical gets greatly preferentially.
As: " master " is encoded to DU, can not be taken as DT or DGH etc.
4, two intersect and state's word frame, a mouthful word frame are not taken apart.As: " person " is encoded to PXB, can not be taken as UJB." because of " be encoded to QK, can not be taken as NKH.
5, shaped as frame lower end and other stroke intersect regard as overlapping.
As: " lining " (BU), " really " (BL), " first " (BI).
6, get class sign indicating number F earlier during a perpendicular structured coding of ganging up (F class) more than two, as the structure of go here and there has and roll over and the shaped as frame feature, get again and roll over and the feature code of frame.
As: " Rolling " (F), " string " (FOO), " Wei " (FS), " thing " (FOE)
7, horizontal string is got class sign indicating number H earlier more than two and during two perpendicular string (H class) code fetches, and other method is identical with the F class.
As: " Lv " (H),
Figure C0213171300101
(H), " not " (HS)
8, the two-character word formed headed by (the high frequency word adds A and becomes two code words) of two code words, lead-in are got two yard second word and are got sign indicating number end to end; Form the above speech of three words headed by two code words, lead-in is got the first sign indicating number of getting last two words of this speech after two yards again.
As: " country " (QANM), " National Copyright Administration of the People's Republic of China " (QALP)
" differentiation " (CXIN), " regional cooperation " (CXIA)
9, lead-in is only got a trigram when trigram or the Chinese word coding formed headed by the word more than four yards, and the head sign indicating number of getting this speech the last character again gets final product.
As: " taste " (OOOO), " having good character and fine scholarship " (OOOA), " win triumph " (CONM)
10, this encoding scheme repeated code disposal route: two yards, behind a word that is of little use, add A during three code word repeated codes, a trigram only can be got in word commonly used during four code word repeated codes and make three, select input with numerical key during two word repeated codes that minority is of little use.
As: " member " (OR), " " (ORA), " happiness " (UOK), " praising " (UOKO)

Claims (1)

1, a kind of square round classify pictographic code input method of Chinese character, it is characterized in that: select higher radical of the frequency of occurrences in the Hanzi structure and basic stroke code element for use as encode Chinese characters for computer, and be divided into some classes according to the appearance similar purpose is close, utilize the English alphabet of computer keyboard to have radian, and Chinese character is square, at first the English alphabet shape is had an area of conversion, after the conversion with the capitalization of 24 English alphabets, 5 English alphabet small letters, the profile of 2-0 arabic numeral Chinese character is carried out the approximate corresponding classification association of sorting out and being close in meaning of profile with the radical that we are classified; 9 numerical keys in the keyboard of computing machine the top 2 to 0 only utilize profile to sort out symbol as pictograph, corresponding English alphabet keys W---P below utilizing it in when input coding has only used 26 English alphabet keys as enter key when encode Chinese characters for computer is imported; When encoding, adopt word lead-in to associate the prompting input synchronously;
The coding rule and the method for described square round classify pictographic code input method of Chinese character are:
(1), 26 high frequency words are only got one yard and are added space bar and finish; 26 key name characterized radicals are hit this key earlier and are hit A once and add space bar and finish; Characterized radical is hit this key earlier once without exception on all the other keys, and the stroke end to end of getting this word then adds space bar to be finished;
(2), except that 26 high frequency words and key name characterized radical, other word gets three end one according to its radical that comprises without exception, has only two or three radicals, adds space bar; According to sequential write, form successively according to radical during code fetch, big sign indicating number is preferential, and promptly a word can be torn two yards open and not get trigram, can tear trigram open and not get four yards, get according to character root of keyboard when yardage is identical big preferential, but in the outer earlier back of pen clip structure and full shaped as frame structure;
(3), two singles draw to intersect, the shaped as frame structure do not take apart without exception;
(4), structure of ganging up from top to bottom more than two, it is the F class, the horizontal above and perpendicular structure of ganging up of two perpendicular pens of two perpendicular pens of ganging up, it is the H class, get such condition code F, H during code fetch without exception earlier, get the folding pen and the shaped as frame feature code of being gone here and there in the structure again, other no knuckle stroke omits without exception, and F, H class formation add the condition code coding by the class sign indicating number without exception;
(5), the shaped as frame structure lower end that can not be split and other stroke radical intersect, and just do not gang up frame, regards as overlapping without exception;
(6), two code words, just the high frequency word adds A and becomes the two-character word of being formed headed by two code words, lead-in is got two yard second word and is got sign indicating number end to end; Form the above speech of three words headed by two code words, lead-in is got the first sign indicating number of getting last two words of this speech after two yards again; Lead-in is only got a trigram when trigram or the Chinese word coding formed headed by the word more than four yards, and the head sign indicating number of getting this speech the last character again gets final product;
(7), when the punctuation mark that has on the keyboard, arabic numeral input, the keystroke method is with English state; The key name of described keyboard and the corresponding relation of Chinese character root are as follows:
Figure C021317130003C1
CNB021317135A 2002-05-20 2002-09-03 Square round classify pictographic code Expired - Fee Related CN1162767C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB021317135A CN1162767C (en) 2002-05-20 2002-09-03 Square round classify pictographic code

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN02113024.8 2002-05-20
CN02113024 2002-05-20
CNB021317135A CN1162767C (en) 2002-05-20 2002-09-03 Square round classify pictographic code

Publications (2)

Publication Number Publication Date
CN1414457A CN1414457A (en) 2003-04-30
CN1162767C true CN1162767C (en) 2004-08-18

Family

ID=25741119

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021317135A Expired - Fee Related CN1162767C (en) 2002-05-20 2002-09-03 Square round classify pictographic code

Country Status (1)

Country Link
CN (1) CN1162767C (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102073386B (en) * 2010-11-09 2012-07-11 浙江理工大学 Chinese character computer input method for pictographic and ideographic classified radicals
CN103135788B (en) * 2013-03-18 2016-07-06 姜大林 A kind of Chinese character four-quadrant meaning shape input method
CN107300988A (en) * 2017-09-04 2017-10-27 张新伟 The pictograph letter method of Chinese character coding and its computer Chinese input with keyboard

Also Published As

Publication number Publication date
CN1414457A (en) 2003-04-30

Similar Documents

Publication Publication Date Title
CN1162767C (en) Square round classify pictographic code
CN1262473A (en) Chinese-caracter input method by phonetic letters with numeral key pad
CN1116335A (en) Chinese character screen-writing input system
WO2008089654A1 (en) Ordering retrieving method of chinese character type, device thereof and an information system
CN1196057C (en) One-code two-form quick Chinese digital coding input method
CN1274883A (en) Simplified spelling-touching screen mouse Chinese character input method
CN1349157A (en) Digital configuration code Chinese character input method
CN1072785A (en) Irrational rank-numeral synthetic coding method and keyboard thereof
CN1177271C (en) Four-stroke number code input method for characters and words and without duplication code and its keyboard
CN1043381C (en) Four-stroke digit look-up method for Chinese characters
CN1348127A (en) Precise alphabetic writing input method via common digit keyboard
CN1178344A (en) Four tone inputting method for Chinese characters
CN1138197C (en) 10-stroke shape-pronunciation input method
CN1378122A (en) Yi-code input method for Chinese characters
CN1458566A (en) Chinese character plain code input method
CN1744015A (en) Phonetic-code fast input method
CN1328282A (en) Chinese-character 'Natural code', input method
CN1514338A (en) Chinese character computer/cell phone input integrated code
CN2518148Y (en) On-line identification braille board
CN1365038A (en) Ten-stroke phonetic and configurational code input method
CN1068444C (en) Method of Chinese-character coding
CN1039512C (en) Single stroke input method and keyboard thereof
CN1042768C (en) Alphabetic Chinese character input keyboard and input method
CN1245678C (en) Chinese character input method using phoneticizing and complement code
CN1396511A (en) Microcomputer operating system with correspondance between key code and window position number and its Chinese-character'9-code' input method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040818