CN1040103A - Irrational number digital coding and keyboard thereof - Google Patents

Irrational number digital coding and keyboard thereof Download PDF

Info

Publication number
CN1040103A
CN1040103A CN 89107074 CN89107074A CN1040103A CN 1040103 A CN1040103 A CN 1040103A CN 89107074 CN89107074 CN 89107074 CN 89107074 A CN89107074 A CN 89107074A CN 1040103 A CN1040103 A CN 1040103A
Authority
CN
China
Prior art keywords
phrase
chinese character
word
code
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 89107074
Other languages
Chinese (zh)
Inventor
肖水清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HUADI TECHNICAL INFORMATION SERVICES GUILIN CITY
Original Assignee
HUADI TECHNICAL INFORMATION SERVICES GUILIN CITY
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HUADI TECHNICAL INFORMATION SERVICES GUILIN CITY filed Critical HUADI TECHNICAL INFORMATION SERVICES GUILIN CITY
Priority to CN 89107074 priority Critical patent/CN1040103A/en
Publication of CN1040103A publication Critical patent/CN1040103A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

A kind of Chinese character input method and keyboard Designing are utilized QWERTY keyboard, can import 6763 GB I and II Chinese characters and 75600 phrases.By word commonly used more, stroke more less, the easy more principle of keystroke is to encode Chinese characters for computer.Humorous sound and digital code by phrase is encoded to phrase.Average every word keystroke 2.36 keys of Chinese character, average every word keystroke 1 key of phrase, Chinese character and phrase all do not have repeated code.Available area bit code input whole Chinese characters of GB2312-80 and graphical symbol are realized on IBM-PC/XT and compatible computer under this method state, at internal memory 512k, have only on the low end computer of a floppy disk and all can move this method.

Description

Irrational number digital coding and keyboard thereof
The present invention relates to a kind of robot calculator Chinese character input method and keyboard Designing.
The robot calculator Chinese character input method has kind more than 400 at present, kind more than 100 is arranged through Patent Office of the People's Republic of China is disclosed, and these Chinese character input methods can be divided into four big classes:
One, sound sign indicating number: promptly encode according to the pronunciation of Chinese character.As the Chinese phonetic alphabet, initial and final double-spelling sign indicating number etc.The shortcoming of sound sign indicating number is: repetition rate of coding height, code length are long, more than average every word 4 keys, do not observe word commonly used more, stroke more less, the easy more principle of keystroke.
Two, font code: promptly encode, as " optimizing the Five-stroke Method compiling method and keyboard thereof " (seeing CN85100837A) according to the font of Chinese character.The shortcoming of font code is: the rule that has a pack to divide Chinese character to encode, find it difficult to learn, and repeated code is arranged, code length is longer, and mean code length 2.6 keys of the Five-stroke Method are not exclusively observed word commonly used more, and stroke is few more, the easy more principle of keystroke.
Three, pronunciation-form combination code: promptly encode according to the pronunciation and the font of Chinese character simultaneously, as " natural code " (Chinese software design specialist Zhou Zhinong.In June, 1989), the shortcoming of phonetic-stroke code is: should learn the coding rule of sound sign indicating number, learn the coding rule of font code again, find it difficult to learn, have repeated code, code length longer, the average keystroke of every word is more than 3 times.Do not observe word commonly used more, stroke is few more, the easy more principle of keystroke.
Four, number: as region-position code, telegraph code.Its shortcoming is: code length is longer, and every word 4 keys are grasped with general learning method is difficult, do not observe word commonly used more, and stroke is few more, the easy more principle of keystroke.
Zhou Zhinong expert is in global function second generation Chinese character input software " natural code " handbook of publishing in June, 1989, and say when speaking of the development of character coding input technology: pure Irrational rank method did not occur in China.The method is by the Chinese character frequency of occurrences, with the most frequently used Chinese character layout on best keyboard position, and the coding also the shortest.Though this method memory difficulty is very big, if through for a long time on top of after, its input speed will be the fastest in theory.
Purpose of the present invention is to provide a kind of Irrational rank Hanzi coding scheme.The characteristics of this programme: no repeated code, code length is the shortest, observes word commonly used more fully, and stroke is few more, and the easy more principle of keystroke has abundant phrase, learns easily, uses easily.
The present invention includes the encoding scheme of Chinese character, the encoding scheme of phrase, the design three part particular contents of keyboard.The present invention with arabic numeral 0~9, opening bracket arabic numeral (0)~(15), close the bracket arabic numeral
Figure 891070745_IMG2
~9. Chinese character and phrase are encoded.
One, the coding of Chinese character: according to tens of staff of national Chinese character information processing system engineering from year July in September, 1974 to 1985 to 86 books, 104 periodicals and 7075 pieces of papers, the statistics that adds up to 21657039 word language materials, Chinese character according to the ordering successively from big to small of its frequency of utilization, is encoded top 256 the most frequently used words (frequency of utilization reaches 0.612) with two opening bracket arabic numeral (0) (0)~(15) (15).As (0) (0), (0) (1), (0) (2) is, (0) (3) exist, and (4) (0).With the frequency of utilization sequence number is that 1000 everyday characters (frequency of utilization is 0.328) of 257~1256 are closed the bracket arabic numeral with three
Figure 891070745_IMG3
Figure 891070745_IMG4
Figure 891070745_IMG5
~9. encode.As
Figure 891070745_IMG7
Figure 891070745_IMG8
Two,
Figure 891070745_IMG10
1. change,
Figure 891070745_IMG11
It is 2. other,
Figure 891070745_IMG13
Figure 891070745_IMG14
3. make,
Figure 891070745_IMG15
4. cut.More than the frequency of utilization of 1256 everyday characters reach 0.940.The present invention is called the basic Chinese characters coding.The coding of following Chinese character is called the expansion encode Chinese characters for computer.
Two key words, last position is
Figure 891070745_IMG17
~9., back one is (0)~(15), totally 160 Chinese characters.
Two key words, last position is (0)~(15), back one is
Figure 891070745_IMG18
~9., totally 160 Chinese characters.
Two key words, last position is 0~9, back one is
Figure 891070745_IMG19
~9., totally 100 Chinese characters.
Totally 420 of the two key words that expand can be user-defined professional everyday characters.Also can be the common characters of frequency sequence number 1257~1676, frequency of utilization be 0.029, as (0) spore,
Figure 891070745_IMG21
It is (1) vigorous, (2) alliance, (3) buy, (4) poplar.
Totally 1676 of primary word and expansion two key words, frequency of utilization is 0.969.
Triple bond word, front two are 0~9, and back one is
Figure 891070745_IMG25
~9., totally 1000.
The triple bond word, the first two position is 0~9, back one is (0)~(15), totally 1600.
The triple bond word, last position is 0~9, back two is (0)~(15), totally 2560.
Totally 5160 of the triple bond words that expands are for the frequency of utilization sequence number is 1677 later non-common words.
Totally 676 of two key words of basic Chinese characters and expansion Chinese character, 6160 of triple bond words add up to 6836, can encode to whole 6763 Chinese characters of GB2312-80.
Two, the coding of phrase.
The present invention designs the homophonic map table of Chinese character-numeral, sees accompanying drawing one.All Chinese characters are grouped into 0~90 arabic numeral respectively, phrase is encoded.Four of phrase code lengths.If 3300 of basic phrases expand 46300 of phrases.
(1) coding of basic phrase.Be divided into two words, three words, four words, the above speech of five words.
1, two words: first is identification code , (0) or (10), second and third is that the 00~99, four of the humorous sound and digital code of each Chinese character of two-character word is
Figure 891070745_IMG27
, (0) or (10), totally 300.
2, three words: first is identification code
Figure 891070745_IMG28
, second to four is the homophonic yardage 000~999 of three words, totally 1000.
3, four words: first is identification code (0), and second to four is triliteral humorous sound and digital code 000~999 before four words, totally 1000.
4, the above speech of five words: first is identification code (10), and second to four is triliteral humorous sound and digital code 000~999 before the above speech of five words, totally 1000.
Basic phrase mainly is the function word of using always, inferior notional word for using always.
(2), the coding that expands phrase is similar with the coding of basic phrase, is divided into two words, three words, four words, the above speech of five words equally.
1, two words:
(1), noun: last position be identification code 1.~9., two, three of centres are that the 00~99, four of the humorous sound and digital code of two-character word is identification code
Figure 891070745_IMG29
-9..Totally 9900.
(2), verb: last position is identification code (1)~(9), and middle two, three is that the 00~99, four of the humorous sound and digital code of two-character word is identification code (0)~(9), totally 9900.
(3), the speech except that noun, verb: last position is that the two or three of identification code (11)~(15) is the humorous sound and digital code 00~99 of two-character word, and the 4th is identification code (10)~(15).Totally 3500.
2, three words: last position be identification code 1.-9., back three is totally 9000 of the humorous sound and digital codes 000~999 of three words.
3, four words: last position is identification code (1)-(9), and back three is triliteral humorous sound and digital code 000-999 before four words, totally 9000.
4, the above speech of five words, last is identification code (11)~(15), back three is triliteral humorous sound and digital code 000~999 before the above speech of five words, totally 5000.
Expanding phrase can set according to different specialties.The coding method of identification code is: phrase commonly used comes the front, and identification code numerical value is little.For two words, the primary identification code of first conversion, the 4th identification code got
Figure 891070745_IMG30
, (0), (10).
The above-mentioned encoding scheme that the present invention proposes, Chinese character and phrase all do not have repeated code, and totally 49600 of basic phrase and expansion phrases can satisfy the general requirement of each specialty to the phrase amount fully.If feel that still the phrase amount is not enough, can increase 26000 of phrases newly, the code length of newly-increased phrase still is four, first to the 3rd is that the 000~999, four of the humorous sound and digital code of phrase is identification code ~9., (0)~(15).
This programme has been implemented on IBM-PC/XT and the compatible computer thereof and has moved, under this programme input state, and need not any conversion available area bit code input whole 6763 Chinese characters of GB2312-80 and 692 graphical symbols.
Three, keyboard Designing
The present invention is designed to 26 English letter keyboards
Figure 891070745_IMG32
~9., (0)~(15), 26 parenthesized arabic numeral.Because the cryptoprinciple of Chinese character of the present invention is: word commonly used more, the numerical value of its coding is more little, is 0.038 as the frequency of utilization of (0) (0) " ", and the frequency of utilization of (9) (9) " good " is: 0.002.The phrase identification code also is that phrase numerical value commonly used more is more little, so the design of keyboard will be arranged in fractional value the position of easy keystroke, i.e. the middle part of keyboard.Consider that common people's right hand is more flexible than left hand, so keyboard Designing of the present invention is as follows: open with 26 English letter keyboard Y, H, B branch, up 10 keys therefrom to the right side are
Figure 891070745_IMG33
, 2., 4., 6., 8. therefrom arrive a left side for 1., 3., 5., 7., 9..Middle 9 keys of row therefrom arrive right be (0), (2), (4), (6), therefrom arrive a left side and are (1), (3), (5), (7), (9).Descending 7 keys therefrom arrive a left side and are (10), (8), therefrom arrive a left side and are (11), (12), (13), (14), (15).See accompanying drawing 2.
Encoding scheme of the present invention, the rule in above-mentioned keyboard is clearly, for example,
1. basic two key words are two key combinations of middle line unit and following line unit (0)~(15).
2. basic triple bond word is to go up line unit
Figure 891070745_IMG34
~9. three key combinations.
3. the double word noun is two quadruple linkage combinations of going up line unit and two-digit key.
4. the double word verb is the quadruple linkage combination of two middle line units and following line unit (8) and two-digit key.
5. two following line units of other two-character word (except (8)) make up with the quadruple linkage of two-digit key.
6. three words are quadruple linkage combinations of going up line unit and three numerical keys.
7. four words are quadruple linkage combinations of a middle line unit and following line unit (8) and three numerical keys.
8. the above speech of five words is the quadruple linkage combination of a following line unit (except (8)) and three numerical keys.
This programme realizes on the IBM-PC machine that the inventor adopts the imago association method to come memory Chinese character coding, and memory rate is very fast, can remember the coding of more than 100 Chinese character at every turn, and memory and profound, is difficult for forgetting.
The basic skills of imago association is:
1, the memory of encode Chinese characters for computer: the number of Chinese character is converted into one two words or three words according to the partials table, then this word and Chinese character is constituted the imago association, and this association is drawn as picture, thereby the coding of Chinese character is remembered.Example:
-(0) (0) necktie: my necktie ((0) (0) is converted to necktie)
Be-(0) (2) ape man: this is an ape man
2, the memory of phrase coding: according to homophonic map table identification code is converted into word, then should partials speech and the association of phrase formation imago.Example:
2. 450-(two) waiter: two waiters.
3. 337-(three) taxi: three taxis.
Description of drawings:
Fig. 1, the homophonic map table of Chinese-character digital.
Fig. 2, keyboard layout. each word comprises the phonetically similar word and the four tones of standard Chinese pronunciation among Fig. 1.As the goblin, shake, ladle out, want;
The Chinese character that " si " expression You this sound Zu becomes is such as " silk " " temple " etc.; " s-" expression this sound of You is Yu the Zi that other sound is combined into, such as " sweeping " " gloomy ";
Flat tongue consonant " rope " comprises that cerebral " says ".
You point of the present invention
1, the present invention is that word is taken as the leading factor take Zi as the basis, high frequency Zi, and the encoding scheme of word You elder generation, and character word stock is all open, revises character word stock for the Yong family of different majors. It is present state-of-the-art Hanzi coding scheme. Zheng said such as Zhou Zhinong Zhuan family, Zhe class encoding scheme is the Chinese-character input scheme of the second generation.
2, Zi of the present invention, word all do not have the Chong code, can touch system. Zhe is except the numbers such as region-position code, telegraph code, and other encoding scheme institute is irrealizable.
3, the average every Zi keystroke of individual character of the present invention is 2.36 times, and the average every Zi keystroke of phrase 1 time is the shortest Hanzi coding scheme of code length.
4, everyday character of the present invention is arranged in the section of keyboard, and keystroke is easy, and Zun keeps Yue Zi commonly used fully, and the number of times Yue of keystroke is few, the easy Yuan Ze of keystroke Yue.
5, this coding does not have the such complicated Chinese character Zhe of a cover of pictographic code to divide Yuan Ze, need not remember Zi root, radical etc. Learn easily, easily Yong.
6, under Zai this programme input state, the whole Chinese characters of available area bit code input GB2312-80. To a large amount of non-common Zi as expanding totally 5087 of triple bond Zi, frequency of utilization only 0.031. Can encode, the Zhe sample can be saved the calculator memory space, thereby Yun is capable on can the Zai internal memory littler low end computer of this programme, promotes easily, but Zhi needs the floppy disk of a 360K to deposit this cover coded system with regard to Zhu, but calculator memory Zhi wants just capable this coded system of Yun of You 512K.
7, Yun imago association method can be remembered rapidly the encode character for computer of this programme and phrase coding.

Claims (1)

1, a kind of Chinese character input method and keyboard Designing, can utilize 10 alpha-numeric keys, 26 English alphabet keys to import the Chinese character input method of Chinese character and phrase, feature of the present invention is: according to the frequency of utilization of Chinese character, by word stroke commonly used more more less, keystroke is easy more, with two keys or triple bond Chinese character is encoded, humorous sound and digital code according to phrase is encoded to phrase, under this method input state, available area bit code input whole Chinese characters of GB2312-80 and graphical symbol are designed to 26 arabic numeral keyboards with 26 English letter keyboards.
CN 89107074 1989-09-07 1989-09-07 Irrational number digital coding and keyboard thereof Pending CN1040103A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 89107074 CN1040103A (en) 1989-09-07 1989-09-07 Irrational number digital coding and keyboard thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 89107074 CN1040103A (en) 1989-09-07 1989-09-07 Irrational number digital coding and keyboard thereof

Publications (1)

Publication Number Publication Date
CN1040103A true CN1040103A (en) 1990-02-28

Family

ID=4857029

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 89107074 Pending CN1040103A (en) 1989-09-07 1989-09-07 Irrational number digital coding and keyboard thereof

Country Status (1)

Country Link
CN (1) CN1040103A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1091530C (en) * 1994-09-02 2002-09-25 舒从如 Second-generation regional code of Chinese characters for keyboard inputting
CN100403238C (en) * 2001-08-28 2008-07-16 徐惠才 English numeral codes

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1091530C (en) * 1994-09-02 2002-09-25 舒从如 Second-generation regional code of Chinese characters for keyboard inputting
CN100403238C (en) * 2001-08-28 2008-07-16 徐惠才 English numeral codes

Similar Documents

Publication Publication Date Title
CN1376965A (en) Small keyboard layout for inputting letters
CN85100837A (en) Optimize the Five-stroke Method compiling method and keyboard thereof
CN1129837C (en) Mounting device for universal Chinese phonetic alphabet keyboard
CN1097766C (en) Chinese-character 5-key input method
CN1040103A (en) Irrational number digital coding and keyboard thereof
CN1119739C (en) Chinese-character 5-stroke digital input method with keyboard of computer and its keyboard
CN1033476C (en) Multiple-language digital coding method and its keyboard
CN1081353C (en) Latinized phonetic codes for modern Chinese works
CN1091895C (en) Computer Chinese input scheme based on the Chinese phonetic alphabet
CN1148637C (en) Precise alphabetic writing input method via common digit keyboard
CN87100555A (en) Double stroke-order Chinese character input scheme of computer and keyboard thereof
CN1257444C (en) Complete pronunciation Chinese input method for computer
CN1472626A (en) Intelligent embedded character inputting method and device
CN1059746C (en) Computer phonetic Chinese characters input method
CN1191702C (en) Chinese Character input method of simplified keyboard
CN1147780C (en) Three-stroke digital code Chinese character input method and keyboard
CN1116336A (en) Substitution type Chinese phonetic character, word input coding method and keyboard thereof
CN101034319A (en) Chinese character input method and special-purpose keyboard thereof
CN1030867C (en) Phoneme simple code input method
CN1388430A (en) Modern Chinese pronunciation input method
CN1208711C (en) Digital English inputting method and keyboard thereof
CN1248014A (en) Computer Chinese input method of component first and last code and its keyboard
CN1341884A (en) Chinese language input method
CN1652069A (en) Phonetic digital code input method
CN1110811A (en) Dictionery indexing coding input method and its Chinese and Occidental language key-board

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C01 Deemed withdrawal of patent application (patent law 1993)
WD01 Invention patent application deemed withdrawn after publication