CN86107235A - Speech word binary coding input hanzi system and keyboard - Google Patents

Speech word binary coding input hanzi system and keyboard Download PDF

Info

Publication number
CN86107235A
CN86107235A CN86107235.9A CN86107235A CN86107235A CN 86107235 A CN86107235 A CN 86107235A CN 86107235 A CN86107235 A CN 86107235A CN 86107235 A CN86107235 A CN 86107235A
Authority
CN
China
Prior art keywords
code
character
word
input
initial consonant
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN86107235.9A
Other languages
Chinese (zh)
Other versions
CN1006251B (en
Inventor
栗兴民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HANDAN CITY BRANCH OF DEMOCRACY PROMOTION ASSOCIATION OF CHINA
Original Assignee
HANDAN CITY BRANCH OF DEMOCRACY PROMOTION ASSOCIATION OF CHINA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HANDAN CITY BRANCH OF DEMOCRACY PROMOTION ASSOCIATION OF CHINA filed Critical HANDAN CITY BRANCH OF DEMOCRACY PROMOTION ASSOCIATION OF CHINA
Priority to CN86107235.9A priority Critical patent/CN1006251B/en
Publication of CN86107235A publication Critical patent/CN86107235A/en
Publication of CN1006251B publication Critical patent/CN1006251B/en
Expired legal-status Critical Current

Links

Images

Abstract

Speech word binary coding input hanzi system and keyboard thereof belong to the computer Chinese information treatmenting technology field, and keyboard is a specialized equipment of implementing this technology.The key problem in technology of Chinese information processing is the Chinese character input, and encode Chinese characters for computer is Chinese character input " bottleneck ".The present invention adopts with the method for sound for shape and association, and basic character is defined on the keyboard, has reduced the rote memory to character.Adopt formula coding sanctified by usage, easy note eager to learn.A large amount of phrase codings that adopt are imported, and mean code length is 2.3 keys/word, and input speed can reach 150 words per minutes.Thereby become a kind of Chinese information processing technology scheme of desirable practicality.

Description

Speech word binary coding input hanzi system belongs to the computer Chinese information treatmenting technology field, and keyboard is a specialized equipment of implementing this technology.
The world today has entered the epoch that an informationization develops rapidly, and information engineering becomes one of three big pillars of modern science.According to China's national situation, in the information of numerous and complicated vastness, mainly be Chinese information.We will carry out Four Modernizations construction, realize office automation, print publishing modernization, library and information retrieval robotization, production and business management modernization, relate to Chinese information invariably.Thereby solving the Chinese information processing technology problem has become the task of top priority.
So-called Chinese information processing should comprise China various nationalities' language information processing.But, in various nationalities' language, most widely used is Chinese, so so-called Chinese information processing mainly is meant Chinese information processing, more precisely says it mainly is Chinese character information processing here.
Because Chinese character quantity is various, complex structure is so in Technology of Chinese Information Processing, key is the input technology problem of Chinese character.Chinese character is input to computing machine (or claiming computer) three kinds of modes are arranged.That is: speech recognition.Three kinds of input modes of Figure recognition and keyboard.According to present circumstances, speech recognition and Figure recognition input only are in the development test stage, also are far from being and apply, mainly by the keyboard mode input.Though keyboard has large, medium and small three kinds of models, because big keyboard and middle keyboard equipment are big, investment is many, so also be not easy to promote.So, solving the approach of Chinese character input at present, the main keypad that just leans on has been encoded this narrow passage of input.The synonym that " bottleneck " is decided to be approximately both at home and abroad " encode Chinese characters for computer " in recent years self-evidently, image and exactly understand critical role and the effect of encode Chinese characters for computer in Technology of Chinese Information Processing.
About encode Chinese characters for computer academic research, China's starting is slower, but development rapidly.Developed more than 400 scheme in recent years, the existing kind more than 50 of last machine operation.Influence bigger having: " the Five-stroke Method " scheme of the Wang Yongmin slip-stick artist of Henan Province computing center invention; " the configuration code method " of the Li Jinkai lecturer of Beijing Normal University invention; The Qian Weichang professor of Shanghai Polytechnic Univ presides over " HPX Chinese character spelling " scheme of development " macroscopical font " scheme and Beijing Li Huiqin slip-stick artist of Ministry of Water Resources and Power Industry scientific research institution invention.Wherein, the key technical indexes of " the Five-stroke Method " scheme: mean code length L=2.8 key/word; Input speed: Sj=130 words per minute.Its input speed is also faster than foreign language input, in the evaluation meeting expert consistent think reached international most advanced level.But, also also having weak point, that is exactly that this scheme is not easy to grasp, and crosses one section after grasping and bring back to life easily again.
The objective of the invention is: work out that masses are easy to that acceptance, easy note eager to learn, mean code length bond number are few, high input speed and Hanzi keyboard coding input scheme accurately.
The objective of the invention is to reach like this: two kinds of code elements of a kind of employing speech and word coding is proposed, the computer Chinese information processing hanzi system that constitutes by two kinds of input methods and for realize that this system designs with sound for shape, sound shape compatibility, be convenient to the binary input keyboard of association.
This system constitutes like this: use GB2312-80 region-position code and GB1988-80 international code to make machine internal information permutation code, speech word binary pronunciation-shape encode input method (be called for short " CZ-III) and two kinds of input methods of character ideophone coded input method (being called for short " character ") by exploitation constitute a complete Hanzi keyboard code input system that establishes one's own system; concrete structure is seen Fig. 1, and Fig. 2 is seen in its relevant procedures connection.
One of difficult point of encode Chinese characters for computer is to the character memory of (also claiming radical or parts).Numerous in the past schemes all adopt rigid definition, lack inherent contact, and its rote memory amount is very big.In order to alleviate the rote memory amount to character, keyboard of the present invention is to adopt following method design:
1. the basic character that independent names will be arranged is defined on the key position at its title first syllable initial consonant place according to initial consonant definition figure (being Fig. 3);
2. the basic character that will not have independent names to title of being convenient to association of its definition, is defined on the key position at association's title first syllable initial consonant place according to character title associative graph (being Fig. 4) earlier then;
3. in order to reduce some alphabetical quantity of information, with some basic character that traditional title arranged for example " rain " (rain prefix), " door " (door word frame) etc. redefine and be " rain " (mist prefix), " door " (asking the word frame), be defined in respectively then on " A " and " W " key;
With the non-word character beyond the basic character (as "
Figure 86107235_IMG1
) according to first stroke of a Chinese character form of a stroke or a combination of strokes code definition figure (being Fig. 5), be defined on its first stroke of a Chinese character form of a stroke or a combination of strokes code (" I ") key.
Thereby constituted with sound for shape, sound shape compatibility, be convenient to the binary input keyboard of association, i.e. Fig. 6.
About the structure type of Chinese character, be encode Chinese characters for computer difficult point two." the Five-stroke Method " scheme reduces four kinds of fonts to the structure topology figure of Chinese character.Hanzi structure is so complicated, and structure type is concluded fewly more, and the practical font of each class institute subsumption is just many more, should use just difficult more.The present invention reduces ten classes moderately, and each class subsumption again is several, counts 21 kinds of mould figure, and every kind of mould figure connects the position structural region and marked serial number (being Fig. 7).Two kinds of input methods of native system, for the coding of individual character, font code all is by the structure position code fetch, each position limit is got one yard, and regulation is first, from a left side, a last position (comprising two character individual characters) code fetch is from hanging down from the right side from height for inferior position code fetch.
Aspect the input formula, native system adopts the guiding input.So-called guiding input, hit preamble code exactly after, only show with the word or the speech of sign indicating number earlier at presenting bank, hit option code again, just finish input.Guiding for individual character is the technology that " phonetic ", " form of a stroke or a combination of strokes " input methods are used already, and the feature of native system is that words all is the guiding input.Preamble code realizes that with letter key option code realizes with numerical key.Wherein " O " usefulness " space " key is realized.The same sign indicating number speech that guides is arranged by frequency reducing, arranges by first stroke of a Chinese character form of a stroke or a combination of strokes digital code with the sign indicating number individual character.That is to say: option code is the end mark of single word code, represents form of a stroke or a combination of strokes information by first stroke of a Chinese character form of a stroke or a combination of strokes code definition figure (Fig. 7) again.
The maximum difficult point of encode Chinese characters for computer is coding principle, develop the coding principle that a kind of masses are easy to accept, and need go to draw from the formula of masses' description Chinese character by words sanctified by usage.We see gratifiedly: be printed on the compartment of " YZ " on the train, i.e. " hard seat " compartment; The compartment that is printed on " RZ " is " soft seat " compartment; The compartment that is printed on " RW " is " soft sleeper " compartment; The compartment that is printed on " XL " is " luggage " compartment.Be printed on " HB " on the passenger vehicle that Shijiazhuang, Hebei produces and promptly represent " Hebei ".Be printed on " GB " on the national standard book cover and promptly represent " GB "; " HBXW " expression " Hebei news " of Hebei TV station.Realizing the rule of a phrase coding from above-mentioned these examples, be called " phrase sound preface compiling method ", promptly is the initial consonant series arrangement of each syllable of phrase coding the very strong best phrase codings of the public readability that are easy to accept exactly.
For some long machine-operated titles, popular also have a simple rule sanctified by usage, and for example: State Council " electronics development office " abbreviates " electricity shakes and does " as." financial accounting " abbreviates " financial accounting " as." five advocatings, four points of beauty and three aspects of love " abbreviate " five-four-three " as; " Chinese secretary speciality " abbreviates " in secret specialty " as; " business administration specialty " abbreviates " enterprise management specialty " or the like as.We realize simply rule of a phrase again from these examples, are called " phrase sound preface is omitted compiling method ".After just the initial consonant of phrase part syllable being omitted, sequential encoding again.
Above-mentioned rule sanctified by usage becomes theoretical foundation of the present invention.The present invention is about the switching of speech and word input, and without function key, and directly control with the figure place of preamble code: a bit code or three bit codes are individual character, and two bit codes and four bit codes are phrase.Coding rule is as follows:
One, phrase coding rule:
1. two codings:
For two syllable high frequency words, with two alphabetic codings.The initial consonant that first letter is first syllable (or first letter of zero consonant syllable, as follows).Second letter is the initial consonant of second syllable.Each organizes preamble code, can guide 10 group of two syllable sequence word group in unison.The group of sequence word in unison that is directed is out arranged by frequency reducing.Numerical key with correspondence is selected input.Comprise that this class phrase mean code length of options button is 1.5 keys/word.For example:
Figure 86107235_IMG2
(annotate: the preamble code that shows on screen is a lowercase, and the two syllable phrases that are directed equal less than 10 groups).
2. four codings:
2.1 the coding of the general phrase of two syllables: four lexicographic order codings of lead-in metacode that the general phrase of two syllables is added two syllables with the initial consonant of two syllables are made preamble code.Like this, hit the initial consonant of two syllables earlier, be guided out two syllable high frequency phrases.If wherein there is not the phrase that will import, mean code length 2 keys/word general two syllable phrases promptly appear, in the lead-in metacode of then hitting two syllables.For example:
The phrase that preamble code is directed
I g 0 China 1 this 2 look after in 3 totally 5 directly perceived
6 are responsible for 7 grand 8 regular 9 preciousnesses
Igdu 0 subjectivity
2.2, the coding of triphone phrase: the triphone phrase with one, two, trisyllabic initial consonant adds that a letter " O " supplies four, sequential encoding.Because make the word of initial consonant with " O " few, it is used in encoded tail, both made it meet the phrase code type, be again the identifier of " triphone phrase ".Mean code length 1.33 keys/word.For example:
The phrase that preamble code is directed
The g u d o O Communist Party
G M d o O Kuomintang
V u l O O throughput rate 1 yield-power
┆ ┆ ┆ ┆
Di Jiedijiedijie mends
Add for one two three
Sound vowel vowel mother " O "
(annotate: if preamble code satisfies four and do not have sequence word group in unison, then need not hit options button screen in " jumpings " automatically, finish input.)
2.3, the coding of quadrisyllable group: the quadrisyllable group is with the initial consonant sequential encoding of each syllable.Mean code length 1 key/word.For example:
The phrase that preamble code is directed
I g r m O Chinese people
J v w m O spiritual civilization 1 is well known
┆ ┆ ┆ ┆
Di Jiedijiedijiedijie
One two three four tones of standard Chinese pronunciation
Sound vowel vowel vowel mother
2.4, the coding of pentasyllable phrase: the above phrase of pentasyllable so adopt " phrase sound preface omission compiling method ", saves the initial consonant of second syllable because its syllable number has surpassed the phrase code type figure place of regulation, with one, three, four, the initial consonant sequential encoding of pentasyllable.Mean code length 0.8 key/word.For example:
The phrase that preamble code is directed
The i g u d O Chinese Communist Party
W m f w O serves the people
The s x d h O Four Modernizations
┆ ┆ ┆ ┆
Di Jiedijiedijiedijie
One three four tones of standard Chinese pronunciation five
Sound vowel vowel vowel mother
2.5, the coding of hexasyllable phrase: the hexasyllable phrase saves two. tetrasyllabic initial consonant, with one, three, five, hexasyllabic initial consonant sequential encoding makes preamble code.Mean code length 0.67 key/word.For example:
The phrase that preamble code is directed
I r y h O People's Bank of China
The i n y h O Agricutural Bank of China
1 National Industrial and Commercial Bank of China of i g y h O Bank of China
┆ ┆ ┆ ┆
Di Jiedijiedijiedijie
One three five six
Sound vowel vowel vowel mother
2.6, the coding of seven tunes joints phrase: seven tunes joint phrase saves two. four. hexasyllabic initial consonant, with one, three, five, the initial consonant sequential encoding that saves of seven tunes makes preamble code, mean code length 0.57 key/OK.For example:
The phrase that preamble code is directed
The i r g g O People's Republic of China (PRC)
The i r j j O Chinese People's Liberation Army
W s c a O five advocatings, four points of beauty and three aspects of love
┆ ┆ ┆ ┆
Di Jiedijiedijiedijie
One three five seven tunes
Sound vowel vowel vowel mother
2.7, the coding of multisyllable phrase: it is the multisyllable phrase that seven tunes save above phrase, for the multisyllable phrase without exception with one, three, five, the consonant coding of last syllable, its mean code length is less than 0.5 key/word.For example:
Preamble code is directed phrase
I i x h O Chinese Information Processing Society of China
The i g i t O Communist Youth League of China
I g m h O China Council for the Promotion of International Trade (CCPIT)
See Fig. 8 for details about the phrase pronunciation-shape encode.
Two. single character code:
The present invention is for the coding principle of individual character, also is that the formula of drawing public description individual characters sanctified by usage is formulated.Such as: when people describe " opening " word of surname Zhang, its formula is: " bow-length-open "; When describing qualified " closing " word, its formula is: " people--mouth-close ".According to such formula, the present invention has developed two kinds of compiling methods.A kind of is from sound, adds the font information coding of word, is called " pronunciation-shape encode method "; Another kind is from shape, and with the font information coding at each position, three persons of font less than are called " ideophone compiling method " with the initial consonant polishing of this word.
1, pronunciation-shape encode method: the basic formula of pronunciation-shape encode method is: " this word initial consonant-lead-in metacode-end of file character code ".Specifically be divided into two grades:
1.1, the high frequency word: the individual character that applicating frequency is high is called the high frequency word.For the high frequency word, only use initial consonant one bit code of " this word " to make preamble code.Each preamble code is bootable to go out 10 high frequency words.But, what wherein use letter " O " guiding is not the high frequency word, but 10 punctuation marks commonly used.Arrange by its first stroke of a Chinese character form of a stroke or a combination of strokes digital code with a sign indicating number high frequency word, with its code sign indicating number that elects, this type of individual character, mean code length are 2 keys/word simultaneously.For example:
Preamble code is directed individual character
I 0 this 1 positive 2 account among 3 heavy 4 Zhao 56789 palms
B 0 is by 13 white 4 limits 5 eight 67 to 889 half, 2 north not
1.2, general single character: with " this word " initial consonant, prefix form of a stroke or a combination of strokes code and suffix form of a stroke or a combination of strokes code tri-bit encoding are made preamble code for general single character.Add option code, mean code length 4 keys/word.For example:
The individual character that preamble code is directed
B p X 3 grasps
5 one-tenth of u t o
┆ ┆ ┆ ┆
This font font choosing
Prefix was selected for tail generation
Female pen sign indicating number pen sign indicating number sign indicating number
1.3, general combinde rqdical character: for the initial consonant of general combinde rqdical character with " this word ".The lead-in metacode.End of file character code tri-bit encoding is made preamble code, with the prefix form of a stroke or a combination of strokes code sign indicating number that elects.Mean code length 4 keys/word.For example:
The individual character that preamble code is directed
V v g 4
X k c 1 shape
┆ ┆ ┆ ┆
This head for tail for ┆
Word word word
The choosing of female first code element sign indicating number
Select
Sign indicating number
Pronunciation-shape encode about individual character sees Fig. 8 for details.
2, ideophone compiling method: the ideophone compiling method is chosen the font code of each structure position from the font information of Chinese character, and three persons of less than add the initial consonant of " this word ", supply three.The concrete third gear of dividing.
2.1 high frequency word: said here high frequency word is from the higher word of conformal analysis applicating frequency.Only use lead-in metacode (single character is only used prefix form of a stroke or a combination of strokes code) to make preamble code for the high frequency word.Also select with prefix form of a stroke or a combination of strokes code, mean code length 2 keys/word, for example:
The individual character that preamble code is directed
B's 3
Among the t 5
O 6 states
┆ ┆
Lead-in metacode prefix form of a stroke or a combination of strokes code (corresponding numerical code)
2.2 inferior high frequency word: make preamble code for inferior high frequency word with the code (or code of the single character prefix and the suffix form of a stroke or a combination of strokes) of first and second two characters, with the prefix form of a stroke or a combination of strokes code sign indicating number that elects.For example:
The individual character that preamble code is directed
N z 4 is good
3 autumns of h h
T x 5 Zhu
┆ ┆ ┆
First generation is for logarithm
Word answered in the word word
The key of unit's code element sign indicating number
2.3, general two character combinde rqdical characters: for general two character combinde rqdical characters, make preamble code, select with prefix form of a stroke or a combination of strokes code with the consonant coding that two character codes are added " this word ".Mean code length 4 keys/word.For example:
The individual character that preamble code is directed
N v x 4 surnames
X v x 9 property
┆ ┆ ┆
First for tail for this logarithm
Word word word
Sound is answered word
The code element sign indicating number mother of unit
Key
2.4, general multiword unit combinde rqdical character: be called multiword unit combinde rqdical character more than three characters.For multiword unit combinde rqdical character,, respectively get a character code from each position according to Hanzi structure mould figure (Fig. 7) position of marking.From a left side, last position code fetch is from hanging down from the right side from height for preceding two position code fetches.For example:
The combinde rqdical character that preamble code is directed
0 one of l k e
M n f 9 numbers
R f x 3 is numerous
See Fig. 9 for details about individual character ideophone coding.
Three. fuzzy input method:
For above-mentioned two kinds of input methods, the system software support all can be adopted " fuzzy input ".So-called " bluring " promptly is confused about some information." the fuzzy input " of native system design must know first bit code, that is: " this word " initial consonant or lead-in metacode.If know " this word " initial consonant, with regard to the fuzzy input of employing sound shape; Just adopt the fuzzy input of ideophone if know the lead-in metacode.Three kinds of fuzzy form are respectively arranged.
1. sound shape is blured input form:
The normal pronunciation-shape encode of sound=VVG()
Sound=V? G(second bit code is fuzzy)
Sound=VV? (the 3rd bit code is fuzzy)
Sound=V? (the 2nd, three bit codes are fuzzy)
2. ideophone blurs input form:
The normal shape coding of shape=KCX()
Shape=K? X(is fuzzy to second bit code)
Shape=KC? (fuzzy) to the 3rd bit code
Shape=K? (fuzzy) to the 2nd, three bit codes
Four. repeated code is handled:
Two kinds of input methods of native system exploitation all have repeated code, though the repetition rate of coding is not high, must handle.System software supports, hit option code after, if repeated code is arranged, do not import, but show once more at reminding window, and report to the police by frequency.Hit options button again, just finish input.
Comprehensive above-mentioned four kinds of input methods constitute a complete Chinese character input system that shows unique characteristics.Reference system operational flowchart (being Figure 10) can be finished integrated application.In input process, if word or speech that understanding will be imported just adopt " binary " input; If be not familiar with the word that to import, then can use " character " input instead; If some information in two kinds of input methods is had fuzzy, as long as know the initial consonant or the lead-in metacode of " this word ", promptly available " fuzzy input method " input.Below in conjunction with 12 Sixth Plenary Session of the Party Central Committee communique ending passages, carry out the coding simulation test.
Original text: Xinhua News Agency Beijing September 20
Binary coding: XHVOBJOJ4Y7E1V4
The 12 committee of central authorities of the news Chinese Communist Party on the eight
B8R6YZUOI GUDD8V4E1VYJ7I W
The member can point out by the 6th plenary session communique:
YHD8L0LQCOQT2HY3GB7IUO:
The whole Party and army and people of all nationalities are called in plenary session
“QH4HI1QD7QJRPH3QG0GZ6RM
, study hard and carry out that " Central Committee of the Communist Party of China is about society
O,RI0XX0H3GU4LV5《IGIYGYOVH
The resolution of doctrine spiritual civilization construction guilding principle " heavily fortified point
IYJVWMOJVUYID1FI1D3JYLY》,J
Holding construction of socialist material and spirit civilization grabs together
U1.VHIYWIWMH3JVWM0JVUYAQ1VII
, with the excellence of modernization construction and overall restructuring
I5,Y2XDH0JVUYH3QM1GG0D3YY8
Achievement is met the 13 whole nation of party
UJI,YJ6DKE0D3D8V4C1LQC0QG0
Congressional holding.”
DB0DH0D3IK1。”
More than count 137 words, wherein the pentasyllable phrase occurs twice, and the quadrisyllable group occurs six times, and the triphone phrase twice, two syllable phrase occurs and occurs 33 times, and the high frequency individual character occurs 20 times, and general individual character only occurs six times.Demonstrated fully with phrase and be input as the master.Comprise options button, shared 219 keys, mean code length are 1.6 keys/word.To hang down than system's mean code length 2.3 keys/word.
This system is applicable to the electric word computer of various models. intelligent chinese-English typewriter, teletype writer. and the electronics film titler that Chinese terminal and TV, film making are used.
The present invention compared with prior art has the following advantages:
1, for shape, the speech word binary coding input keyboard that sound shape method compatible and association designs is convenient to association, has alleviated the rote memory amount to character with sound in employing.
2, adopt the formula coding of public description Chinese character by words sanctified by usage, be easy to accept easy note eager to learn.
3, employing is input as the master with the phrase coding, and single character code is input as auxilliary, the binary input, and the measure of walking on two legs makes mean code length reach L=2.3 key/word, makes input speed reach the Sj=150 words per minute.
4, adopting the guiding input, with the prefix form of a stroke or a combination of strokes code key that elects, is the end mark of individual character, represents form of a stroke or a combination of strokes information again.The specialty operator can be by the rule touch system, and general operation person can rely on and guide keystroke to select input, takes into account and popularizes and raising, kills two birds with one stone.
5, novelty of the present invention is that the speech word binary coding theory that is proposed is to propose for the first time both at home and abroad, has filled up China and foreign countries' Chinese information processing research speech word binary coding and has imported this blank; Its creativeness is the words preamble code type control that the switch application of speech word binary input is specific: one or three is individual character.Two or four is phrase; Its practicality is that being input as main measure with the phrase coding meets Modern Chinese language application reality.
The shortcoming of native system is that committed memory is many, accounts for 300K.So, realize that best way of the present invention is to make Chinese Card, can vacate more internal memory like that and move other software, the special benefit that system's performance Chinese character is handled.
Description of drawings:
Fig. 1-system architecture diagram
Fig. 2-system's relevant procedures connection layout
KD-keyboard input driver
CIP1-region-position code loading routine
CIP2-GB loading routine
CIP3-character sign indicating number loading routine
CIP4-CZ-II sign indicating number loading routine
The TE-edit routine
The DD-display driving software
Fig. 3-initial consonant definition figure
Fig. 4-character title associative graph
Fig. 5-form of a stroke or a combination of strokes code figure
Fig. 6-binary input keyboard figure
Fig. 7-Chinese character position structure mould figure
Fig. 8-speech word binary pronunciation-shape encode-look at table
Fig. 9-character ideophone coding complete list
Figure 10-system operation process flow diagram
4. after the preceding revisal of the capable revisal of file name page or leaf
Instructions 2 17 (be called for short " the CZ-II) (being called for short " CZ-II ")
7 19 0.57 keys/row 0.57 key/word
13 7 achievement achievements
8 UJI UJ1
13 16 electric word computer robot calculator

Claims (10)

1, a kind of Comnputer Chinese character system and keyboard thereof, it is characterized in that adopting computer Chinese information processing hanzi system that two kinds of code element coded input methods of speech word constitute and for realize that this system designs with sound for shape, the binary input keyboard of sound shape compatibility.
2, hanzi system according to claim 1, it is characterized in that using GB2312-80 region-position code and GB1988-80 GB and make machine internal information permutation code, speech word binary pronunciation-shape encode input method (being called for short " binary ") and two kinds of complete hanzi systems that input method constitutes of character ideophone coded input method (being called for short " character ") by exploitation, its software configuration is seen Fig. 1, and its relevant procedures connect sees Fig. 2.
3, hanzi system according to claim 1 and keyboard thereof, it is characterized in that to have the basic character of independent names according to initial consonant definition figure (being Fig. 3), the basic character that will not have independent names is according to character title associative graph (being Fig. 4), with the non-word character beyond the basic character according to first stroke of a Chinese character form of a stroke or a combination of strokes formula code definition figure (being Fig. 5), application is with the method for sound for shape and association, be defined in respectively on 26 keys, constitute the binary input keyboard of realizing speech word binary coding input hanzi system scheme, i.e. a Fig. 6.
4, hanzi system according to claim 1 and 2, to it is characterized in that Chinese character be ten class mould figure according to the position inductive structure and marked position serial number (being Fig. 7), two kinds of compiling methods of native system all are according to position structure code fetch, each structure position limit is got a character, from a left side, tail portion tail code fetch is from hanging down from the right side from height for preceding two position code fetches.
5, hanzi system according to claim 1 and 2 is characterized in that the figure place control of the switching of speech word binary input by preamble code, and preamble code is two or four and is phrase that adopt with the phrase coding and be input as the master, single character code is input as auxilliary binary input.
6, hanzi system according to claim 1 and 2 is characterized in that adopting the guiding input, hit preamble code after, show with the code word speech at presenting bank, after hitting option code, just finish input, preamble code is realized with letter key, option code realizes with numerical key, wherein, " O " usefulness " space " key realizes, with the sign indicating number individual character and in unison the sequence word group point out by frequency reducing, in the individual character input, option code promptly is an end mark.
7, hanzi system according to claim 1 and 2, it is characterized in that speech word binary pronunciation-shape encode input method encodes to phrase employing sound preface, (zero consonant syllable replaces with its first letter promptly to use the initial consonant of each syllable, as follows) sequential encoding, multisyllable (more than the pentasyllable) phrase is adopted omission sound preface coding, it is sequential encoding again behind the initial consonant of clipped syllable, in summary, the phrase coding divides two and four two grades, two syllable speech are with the initial consonant sequential encoding of two syllables, the triphone phrase is with one, two, trisyllabic initial consonant is filled one " 0 " four lexicographic orders coding that gathers together enough again, the quadrisyllable group is with one, two, three, four, the initial consonant sequential encoding of syllable, the pentasyllable phrase is with one, three, four, the initial consonant sequential encoding of pentasyllable, the above phrase of hexasyllable is with one, three, five, the initial consonant sequential encoding of end syllable, a speech word binary pronunciation-shape encode general chart is seen Fig. 8.
8, hanzi system according to claim 1 and 2, it is characterized in that speech word binary pronunciation-shape encode input method adopts pronunciation-shape encode to individual character, the individual character pronunciation-shape encode also divides two grades, with a guiding be the high frequency word, with three bit codes guiding be general word, the initial consonant of high frequency word usefulness " this word " is made preamble code, with the numerical key sign indicating number that elects, wherein, with ten punctuation marks commonly used of letter " O " guiding, general word adds the lead-in metacode with the initial consonant of " this word ".Three letters of end of file character code are compiled and are made preamble code, and general single character is added prefix form of a stroke or a combination of strokes code and three alphabetic codings of suffix form of a stroke or a combination of strokes code with the initial consonant of " this word ", sees Fig. 8.
9, hanzi system according to claim 1 and 2, it is characterized in that character ideophone coded input method adopts the ideophone coding to individual character, divide third gear five levels, the high frequency word is made preamble code with lead-in unit (single character with this word initial consonant) code, as option code, two character high frequency words are made preamble code with the code of two characters, with the numerical key sign indicating number that elects, general single character " this word " initial consonant, prefix form of a stroke or a combination of strokes code, three character element codes of suffix form of a stroke or a combination of strokes code are made preamble code, with the numeral sign indicating number that elects, general two character combinde rqdical characters add that with the code of two characters " this word " initial consonant makes preamble code, select with numerical key, the combinde rqdical character of three structure positions is got the code of three characters respectively and is made preamble code from three positions, see Fig. 9 for details.
10, hanzi system according to claim 1 and 2, it is characterized in that concrete operations can realize by system operation process flow diagram (being Figure 10), applicable to large, medium and small and microcomputer (or claiming computer), also be applicable to the electronics film titler of using in intelligent chinese-English typewriter, teletype writer, Chinese terminal and TV, the film making.
CN86107235.9A 1986-10-19 1986-10-19 Chinese character system and keyboard for tow-element word code input Expired CN1006251B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN86107235.9A CN1006251B (en) 1986-10-19 1986-10-19 Chinese character system and keyboard for tow-element word code input

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN86107235.9A CN1006251B (en) 1986-10-19 1986-10-19 Chinese character system and keyboard for tow-element word code input

Publications (2)

Publication Number Publication Date
CN86107235A true CN86107235A (en) 1988-04-27
CN1006251B CN1006251B (en) 1989-12-27

Family

ID=4803530

Family Applications (1)

Application Number Title Priority Date Filing Date
CN86107235.9A Expired CN1006251B (en) 1986-10-19 1986-10-19 Chinese character system and keyboard for tow-element word code input

Country Status (1)

Country Link
CN (1) CN1006251B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1068688C (en) * 1994-10-05 2001-07-18 吴胜远 Literal information processing method and apparatus

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1068688C (en) * 1994-10-05 2001-07-18 吴胜远 Literal information processing method and apparatus

Also Published As

Publication number Publication date
CN1006251B (en) 1989-12-27

Similar Documents

Publication Publication Date Title
CN1113305C (en) Language processing apparatus and method
CN1026525C (en) Intellect five strokes double spelling Chinese ideograph code programme
CN1648828A (en) System and method for disambiguating phonetic input
CN1040276A (en) Simplified and complex character root Chinese character entering technique and keyboard thereof
CN86107235A (en) Speech word binary coding input hanzi system and keyboard
CN1387109A (en) Numeral (keypad) input method for braille
CN1607492A (en) Digital electronic device and bopomofo input method using the same
CN1123819C (en) Chinese character key-position code input method for computer
CN1230726C (en) Chinese-character digital code input method for computer
CN1317906A (en) Integrated system for imputting digitalized English in information processing of mobile communication and computer
CN1187677C (en) Method for inputting Chinese holophrase into computers by using partial stroke
CN1257445C (en) Chinese-character 'Pronunciation-meaning code' input method
CN1374577A (en) General Chinese character input method suitable for letter keyboard and digital keyboard in computer and its keyboard
CN1818837A (en) Chinese character inputting method of normalizing applied Chinese phonetic alphabet scheme
CN86107210A (en) Chinese communication scheme
CN1453692A (en) Intelligent input processing method for pictophonetic Chinese character input
CN1196989C (en) Chinese character pattern schematic input method and keyboard thereof
CN1062797A (en) character input keyboard and method
CN85103869A (en) The multilingual processor
CN1218217A (en) Chinese character codes for computer and inputting method therefor
CN1547093A (en) Chinese character pen type and key-position code input method for computer
CN1092186A (en) Numerically controlled bearing code for Chinese character and input method
CN1120408C (en) Chinese-character struture-pronunciation input method for computer
CN1190206A (en) Easy-to-learn Chinese spelling key input scheme and easy-to-learn Chinese character input method
CN1093654C (en) Structural code Chinese character entering method and used general keyboard

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C13 Decision
GR02 Examined patent application
C14 Grant of patent or utility model
GR01 Patent grant
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: Hebei Handan City 4 Hospital No. 25 building 3 unit 3

Applicant after: Li Xingmin

Address before: No. 86 Zhonghua North Street, Hebei, Handan

Applicant before: Handan City Branch of the Democracy Promotion Association of China

COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: HANDAN BRANCH OF THE DEMOCRACY PROMOTION ASSOCIATION OF CHINA TO: LI XINGMIN

C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee