CN1035568A - The input processing method of Chinese - Google Patents

The input processing method of Chinese Download PDF

Info

Publication number
CN1035568A
CN1035568A CN 88100306 CN88100306A CN1035568A CN 1035568 A CN1035568 A CN 1035568A CN 88100306 CN88100306 CN 88100306 CN 88100306 A CN88100306 A CN 88100306A CN 1035568 A CN1035568 A CN 1035568A
Authority
CN
China
Prior art keywords
chinese
mark
word
phonetic
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 88100306
Other languages
Chinese (zh)
Inventor
李约瑟
羽深邦男
石和田四郎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsui Toatsu Chemicals Inc
Original Assignee
Mitsui Toatsu Chemicals Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsui Toatsu Chemicals Inc filed Critical Mitsui Toatsu Chemicals Inc
Publication of CN1035568A publication Critical patent/CN1035568A/en
Pending legal-status Critical Current

Links

Images

Abstract

The present invention relates to a kind of Chinese language input processing method.This method is omitted marker word by the alphabetic character key with phonetic labelling method or Roman capitals Chinese is imported, the corresponding one by one Chinese character mark of Roman capitals mark ellipsis Chinese dictionary in being stored in the storer that is provided with in the Chinese input system in advance, the omission text transform of input is treated as the Chinese of completeness, its advantage has great practical value for making Chinese language input processing method and both realized rapidly and hope accurately.

Description

The input processing method of Chinese
The present invention is the input processing method of relevant Chinese, and it is relevant to phonetic mark language and ellipsis to say so in more detail, Chinese is imported, carried out then the Chinese language input processing method of conversion process by the alphabetic character key.
Chinese is made up of numerous Chinese characters, and these Chinese characters are as then very bothering with inputs such as keyboards.For example also can consider only to prepare to be equivalent to the key of the quantity of Chinese character, but not have practical value because of bond number too expands.Therefore, can adopt Chinese character is divided into left side (being called " partially ") and right-hand part (being called " side "), or be divided into the first half (being called " hat ") and Lower Half (being called " pin "), reach the mode that Chinese character is imported in combination that " side " or " hat " reach " pin " by " partially ".Yet, this kind mode also because of must have numerous " partially " keys, " side " key etc. makes efficient extremely low.
Therefore, just begin promptly to utilize in the following way phonetic labelling method with alphabetic flag Chinese pronunciation, and by the mode of alphabetic character key with Roman capitals mark pattern input Chinese character.Rise and fall because of only not differentiating, so represent to rise and fall with tone with the phonetic labelling method again.Circumflex has four kinds, is called " four tones of standard Chinese pronunciation ".The tone of Chinese character is (what have is not) that regulation is arranged, and the common regulation of phonetic mark must be enclosed these four tones of standard Chinese pronunciation.The four tones of standard Chinese pronunciation are defined by the following stated mode.
One: advance with high tone level.Represent with [-].
Two: rise to high pitch by bass.Represent with [/].
Three: being transferred to bass, also gone up again by high pitch is high pitch.Represent with [∨].
The four tones of standard Chinese pronunciation: reduce to bass by high pitch.With [] expression.
Below, represent with example.For example use " ma " of phonetic mark, as add that tone can tell following Chinese character.
M ā (one): mother (mother)
M á (two): fiber crops (numbness)
M ǎ (three): horse (horse)
M à (four tones of standard Chinese pronunciation): scolding (abuse)
As the labelling method of the four tones of standard Chinese pronunciation, the method for the numeral of adopting is arranged also except that above-mentioned symbol.
According to above explanation,, the phonetic labelling method of 1. not paying the four tones of standard Chinese pronunciation is arranged as the phonetic labelling method; 2. pay the phonetic labelling method of the four tones of standard Chinese pronunciation; 3. use the phonetic labelling method of the numeral four tones of standard Chinese pronunciation etc.
Using the phonetic labelling method occasion do not pay the four tones of standard Chinese pronunciation, during retrieval output because of many homonyms corresponding (with reference to aforementioned " ma ") are arranged, so will be specific go out needed Chinese character, operate loaded down with trivial details, cost a lot of times.Therefore, the processing of the four tones of standard Chinese pronunciation is problems in phonetic mark input mode, and how rationally and efficiently to be handled is a technical problem so far always.
The problem that exists during in addition, with the direct mark Chinese of phonetic is to have that the tab character number increases and shortcoming that treatment effeciency is reduced.For example: open the phonetic mark mode of once having reported on the clear 61-20176 communique with the numeral four tones of standard Chinese pronunciation the spy.If in this manner, for example to Chinese " everlasting ", carry out the words of mark with the phonetic mark mode of the numeral four tones of standard Chinese pronunciation, then be " wan4 gu3 chang2 quing1 ", the key operation increased frequency that the Chinese character that input is made up of four words is required, the treatment effeciency significance difference.
Given this, the objective of the invention is to, adopt common alphabetic character key,, realizes a kind of both rapid and exactly Chinese language input processing method processing, high efficiency imported in Chinese to reduce the key operation number of times that input Chinese is used.
The present invention who addresses the above problem has and is characterised in that, will be 2 that make according to following rule 1 and rule, make Roman capitals mark ellipsis and the corresponding one by one Chinese character mark of the corresponding Roman capitals mark of Chinese character mark ellipsis Chinese dictionary be stored in (step 1.) in the storer set in the Chinese input system in advance; When with the phonetic labelling method Chinese being imported, import (step 2.) according to following regular 1 and regular 2 by the alphabetic character key; The ignore character conversion process of input is become the Chinese (step 3.) of completeness with reference to the corresponding one by one Chinese character mark of aforementioned Roman capitals mark ellipsis Chinese dictionary.
Fig. 1 is the process flow diagram of the principle of expression the inventive method.
Easy the phonetic mark input method and the Chinese idiom ellipsis of paying the four tones of standard Chinese pronunciation are made up, thereby processing imported in Chinese.
Embodiment
Below be elaborated with regard to embodiments of the invention.At first, rule 1 is illustrated.
At this, if the special word literal 2. of order rule 1 and even symbol adopt " j ", special word literal 4. and even symbol adopt " h ", and aforesaid phonetic mark " ma " is adopted rule 1, and be then as follows.
Ma(one): mother
Maj(two): fiber crops
Maa(three): horse
The mah(four tones of standard Chinese pronunciation): scolding
Rule 2 is illustrated for example with that.The above Chinese idioms of aforesaid three words " everlasting " as with paying the input of four tones of standard Chinese pronunciation phonetic labelling method, then are " w à n g ǔ ch á ng q ū ing ".Now it is adopted the present invention.Because of becoming " wanh ", application rule 1 shows the four tones of standard Chinese pronunciation according to rule 2, the 1 words " ten thousand ".The 2nd word begins, and promptly " ancient green for a long time " then only adopts the first initial of phonetic mark, is " gcq ".The whole input just with " wanh gcq " gone.Key operation number of times (hereinafter referred to as the touching number) when utilizing the phonetic labelling method input of paying the four tones of standard Chinese pronunciation is 17 times, and is relative therewith, the touching number when utilizing the inventive method input only 7 times, and number of operations has reduced significantly as can be known.
Begin only to make the corresponding reason of first initial of each word to describe at this to the 2nd word in the rule 2.Adopt aforementioned four-tone tone symbol, though can distinguish Chinese character to a certain extent, word of a word is how not easy to identify because of homonym again.Ma(one only for example) pairing Chinese character just has a lot of words such as " smearing ", " ant ", " fiber crops ", " mill ", " mother ".For the everyday words of forming by two words, how not easy to identify when not having the tone symbol because of homophone with the phonetic mark, go up then very easily identification of circumflex as paying.
On the other hand, in the everyday words that three words above (comprising 3 words) are formed, the homophony language has just reduced widely, but when Chinese input system etc. is imported as alphabet all being used the input of phonetic labelling method, then the touching number will increase.Thereby, reduce for making the touching number, only expected the method for importing with first character of the phonetic mark of word.For example " people " are as representing then to be " r é n m with the phonetic mark
Figure 881003069_IMG2
N ", only get its first initial and be expressed as " rm ".Yet, adopt this mark rule that homophone is increased, retrieval is become very bother.
For this reason, the present invention for 2 word everyday words (remittance of 2 words) all import according to the phonetic mark with interior, occasion for the everyday words more than 3 words (the above Chinese idiom of 3 words), its first initial word is imported with the phonetic mark, the 2nd word begins the phonetic mark ellipsis (Roman capitals mark ellipsis) imported with first initial, thereby makes it identification easily and the touching number reduces.
Because so the Chinese of input is through simple significantly, as directly exporting then interrogatory like this.Thereby, the present invention will according to aforesaid regular 1 and the Roman capitals marks of rule 2 inputs be for conversion into the dictionary (the corresponding one by one Chinese character and words dictionary of Roman capitals mark ellipsis) that the Chinese of complete shape uses and be arranged in the Chinese input system, make the simple mark that is transfused to be for conversion into the Chinese of completeness.In case after being for conversion into the Chinese of completeness, just this Chinese can being exported according to required purpose, or be appended processing.
Fig. 2 is the pie graph that an example of the Chinese input system that the inventive method uses is implemented in expression.Among the figure, 1 is the keyboard of being made up of alphabetic character key and other operating key; 2 for accepting from the CPU(CPU (central processing unit) of carrying out various controls after the input of keyboard 1).3 is the storer that is connected, stores various information with CPU2, and above-mentioned Roman capitals mark ellipsis one corresponding Chinese character mark dictionary 3a is included in its inside.4 are the CRT of expression by the various information of CPU2 output, and 5 is that 6 is the output unit of output transform result to being transformed into the treatment circuit that completeness Chinese is handled as required.As output unit 6, can use for example printing equipment.
Below the action of the system that constitutes is thus described.The operator omits the Chinese that input mode will import and imports from keyboard 1 by deferring to above-mentioned phonetic regular 1, rule 2.The Roman capitals mark ellipsis that CPU2 comes in input and storer 3 interior Roman capitals mark ellipsis one corresponding Chinese character mark dictionary 3a contrast, and read corresponding Chinese character.Finish thus Roman capitals mark ellipsis to the conversion process of corresponding Chinese character.Transformation results is that the cache content that for example is stored in the memory buffer (not shown) set in the CPU2 is presented at CRT4.Number of times carries out this operation repeatedly as required, just can finish the Chinese input and handle.So, express the Chinese of input among the CRT4.
For the occasion of the Chinese that CRT is represented by output unit 6 printouts, with the content of the impact damper in the CPU2 directly to output unit 6 printouts.That is to say that such structure is exactly the word processor of Chinese.If when its word processor as Chinese is worked, also be necessary to make it to increase and a kind of the article that is stored in the memory buffer in the CPU2 such as changed and even correct at editting function, as distribute to the operating key that is attached in the keyboard 1 with various functions, with additional or change some softwares, just can adapt to.
In the treatment circuit 5, can carry out for example handling, perhaps handle for be connected the interface that needs with computing machine for be connected the modified tone that needs with communicating circuit.When linking to each other, become Chinese and pass civilian communication device, just can import the computer operation of Chinese as linking to each other with computing machine with order circuit.
According to the present invention, because keyboard 1 can be with common letter key, so can use the ASC II QWERTY keyboard that also has operating key outside 26 letters.
Below, more specifically the present invention is illustrated.
For following Chinese article being imported, carry out the Chinese character conversion and export to handle being illustrated by the present invention.
<Chinese 〉
The Silk Road has connected east, become on traffic main artery between east and west, has promoted cultural exchanges between east and west, commercial trade and friendly exchanges with West Asia, Europe.
In recent years, the Silk Road is reopened in the common decision of China and some friendly countries.This to the friendship between the developing china and the people of various countries, be bound to make new contribution.
With the information of letter input,, then as follows as representing above-mentioned article with the mark of the phonetic band four tones of standard Chinese pronunciation.
<phonetic band four tones of standard Chinese pronunciation mark 〉
s
Figure 881003069_IMG3
chóuzh lù bǎ dōngfāng gēn x
Figure 881003069_IMG5
yà,ōuzhōu liánx
Figure 881003069_IMG6
q lái,chéngle dōngx
Figure 881003069_IMG8
fāng de jiāotōng yàodào,cùj
Figure 881003069_IMG9
n le dōngx fāng de wénhuà jiāoliú,tōngshāng màoy
Figure 881003069_IMG11
hé yǒuhǎo wǎnglái.
j
Figure 881003069_IMG12
nniánlái,zhōngguó hé y xiē yǒuhaǒ guójiā gòngtóng juéd
Figure 881003069_IMG14
ng chóngx n kāifàng s chóuzh
Figure 881003069_IMG17
lù.zhèdu
Figure 881003069_IMG18
fāzhǎn zhōngguō rénm
Figure 881003069_IMG19
n hé gèguó rénm n zh
Figure 881003069_IMG21
jiān de yǒuy
Figure 881003069_IMG22
,y d
Figure 881003069_IMG24
ng hu zuòchū x
Figure 881003069_IMG26
nde gòngxiàn.
As according to common phonetic Roman capitals mark during with the input of this mark its touching number be 437 times.Then represent identical article with phonetic mark ellipsis of the present invention, then as follows.
<phonetic of the present invention omits mark 〉
siczl baa dongfang gen xiyah,ouzhou lianjxql chengjle dongxf d jiaotyd,cuhjinh l dongxf d wenjhjl,tongsmy hej yoouhwl.
jinhnl,zhongguoj hej yixie yoouhgj gonghtjd chongjxkf siczl,zhehduih fazhaan zhonggrm hej gehgrm zhijian d yoouyih,yidh zuochu xind gonghxianh.
Touching number in the case is 221 times, is about 1/2 when importing according to phonetic Roman capitals mark, and visible the present invention improves efficient greatly.
Fig. 3 pays the contrast figure of four tones of standard Chinese pronunciation mark and phonetic omission mark for Chinese character mark, the phonetic of the used Chinese idiom of present embodiment.Can think and collect the information shown in this figure in a large number among the corresponding one by one Chinese character mark of the Roman capitals mark ellipsis dictionary 3a shown in Fig. 2.At this moment, the capacity of the corresponding one by one Chinese character mark of Roman capitals mark ellipsis dictionary 3a is if any 10~120,000 words, and is then enough for common application target.
As above describe in detail, according to the present invention, by easy pair four tones of standard Chinese pronunciation phonetic mark input method and Chinese idiom ellipsis is combined, can make and utilize common alphabetic character key that required key operation number of times minimizing imported in Chinese, to realize rapidly and to import exactly and handle Chinese language input processing method Chinese, that efficient is high that great practical function is arranged.
The simple declaration of accompanying drawing drawing:
Fig. 1 is the block diagram of the principle of expression the inventive method;
Fig. 2 be for implement the inventive method for the pie graph of an example of Chinese input system;
Fig. 3 pays four tones of standard Chinese pronunciation mark for Chinese character mark, the phonetic of Chinese idiom and phonetic omits the corresponding corresponding diagram of mark.
1 ... keyboard
2……CPU
3 ... storer 3a ... Roman capitals mark ellipsis
Corresponding one by one Chinese character mark dictionary
4……CRT
5 ... treatment circuit
6 ... output unit

Claims (1)

1, a kind of Chinese language input processing method, it is characterized in that this Chinese language input processing method will according to following regular 1 and rule be 2 that make, make Roman capitals mark ellipsis and the corresponding one by one Chinese character mark of the corresponding Roman capitals mark of Chinese mark ellipsis Chinese dictionary be stored in (step 1.) in the storer set in the Chinese input system in advance; When with the phonetic labelling method Chinese being imported by the alphabetic character key, according to following regular 1 and rule 2 import (step 2.), the omission text transform that will import with reference to the corresponding one by one Chinese character mark of aforementioned Roman capitals mark ellipsis Chinese dictionary is treated as the Chinese (step is 3.) of completeness;
(rule 1)
As follows the vocabulary of the vocabulary of 1 word and 2 words is paid and is imported after adding the four tones of standard Chinese pronunciation:
1. to word summation tone sound symbol not;
2. pay at two word suffixes and add specific literal and even symbol;
3. at the overlapping initial vowel of third tone word suffix;
4. pay at four tones of standard Chinese pronunciation word suffix and add specific literal and even symbol different with 2. the time;
(rule 2)
The Chinese idiom of 3 words above (comprising 3 words), the 1st word by 1 pair of rule add the four tones of standard Chinese pronunciation, the 2nd word rises and only makes first word of each word corresponding.
CN 88100306 1987-07-31 1988-01-20 The input processing method of Chinese Pending CN1035568A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP192086/87 1987-07-31
JP62192086A JP2624484B2 (en) 1987-07-31 1987-07-31 Chinese input processing method

Publications (1)

Publication Number Publication Date
CN1035568A true CN1035568A (en) 1989-09-13

Family

ID=16285414

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 88100306 Pending CN1035568A (en) 1987-07-31 1988-01-20 The input processing method of Chinese

Country Status (2)

Country Link
JP (1) JP2624484B2 (en)
CN (1) CN1035568A (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5539911A (en) 1991-07-08 1996-07-23 Seiko Epson Corporation High-performance, superscalar-based computer system with out-of-order instruction execution
CH689424A5 (en) * 1994-05-06 1999-04-15 Clariant Finance Bvi Ltd A fiber-reactive disazo.
CN103838393B (en) * 2014-03-03 2017-10-13 万仁芳 Hanzi structure number character learning input method

Also Published As

Publication number Publication date
JPS6436366A (en) 1989-02-07
JP2624484B2 (en) 1997-06-25

Similar Documents

Publication Publication Date Title
US6003049A (en) Data handling and transmission systems employing binary bit-patterns based on a sequence of standard decomposed strokes of ideographic characters
WO2003104963A1 (en) Input method for optimizing digitize operation code for the world characters information and information processing system thereof
CN102867049A (en) Chinese PINYIN quick word segmentation method based on word search tree
CN1739082A (en) Apparatus and method for enabling unicode input in legacy operating systems
CN1786884A (en) Apparatus for improving key identification accuracy in terminal of requiring multi-keys and method thereof
CN1035568A (en) The input processing method of Chinese
CN1290886A (en) Method system and computer program products for optimum byte and character processing
CN87103761A (en) Chinese character stroke order and shape code input method
CN1054219C (en) Substitution type Chinese phonetic character, word input coding method and keyboard thereof
GB2071018A (en) Improvements in method and apparatus for information processing
CN1035083C (en) Word-oriented Chinese character typing device
EP1221082B1 (en) Use of english phonetics to write non-roman characters
CN1079562A (en) Kinds of words digital encode method and keyboard thereof
CN1026626C (en) Plane keyboard with seperated pages having seperate Chinese characters for inputting
CN1144142C (en) Method for processing duplicate kay words in dual-language dictionary
CN1040278A (en) The multilingual terminological data bank of Chinese character system implementation method
JPH06282566A (en) Information processor
CN85104831A (en) Box-like convenient Chinese character compiling method of head-abdomen-tail number code harmony simple or compound vowel of a Chinese syllable code character and multi-function Chinese character input medium keybiard
CN1099627C (en) System for processing multi-double-bit group word with code page
CN1063946A (en) " Chinese communication scheme " abbreviated spelling computer input method and keyboard
CN86107029A (en) The input in Chinese device
Wang et al. LITREF-a microcomputer based information retrieval system supporting stroke diagnosis, design, and development
CN86102418A (en) Chinese syllable processor and Chinese syllable disposal route
CN1248091C (en) Method for shearing documentation in complex and simplified Chinese-character inputting method
CN2518148Y (en) On-line identification braille board

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication