CN86100118A - The method and apparatus of input Chinese character on an end device - Google Patents

The method and apparatus of input Chinese character on an end device Download PDF

Info

Publication number
CN86100118A
CN86100118A CN198686100118A CN86100118A CN86100118A CN 86100118 A CN86100118 A CN 86100118A CN 198686100118 A CN198686100118 A CN 198686100118A CN 86100118 A CN86100118 A CN 86100118A CN 86100118 A CN86100118 A CN 86100118A
Authority
CN
China
Prior art keywords
syllable
input
identifier
speech
symbol
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN198686100118A
Other languages
Chinese (zh)
Other versions
CN1003193B (en
Inventor
乔基姆·海因策尔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Siemens AG
Original Assignee
Siemens AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens AG filed Critical Siemens AG
Publication of CN86100118A publication Critical patent/CN86100118A/en
Publication of CN1003193B publication Critical patent/CN1003193B/en
Expired legal-status Critical Current

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B41PRINTING; LINING MACHINES; TYPEWRITERS; STAMPS
    • B41JTYPEWRITERS; SELECTIVE PRINTING MECHANISMS, i.e. MECHANISMS PRINTING OTHERWISE THAN FROM A FORME; CORRECTION OF TYPOGRAPHICAL ERRORS
    • B41J3/00Typewriters or selective printing or marking mechanisms characterised by the purpose for which they are constructed
    • B41J3/01Typewriters or selective printing or marking mechanisms characterised by the purpose for which they are constructed for special character, e.g. for Chinese characters or barcodes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

Make the speech realization input Chinese character that phonetic alphabet are formed by means of the Latin alphabet.Chinese word is made up of syllable, and each in these syllables changes into Chinese character more again.Speech is the word row that one or more characters are formed, and also is the arrangement that one or more syllables are formed thus.When utilizing phonetic alphabet to form the speech input with these letter row be stored in the middle syllable of storer (SP) and compare and automatically it is divided by syllable, if here its letter both can enroll in the syllable of observing, always they are enrolled in this syllable of observing in the time of can enrolling in the next syllable again.In order to handle unceasingly, syllable has been disposed the syllable identifier and each speech is compiled into a univocality ordered series of numbers of being made up of the syllable identifier.

Description

The method and apparatus of input Chinese character on an end device
The present invention relates to the method for input Chinese character on an end device, the Chinese word that on this end device, utilizes the Latin alphabet to form by Chinese character as a kind of pinyin character input.Relate to a kind of equipment that is used to implement this method in addition.
Well-knownly be: utilize keyboard input Chinese character by means of the speech of forming with the phonetic symbol that can typewrite.Utilize this phonetic symbol to import a kind of syllable similar by the arrangement of the Latin alphabet to diacritic.Chinese word is made up of syllable, and each syllable is made up of a first sound and a last or end syllable.Formed about 410 different syllables thus, each is formed by in one in 25 kinds of first sounds that may occur and the 34 kinds of last or end syllables one in them.Each syllable utilizes a character translation in a plurality of characters (unisonance character) with this syllable corresponding stored to become Chinese words.Therefore speech is word row that one or several character forms or the arrangement of the syllable of being made up of one or more syllables.
Can imagine, import these Chinese characters with keyboard, this keyboard divides does two layers of setting, has comprised first sound and comprised last or end syllable in ground floor in the second layer.Print word by one and can spontaneously realize, because each first sound and last or end syllable are always alternately imported to another conversion of printing word.In general, the division of speech and speech is to utilize to hit space bar and form, and speech is divided by syllable and had no problem, and just weaves into a syllable because whenever hit twice key.
If Chinese character uses the phonetic symbol that can typewrite to import by means of the international keyboard with Latin alphabet, then the simple division of carrying out speech with single syllable can not directly be accomplished, because first sound is made up of zero to two letters, last or end syllable is made up of one to four letter.
The present invention is based on following task: promptly determine a kind of method and apparatus, by means of them the Chinese word of the phonetic symbol input that application can print with keyboard can automatically be divided by single syllable.
This invention task that relates to above-mentioned technical method realizes in the following manner according to the present invention: promptly ought use an international keyboard (TA) with Latin alphabet to import after each phonetic alphabet, the letter row of input are compared with the syllable of storage in a storer (SP), can automatically speech be opened by syllabification, here, if letter not only can enroll in the syllable of observing but also can enroll in the next syllable time, then always it is organized in the syllable of observing.
Has such advantage according to method of the present invention: existing international keyboard promptly not can be used to import Chinese character as any change, and the service fee that this input mode needs is low.
For fear of the polysemy that is produced with the syllabification speech, may occur introducing segmentation symbol on the position of polysemy at each can head it off.For example, these segmentation symbol is-symbols " ' " or symbol "-".For the word processing that continues, each syllable is distributed a syllable identifier, and speech is compiled into the identifier ordered series of numbers of a univocality.
These syllable identifiers are that the syllable that arranges in alphabetical order formation relatively is numbered according to method of the present invention.
The effective equipment that is used to implement said method has following feature: the storer that is provided with all common syllables of storage therein; And control module, it every imported a letter after, whether the syllable that is about to store in itself and the storer compares, can be discerned as single syllable to univocality with these letters of verification, otherwise always continue to observe letter subsequently, its objective is for by the syllabification speech.
In order to realize the present invention, control module has such function: promptly ought import after the segmentation symbol, this control module is about to the ultima of input just and the letter of back is separated.
In order to handle further, in storer, distributed the syllable distinguishing mark at syllable, and the syllable distinguishing mark under control module is read from storer when having determined the single syllable that meaning exists.
The syllable identifier of storing in storer is corresponding with the syllable that arranges in alphabetical order appearance.
Below incite somebody to action the equipment that at length explain method of the present invention with reference to the accompanying drawings and implement this method.Its accompanying drawing is:
Fig. 1: implement the equipment block diagram that the inventive method is used.
Fig. 2: by the description of Chinese character to three speech.
Comprised a keyboard TA in the equipment shown in Fig. 1, it is actually in order to the key of the input Latin alphabet and the international keyboard that constitutes in order to the key of input digit.This keyboard TA and a control module ST link, and preferably include one or several microprocessor in this control module.This control module ST links with a storer SP again, have in this storer 410 with letter representation and with the Chinese syllable of binary digital encoding.Also set up the syllable identifier that distributes with respect to syllable in addition in this storer SP, they are corresponding with the syllable that arranges in alphabetical order formation.This control module ST and then link with a display unit AE who comprises image display screen BS again, and also can link with a printer DR.Last control module ST also links with a processing unit VE, the syllable identifier is sent to this processing unit, it is an end device with the joining text system of above-mentioned part, print or pen recorder for one, computing machine or with the joining long distance communication end device of toll cable FL.Specifically in processing unit VE, also be provided with and be used to carry out some devices of a row syllable being translated into Chinese character, because always there are a plurality of Chinese characters to dispose correspondingly concerning each syllable, the pronunciation of these Chinese characters is identical or they are to represent with same phonetic alphabet at least.
By means of keyboard TA, Chinese word just can utilize the phonetic alphabet input, and control module ST automatically divides the speech of input and these syllables is distributed the syllable identifier in the single syllable mode by means of storer SP, and these syllable identifiers send processing unit VE to by control module more then.In order to carry out control operation, the speech of these inputs and/or syllable and/or syllable identifier and corresponding Chinese character all can be exported on image display screen BS and printer DR.
Each syllable is made up of one of one of 25 kinds of first sounds that may occur and 34 kinds of last or end syllables that may occur, and first sound may be made up of zero to two letters, and last or end syllable may be by one to four alphabetical composition.The pairing syllable of this speech under the situation of having imported speech just can automatically be tried to achieve by means of control module ST and storer SP, there, the letter row that always will import always compare with the syllable that is stored among the storer SP at every turn, if and univocality soon also be that this syllable identifier that is stored in this storer provides out when identifying a single syllable, again this syllable identifier is delivered to processing unit VE, the letter of being imported, if they may layout be in the syllable of observing at this both, might enroll again in the next syllable, at this moment always they are programmed in this syllable of observing and go.
The syllable of storing in storer SP is that in alphabetical order arrangement stores, and each syllable layout is risen among the ordered series of numbers of arranging from 1 to 410 at one.The syllable identifier is illustrated on the table 1 briefly corresponding to the allocation table of syllable, and table 2 then is to represent the allocation table of syllable corresponding to the syllable identifier conversely.(table vides infra)
If, for example by phonetic alphabet input Chinese word: " babaocai ", control module ST will be behind input alphabet b, whether verification has possessed a complete syllable, because the allocation table letter " b " according to table 1 and table 2 can not constitute a syllable, whether can be identified as a syllable in the coverlet free burial ground for the destitute so after having imported second letter " a ", carry out verification again.What now still cannot determine in this case, because though syllable " ba " is the syllable with identifier 6 storage, also might be following other letter thereafter, so that it may only be to belong to syllable " bai ", " ban ", the part of " bang " or " bao ".This control module ST just can discern after next letter " b " is imported again, because " bab " this syllable do not occur in the allocation table of storer SP, has so just obtained first syllable and has been " ba ".This syllable promptly is flagged as complete syllable and syllable identifier 6 is gone up in its configuration.Whether when behind the 3rd letter " b " the back next letter of input " a ", carrying out verification again is a complete syllable, also must wait for the input of next letter in this case.Behind the next letter of input " o ", it is a complete syllable " bao " that this control module promptly identifies this again, because other syllable that does not start with " bao " in the allocation table of storer SP.Therefore this syllable also provides out with syllable identifier 10 as complete syllable and according to the allocation table of table 1 or table 2.Behind the next letter of input " c ", can not identify a syllable, still can not identify to univocality a syllable behind the input letter " a " subsequently again, because it might be related to syllable: " ca ", " cai ", " can ", " cang " or " cao ".After just thinking to have imported last letter " i ", but just identify to univocality syllable " cai ", and correspondingly in allocation table, disposed identifier 23.
Syllable is to correspond to that lexicographic order is arranged and quilt volume on number in the ordered series of numbers of arranging from 1 to 410 rising in the allocation table of table 1.And what insert is last or end syllable in first row in the allocation table of table 2, and they begin with vowel, shown in the secondary series is being the syllable identifier of subordinate.First the row in the expression then be first sound, all syllables all be begin with these first sounds and following thereafter first row in the expression the last or end syllable part.What provide is the syllable identifier corresponding with that syllable on the joining with row of being expert at, this syllable promptly be begin with the first sound in first row and with the last or end syllable that provides in the one's own profession as the syllable that finishes.
On the joining of row and row, represent that by numerical value O this syllable is non-existent in the table 2.For example there are not syllable " bc " or " cei ", do not have syllable " m ü e " yet.And " z ü e ".The syllable that when input speech " babaocai " and speech " banama ", is occurred and under the syllable identifier always in the scope of table 1 and table 2 solid line institute frame, make to have sign.
When importing, speech " banama " has similar mode: at first be input alphabet " b " input alphabet " a " then to the input of speech " babaocai ".Because after input two letters " ba ", can't identify syllable to univocality, just think after letter " n " input, just can identify complete syllable and come and export a syllable identifier 8 to processing unit.Syllable below after the next letter of input " a " again can not the identification of univocality ground.After just thinking to have imported letter " m ",,, and it has been disposed syllable identifier 1 so letter " a " can be identified as a complete syllable because in storer SP, do not have syllable " am ".Syllable " ma " is identified and to its configuration syllable identifier 191 behind space between two letters of input " ma " and subsequently speech in the corresponding way.
As what can know by inference from the allocation table of table 1: speech " banama " might be divided into that to have identifier be 6,210 and 191 syllable " ba ", and " na ", " ma " in this case will be with speech: " ba ' nama " or " ba-nama " input.Because the method according to this invention, a letter, if it not only can belong to this in observed syllable but also can belong to next syllable, so always it is enrolled in this syllable in observed, if can exist the threat of another kind of layout syllable the time, will import special segmentation symbol.This segmentation symbol for example is: symbol " ' " or symbol "-", therefore divide in this univocality of syllable of in particular cases also may accomplishing.
Compiling from the syllable to the Chinese character can't realize now immediately, because any one syllable of representing with phonetic alphabet is corresponding with several Chinese characters.For example just exist 18 operational Chinese characters, reach " cai " for syllable " bao " and exist 17 and 11 operational Chinese characters respectively corresponding to " ba ".From these corresponding Chinese characters, select at this moment to utilize processing unit VE automatically to finish, as another part application for patent of submitting simultaneously with present patent application ... described in, used a vocabulary for this reason, utilize its identifier to store 30 therein, 000 Chinese word and can at every turn determining here: which compilation symbol what syllable at this moment was related is, for this symbol and then be assembled into the ordered series of numbers of a unified 4-digit number, this ordered series of numbers has promptly indicated corresponding Chinese character.Even if on the image display screen BS of display unit AE or on printer DR with demonstrating the Chinese character corresponding with importing speech.Text transmission originally can utilize with the joining toll cable FL of this processing unit VE and transmit the four figures ordered series of numbers of compiling Chinese character or transmit the syllable identifier, here the prerequisite that transmits the syllable identifier is: in receiving element, Chinese character to the compilation of syllable identifier utilizes the speech storer just can accomplish.
Described speech " babaocai " in Fig. 2, " banama " reaches " ba ' nama " and the syllable identifier of their institute's subordinates and the Chinese character that utilizes the speech storer to try to achieve.

Claims (8)

1, the method of input Chinese character on an end device, on this end device, utilize the Latin alphabet to make the Chinese word that the phonetic symbol input is made up of Chinese character, it is characterized in that: after utilizing international keyboard to import each phonetic alphabet with Latin alphabet, the letter row of having imported are made comparisons with the syllable of the middle storage of storer (SP), Chinese word will automatically be divided with single syllable and come, if its letter both can enroll in the syllable of observing therein, in the time of can enrolling next syllable again, always they are programmed in each syllable under observation and go.
2, according to the method for claim 1, it is characterized in that: imported a segmentation symbol of two inter-syllables when with phonetic alphabet input speech after, then the syllable of front is promptly indicated into complete syllable.
3, according to the method for claim 2, it is characterized in that: utilize symbol " ' " and symbol "-" as segmentation symbol.
4, according to the method for a claim in the claim 1 to 3, it is characterized in that: the syllable that identifies for each univocality disposes a last syllable identifier, and this speech is compiled into a univocality ordered series of numbers of being made up of the syllable identifier.
5, according to the method for claim 4, it is characterized in that: the syllable identifier is that the syllable of arranging formation in alphabetical order distributes in proper order.
6, enforcement of rights requires 1 equipment, it is characterized in that: be provided with storer (SP), in this storer, stored all common syllables: and be provided with a control module (ST), it behind symbol of every input by with storer (SP) in the syllable of storage compare verification: whether can identify to univocality a syllable, otherwise each symbol of observing again thereafter is intended to the syllabification speech.
7, according to the equipment of claim 6, it is characterized in that: control module (ST) demarcates at the letter that is about to the final syllable of input just and input subsequently after the segmentation symbol input.
8, according to the equipment of claim 6 and 7, it is characterized in that: in storer (SP), stored the syllable identifier, the syllable of identifier dispensing that has only a meaning in these syllable identifiers, and the syllable identifier of correspondence is sent to processing unit (VE) according to the speech of input.
CN86100118A 1985-02-15 1986-01-10 Method and apparatus for dividing chinese words into single syllable Expired CN1003193B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19853505291 DE3505291A1 (en) 1985-02-15 1985-02-15 Method and arrangement for inputting Chinese characters into a terminal
DEP3505291.0 1985-02-15

Publications (2)

Publication Number Publication Date
CN86100118A true CN86100118A (en) 1986-08-13
CN1003193B CN1003193B (en) 1989-02-01

Family

ID=6262666

Family Applications (1)

Application Number Title Priority Date Filing Date
CN86100118A Expired CN1003193B (en) 1985-02-15 1986-01-10 Method and apparatus for dividing chinese words into single syllable

Country Status (3)

Country Link
JP (1) JPS61193258A (en)
CN (1) CN1003193B (en)
DE (1) DE3505291A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5175803A (en) * 1985-06-14 1992-12-29 Yeh Victor C Method and apparatus for data processing and word processing in Chinese using a phonetic Chinese language
EP0271619A1 (en) * 1986-12-15 1988-06-22 Yeh, Victor Chang-ming Phonetic encoding method for Chinese ideograms, and apparatus therefor

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2033633B (en) * 1978-10-03 1982-05-19 Pok Fun Ng Ideographic coding
GB2060231B (en) * 1979-10-12 1983-11-23 Int Telecommunications Adminis Easy and flexible method of imputting ideogram-type language characters into a computer system
DE3214362A1 (en) * 1982-04-20 1983-10-20 Olympia Werke Ag CIRCUIT ARRANGEMENT IN WRITING OR SIMILAR MACHINES WITH A LARGE CHARACTER OF CHARACTERS

Also Published As

Publication number Publication date
JPS61193258A (en) 1986-08-27
CN1003193B (en) 1989-02-01
DE3505291C2 (en) 1988-09-08
DE3505291A1 (en) 1986-08-21

Similar Documents

Publication Publication Date Title
US4505602A (en) Method for encoding ideographic characters
US4193114A (en) Ticket-issuing system
US5175803A (en) Method and apparatus for data processing and word processing in Chinese using a phonetic Chinese language
CA2168133A1 (en) Device and method for displaying the title of pieces of music
US4611995A (en) Electronic language learning machine
GB2283598A (en) Data entry workstation
CA1279128C (en) Means and method for electronic coding of ideographic characters
CN86101871A (en) The method of selection and reproducing language characters
NL7907353A (en) IDIOGRAPHIC CODING.
CN86100118A (en) The method and apparatus of input Chinese character on an end device
KR100284847B1 (en) Dictionary of an Alphabetic Foreign Language
US7734571B2 (en) Method for processing sensor data within a particle stream by a KStore
CN1149463C (en) Method of inputting characters through combination of numberic keys
US4294550A (en) Ideographic typewriter
US5529496A (en) Method and device for teaching reading of a foreign language based on chinese characters
EP0271619A1 (en) Phonetic encoding method for Chinese ideograms, and apparatus therefor
CN1019425B (en) Chinese input system and its key board
KR940007932B1 (en) Method and apparatus for processing ideographic characters
CN1074552C (en) User's action recording device
JP2812218B2 (en) Data search device and data search method
GB2100899A (en) Encoding ideographic characters
CN1036846A (en) Chinese character processing device
US3727327A (en) Coded filing and retrieval system
CN1116336A (en) Substitution type Chinese phonetic character, word input coding method and keyboard thereof
KR100454806B1 (en) New Multi-purpose Visual-Language System Based On Braille

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C13 Decision
GR02 Examined patent application
C14 Grant of patent or utility model
GR01 Patent grant
C19 Lapse of patent right due to non-payment of the annual fee
CF01 Termination of patent right due to non-payment of annual fee