CN1153355A

CN1153355A - Multilingual file processing device

Info

Publication number: CN1153355A
Application number: CN95119230A
Authority: CN
Inventors: 郭俊桔
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1994-11-11
Filing date: 1995-11-10
Publication date: 1997-07-02
Anticipated expiration: 2015-11-10
Also published as: JPH08137885A; CN1097244C

Abstract

This invention enables a word processor for one language to be used for another language. An input language process part reads in a character input program for the language to be processed at present from a language process storage part according to an inputted language flag. An input code decomposition part decomposes the inputted character code into a text code and a language indication code. An editing process part edits the decomposed text code and language indication code at the same time. An output code composition part composes a corresponding character code of the language indication code and text code. A character output part outputs a corresponding font with the composed character code and language indication code.

Description

Multilingual file processing device

The present invention relates to multilingual file processing device, relate in particular to the multilingual file processing device of Chinese and Japanese dual-purpose.

From now on, handle when multi-lingual at document handling apparatus newly developed, as character code, (for example: Unicode standard or ISO10646) situation may be how to adopt corresponding multi-lingual code train.But corresponding multi-lingual code train just just forms standard recently.Therefore, document handling apparatus originally, no matter be commercially available Japan word processor and Chinese words processor, the improved goods of still will develop from now on, selling does not adopt the multi-lingual code train of this standard, and adopts the code train that mode is arranged earlier that generally knows that.So, the character code series that has earlier, for example have: with the English of 1 byte performance half-angle, the American Standard Code for Information Interchange series of numeral, with JIS code train, the SHIFT JIS code train of 2 bytes performance japanese type, and the BIG5 code train, the IBM5550 code train that show Chinese text with 2 bytes.

In addition, in the multi-lingual occasion of input, document handling apparatus just must have the equipment of the various language of input in order to read word processing program repeatedly from supplementary storage.In this respect, it has formed according to the category of language that should handle, rapid, that high-level efficiency reads word processing program structure and mode.

For example have the spy to open to deliver in the flat 1-213744 communique as this multi-lingual signal conditioning package that has earlier.Even this device is to be referred to make multi-lingual file, in fact a kind of specific language has occupied the major part that makes file, and other Languages is not only few, and the situation that its kind also is restricted is developed.

Below, this device is described.

Figure 10 is a structural drawing.In this figure, the 100th, the Language Processing information memory cell, this unit is stored in the literal loading routine of various countries' language in the auxiliary storage medium (for example hard disk).The 101st, system language process information administrative unit, the literal loading routine of 2 national languages of the interim storage in this unit.The 102nd, Language Processing information registering unit, this unit is retrieved in Language Processing information memory cell 100, and the literal loading routine of various countries' language is stored in the system language process information administrative unit 101.103 are to use the language storage unit, and identifying information (language indication information) and the last language indication information of registering which country language the literal loading routine of storing in the administrative unit is, promptly implicit language indication information handled in this unit storage representation system language.The 104th, the language indication information registration unit, this unit is to Language Processing information registering unit 102 output language indication informations, in Language Processing information memory cell 100, carry out the retrieval of literal loading routine, the literal loading routine that retrieval is drawn is registered in the system language process information administrative unit 102, meanwhile language indication information is registered in to use in the language storage unit 103.The 105th, various countries' language is mixed carry out the text-editing unit that literal is imported.The 106th, the language indication information input block of input language indication information.

Be input as example with file English and that Japanese mixes below, the workflow that example is arranged earlier is described.

Suppose at first to have carried out English input.Language Processing information registering unit 102 is retrieved in Language Processing information memory cell 100, takes out the English words loading routine, and is stored in the system language process information administrative unit 101.The identifying information that language indication information registration unit 104 will have been imported (owing to be English, institute thinks " E ") is registered in and uses in the language storage unit 103.In this occasion, owing to only registered English, so the implicit language indication information of representing register content at last is English.Like this, Ying Wen file has just become the object that will make.Then, suppose that literal importer (file is grown up) wants to import Japanese.From language indication information input block 106 language indication information that is used for identifying Japanese (owing to be Japanese, institute thinks " J ") is re-entered.

Language indication information registration unit 104 is because the language identification information of having imported is J, so, this language indication information is outputed in the Language Processing information registering unit 102, in indication update system Language Processing information, language indication information J appended to be registered in use in the language storage unit 103.Like this, just in using language storage unit 103, E and these 2 language indication information of J have been stored.Also have, in last registration, also registered (so what import later is Japanese file) as the implicit language indication information of J.

Simultaneously, Language Processing information registering unit 102 is according to the language indication information of having indicated, retrieval and read the japanese type loading routine from Language Processing information memory cell 100, and it is stored in the system language process information administrative unit 101.Consequently, English words loading routine and japanese type loading routine have been stored in the system language process information administrative unit 101.

If, just can reduce the number of times of from the auxiliary storage medium, reading and writing the literal loading routine repeatedly according to the above-mentioned the sort of multi-lingual signal conditioning package that has earlier.Therefore, can improve the efficient of multi-lingual information processing.

But, if the above-mentioned document handling apparatus that has is earlier diverted in the Chinese language processing device, will produce following such problem:

The 1st, though Japanese is the same with Chinese, all be 2 byte series,, employed number of words approximately has only 7000.So, generally have 13 bits just enough, and 3 remaining bits (specifically, being exactly last bit byte and the MSB that descends bit byte etc.) for example, except the expression font is the attributes such as thick word or space, also are used for other purposes.On the other hand, Chinese words is 13053 words, less than 2 times of Japan word.So Japanese and Chinese can not use common character code structure.Yet the application program in the document handling apparatus (as software for editing) is relevant with the structure of character code.Specifically, the Japanese application program will represent that the bit of above-mentioned attribute is used for editor, then can not in Chinese.And this means can not share application in Chinese and Japanese.So, if intactly handle Chinese and Japanese simultaneously, this multilingual file processing device then is impossible (if only be designated as reference, so, why current Japan word processor can handle English, be because the English literal that uses is considerably less, so code system does not repeat yet).

The 2nd, the kind that can't store this language when the category of language of the literal that the device that has earlier just uses during the storage input, file in processing makes, so, when exporting the file of the multi-lingual mixing of describing with Japanese such as Chinese primer, all difficulties have taken place.

The 3rd, the device that has earlier uses different character code structures to Chinese with Japanese, so, when diverting file device, produced various less important problems.For example, when the Japan word processor is used as the Chinese words processor, conversely, when the Chinese words processor is used as the Japan word processor, must make again and relevant application programs such as address register, the program of resembling of making video recording, graphic package, in line printing.

So, need many man-hour (though also depending on content, about 12 man months, 2000～3000 man-hours), resource and funds etc.

In order to solve above-mentioned problem, the inventive features of claim 1 is: storage unit, editing and processing unit, output code synthesis unit, text font storage unit and the literal output control unit of Language Processing storage unit, input language processing unit, input code resolving cell, text code and the language indication code that have the input block that is made of QWERTY keyboard etc., character code structural table, is made of internal storage (ROM) etc.

The QWERTY keyboard input language sign and the pronunciation symbol of input block;

The character code structural table has japanese type has been distributed a code plane, Chinese text distributed at least the character code hurdle of 2 code plane, according to the appointment on code plane and character code hurdle, decision Japanese or Chinese both one of and the literal of this language;

The pronunciation symbol (comprising a plurality of reading symbols) that the storage of the internal storage of Language Processing storage unit will have been imported is transformed to the Chinese text loading routine of this Chinese text (comprise many literal and article, and be not limited to Chinese character) and the pronunciation symbol that will import is transformed to this japanese type japanese type loading routine of (comprising the statement that is mixed with assumed name) (storage, available state in advance);

The input language processing unit is according to the language flag of having been imported by input block, judgement is read in from the Language Processing storage unit is in Chinese text loading routine and the japanese type loading routine which side, read in this side's program, and the pronunciation symbol that will import is transformed into and this language corresponding character code;

The input code resolving cell decomposes (a kind of conversion to the character code that the conversion of input language processing unit produces, comprise according to language the state that character code is constant) for specify in should be specific on the character code structural table the character code hurdle in the text code of literal and the language indication code on appointment codes plane;

Text code and language indication code that the cell stores of text codes and language indication code has been decomposed by the input code resolving cell;

The editing and processing unit insert, during file editor such as deletion, word attribute conversion, each unit relevant with this editor can be handled text code and the language indication code of storing in text code and the language indication code storage unit simultaneously;

The output code synthesis unit is according to text code and the language indication code stored in text code and the language indication code storage unit, and is synthetic this Chinese text code in the input reading symbol or japanese type code;

The text font storage unit is exported then by the character code of the font (letter etc.) that is registered in Chinese text in the character code structural table and japanese type with this language is mapped;

The literal output control unit after the font of the Chinese text of taking-up correspondence or japanese type, outputs on print unit and the display unit from the character script storage unit according to the synthetic result of output code synthesis unit.

The inventive features of claim 2 is: the character code structural table has the character code hurdle of Japanese; The input code resolving cell has the input code Japanese code resolver that the character code that the conversion of input language processing unit is produced is decomposed into language indication code and japanese type code; The storage unit of text code and language indication code has the japanese type code memory of japanese type code as the text code storage.

The inventive features of claim 3 is: storage unit, editing and processing unit, output code synthesis unit, text font storage unit and literal output control unit with input block, character code structural table, Language Processing storage unit, input language processing unit, input code resolving cell, text code and language indication code

Input block input language sign and pronunciation symbol;

The character code structural table has number of words according to this state's language to the multi-lingual character code hurdle that distributes at least one code plane (Code Plane), according to the appointment on code plane and character code hurdle, and the kind and the literal thereof of decision various countries language;

That use and the pronunciation symbol that imported of various countries' language that Language Processing cell stores handle is corresponding with the character code structural table is transformed into the kinds of words loading routine of using with various countries' language of the corresponding literal of this state's language, and (pronunciation symbol is a literal, when becoming English etc., in fact also have unwanted, so in fact this loading routine comprises does not have and the dual-purpose both of these case);

The input language processing unit reads in the literal loading routine that this state's language is used according to the language flag of input block input from the Language Processing storage unit, the pronunciation symbol of having imported is transformed to and the national language corresponding character code of having imported;

The character code that the input code resolving cell produces input language processing unit conversion (as previously described, also comprising the not occasion of conversion of essence) be decomposed into specify in should be specific on the character code structural table the character code hurdle in the text code of literal and the language indication code on appointment codes plane;

The editing and processing unit insert, during file editor such as deletion, word attribute conversion, each unit relevant with this editor can be handled text code and the language indication code of storing in text codes and the language indication code storage unit simultaneously;

The output code synthesis unit is according to text code and the language indication code stored in text code and the language indication code storage unit, synthetic with this language pronunciation symbol corresponding character code in the input language sign;

The text font storage unit is exported then by the font that is registered in the various literal of multilingual in the character code structural table and the character code of this language are mapped;

The literal output control unit takes out and exports the text font of this corresponding state's language according to the synthetic result of output code synthesis unit from the text font storage unit.

According to above-mentioned formation, in the invention of claim 1, plan the language flag and the pronunciation symbol of the file that makes etc. from input block input by the user.The document code structural table distributes a code plane to japanese type in the character code hurdle, Chinese character is distributed 2 code plane at least, and the indication by code plane and character code hurdle just can determine literal.The Language Processing storage unit is stored the Chinese text loading routine and the japanese type loading routine that be transformed into the reading symbol from the input block input this Chinese or japanese type in advance.The input language processing unit is according to the language flag by the input block input, judgement is read in from the Language Processing storage unit is any in Chinese text loading routine and the japanese type loading routine, the reading symbol imported and the literal reading symbol that constitutes statement is transformed to the character code of this language.The character code that the input code resolving cell produces the conversion of input language processing unit is decomposed into the text code of the character code hurdle literal of specifying specific characters such as configuration file and the language indication code on appointment codes plane.Text code and language indication code that the cell stores of text code and language indication code has been decomposed by the input code resolving cell.This moment, also the character code of handlebar kinds of words gathered the conversion situation of storage later on.The editing and processing unit is inserting, during file editor such as deletion, word attribute conversion, can handle text code and the language indication code of storing in text code and the language indication code storage unit simultaneously in each unit relevant with this editor.The output code synthesis unit is according to text code and the language indication code stored in text code and the language indication code storage unit, and is synthetic the code of this Chinese text of pronunciation symbol of input or japanese type.The text font storage unit has character script ROM etc., by being mapped being registered in the Chinese text code in the character code structural table and the font of japanese type code and the character code of this language, is exported then.The literal output control unit takes out from the character script storage unit according to the synthetic result of output code synthesis unit, and the Chinese text of output correspondence or the font of japanese type.

In the invention of claim 2: the character code structural table has the character code hurdle of Japanese.Input code Japanese code resolver in the input code resolving cell is decomposed into language indication code and japanese type code to the character code that the conversion of input language processing unit produces.Japanese type code memory in text code and the language indication code storage unit is stored the japanese type code as text code.

In the invention of claim item 3: from input block input language sign and pronunciation symbol.The character code structural table has number of words according to this state's language to the multi-lingual character code hurdle that distributes at least one code plane (Code Plane), according to the appointment on code plane and character code hurdle, can determine the kind and the literal thereof of various countries' language.That use and the reading symbol that will import of various countries' language that Language Processing cell stores handle is corresponding with the character code structural table is transformed into the kinds of words loading routine of using with various countries' language of the corresponding literal of this state's language.The input language processing unit is according to the language flag of input block input, reads in this (comprising dual-purpose) literal loading routine from the Language Processing storage unit, and the pronunciation symbol of having imported is transformed to this state's language corresponding character code.The character code that the input code resolving cell produces above-mentioned input language processing unit conversion be decomposed into specify in should be specific on the character code structural table the character code hurdle in the text code of literal and the language indication code on appointment codes plane.Text code that the storage unit of text code and language indication code has been decomposed the input code resolving cell and language indication code store after being transformed into literal in independent literal or word and the article.The editing and processing unit insert, during file editor such as deletion, word attribute conversion, each unit relevant with this editor can be handled text code and the language indication code of storing in text code and the language indication code storage unit simultaneously.The output code synthesis unit is this language synthetic with character code identical reading symbol corresponding character in the language flag of text code of storing from text codes and language indication code storage unit and language indication code input.The font storage unit is mapped the font of the multilingual various literal of registering by prior storage and use font generator program and the character code of this language on the character code structural table, just can export.The literal output control unit takes out and exports the text font of this corresponding state's language according to the synthetic result of output code synthesis unit from the text font storage unit.

Fig. 1 is the structural drawing of one embodiment of the invention;

Fig. 2 is the workflow diagram of input language processing unit in the foregoing description;

Fig. 3 is the workflow diagram of input code resolving cell in the foregoing description;

Fig. 4 is the workflow diagram of the foregoing description inediting processing unit;

Fig. 5 is the workflow diagram of output code synthesis unit in the foregoing description;

Fig. 6 is the figure of Chinese text, japanese type code plane content in expression the foregoing description;

Fig. 7 is the memory cell of text code and language indication code in the foregoing description word language indication storage, that imported code;

Fig. 8 is the output result who handles example in the foregoing description;

Fig. 9 is the figure of language flag and language codes in expression the foregoing description;

Figure 10 is the structural drawing of the multilingual file processing device that has earlier;

Figure 11 is the structural drawing of general Japan word processor;

Figure 12 is the language flag of expansion and the concept map of language codes one example.

Embodiment

Before explanation embodiment, at first explain the term usage of this instructions.

(1) there is not the difference of odd number and plural number in principle in the noun etc. of Japanese and Chinese.So, be written as the occasion of " literal ", " character code " in this manual, short of special note " text strings ", " literal ", the occasion of so existing odd number also has the occasion of plural number.Further, " literal " in the articles such as " text code and the language indication code of storage literal ", " specifying the text code and the appointment codes plane of character code hurdle literal " also is the notion that comprises " article ", " file " or " constituting the various literal of article ", " the various literal in the file ".

(2) " pronunciation symbol " is the word phonemic notation etc. that is not limited to Chinese, but comprises that also " phonographys " such as " letter ", " assumed names " reach the notion of " phonemic language " etc., and " literal " is not limited to " phonography that Chinese character is such ", also comprise "." wait mark, " assumed name " and " Chinese idiom " formed by a plurality of Chinese characters etc.

(3) corresponding relation between " pronunciation symbol " and " literal " also is not limited to 1 pair 1 relation as the corresponding assumed name of Chinese character " sea " " ラ body " " かぃ ".

The following describes technology as prerequisite of the present invention.

(1) wants to realize showing, printing the output of Chinese text etc., just need carry out to a certain degree repacking, additional equipment etc. with the Japan word processor.Specifically, according to the font driver, with reference to character code, required font synthetic (processing mode of Taiwan Dyna font) or adopt the method (required storer be said method half) of the array mode of storage radical (radical), according to the font generator program, generate various fonts (Taiwan literary composition ancient cooking vessel font processing mode) etc.

(2) has required formation unit such as performance selects the desired literal of importer from the pronunciation symbol corresponding character function or learning functionality etc.

But these are all irrelevant with aim of the present invention, so, omit its explanation.

Below, according to embodiment the present invention is described.

Figure 11 is the structural drawing of general Japan word processor, by this word processor being carried out necessary minimal transformation and optional equipment, just can bring into play the action effect relevant with the present invention.In this figure, the 111st, the input block of input pronunciation symbol or encoding function symbol.The 112nd, the Language Processing storage unit of storage japanese type loading routine and English words loading routine.The 113rd, with reference to the code kind imported, by the Language Processing memory cell selecting behind the loading routine, pronunciation symbol is transformed to the input language processing unit of japanese type code.The 115th, carry out file editor's editing and processing unit.The 116th, store the above character code storage unit of word code and slider position of file among the current Production Editor.The 117th, take out corresponding font according to character code after, with the font that taken out literal output control unit to output unit 118 outputs.The 118th, the output unit of output character has printer, CRT etc.

Below with above Japan word processor as can handle Japanese and Chinese multi-lingual word processor, be example with its function of being brought into play, basic thought of the present invention is described.

In order to make the character code structure unanimity of various language in the word processor, below, the structure of expression Chinese text code in the IBM5550 code is described.

As shown in Figure 6, on the superincumbent basis character code of Chinese is divided into 2 code plane (CP1 hurdle).Here why being divided into 2 code plane is because as foregoing, it approximately is the literal of 2 times of Japaneses that Chinese has used.In this figure, transverse axis (b2) provides is following bit byte in the character code, and what the longitudinal axis (b1) provided is the last bit byte of character code.Moreover, the character code of the JIS code that the numeric representation in the table is corresponding with Japanese SHIFT JIS.Also have, CP1 and CP2 represent last bit byte appointment in plane 1 (CP1 hurdle) and plane 2 (CP2 hurdle) of Chinese text code.

That is to say, when the JIS code with Japanese adopts as character code, when character code 2121 that occurs showing and language codes 100 (illustrating), represent that this literal is equivalent to 2121 codes in the Japanese JIS code train by Fig. 9 with 16 systems.But, when language codes is 010, represent this literal be equivalent to Chinese text code plane 2 (CP2 hurdle) go up D040 (D represent with 16 systems represent 13) literal of code.

Equally, when language codes is 001, represent that this literal is 8140 codes on the Chinese text code plane 1 (CP1 hurdle).

Find out that by above according to the edit routine of Japanese word processor, this character code is Japanese JIS code itself fully.But, by using code plane, although character code is the JIS code of Japanese,, can specific Chinese text, and then the reading symbol of having imported can be transformed to Chinese text etc.So,, also can use even the file editor of Japanese does not make an amendment.Perhaps, also be easy to carry out the establishment, conversion etc. of new procedures.

Below, Fig. 1 illustrates structure relevant with the present invention, that can handle the multilingual file processing device embodiment of Japanese and Chinese.In this figure, the 11st, input block.The 12nd, the input language processing unit.The 13rd, the input code resolving cell.The 14th, the editing and processing unit.The 15th, the output code synthesis unit.The 16th, the storage unit of text code and language indication code.The 17th, output unit.The 18th, the literal output control unit.The 19th, the Language Processing storage unit.

The following describes effect, structure of each unit etc.

Input block 11 has keyboard etc.Three kinds of literal loading routines such as Language Processing storage unit 19 storage japanese types, English words, Chinese text.Input language processing unit 12 is with reference to the language flag by input block 11 inputs, read in the corresponding character loading routine from Language Processing storage unit 19 after, the reading symbol of having imported is transformed to character code.Input code resolving cell 13 is decomposed into language indication code and text code according to the relation between Chinese text code plane and japanese type code plane with character code.The storage unit 16 of editing and processing unit 14 cross reference file codes and language indication code is carried out editing and processing.The slider position of the language indication code of the text code of master file, correspondence and appointment input Chinese words position during storage unit 16 storages of text code and language indication code are made.Output code synthesis unit 15 is according to the relation of Chinese text code plane and japanese type code plane, and is with reference to language indication code, synthetic the character code from the spoken and written languages of text code and language indication code input.Literal output control unit 18 utilizes output code to take out corresponding font with reference to language indication code, to output unit 17 outputs.Output unit 17 has monitor and printer etc., exports its font.

Fig. 2 illustrates the workflow of input language processing unit 12 in the present embodiment.

Below with reference to this figure its work is described.

(S21) by input block 11 input language sign and pronunciation symbols.

(S22) judge whether language flag is the Chinese character language.If the Chinese character language just proceeds to (S23).If not the Chinese character language, just proceed to (S25).

(S23), from Language Processing storage unit 19, read in the corresponding character loading routine with reference to language flag.Above-mentioned literal loading routine is Japanese ideogram conversion program, Chinese character conversion program etc. for example.

(S24) pronunciation symbol that will import enters (S25) after being transformed to literal.

(S25) behind output language sign and the literal, end process.Also have, in above-mentioned (S22),, just directly enter (S25) if not the Chinese character language, after output language sign and the text strings, end process.

Fig. 3 illustrates the workflow of input code resolving cell 13 in the present embodiment.

Below with reference to this figure its work is described.

(S31) by after input language processing unit 12 input language signs and the literal, proceed to (S32).

(S32) with reference to language flag, select corresponding decomposing program, enter (S33) then.

(S33) judge whether language flag is the editting function symbol.If the editting function symbol just finishes its processing.If not the editting function symbol, just proceed to (S34).

(S34) character code of a literal of taking-up from the character code of having imported.

(S35) character code with above-mentioned taking-up is decomposed into language indication code and text codes.

(S36) judgement has or not also untreated character code.If have, just turn back to above-mentioned (S34).If no, just proceed to (S37).

(S37) behind output text code and the language indication code, end process.

Fig. 4 illustrates the workflow of present embodiment inediting processing unit 14.

Below with reference to this figure its work is described.

(S41) by input code resolving cell 13 input language signs, text code, classical Chinese indication code.

(S42) from the storage unit 16 of text code and language indication code, read in the document code of file among the editor, the language indication code corresponding, the slider position that expression becomes the text point of current edit object with it.

(S43) judge whether language flag is the editting function symbol.If the editting function symbol just proceeds to (S45).If not the editting function symbol, just proceed to (S44).

(S44) with reference to slider position, the text code that will import and language indication code are inserted in the text code and the language indication code corresponding with it among the editor respectively, proceed to (S46) then.

(S45) according to the indication of functional symbol,, the file among the editor is carried out editing and processing with reference to editting function and slider position.

(S46) after the storage unit 16 that text code in the file in will editing and the language indication code corresponding with it are stored in text code and language indication code, enter (S47).

(S47) text code that will export and language indication code thereof output to output unit.

Fig. 5 is the workflow of output code synthesis unit 15 in the present embodiment.

The following describes the work shown in this figure.

(S51) by editing and processing unit 14 input text code and the language indication codes corresponding with it.

(S52) text code and the language indication code corresponding in literal of taking-up with it.

(S53), select the character code synthesis program with reference to language codes.

(S54), the text code in the literal is transformed into character code according to above-mentioned synthesis program.Replace the text code in the former literal.

(S55) judgement has or not also untreated text code.If also have, just turn back to above-mentioned (S52); If no, just proceed to (S57).

(S56) behind output character code and the language indication code corresponding with it, end process.

Present embodiment multilingual file processing device about constituting as above the following describes its work.

At first, each unit of this device all is set at original state.

By input block 11 with " da4Iu4cceng1 " ruan3ti3 " uei2 " pronunciation symbol and language flag LC=1 (like that, this refers to shown in Fig. 9 (b), the literal of importer's intention be the traditional Chinese word 1) be input on the language processing unit 12.

According to the value " 1 " of above-mentioned LC, from Language Processing storage unit 19, read in the Chinese character conversion program of traditional Chinese word.

Above-mentioned Chinese character conversion program is transformed to above-mentioned reading symbol the " character code of Da Lu Said " Soft Body " As ".

In input code resolving cell 13, with reference to above-mentioned language flag LC and character code, character code is decomposed into language indication code LPS={001,001,001,001,001,001,001,001} and text code TCS={374b, 5674 ....

To deliver to editing and processing unit 14 by the text code TCS and the language indication code LPS of above-mentioned input code resolving cell 13 outputs.In editing and processing unit 14, owing to do not store any content in the storage unit 16 of the text code of this moment and language indication code, so directly the value with above-mentioned LPS and TCS is stored in text code and the language indication code storage unit 16.At this moment, slider position is in the end of writing of this paper.So,, export above-mentioned LPS and TCS for transformation results is outputed to monitor.

In output code synthesis unit 15, with reference to language indication code and text code,, select suitable synthesis program, by the synthetic character code of text code according to language indication code by 14 outputs of editing and processing unit.

In literal output control unit 18, with reference to character code and the language codes corresponding, take out appropriate font with it, output to output unit 17.By output unit 17 font of taking out is shown on monitor.

Then, the language flag " 2 " of expression simplified Chinese character word is outputed on the input language processing unit 12 by input block 100.At this moment, it is 2 that the value of language flag LC is updated to, and the content of pronunciation symbol also becomes " " ruan3jian4 " ".

At this moment, in input language processing unit 12,, from Language Processing storage unit 19, read in the Chinese character conversion program of simplified Chinese character word by the reference language flag.

The pronunciation symbol that above-mentioned Chinese character conversion program will have been imported is transformed to the character code string of " software ".

In input code resolving cell 13,, character code is decomposed into language indication code LPS={011,011 at the reference language flag, 011,011} and text codes TCS={6624,706f, 5880, after the 6624}, above-mentioned LPS and TCS are delivered to editing and processing unit 14.

In editing and processing unit 14, because the language flag of having imported is not the editting function symbol, so, will be with reference to the text code in the storer 16 of text code and language indication code, language indication code and slider position store above-mentioned text code, language indication code in the storer 16 of text code and language indication code.At this moment, the language indication code in the storer 16 of text code and language indication code is " 001,001,001,001,001,001; 001,001,001,001,001,001 " (12 of beginning shown in Fig. 7), and text code is 374b, 5674 ..., 6624,706f, 5880,6624.

Then, in output code synthesis unit 15, by the synthetic character code of text code, language indication code of editing and processing unit 14 outputs.

In literal output control unit 18, according to above-mentioned character code and the language codes corresponding with it, take out the simplified Chinese character complex form of Chinese characters respectively after, output to output unit 17.At this moment, literal demonstration " Da Lu Said " Soft Body " As " software " on the monitor) ".If with the file of Japanese and English (half-angle) input equivalent, go up demonstration, just become shown in Figure 8 at output unit 17 (monitor).The text code of this moment and the content of the language indication code in the language indication code storage unit are shown in Fig. 7.

In this figure, illustrate 30 and 1 constitute add up to 41 language indication codes.And, initial 8 " 001 " expressions " greatly ", " " " etc. traditional Chinese word (among Fig. 9 " complex form of Chinese characters 1).4 " 011 " expression simplified Chinese character words such as " parts " of back.5 " 00 " expressions ", " of back, traditional Chinese words such as " days ".8 " 100 " expressions " " of back ", japanese types such as " ソ ".5 " 001 " expressions ", " of back, traditional Chinese words such as " English ".10 " 000 " expressions " " of back ", English words (ASCII) such as " S ".Last one " 001 " expression traditional Chinese word ".」

By above explanation as can be known, the Japan word processor can be used as the Chinese words processor, and then also can be used as the multilingual file processing device use.

Abovely the present invention has been described according to embodiment, still self-evident, the invention is not restricted to the foregoing description.That is to say that the present invention also comprises following content.

(1) bit number of language codes can suitably increase and decrease according to the language number of process object.

(2) about the appointment of code plane, can be according to the category of language change of process object.

(3) because language indication code is the data that constitute by 0 and 1, so, can other conclusion, still, it is first-class to store disk on the basis that keeps corresponding relation between literal, at this moment, for example adopts the technology with the compression of Huffman methods such as (Hoffman).

(4) synthetic about the decomposition of input code and output code, reference table not, and be to use computing formula to carry out.

(5) as shown in figure 12, expand language codes and language flag a little, just can handle Chinese, Japanese, English, Korea's literary composition, Arabic, Europe simultaneously is language (language such as English, moral, Russia are few because of the literal number of words, can be placed on fully in the japanese type code hurdle).

Perhaps, make code plane 1,2,3, also divert and use English, the German of German text, Russian corresponding to English, German, Russian.

Also have, in the occasion of English digital, because pronunciation symbol is exactly the literal that this language forms, so the literal loading routine is actually unwanted.

(6) owing to reasons such as manufacturings, can a requisite inscape of the present invention (essential condition, part) physically, mechanically as a plurality of parts etc., conversely, a plurality of inscapes can be carried out integratedly, perhaps suitably they be combined.

(7) transform existing word processor etc., perhaps new procedures is included, as multilingual file processing device with function of the present invention.

(8) output can output on medium and the PERCOM peripheral communication circuit.Equally, input also can be finished by medium and PERCOM peripheral communication circuit.

(9) the Japanese file is main, and for the ease of those also making of not such ordinary use Chinese file, short of special instructions are just pressed Japanese and handled.

(10) other distortion of carrying out in the scope that does not change aim of the present invention have, and replace also untapped in an embodiment " implicit language indication information " etc. with language flag and pronunciation.

(invention effect)

As described above, the multilingual file processing device relevant with the present invention is complete Removed the problem points that has earlier. Specifically, obtained following effect.

(1) both be the document handling apparatus that does not use multi-lingual corresponding code train, also easy Use as multilingual file processing device. For example, although what do the information in the application program Some modifications still, can major part intactly utilize its code structure and use journey Order, so, can be with the Japan word processor as the word processor of processing Japanese and Chinese both sides.

(2) do not need again to make the related edit program. So, the file of any language Treating apparatus can be rapidly, expeditiously as multilingual file processing device. For example, exist Above-mentioned Japanese processing machine is also processed the occasion of Chinese, owing to decomposing program, synthesis program etc. can Divert, reduce about 1/3 required man-hour.

(3) by using the language codes plane, the code structure of various language can be identical. The institute With, be convenient in the future system extension, the software development when improving.

Because above effect, practical function of the present invention is very big.

Claims

1. the document handling apparatus of Chinese and Japanese dual-purpose is characterized in that: storage unit, editing and processing unit, output code synthesis unit, text font storage unit and literal output control unit with input block, character code structural table, Language Processing storage unit, input language processing unit, input code resolving cell, text code and language indication code;

Input block input language sign and pronunciation symbol;

The character code structural table has japanese type has been distributed a code plane (CodePlane), Chinese text distributed at least the character code hurdle of 2 code plane, according to the appointment on code plane and character code hurdle, decision Japanese or Chinese both one of and the literal of language;

The pronunciation symbol that the Language Processing cell stores will have been imported is transformed to the Chinese text loading routine of this Chinese text and the pronunciation symbol that will import is transformed to the japanese type loading routine of this japanese type;

The input language processing unit is according to the language flag of having been imported by described input block, in Chinese text loading routine and the japanese type loading routine which side according to what read in the described Language Processing storage unit is, read in this side's program, and the pronunciation symbol that will import is transformed into and this language corresponding character code;

The character code that the input code resolving cell produces described input language processing unit conversion be decomposed into specify in should be specific on the described character code structural table the character code hurdle in the text code of literal and the language indication code on appointment codes plane;

Text code and language indication code that the cell stores of text code and language indication code has been decomposed by described input code resolving cell;

The editing and processing unit insert, during file editor such as deletion, word attribute conversion, each unit relevant with this editor can be handled text code and the language indication code of storing in described text code and the language indication code storage unit simultaneously;

The output code synthesis unit is according to text code and the language indication code stored in described text code and the language indication code storage unit, and is synthetic this Chinese text code in the input pronunciation symbol or japanese type code;

The text font storage unit is exported then by being mapped being registered in the Chinese text in the described character code structural table and the font of japanese type and the character code of this language;

The literal output control unit takes out from described text font storage unit and the Chinese text of output correspondence or the font of japanese type according to the synthetic result of described output code synthesis unit.

2. by the document handling apparatus of described Chinese of claim 1 and Japanese dual-purpose, it is characterized in that: described character code structural table has japanese type code hurdle;

Described input code resolving cell has the input code Japanese code resolver that the character code that the conversion of input language processing unit is produced is decomposed into language indication code and japanese type code;

The storage unit of described text code and language indication code has the japanese type code memory of japanese type code as the text code storage.

3. multinational document handling apparatus, it is characterized in that having storage unit, editing and processing unit, output code synthesis unit, text font storage unit and the literal output control unit of input block, character code structural table, Language Processing storage unit, input language processing unit, input code resolving cell, text code and language indication code;

Input block input language sign and pronunciation symbol;

The character code structural table has number of words according to this state's language for multi-lingual, the character code hurdle that has distributed at least one code plane (Code Plane), according to the appointment between code plane and character code hurdle, the kind and the literal thereof of decision various countries language;

That use and the pronunciation symbol that imported of various countries' language that Language Processing cell stores handle is corresponding with described character code structural table is transformed into the kinds of words loading routine of using with various countries' language of the corresponding literal of this state's language;

The input language processing unit reads in the literal loading routine that this state's language is used according to the language flag of described input block input from described Language Processing storage unit, the pronunciation symbol of having imported is transformed to this state's language corresponding character code;

The character code that the input code resolving cell produces the conversion of described input language processing unit be decomposed into specify in should be specific on the described character code structural table the character code hurdle in the text code of literal and the language indication code on appointment codes plane;

Text code and language indication code that the cell stores of text code and language indication code has been decomposed by the input code resolving cell;

The output code synthesis unit is according to text code and the language indication code stored in described text code and the language indication code storage unit, with input language sign in the character code of the pronunciation symbol corresponding character identical with this language synthetic;

The font storage unit is exported then by the font that is registered in the various literal of multilingual in the described character code structural table and the character code of this language are mapped;

The literal output control unit takes out and exports the text font of this corresponding state's language according to the synthetic result of described output code synthesis unit from described text font storage unit.