JPH03127264A - Mechanical translation system for sentence including parentheses - Google Patents

Mechanical translation system for sentence including parentheses

Info

Publication number
JPH03127264A
JPH03127264A JP1266557A JP26655789A JPH03127264A JP H03127264 A JPH03127264 A JP H03127264A JP 1266557 A JP1266557 A JP 1266557A JP 26655789 A JP26655789 A JP 26655789A JP H03127264 A JPH03127264 A JP H03127264A
Authority
JP
Japan
Prior art keywords
parentheses
internal structure
sentence
analysis
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1266557A
Other languages
Japanese (ja)
Inventor
Yasuhiro Takayama
泰博 高山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP1266557A priority Critical patent/JPH03127264A/en
Publication of JPH03127264A publication Critical patent/JPH03127264A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To obtain a proper translation to an input original sentence including the parentheses with the natural extension of a conventional mechanical translation system by holding the information on the parentheses in an internal structure and scattering the processes of the parentheses to the stages of a translating process. CONSTITUTION:A mechanical translating device consists of an input device 20, an input control part 21, an editing control part 22, a display control part 23, a display device 24, a translation system 25, a text storage part 26, a print control part 27, a printer 28, and a dictionary system 29. A secondary process is provided at each stage of a translating process in response to the process for parentheses. Thus, the information on the parentheses are held as the attributes contained in an internal structure. Thus, it is possible to accurately analyze the roles of parentheses against a whole sentence. In such a constitution, natural translation is obtained to an original sentence including parentheses with the natural extension of a conventional mechanical translation system.

Description

【発明の詳細な説明】 [産業上の利用分野] この発明は、従来の機械翻訳方式を拡張して、括弧付き
の文を含む原文量に対して自然な訳文を得ることができ
るようにした機械翻訳方式に関するものである。
[Detailed Description of the Invention] [Field of Industrial Application] This invention expands the conventional machine translation method to make it possible to obtain natural translated sentences for a large amount of original text including parenthesized sentences. It is related to machine translation methods.

[従来の技術] 計算機を利用して、ある言語を他の言語に翻訳する種々
の機械翻訳装置が開発されてきており、これらの機械翻
訳装置においては、通常の文法では取り扱うことが困難
な文法外の言語現象を処理する様々な対策がなされれて
いる。
[Prior Art] Various machine translation devices have been developed that use computers to translate one language into another. Various measures have been taken to deal with external linguistic phenomena.

例えば、第4図は特開昭63−136262号に示され
ている従来の機械翻訳装置における括弧記号の解析処理
と生成処理の図であり、第5図は特開昭63−1362
68号に示されている従来の機械翻訳装置における引用
記号の解析処理と生成処理の図である。また、第6図は
特開昭63−113668号における引用符号類に対す
る処理および内部構造の図である。
For example, FIG. 4 is a diagram showing the parsing process and generation process for parentheses in a conventional machine translation device shown in Japanese Patent Application Laid-Open No. 63-136262, and FIG.
68 is a diagram of the citation symbol analysis process and generation process in the conventional machine translation device shown in No. 68. Moreover, FIG. 6 is a diagram of the processing and internal structure for reference symbols in Japanese Patent Application Laid-Open No. 113668/1983.

第4図(a)においては、辞書引き形態素解析部の初め
に、文字ポインタを先頭の単語に設定(40)L、括弧
を単語として扱い、単語が左括弧であるかどうかを判定
(41)L、もし単語が左括弧であった場合には、右括
弧までの単語数を数え、1つ前の単語数に加え(42)
、括弧でなければ辞書引き処理(43)を行いながら、
処理(41)から処理(43)を繰り返す。
In FIG. 4(a), at the beginning of the dictionary lookup morphological analysis section, the character pointer is set to the first word (40)L, the parentheses are treated as words, and it is determined whether the word is a left parenthesis (41). L, if the word is a left parenthesis, count the number of words up to the right parenthesis and add it to the number of the previous word (42)
, if there are no parentheses, while performing dictionary lookup processing (43),
Processing (41) to processing (43) are repeated.

第4図(b)においては、生成部で1つの単語について
生成処理(45)を行い、訳語をバッファにストア(4
6)L、この時、単語が左括弧であ、るかどうかを判定
(47)L、単語が左括弧でなければ、通常の生成処理
(45)を続け、単語が左括弧の場合には、右括弧まで
の単語を訳語バッファにストア(48)する。以上の処
理を生成する単語が終了(49)するまで繰り返す。
In FIG. 4(b), the generation unit performs generation processing (45) for one word, and stores the translated word in a buffer (45).
6) L, at this time, determine whether the word is a left parenthesis (47) L, if the word is not a left parenthesis, continue the normal generation process (45), and if the word is a left parenthesis, , stores the words up to the right parenthesis in the translation word buffer (48). The above process is repeated until the word to be generated is completed (49).

第5図において、辞書引き形態素解析部の前に、文字ポ
インタを0に設定(50)L、開きの引用記号があるか
判定(52)L、もし開きの引用記号があった場合には
、その単語位置を記憶(53)し、閉じ引用記号をサー
チ(54)L、閉じの引用記号があるかどうか判定(5
5)L、閉じの弓用記号があれば、その単語位置を記憶
(5,6)L、処理(51)に戻る。判定(52)で開
きの引用記号がなければ開きと閉じの引用記号単語位置
に従って引用記号を消去(57)する。判定(55)で
閉じの引用記号がなければ、開きの単語位置をクリア(
58)する。処理(57)または処理(58)が終わる
と、辞書引き処理(59)、形態素解析(60)、変換
処理(62)を引用記号を含まない文の場合と同様に適
用する。生成部においては、生成する句の単語位置と単
語位置記憶(53)、(56)で記憶した単語位置と一
致するかどうかを判定(63)L、一致した場合には、
フラグをONに設定(64)L、開きの引用記号を生+
1i!(65)する。開き引用記号生成処理(65)が
終了するか、判定(63)で単語位置が−致しなければ
句の生成(66)を行って、フラグがONか否かを判定
(67)L、フラグがONならば、閉じの引用記号を生
成(68)する。閉じ引用記号生成処理(68)が終了
するか、処理(68)でフラグがONでなければ、全て
の句を生成したかを判定(69)L、全ての句を生成し
た場合には、処理を終了する。判定(69)で、まだ全
ての句を生成していなかった場合には、フラグをOFF
に設定(70)して判定(63)から繰り返す。
In FIG. 5, before the dictionary lookup morphological analysis section, the character pointer is set to 0 (50) L, it is determined whether there is an open quotation mark (52) L, and if there is an open quotation mark, Memorize the word position (53), search for the closing quotation mark (54)L, and determine whether there is a closing quotation mark (5
5) L, if there is a closed bow symbol, remember the word position (5, 6) L, return to process (51). If it is determined (52) that there is no opening quotation symbol, the quotation symbol is deleted according to the opening and closing quotation symbol word positions (57). If there is no closing quotation mark in judgment (55), clear the opening word position (
58) Do. When processing (57) or processing (58) is completed, dictionary lookup processing (59), morphological analysis (60), and conversion processing (62) are applied in the same way as for sentences that do not include quotation marks. In the generation unit, it is determined whether the word position of the phrase to be generated matches the word position stored in word position storage (53) and (56) (63)L, and if they match,
Set the flag to ON (64) L, open quotation symbol +
1i! (65) Do. If the open quotation mark generation process (65) is completed or the word position does not match in the judgment (63), a phrase is generated (66), and it is judged whether the flag is ON or not (67). If ON, a closing quotation symbol is generated (68). If the closing quotation symbol generation process (68) is completed or the flag is not ON in the process (68), determine whether all phrases have been generated (69)L. If all phrases have been generated, the process end. In judgment (69), if all phrases have not been generated yet, turn off the flag.
(70) and repeat from judgment (63).

第6図(a)は、英日翻訳における引用符号類に対する
処理過程の図である。英文が入力されると、入力部(7
1)、前編集部(72)、形態素解析部(73)、引用
符号類処理部(74)、構文解析I部(75)、構文解
析■部(76)、構造変換部(77)、訳文生成部(7
8)、後編集部(79)、出力部(80)を経て、日本
文を出力する。制御部(83)は、解析ルール(82)
に基づいて、単語辞書(81)を検索しながら処理の流
れを制御し、操作表示部(87)は操作者から本装置に
与える様々な指示および本装置から操作者に対する様々
な指示を可視表示する。第6図(b)は、第6図(a)
の処理の際に、括弧類の情報を保持するための内部構造
の図であり、文の情報(84)、ブロックの情報(85
)、単語の情報(86)というように解析の際に得た種
々の情報を別々に保持する。この従来の方式では、形態
素解析部(73)と構文解析1部(75)との間に引用
符号類処理部(74)を設け、引用符号類をブロックの
情報(85)の中に位置情報として保持することにより
、括弧類の翻訳を可能にしようとしている。
FIG. 6(a) is a diagram of the processing process for quotation marks in English-Japanese translation. When English text is input, the input section (7
1), pre-editing section (72), morphological analysis section (73), quotation marks processing section (74), syntax analysis section I section (75), syntax analysis section (76), structure conversion section (77), translation section Generation part (7
8), a post-editing section (79), and an output section (80) to output Japanese text. The control unit (83) analyzes the analysis rules (82)
The operation display unit (87) visually displays various instructions given to the device by the operator and various instructions from the device to the operator. do. Figure 6(b) is the same as Figure 6(a).
This is a diagram of the internal structure for holding bracket information during processing, including sentence information (84) and block information (85).
), word information (86), and other various information obtained during analysis are separately held. In this conventional method, a quotation mark processing unit (74) is provided between the morphological analysis unit (73) and the first syntactic analysis unit (75), and the quotation marks are stored in the block information (85) as position information. By keeping it as , we are trying to make it possible to translate the brackets.

この様に、従来の方式では、形態素解析の前、辞書引き
部の内部、または形態素解析と構文解析との間のいずれ
かに、括弧類に対する特別な一処理過程を設け、括弧類
の情報を単語また位置情報として保持することによって
、括弧類を含む文の翻訳を可能にしようとしていた。
In this way, in conventional methods, a special processing process for brackets is provided either before morphological analysis, inside the dictionary lookup section, or between morphological analysis and syntactic analysis, and information on brackets is stored. The idea was to enable translation of sentences containing parentheses by storing them as word or position information.

[発明が解決しようとする課題] この様な従来の機械翻訳装置における括弧類に対する処
理手段では、処理過程の内部で括弧類を扱う部分がごく
一部の特別な処理に限られ、また、括弧類の情報を独立
した単語として取り扱うか、または、文を解析して得た
他の情報とは別に保持していたため、処理の柔軟性がな
く、翻訳方式の構成を複雑にしているので、文中におけ
る複雑な括弧類の使用法に対処するには十分でなく、翻
訳方式を拡張するためには、再び特殊な対処を必要とす
ることから機械翻訳装置の設計者および保守者にとって
は極めて煩わしいことであるという問題点があった。
[Problems to be Solved by the Invention] In the processing means for parentheses in such conventional machine translation devices, the part that handles parentheses within the processing process is limited to a few special processes, and Because similar information is treated as an independent word or stored separately from other information obtained by analyzing sentences, processing is inflexible and the structure of the translation method is complicated. It is not sufficient to deal with the complex usage of parentheses in There was a problem that.

この発明は上記のような問題点を解決するためになされ
たもので、括弧類の情報を内部構造に保持して、括弧類
に対する処理を翻訳過程の各段階に分散して持つことに
よって、従来の機械翻訳方式の自然な拡張で、括弧類を
含む入力原文章に対しても適切な訳文を得ることができ
ると共に、将来の翻訳方式の拡張に対して処理の柔軟性
および処理過程全体の等偏性を維持した機械翻訳方式を
得ることを目的とするものである。
This invention was made in order to solve the above-mentioned problems, and by retaining bracket information in an internal structure and distributing the processing for brackets at each stage of the translation process, This is a natural extension of the machine translation method, and it is possible to obtain an appropriate translation even for the input source text that contains parentheses, and it also improves processing flexibility and the overall processing process for future expansion of the translation method. The purpose is to obtain a machine translation method that maintains bias.

[課題を解決するための手段] この発明に係わる機械翻訳方式は、入力された原文章か
ら1つの文を切り出す手段と、その1つの文の中に括弧
が含まれている場合に左括弧と右括弧との対を検出し、
その使用目的を判定する手段と、文中の括弧以外の文字
に対して文字の種類に関する情報を付与する手段と、括
弧の内側の文字を独立して解析し内部構造に置き換え、
括弧の外側に位置する他の未解析文字の並びと混在させ
て保持する手段と、括弧の外側に位置する文字の並びを
、文の構成要素ごとに解析して内部構造に置き換える手
段と、文の構成要素ごとの内部構造の間の関係を解析す
る構文解析手段と、その副処理過程として前記括弧の内
側の文字に対する内部構造と括弧の外側に位置する他の
内部構造の並びとの関係を解析する手段と、解析された
原文の内部構造に対して訳文の内部構造に変換する変換
処理と、その副処理として括弧の内側の文字に対する内
部構造を変換する手段と、訳文の内部構造から訳文を生
成する手段と、その副処理として変換された括弧の内側
の文字に対する内部構造から括弧付きで訳文を生成する
手段とを備えたものである。
[Means for Solving the Problem] The machine translation method according to the present invention includes a means for cutting out one sentence from an input original sentence, and a means for cutting out one sentence from an input original sentence, and a means for cutting out a left parenthesis when a parenthesis is included in the one sentence. Detect pair with right parenthesis,
a means for determining the purpose of use; a means for providing information regarding the type of characters to characters other than the parentheses in a sentence; a means for independently analyzing the characters inside the parentheses and replacing them with an internal structure;
A means for storing the character sequences located outside the parentheses in a mixed manner with other unparsed character sequences, a means for parsing the character sequences located outside the parentheses for each component of the sentence and replacing it with an internal structure, A syntactic analysis means for analyzing the relationship between the internal structures of each constituent element, and as a subprocess thereof, the relationship between the internal structure for the characters inside the parentheses and the arrangement of other internal structures located outside the parentheses. A means for analyzing, a conversion process for converting the internal structure of the analyzed original text into the internal structure of the target text, a means for converting the internal structure for characters inside the parentheses as a sub-process, and a means for converting the internal structure of the target text into the target text. and, as a subprocess thereof, a means for generating a translated sentence with parentheses from the internal structure of the converted characters inside the parentheses.

[作用コ この発明における機械翻訳方式は括弧類を処理するため
の複数の処理過程を備え、内部構造の中に括弧類に関す
る情報を保持することにより、入力態文章中の括弧類を
含む文を翻訳可能にし、かつ、翻訳処理全体の拡張可能
性を維持することが可能になる。
[Operations] The machine translation method in this invention has multiple processing steps for processing brackets, and by retaining information regarding brackets in the internal structure, it can translate sentences containing brackets in input text. It becomes possible to make translation possible and maintain expandability of the entire translation process.

[実施例コ 以下、この発明の一実施例を図について説明する。第1
図は、この発明に係わる機械翻訳方式の処理過程の一例
である。第1図において、(1)は入力された原文章か
ら1つの文を切り出す手段、(2)はこの切り出し手段
(1)で切り出された1つの文の中の各文字に対して文
字の種類に関する情報を付与すると同時に、括弧が含ま
れている場合に左括弧と右括弧との対を検出し、括弧の
使用目的を判定し、括弧の使用目的に応じた情報を付与
する手段、(3)は本格的な解析に先立ち、記号列など
の文法外の処理を行う手段、(4)は括弧の内側の文字
を解析し内部構造に置き換え、括弧の外側に位置する他
の未解析の文字の並びと混在させて保持する手段、(5
)は括弧の外側に位置する文字の並びを文の構成要素ご
とに解析して内部構造に置き換える手段、(6)は括弧
白文字解析手段(4)で解析して得た括弧類の内側の内
部構造と括弧類の外側にあった単語の内部構造との関係
を解析する括弧機能解析第1手段、(7)は構文解析に
先立ち、並列範囲の同定や構文要素の局所的な解析を行
う手段、(8)は括弧の内側の内部構造と他の構文要素
との関係付けを行う括弧機能解析第1手段、(9)は文
の構成要素ごとの内部構造の間の関係を解析する構文解
析手段、(10)は構文解析手段(9)の副処理過程と
して前記括弧の内側の文字に対する内部構造と括弧の外
側に位置する他の内部構造の並びとの関係を解析する括
弧機能解析第■手段、(11)は構文解析した内部構造
を補正する解析後処理手段、(12)は解析後処理手段
(11)までの処理で得た内部構造をさらに中間的な内
部構造に置き換える変換前処理手段、(■3)は解析さ
れた原文の内部構造に対して訳文の内部構造に変換する
構造変換処理手段、(14)は括弧の内側の文字に対す
る内部構造を独立して変換する構造変換処理手段(13
)の副手段である括弧内変換手段、(15)は変換され
た訳文の内部構造を訳文の言語らしい構造に変換する生
成前処理手段、(16)は訳文の内部構造から訳文の構
文を生成する構文生成処理手段、(17)は構文生成処
理手段(16)において変換された括弧内の内側の文字
に対する内部構造から括弧付きで訳文を生成する括弧内
生酸処理手段、(18)は訳文中の各単語に関する形態
的な処理を行う形態素生成処理手段、(19)は訳文の
最終的な補正を行う生成後処理手段である。
[Example 1] An example of the present invention will be described below with reference to the drawings. 1st
The figure shows an example of the processing process of the machine translation method according to the present invention. In Figure 1, (1) is a means for cutting out one sentence from the input original text, and (2) is the type of character for each character in one sentence cut out by this cutting means (1). At the same time, when parentheses are included, a means for detecting a pair of a left parenthesis and a right parenthesis, determining the purpose of use of the parentheses, and adding information corresponding to the purpose of use of the parentheses, (3) ) is a means to perform ungrammatical processing such as symbol strings before full-scale analysis, and (4) is a means to analyze the characters inside the parentheses and replace them with the internal structure, and other unparsed characters located outside the parentheses. (5)
) is a means to analyze the sequence of characters located outside the parentheses for each component of the sentence and replace it with an internal structure, and (6) is a method for analyzing the character sequence located outside the parentheses by using the parenthesis white character analysis means (4). The first method of parenthesis function analysis, which analyzes the relationship between the internal structure and the internal structure of words outside the parentheses, involves identifying parallel ranges and local analysis of syntactic elements prior to syntactic analysis. means, (8) is the first means of parenthesis function analysis that correlates the internal structure inside the parentheses with other syntactic elements, and (9) is a syntax that analyzes the relationships between the internal structures of each constituent element of the sentence. The analysis means (10) is a sub-process of the syntax analysis means (9), which is a parenthesis function analysis step that analyzes the relationship between the internal structure of the characters inside the parentheses and the arrangement of other internal structures located outside the parentheses. ■ Means, (11) is a post-analysis processing means that corrects the parsed internal structure, and (12) is a post-analysis processing means before conversion that further replaces the internal structure obtained through processing up to (11) with an intermediate internal structure. Processing means, (■3) is a structure conversion processing means that converts the internal structure of the analyzed original text into the internal structure of the translated text, and (14) is a structure conversion that independently converts the internal structure of the characters inside the parentheses. Processing means (13
), (15) is a generation pre-processing means that converts the internal structure of the translated text into a structure suitable for the language of the translated text, and (16) is a means for generating the syntax of the translated text from the internal structure of the translated text. (17) is a parenthetical acid processing means for generating a translated sentence with parentheses from the internal structure of the characters inside the parentheses converted by the syntax generating processing means (16); (18) is a parenthetical acid processing means for generating a translated sentence with parentheses; Morphological generation processing means (19) performs morphological processing on each word in the translation, and post-generation processing means (19) performs final correction of the translated text.

第2図は上記実施例に係わる機械翻訳方式を実現するた
めの機械翻訳装置の構成を示すブロック図である。第2
図に示すように、この機械翻訳装置は、キーボード、マ
ウス、OCR(光学式文字読取装置)などの入力装置(
20)と、この人力装置(20)を制御する入力制御部
(21)と、翻訳作業全体を管理し、原文や訳文の編集
処理を制御する編集制御部(22)と、原文や訳文など
の表示を制御する表示制御部(23)と、この表示制御
部(23)から指示された内容を表示する表示装置(2
4)と、翻訳処理を行う翻訳手段としての翻訳系(25
)と、原文や訳文を格納するテキスト記憶部(26)と
、原文や訳文などの印刷を制御する印刷制御部(27)
と、この印刷制御部(27)から指示された内容を印刷
する印刷装置(28)と、翻訳処理に必要な辞書情報が
格納してあり、それらの辞書情報を検索する機能を有す
る辞書システム(29)とから構成されている。
FIG. 2 is a block diagram showing the configuration of a machine translation device for realizing the machine translation method according to the above embodiment. Second
As shown in the figure, this machine translation device uses input devices such as a keyboard, mouse, and OCR (optical character reader).
20), an input control unit (21) that controls this human-powered device (20), an editing control unit (22) that manages the entire translation work and controls the editing process of the original text and translated text, and A display control unit (23) that controls display, and a display device (2) that displays content instructed by the display control unit (23).
4) and a translation system (25
), a text storage unit (26) that stores the original text and translated text, and a print control unit (27) that controls printing of the original text and translated text, etc.
, a printing device (28) that prints the content instructed by the print control unit (27), and a dictionary system (28) that stores dictionary information necessary for translation processing and has a function to search this dictionary information. 29).

第3図は、この実施例に係わる処理による内部構造を示
す図である。この実施例の翻訳方式における内部構造は
、(36)に示すようなノードとリンクとから構成され
、各ノードやリンクは属性名=属性値の集まりからなる
ものとする。(30)は文字切り出し手段(1)によっ
て、テキスト記憶部(26)の中の原文章から切り出し
て、翻訳系(25)へ送る原文、(31)は原文(30
)を1文字ごとに分解して文字コードのリスト形式で保
持したもの、(32)は字種情報解析手段(2)によっ
て付与したリスト(31)の各文字コードに対応する字
種情報、(33)は括弧内解析手段(4)によって得た
、括弧の内側の文字を解析した内部構造(34)と括弧
の外側に位置する他の未解析の文字コードの並びと混在
させて保持しているリスト、(35)は形態素解析手段
(6)の処理が終了した状態の文の構成要素ごとの内部
構造のリスト、(36)は構文解析手段(9)の処理が
終了した状態の内部構造、(37)は構文変換処理手段
(13)の処理が終了状態の内部構造、(38)は括弧
内変換手段(14)によって変換された内部構造の部分
、(39)は訳文である。
FIG. 3 is a diagram showing an internal structure resulting from processing related to this embodiment. The internal structure of the translation method of this embodiment is composed of nodes and links as shown in (36), and each node and link is composed of a collection of attribute name=attribute value. (30) is the original text that is cut out from the original text in the text storage unit (26) by the character segmentation means (1) and sent to the translation system (25); (31) is the original text (30
) is broken down character by character and held in the form of a list of character codes, (32) is the character type information corresponding to each character code in the list (31) given by the character type information analysis means (2), ( 33) is maintained by mixing the internal structure (34) obtained by analyzing the characters inside the parentheses with the sequence of other unanalyzed character codes located outside the parentheses, obtained by the parentheses analysis means (4). (35) is a list of the internal structure of each constituent element of the sentence after the processing of the morphological analysis means (6) has been completed, and (36) is the internal structure after the processing of the syntactic analysis means (9) has been completed. , (37) is the internal structure after the processing of the syntax conversion processing means (13) has been completed, (38) is the part of the internal structure converted by the conversion means (14) in parentheses, and (39) is the translated text.

次に、上記実施例の動作について図を参照しながら説明
する。第1図において、文切り出し手段(1)によって
、テキスト記憶部(26)の中の原文章から1つの文の
中の文字コードのリスト(31)に対して各文字の種類
に関する情報を文字と対応するリスト形式(32)で付
与すると同時に、文字の中に左括弧を検出すると、対応
する右括弧を探して、左括弧と右括弧とが1文の中の対
になって出現する場合には、(32)に示すように括弧
であるという字種1pやrpを付与し、左右が対応して
いない場合には、記号しての字種を付与する。これによ
り、文中での括弧の使用目的に応じた字種を付与するこ
とが可能となる。解析前処理手段(3)は原文中に記号
列や特定の表現が含まれる場合には、本格的な解析に先
立ち、記号列などの文法外の処理を行う。
Next, the operation of the above embodiment will be explained with reference to the drawings. In FIG. 1, information regarding the type of each character is extracted from the original text in the text storage unit (26) to a list (31) of character codes in one sentence by the sentence extraction means (1). At the same time as adding the corresponding list format (32), when a left parenthesis is detected in a character, the corresponding right parenthesis is searched, and if a left parenthesis and a right parenthesis appear as a pair in one sentence, As shown in (32), the character types 1p and rp are assigned as parentheses, and when the left and right sides do not correspond, the character type as symbols is assigned. This makes it possible to assign character types depending on the purpose of parentheses in a sentence. If a symbol string or a specific expression is included in the original text, the pre-analysis processing means (3) performs extra-grammatical processing of the symbol string or the like prior to full-scale analysis.

次に、解析前処理手段(3)は、括弧の内側の文字に対
して、手段(5)から手段(11)までの解析手段を適
用して解析し内部構造(34)を作り、内部構造(34
)と括弧の外側に位置する他の未解析の文字の並びとを
混在させて、(35)に示すようなリスト形式で保持す
る。括弧内の内部構造(34)の−容土のノードには、
quote=paren等の属性を付与して、後の処理
で括弧内の内部構造であることを判定することが可能な
ように情報を設定しておく。
Next, the analysis pre-processing means (3) applies the analysis means from means (5) to means (11) to analyze the characters inside the parentheses, creates an internal structure (34), and generates an internal structure. (34
) and other unparsed character sequences located outside the parentheses are mixed and held in a list format as shown in (35). In the internal structure (34) in parentheses, the −containing soil node has
By giving an attribute such as quote=paren, information is set so that it is possible to determine that it is an internal structure within parentheses in later processing.

形態素解析手段(5)は形態素解析文法規則によってリ
スト(35)に示すように、括弧の外側に位置する文字
コードの並びを辞書システム(29)を検索しながら、
文の構成要素ごとに属性Cat、auxwordなどか
らなる内部構造に置き換える。この時、括弧機能解析第
■部(6)は、括弧の内側の文字コードの並びを解析し
て得た内部構造(34)と括弧の外側に位置する文の構
成要素ごとの内部構造との関係を解析し、括弧内の内部
構造(34)を支配するごとリンクで接続して、そのリ
ンクに5yn=paren等の属性を付与する。
The morphological analysis means (5) uses the morphological analysis grammar rules to search the dictionary system (29) for the sequence of character codes located outside the parentheses, as shown in the list (35).
Each component of the sentence is replaced with an internal structure consisting of attributes Cat, auxword, etc. At this time, the parenthesis function analysis part (6) compares the internal structure (34) obtained by analyzing the sequence of character codes inside the parentheses with the internal structure of each component of the sentence located outside the parentheses. The relationship is analyzed, and each controlling internal structure (34) in parentheses is connected with a link, and an attribute such as 5yn=paren is given to the link.

構文解析前処理手段(7)は、構文解析手段(9)の処
理に先立ち、並列範囲の同定や構文要素の局所的な解析
を行うと同時に、括弧機能解析第■部(8)と副処理と
して呼び出し、括弧機能解析第■手段(6)で処理し切
れなかった、括弧の内側の文字に対する内部構造と括弧
の外側に位置する他の内部構造の並びとの関係を解析し
て内部構造の属性に反映する。構文解析手段(9)は翻
訳系(25)が持つ構文解析文法規則にしたがって文の
構成要素ごとの内部構造の間を解析し、解析して得た情
報をノードやリンクの属性syn。
Prior to the processing of the syntactic analysis means (9), the syntactic analysis preprocessing means (7) identifies parallel ranges and locally analyzes syntactic elements, and at the same time performs parenthesis function analysis part (8) and subprocessing. , and analyze the relationship between the internal structure for the characters inside the parentheses and the sequence of other internal structures located outside the parentheses, which could not be processed in the parentheses function analysis method (6). Reflect in attributes. The syntactic analysis means (9) analyzes the internal structure of each constituent element of the sentence according to the syntactic analysis grammar rules of the translation system (25), and uses the information obtained through the analysis as attributes syn of nodes and links.

semとして保持する。その際に、括弧機能解析第■手
段(10)を副処理として呼び出し、括弧の内側の文字
に対する内部構造(34)と括弧の外側に位置する他の
内部構造との関係を、さらに深く解析して内部構造の属
性として格納する。解析後処理手段(11)は構文解析
した内部構造を補正し、また、変換前処理手段(12)
は解析後処理手段(11)までの解析で得た内部構造を
さらに中間的な内部構造に置き換える。
Save as sem. At that time, the parenthesis function analysis means (10) is called as a sub-process to further deeply analyze the relationship between the internal structure (34) for the characters inside the parentheses and other internal structures located outside the parentheses. and stored as an attribute of the internal structure. The post-analysis processing means (11) corrects the parsed internal structure, and the pre-conversion processing means (12)
further replaces the internal structure obtained by the analysis up to the post-analysis processing means (11) with an intermediate internal structure.

構造変換処理手段(13)は解析された原文の内部構造
(36)に対して訳文の内部構造(37)に変換する。
The structure conversion processing means (13) converts the internal structure (36) of the analyzed original text into the internal structure (37) of the translated text.

その際に、括弧内変換手段(14)を副処理として呼び
出し、括弧の内側の文字に対する内部構造(34)を独
立して変換し、内部構造(38)を得る。
At this time, the parenthesized conversion means (14) is called as a sub-process, and the internal structure (34) for the characters inside the parentheses is independently converted to obtain the internal structure (38).

さらに構文生成処理手段(16)は訳文の内部構造から
訳文の構文を生威し、括弧内生酸処理手段(17)は構
文生成処理手段(16)から副処理として呼び出され、
括弧内変換手段(14)において変換された括弧の内側
の文字に対する内部構造から括弧付きで訳文を生成し、
形態素生成処理手段(18)は訳文中の各単語に関する
形態的な処理を行い、生成後処理手段(19)は訳文の
最終的な補正を行い、訳文(39)を生成する。
Further, the syntax generation processing means (16) generates the syntax of the translated text from the internal structure of the translated text, and the parenthetical acid processing means (17) is called as a subprocess by the syntax generation processing means (16),
Generating a translated sentence with parentheses from the internal structure of the characters inside the parentheses converted in the parentheses conversion means (14),
The morpheme generation processing means (18) performs morphological processing on each word in the translated text, and the post-generation processing means (19) performs final correction of the translated text to generate a translated text (39).

以上によって、従来の翻訳過程の自然な拡張により、括
弧類を含む原文章を訳文章に適切に翻訳することが可能
である。
As described above, it is possible to appropriately translate an original text including parentheses into a translated text by a natural extension of the conventional translation process.

なお、上記実施例では機械翻訳における括弧類の処理方
式について説明したが、一般の文章処理方式における括
弧類や記号の扱いに関する場合に、本方式を用いてもよ
く、上記実施例と同様の効果を奏する。
Although the above example describes a method for processing parentheses in machine translation, this method may also be used in cases related to the handling of parentheses and symbols in general text processing methods, and the same effect as in the above example can be achieved. play.

[発明の効果コ 以上のように、この発明によれば翻訳過程の各段階で括
弧類の処理に対応する副処理過程を備え、括弧類に関す
る情報を内部構造中の属性として保持することにより、
文中における括弧類が文全体に占める役割を適格に分析
できるので、従来の機械翻訳手段の自然な拡張によって
、翻訳システムの等偏性を維持したまま、括弧類を含む
原文章を括弧類を含む訳文章に翻訳することが可能とな
る。
[Effects of the Invention] As described above, according to the present invention, by providing a sub-processing process corresponding to the processing of brackets at each stage of the translation process, and retaining information regarding the brackets as an attribute in the internal structure,
Since it is possible to properly analyze the role that parentheses play in a sentence as a whole, it is a natural extension of conventional machine translation methods to convert original sentences that contain parentheses to include parentheses while maintaining the equivalence of the translation system. It becomes possible to translate into a translated text.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図はこの発明の一実施例に係わる機械翻訳方式によ
る基本流れ図、第2図はこの発明の一実施例を実現する
ための機械翻訳装置の構成図、第3図はこの発明の一実
施例の処理過程の内部構造を示す図、第4図、第5図、
第6図はいずれも従来の機械翻訳方式を説明するための
図である。 図中、(1)は1文切り出し処理手段、(2)は字他ね
情報解析手段、(3)は解析前処理手段、(4)は括弧
内解析手段、(5)は形態素解析手段、(6)は括弧機
能解析第1手段、(7)は構文解析前処理手段、(8)
は括弧機能解析第1手段、(9)は構文解析手段、(1
0)は括弧機能解析第1手段、(11)は解析後処理手
段、(12)は変換前処理手段、(13)は構造変換処
理手段、(14)は括弧内変換手段、(15)は生成前
処理手段、(16)は構文生成処理手段、(17)は括
弧的生成処理手段、(18)は形態素生成処理手段、(
19)は生成後処理手段である。 なお、図中同一符号は同一または相当部分を示す。
Fig. 1 is a basic flowchart of a machine translation system according to an embodiment of the present invention, Fig. 2 is a block diagram of a machine translation device for realizing an embodiment of the invention, and Fig. 3 is an implementation of the invention. Diagrams showing the internal structure of the example processing process, FIGS. 4 and 5,
FIG. 6 is a diagram for explaining the conventional machine translation method. In the figure, (1) is a single sentence extraction processing means, (2) is a character and other information analysis means, (3) is an analysis pre-processing means, (4) is an analysis means in parentheses, (5) is a morphological analysis means, (6) is the first means of bracket function analysis, (7) is the syntax analysis preprocessing means, and (8)
is the first means of bracket function analysis, (9) is the syntax analysis means, (1
0) is the first means of parenthesis functional analysis, (11) is the post-analysis processing means, (12) is the pre-conversion processing means, (13) is the structural conversion processing means, (14) is the conversion means within the parentheses, and (15) is the Generation pre-processing means, (16) syntax generation processing means, (17) parenthetical generation processing means, (18) morpheme generation processing means, (
19) is a post-generation processing means. Note that the same reference numerals in the figures indicate the same or corresponding parts.

Claims (1)

【特許請求の範囲】[Claims] 原文章から訳文章を得る機械翻訳装置において、入力さ
れた原文章から1つの文を切り出す手段と、この1つの
文の中に括弧が含まれている場合に左括弧と右括弧との
対を検出し、その括弧が本来の意味で括弧として使用さ
れているか、或いは記号として使用されているかを判定
し、その情報を付与すると同時に、文中の括弧以外の文
字に対して同様に文字の種類に関する情報を付与する手
段と、括弧の内側の文字を解析し内部構造に置き換え、
括弧の外側に位置する他の未解析文字の並びと混在させ
て保持する手段と、括弧の外側に位置する文字の並びを
、文の構成要素ごとに解析して内部構造に置き換える手
段と、文の構成要素ごとの内部構造の間の関係を解析す
る構文解析手段と、この構造解析手段の副処理過程とし
て前記括弧の内側の文字に対する内部構造と括弧の外側
に位置する他の内部構造の並びとの関係を解析する手段
と、解析された原文の内部構造に対して訳文の内部構造
に変換する変換処理手段と、この変換処理手段の副処理
として括弧の内側の文字に対する内部構造を変換する手
段と、訳文の内部構造から訳文を生成する手段と、この
訳文生成手段の副処理として変換された括弧の内側の文
字に対する内部構造から括弧付きで訳文を生成する手段
とを備えたことを特徴とする括弧付き文の機械翻訳方式
In a machine translation device that obtains a translated text from an original text, there is a means for cutting out one sentence from an input original text, and a means for cutting out a pair of left and right parentheses when this one sentence contains parentheses. Detect and determine whether the parentheses are used as parentheses in their original meaning or as symbols, and at the same time provide information regarding the type of characters for characters other than parentheses in the sentence. A means of adding information, parsing the characters inside the parentheses and replacing them with the internal structure,
A means for storing the character sequences located outside the parentheses in a mixed manner with other unparsed character sequences, a means for parsing the character sequences located outside the parentheses for each component of the sentence and replacing it with an internal structure, A syntax analysis means for analyzing the relationship between the internal structures of each constituent element, and as a sub-processing process of this structure analysis means, an arrangement of the internal structure for the characters inside the parentheses and other internal structures located outside the parentheses. a conversion processing means for converting the internal structure of the analyzed original text into the internal structure of the translated text, and a subprocess of this conversion processing means that converts the internal structure of the characters inside the parentheses. A means for generating a translated sentence from the internal structure of the translated sentence, and a means for generating a translated sentence with parentheses from the internal structure for the converted characters inside the parentheses as a sub-process of the translated sentence generating means. A machine translation method for parenthesized sentences.
JP1266557A 1989-10-13 1989-10-13 Mechanical translation system for sentence including parentheses Pending JPH03127264A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1266557A JPH03127264A (en) 1989-10-13 1989-10-13 Mechanical translation system for sentence including parentheses

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1266557A JPH03127264A (en) 1989-10-13 1989-10-13 Mechanical translation system for sentence including parentheses

Publications (1)

Publication Number Publication Date
JPH03127264A true JPH03127264A (en) 1991-05-30

Family

ID=17432496

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1266557A Pending JPH03127264A (en) 1989-10-13 1989-10-13 Mechanical translation system for sentence including parentheses

Country Status (1)

Country Link
JP (1) JPH03127264A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06259470A (en) * 1993-03-08 1994-09-16 Nec Corp Machine translation device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06259470A (en) * 1993-03-08 1994-09-16 Nec Corp Machine translation device

Similar Documents

Publication Publication Date Title
JP2654001B2 (en) Machine translation method
US20050137853A1 (en) Machine translation
US4833611A (en) Machine translation system
JPH02112068A (en) System for simply displaying text
JPH03127264A (en) Mechanical translation system for sentence including parentheses
JPS6180362A (en) Translation system
JP2632806B2 (en) Language analyzer
JPS6190269A (en) Translation system
JPS6344276A (en) Automatic generator for generated syntax
JPS62163174A (en) Mechanical translating device
JP2000259631A (en) Machine translation proofreading device
JPS63136260A (en) Processing system for production of sentence of mechanical translation system
JP3197110B2 (en) Natural language analyzer and machine translator
JP3233800B2 (en) Machine translation equipment
JP2875927B2 (en) Machine translation equipment
JP2687334B2 (en) Machine translation equipment
JPH0550778B2 (en)
JPH07122876B2 (en) Machine translation device
JPH06348465A (en) Program preparation supporting device
JPS6190268A (en) Translation system
JPH0546370A (en) Program generating device
JPH04112366A (en) Natural language sentence analyzing device
JPH03216761A (en) Machine translation device
JPS6244872A (en) Machine translator
JPH01112368A (en) Machine translator