JPH0844741A

JPH0844741A - Device and method for analyzing tense of japanese sentence

Info

Publication number: JPH0844741A
Application number: JP6295054A
Authority: JP
Inventors: Eiji Takeishi; 英二武石
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-05-25
Filing date: 1994-11-29
Publication date: 1996-02-16

Abstract

PURPOSE:To analyze the omitted tense of a TAIGEN (the sustentives) termination sentence by analyzing the tense by utilizing the interaction between the tense in a subordinate clause and the tense in a main clause. CONSTITUTION:A TAIGEN termination extraction part 100 checks whether the final word of the sentence ending paragraph of an inputted document excluding a punctuation mark ',' is a noun or not and decides a TAIGEN termination sentence when there is the final word terminating with the noun. When the TAIGEN termination extraction part 110 extracts the TAIGEN termination sentence, a time expression extraction part 120 collates syntax analytic result information on the sentence to be processed with a time expression pattern table 140 and outputs all time expressions together with its classification information. A predicate sentence tense analysis part 130 applies a verb predicate sentence time analysis part 131 for a TAIGEN termination sentence decided as a verb predicate sentence or a noun sentence tense analysis part 132 for a TAIGEN termination sentence decided as a noun predicate sentence to output the result. Further, an analysis is taken by using context information as well.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、日本語文時制解析装置
及び日本語文時制解析方法に係り、特に入力された日本
語文の文末文節が名詞で終止する体言止め文である場合
に、その省略された時制を解析する日本語文時制解析装
置及び日本語文時制解析方法に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a Japanese sentence tense analysis device and a Japanese sentence tense analysis method, and in particular, when the sentence ending clause of an input Japanese sentence is a noun sentence ending sentence, it is omitted. The present invention relates to a Japanese sentence tense analysis device and a Japanese sentence tense analysis method for analyzing tense.

【０００２】[0002]

【従来の技術】従来の文書処理システムにおける日本語
文解析技術として、体言止め表現の抽出、時制の解析、
省略された表現の補完については次のような技術が検討
されている。従来の第一の方法として、体言止め表現の
抽出技術では、構文解析の一部として、文中に「が
（は）」格と「を」格が存在する場合、その文内のサ変
名詞による体言止めの品詞は動詞であるという、動詞を
修飾する格要素の意味的な制約を利用し、着目した体言
（名詞）の前方の格要素が当該体言を修飾するか否かを
判定し、その結果を利用して体言止めとその品詞を抽出
する技術が検討されている（鍬田他、「サ変名詞＋読点
の品詞判別処理」、情報処理学会第４２回全国大会、１
９９１年）。2. Description of the Related Art Japanese sentence analysis technology in a conventional document processing system includes extraction of wording expressions, tense analysis,
The following techniques are being considered for supplementing omitted expressions. As the first conventional method, in the technique of extracting wording-stop expressions, if there are "ga (wa)" and "wo" cases in the sentence as part of the syntactic analysis, the wording of the sahen noun in the sentence Using the semantic constraint of the case element that modifies the verb that the part of speech of the stop is a verb, it is determined whether or not the case element in front of the observed word (noun) modifies the word. Techniques for extracting word-of-speech and its part-of-speech by using are being investigated (Kudada et al., "Sahen noun + part-of-speech discrimination of reading point", 42nd National Convention of IPSJ, 1
991).

【０００３】また従来の第二の方法として、時制を解析
する技術では、主に機械翻訳システムなどにおいて、時
制を表す表現（「た」や「ている」など）が文中に出現
することを前提として、それらが「過去」を表すのか
「現在完了」を表すのかなどを解析する技術が検討され
ている（白井他：「新聞記事翻訳における時制の訳出に
ついて」、電子情報通信学会秋期全国大会、１９９０
年）。As a second conventional method, in the technique of analyzing tenses, it is premised that expressions (such as "ta" and "teru") representing tenses appear in sentences, mainly in machine translation systems and the like. As a result, a technology for analyzing whether they represent "past" or "currently completed" is being studied (Shirai et al .: "Translation of tense in newspaper article translation", IEICE Autumn National Convention, 1990
Year).

【０００４】従来の第三の方法として、日本語文におい
て省略された表現として、例えば「申込み期限を教えて
下さい」「もう、締め切りました」において、”締め切
りました”の前の「申込みは」が省略されている。この
ような省略された表現を補完する技術として、述語の格
要素を対象とし、従属節の種類などを文内情報、接続詞
や前後の文との語彙の結束性などの文脈情報を用いて、
省略された表現を推定する技術が検討されている（工藤
他：「対話翻訳システムのための文脈処理機構とその性
能評価」、情報処理学会論文誌第３３巻第２号、１９９
２年）。As a third conventional method, as an abbreviation in Japanese sentences, for example, "Please tell me the deadline for application" and "Already closed", "Application" before "deadline" Is omitted. As a technique that complements such abbreviated expressions, targeting case elements of predicates, using contextual information such as the type of subordinate clauses, in-sentence information, connectives and vocabulary cohesion with preceding and following sentences,
Techniques for estimating omitted expressions have been studied (Kudo et al .: "Context Processing Mechanism for Dialogue Translation Systems and Its Performance Evaluation", IPSJ Transactions, Vol. 33, No. 2, 199).
2 years).

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記従
来技術による文書処理システムでは、体言止め文が頻出
する新聞記事などの日本語文章を処理する場合に以下の
ような問題がある。上記従来の第一の方法のように、体
言止め表現を抽出する技術により体言止めを正しく抽出
し、構文解析処理の精度を向上させることは可能である
が、機械翻訳システムなどの文書処理応用システムで必
要となる時制情報を抽出することはできない。即ち、外
国語に日本語から翻訳する場合には、時制表現が曖昧な
まま翻訳することは不可能であるため、時制情報の抽出
は必須となる。However, the above-described document processing system according to the prior art has the following problems when processing a Japanese sentence such as a newspaper article in which a false stop sentence frequently appears. Like the first conventional method described above, it is possible to correctly extract the wording stop by the technology for extracting the wording stop expression and improve the accuracy of the syntactic analysis process. However, a document processing application system such as a machine translation system. It is not possible to extract the tense information needed in. That is, when translating from Japanese into a foreign language, it is impossible to translate the tense expression while it is ambiguous. Therefore, it is essential to extract the tense information.

【０００６】また、従来の第二の方法に示す時制解析の
技術は、時制を表す表現が出現することを前提としてい
るため、体言止め文のように時制を表す表現が省略され
た文は処理の対象とならず、時制を解析することはでき
ない。更に従来の第三の方法のように、省略された表現
を補完する技術でも、体言止め文における時制の補完に
ついては検討の対象とされていないため、この技術を利
用して省略された時制表現を補完することはできない。
また、文内情報を利用した体言止め文の時制解析方法
は、１文内の時間表現や述語表現の情報に基づいて、体
言止め文の時制を解析するものであるため、文内にキー
となる表現が出現せず、前後の文を見ないとその時制が
認定できないような文に対しては正しく解析できない。Further, since the technique of tense analysis shown in the second conventional method is based on the premise that an expression representing tense appears, a sentence in which the expression representing tense is omitted, such as an improper sentence, is processed. You cannot analyze the tense because it is not subject to. Furthermore, even with the technique of complementing omitted expressions like the third method in the past, the supplementation of tenses in word-stop sentences is not subject to consideration, so tense expressions omitted using this technique are not considered. Cannot be complemented.
In addition, the tense analysis method of a word stop sentence that uses in-sentence information analyzes the tense of the word stop sentence based on the information of the time expression and the predicate expression in one sentence. Cannot be correctly parsed for sentences whose tense cannot be recognized unless the preceding and following sentences appear.

【０００７】本発明は、上記の点に鑑みなされたもの
で、文中に時間を表す表現が存在しない場合においても
体言止め文の省略された時制の解析を可能にすること
と、文中に出現する時間を表す表現とその種類、文末の
体言の種類、複文における従属節と主節との関係を解析
し、従来解析することができなかった体言止め文の省略
された時制を解析するための日本語文時制解析装置及び
日本語文時制解析方法を提供することを目的とする。The present invention has been made in view of the above points, and enables the analysis of the tense in which a word-stop sentence is omitted even when there is no expression representing time in the sentence, and appears in the sentence. Japan for analyzing time expressions and their types, end-sentence type, and relations between subordinate clauses and main clauses in compound sentences, and abbreviated tenses of word-stop sentences that could not be analyzed in the past. An object is to provide a language sentence tense analysis device and a Japanese sentence tense analysis method.

【０００８】また、本発明の更なる目的は、文内情報を
用いた体言止めの時制の解析方法を補完し、前後の文の
時制を参照すると共に、当該体言止め文と後文との述語
の並列性や動作の推移性を解析することによって、文内
に時制を指示する表現を含まない体言止め文の時制を解
析することが可能な日本語文時制解析装置及び日本語文
時制解析方法を提供することを目的とする。A further object of the present invention is to supplement the analysis method of the tense of the sentence stop using the in-sentence information, refer to the tense of the sentences before and after, and to predicate the sentence stop sentence and the subsequent sentence. We provide a Japanese sentence tense analysis device and a Japanese sentence tense analysis method that can analyze the tense of a body stop sentence that does not include an expression that indicates tense in the sentence by analyzing the parallelism and transitivity of movements. The purpose is to do.

【０００９】[0009]

【課題を解決するための手段】図１は、本発明の原理構
成図である。本発明は、日本語文章を入力して処理する
日本語文処理システムにおける日本語文章の時制を解析
する日本語文時制解析装置において、処理対象の日本語
文章を構文解析した結果が入力されると、日本語文章の
文末文節が名詞で終止している体言止め文の省略されて
いる時制を文末文節の性質より解析する第１の解析手段
１００と、第１の解析手段１００に加えて、文脈情報に
より解析する第２の解析手段８００を有する。FIG. 1 is a block diagram showing the principle of the present invention. The present invention, in a Japanese sentence tense analysis device for analyzing the tense of a Japanese sentence in a Japanese sentence processing system for inputting and processing a Japanese sentence, when the result of parsing the Japanese sentence to be processed is input, In addition to the first analysis unit 100 and the first analysis unit 100, which analyzes the tense in which a sentence-fixed sentence in which a sentence ending sentence of a Japanese sentence ends with a noun is omitted, in addition to the first analyzing unit 100, context information It has the 2nd analysis means 800 analyzed by.

【００１０】上記の第１の解析手段１００は、日本語文
章から体言止め文を抽出する抽出手段１１０と、体言止
め文の文末文節である体言止め文節が、動詞述語文また
は名詞述語文であるかを判定する述語判定手段１１５
と、述語判定手段１１５により判定された動詞述語文ま
たは名詞述語文毎に、日本語文章の構文構造を解析し、
体言止め文の時制解析結果を出力する構文パターンを利
用した時制解析手段１３０とを有する。In the first analysis means 100, the extraction means 110 for extracting a word stop sentence from a Japanese sentence and the word stop phrase that is the last sentence clause of the word stop sentence is a verb predicate sentence or a noun predicate sentence. Predicate determination means 115 for determining
And for each verb predicate sentence or noun predicate sentence judged by the predicate judging means 115, analyze the syntactic structure of the Japanese sentence,
And a tense analysis means 130 using a syntactical pattern that outputs a tense analysis result of an injunction sentence.

【００１１】また、上記の構文パターンを利用した時制
解析手段１３０は、時間を表す表現パターンを格納した
時間表現パターン記憶手段と、抽出手段１１０で抽出さ
れた体言止め文中に出現する時間表現を抽出し、時間表
現パターン記憶手段と照合して、時間表現が絶対的また
は相対的な時制を表現するかの分類情報を出力する時間
表現抽出手段とを有する。Further, the tense analysis means 130 utilizing the above-mentioned syntactical pattern extracts the time expression pattern storage means for storing the expression pattern representing time and the time expression appearing in the wording stop sentence extracted by the extraction means 110. And a time expression extraction means for comparing with the time expression pattern storage means and outputting classification information indicating whether the time expression expresses an absolute or relative tense.

【００１２】また、上記の述語判定手段１１５は、体言
止め文節の名詞に係る全ての文節中の、副詞、時詞、接
続助詞を含む文節や動詞の格要素となる文節の有無によ
り、体言止め文節が動詞述語文として使用されている
か、名詞述語文として使用されているかを判定する判定
手段を有する。また、上記の構文パターンを利用した時
制解析手段１３０は、過去を表す名詞を格納した既然名
詞記憶手段と、述語判定手段１１５において体言止め文
節が動詞述語文であると判定された場合に、体言止め文
節の名詞と既然名詞記憶手段との照合結果に基づいた体
言止め文の構文構造、体言止め文節の種類、時間表現及
び分類情報を解析することにより、体言止め文節の省略
されている時制を解析する動詞述語文時制解析手段と、
述語判定手段１１５において体言止め文節が名詞述語文
であると判定された場合に、体言止め文の構文構造、時
間表現及び分類情報を解析することにより、体言止め文
節の省略されている時制を解析する名詞述語文時制解析
手段とを有する。The predicate determination means 115 determines whether or not there is a phrase that includes an adverb, a noun, a connecting particle, or a phrase that is a case element of a verb in all the phrases related to the noun of the phrase to stop the sentence. It has a judgment means which judges whether the clause is used as a verb predicate sentence or a noun predicate sentence. Further, the tense analysis means 130 using the above-mentioned syntactic pattern, when the noun storage means that stores nouns representing the past and the predicate determination means 115 determine that the noun phrase is a verb predicate sentence, the noun sentence By analyzing the syntactic structure of the noun phrase, the type of noun phrase, the time expression, and the classification information based on the matching result of the noun of the noun phrase and the existing noun storage means, the tense in which the noun phrase is omitted is analyzed. Verb predicate sentence tense analysis means to analyze,
When the predicate determination unit 115 determines that the nomination phrase is a noun predicate sentence, the tense in which the nomination phrase is omitted is analyzed by analyzing the syntactic structure, time expression, and classification information of the nomination phrase. And a noun predicate sentence tense analyzing means.

【００１３】さらに、上記の第２の解析手段８００は、
上記の第１の解析手段の抽出手段１１０、述語判定手段
１１５、構文パターンを利用した時制解析手段１３０に
加えて、文章単位の情報を保持しておく文脈情報記憶手
段３００と、体言止め文とその後文の構文パターンから
体言止めの時制を解析するための規則を記述した時制解
析規則５００と、第１の解析手段１００より取得した体
言止め文の解析結果と文脈情報記憶手段３００及び時制
解析規則５００とを照合し、体言止め文と次文の述語の
並列性や動詞の推移性を基に体言止め文の時制を解析す
る時制解析手段に相当する時制解析手段４００とを更に
有する。Further, the above-mentioned second analysis means 800 is
In addition to the extraction unit 110 of the first analysis unit, the predicate determination unit 115, and the tense analysis unit 130 using a syntactic pattern, a context information storage unit 300 that holds information in sentence units, and a wording sentence After that, a tense analysis rule 500 describing a rule for analyzing the tense of the word stop from the syntax pattern of the sentence, the analysis result of the word stop sentence acquired from the first analysis means 100, the context information storage means 300, and the tense analysis rule. It further includes a tense analysis unit 400 corresponding to a tense analysis unit that collates 500 and analyzes the tense of the noun sentence based on the parallelism of the predicate of the noun sentence and the predicate of the next sentence and the transitivity of the verb.

【００１４】本発明は、日本語文章を入力して処理する
文書処理システムにおける日本語文章の時制を解析する
日本語文時制解析方法において、処理対象の日本語文章
を構文解析した結果が入力されると、日本語文章中の文
末文節が名詞で終止している体言止め文の省略されてい
る時制を文末文節の性質により解析する。図２は、本発
明の原理を説明するためのフローチャートである。According to the present invention, in a Japanese sentence tense analysis method for analyzing the tense of a Japanese sentence in a document processing system for inputting and processing a Japanese sentence, the result of parsing the Japanese sentence to be processed is inputted. And, the tense in which the sentence stop sentence in which the sentence end sentence in the Japanese sentence ends with a noun is omitted is analyzed by the property of the sentence end sentence. FIG. 2 is a flow chart for explaining the principle of the present invention.

【００１５】また、本発明の日本語文時制解析装置は、
日本語文章の構文解析結果が入力されると（ステップ
１）、日本語文章から文末文節が名詞で終止している体
言止め文を抽出し（ステップ２）、体言止め文の文末文
節である体言止め文節が、動詞述語文または名詞述語文
であるかを判定し（ステップ３）、判定された動詞述語
文または名詞述語文毎に、体言止め文の構文構造などを
解析し（ステップ４）、体言止め文の時制解析結果を出
力する（ステップ５）。Further, the Japanese sentence tense analysis device of the present invention is
When the syntax analysis result of a Japanese sentence is input (step 1), a sentence stop sentence in which the sentence ending phrase ends with a noun is extracted from the Japanese sentence (step 2), and the sentence sentence that is the sentence ending sentence clause is extracted. It is determined whether the stop clause is a verb predicate sentence or a noun predicate sentence (step 3), and for each determined verb predicate sentence or noun predicate sentence, the syntactic structure of the verbal stop sentence is analyzed (step 4), The tense analysis result of the improper sentence is output (step 5).

【００１６】また、体言止め抽出後に、体言止め文中に
出現する時間表現を抽出し、所定のパターンと照合する
ことにより、時間表現が絶対的または相対的な時制を表
現するかの分類情報を出力する。また、抽出された体言
止め文が動詞述語文か名詞述語文であるかを判定する際
に体言止め文節の名詞に係る全ての文節の中の、副詞、
時詞、接続助詞を含む文節や動詞の格要素となる文節の
有無により判定する。Further, after extracting the wording stop, the time expression appearing in the word stop sentence is extracted and collated with a predetermined pattern to output the classification information indicating whether the time expression expresses an absolute or relative tense. To do. In addition, when determining whether the extracted noun sentence is a verb predicate sentence or a noun predicate sentence, an adverb in all the phrases related to the noun of the noun sentence phrase,
Judgment is made based on the presence or absence of bunsetsu including verbs and connective particles, and bunsetsu that is a case element of verbs.

【００１７】また、体言止め文節が動詞述語文であると
判定された場合に、体言止め文節の名詞と過去のことを
表す名詞を格納した既然名詞記憶手段とを照合し、同一
の名詞が登録されているかを解析し、体言止め文の構文
構造と体言止め文中の時間表現及び分類情報の解析によ
り、体言止め文節の省略されている時制を解析し、一
方、体言止め文が名詞述語文であると判定された場合
に、体言止め文の構文構造と時間表現及び分類情報の解
析により、体言止め文節の省略されている時制を解析す
る。Further, when it is determined that the noun phrase is a verb predicate sentence, the noun of the noun phrase is compared with the existing noun storage means that stores a noun representing the past, and the same noun is registered. Is analyzed, and by analyzing the syntactic structure of the word stop sentence and the time expression and classification information in the word stop sentence, the tense in which the word stop clause is omitted is analyzed, while the word stop sentence is a noun predicate sentence. When it is determined that there is the sentence stop sentence, the tense in which the sentence stop sentence is omitted is analyzed by analyzing the syntactic structure of the sentence stop sentence, the time expression, and the classification information.

【００１８】上記の日本語文時制解析方法において、日
本語文章内に時制を指示する表現を含まないために時制
が解析不能の場合に（ステップ６）、体言止め文の前後
の文の時制を参照し（ステップ７）、体言止め文と体言
止め文の後文との述語の並列性や動作の推移性を解析す
ることにより体言止め文の時制を解析する（ステップ
８）。In the above Japanese sentence tense analysis method, when the tense cannot be parsed because the Japanese sentence does not include an expression that indicates the tense (step 6), refer to the tenses before and after the sentence stop sentence. Then, the tense of the word stop sentence is analyzed by analyzing the parallelism of the predicates and the transitivity of the action between the word stop sentence and the subsequent sentence of the word stop sentence (step 8).

【００１９】即ち、日本語文の構文解析結果が入力さ
れ、構文解析結果を文脈情報として入力順に保持してお
き、日本語文に体言止め文が含まれるとき、まず体言止
め文内の時間表現や構文パターンを利用して体言止め文
の時制を解析し、体言止め文内の情報で解析不能である
場合には、さらにその体言止め文の前後の文の構文情報
を用いて時制を解析するルールを参照して体言止め文の
時制を決定するものである。That is, when the syntactic analysis result of a Japanese sentence is input, the syntactic analysis result is held as context information in the order of input, and when the Japanese sentence includes a wording sentence, first the time expression and the syntax in the wording sentence are written. If a pattern is used to analyze the tense of a word stop sentence, and if the information in the word stop sentence cannot be analyzed, a rule to analyze the tense using the syntactic information of the sentences before and after the word stop sentence is used. The tense of the sentence stop sentence is determined by reference.

【００２０】[0020]

【作用】請求項１の発明は、体言止めの文末文節の性質
（動詞・名詞）に基づいて文中に省略されている体言止
めの時制を判定し、さらに、入力された文脈も参照する
ため、文節の性質と文章の前後関係の両面からの解析が
可能となる。請求項２の発明は、文末文節の性質により
解析する場合に、その体言止め文の文末文節が動詞述語
または名詞述語で終止しているかを判定することによ
り、その後の体言止めの省略された時制の解析を述語の
性質毎に振り分けて解析する。これにより、動詞述語及
び名詞述語独特の表現を区別して解析することが可能と
なる。According to the invention of claim 1, since the tense of the word stop which is omitted in the sentence is determined based on the property (verb / noun) of the sentence ending clause of the word stop, and the input context is also referred to. It is possible to analyze from both aspects of bunsetsu character and context of sentences. According to the invention of claim 2, when the sentence ending clause is analyzed based on the property of the sentence ending clause, it is determined whether the sentence ending clause of the sentence ending sentence ends with a verb predicate or a noun predicate, so that the subsequent sentence tense is omitted. The analysis of is distributed according to the property of the predicate. This makes it possible to distinguish and analyze expressions unique to verb predicates and noun predicates.

【００２１】請求項３及び請求項５の発明は、体言止め
文に出現する時間表現を相対的な表現であっても取得
し、時間表現パターンと照合することにより、時制を解
析することが可能である。これにより、主節の時制だけ
でなく従属節の時制をも考慮した、即ち従属節の時制と
主節の時制の基準の違いを考慮して、省略された体言止
めの時制を解析することができる。According to the inventions of claims 3 and 5, it is possible to analyze the tense by acquiring the time expression appearing in the injunction sentence even if it is a relative expression and matching it with the time expression pattern. Is. As a result, it is possible to analyze the abbreviated tense of tense in consideration of not only the tense of the main clause but also the tense of the subordinate clause, that is, considering the difference between the tense of the dependent clause and the tense of the main clause. it can.

【００２２】請求項４の発明は、体言止め文節を動詞述
語文か名詞述語文に分け、各々の構文構造毎に解析する
ため、物や事柄に関して言及する表現と動作に関して言
及する表現とを区別して解析することが可能である。請
求項５の発明は、抽出された体言止め文が名詞述語文で
ある場合には、抽出された体言止め文が複文か単文かを
判定し、さらに従属節の時制、時間表現が主節、従属節
のいずれを修飾するかを判定する構文解析、時間表現の
パターン解析を行うことにより体言止め文の時制を解析
する。さらに、抽出された体言止め文節が動詞述語文で
ある場合には、当該体言止め文節と同じ名詞が登録され
ている既然名詞記憶手段に存在するかを照合し、当該す
るものが存在すれば当該名詞の時制を引用して当該体言
止め文節の時制を解析することができる。According to the fourth aspect of the invention, since the wording clause is divided into a verb predicate sentence or a noun predicate sentence and analyzed for each syntactic structure, an expression that refers to an object or a matter and an expression that refers to an action are separated. It is possible to analyze separately. According to the fifth aspect of the invention, when the extracted wording sentence is a noun predicate sentence, it is determined whether the extracted wording sentence is a compound sentence or a simple sentence, and the tense and the time expression of the subordinate clause are the main clauses. We analyze the tense of a word-stop sentence by performing syntactic analysis to determine which subordinate clause is modified and pattern analysis of time expression. Furthermore, when the extracted wording clause is a verb predicate sentence, it is checked whether the same noun as the wording clause is present in the registered noun storage means, and if the corresponding word exists, the The tense of the noun phrase can be analyzed by quoting the tense of the noun.

【００２３】上記の請求項１から請求項５の解析装置
は、文内の情報を利用して時制を解析したが、請求項６
の発明は、文内情報で解析が不能である場合に、入力さ
れた構文解析結果の文脈を参照して解析する。処理対象
の日本語文章を構文解析された結果が入力されると、解
析制御手段により入力された構文情報を文脈情報記憶手
段に登録すると共に、体言止め文を含む場合に、文脈情
報記憶手段の内容を参照して、体言止め文の前後の文の
時制を取得して、当該体言止め文の時制を判定する。The analysis device according to any one of claims 1 to 5 uses the information in the sentence to analyze the tense.
According to the invention, when the in-sentence information cannot be analyzed, the sentence is analyzed by referring to the context of the input syntactic analysis result. When the result of the syntactic analysis of the Japanese sentence to be processed is input, the syntactic information input by the analysis control means is registered in the context information storage means, and if the sentence stop sentence is included, the context information storage means By referring to the contents, the tenses of the sentences before and after the sentence stop sentence are acquired, and the tense of the sentence stop sentence is determined.

【００２４】請求項７〜１１の発明は、１つの文内情報
を利用して時制を解析するものであり、主節と従属節の
時制の関連性により文中に時制を表現するキーワードが
なくとも時制の解析が可能となる。請求項１２の発明
は、請求項７から１１の発明のように、文中において時
制を解析するものではなく、入力された複数の構文解析
の結果により体言止めの前後の構文との文脈の関連性に
より時制の解析を行うため、文中の情報から時制を解析
できない場合に有効である。According to the inventions of claims 7 to 11, the tense is analyzed using one in-sentence information, and even if there is no keyword expressing the tense in the sentence due to the relation between the tense of the main clause and the subordinate clause. It enables tense analysis. According to the invention of claim 12, unlike the inventions of claims 7 to 11, the tense is not analyzed in the sentence, but the relevance of the context with the syntax before and after the wording is stopped according to the results of a plurality of input syntactic analyzes. Since the tense is analyzed by, it is effective when the tense cannot be analyzed from the information in the sentence.

【００２５】請求項１３の発明は、体言止め文の前文の
構文情報と時制解析規則を用いて時制の解析ができない
場合に、次文の構文情報と時制解析規則に記述された各
記憶の制約条件との照合を行うことによって体言止め文
の時制を解析するものである。このように、１つの文内
の情報に基づいて時制解析を行うのみならず、体言止め
文の前後の文脈により時制表現が省略されている文の時
制を解析することが可能となる。According to the thirteenth aspect of the present invention, when the tense analysis cannot be performed using the syntactic information of the preceding sentence and the tense analysis rule of the impersonation sentence, the constraint of each memory described in the syntactic information of the next sentence and the tense analysis rule. By analyzing the tense of the word-stop sentence by comparing with the condition. In this way, it becomes possible not only to analyze the tense based on the information in one sentence, but also to analyze the tense of a sentence in which the tense expression is omitted due to the contexts before and after the word stop sentence.

【００２６】[0026]

【実施例】以下、図面と共に本発明の実施例を詳細に説
明する。［第１の実施例］本装置へ入力される日本語文章は、構
文解析処理済みのものとする。構文解析は漢字と仮名で
べた書きされた日本語文の単語単位への分割を行う形態
素解析の処理結果を入力として、日本語文を構成する文
節の修飾、被修飾の関係を解析するとともに、その修飾
関係の種類も認定する処理である。通常、日本語文書処
理システムには、形態素解析及び構文解析の処理機能が
具備されており、その結果が入力されるものとする。Embodiments of the present invention will now be described in detail with reference to the drawings. [First Embodiment] It is assumed that the Japanese sentence input to this device has been subjected to a syntactic analysis process. The syntactic analysis uses the result of morphological analysis, which divides a Japanese sentence written in Kanji and Kana into word units, and analyzes the relations between the clauses and the modifiers that compose the Japanese sentence. This is a process of recognizing the type of relationship. Generally, a Japanese document processing system is provided with processing functions of morphological analysis and syntactic analysis, and the results thereof are input.

【００２７】図３は、本発明の第１の実施例のシステム
構成図である。同図に示すシステムは、文内情報利用型
時制解析部１００、時間表現パターンテーブル１４０、
既然名詞テーブル１５０及び解析制御部２００より構成
される。図４は、本発明の第１の実施例の日本語文時制
解析装置の構成を示す。同図に示す日本語文時制解析装
置は、図３における文内情報利用型時制解析部１００に
対応する。FIG. 3 is a system configuration diagram of the first embodiment of the present invention. The system shown in the figure includes an in-sentence information use type tense analysis unit 100, a time expression pattern table 140,
It is composed of a preexisting noun table 150 and an analysis control unit 200. FIG. 4 shows the configuration of the Japanese sentence tense analyzing apparatus according to the first embodiment of the present invention. The Japanese sentence tense analysis device shown in the figure corresponds to the sentence information utilizing type tense analysis unit 100 in FIG.

【００２８】日本語文時制解析装置は、日本語文章が入
力されると、当該入力文より文末文節が名詞で終止して
いる体言止め文を抽出する体言止め抽出部１１０、体言
止め文が動詞述語文か名詞述語文かを判定する述語判定
部１１５、入力文中の時間表現とその種類を出力する時
間表現抽出部１２０、述語判定部１１５の判定結果に基
いて、動詞述語文または、名詞述語文のそれぞれの体言
止め文節の省略された時制を解析し、出力する述語文時
制解析部１３０、時を表す表現パターンとその分類情報
を有する時間表現パターンテーブル１４０、過去のこと
を表す名詞を格納した既然名詞テーブル１５０から構成
される。The Japanese sentence tense analysis device, when a Japanese sentence is input, extracts a sentence stop sentence in which the sentence ending clause ends with a noun from the input sentence. A predicate determination unit 115 that determines whether a sentence or a noun predicate sentence, a time expression extraction unit 120 that outputs the time expression in the input sentence and its type, and a verb predicate sentence or a noun predicate sentence based on the determination result of the predicate determination unit 115. The predicate sentence tense analysis unit 130 that analyzes and outputs the abbreviated tense of each wording stop clause, the time expression pattern table 140 having the expression pattern representing time and its classification information, and the noun representing the past are stored. It is composed of the noun table 150.

【００２９】体言止め抽出部１１０は、入力された文章
より句点「。」を除いた文末文節の最終単語が名詞か否
かをチェックし、名詞で終止している最終単語があれば
体言止め文と判定する。述語判定部１１５は、体言止め
文と判定された場合には、最終文節を修飾する全ての文
節を対象として、副詞や時詞を含む文節があるか、接続
助詞を含む文節や動詞の格要素となる文節があるかをチ
ェックし、あれば当該体言止め文を動詞述語文、なけれ
ば名詞述語文と判定する。The verbal stop extraction unit 110 checks whether or not the final word of the sentence ending clause excluding the punctuation "." From the input sentence is a noun, and if there is a final word ending with a noun, the verbal stop sentence is detected. To determine. When the predicate determination unit 115 determines that the sentence is a sentence stop sentence, the predicate determination unit 115 targets all the clauses that modify the final clause and includes a clause that includes an adverb or a verb, or a case element that includes a clause or a verb. It is determined whether there is a clause that becomes, and if there is, the subject stop sentence is determined to be a verb predicate sentence, and if not, it is determined to be a noun predicate sentence.

【００３０】時間表現抽出部１２０は、体言止め抽出部
１１０において、体言止め文が抽出された場合に、処理
対象文の構文解析結果情報と時間表現パターンテーブル
１４０とを照合し、全ての時間表現をその分類情報とと
もに出力する。述語文時制解析部１３０は、述語判定部
１１５において、動詞述語文と判定された体言止め文に
対しては動詞述語文時制解析部１３１を、名詞述語文と
判定された体言止め文に対しては名詞述語文時制解析部
１３２を適用し、その結果を出力する。The time expression extraction unit 120 collates the syntactic analysis result information of the processing target sentence with the time expression pattern table 140 when the word expression stop extraction unit 110 extracts a word expression stop sentence, and all time expressions are expressed. Is output together with the classification information. The predicate sentence tense analysis unit 130 uses the verb predicate sentence tense analysis unit 131 for the noun sentence that is determined to be a verb predicate sentence by the predicate determination unit 115, and the noun sentence sentence that is determined to be a noun predicate sentence. Applies the noun predicate sentence tense analysis unit 132 and outputs the result.

【００３１】動詞述語文時制解析部１３１は、動詞述語
文と判定された体言止め文に対して、既然名詞テーブル
１５０、構文解析結果情報及び時間表現抽出結果を用い
て、その構文構造と時間表現の出現パターンから、体言
止め文節の省略された時制を判定する。名詞述語文時制
解析部１３２は、名詞述語文と判定された体言止め文に
対して、構文解析結果情報及び時間表現抽出結果を用い
てその構文構造と、体言止め文が時間について陳述され
た文であるかの解析及び時間表現の出現パターンから、
体言止め文節の省略された時制を判定する。The verb predicate sentence tense analysis unit 131 uses the noun table 150, the syntactic analysis result information, and the time expression extraction result for the verbal predicate sentence determined to be a verbal predicate sentence. From the appearance pattern of, the tense in which the wording stop clause is omitted is determined. The noun predicate sentence tense analysis unit 132 uses the syntactic analysis result information and the time expression extraction result for the noun predicate sentence determined as a noun predicate sentence, and a sentence in which the noun predicate sentence is described with respect to time. From the analysis of whether or not and the appearance pattern of the time expression,
Judge the tense in which the word-stop clause is omitted.

【００３２】図５は、本発明の第１の実施例の体言止め
抽出部及び述語判定部の動作のフローチャートを示す。（ステップ１１０）体言止め抽出部１００は、入力さ
れた日本語文章より句点を除く最終文節の単語が名詞か
否かを判定し、名詞であれば、体言止め文として抽出す
る。一方、名詞以外であれば当該日本語文章は体言止め
文ではないため、ステップ１５０に移行する。FIG. 5 shows a flow chart of the operation of the wording stop extraction unit and the predicate determination unit of the first embodiment of the present invention. (Step 110) The word stop extractor 100 determines whether or not the word in the final phrase excluding the punctuation from the input Japanese sentence is a noun, and if the word is a noun, it is extracted as a word stop sentence. On the other hand, if it is other than a noun, the Japanese sentence is not a sentence stop sentence, so the process proceeds to step 150.

【００３３】（ステップ１２０）体言止め抽出部１１
０により、最終文節の単語が名詞であると判定された場
合には、述語判定部１１５は、当該名詞がサ変型名詞で
あるかを判定し、サ変型名詞以外である場合にはステッ
プ１６０に移行する。（ステップ１３０）最終文節の単語がサ変型名詞であ
る場合に、抽出された体言止め文中の文末文節に係る文
節の中に、副詞、時詞があるかを判定し、存在する場合
にはステップ１７０に移行する。(Step 120) The phrase stop extracting unit 11
If 0 determines that the word in the final phrase is a noun, the predicate determining unit 115 determines whether the noun is a sa variant noun, and if it is not a sa variant no, proceeds to step 160. Transition. (Step 130) If the word in the final phrase is a sa variant noun, it is determined whether or not there is an adverb or a verb in the phrase related to the end phrase in the extracted sentence stop sentence. Move to 170.

【００３４】（ステップ１４０）上記ステップ１３０
において、文末文節に係る文節の中に副詞、時詞が存在
していない場合には、文末文節に係る文節の中に、文節
末単語が副助詞または格助詞「から」「で」「へ」
「と」「に」「より」「を」、または接続助詞のものが
あるかを判定する。ここで副助詞、格助詞、接続助詞の
何れもない場合にはステップ１６０に移行し、何れかが
存在する場合にはステップ１７０へ移行する。(Step 140) Step 130
In, when there is no adverb or verb in the phrase related to the end-of-sentence phrase, the end-of-sentence word is a sub-particle or a case particle “from”, “de”, or “to” in the phrase related to the end-of-sentence phrase.
It is determined whether there is a "to", "ni", "yori", "wo", or a connection particle. If none of the auxiliary particle, the case particle, or the connecting particle is present, the process proceeds to step 160, and if any of them exists, the process proceeds to step 170.

【００３５】（ステップ１５０）上記のステップ１１
０において、最終単語が名詞以外の品詞である場合に
は、体言止め文には該当しない文章であるとの判定結果
を出力する。（ステップ１６０）上記ステップ１２０において、述
語判定部１１５は、抽出された体言止め文の最終単語が
サ変型名詞でない場合、及び、ステップ１４０におい
て、文末文節に係る文節中の文節末単語が副助詞、格助
詞、接続助詞であるものが何れも存在していない場合に
は、名詞述語文の体言止め文であると判定する。(Step 150) Step 11 above
In 0, when the final word is a part of speech other than a noun, the determination result that the sentence is not a sentence stop sentence is output. (Step 160) In the above step 120, the predicate determination unit 115 determines that the final word of the extracted sentence-fixing sentence is not a SA variant noun, and in step 140, the phrase ending word in the phrase related to the sentence ending phrase is a sub particle. If none of the case particles and the connecting particles are present, it is determined that the noun predicate sentence is a sentence stop sentence.

【００３６】（ステップ１７０）上記ステップ１３０
において、文末文節に係る文節の中に副詞、時詞が存在
する場合、及び、ステップ１４０において、文末文節に
係る文節中の文節末単語が副助詞、格助詞、接続助詞で
あるものが何れか存在している場合には、動詞述語文の
体言止め文であると判定する。次に、入力された日本語
文章が体言止め文である場合に、時間表現抽出部１２０
は、当該体言止め文の省略された時制を解析するために
使用する情報として、当該体言止め文中から時間を表す
表現とその分類パターンを、時間表現パターンテーブル
１４０を参照して抽出する。(Step 170) The above step 130
In the case where there is an adverb or a verb in the phrase related to the end-of-sentence phrase, and in step 140, the end-of-sentence word in the phrase related to the end-of-sentence phrase is any of a sub-particle, a case particle, and a connective particle. If it exists, it is determined to be a verbal predicate sentence stop sentence. Next, when the input Japanese sentence is a sentence stop sentence, the time expression extraction unit 120
Refers to the time expression pattern table 140 and extracts, as information used to analyze the omitted tense of the wording sentence, an expression representing time and its classification pattern from the wording sentence.

【００３７】図６は、本発明の第１の実施例の時間表現
パターンテーブル１４０の概念図を示す。時間表現パタ
ーンテーブル１４０は、時間を表す典型的な表現を単語
連鎖パターンの形式で格納したテーブルであり、各時間
表現パターンと、その表現パターンが過去を表すか未来
を表すか、及び「昨年」「明日」のように現在を基準に
した相対的な時を表すのか、西暦などのように絶対的な
時を表すのかといった分類情報から構成されている。FIG. 6 is a conceptual diagram of the time expression pattern table 140 according to the first embodiment of this invention. The time expression pattern table 140 is a table that stores typical expressions representing time in the form of a word chain pattern, and each time expression pattern, whether the expression pattern represents the past or the future, and “last year”. It is composed of classification information such as whether it represents a relative time based on the present such as "tomorrow" or an absolute time such as the Christian era.

【００３９】次に、上記述語判定部１１５が当該体言止
め文を動詞述語文と判定した場合に、体言止め文の省略
された時制を解析する述語文時制解析部１３０の動作に
ついて説明する。まず、動詞述語文時制解析部１３１で
使用する既然名詞テーブル１５０について説明する。Next, the operation of the predicate sentence tense analysis unit 130 for analyzing the tense in which the infix word is omitted when the above descriptive word determination unit 115 determines that the injunction sentence is a verb predicate sentence will be described. First, the existing noun table 150 used by the verb predicate sentence tense analysis unit 131 will be described.

【００４０】図７は、本発明の第１の実施例の既然名詞
テーブルの概念図を示す。既然名詞テーブル１５０は、
「指摘」や「歴任」のように体言止めとして用いられる
場合、つねに既然（過去）を表す名詞をリストにしたテ
ーブルであり、動詞述語文時制解析部１３１において、
体言止め文の文末文節の名詞と比較することにより体言
止めの省略された時制の解析を可能にしている。FIG. 7 is a conceptual diagram of the noun table in advance according to the first embodiment of the present invention. The noun table 150 already exists
In the case of being used as a word stop such as "point out" or "replacement", it is a table that lists nouns that always represent the past (past). In the verb predicate sentence tense analysis unit 131,
By comparing it with the noun of the end sentence of the word-stop sentence, it is possible to analyze the tense in which word-stop is omitted.

【００４１】図８は、本発明の第１の実施例の動詞述語
文時制解析部の動作を示すフローチャートを示す。（ステップ３１１）述語判定部１１５が、体言止め文
が動詞述語文であると判定すると、動詞述語文時制解析
部１３１は、時間表現抽出部１２０により抽出された当
該体言止め文中の時間表現の中に、文末文節を修飾し、
過去を表すものがあるかを判定し、存在する場合にはス
テップ３１６へ移行する。FIG. 8 is a flow chart showing the operation of the verb predicate sentence tense analysis unit of the first embodiment of the present invention. (Step 311) When the predicate determination unit 115 determines that the verbal predicate sentence is a verb predicate sentence, the verb predicate sentence tense analysis unit 131 includes the temporal expressions in the verbal predicate sentence extracted by the time expression extraction unit 120. To modify the end clause,
It is determined whether or not there is something that represents the past, and if it exists, the process proceeds to step 316.

【００４２】（ステップ３１２）ステップ３１１によ
り、当該体言止め文中の時間表現の中に、文末文節を修
飾し、過去を表すものが存在しない場合には、動詞述語
文時制解析部１３１は、文末の名詞と同一の名詞が既然
名詞テーブル１５０に登録されているかを判定し、登録
されている場合にはステップ３１６へ移行する。（ステップ３１３）ステップ３１２により、当該体言
止め文の文末の名詞と同一の名詞が既然名詞テーブル１
５０に登録されていない場合には、動詞述語文時制解析
部１３１は、体言止め文が複文であるか単文であるかを
判定し、単文である場合にはステップ３１７へ移行す
る。(Step 312) In step 311, the verb predicate sentence tense analysis unit 131 modifies the sentence end phrase in the time expression in the sentence stop sentence, and when there is no thing indicating the past, the verb predicate sentence tense analysis unit 131 It is determined whether the same noun as the noun is already registered in the noun table 150, and if it is registered, the process proceeds to step 316. (Step 313) By step 312, the same noun as the noun at the end of the sentence stop sentence is already in the noun table 1.
If it is not registered in 50, the verb predicate sentence tense analysis unit 131 determines whether the wording stop sentence is a compound sentence or a simple sentence, and if it is a simple sentence, the process proceeds to step 317.

【００４３】（ステップ３１４）上記ステップ３１３
において、動詞述語文時制解析部１３１が当該体言止め
文が複文であると判定した場合には、体言止め文の従属
節の述語が現在形であり、上記時間表現抽出部１２０に
おいて抽出された時間表現の中に従属節の述語を修飾
し、かつ絶対過去を表すものがあるかを判定し、存在す
る場合にはステップ３１６へ移行する。(Step 314) Above Step 313
In the case where the verb predicate sentence tense analysis unit 131 determines that the incomprehension sentence is a compound sentence, the predicate of the subordinate clause of the incomprehension sentence is the present tense and the time extracted by the time expression extraction unit 120. It is judged whether or not there is an expression that modifies the predicate of the subordinate clause and that represents the absolute past, and if there is, the process proceeds to step 316.

【００４４】（ステップ３１５）上記ステップ３１４
により、動詞述語文時制解析部１３１が、体言止め文の
従属節の述語が過去形であり、上記時間表現抽出部１２
０において抽出された時間表現の中に従属節の述語を修
飾し、かつ絶対過去を表すものがないと判定した場合に
は、当該体言止め文の従属節の述語が過去形であり、上
記時間表現抽出部１２０において抽出された時間表現の
中に従属節の述語を修飾し、かつ相対未来を表すものが
あるかを判定し、存在しない場合にはステップ３１７
へ、存在する場合にはステップ３１６へ移行する。(Step 315) The above step 314
As a result, the verb predicate sentence tense analysis unit 131 determines that the predicate of the subordinate clause of the noun sentence is the past tense, and the time expression extraction unit 12
If the predicate of the subordinate clause is modified in the time expressions extracted in 0 and there is no representation of the absolute past, then the predicate of the subordinate clause of the said word stop sentence is the past tense, and the time The expression extracting unit 120 determines whether or not the time expressions extracted are those that modify the predicate of the subordinate clause and represent the relative future. If they do not exist, step 317.
If it exists, the process proceeds to step 316.

【００４５】（ステップ３１６）上記ステップ３１１
において、抽出された時間表現の中に文末文節を修飾
し、過去を表すものが存在する場合、または、ステップ
３１２において、文末の名詞と同一の名詞が既然名詞テ
ーブル１５０に登録されている場合、または、ステップ
３１４において、体言止め文の従属節の述語が現在形で
あり、抽出された時間表現の中に従属節の述語を修飾
し、かつ絶対過去を表すものが存在する場合、又は、上
記ステップ３１５において、体言止め文の従属節の述語
が過去形であり、抽出された時間表現の中に従属節の述
語を修飾し、かつ相対未来を表すものがある場合には、
動詞述語文時制解析部１３１は、当該体言止め文の省略
された時制は過去であると判定する。(Step 316) The above step 311
In the case where a sentence end phrase is modified in the extracted time expressions to represent the past, or when the same noun as the sentence end noun is already registered in the noun table 150 in step 312, Alternatively, in step 314, if the predicate of the subordinate clause of the improper sentence is the present tense, and there is an expression that modifies the predicate of the subordinate clause and represents the absolute past in the extracted time expression, or In step 315, when the predicate of the subordinate clause of the impersonation sentence is the past tense, and the extracted temporal expressions modify the predicate of the subordinate clause and represent the relative future,
The verb predicate sentence tense analysis unit 131 determines that the omitted tense of the wording sentence is past.

【００４６】（ステップ３１７）上記ステップ３１３
において、体言止め文が単文である場合、または、記ス
テップ３１５において、当該体言止め文の従属節の述語
が過去形であり、抽出された時間表現の中に従属節の述
語を修飾し、かつ相対未来を表すものが存在しない場合
には、動詞述語文時制解析部１３１は、当該体言止め文
の省略された時制は現在であると判定する。(Step 317) The above step 313
In step 315, the predicate of the subordinate clause of the substantive sentence is a past tense, and the predicate of the subordinate clause is modified in the extracted time expression, and When there is no one that represents the relative future, the verb predicate sentence tense analysis unit 131 determines that the omitted tense of the instant word stop sentence is the present.

【００４７】図９は、本発明の第１の実施例の名詞述語
文時制解析部の動作を示すフローチャートを示す。（ステップ３２１）名詞述語文時制解析部１３２は、
通語判定部１１５により、当該体言止め文が名詞述語文
と判定された場合、上記時間表現抽出部１２０により抽
出された当該体言止め文中の時間表現の中に、過去を表
すものがあるかを判定し、過去を表すものが存在しない
場合にはステップ３２４へ移行する。FIG. 9 is a flow chart showing the operation of the noun predicate sentence tense analysis unit of the first embodiment of the present invention. (Step 321) The noun predicate sentence tense analysis unit 132
When the language determination unit 115 determines that the wording stop sentence is a noun predicate sentence, whether the time expressions in the wording stop sentence extracted by the time expression extraction unit 120 include a past expression. If it is determined that there is nothing that represents the past, the process proceeds to step 324.

【００４８】（ステップ３２２）名詞述語文時制解析
部１３２は、当該体言止め文中の時間表現の中に過去を
表すものが存在している場合には、その体言止め文が時
間について陳述した文であるかを判定し、時間について
陳述した文である場合にはステップ３２４へ移行する。
ここで、時間について陳述した文とは、Ｘが時間表現で
あり、Ａ、Ｂ、＊を任意の表現とした時、「Ａは／が
＊Ｘ。」「＊Ｘは／がＢ。」のパターンの文
のことである。(Step 322) The noun predicate sentence tense analysis unit 132, if there is a word representing the past in the time expressions in the word stop sentence, the word stop sentence describes the time. It is determined whether there is a statement, and if the statement is a statement of time, the process proceeds to step 324.
Here, the statement that states time is X is a time expression, and when A, B, and * are arbitrary expressions, "A is / ga * X.""* X is / ga B." It is a sentence of a pattern.

【００４９】（ステップ３２３）上記ステップ３２２
において、当該体言止め文が時間について陳述した文で
ない場合には、当該体言止め文が補文標識「の」または
「こと」を有する複文であり、その従属節の述語が過去
形であるかを判定する。ここで、当該体言止め文が補文
標識「の」または「こと」を有する複文でありその従属
節の述語が過去形である場合にはステップ３２４へ移行
する。(Step 323) The above step 322
In, if the sentence-fixing sentence is not a statement that states time, then whether the sentence-fixing sentence is a compound sentence with the complement sentence indicator “of” or “thing” and whether the predicate of the subordinate clause is the past tense judge. Here, if the sentence stop sentence is a compound sentence having the complement sentence indicator "no" or "koto" and the predicate of the subordinate clause is the past tense, the process proceeds to step 324.

【００５０】（ステップ３２４）名詞述語文時制解析
部１３２は、上記ステップ３２１において、当該体言止
め文中の時間表現の中に過去を表すものがない場合、ま
た、上記ステップ３２２において、当該体言止め文が時
間について陳述した文である場合、上記ステップ３２３
において、当該体言止め文が補文標識「の」または「こ
と」を有する複文であり、その従属節の述語が過去形で
ある場合に、当該体言止め文の省略された時制は「現
在」と判定する。(Step 324) The noun predicate sentence tense analysis unit 132 determines in the above step 321 that the time expression in the sentence stop sentence does not indicate the past, and in the step 322, the sentence stop sentence. Is a statement stating time, step 323 above
In, in the case where the injunction sentence is a compound sentence having the complement sentence indicator “of” or “koto” and the predicate of the subordinate clause is the past tense, the omitted tense of the injunction sentence is “present”. judge.

【００５１】（ステップ３２５）上記ステップ３２３
において、当該体言止め文が補文標識「の」、「こと」
を有する複文でないか、従属節の述語が過去形でない場
合には、当該体言止め文の省略された時制は「過去」と
判定する。以下に、４つの体言止め文の例について、時
制解析の具体例を示す。文１「第２次世界大戦の終戦は１９４５年。」文２「北の湖が引退したのは、本人の決断。」文３「首相はインフレには繋がらないと強調。」文４「軍部は１８９０年までに返還される領土に軍事
工場を建設。」〔第１の例〕（文１）入力する日本語文章として「文１」を使用する場合、体
言止め抽出部１１０は、図１０（Ａ）に示す構文解析の
結果情報が入力される。例えば（（”第２次世界大戦
の” ”終戦は” ［名詞−の−名詞］） ”１９４５
年。” ［名詞−は−名詞］）の形式で入力する。(Step 325) The above step 323.
In the above, the injunction sentence is the supplementary sign "no", "koto"
If the sentence is not a compound sentence or the predicate of the subordinate clause is not the past tense, the abbreviated tense of the sentence stop sentence is determined to be “past”. The following is a specific example of tense analysis for four examples of improper sentences. Sentence 1 "The end of World War II was 1945." Sentence 2 "It was his decision to retire Lake of the North." Sentence 3 "The prime minister emphasizes that it will not lead to inflation." Sentence 4 "The military is 1890. Construction of a military factory on the territory returned by year. ”[First example] (Sentence 1) When“ Sentence 1 ”is used as the Japanese sentence to be input, the wording stop extraction unit 110 displays the sentence shown in FIG. The result information of the syntax analysis shown in () is input. For example, (("The end of World War II" is "[noun-no-noun])" 1945
Year. "[Noun-wa-noun]).

【００５２】まず、体言止め抽出部１１０は文末文節の
句点を除く最終単語「１９４５年」が名詞かどうかをチ
ェックする（ステップ１１０）と名詞であるため、体言
止め文であるし、最終単語「１９４５年」を含む文を体
言止め文として抽出する。次に、述語判定部１１５は、
当該名詞がサ変型名詞かどうかをチェックする（ステッ
プ１２０）。しかし、サ変型名詞ではないため名詞述語
文と判定する（ステップ１６０）。First, the verbal stop extraction unit 110 checks whether the final word "1945" excluding the punctuation in the sentence ending clause is a noun (step 110). Since it is a noun, it is a verbal stop sentence and the final word " A sentence including “1945” is extracted as a wording stop sentence. Next, the predicate determination unit 115
It is checked whether or not the noun is a satypical noun (step 120). However, since it is not a satypical noun, it is judged as a noun predicate sentence (step 160).

【００５３】次に、時間表現抽出部１２０は、「文１」
中の時間表現及びその分類情報を、構文解析結果情報と
時間表現パターンテーブル１４０とを照合し、図１１
（Ａ）に示す、パターン“数詞＋「年」”の時間表現
「１９４５年」を発見し、その分類情報として「絶対過
去」を取得する。「文１」は述語判定部１１５におい
て、名詞述語文と判定されたため、名詞述語文時制解析
部１３２にて時制が解析される。Next, the time expression extraction unit 120 causes the "sentence 1".
The time expression and the classification information therein are compared with the syntactic analysis result information and the time expression pattern table 140, and FIG.
The time expression "1945" of the pattern "numerical +" year "" shown in (A) is discovered, and "absolute past" is acquired as the classification information. Since "sentence 1" is determined to be a noun predicate sentence by the predicate determination unit 115, the tense is analyzed by the noun predicate sentence tense analysis unit 132.

【００５４】名詞述語文時制解析部１３２は、「文１」
に対して、時間表現抽出部１２０で「１９４５年」が過
去を表す時間表現であるとの判定結果を取得し、次いで
時間について陳述した文であるかをチェックし（ステッ
プ３２２）、時間について陳述した文であると判定され
たため、図１２（Ａ）に示すように、体言止めの省略さ
れた時制は「現在」と判定する（ステップ３２４）。The noun predicate sentence tense analysis unit 132 determines that "sentence 1"
On the other hand, the time expression extraction unit 120 obtains a determination result that "1945" is a time expression representing the past, and then checks whether or not the statement is a statement of time (step 322). Since it is determined that the sentence is a sentence that has been written, as shown in FIG. 12A, the tense in which the wording stop is omitted is determined to be "present" (step 324).

【００５５】〔第２の例〕（文２）入力する日本語文章として「文２」を使用する場合、体
言止め抽出部１１０には図１０（Ｂ）に示す構文解析の
結果情報が入力される。例えば、（（（”北の湖が”
”引退した” ［ガ格］） ”のは” ［補
文］）（”本人の” ”決断。” ［名詞−の−名
詞］）［名詞−は−名詞］）の形式で入力される。[Second Example] (Sentence 2) When "Sentence 2" is used as the Japanese sentence to be input, the syntactic analysis result information shown in FIG. It For example, ((("North Lake is"
"Retired" [ga case]) "is entered in the form of" [supplementary sentence]) ("the person's""decision." [Noun-noun-noun]) [noun-is-noun]).

【００５６】まず、体言止め抽出部１１０は、文末文節
の句点を除く最終単語「決断」は名詞と判定する。次
に、述語判定部１１５は、上記ステップ１２０ではサ変
型名詞と判定する。次に、文末文節を修飾する文節中に
副詞、時詞があるかをチェックし（ステップ１３０）、
「文２」には存在しないので、文末文節に係る文節の中
に文節末単語が副助詞または格助詞または接続助詞のも
のがあるかをチェックする（ステップ１４０）。しか
し、それらは存在しないので名詞述語文と判定する（ス
テップ１６０）。First, the wording stop extracting unit 110 determines that the final word "decision" excluding the punctuation in the sentence end clause is a noun. Next, the predicate determination unit 115 determines that it is a sa variant noun in step 120. Next, it is checked whether or not there is an adverb or a verb in the phrase that modifies the end phrase (step 130),
Since it does not exist in "Sentence 2", it is checked whether or not the end-of-sentence word is a sub-particle, a case particle, or a connecting particle among the clauses related to the end-of-sentence clause (step 140). However, since they do not exist, it is determined as a noun predicate sentence (step 160).

【００５７】次に、時間表現抽出部１２０は、「文２」
中の時間表現及びその分類情報を得るため、構文解析結
果情報と時間表現パターンテーブル１４０とを照合する
が、「文２」には時間表現が存在しないので何も抽出し
ない。「文２」は体言止め抽出部１１０で名詞述語文と
判定しているので、処理を述語文時制解析部１３０の名
詞述語文時制解析部１３２に移す。Next, the time expression extraction unit 120 causes the "sentence 2"
The syntactic analysis result information is collated with the time expression pattern table 140 to obtain the time expression and the classification information therein, but nothing is extracted because the "sentence 2" has no time expression. Since the “sentence 2” is determined to be a noun predicate sentence by the impersonation stopping extraction unit 110, the processing is transferred to the noun predicate sentence tense analysis unit 132 of the predicate sentence tense analysis unit 130.

【００５８】名詞述語文時制解析部１３２は、「文２」
には時間表現がなく（ステップ３２１）ので、「引退し
たのは」は「の」を伴う過去形の従属節なので（ステッ
プ３２３）、体言止めの時制は「現在」と判定する（ス
テップ３２４）。〔第３の例〕（文３）入力する日本語文章として「文３」を使用する場合、体
言止め抽出部１１０はは図１０（Ｃ）に示す構文解析の
結果情報が入力される。The noun predicate sentence tense analysis unit 132 determines that "sentence 2".
Since there is no time expression in (step 321), since "has retired" is a subordinate clause of the past tense accompanied by "no" (step 323), the word tense is judged to be "present" (step 324). . [Third example] (Sentence 3) When "Sentence 3" is used as the Japanese sentence to be input, the wording stop extraction unit 110 receives the result information of the syntax analysis shown in FIG.

【００５９】例えば、（”首相は” （（”インフレに
は” ”繋がらないと” ［格関係（ニ）］）”強
調。” ［引用］）［格関係（ガ）］）の形式で入力
する。For example, input in the form of "(prime minister)"(("" not linked to inflation "" [case relation (d)]] "emphasized." [Quote] [case relation (ga)]) To do.

【００６０】まず、体言止め抽出部１１０は、上記ステ
ップ１１０で文末文節の句点を除く最終単語「強調」は
名詞と判定する。述語判定部１１５は、当該体言止め文
はサ変型名詞と判定する。次に、述語判定部１１５は、
上記ステップ１３０により時詞も副詞もないと判定する
が、上記ステップ１４０で接続助詞を含む文節「繋がら
ないと」があると判定し、動詞述語文と判定する（ステ
ップ１７０）。First, the wording stop extracting unit 110 determines that the final word "emphasis" excluding the punctuation in the sentence ending clause is a noun in step 110. The predicate determination unit 115 determines that the wording stop sentence is a sa variant noun. Next, the predicate determination unit 115
Although it is determined in the above step 130 that there is no verb or adverb, it is determined in the above step 140 that there is a clause "when it is not connected" including the connective particle, and it is determined to be a verb predicate sentence (step 170).

【００６１】次に、時間表現抽出部１２０は、「文３」
中の時間表現及びその分類情報を得るため、構文解析結
果情報と時間表現パターンテーブル１４０とを照合する
が、「文３」には時間表現が存在しないので何も抽出し
ない。「文３」は体言止め抽出部１１０で動詞述語文と
判定しているので、処理を述語文時制解析部１３０の動
詞述語文時制解析部１３１に移す。Next, the time expression extraction unit 120 causes the "sentence 3".
The syntactic analysis result information is collated with the time expression pattern table 140 in order to obtain the time expression and the classification information therein, but nothing is extracted because "sentence 3" has no time expression. Since “statement 3” is determined to be a verb predicate sentence by the verbal stop extraction unit 110, the processing is transferred to the verb predicate sentence tense analysis unit 131 of the predicate sentence tense analysis unit 130.

【００６２】動詞述語文時制解析部１３１は、時間表現
抽出部１２０で抽出した時間表現の中に文末文節を修飾
し、過去を表すものがあるかをチェックする（ステップ
３１１）が、「文３」には時間表現が存在しないので、
文末名詞と同一の名詞が既然名詞テーブル１５０に登録
されているかをチェックする（ステップ３１２）。「文
３」の文末名詞「強調」は既然名詞テーブル１５０に登
録されているので、図１２（Ｃ）に示すように体言止め
の省略された時制は「現在」と判定する（ステップ３１
６）。The verb predicate sentence tense analysis unit 131 modifies the sentence end clause in the time expressions extracted by the time expression extraction unit 120 and checks whether there is any expression indicating the past (step 311). Since there is no time expression in ",
It is checked whether the same noun as the end-of-sentence noun is already registered in the noun table 150 (step 312). Since the sentence end noun "emphasized" of "sentence 3" is already registered in the noun table 150, the tense in which the wording is omitted is determined to be "present" as shown in FIG. 12C (step 31).
6).

【００６３】〔第４の例〕（文４）入力する日本語文章として「文４」を使用する場合、体
言止め抽出部１１０は、図１０（Ｄ）に示す構文解析の
結果情報が入力される。例えば、（”軍部は”
（（（”１８９０年までに” ”返還する” ［格関
係（ニ）］） ”領土に” ［用言−体言］）（”軍事
工場を” ”建設。” ［格関係（ヲ）］）［格関係
（ニ）］））の形式で入力される。[Fourth example] (Sentence 4) When "Sentence 4" is used as the Japanese sentence to be input, the wording stop extracting unit 110 receives the result information of the syntax analysis shown in FIG. It For example, ("Military is"
((("By 1890""Return" [Category (D)]) "On the territory" [Dictionary-Synopsis]) ("Military factory""Construction." [Category (Wo)]) [Case relationship (d)])).

【００６４】まず、体言止め抽出部１１０は、文末文節
の句点を除く最終単語「建設」は名詞と判定する。述語
判定部１１５は、当該体言止め文がサ変型名詞と判定す
る。次に、述語判定部１１５は、上記ステップ１３０に
より時詞も副詞もないと判定されるが、上記ステップ１
４０で格要素なる文節「軍事工場を」があると判定さ
れ、動詞述語文と判定する（ステップ１７０）。First, the wording stop extracting unit 110 determines that the final word "construction" excluding the punctuation in the sentence end clause is a noun. The predicate determination unit 115 determines that the wording stop sentence is a sa variant noun. Next, the predicate determination unit 115 determines that there is no time and adverb in the above step 130, but the above step 1
It is determined that there is a case clause "military factory" in 40, and it is determined to be a verb predicate sentence (step 170).

【００６５】次に、時間表現抽出部１２０は、「文４」
中の時間表現及びその分類情報を、構文解析結果情報と
時間表現パターンテーブル１４０とを照合することで、
図１１（Ｂ）に示す、パターン”数詞＋「年」”の時間
表現「１８９０年」を発見し、その分類情報として「絶
対過去」を取得する。「文４」は体言止め抽出部１１０
で動詞述語文と判定しているので、処理を述語文時制解
析部１３０の動詞述語文時制解析部１３１に移す。Next, the time expression extraction unit 120 causes the "sentence 4".
By comparing the time expression and its classification information in the syntax analysis result information with the time expression pattern table 140,
The time expression "1890" of the pattern "numerical +" year "" shown in FIG. 11B is discovered, and "absolute past" is acquired as the classification information. “Sentence 4” is the word stop extractor 110
Since it is determined to be a verb predicate sentence, the processing is transferred to the verb predicate sentence tense analysis unit 131 of the predicate sentence tense analysis unit 130.

【００６６】動詞述語文時制解析部１３１は、上記ステ
ップ３１１で過去を表す時間表現「１８９０年」は存在
するが文末文節を修飾してなく、上記ステップ３１２に
おいても「建設」は既然名詞テーブル１５０に登録され
ていないと判定する。さらに、「文４」は複文と判定さ
れ、従属節の述語が現在形であり、抽出した時間表現の
中に従属節の述語を修飾し、かつ絶対過去を表すものが
あるかとチェックする（ステップ３１４）。本例では、
従属節「１８９０年までに返還する」が「現在」形で、
時間表現「１８９０年」は絶対過去であり従属節の述語
を修飾しているので、図１２（Ｄ）に示すように、体言
止めの省略された時制は「過去」と判定する（ステップ
３１６）。The verb predicate sentence tense analysis unit 131 has the time expression "1890" representing the past in the step 311, but does not modify the sentence ending clause, and the "construction" also exists in the noun table 150 in the step 312. It is determined that it is not registered in. Furthermore, "statement 4" is determined to be a compound sentence, the predicate of the subordinate clause is the present tense, and it is checked whether or not the extracted time expression modifies the predicate of the subordinate clause and represents the absolute past (step 314). In this example,
The subordinate clause "Return by 1890" is in the "present" form,
Since the time expression "1890" is an absolute past and modifies the predicate of the subordinate clause, the tense in which the wording is omitted is determined to be "past" as shown in FIG. 12 (D) (step 316). .

【００６７】また、本実施例では動詞述語文時制解析部
１３１と名詞述語文時制解析部１３２を述語文時制解析
部１３０として扱ったが、それぞれを分けて扱うことも
可能である。動詞述語文と名詞述語文とでは異なる言語
現象を見せることが多く、言語の分析、研究においては
両者を別の枠組みで扱うことが多い。［第２の実施例］上記第１の実施例では、文内情報を利
用して時制を解析したが、以下に示す第２の実施例で
は、文脈に基づいて時制を解析する例を示す。Further, in this embodiment, the verb predicate sentence tense analysis unit 131 and the noun predicate sentence tense analysis unit 132 are treated as the predicate sentence tense analysis unit 130, but they may be treated separately. Verb predicate sentences and noun predicate sentences often show different linguistic phenomena, and in language analysis and research, they are often treated in different frameworks. [Second Embodiment] In the first embodiment, the tense is analyzed using the in-sentence information, but in the second embodiment described below, an example of analyzing the tense based on the context is shown.

【００６８】図１３は、本発明の第２の実施例のシステ
ム構成示す。同図に示す日本語文時制解析装置は、上記
第１の実施例で説明した文内情報利用型時制解析部１０
０、文脈情報スタック３００、時制解析部４００、時制
解析ルールテーブル５００及び解析制御部２００より構
成される。文脈情報スタック３００は、入力された文章
単位の構文解析結果情報を保持するテーブルである。FIG. 13 shows the system configuration of the second embodiment of the present invention. The Japanese sentence tense analysis device shown in the figure is a tense analysis unit 10 using in-sentence information explained in the first embodiment.
0, context information stack 300, tense analysis unit 400, tense analysis rule table 500, and analysis control unit 200. The context information stack 300 is a table that holds the input sentence-by-sentence syntax analysis result information.

【００６９】時制解析ルールテーブル５００は、体言止
め文の構文パターンとその後文の構文パターンを制約条
件として、体言止め文の時制を解析する規則を記述した
テーブルである。図１４は、本発明の第２の実施例の時
制解析ルールテーブルの概念図を示す。同図に示すよう
に、時制解析ルールテーブル５００は、ルールＩＤ５１
０、体言止め文制約条件５２０、後文制約条件５３０及
び、時制５４０の各項目より構成される。The tense analysis rule table 500 is a table in which a rule for analyzing the tense of a word stop sentence is described with the syntax pattern of the word stop sentence and the syntax pattern of the subsequent sentence as constraints. FIG. 14 is a conceptual diagram of the tense analysis rule table according to the second embodiment of this invention. As shown in the figure, the tense analysis rule table 500 has a rule ID 51
0, an incomprehension sentence constraint 520, a postscript constraint 530, and a tense 540.

【００７０】文内情報利用型時制解析部１００は、１文
単位に体言止め文かどうかを判定すると共に、体言止め
文の場合には、文内の時間情報や体言止めに用いられた
名詞の文法的、意味的な特性を利用して、その時制を解
析するものである。解析制御部２００は、構文解析済の
日本語文章が入力されるものとする。構文解析は、漢字
と仮名でべた書きされた日本語文を単語単位に分割する
形態素解析の処理結果を入力として、日本語文を構成す
る文節の修飾、被修飾の関係を解析すると共に、それが
どのような修飾関係かを認定する処理である。通常、日
本語文書処理システムは、形態素解析及び構文解析の処
理機能を有する。The in-sentence information utilization type tense analysis unit 100 determines whether or not the sentence is a sentence stop sentence on a sentence-by-sentence basis. In the case of a sentence stop sentence, the time information in the sentence or the noun used for the sentence stop is determined. It uses literary and semantic characteristics to analyze the tense. The analysis control unit 200 is assumed to be input with a syntax-analyzed Japanese sentence. The syntactic analysis uses the result of morphological analysis, which divides a Japanese sentence written in Kanji and Kana into word units, as an input and analyzes the relation between the modifiers and the modifiers of the clauses that compose the Japanese sentence. This is the process of recognizing whether such a modification relationship exists. Generally, a Japanese document processing system has processing functions of morphological analysis and syntactic analysis.

【００７１】時制解析部４００は、文脈情報スタック３
００から取得した体言止め文の前後の文の時制情報、時
制解析ルールテーブル５００から取得した体言止め文と
次文との並列性の解析を行い、体言止め文が「現在」を
表しているか、「過去」を表しているかを解析する。次
に、本実施例の概要の動作を説明する。図１５は、本発
明の第２の実施例の概要動作を示すフローチャートであ
る。The tense analysis unit 400 uses the context information stack 3
Whether the tense information before and after the sentence stop sentence acquired from 00 and the parallelism between the sentence stop sentence acquired from the tense analysis rule table 500 and the next sentence are analyzed, and whether the sentence stop sentence represents the “present”, Analyze whether it represents the "past". Next, the operation of the outline of this embodiment will be described. FIG. 15 is a flow chart showing the outline operation of the second embodiment of the present invention.

【００７２】文脈情報スタック３００に文書単位の文脈
がスタックされる（ステップ５０１）。解析制御部２０
０は、構文解析された結果が入力され、文脈情報スタッ
ク３００にスタックされている情報を１文づつ文内情報
利用型時制解析部１００に転送する（ステップ５０
２）。文内情報利用型時制解析部１００は、体言止め文
に用いられた名詞の文法的、及び意味的なものを解析す
る。しかし、文中に時制を解析するためのキーとなる表
現がない場合には、時制は不定となる。従って、文内情
報利用型時制解析部１００は、解析の結果、体言止めが
あり、かつ時制が不定である場合には（ステップ５０
３、Ｙｅｓ）、時制解析部４００に解析結果を転送する
（ステップ５０４）。時制解析部４００は、文脈情報ス
タック３００の内容を更新する（ステップ５０５）。Document-based contexts are stacked on the context information stack 300 (step 501). Analysis control unit 20
In the case of 0, the result of the syntax analysis is input, and the information stacked in the context information stack 300 is transferred one sentence at a time to the sentence information use type tense analysis unit 100 (step 50).
2). The in-sentence information utilization type tense analysis unit 100 analyzes the grammatical and semantic meanings of the nouns used in the body stop sentences. However, if there is no key expression in the sentence to analyze the tense, the tense is indefinite. Therefore, the in-sentence information utilization type tense analysis unit 100, if the result of the analysis indicates that the wording is stopped and the tense is indefinite (step 50).
(3, Yes), the analysis result is transferred to the tense analysis unit 400 (step 504). The tense analysis unit 400 updates the contents of the context information stack 300 (step 505).

【００７３】次に、時制解析部４００の動作を説明す
る。以下の説明では、文内情報利用型時制解析部１００
より解析結果が転送されているものとして説明する。図
１６は、本発明の第２の実施例の時制解析部の動作を示
すフローチャートである。時制解析部４００は、入力さ
れた文が最終文であるかを判定し（ステップ５１１）、
最終文の場合にはステップ５１７に移行する。最終文で
ない場合には、文脈情報スタック３００から体言止め文
及び前後の文の解析結果を取得する（ステップ５１
２）。時制解析部４００は、入力された文が第１文であ
るかを判定し、第１文の場合には、読みとばし、ステッ
プ５１５の処理を行う。第１文以外であれば（ステップ
５１３、Ｎｏ）、前文の時制と後文の時制を比較し、双
方の時制が同一の場合には、ステップ５１６に移行し
（ステップ５１４、Yes)、異なる場合には（ステップ５
１４、No）、体言止め文及び後文の解析結果をキーとし
て時制解析ルールテーブル５００を検索し、時制解析結
果を取得する（ステップ５１５）。また、前文と後文の
時制が同一の場合には、体言止め文の時制は前文及び後
文の時制とする（ステップ５１６）。Next, the operation of the tense analysis unit 400 will be described. In the following description, the sentence information use type tense analysis unit 100 will be described.
The description will be made assuming that the analysis result is transferred. FIG. 16 is a flow chart showing the operation of the tense analysis unit of the second exemplary embodiment of the present invention. The tense analysis unit 400 determines whether the input sentence is the final sentence (step 511),
If it is the final sentence, the process proceeds to step 517. If it is not the final sentence, the parsing result and the analysis results of the preceding and following sentences are acquired from the context information stack 300 (step 51).
2). The tense analysis unit 400 determines whether the input sentence is the first sentence, and if it is the first sentence, skips the reading and performs the process of step 515. If it is not the first sentence (step 513, No), the tense of the preceding sentence and the tense of the latter sentence are compared. If both tenses are the same, the process proceeds to step 516 (Yes in step 514), and if they are different. (Step 5
No. 14, No.), the tense analysis rule table 500 is searched by using the analysis results of the injunction sentence and the after sentence as keys, and the tense analysis result is acquired (step 515). If the tense of the preceding sentence and the tense of the latter sentence are the same, the tense of the sentence stop sentence is the tense of the preceding sentence and the latter sentence (step 516).

【００７４】これにより、解析制御部２００は、取得し
た情報「体言止め時制＝過去」に基づいて文脈情報スタ
ック３００を更新する。〔第１の例〕づづいて、上記の動作を以下の例に適応さ
せた例を示す。処理対象の日本語文章を『政府は米の緊
急輸入を発表した。米作は全国的に記録的な大凶作。食
糧庁では年内の国内需要をまかなうには最低２００万ト
ンの輸入が必要と予測する。』まず、解析制御部２００には、上記の文を構文解析
した結果が入力され、文脈情報スタック３００には図１
７に示す情報がスタックされる。同図に示す例では、第
１文の『政府は米の緊急輸入を発表した』について、主
題は“政府”、時制は“過去”としてスタックされる。As a result, the analysis control unit 200 updates the context information stack 300 based on the acquired information "comprehension tense = past". [First Example] Next, an example in which the above operation is applied to the following example will be described. The Japanese sentence to be processed is "The government has announced an emergency import of rice. Rice crops are record-breaking crops nationwide. The Food Agency estimates that at least 2 million tons of imports will be needed to meet domestic demand within the year. First, the result of syntax analysis of the above sentence is input to the analysis control unit 200, and the context information stack 300 is displayed in FIG.
The information shown in 7 is stacked. In the example shown in the figure, in the first sentence, “The government has announced an emergency import of rice”, the subject is “government” and the tense is “past”.

【００７５】第２文の『米作は全国的に記録的な大凶
作。』について、主題は“米作は”、時制は“不定”と
してスタックされる。第３文の『食料庁では年内の国内
受領を賄うには最低２００万トンの輸入が必要と予測す
る』について、主題は“食糧庁では”、時制は“現在”
としてスタックされる。The second sentence, "American crops are record-breaking crops nationwide. ”, The subject is stacked as“ rice production ”and the tense as“ indefinite ”. Regarding the third sentence, "The Food Agency predicts that at least 2 million tons of imports will be required to cover domestic receipts within the year," the subject is "at the Food Agency," and the tense is "present."
Is stacked as.

【００７６】解析制御部２００は、文脈情報スタッ
ク３００にスタックされている情報を１文づつ文内情報
利用型時制解析部１００に転送する。文内情報利用型時制解析部１００は、転送された各
情報を解析し、図１８に示す解析結果を出力する。同図
の例では、文内情報利用型時制解析部１００は、第１文
について主題は“政府は”、時制を“過去”とし、第２
文については、主題は“米作は”、時制は不定とし、文
章が体言止め文であり、名詞述語文であるとし、第３文
について主題は、“食糧庁では”、時制を“現在”とし
ている。The analysis control unit 200 transfers the information stacked in the context information stack 300 to the in-sentence information use type tense analysis unit 100 one by one. The in-sentence information use type tense analysis unit 100 analyzes each transferred information and outputs an analysis result shown in FIG. In the example of the figure, the in-sentence information use type tense analysis unit 100 sets the subject of the first sentence as “government” and the tense as “past”.
Regarding the sentence, the subject is "rice production", the tense is indefinite, the sentence is a noun sentence, and it is a noun predicate sentence. For the third sentence, the subject is "at the Food Agency" and the tense is "present". I am trying.

【００７７】において、第２文については、体言
止め文であり、名詞述語文であることが解析されている
が、第２文の文中には時制を解析するためのキーとなる
表現がないため、時制は決定されない。そこで、解析制
御部２００は、体言止め文の情報を時制解析部４００に
転送する。時制解析部４００は、転送された文が第２文である
ので、文脈情報スタック３００より前文である第１文と
後文の第３文を取得して、時制を比較する。この場合の
第１文の時制は“過去”であり、第３文の時制は“現
在”であることから、体言止め文（第２文）と後文（第
３文）の情報をキーとして、時制解析テーブル５００を
検索する。In regard to the second sentence, it is analyzed that it is a noun sentence sentence and a noun predicate sentence, but there is no key expression for analyzing tense in the sentence of the second sentence. , The tense is not decided. Then, the analysis control unit 200 transfers the information of the wording stop sentence to the tense analysis unit 400. Since the transferred sentence is the second sentence, the tense analysis unit 400 acquires the first sentence, which is the preceding sentence, and the third sentence, which is the latter sentence, from the context information stack 300, and compares the tenses. In this case, the tense of the first sentence is "past", and the tense of the third sentence is "present". Therefore, the information of the sentence stop sentence (second sentence) and the latter sentence (third sentence) is used as a key. , The tense analysis table 500 is searched.

【００７８】この場合、体言止め文が名詞述語文
で、構文が動詞述語文であることから、規則Ｍ２にマッ
チして、「体言止め文の時制＝前文の時制」となる結果
が得られ、前文の時制が“過去”であるから、“体言止
め文の時制＝過去”となる。〔第２の例〕以下の第２の例について説明する。『世銀
は１７年ぶりにイスラエルへの融資を再開した。イスラ
エルはこれを道路設備、ダム建設などの公共事業に活
用。約２０年続いた内乱からの脱却を目指す。』まず、解析制御部２００には、上記の文を構文解析
した結果が入力され、文脈情報スタック３００には図１
９に示す情報がスタックされる。同図に示す例では、第
１文の『世銀は１７年ぶりにイスラエルへの有しを再開
した。』について、主題は“世銀は”、時制は“過去”
としてスタックされる。In this case, since the verbal stop sentence is a noun predicate sentence and the syntax is a verb predicate sentence, a result that “the tense of the verbal stop sentence = the tense of the preceding sentence” is obtained by matching the rule M2. Since the tense of the preceding sentence is "past", the tense of the wording sentence is "past". [Second Example] The following second example will be described. “The World Bank resumed lending to Israel for the first time in 17 years. Israel uses this for public works such as road facilities and dam construction. Aiming to break free from the civil war that lasted for about 20 years. First, the result of syntax analysis of the above sentence is input to the analysis control unit 200, and the context information stack 300 is displayed in FIG.
The information shown in 9 is stacked. In the example shown in the figure, the first sentence "The World Bank resumed its ownership in Israel for the first time in 17 years. ”, The subject is“ World Bank ”and the tense is“ past ”
Is stacked as.

【００７９】第２文の『イスラエルはこれを道路設備、
ダム建設などの公共事業に活用。』について、主題は
“イスラエルは”、時制は“不定”としてスタックされ
る。第３文の『約２０年続いた内乱からの脱却を目指
す。』について、時制は“現在”としてスタックされ
る。解析制御部２００は、文脈情報スタック３００にス
タックされている情報を１文づつ文内情報利用型時制解
析部１００に転送する。The second sentence, "Israel uses this as road equipment,
Used for public works such as dam construction. , The subject is stacked as “Israel is” and the tense is “indefinite”. The third sentence, "Aiming to escape from the civil war that lasted for about 20 years. ”, The tense is stacked as“ present ”. The analysis control unit 200 transfers the information stacked in the context information stack 300 one sentence at a time to the in-sentence information utilization type tense analysis unit 100.

【００８０】文内情報利用型時制解析部１００は、
転送された各情報を解析し、図２０に示す解析結果を出
力する。同図の例では、文内情報利用型時制解析部１０
０は、第１文について主題は“世銀は”、時制を“過
去”とし、第２文については、主題は“イスラエル
は”、時制は“不定”とし、文章が体言止め文であり、
動詞述語文であるとし、第３文について時制を“現在”
としている。The sentence information use type tense analysis unit 100
Each transferred information is analyzed and the analysis result shown in FIG. 20 is output. In the example of the figure, the in-sentence information utilization type tense analysis unit 10
0 is the subject of the first sentence is “World Bank”, the tense is “past”, and the second sentence is the subject of “Israel” and the tense is “indefinite”.
Assume that it is a verb predicate sentence, and the tense is “present” for the third sentence.
And

【００８１】において、第２文については、体言
止め文であり、動詞述語文であることが解析されるが、
第２の文中には時制を解析するためのキーとなる表現が
ないため、時制は決定されず、体言止めの情報は時制解
析部４００に転送される。時制解析部４００は、体言止め文が第１文でも最終
文でもないので、文脈情報スタック３００から前文（第
１文）と後文（第３文）の情報を取得して、その時制を
比較する。前文（第１文）が“過去”で後文（第３文）
が“現在”であることから、体言止め文と後文の情報を
キーとして時制解析ルールテーブル５００を検索する。In regard to the second sentence, it is analyzed that it is a verbal predicate sentence and a verb predicate sentence.
Since there is no key expression for analyzing the tense in the second sentence, the tense is not determined, and the wording stop information is transferred to the tense analysis unit 400. The tense analysis unit 400 obtains the information of the preceding sentence (first sentence) and the succeeding sentence (third sentence) from the context information stack 300, because the improper sentence stopping sentence is neither the first sentence nor the final sentence, and compares the tenses. To do. Preamble (first sentence) is "past" and post sentence (third sentence)
Is “present”, the tense analysis rule table 500 is searched using the information of the wording stop sentence and the after sentence as a key.

【００８２】この場合、体言止め文が動詞述語文で
「イスラエルは」という主題表現を持ち、後文（第３
文）も動詞述語文であることから、規則Ｄ４にマッチし
て、「体言止めの時制＝後文の時制」となる結果が得ら
れ、後文の時制が「現在」であるから「体言止め文の時
制＝現在」となる。なお、本発明は、上記の第１の実施
例及び第２の実施例に限定されることなく、特許請求の
範囲内で種々、変更・応用が可能である。In this case, the injunction sentence is a verb predicate sentence and has a subject expression of “Israel”, and
(Sentence) is also a verb predicate sentence, it matches rule D4 and obtains the result that "tense tense of postponement = tense of postscript", and the tense of postscript is "present" The sentence tense = present ". The present invention is not limited to the above-mentioned first and second embodiments, and various modifications and applications are possible within the scope of the claims.

【００８３】[0083]

【発明の効果】上述のように、本発明によれば、従属節
中の時制と主節（文末）の時制の相互作用を利用して時
制を解析することができるため、文中に時間を表す表現
がない場合においても体言止め文の省略された時制の解
析が可能となる。また、従属節中の絶対的な時間を表す
表現と相対的な時間を表す表現を区別した上で時制解析
を行うので、従属節と主節の時制の基準の違いを考慮し
た解析が可能となり、従来解析することができなかった
体言止め文の省略された時制を精度よく解析することが
可能となる。As described above, according to the present invention, since the tense can be analyzed by utilizing the interaction between the tense in the subordinate clause and the tense in the main clause (end of sentence), the time is expressed in the sentence. Even when there is no expression, it is possible to analyze the tense in which the wording sentence is omitted. In addition, since the tense analysis is performed after distinguishing between the expressions that represent absolute time and the expressions that represent relative time in the subordinate clause, it is possible to analyze in consideration of the difference in the tense standard between the subordinate clause and the main clause. , It becomes possible to analyze the tense in which the wording sentence was omitted, which could not be analyzed in the past.

【００８４】さらに、体言止め文だけでなく、前後の文
の情報を用いて、解析するため、体言止め文内に時制を
示唆する表現がない場合にもその時制を解析することが
できる。また、前後の文の情報をとして、時制だけでな
く、体言止め文との動作の並列性や、時間的推移性も考
慮しているので、体言止め文の前文と後文の時制が異な
るときにも文脈を把握して適切に解析することが可能と
なる。Furthermore, since the analysis is performed using not only the sentence stop sentence but also the information of the sentences before and after, the tense can be analyzed even when there is no expression suggesting the tense in the sentence stop sentence. In addition, since the information of the sentences before and after is taken into consideration not only the tense but also the parallelism of the action with the word stop sentence and the temporal transitivity, when the tense of the sentence before and after the word stop sentence is different. Also, it becomes possible to grasp the context and analyze it appropriately.

[Brief description of drawings]

【図１】本発明の原理構成図である。FIG. 1 is a principle configuration diagram of the present invention.

【図２】本発明の原理を説明するための図である。FIG. 2 is a diagram for explaining the principle of the present invention.

【図３】本発明の第１の実施例のシステム構成図であ
る。FIG. 3 is a system configuration diagram of a first embodiment of the present invention.

【図４】本発明の第１の実施例の日本語文時制解析装置
の構成図である。FIG. 4 is a configuration diagram of a Japanese sentence tense analysis device according to the first embodiment of the present invention.

【図５】本発明の第１の実施例の体言止め抽出部及び述
語判定部の動作を示すフローチャートである。FIG. 5 is a flowchart showing the operation of the wording stop extraction unit and the predicate determination unit according to the first embodiment of this invention.

【図６】本発明の第１の実施例の時間表現パターンテー
ブルの概念図である。FIG. 6 is a conceptual diagram of a time expression pattern table according to the first embodiment of this invention.

【図７】本発明の第１の実施例の既然名詞テーブルの概
念図である。FIG. 7 is a conceptual diagram of an existing noun table according to the first embodiment of this invention.

【図８】本発明の第１の実施例の動詞述語文時制解析部
の動作を示すフローチャートである。FIG. 8 is a flowchart showing an operation of a verb predicate sentence tense analysis unit of the first exemplary embodiment of the present invention.

【図９】本発明の第１の実施例の名詞述語文時制解析部
の動作を示すフローチャートである。FIG. 9 is a flowchart showing an operation of a noun predicate sentence tense analysis unit according to the first embodiment of this invention.

【図１０】本発明の第１の実施例の入力用日本語文章の
構造図である。FIG. 10 is a structural diagram of an input Japanese sentence according to the first embodiment of this invention.

【図１１】本発明の第１の実施例の時間表現抽出結果の
概念図である。FIG. 11 is a conceptual diagram of a time expression extraction result according to the first embodiment of this invention.

【図１２】本発明の第１の実施例の動詞述語文時制解析
部及び名詞述語文時制解析部の処理を示す図である。FIG. 12 is a diagram showing processing of a verb predicate sentence tense analysis unit and a noun predicate sentence tense analysis unit according to the first embodiment of this invention.

【図１３】本発明の第２の実施例のシステム構成図であ
る。FIG. 13 is a system configuration diagram of a second embodiment of the present invention.

【図１４】本発明の第２の実施例の時制解析ルールテー
ブルの例を示す図である。FIG. 14 is a diagram showing an example of a tense analysis rule table according to the second embodiment of the present invention.

【図１５】本発明の第２の実施例の概要動作を示すフロ
ーチャートである。FIG. 15 is a flowchart showing a schematic operation of the second embodiment of the present invention.

【図１６】本発明の第２の実施例の時制解析部の動作の
フローチャートである。FIG. 16 is a flowchart of the operation of the tense analysis unit of the second example of the present invention.

【図１７】本発明の第２の実施例の解析制御部による解
析結果（その１）である。FIG. 17 is an analysis result (No. 1) by the analysis control unit according to the second embodiment of the present invention.

【図１８】本発明の第２の実施例の文内情報利用型時制
解析部の解析結果（その１）である。FIG. 18 is an analysis result (No. 1) of the sentence information utilizing type tense analysis unit according to the second embodiment of the present invention.

【図１９】本発明の第２の実施例の解析制御部による解
析結果（その２）である。FIG. 19 is an analysis result (No. 2) by the analysis control unit according to the second embodiment of the present invention.

【図２０】本発明の第２の実施例の文内情報利用型時制
解析部の解析結果（その２）である。FIG. 20 is an analysis result (No. 2) of the sentence information utilizing type tense analysis unit according to the second embodiment of the present invention.

[Explanation of symbols]

１００第１の解析手段、文内情報利用型時制解析部１１０抽出手段、体言止め抽出部１１５述語判定手段、述語判定部１２０時間表現抽出部１３０構文パターンを利用した時制解析手段、述語文
時制解析部１３１動詞述語文時制解析手段、動詞述語文時制解析
部１３２名詞述語文時制解析手段、名詞述語文時制解析
部１４０時間表現パターンテーブル１５０既然名詞テーブル２００解析制御部３００文脈情報記憶手段、文脈情報スタック４００時制解析部５００時制解析規則、時制解析ルールテーブル５１０ルールＩＤ５２０体言止め文制約条件５３０後文制約条件５４０時制８００第２の解析手段100 first analysis means, in-sentence information utilization type tense analysis section 110 extraction means, word stop extraction section 115 predicate determination means, predicate determination section 120 temporal expression extraction section 130 tense analysis means using syntactic patterns, predicate sentence tense analysis Part 131 Verb predicate sentence tense analysis means, Verb predicate sentence tense analysis part 132 Noun predicate sentence tense analysis means, Noun predicate sentence tense analysis part 140 Temporal expression pattern table 150 Preexisting noun table 200 Analysis control part 300 Context information storage means, context information Stack 400 Temporal analysis unit 500 Temporal analysis rule, tense analysis rule table 510 Rule ID 520 Indicative sentence constraint condition 530 Postscript constraint condition 540 Temporal 800 Second analysis means

Claims

[Claims]

1. A Japanese sentence tense analysis device for analyzing the tense of a Japanese sentence in a Japanese sentence processing system for inputting and processing a Japanese sentence, wherein the sentence ending clause of the Japanese sentence ends with a noun. The present invention is characterized by having a first analyzing means for analyzing a tense in which a stop sentence is omitted based on the property of the sentence ending clause, and a second analyzing means for analyzing by context information in addition to the first analyzing means. Japanese sentence tense analyzer.

2. The first analyzing means extracts the sentence stop sentence from the Japanese sentence, and the sentence stop sentence that is the last sentence clause of the sentence stop sentence is a verb predicate sentence or a noun predicate sentence. A predicate determining means for determining whether or not, and for each of the verb predicate sentence or the noun predicate sentence determined by the predicate determining means, analyzes the syntactic structure of the Japanese sentence,
A Japanese sentence tense analysis device, comprising: a tense analysis means utilizing a syntactic pattern for outputting a tense analysis result of the intelligence sentence.

3. A tense analysis unit using the syntactic pattern extracts a time expression pattern storage unit storing an expression pattern representing time, and a time expression appearing in the word stop sentence extracted by the extraction unit. 3. The time expression extraction means for collating with the time expression pattern storage means and outputting classification information as to whether the time expression expresses an absolute or relative tense. Japanese sentence tense analyzer.

4. The predescriptive word determination means is an adverb in all clauses related to the noun of the wording stop clause,
A judgment means for deciding whether the phrase stop phrase is used as a verb predicate sentence or a noun predicate sentence depending on the presence or absence of a phrase including a verb, a connecting particle or a verb case element. The Japanese sentence tense analyzer described in item 2.

5. The tense analysis means using the syntactic pattern, wherein the noun storage means stores a noun representing the past, and the predescription word determination means determines that the noun phrase is a verb predicate sentence. In, by analyzing the syntactic structure of the noun phrase, the type of nom phrase, the time expression and the classification information based on the matching result of the noun of the noun phrase and the existing noun storage means, Verb predicate sentence tense analysis means for analyzing the tense in which the word stop clause is omitted, and the syntactic structure of the word stop sentence when the word stop clause is determined to be a noun predicate sentence by the predescription word determination means The noun predicate sentence tense analyzing means for analyzing the tense in which the wording clause is omitted by analyzing the time expression and the classification information. Language sentence tense analyzer.

6. The second analysis means is a context information storage means for holding information in sentence units, and a rule for analyzing the tense of the word stop from the syntactic pattern of the word stop sentence and the subsequent sentence. And the tense analysis rule described above, the analysis result of the sentence stop sentence acquired by the first analysis unit is collated with the context information storage unit and the tense analysis rule, and the predicate of the sentence stop sentence and the next sentence The Japanese sentence tense analyzing apparatus according to claim 1, further comprising: tense analyzing means for analyzing the tense of the sentence stop sentence based on parallelism and transitivity of movement.

7. A Japanese sentence tense analysis method for analyzing the tense of a Japanese sentence in a document processing system for inputting and processing a Japanese sentence, wherein the sentence ending sentence in the Japanese sentence ends with a noun. A Japanese sentence tense analysis method characterized by analyzing a tense in which a stop sentence is omitted according to the property of the last sentence clause.

8. A nomination sentence in which the last sentence clause ends with a noun is extracted from the Japanese sentence, and whether the nomination sentence clause, which is the last sentence clause of the last sentence sentence, is a verb predicate sentence or a noun predicate sentence. 8. The Japanese sentence tense analysis method according to claim 7, further comprising: analyzing the syntactic structure of the verbal stop sentence for each of the verb predicate sentence and the noun predicate sentence, and outputting the result of tense analysis of the verbal predicate sentence. .

9. A classification of whether the time expression expresses an absolute or relative tense by extracting a time expression appearing in the word stop sentence after the extraction of the word stop and comparing it with a predetermined pattern. The Japanese sentence tense analysis method according to claim 8, which outputs information.

10. A phrase which is a case element of a verb or a verb including adverbs, verbs, connecting particles, among all the phrases related to the noun of the grammatical stop phrase when determining the type of the grammatical stop phrase. 9. The Japanese sentence tense analysis method according to claim 8, wherein it is determined whether the incomplete phrase is used as a verb predicate sentence or a noun predicate sentence depending on the presence or absence of.

11. A pre-existing noun memory that stores a noun of the noun phrase and a noun representing the past when it is determined that the noun phrase is a verb predicate sentence when the tense of the predescription word sentence is analyzed. Based on the syntactical structure of the body language stop sentence, the time expression in the body language stop sentence and the classification information, the body language stop clause is omitted by collating the means and analyzing whether the same noun is registered. The tense is analyzed, and when it is determined that the wording sentence is a noun predicate sentence, the wording clause is omitted based on the syntactic structure of the word sentence stopping sentence, the time expression, and the classification information. The Japanese sentence tense analysis method according to claim 8, wherein the tense is analyzed.

12. The Japanese sentence tense analysis method according to any one of claims 7 to 11, wherein a tense that cannot be analyzed because the Japanese sentence does not include an expression indicating a tense in the Japanese sentence is used as a tense before and after the sentence stopping sentence. The Japanese sentence tense analysis characterized by analyzing the tense of the above word stop sentence by analyzing the parallelism of the predicates and the transitivity of the actions of the above word stop sentence and the after sentence of the above word stop sentence Method.

13. The syntactic analysis result of the Japanese sentence is input, the syntactic analysis result is held as context information in the order of input, and when the Japanese sentence includes a wording sentence sentence, sentences before and after the word sentence sentence sentence are included. The tense of is analyzed by referring to the context information, and when it is not parseable, the syntactic information of the sentence stop sentence and the sentence after the sentence stop and the rule for analyzing the tense are matched and matched. 13. The Japanese sentence tense analysis method according to claim 12, wherein the tense is the tense of the sentence stop sentence.