JPH01287776A

JPH01287776A - Syntax analyzing method

Info

Publication number: JPH01287776A
Application number: JP63117265A
Authority: JP
Inventors: Yuji Sugano; 祐司菅野; Kenji Nagao; 健司長尾; Kenichi Ueda; 謙一上田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1988-05-13
Filing date: 1988-05-13
Publication date: 1989-11-20

Abstract

PURPOSE:To reduce the output of an unintentional interpretation by deforming morpheme analytic results into a group of morphemes which have plural start positions and end positions and analyzing the group, and obtaining a syntax analytic result. CONSTITUTION:Respective morphemes in plural morpheme sequences generated as a result of the morpheme analysis of an input sentence 1 are given virtual positions representing the conjunction relations among the morphemes and morphemes which share the virtual positions and 'part of speech' used for syntax analysis are put in one group. A set 5 of groups of morphemes obtained as mentioned above has its syntax analyzed by a parallel type syntax analyzing means 6 to obtain syntax analysis results. Consequently, the order of plural interpretations is easily determined and the output of an unintentional interpretation as a 1st interpretation is reduced.

Description

【発明の詳細な説明】産業上の利用分野本発明はワードプロセッサー等で利用される。[Detailed description of the invention] Industrial applications The present invention is used in word processors and the like.

入力された文の文構造や意味を解析する構文解析方法に
関するものである。It relates to a syntactic analysis method that analyzes the sentence structure and meaning of input sentences.

従来の技術以下１文章として日本語を用いたものを例にとって説明
する。2. Description of the Prior Art The following describes an example using Japanese as one sentence.

機械翻訳システムや文書構成システム等１日本語文章の
文構造や意味を理解して動作する計算機応用システムで
は、日本語文の構文構造を解析する必要がある。Computer-applied systems such as machine translation systems and document composition systems that operate by understanding the sentence structure and meaning of a single Japanese sentence need to analyze the syntactic structure of the Japanese sentence.

従来計算言語学の分野では、そのような文解析の手法が
研究されており１例えば「講座現代の言語第７巻・言語
の機械処理」（長尾真編、三省堂刊）や、「日本語情報
処理」（長尾真監修、電子通信学会発行）などの文献に
は、これまでに開発された従来の文解析手法のうちの代
表的なものが紹介されている。しかし、これらの中で述
べられている構文解析手法は、いずれも、１つの形態素
解析結果を受けと２て、その構文構造を解析するもので
ある。Conventionally, in the field of computational linguistics, such sentence analysis methods have been researched. Literature such as ``Processing'' (edited by Makoto Nagao, published by the Institute of Electronics and Communication Engineers) introduces representative conventional sentence analysis methods developed to date. However, all of the syntactic analysis methods described in these methods receive a single morphological analysis result and analyze its syntactic structure.

以下、第２図を参照して従来の構文解析方法について説
明する。The conventional syntax analysis method will be described below with reference to FIG.

第２図は従来の構文解析方法の一実施例を示すブロック
図である。第２図において、１はべた書きされた漢字か
な混じり表記の入力文、２は形態素の認定に用いる電子
化辞書、３は、電子化辞書２を検索して、与えられた漢
字かな混じり文の文字列から、形態素の候補を認定する
辞書検索手段、４は入力文１を受は取り、辞書検索手段
３を使用して、可能な形態素列を１つ１つ求め、出力す
る直列型形態素解析手段、５は直列型形態素解析手段４
の出力する個々の形態素列、６は形態素列５を１個受は
取って可能な構文構造をすべて求める並列型構文解析手
段、７は並列構文解析手段６が１つの形態素列５に対し
て求めた、全ての構文木（ツリー）の列、８は次の解釈
を求めるために。FIG. 2 is a block diagram showing one embodiment of a conventional syntax analysis method. In Figure 2, 1 is an input sentence written in solid kanji/kana mixed notation, 2 is an electronic dictionary used for morpheme recognition, and 3 is a search for the electronic dictionary 2 to find the given kanji/kana mixed sentence. Dictionary search means 4 identifies morpheme candidates from a character string; 4 receives input sentence 1; uses dictionary search means 3 to find and output possible morpheme sequences one by one; serial morphological analysis; 5 is a serial morphological analysis means 4
The individual morpheme sequences outputted by , 6 is a parallel syntactic analysis means that receives one morpheme sequence 5 and calculates all possible syntactic structures, and 7 is the parallel syntax analysis means 6 that calculates all possible syntactic structures for one morpheme sequence 5 In addition, column 8 of all syntax trees is used to obtain the following interpretation.

直列型形態素解析手段４に対し、入力文１の、次の形態
素解析結果を求めて出力するよう指示する次解釈提示指
令である。This is a next interpretation presentation command that instructs the serial morphological analysis means 4 to obtain and output the next morphological analysis result of the input sentence 1.

μ上のような構成において、臥下その動作について説明
する。In a configuration like the one on μ, the operation of lying down will be explained.

べた書きされた漢字かな混じり表記の入力文１は、形態
素解析手段４に読み込まれる。直列型形態素解析手段４
は辞書検索手段３によって、入力文１０部分文字列に表
記が一致する形態素を検索し、隣接する形態素相互の文
法的な接続可否を判断し、形態素列５として適切である
と思われるものを系統的に求め、解が１個見つかると、
その結果を形態素列５として出力して処理を中止する。An input sentence 1 written in solid letters with a mixture of kanji and kana is read into the morphological analysis means 4. Serial type morphological analysis means 4
The dictionary search means 3 searches for a morpheme whose notation matches the input sentence 10 partial string, determines whether adjacent morphemes can be grammatically connected to each other, and systematically selects morphemes that are considered to be appropriate as the morpheme string 5. When one solution is found,
The result is output as morpheme sequence 5 and the process is stopped.

形態素列５が出力されると、並列型構文解析手段６が起
動される。並列型構文解析手段６は、形態素列５を読み
込み、構文規則にしたがって、形態素列５の全ての可能
な構文構造を求める。When the morpheme sequence 5 is output, the parallel parsing means 6 is activated. The parallel syntactic analysis means 6 reads the morpheme sequence 5 and finds all possible syntactic structures of the morpheme sequence 5 according to syntactic rules.

もし、構文構造の数が０個、すなわち、形態素列５は、
構文規則上、文として解釈できない場合には、並列型構
文解析手段６は直列型形態素解析手段４へ、次の形態素
解析結果を出力するよう指令を出す。指令を受けた直列
型形態素解析手段４は、中止していた処理を再開し、次
の結果を求めて形態素列５として出力する。形態素解析
結果が残っており、構文解析が失敗する限り、上記のよ
うに、形態素解析結果を１個ずつ次々と取り出しては、
構文解析を試みる。という動作を繰り返す。If the number of syntactic structures is 0, that is, the morpheme sequence 5 is
If it cannot be interpreted as a sentence due to syntactic rules, the parallel syntactic analysis means 6 issues a command to the serial morphological analysis means 4 to output the next morphological analysis result. Upon receiving the command, the serial morphological analysis means 4 resumes the suspended process, obtains the next result, and outputs it as a morpheme sequence 5. As long as the morphological analysis results remain and syntactic analysis fails, take out the morphological analysis results one by one as described above.
Attempt to parse. Repeat this action.

その過程で、もし、構文構造が１個以上求まった場合に
は、並列型構文解析手段６は求まった全ての構文構造を
、構文木の列７として出力し、動作を中止する。一方、
前記の繰り返しの過程で、もう、これμ上、形態素解析
結果はない（全ての解釈を尽くした）場合には、他には
、構文構造がない旨を外部に通知して、処理を終了する
。この他、この構文解析結果を利用する外部の装置から
、他の解釈を提示するための要求が、次解釈提示指令８
として入力された場合には、直列型形態素解析手段４に
１次の形態素解析結果を出力する指令が出され、前記の
繰り返し動作を行って１次の構文木の列７を求める。従
って、入力文１の全ての解釈を１次解釈提示指令８を何
度か出すことにより。In the process, if one or more syntactic structures are found, the parallel syntactic analysis means 6 outputs all the found syntactic structures as a sequence 7 of the syntax tree and stops the operation. on the other hand,
In the process of repeating the above, if there are no more morphological analysis results on this μ (all interpretations have been exhausted), notify the outside that there is no other syntactic structure and terminate the process. . In addition, a request to present another interpretation from an external device that utilizes this parsing result is issued by the Next Interpretation Presentation Command 8.
, a command is issued to the serial morphological analysis means 4 to output the result of the first-order morphological analysis, and the above-described repetitive operation is performed to obtain the sequence 7 of the first-order syntax tree. Therefore, all interpretations of the input sentence 1 are issued by issuing the primary interpretation presentation command 8 several times.

求めることが出来る。You can ask for it.

発明が解決しようとする課題しかし、μ上のような方法では、個々の形態素解析結果
毎に構文木の列を求めるため、個々の形態素解析結果の
中で、最良の構文木を求めることは比較的容易だが、複
数個の形態素解析結果の中から、構文・意味構造を加味
して、最もよい構文木をｎコ（ｎ≧１）求める。という
処理は記憶容量、処理効率の両面で、実現が困難である
という課題があった。また、多くの応用システムでは。Problems to be Solved by the Invention However, in the method described above, a sequence of syntactic trees is obtained for each individual morphological analysis result, so it is difficult to obtain the best syntactic tree among the individual morphological analysis results. It is easy to do this, but from among multiple morphological analysis results, the best n parse trees (n≧1) are determined by taking syntactic and semantic structures into consideration. This process is difficult to implement in terms of both storage capacity and processing efficiency. Also, in many application systems.

構文木の列７が１つ求ま・った時点で、その列の中に、
筆者（あるいは話者）の意図した解釈があるものと仮定
してμ後の処理を進めるが、この場合には、直列型形態
素解析手段４が、最初に出力する形態素列５が、入力文
１を受は取った後で最初に出力する形態素列５の精度が
高いことが前提条件となるが、形態素解析の段階では、
複数の解釈に対して、精度の高い順序付けをすることは
困難であり（そのような高精度の順序付けのためには。When one column 7 of the syntax tree is found, in that column,
The processing after μ is performed on the assumption that there is an interpretation intended by the author (or the speaker). The prerequisite for Uke is that the first morpheme sequence 5 output after receiving is highly accurate, but at the stage of morphological analysis,
It is difficult to provide a highly accurate ordering for multiple interpretations (for such a highly accurate ordering.

構文・意味解析処理が必要となる）、筆者（あるいは話
者）の意図した解釈ではない解釈がμ後の処理に用いら
れてしまうという課題があった。これらの課題は、分か
ち書きの習慣のない日本語などの言語において、特に重
要なものとなる。(requires syntactic/semantic analysis processing), and there was a problem in that an interpretation that was not the author's (or speaker's) intention was used in the post-μ processing. These issues are particularly important in languages such as Japanese, which do not have the custom of separating text.

第１乃至第３の本発明は上記の課題を解決するもので、
複数の形態素解析結果を複数の位置を持つ形態素の組の
集合に変形し、この集合を従来の並列型の構文解析手法
または、それを改良した方法によって構文解析を行うこ
とにより、複数の解釈の間の順序付けが容易で、意図し
ない解釈を第１の解釈として出力することの少ない構文
解析方法を提供するものである。The first to third present inventions solve the above problems,
By transforming multiple morphological analysis results into a set of morpheme pairs with multiple positions and parsing this set using a conventional parallel parsing method or an improved method, multiple interpretations can be obtained. The object of the present invention is to provide a syntax analysis method in which the ordering between the two is easy and an unintended interpretation is less likely to be output as the first interpretation.

課題を解決するための手段第１の本発明は、入力文の形態素解析の結果生ずる複数
の形態素解析結果を、複数の開始位置と。Means for Solving the Problems A first aspect of the present invention is to analyze a plurality of morphological analysis results generated as a result of morphological analysis of an input sentence with a plurality of starting positions.

複数の終了位置を持つ形態素の組の集合に変形し、前記
形態素の組の集合を構文解析手段によって解析して、前
記複数の形態素解析結果に対する構文解析結果を得るこ
とにより、上記目的を達成するものである。The above object is achieved by transforming the morphemes into a set of morpheme sets having a plurality of end positions, analyzing the set of morpheme sets by a syntactic analysis means, and obtaining a syntactic analysis result for the plurality of morphological analysis results. It is something.

第２の本発明は、第１の本発明における形態素解析結果
の各々が、形態情報、統語情報および意味情報の付加さ
れた日本語の形態素の列であることにより、上記目的を
達成するものである。The second invention achieves the above object by each of the morphological analysis results in the first invention being a sequence of Japanese morphemes to which morphological information, syntactic information, and semantic information are added. be.

第３の本発明は、第１の本発明における形態素解析結果
が、形態情報、統語情報及び意味情報の付加された形態
素をノードとし、形態素間の接続可能性をアークとする
グラフであることにより。The third aspect of the present invention is that the morphological analysis result in the first aspect of the present invention is a graph in which morphemes to which morphological information, syntactic information, and semantic information are added are nodes, and connections between the morphemes are arcs. .

上記目的を達成するものである。This aims to achieve the above objectives.

作　　　　用第１乃至第３の本発明は、上記構成において、例えば入
力文を形態素解析した結果生じる複数の形態素列の各々
の形態素に、形態素間の接続関係を表現する仮想的な位
置を付与し、この仮想的位置と、構文解析で用いる「品
詞」とを共有する形態素を、１つの形態素の組としてま
とめる。こうしてできた形態素の組の集まり（集合）が
、並列型の構文解析手段で構文解析され、複数の構文解
析結果が得られる。なおこれらの構文解析結果を求める
過程において、形態素、構文及び意味を考慮して、結果
に順序を付けることもできる。Effects The first to third aspects of the present invention, in the above configuration, assign a virtual position representing a connection relationship between the morphemes to each morpheme of a plurality of morpheme sequences generated as a result of morphological analysis of an input sentence, for example. , morphemes that share this virtual position and the "part of speech" used in syntactic analysis are grouped together as one morpheme set. The collection (set) of morphemes thus created is parsed by a parallel parsing means, and a plurality of parsing results are obtained. Note that in the process of obtaining these syntactic analysis results, the results can be ordered in consideration of morphemes, syntax, and meanings.

実施例 μ下、本発明の一実施例について１図面を参照しながら
説明する。なお、−例としてべた書きされた漢字かな混
じり表記の日本語文を１文節数最少法の考え方を用いて
形態素解析して、最少文節数をもつ形態素列を全て求め
、これらの形態素列を変形し、分脈自由文法の形式の構
文規則と、並列型の構文解析手段によって、変形した形
態素の組の集合を構文解析し、複数の構文木が埋め込ま
れた解析衣を出力する場合の構文解析方法に付いて述べ
る。EMBODIMENT MU Below, an embodiment of the present invention will be described with reference to one drawing. As an example, we can morphemely analyze a Japanese sentence written in solid letters with a mixture of kanji and kana using the concept of the minimum number of clauses, find all morpheme sequences with the minimum number of clauses, and transform these morpheme sequences. , a syntactic analysis method for parsing a set of deformed morpheme pairs using syntactic rules in the form of a free grammar and a parallel syntactic analysis means, and outputting a parse in which multiple parse trees are embedded. I will explain it accordingly.

第１図は本発明の一実施例における構文解析方法を実施
する装置を中心とするブロック図である。FIG. 1 is a block diagram mainly showing an apparatus for implementing a syntax analysis method according to an embodiment of the present invention.

図において、１１は解析すべき入力文であり、１２は形
態素の表記、読み及び品詞を納めた電子化辞書、１３は
電子化辞書１２を参照して、指定された漢字かな混じり
文字列の最左部分列と１表記カー合致するものを全て検
索する辞書検索手段、１４は形態素列中に含まれる文節
の個数が最少であるような解釈（形態素列）を全て求め
て出力する形態素解析手段、１５は形態素解析手段１４
によって出力される、表記、読み及び品詞の情報を持つ
形態素列の集合であり、１６は形態素列の集合１５の各
々の形態素列の各形態素間の接続可否を表現する複数の
開始位置及び複数の終了位置を算出し、この２種の位置
情報を形態素列の集合１５中の各形態素に付与する位置
情報付与手段、１７は位置情報付与手段１６が出力した
位置情報の付与された形態素列の集合を受は取って１位
置情報及び品詞が同一である全ての形態素をまとめて組
にする形態素まとめ上げ手段、１８は形態素まとめ上げ
手段１７によって出力される形態素の組がいくつか集ま
った、形態素の組の集合であり５１９は形態素の組の集
合１８を受は敗って１分脈自由文法の形式で書かれた構
文規則と、構文規則毎に設けられた優先度算出規則とに
よって、構文的に見て最も確からしい構文構造を予め指
定した個数だけ求める並列型構文解析手段、２０は並列
型構文解析手段１９によって作成され、出力される複数
の構文構造及びそれに伴う形態素列が格納された解析衣
である。なお、並列型構文解析手段１９としては、アー
リー（ｇａｒｌａｙ）の方法や、コーク・カサミ・ヤン
ガー（Ｃｏｃｋｅ−Ｋａｓａｍｉ　−Ｙｏｕｎｇｅｒ　
）の方法などの並列型の構文解析手法を、複数位置情報
を含む入力に対応できるようにしたものを用いてもよい
し、あるいは本実施例のような「グラフ」的な入力を扱
うのに特に適した独自の手法を用いてもよい。In the figure, 11 is an input sentence to be analyzed, 12 is an electronic dictionary containing the notation, pronunciation, and part of speech of morphemes, and 13 is an electronic dictionary containing the notation, pronunciation, and part of speech of morphemes. 14 is a dictionary search means for searching for all the ones that match the left subsequence with a single notation; 14 is a morphological analysis means for finding and outputting all interpretations (morpheme sequences) such that the number of phrases included in the morpheme sequence is the minimum; 15 is a morphological analysis means 14
16 is a set of morpheme sequences having information on orthography, pronunciation, and part of speech output by a position information adding means for calculating the end position and adding these two types of position information to each morpheme in the set 15 of morpheme sequences; 17 is a set of morpheme sequences to which position information outputted by the position information adding means 16 has been added; A morpheme grouping means 18 takes all the morphemes that have the same position information and the same part of speech and groups them together; 519 is a set of morpheme pairs, and 519 is syntactically defined by the syntactic rules written in the form of a one-minute free grammar and the priority calculation rules established for each syntactic rule. Parallel parsing means 20 calculates a predetermined number of most probable syntactic structures, and reference numeral 20 denotes a parsing garment in which a plurality of syntactic structures and associated morpheme sequences created and output by the parallel parsing means 19 are stored. It is. The parallel parsing means 19 may be the Garlay method or the Cocke-Kasami-Younger method.
) can be used to handle input that includes multiple positional information, or a parallel parsing method such as the method described in Specially suited proprietary techniques may also be used.

μ上のような構成において、以下、「くるまでほこをは
こぶ」という文が、入力文１１として与えられた場合を
例にとって解析動作の概略を説明する。In the configuration shown above, an outline of the analysis operation will be explained below, taking as an example the case where the sentence "Kuruma hoko wo hakobu" is given as the input sentence 11.

入力文１１は、　形態素解析手段１４に読み込まれ１文
頭から文末へ向かって、系統的な辞書検索が辞書検索半
没１３を使用して行われ、隣接する形態素カー文法的に
接続し得るかどうかが、内部にある接続テーブルを用い
て行われる。こうして求まったいくつかの形態素列の中
から１文節（分脈自由文法によって形式的に定義される
ものとする。）の個数が最も少ないものが、形態素列の
集合１５として全て出力される。今の場合、形態素列の
集合１５は。The input sentence 11 is read into the morphological analysis means 14, and a systematic dictionary search is performed from the beginning of the sentence to the end using the dictionary search half-input 13 to determine whether adjacent morphemes can be grammatically connected. is done using an internal connection table. Among the several morpheme sequences thus determined, those with the smallest number of clauses (formally defined by a segmental free grammar) are all output as a set 15 of morpheme sequences. In this case, the set of morpheme sequences 15 is.

（「車」「で」「箱」「を」「運ぶ」）。(``car'', ``de'', ``box'', ``wo'', ``carry'').

（「車」「では」「子」「を」「運ぶ」）、（「来る」
「まで」「箱」「を」「運ぶ」）、（「来る」「まで」
「は」「子」「を」「運ぶ」）、（「来る」「間」「で
」「箱」「を」「運ぶ」）。('car', 'de', 'child', 'wo', 'carry'), ('come'
"until""box""to""carry"),("come""until"
``ha'',``child'',``wo'',``carry''),(``come'',``between'',``de'',``box'',``wo'',``carry'').

（「来る」「間」「では」「子」「を」「運ぶ」）。(``come'', ``between'', ``then'', ``child'', ``wo'', ``carry'').

（「繰る」「まで」「箱」「を」「運ぶ」）。(``carry'', ``until'', ``box'', ``wo'', ``carry'').

等１文節数が３個であるような多くの形態素列の集合と
なる。位置情報付与手段１６は、この形態素列の集合１
５を受は取り、各形態素列を先頭から同時に走査し、形
態素間の接続関係を表現する開始位置、終了位置を、各
形態素に付与する。例えば「来る」という形態素の場合
には、「まで」、「間」の２種類の形態素が後続する。This is a set of many morpheme sequences such that the number of clauses is 3. The position information adding means 16 uses this morpheme sequence set 1
5, each morpheme sequence is simultaneously scanned from the beginning, and a start position and an end position expressing the connection relationship between the morphemes are given to each morpheme. For example, in the case of the morpheme "kuru", two types of morphemes follow: "to" and "ma".

入力文の上では「来る」と同一部分列「くる」を共有す
る別の形態素として、「繰る」、「拘侠」の２つの形態
素カーあり、「繰る」には「まで」と「間」が。In the input sentence, there are two other morphemes that share the same substring ``kuru'' with ``kuru'': ``kuru'' and ``keikikyo'', and ``kuru'' has ``ma'' and ``ma''. but.

「拘偵」には「まで」が、それぞれ後続する。従って、
「来る」、「繰る」の開始位置は０、終了立置は１及び
２となり、「拘偵」の開始位置はＯ１終了位置は２．「
間」の開始位置は１．「まで」の開始位置は２に、それ
ぞれ定まる。この位置情報により、「来る」及び「繰る
」には「間」が後続するが、「拘傳」には「間」が後続
しないことが表現されている。このようにして１位置を
付与された形態素列の集合１５が、形態素まとめ上げ手
段１７へ渡される。　形態素まとめ上げ手段１７は、受
は取った形態素列の集合中の、開始立置。``Kantei'' is followed by ``to''. Therefore,
The starting position of ``Kuru'' and ``Kuru'' is 0, the ending position is 1 and 2, and the starting position of ``Kakei'' is O1, and the ending position is 2. "
The starting position of "between" is 1. The starting position of "until" is determined to be 2. This position information expresses that ``kuru'' and ``karu'' are followed by ``ma'', but ``kokuden'' is not followed by ``ma''. The set 15 of morpheme sequences assigned one position in this manner is passed to the morpheme grouping means 17. The morpheme grouping means 17 sets the starting position in the set of morpheme sequences taken.

終了立置及び品詞を共有する全ての形態素を形態素の組
としてまとめ上げ、μ後の構文・意味解析処理では１つ
の形態素として処理を行なう。例えば、先の例では、「
来る」と「繰る」は、共に。All morphemes that share a final position and a part of speech are grouped together as a morpheme set, and processed as one morpheme in the syntactic/semantic analysis process after μ. For example, in the previous example, "
"come" and "kari" are both used together.

開始位置がＯ１終了立置ｈｔ１及び２１品詞が「動詞」
であり、［：（０）、（１、２）、動詞、（「来る」、
「繰る」）〕のような形態素の組になる。Starting position is O1 Ending position ht1 and 21 Part of speech is "verb"
and [: (0), (1, 2), verb, ('come',
It becomes a set of morphemes like ``kuru'')].

このような組を要素とする集合が、形態素の組の集合１
８として出力される。並列型構文解析手段１９は、この
形態素の組の集合１８を１通常の意味の（すなわち入力
文の文字の）位置を複数個持つ形態素の列であると見な
し、内蔵された分脈自由文法の形の構文規則に従い、形
態素の組の中のまとめ上げられた清報（上記の″（「来
る」、「繰る」）”など）等をもとに優先度算出規則を
利用して、最も確からしい構文溝造を、予め指定された
個数だけ求めて、解析表２０として出力する。A set containing such pairs as elements is the set 1 of morpheme pairs.
Output as 8. The parallel syntactic analysis means 19 considers this set of morpheme pairs 18 to be a sequence of morphemes that have multiple positions of normal meaning (that is, the character of the input sentence), and uses the built-in form of the free branch grammar. According to the syntactic rules of , the most probable is determined by using the priority calculation rules based on the information grouped together in the set of morphemes (e.g., ``(come'', ``carry'')'' etc.). A predetermined number of syntactic groove structures are determined and outputted as an analysis table 20.

なお、μ上の説明では日本語を使用した場合について説
明したが、英語、ドイツ語、中国語等。In addition, in the explanation above, we explained the case where Japanese was used, but English, German, Chinese, etc.

その他の各種言語にも適用できる。It can also be applied to various other languages.

発明の効果 μ上の説明のように、本発明によれば、従来の構文解析
方法に比べ、形態素解析で生じる曖昧さをも考慮した複
数の解釈の間の比較・順序付けが容易であり、意図しな
い解釈を第１の解釈として出力することが少ないという
長所がある。特に入力文がべた書きされた。かな表記の
日本語である場合には、形態素解析で生じる曖昧さが多
く、且つ各々の形態素解析結果の間の順序付けが形態素
解析の段階では困難な場合が多いため、特に本発明は有
益である。Effects of the Invention As explained above, according to the present invention, compared to conventional syntactic analysis methods, it is easier to compare and order multiple interpretations, taking into account the ambiguity that occurs in morphological analysis, and it is possible to This has the advantage that an interpretation that does not occur is rarely output as the first interpretation. In particular, the input text was written all over the place. The present invention is particularly useful in the case of Japanese written in kana notation, since there are many ambiguities that arise in morphological analysis, and it is often difficult to order the results of morphological analysis at the stage of morphological analysis. .

[Brief explanation of the drawing]

第１図は本発明による構文解析方法を実施するだめの装
置を中心とする構成を示すブロック図。第２図は従来の構文解析方法を実施するための装置を中
心とする構成を示すブロック図である。１１・・・入力文、１２・・・電子化辞書４１３・・辞
書検索手段、１４・・・形態素解析手段、１５・・形態
素列の集合、１６・位置情報付与手段、１７・・・形態
素まとめ上げ手段、１８・・・形態素の組の集合、１９
・・・並列型構文解析手段、２０・・・解析衣。代理人の氏名　弁理士　中　尾　敏　男　ほか１名第１
図FIG. 1 is a block diagram showing the configuration mainly of an apparatus for implementing the syntax analysis method according to the present invention. FIG. 2 is a block diagram showing a configuration centered on a device for implementing a conventional syntax analysis method. DESCRIPTION OF SYMBOLS 11... Input sentence, 12... Electronic dictionary 413... Dictionary search means, 14... Morphological analysis means, 15... Set of morpheme sequences, 16. Position information adding means, 17... Morphological summary Raising means, 18... Set of morpheme pairs, 19
...Parallel parsing means, 20...Analysis clothes. Name of agent: Patent attorney Toshio Nakao and 1 other person No. 1
figure

Claims

[Claims]

(1) Transform a plurality of morphological analysis results resulting from morphological analysis of an input sentence into a set of morpheme pairs having a plurality of start positions and a plurality of end positions, and convert the set of morpheme pairs by a syntactic analysis means. A syntactic analysis method, characterized in that the syntax analysis results are obtained for the plurality of morphological analysis results.

(2) The syntactic analysis method according to claim 1, wherein each of the morphological analysis results is a sequence of Japanese morphemes to which morphological information, syntactic information, and semantic information are added.

(3) The syntactic analysis method according to claim 1, wherein the morphological analysis result is a graph in which nodes are morphemes to which morphological information, syntactic information, and semantic information have been added, and arcs are connections between morphemes.