JP3605735B2

JP3605735B2 - Natural language semantic analysis processor

Info

Publication number: JP3605735B2
Application number: JP07835595A
Authority: JP
Inventors: 隆男松原; 昌記西谷; 元之伊藤; 秀憲青沢; 朗高木
Original assignee: 株式会社Ｃｓｋ
Priority date: 1995-03-10
Filing date: 1995-03-10
Publication date: 2004-12-22
Anticipated expiration: 2019-12-22
Also published as: JPH08249334A

Description

【０００１】
【産業上の利用分野】
本発明は、入力文が持つ情報を順次文脈情報として蓄積し意味解釈を行う装置、特に、カーナビゲーションの対話処理等において、文脈情報が自然に蓄積できるよう意味解釈を行う自然言語の意味解析処理装置に関するものである。
【０００２】
【従来の技術】
従来、対話に関しては、その意図や話題の認識・利用法に焦点をあてた研究が多くなされている（例えば、竹下敦：「対話のインタラクション構造を用いた話題の認識」、情報処理学会自然言語処理研究会８７−１０（１９９２））。しかし、現状ではそれらの認識の前提となる文脈の構築技術に関して、まだ安定した技術があるとは言い難い。
【０００３】
文脈構築上の問題点として、意味解釈の役割は、入力文の意味を、文体に依存せずに正しく認識し、文脈に整理統合し、問題解決に必要な情報を抽出することにある。また、文解釈の結果構築される「文脈」は、その後の問題解決プロセスを安定に起動できるような均一な形式を持たなければならない。一方、実際の文章を見ると、ある内容が、一文で与えられたり複数文で与えられたりするといった文体上のいわゆる「ぶれ」がしばしば現れる。また、一文内に限ってみても、意味解釈処理の入力となる構文解析木には、ノードの種類や枝ぶりに、相当の多様性が存在するので、意味的には同じ内容の文が、異なった構文構造で表されることがある。
【０００４】
この場合に同じ内容を異なった構造で表現することを許すと、意味のつき合わせや統合を安定に行なうことは不可能である。文内容を意味的に統合し、一つの文脈を構築していくためには、文体や構文解析木の構造的多様性を吸収してやる必要がある。そのためには、（１）意味表現が、文の複雑さや意味内容に依存して変動するような構造を持たない、（２）文の意味解釈結果が、文脈の場の中で、意味内容毎に定まった場所に位置付けられる、ことが必要である。
【０００５】
従来の自然言語処理の手法では、入力文の内容を論理式や動詞フレーム・意味ネット等の形式で整理するのが一般的であった。
【０００６】
【発明が解決しようとする課題】
しかしながら、例えば、動詞フレームには、述語＋表層格（または深層格）という内部構造が存在し、動詞及び格名のバリエーションにより、表現形式の異なる同義表現が多数存在してしまう。例えば、同じ出発地が東京を表す文においても「東京を出発する」あるいは「東京から行く」等の表現の多様性が存在する。意味ネット等の場合でも同様の問題がある。それらの表現は、極論すれば、構文解析木を同程度の複雑さを持つ別種の構造に形式変換したものに過ぎず、構文解析木と同様、表現の多様性の問題を抱えており、文脈の蓄積という観点からみた場合、意味表現としては適切とは言えなかった。
【０００７】
そこで本発明は、特に、例えば、カーナビゲーションのための対話を題材として、文脈情報を自然に蓄積していける意味解析手法を開発した自然言語の意味解析処理装置を提供することを目的とする。
【０００８】
【課題を解決するための手段】
上記目的を達成するために本発明の自然言語の意味解析処理装置は、入力される文の構文解析木を得る構文解析部と、
次のａ）〜ｅ）の処理を行う意味解析部と
ａ）前記構文解析部で解析された構文解析木に含まれる各節を単位として、その部分構造木を［述語＋格助詞句］の枝の集合に分解する、
ｂ）前記ａ）で得られた各枝の「格助詞句」に相当する「格助詞＋格助詞につく名詞」に対して「格助詞」と「格助詞につく名詞」との間に、「格助詞」によって指定される、「格助詞につく名詞」のクラスを表す「属性名詞」と「という」とを補完して［述語＋格助詞＋属性名詞＋「という」＋格助詞につく名詞］に変換する、
ｃ）さらに前記ｂ）で変換された「という」を「〜に等しいところの」と関係節構造に分解して［述語＋格助詞＋属性名詞＋関係代名詞＋格助詞「は」＋「等しい」＋格助詞「に」＋格助詞につく名詞］と同義変形した後、
属性名詞と関係代名詞を反転させて「等しい」を主動詞化し、［述語＋格助詞＋関係代名詞＋属性名詞＋格助詞「は」＋「等しい」＋格助詞「に」＋格助詞につく名詞］と変換する、
ｄ）前記ｃ）で変換された結果を、［＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞＝＜格助詞につく名詞＞］と形式化した後、左辺である［＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞］を、述語が指定する現象概念の属性に対応する名詞一語に置き換えてパラメータとし、右辺である＜格助詞につく名詞＞をパラメータ値とすると共に、「パラメータパラメータ値」対の文脈情報として蓄積する、
ｅ）さらに、前記ｄ）の処理において蓄積される「パラメータパラメータ値」対を、それ以前に前記ｄ）の処理によりすでに蓄積された「パラメータパラメータ値」対と照合し、該照合した結果に基づきパラメータの属性により定まる文脈情報の一定の位置への書き込み処理を行い蓄積する、
前記構文解析部と意味解析部とを統括制御し入力文の取り込みと出力文の出力の制御を行う制御部とを備えたものである。
【０００９】
地図データに基づいて地図情報操作を行う地図情報操作部をさらに備え、前記意味解析部において、地図上の実体を指す名詞や移動現象に言及する節が現れた場合には、地図情報操作部を介して地図データを参照し、名詞のレファレントの同定や移動過程のシミュレーションを行うことが、時空間的情報が絡む場合に、現象の流れや空間的なつながりを補完でき連続性の取り扱いの問題に部分的に対処できる点で好ましい。
【００１０】
前記制御部は、文の入出力を制御するとともに、前記構文解析部、意味解析部を統括制御する対話制御部と、制御ルールを格納する制御ルール部と、問題解決ルールを格納する問題解決ルール部とを有することが、カーナビゲーションのための対話システムを構築できる点で好ましい。
【００１１】
【作用】
本発明は、構文解析部で解析された構文解析木を、意味解析部で節単位に分割して入力し［述語＋格助詞句］の枝毎に分析し現象ごとにユニークに定まるパラメータ値対の形に整理することで、述語による枝の変動を吸収し、構造を均一化することができ、文脈の蓄積を可能とする。
【００１２】
【実施例】
以下、本発明を図示の一実施例により具体的に説明する。図１は本発明の実施例の自然言語の意味解析処理装置の構成を示すブロック図、図２は本発明の自然言語の意味解析処理装置による節内容のモデルの現象概念を示す図、図３は本発明の自然言語の意味解析処理装置による文脈のモデル例を示す図である。
【００１３】
まず、本発明の問題解決のための基本的手法について説明する。文を構成する命題レベルの基本単位は節であるので、入力文の意味解釈は節を単位とするのが自然である。この場合に、節レベルの表現の構造上の多様性の要因は、（ａ）述語・名詞を修飾する枝の組合せタイプや数の多様性、及び（ｂ）述語・名詞の意味の多様性である。構文解析木もしくは、それと同種の構造上の多様性を内包する意味表現では、前記の（ａ）及び（ｂ）を原因とする構造の多様性から逃れることは不可能なので、まず、節を〔述語＋格助詞句〕という枝の集合に分解し、各枝をそれぞれ他の枝とは独立に解釈することを考える。これにより述語の解釈は複数回行うことになるが、枝を個別に解釈するためにそのことは容認する。また、実際には、各枝の間に何らかの意味的従属関係が存在することもある。このパラメータ間に依存関係が存在する場合には、後述するようにそのパラメータに対し、その依存関係に従って他のパラメータの値を参照または算出するような付加手続きを与えることにするが、これについての詳細は後述する。
【００１４】
しかし、節を枝に分解したところで、前記の（ｂ）の述語・名詞の意味の多様性の問題は残る。そこで、各枝［述語＋格助詞＋格助詞につく名詞］を［＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞＝＜格助詞につく名詞＞］と同義変換し、さらに、この＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞を、現象によって一意に定まり、述語に依存しない名詞一語、例えば「〜から行く」であれば、それを「出発地」とするなどで表す解釈方法を導入する。これにより、述語による枝の変動を吸収し、構造を均一化することができる。ここで、［］は、記述の対象とする部分構文木の範囲もしくはまとまりを表す。また、＜＞は、句または節のまとまりを表す。＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞と言う表現において、内側の＜＞は連体修飾節のまとまりを、また外側の＜＞は属性名詞をヘッドとする名詞句のまとまりを表す。＝の左辺は、「等しい」の主格成分を、右辺は、補格成分を表す。
【００１５】
本発明では、この「述語＋格＋属性名詞」を表す名詞のことを、パラメータと呼び、その格につく名詞のことをパラメータ値と呼ぶ。パラメータ間に依存関係が存在する場合には、そのパラメータに対し、その依存関係に従って他のパラメータ値を参照または算出するような付加手続きを与えることにした。すなわち、本発明では、節の内容を図２に示すように、構造的に最大限に単純化したフラットな構造で表すことができる。ここで、節という単位を表すために、パラメータ値群を囲む述語に相当する枠を用意したが、この枠は文脈情報としては本質的なものではない。このような意味表現形式では、「東京を出発する」の文も「東京から行く」の文も、共に「出発地」＝「東京」と整理することができる。また、これらの各パラメータの、文脈の場の中での位置づけをあらかじめ全て決定しておくことにより、入力されてきた節の各枝の情報を文脈のどこに書き込めばよいかは一意に定まることになる。以上により、前述の（１）意味表現が、文の複雑さや意味内容に依存して変動するような構造を持たない、また（２）文の意味解釈結果が、文脈の場の中で、意味内容毎に定まった場所に位置付けられる、という条件が満たされ、節の内容が文脈上の特定の位置に安定に書き込まれることにより、文脈情報が蓄積されることになる。
【００１６】
例えば、「居平交差点を左折、檜枝岐川沿いに沼田街道を南西に走り、会津高原の八総から約１時間で会津最奥の檜枝岐温泉に出る。」という文の場合、図３に示すような文脈モデルが形成されることになる。
【００１７】
上記では、時空間的情報が絡む問題については無視してきたが、実際の文章では、例えば「東京を出て、宇都宮を通り、日光へ行きたい。」のように、出来事の時空間的な断面が断片的に述べられるだけで、統合に際して、その間の現象の流れや空間的つながりを補完しなければならないことがある。こうした時空間的連続性の取り扱いの問題を一般的に解決することは、現状では困難であるが、本発明では、空間的情報については、道路・領域・施設情報等を保持した地図情報を持つことで、部分的に対処することにする。移動や存在に関する文が入力された場合には地図上でシミュレーションやレファレントの検索を行なうなどして、統合を試みる。前述の例文「東京を出て、宇都宮を通り、日光へ行きたい」の場合、シミュレーションにより東京から日光に至る経路が求められ、それに基づいて、一つの移動現象として統合されることになる。
【００１８】
次に、本発明の自然言語の意味解析処理装置をナビゲーション用対話システムとして構成した実施例について説明する。
【００１９】
図１において、本実施例のナビゲーション用対話システムは、文を入力するための入力装置１及び文を出力したり地図情報を出力する出力装置２と、入力文の構文解析と意味解析を行う言語処理部１０と、地図情報の管理を行う地図管理部２０と、入力装置１、出力装置２、言語処理部１０及び地図管理部２０を総括制御する制御部３０とから構成されている。
【００２０】
上記入力装置１は、制御部３０の制御のもとに文を取り込む装置であり、例えば、音声認識により文を入力し形態素解析処理を施す部分である。また、出力装置２は、制御部３０の制御のもとに文あるいは地図情報を出力する部分であり、例えば、文を音声合成により出力したり、あるいはディスプレイに文あるいは地図を表示する部分である。上記言語処理部１０は、構文解析するための文法解析辞書１２に基づいて入力装置１で入力された文の構文を解析し解析木を得る構文解析部１１と、節単位に分割されて入力される解析木を意味情報辞書１４に基づいて述語＋格助詞句の枝ごとに分析し、パラメータ値対の形に整理し、地図上の実体を指す名詞や移動現象に言及する節が現れた場合には、地図管理部２０の地図情報を参照し、名詞のレファレントの同定や移動過程のシミュレーションを行い、各パラメーター値対の内容を文脈の場に次々と照合、またはパラメータにより定められた文脈情報上の一定の場所に書き込む処理を行う（重ね書きする）意味解析部１３とからなる。上記地図管理部２０は、地図情報を格納する地図データ２２に基づいて地図情報操作を行う地図情報操作部２１からなる。また上記制御部３０は、入力装置１、出力装置２、構文解析部１１、意味解析部１３及び地図情報操作部２１を統括制御し、制御のためのルールを起動するための制御ルール部３２及び問題解決ルールを起動するための問題解決ルール部３３に基づいて対話処理により入力文を取り込み出力文や地図情報を出力制御する対話制御部３１からなる。
【００２１】
次に、本実施例による自然言語の意味解析処理装置により、述語による枝の変動を吸収し、構造を均一化する例について説明する。
【００２２】
図４は第１実施例の例文１の構文木（依存構造）を示す図、図５は同じ第１実施例の例文２の構文木（依存構造）を示す図、図６は第１実施例の例文１の構文木をパラメータ値対の形に整理する例を示す図である。
【００２３】
この実施例１において、例文１は「東京から宇都宮まで東北自動車道で行く。」、例文２は「東京を出発、東北自動車道を使って、宇都宮に行く。」で、同等の意味を表している。これら２つの例文１及び例文２の依存構造木は、それぞれ図４及び５に示すように、使われる述語（言及の単位）、述語毎の格の違いのために、互いに異なっている。従って、このデータからでは、直接２つの文の意味的同等性を判断することは困難である。これらの表層構造上の違いを吸収するために、述語＋格助詞句毎に、図６のように等価変形を施し、均一な構造で、依存構造木の各枝の内容を整理する。この様な等価変形については、例えば、「高木・伊東著：「自然言語の処理」、丸善、１９８７」が参照される。
【００２４】
以下、図示の第１実施例として、以下の例文について、各枝［述語＋格助詞＋格助詞につく名詞］を［＜＜述語＋格助詞＋関係代名詞＞＋属性名詞＞＝＜格助詞につく名詞＞］と同義変換し、述語による枝の変動を吸収し、構造を均一化する方法について説明する。
【００２５】
今、例文１の「東京から宇都宮まで東北自動車道で行く。」という入力文は、構文解析部１１で図４に示す依存構造木に解析され、節単位に分割して意味解析部１３に送られる。意味解析部１３は、この構造木を、［述語＋格助詞句］の枝毎に分析して図６に示すように「東京から行く」の文章を［出発地＝東京］の形に整理する。なお、「等しい（である）」の主格名詞句の解釈による名詞化については、上記文献で妥当性を保証されているので、実際に依存構造の変形処理を行うことはせず、述語の意味素性毎に、それぞれが受け得る連用修飾成分（格、属性など）と、解釈によって得られるべき名詞との対応関係を意味情報辞書１４に記憶しておき、機械的に変換処理を行う。図８は、このような連用修飾成分と述語意味素性とによって、解釈結果として得られる名詞の例である。なお、述語が断定を表す「（で）ある」の場合は、任意の現象又は実体のクラス、パラメータ値対等に対する話者の判断を表わし、このため現象を直接言及する他の述語のようにそこから直接パラメータを抽出することはできない。「（で）ある」が現象のパラメータ値に対する判断を表わす場合には、主格につく名詞は現象のパラメータにマッピングされるべき概念を表わし、補格につく名詞がその値を表わす。その際、主格名詞には一般的な名詞が用いられることが多く、パラメータ名と直接は対応付けられないことが多い。例えば、「料金は、７００円である。」の「料金」と「移動」のパラメータとして用意されている対応するパラメータ「局所移動所要料金」は、直接マッチしない。したがって、この場合には、別途対応関係を定義しておく。
【００２６】
他の格助詞句についても同様の手法で意味解析部１３により意味解析され、「宇都宮まで行く」 → ［目的地＝宇都宮］
「東北自動車道で行く」→ ［使用道路＝東北自動車道］
と整理される。このとき、東京、宇都宮、東北自動車道などの地図上の実体を示す名詞や移動現象に言及する節が現れた場合には、地図管理部２０の地図情報操作部２１により地図データ２２が参照されて名詞のレファレントの同定や移動過程のシミュレーションが行われる。
【００２７】
このように整理することにより、例文１の「東京から宇都宮まで東北自動車道で行く。」の解釈は図７に示すように、非常にフラットな形でまとめられる。
【００２８】
次に、例文２の「東京を出発、東北自動車道を使って、宇都宮に行く。」の場合も、図５に示す構造木を、例文１と同じような形式に各節毎に整理し、図９に示すようにまとめられ、さらに、得られた各節の解釈結果が整理されていく。この場合に、構造が均一でかつパラメータの位置が現象毎に文脈の場の中でユニークに定まるので、係り受け構造や意味内容に依らず、容易に文脈情報上の一定の定められた場所に書き込む処理を行って（重ね書きして）ゆくことができる。この結果として、例文１の解釈結果と、例文２の解釈結果とは全く同じになる。このように、本発明実施例では、均一なデータ構造の上に情報を一定の定められた場所に書き込む処理を行う（重ね書きする）ことにより、文章全体の意味を、文脈に適切な形で蓄積してゆくことができる。
【００２９】
前述したように、本発明の対話システムでは、節の内容を図２のように、構造的に最大限に単純化したフラットな構造で表すことができ、この意味表現形式では、「東京を出発する」も「東京から行く」も、共に「出発地」＝「東京」と整理することができる。また、これらの各パラメータの、文脈の場の中での位置づけをあらかじめ全て決定しておくことにより、入力されてきた節の各枝の情報を文脈のどこに書き込めばよいかは一意に定められることは前述した。
【００３０】
上記の第１実施例の例文１、２においては、時空間的情報が絡む問題については無視してきたが、実際の文章では、例えば「東京を出て、宇都宮を通り、日光へ行きたい。」のように、出来事の時空間的な断面が断片的に述べられるだけで、統合に際して、その間の現象の流れや空間的つながりを補完しなければならないことがあるが、こうした時空間的連続性の取り扱いの問題を一般的に解決することは現状では困難である。本発明の対話システム装置では、空間的情報については、地図管理部２０の地図情報操作部２１と地図データ２２を備えることにより、道路・領域・施設情報等を保持した地図情報を持つことで、部分的に対処することができる。すなわち、移動や存在に関する文が入力された場合には、地図情報操作部２１で地図上でシミュレーションやレファレントの検索を行なうなどして、統合が試みられる。例えば、前述の「東京を出て、宇都宮を通り、日光へ行きたい。」の例文の場合、シミュレーションにより東京から日光に至る経路が求められ、それに基づいて、一つの移動現象として統合される。そこで、意味解析の蓄積と利用を行う。
【００３１】
前記例文２の「東京を出発、東北自動車道を使って、宇都宮に行く」の解釈の例につい上述したが、本発明の意味解析／意味表現手法を用いると、一文でどこまで述べるか、どういう文体で述べるか、といった、文章の表層構造的なぶれの影響を受けずに、安定な意味解釈結果を得ることができる。また、この解釈モデルでは、あらかじめ、１つの現象内に存在する全パラメータを決定しておくことにより、情報がその意味に従って、常に一定の位置に格納されるようになっている（どういうパラメータについて述べているかで、格納位置が決まる）ので、入力文の意味解析結果を、同じ形式で記述された同種の文脈情報上、容易に照合または書き込む処理を行う（重ね書きする）ことができると共に、以後の何らかの問題解決処理でも、処理に必要なデータをスムーズに収集することが可能である。ただし、例えば、ルートの記述を行う際に、ルートの断片の説明を繰り返して、全体として全ルートの説明を行うという形の記述が行われる。この場合、ルートの断片を表わすために用いられるパラメータは、複数回繰り返して現れることになり、文脈中でパラメータの位置を一意に同定できないことになる。このような場合には、（１）入力文の解釈中の節から得られた他のパラメータ値で、すでに文脈中に位置付けられているものがあれば、それを含むルート断片中の対応するパラメータに解釈結果の値をセットする、（２）直前に話題になった（パラメータ値がセットされた）ルート断片中の対応するパラメータに値を代入する、あるいは（３）直前に話題になったルート断片中の対応するパラメータの値がすでに埋まっている場合には、次のルート断片中の対応するパラメータに値を代入する、等の処理を行う。
【００３２】
また、同様な理由により、ある述語の連用修飾成分が、ルート全体を規定するパラメータを言及しているのか、ルートの断片を規定するパラメータを言及しているのかのあいまい性が生じる場合がある。このような場合には、例えば、入力文解釈時には、とりあえずルートの断片を規定するパラメータを言及しているものとしておき、その上で文脈情報と照合し、地図情報を参照しつつ必要に応じてルート全体を規定するパラメータに解釈しなおす等の処理が必要となる。
【００３３】
一例を挙げると、「東京から宇都宮に行く。」
「宇都宮から日光に行く。」
という２文を解釈する場合には、例えば、以下のような処理を行う。まず、第１文の解釈により、局所出発地＝東京局所到着地＝宇都宮とする。この段階で文脈と照合すると、文脈には何も書かれていないので、上のパラメータ、パラメータ値をそのまま第１のルート断片の記述として文脈に代入すると共に、これがルート全体のパラメータを規定している可能性を考えて、ルート全体を規定するパラメータ「出発地」「目的地」に対しても、出発地＝東京目的地＝宇都宮を代入する。次に、第２文の解析により、局所出発地＝宇都宮局所到着地＝日光を得る。これを文脈と照合すると、ルートの断片がすでに記録されており、またその局所到着地と第２文から得られた局所出発地の値が一致するので、第２文は相続く２番目のルートの断片に関する記述を与えるものであると判断し、対応する文脈中のパラメータに値を代入する。それと同時にルート全体を規定するパラメータについても「日光」がルート全体の目的地である可能性を考えて、目的地＝日光と、値を更新する。その後、ルートを記述する文の入力がこれで終了したことが判明した段階で、目的地＝日光が確定する。また、例えば、第３文として、「東京から日光まで３時間かかる。」
が入力されると、意味解析結果として、局所出発地＝東京局所到着地＝日光局所移動所要時間＝３時間を得る。これを文脈と照合すると、「東京」「日光」がルート全体の「出発地」「目的地」であることが判明するので、「局所移動所要時間」はルート全体の「移動所要時間」に解釈し直されるべきであると判断し、文脈中で「移動所要時間」パラメータに、移動所要時間＝３時間を代入する。ちなみに、第１文の後に「２時間かかる。」が入力された場合は、局所出発地、局所到着地の指定がないので、直前に言及されたルート断片について述べていると判断し、第１のルート断片中の「局所移動所要時間」に「２時間」を代入する。なお、入力文が疑問文の場合にも、同様にして文脈情報と照合してパラメータのあいまい性を解消しつつ意味解析が行われるが、平叙文の場合と異なり、疑問文の内容が文脈情報上に書き込まれる処理が行われる（重ね書きされる）ことはない（疑問文中の代名詞または省略語成分が文脈情報により照応、補完されることはある）。
【００３４】
次に、以上のような考え方に基づき、第２実施例として以下の例文３について実施した結果を説明する。
「東京を８時に出る。まず東北自動車道で宇都宮に向かう。１００分程で宇都宮に着く。宇都宮から日光までは日光宇都宮道路を利用しよう。料金は７７０円だ。距離は約２５ｋｍ、約２０分の行程である。１０時頃には日光に到着するだろう。」
という一連の文章が入力された場合には、前記第１実施例と同様にして図１０〜図１２のような文脈が構築される。ただし、前述のように「料金」「距離」は、あらかじめ定義されているテーブルにより、「局所移動所要料金」「局所移動距離」に各々対応付けられるものとする。「行程」は「移動」と解釈され、述語の「移動する」と対応して各連体修飾成分が「移動」のパラメータに変換される。「約２０分の」は、単位名詞「分」を手がかりにして「局所移動所要時間」パラメータの値にセットされる。
【００３５】
以上の結果、例えば、「東京から日光まで何分で行くか？」という疑問文が入力された場合も、非常に単純なアルゴリズムで答えることができる。すなわち、この疑問文の内容は、すでに入力された上述の文により構築された文脈情報との照合を行った後、最終的に図１３に示す解釈結果が得られる。この形にすることによって、文脈中の「移動所要時間」が問い合わせの対象となっていることがわかるので、移動所要時間を求めるための問題解決ルール部３３の手続きを起動する。移動所要時間を求めるためには、各「局所移動所要時間」を合算すればよい。すなわち、上例の場合に、東京−宇都宮間の局所移動所要時間１００分と、宇都宮−日光間の局所移動所要時間２０分を足して、尋ねられている移動所要時間が「１２０分」と求まるのでそれをもとに、疑問文に応答することが可能になる。
【００３６】
このように、情報の格納位置がその意味によって定まるようになっているために、問題解決に必要な適当な手続きを同定したり、問題解決に必要なデータを収集したりすることが安定にできる。
【００３７】
以上のとおり本発明の対話システム装置では、例えば、図１４に示すような対話処理を行うことができる。すなわち、ユーザが「Ａ観光地に行きたい。」の問いかけに対し、システムは「経由地を指定して下さい。」と指示する。この指示に対しユーザが「Ｌ交差点。」と指示すると、システムは「はい、わかりました。」と答える。次に、ユーザが「朝６時に出発して、午後５時に到着したい。」の問いかけに対し、システムは「はい、わかりました。」と答える。そして、ユーザが「道を教えてくれ。」の問いかけに対し、システムは「道路は１個あります。まず、国道１号を北東へ進行し、Ｅ交差点まで行って下さい。次に、国道１号を北へ進行し、Ｈ交差点まで行って下さい。最後に、国道１号を北東へ進行して下さい。」と答える。この対話システム装置においては、ユーザは入力装置１から音声で入力し、システムは出力装置２から音声合成による音声で出力されるか、あるいはディスプレイに表示される。
【００３８】
上記の枠組に従い、実験データとして市販の観光ドライブガイドブックである（１）「車で行って遊んで泊まる「日光・那須・塩原」、旺文社（１９９３）」及び（２）「ブルーガイド・ドライブ「日光・鬼怒川・那須」、実業之日本社（１９９４）」に登場する「移動」に関連する例文３８２文について実施したところ、その７割程度の文に関しては、この枠組で適切な文脈モデルを構成できることを確認した。
【００３９】
本発明の好適な実施例について説明したが、本発明の精神を逸脱しない範囲内において種々の改良及び変更をなし得ることはもちろんである。
【００４０】
【発明の効果】
以上説明したように本発明の自然言語の意味解析処理装置では、構文解析部で解析された構文解析木を、意味解析部で節単位に分割して入力し［述語＋格助詞句］の枝毎に分析しパラメータ値対の形に整理することで、述語による枝の変動を吸収し、構造を均一化することができ、文脈の蓄積が可能になる。
【図面の簡単な説明】
【図１】本発明の自然言語の意味解析処理装置の構成を示すブロック図である。
【図２】本発明の自然言語の意味解析処理装置による節内容のモデルの現象概念を示す図である。
【図３】本発明の自然言語の意味解析処理装置による文脈のモデル例を示す図である。
【図４】本発明の自然言語の意味解析処理装置による第１実施例の例文１の構文木を示す図である。
【図５】本発明の自然言語の意味解析処理装置による第１実施例の例文２の構文木を示す図である。
【図６】本発明の自然言語の意味解析処理装置による第１実施例の例文１の構文木をパラメータ値対の形の整理を示す図である。
【図７】本発明の自然言語の意味解析処理装置による第１実施例の例文１の意味解析結果及び文脈情報を示す図である。
【図８】本発明の自然言語の意味解析処理装置による連用修飾成分と述語意味素性とによって解釈結果として得られる名詞の例を示す図である。
【図９】本発明の自然言語の意味解析処理装置による第１実施例の例文２の意味解析結果及び文脈情報を示す図である。
【図１０】本発明の自然言語の意味解析処理装置による第２実施例の例文３の意味解析結果及び文脈情報を示す図である。
【図１１】本発明の自然言語の意味解析処理装置による第２実施例の例文３の意味解析結果及び文脈情報を示す図である。
【図１２】本発明の自然言語の意味解析処理装置による第２実施例の例文３の意味解析結果及び文脈情報を示す図である。
【図１３】本発明の自然言語の意味解析処理装置による疑問文の意味解析結果を示す図である。
【図１４】本発明の自然言語の意味解析処理装置による対話処理例を示す図である。
【符号の説明】
１入力装置
２出力装置
１０言語処理部
１１構文解析部
１２文法解析辞書
１３意味解析部
１４意味情報辞書
２０地図管理部
２１地図情報操作部
２２地図データ
３０制御部
３１対話制御部
３２制御ルール部
３３問題解決ルール部[0001]
[Industrial applications]
The present invention relates to a device that sequentially stores information of an input sentence as context information and performs a semantic interpretation, and in particular, a natural language semantic analysis process that performs a semantic interpretation so that context information can be naturally stored in a car navigation interactive process or the like. It concerns the device.
[0002]
[Prior art]
In the past, many studies on dialogue have focused on the intent and topic recognition and use methods (eg, Atsushi Takeshita: "Recognition of topics using interaction structure of dialogue", IPSJ Natural Language Processing Research Group 87-10 (1992)). However, at present, it is hard to say that there is still a stable technology for the context construction technology that is the premise of such recognition.
[0003]
As a problem in context construction, the role of semantic interpretation is to correctly recognize the meaning of the input sentence without depending on the style, organize and integrate it into the context, and extract information necessary for problem solving. The "context" constructed as a result of the sentence interpretation must have a uniform format so that the subsequent problem solving process can be started stably. On the other hand, when looking at actual sentences, a so-called “blurring” often appears in a stylistic style such that a given content is given in one sentence or a plurality of sentences. Even within one sentence, the parse tree that is the input of the semantic interpretation processing has considerable diversity in the types of nodes and branching, so sentences with the same semantic content are May be represented by a different syntactic structure.
[0004]
In this case, if the same contents are allowed to be represented by different structures, it is impossible to stably associate and integrate meanings. In order to integrate sentence contents semantically and construct one context, it is necessary to absorb the structural diversity of stylistic styles and parsing trees. For this purpose, (1) the semantic expression does not have a structure that fluctuates depending on the complexity and the semantic content of the sentence, and (2) the semantic interpretation result of the sentence is expressed in the context of the context. It is necessary to be positioned in a fixed place.
[0005]
In the conventional natural language processing method, the contents of an input sentence are generally arranged in the form of a logical expression, a verb frame, a semantic net, or the like.
[0006]
[Problems to be solved by the invention]
However, for example, a verb frame has an internal structure of a predicate + a surface case (or a deep case), and there are many synonymous expressions having different expression forms due to variations in verbs and case names. For example, even in a sentence where the same departure place represents Tokyo, there is a variety of expressions such as “depart from Tokyo” or “go from Tokyo”. There is a similar problem in the case of a semantic net or the like. These expressions are, in extreme terms, just a formal transformation of the parse tree into another kind of structure with similar complexity, and like the parse tree, have the problem of expression diversity, From the viewpoint of accumulation of words, it was not appropriate as a semantic expression.
[0007]
Therefore, an object of the present invention is, in particular, to provide a natural language semantic analysis processing device that has developed a semantic analysis method capable of naturally storing context information using, for example, a conversation for car navigation as a theme.
[0008]
[Means for Solving the Problems]
In order to achieve the above object, a natural language semantic analysis processing device of the present invention includes a parsing unit that obtains a parse tree of an input sentence,
A semantic analysis unit that performs the following processes a) to e):
a) Analyzed by the parsing unitWasDecompose the substructure tree into a set of branches of [predicate + case particle phrase] for each clause included in the parse tree,
b) the aboveFor "case particle + noun attached to case particle" corresponding to "case particle phrase" of each branch obtained in a)Case Particles and Nouns Attached to Case ParticlesWhenIn between, "attribute noun" and "sai", which indicate the class of "noun attached to case particle", specified by "case particle"WhenTo be converted to [predicate + case particle + attribute noun + "name" + noun attached to case particle],
c) FurtherConverted in b) aboveDecompose "is" into a relative clause structure of "equal to" and convert it into [predicate + case particle + attribute noun + relation pronoun + case particle "ha" + "equal" + case particle "ni" + case particle. Noun]
Invert attribute nouns and relative pronouns to make "equal" the main verb, [predicate + case particle + relation pronoun + attribute noun + case particle "ha" + "equal" + case particle "ni" + noun attached to case particle ]
d)Result converted in c) aboveAfter formalizing as <<<< predicate + case particle + relational pronoun >> + attribute noun> = <noun attached to case particle>]On the left side[<< predicate + case particle + relative pronoun> + attribute noun>]The noun corresponding to the attribute of the phenomenon concept specified by the predicate is replaced with one noun as a parameter, and the right-hand side <noun attached to the case particle> is used as a parameter value, and is stored as context information of a “parameter parameter value” pair.
e) Further, The "parameter parameter value" pair accumulated in the process d) is compared with the "parameter parameter value" pair previously accumulated by the process d), and the attribute of the parameter is determined based on the collation result. The context information determined byWrite processDo and accumulate,
A control unit for controlling the syntactic analysis unit and the semantic analysis unit and controlling the input sentence acquisition and the output sentence output.
[0009]
It further includes a map information operation unit that performs map information operation based on map data, and in the semantic analysis unit, when a noun pointing to an entity on a map or a clause that refers to a movement phenomenon appears, the map information operation unit is Identifying noun referents and simulating the movement process by referring to map data via a computer can complement the flow of phenomena and spatial connections when spatio-temporal information is involved, leading to the problem of handling continuity. This is preferable because it can be partially addressed.
[0010]
The control unit controls input and output of a sentence, and also controls the syntax analysis unit and the semantic analysis unit, a dialogue control unit that stores control rules, and a problem solving rule that stores problem solving rules. It is preferable to have a unit in that a dialog system for car navigation can be constructed.
[0011]
[Action]
According to the present invention, a parse tree analyzed by a parsing unit is divided into clause units by a semantic analyzing unit, input and analyzed for each branch of [predicate + case particle phrase], and a parameter value pair uniquely determined for each phenomenon is determined. By organizing in the form of, the variation of the branch caused by the predicate can be absorbed, the structure can be made uniform, and the context can be accumulated.
[0012]
【Example】
Hereinafter, the present invention will be described in detail with reference to an embodiment shown in the drawings. FIG. 1 is a block diagram showing a configuration of a natural language semantic analysis processing device according to an embodiment of the present invention. FIG. 2 is a diagram showing a concept of a phenomenon of a clause content model by the natural language semantic analysis processing device of the present invention. FIG. 3 is a diagram showing an example of a context model by the natural language semantic analysis processing device of the present invention.
[0013]
First, a basic method for solving the problem of the present invention will be described. Since the basic unit of the proposition level that constitutes a sentence is a clause, it is natural that the meaning of the input sentence is interpreted in units of clauses. In this case, the factors of structural diversity of clause-level expressions are (a) diversity of combinations and numbers of branches that modify predicates and nouns, and (b) diversity of meanings of predicates and nouns. is there. In the case of a parse tree or a semantic expression containing structural diversity of the same type, it is impossible to escape from the structural diversity caused by the above (a) and (b). [Predicate + case particle phrase]], and consider that each branch is interpreted independently of the other branches. As a result, the predicate is interpreted a plurality of times, but this is acceptable because the branches are interpreted individually. Also, in practice, there may be some semantic dependency between the branches. If there is a dependency between the parameters, an additional procedure for referring to or calculating the value of another parameter according to the dependency will be given to the parameter as described later. Details will be described later.
[0014]
However, when the clause is decomposed into branches, the problem of the diversity of the meanings of the predicates and nouns in (b) remains. Therefore, each branch [predicate + caseParticle+ CaseParticleNoun to [<<<Predicate + caseParticle+Related Pronouns>+ Attribute noun>=<CaseParticleThe noun>], And<<<Predicate + caseParticle + Relational Pronoun>+ Attribute noun>Is uniquely determined by the phenomenon, and a noun word that does not depend on the predicate, for example, if “goes from”, is interpreted as “departure place”. This makes it possible to absorb the variation of the branch caused by the predicate and to make the structure uniform.Here, [] indicates the range or unit of the partial syntax tree to be described. <> Represents a unit of a phrase or a clause. In the expression << predicate + case particle + relational pronoun + attribute noun>, the inner <> represents a group of adnominal modifier clauses, and the outer <> represents a group of noun phrases headed by an attribute noun. The left side of = represents the nominative component of "equals", and the right side represents the qualification component.
[0015]
In the present invention, a noun representing this "predicate + case + attribute noun" is called a parameter, and a noun attached to the case is called a parameter value. When there is a dependency between parameters, an additional procedure for referring to or calculating another parameter value according to the dependency is provided for the parameter. That is, according to the present invention, the contents of the node can be represented by a flat structure that is structurally simplified as shown in FIG. Here, a frame corresponding to a predicate surrounding a parameter value group is prepared to represent a unit called a clause, but this frame is not essential as context information. In such a semantic expression format, both the sentence “Departing from Tokyo” and the sentence “Going from Tokyo” can be arranged as “Departure place” = “Tokyo”. In addition, by determining the position of each of these parameters in the context field in advance, it is possible to uniquely determine where in the context information of each branch of the input clause should be written. Become. As described above, (1) the semantic expression does not have a structure that fluctuates depending on the complexity and semantic content of the sentence, and (2) the semantic interpretation result of the sentence is expressed in the context of the context. The condition that the content is positioned at a fixed place for each content is satisfied, and the context information is accumulated by stably writing the content of the clause at a specific position in the context.
[0016]
For example, in the case of the sentence "Turn left at the Ihei intersection, drive southwest along the Numata highway along the Hinoemata river, and go to the innermost Hinoemata hot spring in Aizu in about an hour from Yaso on the Aizu plateau." A context model as shown is formed.
[0017]
In the above, the problem involving spatiotemporal information has been ignored, but in actual sentences, the spatiotemporal section of the event, such as "I want to leave Tokyo, pass Utsunomiya and go to Nikko." May be described fragmentarily, and in integration, it may be necessary to supplement the flow of phenomena and spatial connections between them. Although it is currently difficult to solve such a problem of handling spatiotemporal continuity in general, in the present invention, spatial information has map information holding road, area, facility information, and the like. I will deal with it in part. If a sentence related to movement or presence is input, a simulation or referent search is performed on the map to attempt integration. In the case of the above-mentioned example sentence "I want to leave Tokyo and go to Utsunomiya and go to Nikko", a route from Tokyo to Nikko is obtained by simulation, and based on that, it is integrated as one movement phenomenon.
[0018]
Next, an embodiment in which the natural language semantic analysis processing device of the present invention is configured as a navigation interactive system will be described.
[0019]
In FIG. 1, a navigation interactive system according to the present embodiment includes an input device 1 for inputting a sentence, an output device 2 for outputting a sentence and outputting map information, and a language for performing syntax analysis and semantic analysis of the input sentence. It comprises a processing unit 10, a map management unit 20 for managing map information, and a control unit 30 for controlling the input device 1, the output device 2, the language processing unit 10 and the map management unit 20 as a whole.
[0020]
The input device 1 is a device that takes in a sentence under the control of the control unit 30, and is, for example, a part that inputs a sentence by voice recognition and performs a morphological analysis process. The output device 2 is a part that outputs a sentence or map information under the control of the control unit 30. For example, the output device 2 is a part that outputs a sentence by voice synthesis or that displays a sentence or a map on a display. . The language processing unit 10 analyzes a syntax of a sentence input by the input device 1 based on a grammar analysis dictionary 12 for syntax analysis, and obtains a parse tree. Is analyzed for each branch of the predicate + case particle phrase based on the semantic information dictionary 14 and arranged in the form of parameter value pairs, and a noun pointing to an entity on a map or a clause referring to a movement phenomenon appears. In reference to the map information of the map management unit 20, identification of referents of nouns and simulation of the movement process are performed, and the contents of each parameter value pair are collated one after another in a context field, or context information defined by parameters is used. The semantic analysis unit 13 performs a process of writing (overwriting) in the above-mentioned fixed location. The map management unit 20 includes a map information operation unit 21 that performs map information operation based on map data 22 that stores map information. The control unit 30 controls the input device 1, the output device 2, the syntax analysis unit 11, the semantic analysis unit 13, and the map information operation unit 21, and controls a control rule unit 32 for activating a control rule. An interactive control unit 31 fetches an input sentence by interactive processing based on a problem solving rule unit 33 for activating a problem solving rule and controls output of an output sentence and map information.
[0021]
Next, a description will be given of an example in which the natural language semantic analysis processing device according to the present embodiment absorbs variations in branches caused by predicates and makes the structure uniform.
[0022]
FIG. 4 is a diagram showing a syntax tree (dependency structure) of an example sentence 1 of the first embodiment, FIG. 5 is a diagram showing a syntax tree (dependency structure) of an example sentence 2 of the same first embodiment, and FIG. 6 is a first embodiment. FIG. 9 is a diagram showing an example of organizing the syntax tree of Example sentence 1 into a parameter value pair.
[0023]
In the first embodiment, the example sentence 1 is equivalent to "go from Tokyo to Utsunomiya on the Tohoku Expressway." The example sentence 2 is "depart from Tokyo and use the Tohoku Expressway to go to Utsunomiya." I have. As shown in FIGS. 4 and 5, the dependency structure trees of these two example sentences 1 and 2 are different from each other due to differences in predicates (units of reference) used and cases of the predicates. Therefore, it is difficult to directly determine the semantic equivalence of two sentences from this data. In order to absorb these differences in the surface structure, an equivalent transformation is performed for each predicate + case particle phrase as shown in FIG. 6, and the contents of each branch of the dependency structure tree are arranged in a uniform structure. For such an equivalent deformation, reference is made, for example, to Takagi and Ito: "Processing of Natural Language", Maruzen, 1987.
[0024]
Hereinafter, as a first embodiment shown in the drawings, the following example sentences are used for each branch.[Predicate + caseParticle+ CaseParticleThe noun]To[<<Predicate + caseParticle+Related Pronouns>+ Attribute noun>=<CaseParticleThe noun>]A method of performing a synonymous transformation to absorb variations of branches caused by predicates and making the structure uniform will be described.
[0025]
Now, the input sentence “Go from Tokyo to Utsunomiya on the Tohoku Expressway” in the example sentence 1 is analyzed by the parsing unit 11 into a dependency structure tree shown in FIG. Can be The semantic analysis unit 13 analyzes this structural tree for each branch of [predicate + case particle phrase] and arranges the sentence “go from Tokyo” into the form of “departure place = Tokyo” as shown in FIG. . As for the nounization by interpretation of the noun phrase "equal (is)", the validity is guaranteed in the above-mentioned document, so the dependency structure is not actually transformed, and the meaning of the predicate is not considered. For each feature, the corresponding relationship between the consecutive modification components (case, attribute, etc.) that can be received and the noun to be obtained by interpretation is stored in the semantic information dictionary 14, and the conversion process is performed mechanically. FIG. 8 is an example of a noun obtained as an interpretation result by such a combined modifier component and a predicate semantic feature. If the predicate is "(is)", which indicates an assertion, it indicates the speaker's judgment on an arbitrary phenomenon or the class of an entity, parameter value, etc., and therefore, as in other predicates that directly refer to the phenomenon, Parameters cannot be extracted directly from. When "(is)" represents a judgment on the parameter value of a phenomenon, the noun attached to the nominative indicates the concept to be mapped to the parameter of the phenomenon, and the noun attached to the nominative indicates the value. At this time, a general noun is often used as the nominative noun, and is often not directly associated with the parameter name. For example, the "charge is 700 yen." "Charge" and the corresponding parameter "local transfer required charge" prepared as the parameter of "movement" do not directly match. Therefore, in this case, a correspondence is separately defined.
[0026]
The other case particle phrases are semantically analyzed by the semantic analysis unit 13 in the same manner, and “go to Utsunomiya” → [destination = Utsunomiya]
"Go by Tohoku Expressway" → [Road used = Tohoku Expressway]
It is arranged with. At this time, if a noun indicating an entity on a map such as Tokyo, Utsunomiya, or the Tohoku Expressway or a clause referring to a movement phenomenon appears, the map data 22 is referred to by the map information operation unit 21 of the map management unit 20. Identification of referents in nouns and movement processessimulationIs performed.
[0027]
By arranging in this manner, the interpretation of the example sentence 1 "Go from Tokyo to Utsunomiya on the Tohoku Expressway" can be summarized in a very flat form as shown in FIG.
[0028]
Next, in the case of the example sentence 2 "Leave Tokyo and use the Tohoku Expressway to go to Utsunomiya.", The structural tree shown in FIG. 5 is arranged for each section in a format similar to that of the example sentence 1. The results are summarized as shown in FIG. 9, and the obtained interpretation results of each section are further organized. In this case, since the structure is uniform and the position of the parameter is uniquely determined in the context field for each phenomenon, it can easily be placed in a fixed place on the context information regardless of the dependency structure and semantic content. The writing process can be performed (overwriting). As a result, the interpretation result of the example sentence 1 is exactly the same as the interpretation result of the example sentence 2. As described above, in the embodiment of the present invention, the processing of writing information in a predetermined place on a uniform data structure is performed (overwritten), so that the meaning of the entire sentence can be changed in a form appropriate for the context. It can accumulate.
[0029]
As described above, in the dialogue system of the present invention, the contents of a clause can be represented by a flat structure that is structurally simplified as much as possible, as shown in FIG. Both "do" and "go from Tokyo" can be arranged as "departure point" = "Tokyo". In addition, by determining in advance all the positions of these parameters in the context field, it is possible to uniquely determine where in the context information of each branch of the input clause should be written. Has been described above.
[0030]
In the example sentences 1 and 2 of the first embodiment described above, the problem involving spatiotemporal information has been ignored, but in actual sentences, for example, "I want to leave Tokyo and pass Utsunomiya to Nikko." As mentioned above, the spatio-temporal cross section of an event may be described only in a fragmentary manner, and in integration, it may be necessary to complement the flow of phenomena and the spatial connection between them. It is currently difficult to solve handling problems in general. In the interactive system device of the present invention, the spatial information is provided with the map information operation unit 21 of the map management unit 20 and the map data 22 to have the map information holding the road / area / facility information. Partially addressed. That is, when a sentence relating to movement or presence is input, integration is attempted by, for example, performing a simulation or referent search on the map using the map information operation unit 21. For example, in the case of the above-mentioned example sentence "I want to leave Tokyo and pass through Utsunomiya to Nikko.", A route from Tokyo to Nikko is obtained by simulation, and based on that, it is integrated as one movement phenomenon. Therefore, the semantic analysis is accumulated and used.
[0031]
Although the example of the interpretation of the example sentence 2 "Leaving Tokyo and going to Utsunomiya using the Tohoku Expressway" has been described above, if the semantic analysis / semantic expression method of the present invention is used, what style is described in one sentence? It is possible to obtain a stable semantic interpretation result without being affected by the fluctuation of the surface structure of the sentence such as described in (1). Further, in this interpretation model, by determining all parameters existing in one phenomenon in advance, information is always stored in a fixed position according to its meaning (what parameters are described). Determines the storage position), so that the semantic analysis result of the input sentence can be easily collated or written (overwritten) on the same type of context information described in the same format, and It is possible to smoothly collect data necessary for processing even in some problem solving processing. However, for example, when a route is described, a description is given in a form of repeating the description of the route fragments and describing the entire route as a whole. In this case, the parameter used to represent the root fragment will appear repeatedly multiple times, and the position of the parameter cannot be uniquely identified in the context. In such a case, (1) If any other parameter value obtained from the clause in the interpretation of the input sentence is already positioned in the context, the corresponding parameter in the root fragment containing it is included. Set the value of the interpretation result to (2) substitute the value for the corresponding parameter in the route fragment that was just talked about (the parameter value was set), or (3) the route that was talked just before If the value of the corresponding parameter in the fragment is already filled, processing such as substituting the value for the corresponding parameter in the next route fragment is performed.
[0032]
Further, for the same reason, there may be an ambiguity as to whether the repeated modification component of a certain predicate refers to a parameter that defines the entire route or a parameter that defines a fragment of the route. In such a case, for example, at the time of interpreting the input sentence, it is assumed that a parameter that specifies a fragment of the route is referred to for the time being, and then compared with the context information, and as necessary while referring to the map information. Processing such as re-interpreting the parameters as defining the entire route is required.
[0033]
For example, "Go from Tokyo to Utsunomiya."
"Go to Nikko from Utsunomiya."
When interpreting the two sentences, for example, the following processing is performed. First, based on the interpretation of the first sentence, the local departure place = the Tokyo local arrival place = Utsunomiya. When comparing with the context at this stage, since nothing is written in the context, the above parameters and parameter values are directly substituted into the context as the description of the first route fragment, and this defines the parameters of the entire route. In consideration of the possibility that the route is present, the departure place = Tokyo destination = Utsunomiya is also substituted for the parameters “departure place” and “destination” that define the entire route. Next, the local sentence = Utsunomiya local destination = Nikko is obtained by the analysis of the second sentence. Matching this with the context, the second sentence is the second successive route because the fragment of the route has already been recorded and its local destination and the value of the local departure from the second sentence match. And assigns a value to a parameter in the corresponding context. At the same time, for the parameter that defines the entire route, the value is updated to “destination = Nikko” considering that “Nikko” may be the destination of the entire route. Thereafter, when it is determined that the input of the sentence describing the route has been completed, the destination = Nikko is determined. Also, for example, as the third sentence, "It takes three hours from Tokyo to Nikko."
Is input, the local departure place = the Tokyo local destination = the Nikko local movement required time = 3 hours as the semantic analysis result. When this is compared with the context, it is found that "Tokyo" and "Nikko" are the "departure point" and "destination" of the entire route, so "local travel time" is interpreted as "travel time" of the entire route It is determined that the time is to be changed, and the time required for movement = 3 hours is substituted for the “time required for movement” parameter in the context. By the way, if "takes 2 hours" is input after the first sentence, since the local departure place and the local destination are not specified, it is determined that the route fragment mentioned immediately before is described. "2 hours" is substituted for the "local movement required time" in the route fragment of the above. In the case where the input sentence is a question sentence, the semantic analysis is performed in the same manner as in the case of the declarative sentence, while the ambiguity of the parameter is resolved by collating with the context information. The processing written above is not performed (overwritten) (the pronoun or abbreviation component in the question sentence may be adapted and complemented by the context information).
[0034]
Next, based on the above-described concept, a result of the following example sentence 3 as the second embodiment will be described.
"Leave Tokyo at 8am. First, head to Utsunomiya on the Tohoku Expressway. You will reach Utsunomiya in about 100 minutes. From Utsunomiya to Nikko, use the Nikko Utsunomiya Road. The fare is 770 yen. Distance is about 25km, about 20 minutes It will arrive at Nikko around 10 o'clock. "
Is input, a context as shown in FIGS. 10 to 12 is constructed in the same manner as in the first embodiment. However, as described above, the “fee” and the “distance” are respectively associated with the “local travel required fee” and the “local travel distance” by a predefined table. The “stroke” is interpreted as “move”, and each union modifying component is converted into a “move” parameter corresponding to the “move” predicate. “About 20 minutes” is set to the value of the “local movement required time” parameter using the unit noun “minute” as a clue.
[0035]
As a result of the above, for example, even when a question sentence “How long does it take to go from Tokyo to Nikko?” Can be answered with a very simple algorithm. That is, the content of this question sentence is collated with the context information constructed by the previously input sentence, and finally, the interpretation result shown in FIG. 13 is obtained. By adopting this form, it can be seen that the “time required for movement” in the context is an object of the inquiry, and the procedure of the problem solving rule unit 33 for obtaining the required time required for movement is started. In order to obtain the required travel time, the “local travel required time” may be summed. That is, in the case of the above example, the required required travel time is obtained as "120 minutes" by adding the required local travel time between Tokyo and Utsunomiya for 100 minutes and the required local travel time between Utsunomiya and Nikko for 20 minutes. Therefore, it becomes possible to answer the question based on it.
[0036]
As described above, since the storage location of the information is determined by its meaning, it is possible to stably identify an appropriate procedure necessary for solving the problem and collect data necessary for solving the problem. .
[0037]
As described above, the dialogue system apparatus of the present invention can perform a dialogue process as shown in FIG. 14, for example. In other words, in response to the user's question "I want to go to the sightseeing spot A." When the user instructs "L intersection" in response to this instruction, the system answers "Yes, I understand." Next, when the user asks "I want to depart at 6:00 am and arrive at 5:00 pm", the system answers "Yes, I understand." Then, when the user asks "Tell me the road.", The system responds, "There is one road. First, proceed northeast on National Road No. 1 and go to the E intersection. Next, National Road No. 1 To the north and go to the H-intersection. Finally, follow National Route 1 to the northeast. " In this interactive system device, a user inputs by voice from the input device 1, and the system is output by voice from the output device 2 by voice synthesis or displayed on a display.
[0038]
According to the above framework, it is a commercially available sightseeing drive guidebook as experimental data (1) "Nikko / Nasu / Shiobara, go and drive and stay overnight, Obunsha (1993)" and (2) "Blue Guide Drive" Nikko / Kinugawa / Nasu "and" 382 "related to" movement "appearing in" Jitsugyo no Nihonsha "(1994). About 70% of the sentences were composed of an appropriate context model using this framework. I confirmed that I can do it.
[0039]
Although the preferred embodiment of the present invention has been described, various modifications and changes can be made without departing from the spirit of the present invention.
[0040]
【The invention's effect】
As described above, in the natural language semantic analysis processing apparatus of the present invention, the parse tree analyzed by the syntactic analysis unit is divided and input into clause units by the semantic analysis unit, and the [predicate + case particle phrase] branch is input. By analyzing each parameter and arranging them in the form of parameter value pairs, variations in branches due to predicates can be absorbed, the structure can be made uniform, and context can be accumulated.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a configuration of a natural language semantic analysis processing device of the present invention.
FIG. 2 is a diagram showing a concept of a phenomenon of a model of a clause content by the natural language semantic analysis processing device of the present invention.
FIG. 3 is a diagram illustrating an example of a context model by the natural language semantic analysis processing device of the present invention.
FIG. 4 is a diagram showing a syntax tree of an example sentence 1 of the first embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 5 is a diagram showing a syntax tree of an example sentence 2 of the first embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 6 is a diagram showing the arrangement of the syntax tree of the example sentence 1 of the first embodiment in the form of parameter value pairs by the natural language semantic analysis processing device of the present invention.
FIG. 7 is a diagram showing a semantic analysis result and context information of an example sentence 1 of the first embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 8 is a diagram showing an example of a noun obtained as an interpretation result by a combined modifier component and a predicate semantic feature by the natural language semantic analysis processing device of the present invention.
FIG. 9 is a diagram showing a semantic analysis result and context information of an example sentence 2 of the first embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 10 is a diagram illustrating a semantic analysis result and context information of an example sentence 3 of the second embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 11 is a diagram illustrating a semantic analysis result and context information of an example sentence 3 of the second embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 12 is a diagram illustrating a semantic analysis result and context information of an example sentence 3 of the second embodiment by the natural language semantic analysis processing device of the present invention.
FIG. 13 is a diagram showing a result of semantic analysis of a question sentence by the natural language semantic analysis processing device of the present invention.
FIG. 14 is a diagram illustrating an example of an interactive process performed by the natural language semantic analysis processing device of the present invention.
[Explanation of symbols]
1 Input device
2 Output device
10 Language processing unit
11 Syntax analyzer
12 grammar analysis dictionary
13 Semantic analysis part
14 semantic information dictionary
20 Map Management Department
21 Map information operation unit
22 Map data
30 control unit
31 Dialogue control unit
32 Control Rule Section
33 Problem Solving Rule Division

Claims

A parser for obtaining a parse tree of an input sentence;
The next a) to e units sections included in the semantic analysis unit and a) parse tree is analyzed by the syntax analysis unit performs a process of), the partial structure tree of the predicate + case particle Clause Decompose into a set of branches,
b) corresponds to the "case particles Clause" of each branch obtained by said a) for the "nouns attached to case particles + case particle" and "case particle" between "nouns attached to case particle", is specified by the "case particle", get to represent a class of "noun to get the case particle" and "attribute noun" to complement the "that" [predicate + case particle + attribute noun + "of" + case particle Noun],
c) Further, the word "" converted in the above b) is decomposed into a relational clause structure of "equal to", and the relation [predicate + case particle + attribute noun + relation pronoun + case particle "ha" + "equal" + Case particle "ni" + noun attached to case particle]
Invert attribute nouns and relative pronouns to make "equal" the main verb, [predicate + case particle + relation pronoun + attribute noun + case particle "ha" + "equal" + case particle "ni" + noun attached to case particle ]
d) The result converted in c) is formalized as <<<< predicate + case particle + relational pronoun >> + attribute noun >> = <noun attached to case particle>], and then the left-hand side is <<<< predicate + The case particle + relative pronoun> + attribute noun>] is replaced with a single noun corresponding to the attribute of the phenomenon concept specified by the predicate, and is used as a parameter. The right-hand side <noun attached to the case particle> is used as a parameter value. Store as context information of "parameter parameter value" pair,
e) Further , the "parameter parameter value" pair accumulated in the process d) is compared with the "parameter parameter value" pair previously accumulated by the process d), and based on the result of the collation. Writes and accumulates the context information determined by the attribute of the parameter to a certain position ,
A natural language semantic analysis device, comprising: a control unit that controls the syntax analysis unit and the semantic analysis unit and controls the input sentence capture and the output sentence output.

It further includes a map information operation unit that performs map information operation based on the map data. 2. The natural language semantic analysis processing apparatus according to claim 1, wherein the reference of the noun is identified and the movement process is simulated by referring to the map data via the interface.

The control unit controls input and output of a sentence, and also controls the syntax analysis unit and the semantic analysis unit. The natural language semantic analysis processing device according to claim 1 or 2, further comprising a unit.