JP2017107276A

JP2017107276A - Information processing device, information processing method, and program

Info

Publication number: JP2017107276A
Application number: JP2015238366A
Authority: JP
Inventors: 塚原　裕史; Yasushi Tsukahara; 裕史塚原; 慶内海; Kei Uchiumi; 弘利岩崎; Hirotoshi Iwasaki
Original assignee: Denso IT Laboratory Inc
Current assignee: Denso IT Laboratory Inc
Priority date: 2015-12-07
Filing date: 2015-12-07
Publication date: 2017-06-15

Abstract

PROBLEM TO BE SOLVED: To perform language processing in association of an object existing in an environment with the content of an operation instruction, for even a sentence in which a subject or an object, etc., are omitted.SOLUTION: An information processing device 1 comprises: a dictionary database 10 in which the spatial relative relation of a word and an object co-occurring in a verb representing an operation content is stored; a detection unit 2 for acquiring the data of spatial relative relation of an external object; an input unit 3 for accepting an operation command; a spatial semantic structure analysis unit 4 for referring to the dictionary database 10 and finding the hierarchical structure of the operation command including an omitted word; and a plurality of partial models connected by a hierarchical structure. Each of the partial models is provided with a probability model generation unit 5 for generating a probability model having a first variable relating to a word, a second variable group relating to the spatial relative relation of an object, and a third variable relating to the extent of correspondence of the object to the word, and a collation unit 6 for determining the spatial relative relation of an object having a largest probability value from the data of spatial relative relation detected by the detection unit 2.SELECTED DRAWING: Figure 1

Description

本発明は、自然言語処理を行う情報処理装置に関し、特に、空間内にある物体（オブジェクト）に関する操作指示内容を解析する情報処理装置に関する。 The present invention relates to an information processing apparatus that performs natural language processing, and more particularly, to an information processing apparatus that analyzes operation instruction contents related to an object (object) in a space.

従来から、ロボットに対して、空間内にある物体に関する操作指示を音声によって与えることが知られていた（特許文献１）。しかし、上記した従来技術では、物体の空間的な意味を抽出することはないため、物体間の相対位置の関係性や操作内容と物体との関係性を扱うことができない。 Conventionally, it has been known that an operation instruction related to an object in a space is given to a robot by voice (Patent Document 1). However, since the above-described conventional technique does not extract the spatial meaning of the object, it cannot handle the relationship between the relative positions of the objects and the relationship between the operation contents and the objects.

これに対し、空間的な意味情報を階層的な木構造で表現し、自然言語文からその中にある空間的な意味構造を確率的な手法によって抽出する技術についての先行研究がある（非特許文献１、非特許文献２）。 On the other hand, there is a previous research on a technique that expresses spatial semantic information in a hierarchical tree structure and extracts the spatial semantic structure in it from a natural language sentence by a probabilistic method (non-patented). Document 1, Non-Patent Document 2).

特開２０１１−１７０７８９号公報JP 2011-170789 A

T. Koller他「Towards Understanding Natural Language Directions」Proceedings of the 5th ACM/IEEE International Conference on Human-robot InteractionT. Koller et al. `` Towards Understanding Natural Language Directions '' Proceedings of the 5th ACM / IEEE International Conference on Human-robot Interaction S.Tellex他「Understanding Natural Language Commandsfor Robotic Navigation and Mobile Manipulation」In Proceedings of the National Conference on Artificial Intelligence (AAAI 2011).S. Tellex et al. `` Understanding Natural Language Commands for Robotic Navigation and Mobile Manipulation '' In Proceedings of the National Conference on Artificial Intelligence (AAAI 2011).

しかし、普段の会話においては、主語や目的語等が省略されることもしばしばであるが、非特許文献１，２では、このような省略に対する解決手段を提供していない。特に、日本語の場合には、ゼロ代名詞と呼ばれる主語や目的語の省略が許されるため、こうした課題を解決することが求められている。 However, in usual conversations, the subject, the object, and the like are often omitted, but Non-Patent Documents 1 and 2 do not provide means for solving such omission. In particular, in the case of Japanese, omission of subjects and objects called zero pronouns is allowed, so there is a need to solve these problems.

また、非特許文献１では、環境は建物内のような静的なものを想定している。非特許文献１に記載された技術では、静的な環境の中で、あらかじめ操作内容をロボットに教示しておく必要があるため、ドライブの環境のように動的に変化していく状況には適用できなかった。 In Non-Patent Document 1, it is assumed that the environment is static like a building. In the technique described in Non-Patent Document 1, since it is necessary to teach the operation contents to the robot in a static environment in advance, there is a situation where it changes dynamically like the drive environment. It was not applicable.

例えば、自動運転などにおいて、ドライバが言葉で運転指示を出す環境を考えると、環境は常に動的に変化するため、非特許文献１に記載された技術を適用することはできず、もし適用できるとしても、静的でかつ既知という非常に限られた環境内でしか適用できないという問題がある。 For example, when considering an environment in which a driver gives driving instructions in words in automatic driving or the like, the environment always changes dynamically, so the technique described in Non-Patent Document 1 cannot be applied and can be applied. However, there is a problem that it can be applied only in a very limited environment that is static and known.

そこで、本発明は、主語や目的語等が省略された文であっても、環境内に存在する物体と操作指示内容とを適切に関連付けて言語処理を行う情報処理装置を提供することを目的とする。 Accordingly, an object of the present invention is to provide an information processing apparatus that performs linguistic processing by appropriately associating an object existing in an environment with the content of an operation instruction even if the subject, object, etc. are omitted. And

本発明の情報処理装置は、操作内容を表す動詞に共起する文言およびオブジェクトの空間的相対関係を記憶した辞書データベースと、外部空間に存在するオブジェクトに関する空間的相対関係のデータを取得する検出部と、ユーザから操作命令文の入力を受け付ける入力部と、前記入力部にて入力された操作命令文に含まれる文言の階層構造を求めるとともに、前記辞書データベースを参照して前記操作命令文において省略されている文言も含めた階層構造を求める空間的意味構造解析部と、前記階層構造でもって接続された複数の部分モデルを有し、各部分モデルの確信度の関数として確信度が求まる確信度モデルであって、前記各部分モデルが、前記文言に関する第１の変数、オブジェクトの空間的相対関係に関する第２の変数群、前記オブジェクトが前記文言に対応する度合いに関する第３の変数を有する確信度モデルを生成する確信度モデル生成部と、前記検出部にて検出されたオブジェクトの空間的相対関係のデータを前記第２の変数群に適用して前記複数の部分モデルの確信度の関数として求められる確信度を計算し、前記確信度が最大となるオブジェクトの空間的相対関係を決定する照合部と、前記操作命令文の解析結果を出力する出力部とを備える。ここで、前記確信度モデルは、確率モデルであってもよいし、ニューラルネットワーク、ＳＶＭ（Support Vector Machine）等の識別モデルであってもよい。 An information processing apparatus according to the present invention includes a dictionary database storing words and co-occurrence of objects coexisting in a verb representing operation content, and a detection unit that acquires data of spatial relations related to objects existing in external space And an input unit that accepts an input of an operation command sentence from a user, and a hierarchical structure of words included in the operation command sentence input in the input unit, and is omitted in the operation command sentence with reference to the dictionary database A certainty factor that has a spatial semantic structure analysis unit that obtains a hierarchical structure including the wording and a plurality of partial models connected by the hierarchical structure, and the certainty factor is obtained as a function of the certainty factor of each partial model Each of the partial models includes a first variable relating to the wording, a second variable group relating to a spatial relative relation of the objects, A confidence model generation unit that generates a confidence model having a third variable related to the degree to which the object corresponds to the wording, and data on the spatial relative relationship between the objects detected by the detection unit A collation unit that is applied to a group to calculate a certainty factor that is obtained as a function of the certainty factor of the plurality of partial models, and determines a spatial relative relationship of an object that maximizes the certainty factor; and an analysis of the operation command statement And an output unit for outputting the result. Here, the certainty model may be a probability model or an identification model such as a neural network or SVM (Support Vector Machine).

このように、辞書データベースを参照して操作命令文において省略されている文言も含めた階層構造を求めることで、外部オブジェクトの空間的相対関係の情報を用いて、省略されているであろう文言の候補を求めることができる。ここで、「文言」は、動詞、名詞句などであり、名詞プラス接置詞であってもよい。そして、検出された外部オブジェクトを第２の変数群に順次代入して、確信度モデルの確信度が最大となるオブジェクトの空間的相対関係を決定することで、主語や目的語等が省略された文であっても、環境内に存在するオブジェクトと操作命令とを適切に関連付けて言語処理を行うことができる。また、第２の変数群に代入する値が、外部に存在するオブジェクトによって絞り込まれるので、計算処理を軽減することができる。 In this way, by referring to the dictionary database and obtaining the hierarchical structure including the words omitted in the operation command sentence, the words that would be omitted using the information on the spatial relative relationship of the external object. Candidates can be sought. Here, the “word” is a verb, a noun phrase, or the like, and may be a noun plus infix. Then, the subject or object is omitted by sequentially substituting the detected external object into the second variable group and determining the spatial relative relationship of the object having the maximum certainty factor of the certainty factor model. Even in the case of a sentence, language processing can be performed by appropriately associating an object existing in the environment with an operation instruction. In addition, since the values to be assigned to the second variable group are narrowed down by objects existing outside, the calculation process can be reduced.

本発明の情報処理装置において、前記検出部は、検出されたオブジェクトの位置を地図情報に基づいて、自車両を基準とするローカル座標系に座標変換してもよい。 In the information processing apparatus of the present invention, the detection unit may perform coordinate conversion of the position of the detected object into a local coordinate system based on the host vehicle based on the map information.

このようにローカル座標系に座標変換することで、オブジェクトの位置を扱い易くなり、計算処理を軽減できる。 By converting the coordinates to the local coordinate system in this way, it becomes easier to handle the position of the object, and the calculation process can be reduced.

本発明の情報処理装置において、前記検出部は、オブジェクトの空間的位置関係の時系列データを取得し、オブジェクトの状態が変化している場合には、過去に取得した現在とは異なるオブジェクトの状態を過去オブジェクトとして検出し、前記空間的意味構造解析部は、過去のオブジェクトに関する文言が含まれている場合には、現在のオブジェクトに関する文言と過去のオブジェクトに関する文言とを区別して階層構造を求めてもよい。 In the information processing apparatus of the present invention, the detection unit acquires time-series data of the spatial positional relationship of the object, and when the state of the object is changed, the state of the object different from the current acquired in the past And the spatial semantic structure analysis unit obtains a hierarchical structure by distinguishing a word related to the current object and a word related to the past object when the word related to the past object is included. Also good.

また、本発明の情報処理装置において、前記検出部は、オブジェクトの空間的位置関係の時系列データを取得し、オブジェクトの状態が変化している場合には、予測される未来のオブジェクトの状態を未来オブジェクトとして検出し、前記空間的意味構造解析部は、未来のオブジェクトに関する文言が含まれている場合には、現在のオブジェクトに関する文言と未来のオブジェクトに関する文言とを区別して階層構造を求めてもよい。 In the information processing apparatus according to the present invention, the detection unit obtains time-series data of the spatial positional relationship of the object, and when the state of the object is changed, the state of the future object to be predicted is obtained. The spatial semantic structure analysis unit may detect a hierarchical structure by distinguishing a word related to the current object and a word related to the future object when the word related to the future object is included. Good.

このように外部環境内において状態が変化しているオブジェクトを過去オブジェクトまたは未来オブジェクトとして扱うことにより、操作命令文に過去や未来のオブジェクトに関する命令が含まれているときであっても、適切に言語処理を行うことができる。 By treating an object whose state has changed in the external environment as a past object or future object in this way, even if the operation command statement includes a command related to a past or future object, the language can be appropriately Processing can be performed.

本発明の情報処理装置は、操作内容を表す動詞に共起する文言およびオブジェクトの空間的相対関係を記憶した辞書データベースと、外部空間に存在するオブジェクトに関する空間的相対関係の時系列データを取得し、オブジェクトの状態が変化している場合には、過去に取得した現在とは異なるオブジェクトの状態を過去オブジェクトとして検出する検出部と、ユーザから操作命令文の入力を受け付ける入力部と、前記辞書データベースを参照して、前記入力部にて入力された操作命令文の解析を行い、過去のオブジェクトに関する文言が含まれている場合には、現在のオブジェクトに関する文言と過去のオブジェクトに関する文言とを区別して階層構造を求める空間的意味構造解析部と、前記階層構造でもって接続された複数の部分モデルを有し、各部分モデルの確信度の関数として確信度が求まる確信度モデルであって、前記各部分モデルが、前記文言に関する第１の変数、オブジェクトの空間的相対関係に関する第２の変数群、前記オブジェクトが前記文言に対応する度合いに関する第３の変数を有する確信度モデルを生成する確信度モデル生成部と、前記検出部にて検出されたオブジェクトの空間的相対関係のデータを前記第２の変数群に適用して前記複数の部分モデルの確信度の関数として求められる確信度を計算し、前記確信度が最大となるオブジェクトの空間的相対関係を決定する照合部と、前記操作命令文の解析結果を出力する出力部とを備える。 An information processing apparatus according to the present invention acquires a dictionary database storing words and co-occurrence of objects that co-occur on verbs representing operation contents, and time-series data of spatial relative relationships related to objects existing in an external space. A detection unit that detects, as a past object, a state of an object different from the current acquired in the past when the state of the object has changed, an input unit that receives an input of an operation command sentence from a user, and the dictionary database Referring to, the operation command sentence input in the input unit is analyzed, and when a word related to a past object is included, a word related to a current object is distinguished from a word related to a past object. A spatial semantic structure analysis unit for obtaining a hierarchical structure, and a plurality of partial models connected by the hierarchical structure. A certainty factor model in which the certainty factor is obtained as a function of the certainty factor of each partial model, wherein each partial model includes a first variable relating to the wording and a second variable group relating to a spatial relative relation of the objects. , A confidence model generation unit that generates a confidence model having a third variable related to the degree to which the object corresponds to the wording, and data of spatial relative relations of the objects detected by the detection unit. A collation unit that is applied to a group of variables to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, and determines a spatial relative relation of an object that has the maximum certainty factor, and the operation command statement And an output unit for outputting the analysis result.

また、本発明の情報処理装置は、操作内容を表す動詞に共起する文言およびオブジェクトの空間的相対関係を記憶した辞書データベースと、外部空間に存在するオブジェクトに関する空間的相対関係の時系列データを取得し、オブジェクトの状態が変化している場合には、予測される未来のオブジェクトの状態を未来オブジェクトとして検出する検出部と、ユーザから操作命令文の入力を受け付ける入力部と、前記辞書データベースを参照して、前記入力部にて入力された操作命令文の解析を行い、未来のオブジェクトに関する文言が含まれている場合には、現在のオブジェクトに関する文言と未来のオブジェクトに関する文言とを区別して階層構造を求める空間的意味構造解析部と、前記階層構造でもって接続された複数の部分モデルを有し、各部分モデルの確信度の関数として確信度が求まる確信度モデルであって、前記各部分モデルが、前記文言に関する第１の変数、オブジェクトの空間的相対関係に関する第２の変数群、前記オブジェクトが前記文言に対応する度合いに関する第３の変数を有する確信度モデルを生成する確信度モデル生成部と、前記検出部にて検出されたオブジェクトの空間的相対関係のデータを前記第２の変数群に適用して前記複数の部分モデルの確信度の関数として求められる確信度を計算し、前記確信度が最大となるオブジェクトの空間的相対関係を決定する照合部と、前記操作命令文の解析結果を出力する出力部とを備える。 In addition, the information processing apparatus of the present invention includes a dictionary database that stores words and co-occurrence of objects that co-occur on verbs representing operation contents, and time-series data of spatial relationships of objects that exist in external space. And when the state of the object is changed, a detection unit that detects the predicted state of the future object as a future object, an input unit that receives an input of an operation command from the user, and the dictionary database Referring to the analysis of the operation command sentence input in the input unit, if the word about the future object is included, the word about the current object and the word about the future object are distinguished It has a spatial semantic structure analysis unit for determining the structure and a plurality of partial models connected by the hierarchical structure. A certainty factor model in which the certainty factor is obtained as a function of the certainty factor of each partial model, wherein each partial model includes a first variable relating to the wording, a second variable group relating to a spatial relative relation of objects, and the object A confidence model generation unit that generates a confidence model having a third variable relating to the degree corresponding to the wording, and data of the spatial relative relationship between the objects detected by the detection unit. A collation unit that calculates the certainty factor obtained as a function of the certainty factor of the plurality of partial models and determines the spatial relative relationship of the object having the maximum certainty factor, and the analysis result of the operation command statement An output unit.

このように検出部では、空間的相対関係の時系列データに基づいて、オブジェクトの状態が変化している場合に、過去オブジェクト、未来オブジェクトを検出する一方で、空間的意味構造解析部では、操作命令文を解析して、現在のオブジェクトと、過去オブジェクト又は未来オブジェクトを区別して階層構造を求めるので、確信度の計算においては、過去オブジェクトを過去オブジェクトの部分モデルに、未来オブジェクトを未来オブジェクトの部分モデルに適用して、適切な言語処理解析を行える。 As described above, the detection unit detects past objects and future objects when the state of the object is changed based on the time series data of the spatial relative relationship, while the spatial semantic structure analysis unit detects the operation. Analyzing the statement to distinguish the current object from the past object or future object to obtain the hierarchical structure, so in the calculation of confidence, the past object is used as a partial model of the past object, and the future object is used as a part of the future object. Appropriate language processing analysis can be applied to the model.

本発明の情報処理方法は、上記情報処理装置が実行する方法であり、本発明のプログラムは、上記情報処理装置が実行する方法を実現するためのプログラムである。 The information processing method of the present invention is a method executed by the information processing apparatus, and the program of the present invention is a program for realizing the method executed by the information processing apparatus.

本発明は、主語や目的語等が省略された文であっても、環境内に存在する物体と操作指示内容とを適切に関連付けて言語処理を行うことができる。 The present invention can perform language processing by appropriately associating an object existing in the environment with the contents of an operation instruction even if the subject, object, etc. are omitted.

実施の形態の情報処理装置の構成を示す図である。It is a figure which shows the structure of the information processing apparatus of embodiment. 辞書データベースに記憶されたデータの例を示す図である。It is a figure which shows the example of the data memorize | stored in the dictionary database. 運転シーンの例を示す図である。It is a figure which shows the example of a driving scene. 操作命令文の構造解析結果の例を示す図である。It is a figure which shows the example of the structure analysis result of an operation command sentence. 確率モデルの例を示す図である。It is a figure which shows the example of a probability model. 情報処理装置が外部オブジェクトの特徴量のデータを取得する動作を示す図である。It is a figure which shows the operation | movement in which information processing apparatus acquires the data of the feature-value of an external object. ドライバから操作命令がなされたときにその内容を解析する動作を示す図である。It is a figure which shows the operation | movement which analyzes the content, when the operation command is made from the driver. （ａ）は駐車場の例を示す図である。（ｂ）駐車場に設定した座標系を示す図である。（ｃ）は座標系にオブジェクトの基本属性とオプション属性を関連付けたプライマリ環境オブジェクトの例を示す図である。(A) is a figure showing an example of a parking lot. (B) It is a figure which shows the coordinate system set to the parking lot. (C) is a figure which shows the example of the primary environment object which linked | related the basic attribute of the object and the option attribute to the coordinate system. 自車位置から駐車スペースまでのルートを規定したセカンダリ環境オブジェクトの例を示す図である。It is a figure which shows the example of the secondary environmental object which prescribed | regulated the route from the own vehicle position to a parking space.

以下、本発明の実施の形態の情報処理装置について図面を参照しながら説明する。本実施の形態では、情報処理装置は自動車に搭載される装置であることを前提とし、自動車の運転シーンにおいて、ドライバが指示した操作命令を解析する。 Hereinafter, an information processing apparatus according to an embodiment of the present invention will be described with reference to the drawings. In the present embodiment, it is assumed that the information processing apparatus is an apparatus mounted on an automobile, and an operation command instructed by the driver is analyzed in the driving scene of the automobile.

図１は、実施の形態の情報処理装置１の構成を示す図である。情報処理装置１は、外部の環境に関するデータを検出する検出部２と、ドライバからの操作命令を入力する入力部３と、入力された操作命令文の階層構造を解析する空間的意味構造解析部４と、階層構造に基づいて操作命令文の確率モデルを生成する確率モデル生成部５と、確率モデルに対して検出されたオブジェクトの空間的相対関係のデータを代入して確率値を求め、操作命令に係るオブジェクトを照合する照合部６と、照合部６での照合結果に基づいて操作命令の解析結果を出力する出力部７とを備えている。情報処理装置１は、ハードウェアとしては、電子制御ユニット（ＥＣＵ）によって構成される。このようなＥＣＵを制御するためのプログラムも本発明の範囲に含まれる。 FIG. 1 is a diagram illustrating a configuration of an information processing apparatus 1 according to the embodiment. The information processing apparatus 1 includes a detection unit 2 that detects data related to an external environment, an input unit 3 that inputs an operation command from a driver, and a spatial semantic structure analysis unit that analyzes a hierarchical structure of the input operation command sentence 4, a probability model generation unit 5 that generates a probability model of an operation command sentence based on the hierarchical structure, and obtains a probability value by substituting the data of the spatial relative relationship of the detected object with respect to the probability model, A collation unit 6 that collates objects related to the command, and an output unit 7 that outputs an analysis result of the operation command based on the collation result in the collation unit 6 are provided. The information processing apparatus 1 includes an electronic control unit (ECU) as hardware. Such a program for controlling the ECU is also included in the scope of the present invention.

情報処理装置１には、カメラ１１と測位装置１２が接続されている。カメラ１１は、車両の周辺を撮像し、撮像により得られた画像データを検出部２に送信する。測位装置１２は、自車両の位置を測位する装置であって、例えば、ＧＰＳ受信機でＧＰＳ衛星からの信号を受信して測位してもよいし、ジャイロ・加速度センサなどから得られるデータに基づいて現在位置を計算してもよい。 A camera 11 and a positioning device 12 are connected to the information processing apparatus 1. The camera 11 images the periphery of the vehicle and transmits image data obtained by the imaging to the detection unit 2. The positioning device 12 is a device that measures the position of the host vehicle. For example, the positioning device 12 may receive a signal from a GPS satellite with a GPS receiver and may measure the position, or may be based on data obtained from a gyro / acceleration sensor or the like. The current position may be calculated.

検出部２は、カメラ１１にて撮影されたカメラ画像を解析し、画像から、他車両、歩行者、路側物、駐車スペース、道路ペイント等のオブジェクトを抽出し、抽出したオブジェクトの空間的相対関係のデータを取得する。ここで、空間的相対関係のデータとは、空間に存在するオブジェクトどうしの位置関係であり、例えば、オブジェクトＡとオブジェクトＢが隣接しているとか、オブジェクトＡとオブジェクトＢの距離が○○ｍとか、オブジェクトＡはオブジェクトＢとオブジェクトＣとの間に存在する等の関係である。 The detection unit 2 analyzes the camera image captured by the camera 11, extracts objects such as other vehicles, pedestrians, roadside objects, parking spaces, and road paints from the image, and the spatial relative relationship of the extracted objects. Get the data. Here, the spatial relative relationship data is a positional relationship between objects existing in the space. For example, the object A and the object B are adjacent to each other, or the distance between the object A and the object B is OOm. The object A has a relationship such as existing between the object B and the object C.

検出部２は、オブジェクトの空間的相対関係のみならず、オブジェクトの属性のデータを取得してもよい。オブジェクトの属性とは、例えば、オブジェクトのタイプ、大きさ、色等である。検出部は、オブジェクトの空間的相対関係及び属性を環境データベース（以下、「環境ＤＢ」という）９に記憶する。本明細書では、オブジェクトの空間的相対関係と属性を合わせて、オブジェクトの特徴量という。 The detection unit 2 may acquire not only the spatial relative relationship between the objects but also the attribute data of the objects. The object attributes are, for example, the type, size, color, etc. of the object. The detection unit stores the spatial relative relationship and attributes of the objects in an environment database (hereinafter referred to as “environment DB”) 9. In this specification, the spatial relative relationship and attributes of an object are combined and referred to as an object feature amount.

検出部２は、地図データベース（以下、「地図ＤＢ」という）８に接続されており、地図ＤＢ８から車両の現在位置周辺の地図データを読み出すことができる。検出部２は、カメラ画像から得られたオブジェクトの位置を、自車両を基準とする座標系に変換する。 The detection unit 2 is connected to a map database (hereinafter referred to as “map DB”) 8 and can read map data around the current position of the vehicle from the map DB 8. The detection unit 2 converts the position of the object obtained from the camera image into a coordinate system based on the host vehicle.

また、検出部２は、オブジェクトのデータを経時的に検出し、例えば、オブジェクトが動いている等のようにオブジェクトの状態に変化があった場合には、環境ＤＢ９に過去オブジェクトとして記憶する。オブジェクトが、一定時間以上、同じ場所に滞留した場合には、その点に来た時刻、その場所から動いた時刻で、時系列をセグメンテーションし、３つの異なる過去オブジェクトに分割してもよいし、初期時刻から滞留していた場合には、滞留していた間と、動いた時刻の２つの過去オブジェクトに分割してもよい。 Further, the detection unit 2 detects the data of the object over time, and stores the data in the environment DB 9 as a past object when the state of the object has changed, for example, the object is moving. If an object stays in the same place for more than a certain time, the time series may be segmented and divided into three different past objects according to the time of arrival at that point and the time of movement from that place, In the case of staying from the initial time, it may be divided into two past objects of staying time and moving time.

また、検出部２は、過去から現在までのオブジェクトの動きから、オブジェクトの将来の動きを予測し、環境ＤＢ９に未来オブジェクトとして記憶する。未来オブジェクトは所定の周期で、更新される。 Further, the detection unit 2 predicts the future movement of the object from the movement of the object from the past to the present, and stores it as a future object in the environment DB 9. Future objects are updated at a predetermined cycle.

例えば、駐車スペースに停車していた車両（オブジェクト）が出て行く場合には、現在移動中の車両が現在オブジェクトであり、過去に取得した駐車スペースに停車していた車両が過去オブジェクトである。また、現在の車両の動きから、車両が駐車場の出口に向かっていると予測できるときには、将来、駐車場の出口にいるであろう車両が未来オブジェクトである。このように、過去オブジェクト及び未来オブジェクトは、現在は存在しない仮想的なオブジェクトである。検出部２が検出して環境ＤＢ９に記憶されるオブジェクトの特徴量のデータの詳細は後述する。 For example, when a vehicle (object) parked in the parking space leaves, the vehicle currently moving is the current object, and the vehicle parked in the parking space acquired in the past is the past object. Further, when it can be predicted from the current movement of the vehicle that the vehicle is heading toward the exit of the parking lot, the vehicle that will be at the exit of the parking lot in the future is the future object. Thus, the past object and the future object are virtual objects that do not exist at present. Details of the feature amount data of the object detected by the detection unit 2 and stored in the environment DB 9 will be described later.

入力部３は、音声を入力するマイクと音声認識部によって構成される。入力部３は、ドライバが発した音声をテキストデータに変換して空間的意味構造解析部４に渡す。 The input unit 3 includes a microphone that inputs voice and a voice recognition unit. The input unit 3 converts the voice uttered by the driver into text data and passes it to the spatial semantic structure analysis unit 4.

空間的意味構造解析部４は、入力部３より入力された操作命令文を単独で実質的な意味を有する形態素に分割し、分割された形態素どうしの階層構造を解析する。なお、入力された操作命令文が複文である場合には、単文に分割した上で、それぞれの単文の階層構造を解析する。 The spatial semantic structure analysis unit 4 divides the operation command sentence input from the input unit 3 into morphemes having substantial meanings alone, and analyzes the hierarchical structure of the divided morphemes. If the input operation command sentence is a compound sentence, it is divided into simple sentences and the hierarchical structure of each simple sentence is analyzed.

空間的意味構造解析部４は、「そこ」「あそこ」等の照応詞に対応する指示対象を特定する。また、日本語の場合には、聞き手が容易に推測できる語は頻繁に省略されるが、このようないわゆる「ゼロ代名詞」の文においては、空間的意味構造解析部４は、省略されている語が隠れていることを解析する。 The spatial semantic structure analysis unit 4 identifies an instruction target corresponding to an anaphor such as “there” or “that”. In the case of Japanese, words that can be easily guessed by the listener are frequently omitted, but in such a so-called “zero pronoun” sentence, the spatial semantic structure analysis unit 4 is omitted. Analyze that the word is hidden.

また、空間的意味構造解析部４は、時制（現在、過去、未来）のモダリティ解析を行い、操作命令文に含まれているオブジェクトが、過去オブジェクト、（現在）オブジェクト、未来オブジェクトのいずれであるかを決める。例えば、「赤い車のあったところに停めて」という操作命令文の場合には、「赤い車」は、過去オブジェクトであると判定する。「前の車が停まったら、その隣りに駐車して」という操作命令文の場合には、「前の車」は、未来オブジェクトであると判定する。 Further, the spatial semantic structure analysis unit 4 performs a tense (current, past, future) modality analysis, and the object included in the operation command statement is any of a past object, a (current) object, or a future object. Decide what. For example, in the case of an operation command sentence “stop where there was a red car”, it is determined that “red car” is a past object. In the case of an operation command sentence “when the previous car stops, park next to it”, it is determined that “the previous car” is a future object.

図２は、辞書データベース（以下、「辞書ＤＢ」という）１０に記憶されたデータの例を示す図である。辞書ＤＢ１０は、運転シーンで用いられる動詞に関連付けて、その運転シーンで当該動詞と共起する名詞句のデータが記憶されていると共に、動詞と共起する特徴量（空間的相対関係／属性）のデータが記憶されている。例えば、「停める」という動詞に関連付けて、当該動詞に共起する名詞句として「駐車スペースに」「路側に」「入口の近くに」が記憶されている。 FIG. 2 is a diagram showing an example of data stored in a dictionary database (hereinafter referred to as “dictionary DB”) 10. The dictionary DB 10 stores data of noun phrases that co-occur with the verb in the driving scene in association with the verb used in the driving scene, and features (spatial relative relationship / attribute) co-occurring with the verb. Is stored. For example, “in the parking space”, “on the roadside”, and “near the entrance” are stored as noun phrases co-occurring with the verb “stop”.

また、例えば、「停める」「駐車スペースに」との文言に関連付けて、「駐車スペース：Ｔｒｕｅ前方：Ｔｒｕｅ距離：［１，５］上にある：｛｝」という特徴量が記憶されている。これは、操作文の中に「停める」に共起する「駐車スペース」という文言がある場合には、「駐車スペース」が「前方」に存在し（Ｔｒｕｅ）、その距離は１〜５ｍである（Ｔｒｕｅ）という空間的な特徴量が共起していることを示している。 Further, for example, in association with the words “stop” and “in the parking space”, the feature quantity “parking space: True forward: True distance: [1, 5] is on: {}” is stored. This is because, if there is a phrase “parking space” that co-occurs in “stop” in the operation statement, “parking space” exists “true” (True), and the distance is 1 to 5 m. This indicates that a spatial feature amount (True) co-occurs.

動詞に共起する名詞句は、運転シーンで用いられる大量の操作命令文を教師データとして学習を行うことによって求める。また、特徴量のデータについても、運転シーンを撮影した大量の映像データを解析し、ドライバが発した操作命令文に含まれる動詞に共起する車の動きを検出して、辞書ＤＢ１０を生成する。 A noun phrase that co-occurs in a verb is obtained by learning a large amount of operation command sentences used in a driving scene as teacher data. Also, with respect to the feature amount data, a large amount of video data obtained by photographing a driving scene is analyzed, and the movement of the car co-occurring with the verb included in the operation command sentence issued by the driver is detected, and the dictionary DB 10 is generated. .

なお、辞書ＤＢ１０は、運転シーンに応じて、複数の辞書ＤＢを備えてもよい。例えば、駐車場で用いられる辞書ＤＢ、市街地で用いられる辞書ＤＢ、高速道路で用いられる辞書ＤＢというように複数の辞書ＤＢを有することにより、それぞれの運転シーンで用いられる文言を適切に解析できる。 The dictionary DB 10 may include a plurality of dictionary DBs depending on the driving scene. For example, by having a plurality of dictionary DBs such as a dictionary DB used in a parking lot, a dictionary DB used in an urban area, and a dictionary DB used on a highway, words used in each driving scene can be analyzed appropriately.

確率モデル生成部５は、空間的意味構造解析部４にて解析された操作命令文の階層構造に基づいて、操作命令文の確率モデルを生成する。確率モデルは、複数のノードが階層構造でもって接続された構成を有しており、各ノードは、操作命令文にかかる文言を第１の確率変数λ、オブジェクトの特徴量のデータを第２の確率変数γ、文言とオブジェクトの特徴量が対応する度合いに関する第３の確率変数φを有する。 The probability model generation unit 5 generates a probability model of the operation command sentence based on the hierarchical structure of the operation command sentence analyzed by the spatial semantic structure analysis unit 4. The probabilistic model has a configuration in which a plurality of nodes are connected in a hierarchical structure, and each node uses a first random variable λ as a word for an operation command sentence, and second data as a feature quantity of an object. It has a random variable γ and a third random variable φ relating to the degree to which the wording corresponds to the feature quantity of the object.

後述する図５では、図の便宜上、各ノードが有する第２の確率変数γを一つずつとしているが、実際には、一つの文言に対しては複数の特徴量が存在するのが通常なので、一つのノードは複数の第２の確率変数λを有する。第３の確率変数φは、第１の確率変数λの値と第２の確率変数γの値とが対応しているときに「１」（Ｔｒｕｅ）、対応していないときに「０」（Ｆａｌｓｅ）をとる。第１の確率変数λ、第２の確率変数γ、第３の確率変数φを有するノードが「部分モデル」を構成し、部分モデルが集まって確率モデルが構成されている。確率モデルの確率値は、各部分モデルの確率値の関数として求まる。ここでは、各部分モデルの確率値の積が確率モデル全体の確率値として求まる。 In FIG. 5, which will be described later, for convenience of illustration, each node has one second random variable γ. However, in practice, there are usually a plurality of feature quantities for one wording. , One node has a plurality of second random variables λ. The third random variable φ is “1” (True) when the value of the first random variable λ corresponds to the value of the second random variable γ, and “0” ( False). Nodes having the first random variable λ, the second random variable γ, and the third random variable φ constitute a “partial model”, and the partial models are assembled to form a probability model. The probability value of the probability model is obtained as a function of the probability value of each partial model. Here, the product of the probability values of the partial models is obtained as the probability value of the entire probability model.

本実施の形態では、第１の確率変数λ、第２の確率変数γに対応する特徴量を求めることを目的としているので、確率変数φには「１」（Ｔｒｕｅ）を代入しておく。なお、辞書ＤＢ１０を学習する際には、確率変数φを変数として用いる。 In the present embodiment, since the object is to obtain feature quantities corresponding to the first random variable λ and the second random variable γ, “1” (True) is substituted for the random variable φ. Note that the probability variable φ is used as a variable when learning the dictionary DB 10.

図３は、以下に説明する運転シーンの例を示す図である。このシーンにおいて、ドライバから「自販機の横の赤い車が出て行ったら、そこに停めて」という操作命令が発せられた場合を考える。図３は、便宜上、白黒で記載しているが、駐車場に停車している３台の車両のうち、車体を黒く記載している２台の車両は赤い車であるとする。なお、点線で示した車両Ｃ１は、過去に駐車スペースに駐車していたが、現在は駐車スペースから出たことを示している。この例において、ドライバの操作命令文は、赤い車Ｃ１が出て行った後に、赤い車Ｃ１が停まっていた場所Ｐ１に停めることを意図しているが、赤い車Ｃ１が出て行った後に、赤い車Ｃ１は存在しないため、赤い車Ｃ２の隣りに停めるという命令であると誤判定する可能性があった。 FIG. 3 is a diagram illustrating an example of a driving scene described below. In this scene, let us consider a case where an operation instruction is issued from the driver: “If a red car next to the vending machine goes out, stop there”. Although FIG. 3 is shown in black and white for the sake of convenience, it is assumed that of the three vehicles parked at the parking lot, the two vehicles with the vehicle body shown in black are red vehicles. In addition, although the vehicle C1 shown with the dotted line has parked in the parking space in the past, it has shown that it came out of the parking space now. In this example, the operation instruction of the driver is intended to stop at the place P1 where the red car C1 has stopped after the red car C1 leaves, but after the red car C1 leaves Since the red car C1 does not exist, there is a possibility that it is erroneously determined that the instruction is to stop next to the red car C2.

図４は、「自販機の横の赤い車が出て行ったら、そこに停めて」という操作命令文の空間的意味構造解析結果を示す図である。「自販機の横の赤い車が出て行ったら、そこに停めて」という文は、「自販機の横の赤い車が出て行ったら」というランドマークＬと、「停めて」という動詞Ｖに分解される。ここで、「自販機の横の赤い車が出て行ったら」というランドマークＬは空間的記述句（Spatial Description Clause）であり、「自販機の横」というランドマークＬと、「出て行く」という動詞Ｖと、「赤い車」というオブジェクトＦに分解される。さらに、「自販機の横」というランドマークＬは空間的記述句であり、「自販機」というランドマークＬと、「横」という空間的関係ＳＲと、「ｐ赤い車」というオブジェクトＦに分解される。図３に示すシーンにおいて「ｐ赤い車」は現存しないが、赤い車が過去に自販機の横の駐車スペースＰ１に存在していたことを示す仮想的なオブジェクト（過去オブジェクト）である。 FIG. 4 is a diagram illustrating a spatial semantic structure analysis result of an operation command sentence “If a red car beside the vending machine goes out, stop there”. The sentence “When the red car next to the vending machine goes out, stop there” is broken down into the landmark L “When the red car next to the vending machine goes out” and the verb V “Stop” Is done. Here, the landmark L "When the red car next to the vending machine goes out" is a Spatial Description Clause, and the landmark L "next to the vending machine" and "Go out" It is broken down into a verb V and an object F called “red car”. Further, the landmark L “next to the vending machine” is a spatial descriptive phrase, and is decomposed into a landmark L “vending machine”, a spatial relationship SR “next”, and an object F “p red car”. . In the scene shown in FIG. 3, “p red car” does not exist, but is a virtual object (past object) indicating that a red car has existed in the parking space P1 next to the vending machine in the past.

このように空間的意味構造解析部４は操作命令文を解析して、図４に示すような階層構造を求める。また、空間的意味構造解析部４は、「自販機の横の赤い車が出て行ったら、そこに停めて」という操作命令文のモダリティ解析を行い、仮定法の表現が含まれていることから、照応先に対応する過去オブジェクトを生成している。なお、仮定法の表現の場合に必ず過去オブジェクトというわけではなく、未来オブジェクトのこともあり得る。操作命令文のモダリティ解析は、公知の技術を用いて行うことが可能である。 As described above, the spatial semantic structure analysis unit 4 analyzes the operation command sentence to obtain a hierarchical structure as shown in FIG. In addition, the spatial semantic structure analysis unit 4 analyzes the modality of the operation command sentence “If the red car next to the vending machine goes out, stop there”, and the expression of the assumption method is included. The past object corresponding to the anaphora is generated. In the case of the assumption method expression, the object is not necessarily a past object, but may be a future object. The modality analysis of the operation command statement can be performed using a known technique.

図５は、図４に示す空間的意味構造解析結果に基づいて確率モデルを生成した例を示す図である。確率モデル生成部５は、図５に示すように過去オブジェクトξに対応するノードを含む確率モデルを生成する。各ノードは確率値を表わし、確率モデル全体の確率値は、各ノードの確率値を乗じることにより求めることができる。 FIG. 5 is a diagram showing an example in which a probability model is generated based on the spatial semantic structure analysis result shown in FIG. The probability model generation unit 5 generates a probability model including a node corresponding to the past object ξ as shown in FIG. Each node represents a probability value, and the probability value of the entire probability model can be obtained by multiplying the probability value of each node.

照合部６は、確率モデルが有する確率変数に対して値を代入することで、確率モデル全体の確率値を計算し、確率モデル全体の確率値を最大にする確率変数を求める。照合部６は、検出した特徴量のデータを確率モデルに順次代入し、そのときの部分モデルの確率値を求める。第２の確率変数γに代入する値は、検出部２にて検出された特徴量のデータである。具体的には、空間的意味構造解析部４により、辞書ＤＢ１０に基づいて空間的意味構造解析を行う際に、操作命令文の文言に共起する特徴量のうち、共起関係が強い特徴量を候補として記憶しておき、照合を行う際には、記憶された特徴量のデータを用いる。 The matching unit 6 calculates a probability value of the entire probability model by substituting a value for the random variable included in the probability model, and obtains a random variable that maximizes the probability value of the entire probability model. The matching unit 6 sequentially substitutes the detected feature data into the probability model, and obtains the probability value of the partial model at that time. The value to be substituted for the second random variable γ is feature value data detected by the detection unit 2. Specifically, when the spatial semantic structure analysis unit 4 performs the spatial semantic structure analysis based on the dictionary DB 10, among the feature quantities that co-occur in the words of the operation command sentence, the feature quantities having a strong co-occurrence relationship. Are stored as candidates, and the stored feature value data is used when collation is performed.

また、操作命令文において、指示詞やゼロ代名詞等によって文言が省略されている場合には、確率モデルの当該対象のノードの第１の確率変数λに対して文言を代入することで、部分モデルの確率値を求める。この際、第１の確率変数λに代入する値は、空間的意味構造解析部４が辞書ＤＢ１０に基づいて空間的意味構造解析を行って、文言が省略されていることを解析したときに、操作命令文の文言に共起する文言のうち、共起関係が強い文言を候補として記憶しておき、照合を行う際には、記憶された文言のデータを用いる。 In addition, in the operation command sentence, when a word is omitted by a directive, zero pronoun, or the like, the partial model is substituted by substituting the word into the first random variable λ of the target node of the probability model. Find the probability value of. At this time, the value to be substituted into the first random variable λ is obtained when the spatial semantic structure analysis unit 4 performs spatial semantic structure analysis based on the dictionary DB 10 and analyzes that the wording is omitted. Among the words that co-occur with the words of the operation command sentence, words having a strong co-occurrence relationship are stored as candidates, and the stored word data is used when collation is performed.

照合部６は、部分モデルの確率値の積をとって、確率モデル全体の確率値を計算する。これにより、操作命令文において指示されたオブジェクトや省略された文言を適切に決定することができる。 The matching unit 6 calculates the probability value of the entire probability model by taking the product of the probability values of the partial models. Thereby, the object instruct | indicated in the operation command sentence and the abbreviation text can be determined appropriately.

出力部７は、照合部６にて求められたオブジェクトのデータに基づいて、操作命令文の解析結果を出力する。すなわち、照合部６にて対応するオブジェクトを確定することにより、操作命令文が外部空間内のオブジェクトに対してどのような操作を指示しているのかの解析結果が得られるので、その結果を出力する。 The output unit 7 outputs the analysis result of the operation command sentence based on the object data obtained by the collation unit 6. In other words, by confirming the corresponding object in the collation unit 6, an analysis result indicating what operation the instruction statement instructs the object in the external space can be obtained, and the result is output. To do.

図６及び図７は、本実施の形態の情報処理装置１の動作を示すフローチャートである。図６は、情報処理装置１が外部オブジェクトの特徴量のデータを取得する動作を示し、図７は、ドライバから操作命令がなされたときにその内容を解析する動作を示す。図６に示す動作と図７に示す動作は並行して行われる。具体的には、図６に示す特徴量のデータの取得は常時行われており、ドライバから操作命令がなされたときに、図６に示す特徴量のデータの取得を行いつつ、図７に示す動作が行われる。 6 and 7 are flowcharts showing the operation of the information processing apparatus 1 according to the present embodiment. FIG. 6 shows an operation in which the information processing apparatus 1 acquires feature amount data of an external object, and FIG. 7 shows an operation in which the contents are analyzed when an operation command is issued from the driver. The operation shown in FIG. 6 and the operation shown in FIG. 7 are performed in parallel. Specifically, the feature amount data shown in FIG. 6 is always acquired. When an operation instruction is issued from the driver, the feature amount data shown in FIG. Operation is performed.

まず、外部オブジェクトの特徴量のデータの取得について説明する。図６に示すように、情報処理装置１は、カメラ１１にて撮影した画像を処理して外部オブジェクトを検出し（Ｓ１０）、検出した外部オブジェクトの位置を、自車両を基準としたローカル座標系に座標変換する（Ｓ１１）。ローカル座標系は、例えば、自車両を基準とし、自車両の進行方向を縦軸、進行方向を直交する方向を横軸とし、車両１台の大きさ、あるいはその半分を一目盛とする座標系である。そして、情報処理装置１は、ローカル座標系に変換された特徴量のデータを環境ＤＢ９に記憶する。 First, acquisition of feature amount data of an external object will be described. As illustrated in FIG. 6, the information processing apparatus 1 processes an image captured by the camera 11 to detect an external object (S10), and uses the position of the detected external object as a reference to the local vehicle coordinate system. (S11). The local coordinate system is, for example, a coordinate system in which the own vehicle is a reference, the traveling direction of the own vehicle is the vertical axis, the direction orthogonal to the traveling direction is the horizontal axis, and the size of one vehicle or a half thereof is a scale. It is. Then, the information processing apparatus 1 stores the feature value data converted into the local coordinate system in the environment DB 9.

ここで、オブジェクトの特徴量について一例を挙げて説明する。ただし、特徴量の規定の仕方は、以下に説明する方式に限定されるものではない。
オブジェクトには、環境自体を規定するプライマリ環境オブジェクトと、環境の中での位置関係を規定するセカンダリ環境オブジェクトがある。プライマリ環境オブジェクトは、すべてのオブジェクトが備える属性である基本属性（ＩＤ、座標）と、一部のオブジェクトが備えるオプション属性（タイプ、色、大きさ等）を有する。 Here, an example is given and demonstrated about the feature-value of an object. However, the method of defining the feature amount is not limited to the method described below.
The objects include a primary environment object that defines the environment itself and a secondary environment object that defines a positional relationship in the environment. The primary environment object has basic attributes (ID, coordinates) that are attributes included in all objects, and optional attributes (type, color, size, etc.) included in some objects.

プライマリ環境オブジェクトの例としては、道路セグメント、駐車スペース、ランドマーク、自車、道路ペイント、他車、自動販売機、フェンス、道路ランプ、ポール、交通標識等があり、これらのローカル座標の位置を示すデータによって規定される。 Examples of primary environmental objects include road segments, parking spaces, landmarks, own vehicles, road paints, other vehicles, vending machines, fences, road lamps, poles, traffic signs, etc. Defined by the data shown.

また、プライマリ環境オブジェクトには、座標を持たないデータとして、イベントのデータがある。例えば、駐車、前進、後退、目標に向かって進む、左折、右折、車線変更等である。 The primary environment object includes event data as data having no coordinates. For example, parking, moving forward, moving backward, moving toward the target, turning left, turning right, changing lanes, and the like.

セカンダリ環境オブジェクトも、基本属性（ＩＤ、プライマリ環境オブジェクトの組）と、オプション属性（タイプ等）を有する。セカンダリ環境オブジェクトの例としては、空き駐車スペース、位置関係（上、隣、間、正面、背面、横切る、右、左）、順序、駐車ルートがある。 The secondary environment object also has basic attributes (ID, a set of primary environment objects) and optional attributes (type, etc.). Examples of secondary environmental objects include empty parking spaces, positional relationships (upper, next, middle, front, back, cross, right, left), order, and parking route.

図８（ａ）は駐車場の例を示す図、図８（ｂ）は図８（ａ）の駐車場に設定した座標系を示す図、図８（ｃ）は座標系にオブジェクトの基本属性とオプション属性を関連付けたプライマリ環境オブジェクトの例を示す図である。図８（ｃ）に示すように、ローカル座標系に関連付けて、駐車スペース、レーン、トラック、小トラック、グレイバン等のオブジェクトの特徴量のデータを有する。 8A is a diagram showing an example of a parking lot, FIG. 8B is a diagram showing a coordinate system set in the parking lot in FIG. 8A, and FIG. 8C is a basic attribute of an object in the coordinate system. It is a figure which shows the example of the primary environment object which linked | related and the option attribute. As shown in FIG. 8C, data of feature quantities of objects such as parking spaces, lanes, trucks, small trucks, and gray bangs are associated with the local coordinate system.

図９は、自車位置から駐車スペースまでのルートを規定したセカンダリ環境オブジェクトの例を示す図である。例えば、「２」では、座標（２，０）から（１，０）（１，１）（１，２）を通って、座標（０，２）にある駐車スペースに入れるルートを規定している。 FIG. 9 is a diagram illustrating an example of a secondary environment object that defines a route from the vehicle position to the parking space. For example, “2” defines a route from coordinates (2,0) through (1,0) (1,1) (1,2) to the parking space at coordinates (0,2). Yes.

図８及び図９に示すように、ローカル座標系に、上で定義したプライマリ環境オブジェクト及びセカンダリ環境オブジェクトを与えることで、特徴量のデータを規定できる。以上、特徴量のデータについて説明した。情報処理装置１は、常時、外部環境をセンシングして外部オブジェクトを検出し、検出した外部オブジェクトの位置をローカル座標系に変換して、上記した特徴量のデータを抽出して記憶する。 As shown in FIGS. 8 and 9, feature amount data can be defined by giving the primary environment object and the secondary environment object defined above to the local coordinate system. The feature amount data has been described above. The information processing apparatus 1 always senses the external environment to detect an external object, converts the position of the detected external object to the local coordinate system, and extracts and stores the above-described feature amount data.

図７を参照して、ドライバから操作命令文が入力されたときの動作について説明する。ドライバが音声により操作命令を行うと、情報処理装置１は、入力部３にて操作命令文の入力を受け付ける（Ｓ２０）。情報処理装置１は、入力された操作命令文を形態素に分割し、照応解析を行って、操作命令文に省略されている指示詞やゼロ代名詞を補う（Ｓ２１）。 With reference to FIG. 7, an operation when an operation command statement is input from the driver will be described. When the driver issues an operation command by voice, the information processing apparatus 1 accepts an input of an operation command sentence at the input unit 3 (S20). The information processing apparatus 1 divides the input operation command sentence into morphemes, performs anaphora analysis, and supplements the directives and zero pronouns omitted in the operation command sentence (S21).

続いて、情報処理装置１は、空間的意味構造解析を行い、分割により得られた形態素どうしの階層構造を解析し（Ｓ２２）、その解析結果に基づいて空間的意味を抽出し（Ｓ２３）、例えば図４に示すような階層構造を求める。次に、情報処理装置１は、操作命令文の階層構造に基づいて、例えば図５に示すような確率モデルを生成する（Ｓ２４）。 Subsequently, the information processing apparatus 1 performs a spatial semantic structure analysis, analyzes the hierarchical structure of the morphemes obtained by the division (S22), extracts the spatial meaning based on the analysis result (S23), For example, a hierarchical structure as shown in FIG. 4 is obtained. Next, the information processing apparatus 1 generates a probability model as shown in FIG. 5, for example, based on the hierarchical structure of the operation command sentence (S24).

次に、情報処理装置１は、環境ＤＢ９から読み出したオブジェクトを確率モデルに順次代入して、確率モデル全体の確率値を求め、確率値が最大となったときに代入したオブジェクトを、操作命令文に対応するオブジェクトとして求める照合処理を行う（Ｓ２５）。そして、情報処理装置１は、操作命令文の解析結果を出力する（Ｓ２６）。以上、本実施の形態の情報処理装置１の構成及び動作について説明した。 Next, the information processing apparatus 1 sequentially substitutes the object read from the environment DB 9 into the probability model, obtains the probability value of the entire probability model, and designates the object substituted when the probability value is maximized as the operation command statement. The matching process for obtaining the object corresponding to is performed (S25). Then, the information processing apparatus 1 outputs the analysis result of the operation command sentence (S26). The configuration and operation of the information processing apparatus 1 according to the present embodiment have been described above.

本実施の形態では、空間的意味構造解析部４は、省略されている文言があることを解析し、これを前提とした確率モデルを生成する。これにより、操作命令文において省略された言葉を補う処理を適切に行うことができ、モニターなどの操作を扱うことなく、直感的に空間的な操作についての指示を与えることが可能になる。特に、日本語の場合には、英語と異なりゼロ代名詞と呼ばれる主語や目的語の省略が許されるため、まずそれらを特定しないと対応付け処理ができなかったが、本実施の形態によれば、適切な対応付け処理を行える。 In the present embodiment, the spatial semantic structure analysis unit 4 analyzes that there are omitted words and generates a probability model based on this. As a result, it is possible to appropriately perform processing that compensates for words omitted in the operation command sentence, and it is possible to intuitively give instructions for spatial operations without handling operations such as a monitor. In particular, in the case of Japanese, unlike English, omission of the subject and object called zero pronouns is allowed, so the matching process could not be performed without specifying them first, but according to this embodiment, Appropriate association processing can be performed.

また、本実施の形態では、空間的意味構造解析部４が、時制（現在、過去、未来）のモダリティ解析を行い、操作命令文に含まれているオブジェクトが、過去オブジェクト、（現在）オブジェクト、未来オブジェクトのいずれであるかを決める。その一方で、検出部２は、環境内のオブジェクトを検出した結果から、現在は存在しない仮想的な過去オブジェクト、未来オブジェクトのデータを記憶しておくので、操作命令文が過去又は未来のオブジェクトに関する操作を含んでいる場合であっても、適切に対応付け処理を行うことができる。これにより、環境が動的に変化する場合であっても、環境内のオブジェクトと操作命令文とを適切に関連付けて言語処理を行うことができる。また、確率変数に代入するのが、外部に存在するオブジェクトに絞り込まれるので、探索処理を軽減することができる。 Further, in the present embodiment, the spatial semantic structure analysis unit 4 performs a tense (current, past, future) modality analysis, and the objects included in the operation command statements are past objects, (current) objects, Decide which of the future objects. On the other hand, since the detection unit 2 stores data of virtual past objects and future objects that do not currently exist based on the result of detecting the objects in the environment, the operation command statement relates to past or future objects. Even when the operation is included, the association process can be appropriately performed. Thereby, even when the environment changes dynamically, the language processing can be performed by appropriately associating the objects in the environment with the operation command statements. Moreover, since the assignment to the random variable is narrowed down to the objects existing outside, the search process can be reduced.

以上、本発明の情報処理装置について、実施の形態を挙げて詳細に説明したが、本発明の情報処理装置は上記した実施の形態に限定されるものではない。
上記した実施の形態では、辞書ＤＢ１０には、動詞に関連付けて、その動詞に共起する名詞句のデータを記憶する例を挙げたが、接置詞によって分類した名詞を関連付けてもよい。名詞に接置詞の情報を付加することで、名詞の表層格（ガ格、ハ格、ヲ格、ニ格、カラ格、ヘ格、ト格、ヨリ格、デ格等）に応じて、動詞と名詞の共起関係の強いデータベースを生成できる。また、必ずしも、表層格の情報ではなく、辞書ＤＢは、動作主格、対象格、場所格などの深層格の情報によって、共起関係を規定してもよい。 The information processing apparatus of the present invention has been described in detail with reference to the embodiment. However, the information processing apparatus of the present invention is not limited to the above-described embodiment.
In the above-described embodiment, an example in which data of noun phrases co-occurring with a verb is stored in the dictionary DB 10 in association with a verb has been described, but a noun classified by an infix may be associated. By adding infix information to nouns, verbs can be used according to the surface case of the noun (ga, ha, wo, ni, kara, het, to, yori, de, etc.) A database with strong co-occurrence of nouns can be generated. In addition, the dictionary DB may define the co-occurrence relationship based on deep case information such as an action case, a target case, and a place case, not necessarily information on the surface case.

上記した実施の形態では、文言とオブジェクトとの対応付けの確信度を計算するモデルとして、確率モデルを用いる例を挙げて説明したが、確信度モデルは確率モデルに限定されず、ニューラルネットワークの識別モデルや、ＳＶＭを用いてもよい。すなわち、各部分モデルをニューラルネットワークやＳＶＭで構成しておき、確信度モデル全体の確信度が各部分モデルの確信度の積として求まるように構成する。 In the above-described embodiment, an example using a probability model has been described as a model for calculating the certainty of correspondence between a word and an object. However, the certainty model is not limited to the probability model, and the neural network is identified. A model or SVM may be used. That is, each partial model is configured by a neural network or SVM, and the reliability of the entire certainty model is determined as a product of the certainty of each partial model.

上記した実施の形態では、空間的意味構造解析により操作命令文において省略された文言を求める処理と、過去オブジェクトや未来オブジェクトを用いて確率モデルを生成する処理の両方を含んだ例を挙げたが、上記２つの処理は必ずセットでなければならないというわけではなく、いずれか片方のみを実装した装置も本発明の範囲に含まれる。 In the embodiment described above, an example including both processing for obtaining a sentence omitted in an operation command sentence by spatial semantic structure analysis and processing for generating a probability model using a past object and a future object is given. The above two processes do not necessarily have to be set, and an apparatus in which only one of them is mounted is also included in the scope of the present invention.

本発明は、主語や目的語等が省略された文であっても、環境内に存在する物体と操作指示内容とを適切に関連付けて言語処理を行うことができ、言語解析処理に有用である。 INDUSTRIAL APPLICABILITY The present invention can perform language processing by appropriately associating an object existing in the environment with the contents of an operation instruction even if the subject, object, etc. are omitted, and is useful for language analysis processing. .

１情報処理装置
２検出部
３入力部
４空間的意味構造解析部
５確率モデル生成部
６照合部
７出力部
８地図ＤＢ
９環境ＤＢ
１０辞書ＤＢ
１１カメラ
１２測位装置
DESCRIPTION OF SYMBOLS 1 Information processing apparatus 2 Detection part 3 Input part 4 Spatial semantic structure analysis part 5 Probabilistic model generation part 6 Collation part 7 Output part 8 Map DB
9 Environment DB
10 Dictionary DB
11 Camera 12 Positioning device

Claims

A dictionary database that stores the words that co-occur on verbs representing the operation content and the spatial relative relationships of the objects;
A detection unit that acquires data of a spatial relative relationship with respect to an object existing in the external space;
An input unit for receiving an input of an operation command from a user;
Spatial semantic structure for obtaining a hierarchical structure of words included in an operation command sentence input in the input unit and for obtaining a hierarchical structure including words omitted in the operation command sentence with reference to the dictionary database An analysis unit;
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, A certainty factor model generating unit that generates a certainty factor model having a second variable group related to a spatial relative relationship between objects, a third variable related to the degree to which the object corresponds to the wording,
Applying the spatial relative relationship data of the objects detected by the detection unit to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, the certainty factor is A collation that determines the spatial relative relationship of the largest object,
An output unit for outputting an analysis result of the operation command statement;
An information processing apparatus comprising:

The information processing apparatus according to claim 1, wherein the certainty model is a probability model.

The information processing apparatus according to claim 1, wherein the detection unit performs coordinate conversion of the position of the detected object into a local coordinate system based on the host vehicle based on map information.

The detection unit acquires time-series data of the spatial positional relationship of the object, and when the state of the object has changed, detects the state of the object different from the current acquired in the past as a past object,
4. The spatial semantic structure analysis unit according to any one of claims 1 to 3, wherein, when a word relating to a past object is included, the hierarchical structure is determined by distinguishing a word relating to a current object and a word relating to a past object. The information processing apparatus described in 1.

The detection unit obtains time-series data of the spatial positional relationship of the object, and when the state of the object is changed, detects the state of the predicted future object as a future object,
5. The spatial semantic structure analysis unit according to any one of claims 1 to 4, wherein, when a word regarding a future object is included, the hierarchical structure is determined by distinguishing a word regarding a current object and a word regarding a future object. The information processing apparatus described in 1.

A dictionary database that stores the words that co-occur on verbs representing the operation content and the spatial relative relationships of the objects;
A detection unit that acquires time-series data of spatial relations related to objects existing in the external space, and detects the state of an object different from the current acquired in the past as a past object when the state of the object has changed When,
An input unit for receiving an input of an operation command from a user;
Referring to the dictionary database, the operation command sentence input in the input unit is analyzed, and when a word related to a past object is included, a word related to a current object and a word related to a past object; Spatial semantic structure analysis unit that determines the hierarchical structure by distinguishing
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, A certainty factor model generating unit that generates a certainty factor model having a second variable group related to a spatial relative relationship between objects, a third variable related to the degree to which the object corresponds to the wording,
Applying the spatial relative relationship data of the objects detected by the detection unit to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, the certainty factor is A collation that determines the spatial relative relationship of the largest object,
An output unit for outputting an analysis result of the operation command statement;
An information processing apparatus comprising:

A dictionary database that stores the words that co-occur on verbs representing the operation content and the spatial relative relationships of the objects;
A time series data of a spatial relative relationship with respect to an object existing in the external space, and when the state of the object is changed, a detection unit that detects a predicted state of the future object as a future object;
An input unit for receiving an input of an operation command from a user;
By referring to the dictionary database, analyzing the operation command sentence input in the input unit, and when a word regarding a future object is included, a word regarding a current object and a word regarding a future object Spatial semantic structure analysis unit that determines the hierarchical structure by distinguishing
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, A certainty factor model generating unit that generates a certainty factor model having a second variable group related to a spatial relative relationship between objects, a third variable related to the degree to which the object corresponds to the wording,
Applying the spatial relative relationship data of the objects detected by the detection unit to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, the certainty factor is A collation that determines the spatial relative relationship of the largest object,
An output unit for outputting an analysis result of the operation command statement;
An information processing apparatus comprising:

An information processing method for performing language processing of an operation command statement for an object existing in an external space,
An information processing device acquiring spatial relative data regarding an object existing in external space;
The information processing apparatus receives an input of an operation command sentence from a user;
The information processing apparatus obtains a hierarchical structure of words included in an input operation command sentence, and refers to a dictionary database that stores words that co-occur in a verb representing operation contents and a spatial relative relationship between objects. A step of obtaining a hierarchical structure including words omitted in the operation command sentence;
The information processing apparatus includes a plurality of partial models connected in the hierarchical structure, and a certainty factor model in which a certainty factor is obtained as a function of the certainty factor of each partial model, wherein each partial model includes the wording Generating a confidence model having a first variable relating to, a second variable group relating to a spatial relative relationship of objects, and a third variable relating to a degree to which the object corresponds to the wording;
The information processing apparatus calculates the certainty factor obtained as a function of the certainty factor of the plurality of partial models by applying the spatial relative relationship data of the detected object to the second variable group, and the certainty factor Determining the spatial relative relationship of the object for which
The information processing apparatus outputting an analysis result of the operation command statement;
An information processing method comprising:

An information processing method for performing language processing of an operation command statement for an object existing in an external space,
When the information processing device acquires time-series data of spatial relations regarding objects existing in the external space and the state of the object has changed, the state of the object different from the current acquired in the past is displayed as the past object. Detecting as a
The information processing apparatus receives an input of an operation command sentence from a user;
The information processing apparatus refers to a dictionary database that stores words and words that co-occur on verbs representing operation contents and a spatial relative relationship between objects, analyzes an operation command sentence input by a user, and relates to past objects If wording is included, distinguishing the wording for the current object and the wording for the past object to determine the hierarchical structure;
The information processing apparatus includes a plurality of partial models connected in the hierarchical structure, and a certainty factor model in which a certainty factor is obtained as a function of the certainty factor of each partial model, wherein each partial model includes the wording Generating a confidence model having a first variable relating to, a second variable group relating to a spatial relative relationship of objects, and a third variable relating to a degree to which the object corresponds to the wording;
The information processing apparatus calculates the certainty factor obtained as a function of the certainty factor of the plurality of partial models by applying the spatial relative relationship data of the detected object to the second variable group, and the certainty factor Determining the spatial relative relationship of the object for which
The information processing apparatus outputting an analysis result of the operation command statement;
An information processing method comprising:

An information processing method for performing language processing of an operation command statement for an object existing in an external space,
The information processing apparatus acquires time-series data of spatial relations regarding objects existing in the external space, and detects the predicted future object state as a future object when the state of the object has changed. Steps,
The information processing apparatus receives an input of an operation command sentence from a user;
The information processing apparatus refers to a dictionary database that stores words that co-occur on verbs representing operation contents and spatial relations of objects, analyzes an operation command sentence input by a user, and relates to a future object. If wording is included, distinguishing the wording for the current object and the wording for the future object to determine the hierarchical structure;
The information processing apparatus includes a plurality of partial models connected in the hierarchical structure, and a certainty factor model in which a certainty factor is obtained as a function of the certainty factor of each partial model, wherein each partial model includes the wording Generating a confidence model having a first variable relating to, a second variable group relating to a spatial relative relationship of objects, and a third variable relating to a degree to which the object corresponds to the wording;
The information processing apparatus calculates the certainty factor obtained as a function of the certainty factor of the plurality of partial models by applying the spatial relative relationship data of the object detected by the detection unit to the second variable group. And determining a spatial relative relationship of the object having the maximum certainty factor,
The information processing apparatus outputting an analysis result of the operation command statement;
An information processing method comprising:

A program for performing language processing of operation statements for objects existing in an external space,
Obtaining spatial relative data for objects present in external space;
A step of accepting an input of an operation command from a user;
The hierarchical structure of the words included in the input operation instruction sentence is obtained, and the operation instruction sentence is omitted in the operation instruction sentence with reference to the dictionary database storing the words that co-occur in the verb representing the operation contents and the spatial relative relationship of the objects. A step to obtain a hierarchical structure including
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, Generating a confidence model having a second variable group relating to a spatial relative relationship of objects, a third variable relating to a degree to which the object corresponds to the wording;
Applying the spatial relative relationship data of the detected object to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, and for the object having the maximum certainty factor Determining a spatial relative relationship;
Outputting an analysis result of the operation command statement;
A program that executes

A program for performing language processing of operation statements for objects existing in an external space,
Acquiring time-series data of spatial relations regarding objects existing in the external space, and detecting a state of an object different from the current acquired in the past as a past object if the state of the object has changed; ,
A step of accepting an input of an operation command from a user;
Referring to the dictionary database that stores the words that co-occur in the verb representing the operation contents and the spatial relative relationship of the objects, the operation command sentence input by the user is analyzed, and the words related to the past objects are included. In some cases, distinguishing the wording about the current object from the wording about the past object to obtain a hierarchical structure,
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, Generating a confidence model having a second variable group relating to a spatial relative relationship of objects, a third variable relating to a degree to which the object corresponds to the wording;
Applying the spatial relative relationship data of the detected object to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, and for the object having the maximum certainty factor Determining a spatial relative relationship;
Outputting an analysis result of the operation command statement;
A program that executes

A program for performing language processing of operation statements for objects existing in an external space,
Obtaining time-series data of a spatial relative relationship with respect to an object existing in the external space, and detecting a predicted future object state as a future object when the state of the object is changed;
A step of accepting an input of an operation command from a user;
Referring to a dictionary database that stores words that co-occur on verbs that represent operations and spatial relations of objects, it analyzes the operation commands entered by the user and includes words about future objects In some cases, determining the hierarchical structure by distinguishing the language of the current object from the language of the future object,
A certainty factor model having a plurality of partial models connected in the hierarchical structure, the certainty factor being obtained as a function of the certainty factor of each partial model, wherein each partial model is a first variable related to the wording, Generating a confidence model having a second variable group relating to a spatial relative relationship of objects, a third variable relating to a degree to which the object corresponds to the wording;
Applying the spatial relative relationship data of the objects detected by the detection unit to the second variable group to calculate a certainty factor obtained as a function of the certainty factor of the plurality of partial models, the certainty factor is Determining the largest spatial relationship of the objects;
Outputting an analysis result of the operation command statement;
A program that executes