JP2019008499A

JP2019008499A - Solution search processing device

Info

Publication number: JP2019008499A
Application number: JP2017122715A
Authority: JP
Inventors: 洋介藤原; Yosuke Fujiwara; 由泰高橋; Yoshiyasu Takahashi; 江端　智一; Tomokazu Ebata; 智一江端
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2017-06-23
Filing date: 2017-06-23
Publication date: 2019-01-17
Anticipated expiration: 2037-06-23
Also published as: JP6925179B2

Abstract

To facilitate the analysis of a solution process of a constraint satisfying solution by explicitly outputting constraint that causes the decision of a solution of a decision variable and information of other solutions that cause constraint propagation of the solution.SOLUTION: A solution search processing device includes: a search step-classified constraint propagation state extraction unit for calculating a changing selectable domain in selecting a value of a decision variable in each search step; a constraint propagation execution unit for executing constraint propagation; a prediction model learning unit for learning a prediction model with the value selected in each step as teacher data on the basis of a result calculated by the search step-classified constraint propagation state extraction unit, updating the domain according to the selected value, and performing learning; a value prediction unit for predicting a value of a decision variable to be selected by the prediction model, updating the domain according to the value selected by the constraint propagation execution unit, and performing search processing; and a cause explanation unit for outputting information of a selectable domain to be a cause in the case of predicting a value of a specified decision variable.SELECTED DRAWING: Figure 2

Description

本発明は、解探索処理装置に係り、大規模な制約プログラミングにより制約充足解を探索する問題において、最適解を求める過程を解析するのに好適な解探索処理装置に関する。 The present invention relates to a solution search processing device, and more particularly to a solution search processing device suitable for analyzing a process for obtaining an optimal solution in a problem of searching for a constraint satisfaction solution by large-scale constraint programming.

制約プログラミングにより制約充足解を探索する問題の応用として、鉄道や資源配置、工場の生産計画などの産業分野における資源の管理や計画の業務を対象とする場合がある。 As an application of the problem of searching for a constraint satisfaction solution by constraint programming, there are cases in which resource management and planning operations in industrial fields such as railways, resource allocation, and factory production planning are targeted.

対象とする業務としては、例えば、鉄道運行管理業務では、通常時には予め定められた列車の運行計画（ダイヤ）に基づいて列車を走行させることが要求されているが、運行当日にダイヤ乱れが発生した場合には、列車運行に支障がないように計画を修正しなければならない。鉄道輸送に必要な計画は列車のダイヤのほかに、ダイヤ上の列車に車両の割り当て計画を定めた車両運用情報や、乗務員の割り当て計画を定めた乗務員運用情報がある。運行当日にダイヤ乱れが発生した場合には、ダイヤの修正に応じて車両運用情報や乗務員運用情報の修正が行われる。 For example, in the railway operation management business, it is required to run a train based on a predetermined train operation plan (diagram) at normal times. If this happens, the plan must be revised so that there is no hindrance to train operation. In addition to train schedules, plans necessary for rail transport include vehicle operation information that defines a vehicle allocation plan for trains on the diagram and crew operation information that defines a crew allocation plan. In the event that a timetable disruption occurs on the day of operation, the vehicle operation information and the crew operation information are corrected according to the timetable correction.

また、例えば、資源配置計画業務では、資源の入出荷によって日次で変動する資源の在庫量に応じて、容量制限のある資源の配置場所に資源を配置する日次計画を立案することが求められる。このとき、出荷のための資源の加工を決められた日時に決められた場所で行う制約や日当たりでの資源を移動させる手段の容量制限の制約など多くの制約を遵守しながら、なおかつ前日の計画とはなるべく変更をしないように日次計画を立案する必要がある。 In addition, for example, in the resource allocation planning business, it is required to develop a daily plan for allocating resources at the resource allocation location where capacity is limited according to the resource inventory that fluctuates daily due to the receipt and shipment of resources. It is done. At this time, while complying with many restrictions such as restrictions on the processing of resources for shipment at the specified date and time and restrictions on the capacity of the means for moving resources per day, the plan for the previous day However, it is necessary to make a daily plan so as not to change as much as possible.

上記のような計画作成業務では、大規模な制約充足問題の解を導出しなければならず、従来では熟練したオペレータが手動で行っていた。しかしながら、近年では熟練したオペレータの退職に伴い、上記の業務をシステムで代替しようというニーズが顕在化している。熟練者のオペレータの業務を代替するシステムにおいてはオペレータが立案した計画と同等に実用的な制約充足解を実用的な時間内に求解することが求められている。 In the planning work as described above, a solution to a large-scale constraint satisfaction problem has to be derived, and conventionally it has been performed manually by a skilled operator. However, in recent years, with the retirement of skilled operators, the need to replace the above work with a system has become apparent. In a system that substitutes the work of a skilled operator, it is required to find a practical constraint satisfaction solution within a practical time as much as a plan formulated by the operator.

これまでも、実用的な制約充足解の求解を容易化する技術は提案されている。例えば、特許文献１には、ユーザにより決定変数のドメイン変更などの新たな要求が追加されるたびに、過去の問題解決事例において採用された解と比較し、同一の解が採用された頻度に基づいて解候補の評価値を求め、固定化された制約と追加の要求を満たす中で最も評価値の高い解を出力する技術が記載されている。 Until now, a technique for facilitating the solution of a practical constraint satisfaction solution has been proposed. For example, in Patent Document 1, each time a new request such as a domain change of a decision variable is added by a user, the frequency is the frequency at which the same solution is adopted compared to the solution adopted in past problem solving cases. A technique is described in which an evaluation value of a solution candidate is obtained based on this, and a solution having the highest evaluation value among the fixed constraints and additional requirements is output.

また、非特許文献１には、ツリー探索を効率的に行うプログラミングパラダイムの一つとして、制約プログラミングという手法が開示されている。 Non-Patent Document 1 discloses a technique called constraint programming as one of programming paradigms for efficiently performing a tree search.

特開２００３−９９２５９号公報JP 2003-99259 A

Marc Vilain、Henry Kautz、Constraint Propagation Algorithms for Temporal Reasoning、Aaai、1986、pp377-382Marc Vilain, Henry Kautz, Constraint Propagation Algorithms for Temporal Reasoning, Aaai, 1986, pp377-382

特許文献１に記載の技術は、ユーザが入力した決定変数のドメインの変更に関する要求に対して、過去の問題解決事例において採用された解と比較し、適切な解を出力することができる。ここで、ドメインとは決定変数の取りうる値の範囲をいう。確かに、特許文献１に記載の技術は、小規模な制約充足問題で全探索が終了する場合には過去事例に基づいて望まれる解が出力されるため有効である。しかしながら、制約や決定変数の数が多い大規模な制約充足問題では問題設定の状況に応じて制約充足解の集合を探索すること自体が困難となる。大規模な制約充足問題で決定変数をノードとして決定変数の値をエッジとしたツリー探索を行う場合には全探索を実用的な時間内に行うことは不可能であり、実用時間内に探索できる探索ステップ数内で求解できるように、決定変数のドメイン変更だけではなく、制約式の変更に応じて、適切な探索ルールを設定する必要がある。 The technique described in Patent Literature 1 can output an appropriate solution by comparing the solution inputted in the past problem solving case with the request regarding the change of the domain of the decision variable input by the user. Here, the domain means a range of values that the decision variable can take. Certainly, the technique described in Patent Document 1 is effective because a desired solution is output based on past cases when the full search is completed due to a small-scale constraint satisfaction problem. However, in a large-scale constraint satisfaction problem with a large number of constraints and decision variables, it is difficult to search for a set of constraint satisfaction solutions depending on the problem setting situation. When performing a tree search with decision variables as nodes and decision variable values as edges in a large-scale constraint satisfaction problem, it is impossible to perform a full search within a practical time, and it is possible to search within a practical time. Appropriate search rules need to be set not only according to the domain change of the decision variable but also according to the change in the constraint expression so that the solution can be obtained within the number of search steps.

また、非特許文献１に記載されているような制約プログラミングでは、ある決定変数のドメインが制約式を介して他の決定変数のドメインを縮小させる影響を制約伝播という演算で特定する。制約伝播により、制約式を介した、互いの決定変数のドメインの影響を考慮し、不要な探索範囲を早めにカットすることにより、効率的に探索領域の絞込みを行う。しかしながら、制約プログラミングにおける解探索においても、探索木の深さ方向の効率化はなされているが、探索木の分岐のうち、どの分岐を優先的に探索するかなど幅方向の効率化はなされていない。そのため、制約プログラミングにおいても、制約式の変更に応じて、実用時間内に制約充足解がみつかるように幅方向にも動的な探索をすることが必要である。 Further, in constraint programming as described in Non-Patent Document 1, the influence that a domain of a certain decision variable reduces the domain of another decision variable via a constraint expression is specified by an operation called constraint propagation. By constraining propagation, the influence of domains of decision variables via the constraint equation is taken into account, and unnecessary search ranges are cut early, thereby efficiently narrowing search areas. However, even in solution search in constraint programming, efficiency in the depth direction of the search tree has been achieved, but efficiency in the width direction has been achieved, such as which branch is searched preferentially among the branches of the search tree. Absent. Therefore, also in constraint programming, it is necessary to perform a dynamic search in the width direction so that a constraint satisfaction solution can be found within a practical time according to the change of the constraint equation.

本発明の目的は、制約プログラミングによって解探索を実施する大規模な制約充足問題において、決定変数の解の決定に原因となった制約やその解の制約伝播の原因となった他の解の情報を明示的に出力することにより、制約充足解の求解の過程の解析、デバッグを容易し、もって求解を容易にする解探索処理装置を提供することにある。 It is an object of the present invention to provide information on constraints that cause determination of a decision variable solution and other solutions that cause constraint propagation of the solution in a large-scale constraint satisfaction problem in which a solution search is performed by constraint programming. It is an object of the present invention to provide a solution search processing device that facilitates the analysis and debugging of the process of finding the constraint satisfaction solution by explicitly outputting.

本発明の解探索処理装置の構成は、好ましくは、制約充足問題の求解処理を実行する解探索処理装置であって、過去の出力解を入力とし、解導出過程の各探索ステップで決定変数の値を選択する際に変化する選択可能なドメインを計算する探索ステップ別制約伝播状況抽出部と、制約伝播を実行する制約伝播実行部と、探索ステップ別制約伝播状況抽出部で計算した結果に基づいて、各ステップでの値選択前の決定変数のドメインを入力とし、各ステップで選択された値を教師データとして予測モデルを学習し、制約伝播実行部により選択した値に応じてドメインを更新して、学習を繰り返し実行する予測モデル学習部と、各探索ステップで値選択前の選択可能な決定変数のドメインを入力として、予測モデル学習部で学習した予測モデルで選択するべき決定変数の値を予測し、制約伝播実行部により選択した値に応じてドメインを更新して、決定変数の解の探索処理を繰り返し実行する値予測部とを有するようにしたものである。 The configuration of the solution search processing device of the present invention is preferably a solution search processing device that executes a solution processing of a constraint satisfaction problem, and receives a past output solution as an input, and determines a decision variable at each search step of the solution derivation process. Based on the results calculated by the constraint propagation status extraction unit by search step that calculates selectable domains that change when a value is selected, the constraint propagation execution unit that executes constraint propagation, and the constraint propagation status extraction unit by search step Then, the domain of the decision variable before value selection at each step is input, the prediction model is learned using the value selected at each step as teacher data, and the domain is updated according to the value selected by the constraint propagation execution unit. A prediction model learning unit that repeatedly performs learning, and a prediction model learned by the prediction model learning unit using the domain of selectable decision variables before value selection at each search step as input. A value predicting unit that predicts the value of the decision variable to be selected, updates the domain according to the value selected by the constraint propagation execution unit, and repeatedly executes the search processing for the solution of the decision variable is there.

また、上記解探索処理装置の構成において、さらに、予測モデル学習部で学習した予測モデルに基づき、原因説明の対象となる決定変数の状態を入力とし、特定した決定変数の値を予測した場合の原因となる選択可能なドメインの情報を出力する原因説明部を有するようにしたものである。 Further, in the configuration of the solution search processing device, based on the prediction model learned by the prediction model learning unit, the state of the decision variable to be the cause explanation target is input, and the value of the specified decision variable is predicted A cause explanation section for outputting information on selectable domains causing the cause is provided.

本発明によれば、制約プログラミングによって解探索を実施する大規模な制約充足問題において、決定変数の解の決定に原因となった制約やその解の制約伝播の原因となった他の解の情報を明示的に出力することにより、制約充足解の求解の過程の解析、デバッグを容易にし、もって求解を容易にする解探索処理装置を提供することができる。 According to the present invention, in a large-scale constraint satisfaction problem in which a solution search is performed by constraint programming, information on the constraint causing the determination of the solution of the decision variable and other solutions causing the constraint propagation of the solution By explicitly outputting, it is possible to provide a solution search processing device that facilitates the analysis and debugging of the solution process of the constraint satisfaction solution, thereby facilitating the solution.

実施形態１に係る解探索処理装置のハードウェア・ソフトウェア構成図である。2 is a hardware / software configuration diagram of a solution search processing device according to Embodiment 1. FIG. 実施形態１に係る解探索処理装置の処理の全体概要を示す図である。It is a figure which shows the whole process outline | summary of the solution search processing apparatus which concerns on Embodiment 1. FIG. 探索ステップ別制約伝播状況抽出部１２０の処理を示すフローチャートである。It is a flowchart which shows the process of the constraint propagation condition extraction part 120 according to search step. 過去の探索解の一例を示す図である。It is a figure which shows an example of the past search solution. ステップ別探索情報を示す図である。It is a figure which shows the search information according to step. 探索ステップ２での各決定変数の値選択状況を表す行列である。It is a matrix showing the value selection situation of each decision variable in search step 2. 探索ステップ２での各決定変数の選択可能なドメインを表す行列である。It is a matrix showing the selectable domain of each decision variable in search step 2. 予測モデル学習部の教師データの例を示す図である。It is a figure which shows the example of the teacher data of a prediction model learning part. 予測モデル学習部の処理を示すフローチャートである。It is a flowchart which shows the process of a prediction model learning part. ニューラルネットワークの概念を説明する図である。It is a figure explaining the concept of a neural network. 値予測部１２３と制約伝播実行部１２２の詳細な処理を示すフローチャートである。It is a flowchart which shows the detailed process of the value estimation part 123 and the constraint propagation execution part 122. 予測モデルとして入力される値選択状況を表す行列の例を示す図である。It is a figure which shows the example of the matrix showing the value selection condition input as a prediction model. 予測モデルとして入力される選択可能なドメイン状況を表す行列の例を示す図である。It is a figure which shows the example of the matrix showing the domain condition which can be selected input as a prediction model. 予測モデルから出力される決定変数を表す行列の例を示す図であるIt is a figure which shows the example of the matrix showing the decision variable output from a prediction model 実施形態１の原因説明部１２４の詳細な処理を示すフローチャートである。5 is a flowchart illustrating detailed processing of a cause explanation unit 124 according to the first embodiment. 予測モデルから出力される決定変数を表す行列の例を示す図である。It is a figure which shows the example of the matrix showing the decision variable output from a prediction model. 原因説明の対象とする値選択状況Ｓ_ｋを表す図である。It is a figure showing value selection situation _{Sk made} into the object of cause explanation. 図１６と図１７の行列を原因説明部１２４の入力として、出力される選択可能なドメイン状況Ｄ_ｋを表す図である。FIG. 18 is a diagram illustrating selectable domain situations _Dk that are output with the matrices of FIGS. 16 and 17 as inputs of the cause explanation unit 124. 原因説明の対象となっている決定変数の値を選択するために選択可能となっているべきドメインの表示を示す図である。It is a figure which shows the display of the domain which should be selectable in order to select the value of the decision variable used as the object of cause explanation. 実施形態２に係る解探索処理装置のハードウェア・ソフトウェア構成図である。FIG. 5 is a hardware / software configuration diagram of a solution search processing device according to a second embodiment. 実施形態２に係る解探索処理装置と処理の全体概要を示す図である。It is a figure which shows the solution search processing apparatus which concerns on Embodiment 2, and the whole outline | summary of a process. 制約伝播実行部１２２の詳細な処理を示すフローチャートである。5 is a flowchart showing detailed processing of a constraint propagation execution unit 122. 制約伝播実行結果１１３の例を示す図である。It is a figure which shows the example of the constraint propagation execution result. 実施形態２の原因説明部１２４の詳細な処理を示すフローチャートである。10 is a flowchart illustrating detailed processing of a cause explanation unit 124 according to the second embodiment. 原因説明部の出力画面の例を示す図である（その一）。It is a figure which shows the example of the output screen of a cause description part (the 1). 原因説明部の出力画面の例を示す図である（その二）。It is a figure which shows the example of the output screen of a cause description part (the 2). 原因説明部の出力画面の例を示す図である（その三）。It is a figure which shows the example of the output screen of a cause description part (the 3). 鉄道運行における原因推定結果の出力例を示す図である（その一）。It is a figure which shows the example of an output of the cause estimation result in railroad operation (the 1). 鉄道運行における原因推定結果の出力例を示す図である（その二）。It is a figure which shows the example of an output of the cause estimation result in railroad operation (the 2).

以下、本発明に係る各実施形態を、図１ないし図２９について説明する。 Hereinafter, each embodiment according to the present invention will be described with reference to FIGS.

〔実施形態１〕
以下、本発明の実施形態１を、図１ないし図１９を用いて説明する。 Embodiment 1
Hereinafter, Embodiment 1 of the present invention will be described with reference to FIGS.

先ず、図１を用いて実施形態１に係る解探索処理装置のハードウェア・ソフトウェア構成について説明する。
解探索処理装置は、鉄道における車両や乗務員の再割り当てや資源配置計画を自動で行うための装置であり、図１に示されるように、表示部１０１、入力部１０２、ＣＰＵ１０３、通信部１０４、記憶部１０７、メモリ１０５から構成される一般的な情報処理装置で実現される。解探索処理装置のハードウェアとして用いられる情報処理装置は、デスクトップコンピュータ、ラップトップコンピュータ、タブレットやサーバ装置であってもよい。また、解探索処理装置の情報処理装置は、ネットワーク１００を介して他の情報処理装置と通信可能である。 First, the hardware / software configuration of the solution search processing apparatus according to the first embodiment will be described with reference to FIG.
The solution search processing device is a device for automatically reallocating vehicles and crew members and resource allocation planning in a railway. As shown in FIG. 1, a display unit 101, an input unit 102, a CPU 103, a communication unit 104, This is realized by a general information processing apparatus including the storage unit 107 and the memory 105. The information processing device used as the hardware of the solution search processing device may be a desktop computer, a laptop computer, a tablet, or a server device. Further, the information processing apparatus of the solution search processing apparatus can communicate with other information processing apparatuses via the network 100.

記憶部１０７には、過去の探索情報１１０、ステップ別探索情報１１１、現在の探索情報１１２が格納されている。過去の探索情報１１０は、過去の解と制約式と決定変数のドメインの初期状態で構成される情報である。ステップ別探索情報１１１は、過去の解探索におけると各ステップの決定変数ごとの選択可能なドメインとそのステップで解導出のために選択した決定変数の値との関係を示す情報である。現在の探索情報１１２は、制約式と決定変数のドメインの初期状態で構成される情報であり、問題の解決する対象としている元データである。 The storage unit 107 stores past search information 110, step-by-step search information 111, and current search information 112. The past search information 110 is information including past solutions, constraint equations, and initial states of decision variable domains. The step-specific search information 111 is information indicating a relationship between a selectable domain for each decision variable in each solution search and a value of the decision variable selected for solution derivation at that step. The current search information 112 is information composed of the constraint equation and the initial state of the domain of the decision variable, and is the original data to be solved.

また、メモリ１０５には、解探索処理装置の各機能を実行するプログラム１０６が記憶されており、ＣＰＵ１０３に実行されることにより機能が実現される。解探索処理装置は、プログラム１０６を実行することにより、探索ステップ別制約伝播状況抽出部１２０、予測モデル学習部１２１、制約伝播実行部１２２、値予測部１２３、原因説明部１２４の各機能部の機能を実行する。 The memory 105 stores a program 106 for executing each function of the solution search processing device, and the function is realized by being executed by the CPU 103. By executing the program 106, the solution search processing device performs a search step-specific constraint propagation status extraction unit 120, a prediction model learning unit 121, a constraint propagation execution unit 122, a value prediction unit 123, and a cause explanation unit 124. Perform the function.

次に、図２を用いて解探索処理装置の処理の全体概要について説明する。 Next, an overall outline of the processing of the solution search processing device will be described with reference to FIG.

図２は、実施形態１に係る解探索処理装置の処理の全体概要を示す図である。
解探索処理装置においては、先ず、探索ステップ別制約伝播状況抽出部１２０において過去の探索情報１１０を受け取り、各探索ステップごとに選択した決定変数の値と決定変数の選択によって変化する選択可能なドメインとの関係を導出し、ステップ別探索情報１１１に保存する。ここで、決定変数とは、問題の対象として、その値を決定するべき変数である。また、ドメインとは、決定変数が取りうる値の範囲（定義域）である。 FIG. 2 is a diagram illustrating an overall outline of processing performed by the solution search processing apparatus according to the first embodiment.
In the solution search processing device, first, the search information 110 for each search step receives the past search information 110, and the value of the decision variable selected for each search step and the selectable domain that changes depending on the selection of the decision variable. And is stored in the step-by-step search information 111. Here, the decision variable is a variable whose value is to be determined as a target of the problem. A domain is a range (definition area) of values that a decision variable can take.

次に、予測モデル学習部１２１は、ステップ別探索情報１１１における選択可能なドメインを入力とし、選択した決定変数の値を出力とする教師データによって、ニューラルネットワークの予測モデルを学習する。 Next, the predictive model learning unit 121 learns a predictive model of the neural network by using teacher data having the selectable domain in the step-by-step search information 111 as an input and outputting the value of the selected decision variable as an output.

制約伝播実行部１２２は、各ステップで決定変数の値を選択するごとに制約を介して縮小する他の決定変数の選択可能なドメインの変化を制約伝播によって算出し、決定変数の値選択後の選択可能なドメインの情報を更新する。更新したドメインの情報は予測モデル学習部１２１にフィードバックされる。制約伝播とは、既に説明したように、ある決定変数の値が定まることによりドメインが変化し、制約式を介して他の決定変数のドメインを変化させる事象を算出する演算である。 The constraint propagation execution unit 122 calculates the change of selectable domains of other decision variables that are reduced through the constraint each time a value of the decision variable is selected in each step by constraint propagation. Update selectable domain information. The updated domain information is fed back to the prediction model learning unit 121. As described above, the constraint propagation is an operation for calculating an event in which the domain changes when the value of a certain decision variable is determined and the domain of another decision variable is changed via a constraint equation.

次に、値予測部１２３は、学習したモデルに基づいて、現在の探索の各ステップで選択可能なドメインを入力として選択するべき決定変数の値を予測し、出力する。その後、学習時での処理と同様に、制約伝播実行部１２２により、決定変数の値選択後の選択可能なドメインの情報を更新し、選択した決定変数の値の情報と更新した値選択後の選択可能なドメインの情報を値予測部１２３にフィードバックする。 Next, based on the learned model, the value predicting unit 123 predicts and outputs the value of the decision variable to be selected with the domain that can be selected at each step of the current search as an input. After that, as in the process at the time of learning, the constraint propagation execution unit 122 updates the information of the selectable domain after selecting the value of the decision variable, and the information of the value of the selected decision variable and the updated value after selection Information on selectable domains is fed back to the value prediction unit 123.

原因説明部１２４は、予測モデル学習部１２１で学習したニューラルネットワークの予測モデルを受け取り、値予測部で値の予測に影響する選択可能なドメインを推定するために、選択した決定変数の値をニューラルネットワークの出力部に設定し、設定した値を逆算させた場合に発生する選択可能なドメインの重みベクトルを計算することにより、決定変数の値選択に影響する選択可能なドメインを導出する。 The cause explanation unit 124 receives the neural network prediction model learned by the prediction model learning unit 121, and the value prediction unit estimates the value of the selected decision variable in order to estimate a selectable domain that affects the value prediction. A selectable domain that influences the selection of the value of the decision variable is derived by calculating a weight vector of the selectable domain that is generated when the set value is calculated backward in the network output unit.

次に、図３ないし図１９を用いて実施形態１に係る解探索処理装置の詳細な処理について説明する。 Next, detailed processing of the solution search processing device according to the first embodiment will be described with reference to FIGS. 3 to 19.

以下では、本実施形態の解探索処理装置の処理の詳細を、日次での資源配置計画を例にして説明する。日次での資源配置計画は下記のように設定されるものとする。 In the following, details of the processing of the solution search processing device of the present embodiment will be described by taking a daily resource allocation plan as an example. The daily resource allocation plan shall be set as follows.

資源Ａ、Ｂ、Ｃ、Ｄが存在していたときに、それぞれの資源の重量をａ、ｂ、ｃ、ｄとする。あるエリア１に資源Ａ、Ｂ、Ｃ、Ｄを配置する場合を１、配置しない場合を０とした場合に、資源Ａ、Ｂ、Ｃ、Ｄについてエリア１に資源を配置するか否かを示す決定変数ｙ_１ｈ（ｈ＝１，２，３，４）は、以下のものとする。
ｙ_１１∈{０，１}
ｙ_１２∈{０，１}
ｙ_１３∈{０，１}
ｙ_１４∈{０，１}
このとき、制約の代表例として、エリア１における容量制限ｚ_１に関する容量制約が以下のように定められるものとする。
ａｙ_１１＋ｂｙ_１２＋ｃｙ_１３＋ｄｙ_１４＜ｚ_１
他のエリア２、３、４についても、同様に資源を配置するか否かを示す決定変数ｙ_１ｈ、ｙ_２ｈ、ｙ_３ｈ、ｙ_４ｈにより管理する。 When resources A, B, C, and D exist, the weight of each resource is set to a, b, c, and d. Indicates whether to allocate resources in area 1 for resources A, B, C, and D when 1 is set for resources A, B, C, and D in area 1 and 0 is not set. The decision variable y _1h (h = 1, 2, 3, 4) is as follows.
y ₁₁ ∈ {0,1}
y ₁₂ ∈ {0,1}
y ₁₃ ∈ {0,1}
y ₁₄ ∈ {0,1}
At this time, as a typical example of the constraint, it is assumed that the capacity constraint regarding the capacity limit z ₁ in the area 1 is determined as follows.
ay ₁₁ + by ₁₂ + cy ₁₃ + dy ₁₄ <z ₁
The other areas 2, 3, and 4 are similarly managed by decision variables y _1h , y _2h , y _3h , and y _4h that indicate whether or not resources are to be allocated.

また、資源Ａ、Ｂ、Ｃ、Ｄについてどのエリアに配置するかを示す決定変数ｘ_ｈ（ｈ＝１，２，３，４）とし、以下の通り、エリア番号をドメインとする。
ｘ_１∈{１，２，３，４}
ｘ_２∈{１，２，３，４}
ｘ_３∈{１，２，３，４}
ｘ_４∈{１，２，３，４}
決定変数ｙ_１ｈとｘ_ｈについては、
ｙ_１ｈ＝１
ならば
ｘ_ｈ＝ｌ
という関係が成立するものとする。 Further, a decision variable x _h (h = 1, 2, 3, 4) indicating in which area the resources A, B, C, and D are arranged is set, and the area number is a domain as follows.
x ₁ ∈ {1, 2, 3, 4}
x ₂ ∈ {1, 2, 3, 4}
x ₃ ∈ {1, 2, 3, 4}
x ₄ ∈ {1, 2, 3, 4}
For the decision variables y _1h and x _h ,
y _1h = 1
Then x _h = l
This relationship shall be established.

以下では、このモデルによって各部の詳細な処理について説明する。 Hereinafter, detailed processing of each unit will be described using this model.

先ず、図３ないし図７を用いて探索ステップ別制約伝播状況抽出部１２０の処理と制約伝播実行部１２２の処理の詳細を説明する。
ステップ別探索情報１１１は、探索ステップｋでの各決定変数の値選択状況と選択可能なドメインで構成され、以下の行列で定義される。
Ｓ_ｋ：探索ステップｋでの各決定変数の値選択状況を表す行列
Ｄ_ｋ：探索ステップｋでの各決定変数の選択可能なドメインを表す行列
行列Ｓ_ｋ、Ｄ_ｋは、行が決定変数ｘ_ｈで、列がその決定変数ｘ_ｈのドメインを示している。Ｓ_ｋの探索ステップｋ＝０での各要素の初期値は０である。 First, the details of the processing of the search step-specific constraint propagation status extraction unit 120 and the processing of the constraint propagation execution unit 122 will be described with reference to FIGS.
The search information 111 by step is composed of the value selection status of each decision variable at the search step k and selectable domains, and is defined by the following matrix.
S _k : Matrix representing value selection status of each decision variable at search step k D _k : Matrix representing selectable domain of each decision variable at search step k Matrix S _k and D _k are rows whose decision variable x in _h, the column indicates the domain of the decision variables x _h. The initial value of each element in the search step k = 0 of S _k is 0.

Ｓ_ｋについては、探索ステップｋで
ｘ_ｈ＝ｌ
となっている決定変数ｘ_ｈの行のドメインｌの列に１を選択するものとする。 For S _k , x _h = l in search step k
It shall select one in the column of the row of domains l decision variable x _h which is a.

Ｄ_ｋについては、探索ステップｋ＝０での各要素の初期値は決定変数の初期状態で選択可能となっている決定変数ｘ_ｈのドメインｌを１とし、選択不能となっているものを０としたものである。探索ステップｋ≠０については、Ｓ_ｋの状態での他の決定変数からの制約伝播により、選択可能となった決定変数ｘ_ｈの行のドメインｌの列を１と更新し、選択不能となったものを０と更新する。 For D _k , the initial value of each element at the search step k = 0 is set to 1 in the domain l of the decision variable x _h that can be selected in the initial state of the decision variable, and 0 that is not selectable. It is what. For the search step k ≠ 0, the column of the domain l in the row of the decision variable x _h that has become selectable is updated to 1 due to constraint propagation from other decision variables in the state of S _k , and becomes unselectable. Update the value to 0.

探索ステップ２でのＳ_ｋ、Ｄ_ｋの例は、それぞれ図６、図７に示されるようになる。 Examples of S _k and D _{k in} the search step 2 are as shown in FIGS. 6 and 7, respectively.

図４は、過去の決定変数ｘ_ｈ（ｈ＝１，２，３，４）の一例を示したものである。そして、図４に示される（ｘ_１，ｘ_２，ｘ_３，ｘ_４）＝（２，２，４，３）の内で、（ｘ_１，ｘ_２）＝（２，２）まで選択した状況での決定変数の値選択状況を、図６に示されるＳ_２で、制約伝播により探索ステップｋ＝２で選択可能なドメインを、図７に示されるＤ_２で示したものである。 FIG. 4 shows an example of a past decision variable x _h (h = 1, 2, 3, 4). Then, (x ₁ , x ₂ ) = (2, 2) selected from (x ₁ , x ₂ , x ₃ , x ₄ ) = ( ₂ , ₂ , ₄ , ₃ ) shown in FIG. The decision variable value selection situation in the situation is indicated by S ₂ shown in FIG. 6, and the domain that can be selected in the search step k = 2 by constraint propagation is indicated by D ₂ shown in FIG.

以上を前提として、図３に示される処理を説明する。探索ステップ別制約伝播状況抽出部１２０は、先ず、処理で過去の探索情報１１０を取得し、Ｓ_ｋとＤ_ｋを初期化する（Ｓ１３０）。なお、過去の探索情報１１０は、過去の解と制約式と決定変数のドメインの初期状態で構成される情報である。この初期化においては、Ｓ_ｋについては、零行列に初期化するが、Ｄ_ｋについては決定変数のドメインの初期状態（すなわち、各決定変数が初期状態において選択可能なドメイン）を反映する。 Based on the above, the process shown in FIG. 3 will be described. First, the search step-specific constraint propagation status extraction unit 120 acquires past search information 110 by processing, and initializes S _k and D _k (S130). The past search information 110 is information including past solutions, constraint equations, and initial states of decision variable domains. In this initialization, S _k is initialized to a zero matrix, but D _k reflects the initial state of the domain of the decision variable (that is, the domain in which each decision variable can be selected in the initial state).

次に、取得した解のうち決定変数の値を一つ選択して、Ｓ_ｋを更新する（Ｓ１３１）。次に、更新したＳ_ｋの状態に基づいて、制約伝播実行部１２２により制約伝播を実施して、Ｄ_ｋを更新する（Ｓ１３２）。その後、最後の決定変数までＳ１３１、Ｓ１３２の処理を実施したかを判定し（Ｓ１３３）、実施していない決定変数がある場合（Ｓ１３３：ＮＯ）には、次の決定変数へと進み、Ｓ１３１、Ｓ１３２の処理を繰り返えし、最後の決定変数までＳ１３１、Ｓ１３２の処理を終了する（Ｓ１３３：ＹＥＳ）と、図５に示されるように、それぞれのステップ番号１４１について、値選択状況１４２に対応するＳ_ｋと選択可能なドメイン１４３に対応するＤ_ｋとを取得できる。 Next, one value of the decision variable is selected from the obtained solutions, and S _k is updated (S131). Next, based on the updated state of S _k , constraint propagation is performed by the constraint propagation execution unit 122 to update D _k (S 132). Thereafter, it is determined whether the processing of S131 and S132 has been performed up to the last decision variable (S133). If there is a decision variable that has not been implemented (S133: NO), the process proceeds to the next decision variable, and S131, When the process of S132 is repeated and the processes of S131 and S132 are ended up to the final decision variable (S133: YES), as shown in FIG. 5, each step number 141 corresponds to the value selection situation 142. S _k to be performed and D _k corresponding to the selectable domain 143 can be acquired.

次に、図８ないし図１０を用いて予測モデル学習部１２１の詳細な処理について説明する。 Next, detailed processing of the prediction model learning unit 121 will be described with reference to FIGS.

先ず、予測モデル学習部１２１は、探索ステップ別制約伝播状況抽出部１２０において作成した値選択状況Ｓ_ｋと選択可能なドメイン状況Ｄ_ｋをステップ別探索情報１１１から予測モデル学習の入力の教師データとして読み込む（Ｓ１７０）。また、値選択状況１６１のＳ_ｋ−１と選択可能なドメイン１６２のＤ_ｋ−１に対応させて、Ｓ_ｋで選択した決定変数の値を、値選択結果１６３として抽出し、予測モデル学習の出力の教師データとして設定する。そして、設定した教師データに基づいて予測モデルを機械学習によって学習する（Ｓ１７１）。機械学習モデルの例としては、上記のデータを取り扱うことができる畳み込みニューラルネットワークを用いる。学習した予測モデルについては値予測部１２３と原因説明部１２４に出力する。 First, the prediction model learning unit 121 uses the value selection situation S _k and the selectable domain situation D _k created by the constraint propagation situation extraction unit 120 by search step as teacher data for input of prediction model learning from the search information 111 by step. Read (S170). Further, in correspondence to the _{D k-1} of _{S k-1} and selectable domain 162 value selection status 161, the value of the decision variable selected in _{S k,} and extracted as the value selection result 163, the predictive model learning Set as output teacher data. Then, the prediction model is learned by machine learning based on the set teacher data (S171). As an example of the machine learning model, a convolutional neural network that can handle the above data is used. The learned prediction model is output to the value prediction unit 123 and the cause explanation unit 124.

ここで、畳み込みニューラルネットワークとは、中間層としての複数の畳み込み層（フィルタ部）を有するニューラルネットワークである。フィルタ部ではフィルタ係数（重みベクトル）が設定される。畳み込み層は、入力されたデータにフィルタリングを行う。以上の処理で、畳み込みニューラルネットワークは、画像データなどの２次元のデータを入力し、中間層によりフィルタ係数に基づくフィルタ処理を行って、出力層により識別の判定結果や認識結果などを出力する。 Here, the convolutional neural network is a neural network having a plurality of convolutional layers (filter units) as intermediate layers. In the filter unit, a filter coefficient (weight vector) is set. The convolution layer performs filtering on the input data. With the above processing, the convolutional neural network inputs two-dimensional data such as image data, performs filtering processing based on the filter coefficient by the intermediate layer, and outputs the identification determination result, the recognition result, and the like by the output layer.

本実施形態でのニューラルネットワークにおいては、図１０に示されるように、各ノードでの値選択状況Ｓ_ｋ、選択可能なドメイン状況Ｄ_ｋの入力、決定変数として出力されるＶ_ｋを教師データとして、ノードにおけるｗｅｉｇｈｔを学習する。ここで、Ｓ_ｋのｗｅｉｇｈｔを表す行列をＷ_ｋ、Ｄ_ｋのｗｅｉｇｈｔを表す行列をＷ′_ｋとすると、以下の式が成立する。
Ｖ_ｋ＝Ｓ_ｋＷ_ｋ＋Ｄ_ｋＷ′_ｋ
次に、図１１を用いて値予測部１２３と制約伝播実行部１２２の詳細な処理について説明する。 In the neural network according to the present embodiment, as shown in FIG. 10, the value selection status S _k at each node, the selectable domain status D _k , and V _k output as a decision variable are used as teacher data. , Learns the weight at the node. Here, when the matrix representing the weight of S _k is W _k and the matrix representing the weight of D _k is W ′ _k , the following equation is established.
_{_{_{_{V k = S k W k +}}}} D k W 'k
Next, detailed processing of the value prediction unit 123 and the constraint propagation execution unit 122 will be described with reference to FIG.

値予測部１２３は、先ず、現在の探索情報１１２を取得し、Ｓ_ｋとＤ_ｋを初期化する（Ｓ１１８）。なお、現在の探索情報１１２は制約式と決定変数のドメインの初期状態で構成される情報である。この初期化については、Ｓ_ｋについては零行列として初期化されており、Ｄ_ｋについては決定変数のドメインの初期状態を反映するものとする。次に、Ｓ_ｋとＤ_ｋとを入力として予測モデル学習部１２１で作成した予測モデルの示すｗｅｉｇｈｔによって選択するべき決定変数の値を予測する（Ｓ１８１）。 First, the value predicting unit 123 acquires the current search information 112 and initializes S _k and D _k (S118). Note that the current search information 112 is information including the constraint equation and the initial state of the domain of the decision variable. As for this initialization, S _k is initialized as a zero matrix, and D _k is assumed to reflect the initial state of the domain of the decision variable. Next, the value of the decision variable to be selected is predicted by the weight indicated by the prediction model created by the prediction model learning unit 121 using S _k and D _k as inputs (S181).

次に、Ｓ１８１で予測した決定変数の値を選択し、Ｓ_ｋ＋１を更新する（Ｓ１８２）。次に、更新したＳ_ｋ＋１に基づいて、制約伝播実行部１２２により制約伝播を実行し、Ｄ_ｋ＋１を更新する（Ｓ１８３）。次に、最後の決定変数まで値を設定できたか否かを判定し（Ｓ１８４）、最後の決定変数まで値を設定していないときには（（Ｓ１８４：ＮＯ）、処理で次の決定変数に移行し（Ｓ１８５）、以下、Ｓ１８１〜Ｓ１８３を繰り返す。 Next, the value of the decision variable predicted in S181 is selected, and S _{k + 1} is updated (S182). Next, based on the updated S _{k + 1} , the constraint propagation execution unit 122 executes constraint propagation and updates D _{k + 1} (S183). Next, it is determined whether or not the value has been set up to the last decision variable (S184), and when the value has not been set up to the last decision variable (S184: NO), the process proceeds to the next decision variable. (S185) Hereinafter, S181 to S183 are repeated.

次に、図１２ないし図１４により、探索ステップｋ＝１での、Ｓ１８１の処理の入出力結果の一例を説明する。 Next, an example of the input / output result of the process of S181 at the search step k = 1 will be described with reference to FIGS.

ここで、学習した予測モデルを用いて予測する際に、予測モデルの入力は図１２のＤ_ｋ、図１３のＳ_ｋで、出力は、図１４のＶ_ｋである。Ｖ_ｋは選択するべき決定変数の値を示す予測モデルの出力となる行列である。予測モデルの入力が図１２のＤ_ｋ、図１３のＳ_ｋで与えられたとき、学習した予測モデルで予測される選択するべき決定変数の値を示す出力部の行列Ｖ_ｋは、図１４のように、Ｄ_ｋとＳ_ｋ同様に行が決定変数、列がドメインのデータテーブルで示すことができる。Ｖ_ｋのうち、行列の要素の値が高い順に決定変数とその値を選択し、探索過程でバックトラックなどが生じて、別の決定変数の値を選択する必要がある場合には、出力部の行列の要素の値が次に高い値を選択する。図１４に示される例では、探索ステップｋ＝２においては、Ｖ_ｋの値のうち、最も高い値は、行ｘ_２、列１の要素ある０．７の値なので、ｘ_２の値として、ドメイン１を優先的に選択する。 Here, when the prediction using the learned prediction model, D _k of the input of the predictive model _12, with S _k in FIG. 13, the output is V _k in FIG. V _k is a matrix serving as an output of the prediction model indicating the value of the decision variable to be selected. When the input of the prediction model is given by D _k in FIG. 12 and S _k in FIG. 13, the matrix V _{k of the} output unit indicating the value of the decision variable to be selected predicted by the learned prediction model is shown in FIG. Thus, like D _k and S _k , the row can be represented by the decision variable and the column can be represented by the domain data table. When a decision variable and its value are selected in descending order of the values of the matrix elements in V _k and a backtracking or the like occurs in the search process, it is necessary to select another decision variable value. The next highest element value of the matrix is selected. In the example shown in FIG. 14, in the search step k = 2, of the value of _{V k,} highest value, row _{x 2,} since the value of the element in 0.7 column 1, as the value of _{x 2,} Domain 1 is preferentially selected.

次に、図１５ないし図１９を用いて原因説明部１２４の詳細な処理について説明する。 Next, detailed processing of the cause explanation unit 124 will be described with reference to FIGS. 15 to 19.

原因説明部１２４は、出力解を取得し、取得した出力解について、原因説明の対象とする決定変数の値を選択し、決定変数を表す行列Ｖ_ｋを、予測モデルの出力として設定する（Ｓ１９０）。 The cause explanation unit 124 obtains an output solution, selects a value of a decision variable to be a cause explanation target for the obtained output solution, and sets a matrix V _k representing the decision variable as an output of the prediction model (S190). ).

次に、Ｓ１９０の処理で選択した決定変数の値を探索した際の値選択状況をＳ_ｋから取得し、予測モデルの入力として設定する（Ｓ１９１）。 Then, the value selection status at the time of searching the value of the selected decision variable in the processing of S190 obtained from the S _k, is set as the input of the predictive model (S191).

次に、Ｓ１９２の処理では、決定変数を表す行列Ｖ_ｋの値を逆算させて、選択可能なドメイン状況Ｄ_ｋの値を算出する。最後に、Ｓ１９３の処理では、算出した選択可能なドメイン状況Ｄ_ｋの値を比較し、Ｄ_ｋが高い値の要素の列を、値選択に関係する選択可能なドメインとして抽出し、値選択の原因として出力する。 Next, in the processing of S192, the value of the matrix V _k representing the decision variable is calculated backward to calculate the value of the selectable domain situation D _k . Finally, in the process of S193, the value of the calculated selectable domain situation _Dk is compared, and a column of elements having a high value of _Dk is extracted as a selectable domain related to the value selection. Output as the cause.

以下、図１６ないし図１９を用いて、原因説明部１２４の入出力の例について説明する。図１６、図１７の例では、原因説明部１２４において、ｘ_１に２が選択されているときに、ｘ_２に１が選択された原因説明を実施することを意味している。図１８の例は、図１６、図１７の行列を、原因説明部１２４の入力とし、予測モデルを逆算して選択可能なドメイン状況Ｄ_ｋを求めた場合の計算結果の例を示している。畳み込みニューラルネットの場合は、オートエンコーダー（G. E. Hinton、 R. R. Salakhutdinov、 Reducing the Dimensionality of Data with Neural Networks、 Science、 Vol313、2006、 pp504-506）で用いられているデコーディングの演算同様に、重みベクトルの転置行列を用いることにより、選択可能なドメイン状況Ｄ_ｋを逆算することができる。選択可能なドメイン状況Ｄ_ｋの各要素の値で０より大きな値となっているものは、原因説明の対象となっている決定変数の値を予測するために選択可能となっているべきドメインである。このドメインを原因説明の対象となっている決定変数の値を選択した原因として出力する。 Hereinafter, input / output examples of the cause explanation unit 124 will be described with reference to FIGS. 16 to 19. In the examples of FIGS. 16 and 17, when the cause explanation unit 124 selects 2 for x ₁ , it means that explanation of the cause with 1 selected for x ₂ is performed. The example of FIG. 18 shows an example of a calculation result when the matrix of FIGS. 16 and 17 is used as the input of the cause explanation unit 124, and a selectable domain situation _Dk is obtained by back-calculating the prediction model. In the case of a convolutional neural network, as with the decoding operation used in the auto encoder (GE Hinton, RR Salakhutdinov, Reducing the Dimensionality of Data with Neural Networks, Science, Vol313, 2006, pp504-506), By using the transposed matrix, the selectable domain situation D _k can be calculated back. The value of each element of the selectable domain situation _Dk that is greater than 0 is a domain that should be selectable in order to predict the value of the decision variable that is the subject of the cause explanation. is there. This domain is output as the cause of selection of the value of the decision variable that is the subject of cause explanation.

図１９は、その出力画面例で塗りつぶされている部分が原因説明の対象となっている決定変数の値を選択するために選択可能となっているべきドメインである。ｘ_１に２が選択されているときに、ｘ_２に１が選択されるためには、図１８の結果によると、ｘ_４について１と３が選択可能となっている必要があり、ｘ_４について１と３が選択可能となっていることが値選択の原因となっていることがわかる。 FIG. 19 shows a domain that should be selectable in order to select the value of the decision variable whose cause is described in the part that is filled in the output screen example. When the x ₁ 2 is selected, for 1 is selected x _2, according to the results of FIG. 18, it is necessary for x ₄ 1 and 3 are selectable, x ₄ It can be seen that the selection of values 1 and 3 is the cause of value selection.

〔実施形態２〕
以下、本発明の実施形態２を、図２０ないし図２７を用いて説明する。 [Embodiment 2]
Hereinafter, Embodiment 2 of the present invention will be described with reference to FIGS.

実施形態１では、原因説明部１２４において特定の決定変数の値を選択した原因となる選択可能なドメインを抽出して提示したが、本実施形態の解探索処理装置では、原因説明部の出力例として、選択可能なドメインだけではなく、選択可能なドメインに対して制約伝播によって変更を与えた制約および決定変数とその値選択状況を示すようにする。 In the first embodiment, the cause explanation unit 124 extracts and presents a selectable domain that causes the selection of the value of a specific decision variable. However, in the solution search processing device of this embodiment, an output example of the cause explanation unit In addition to the selectable domains, the constraints and decision variables that have been changed by constraint propagation for the selectable domains and the value selection status thereof are shown.

先ず、図２０および図２１を用いて、実施形態２に係る解探索処理装置のハードウェア・ソフトウェア構成と処理の全体概要について説明する。 First, the hardware / software configuration of the solution search processing apparatus according to the second embodiment and the overall outline of the processing will be described with reference to FIGS. 20 and 21. FIG.

実施形態２に係る解探索処理装置では、図２０に示されるように、実施形態１の図１に示す解探索処理装置の記憶部１０７に格納される情報に加えて、制約伝播実行結果１１３が格納される。 In the solution search processing device according to the second embodiment, as shown in FIG. 20, in addition to the information stored in the storage unit 107 of the solution search processing device shown in FIG. Stored.

図２１に示されるように、制約伝播実行結果１１３には制約伝播実行部１２２から受け取った制約伝播の実行結果が格納され、原因説明部１２４に出力される。 As shown in FIG. 21, the constraint propagation execution result 113 stores the constraint propagation execution result received from the constraint propagation execution unit 122 and outputs the result to the cause explanation unit 124.

次に、図２２ないし図２７を用いて実施形態２に係る解探索処理装置の詳細な処理について説明する。 Next, detailed processing of the solution search processing device according to the second embodiment will be described with reference to FIGS. 22 to 27.

先ず、図２２および図２３を用いて制約伝播実行部１２２の詳細な処理の詳細について説明する。
制約伝播実行部１２２は、先ず、選択する決定変数の値を値予測部１２３から受け取り、制約伝播を実行する（Ｓ２１０）。制約伝播実行部１２２は、次に、制約伝播実行結果１１３として、探索ステップごとに選択した決定変数の値を図２３に示す選択状況２０１の欄に、選択可能なドメインを選択可能なドメイン２０３の欄に、変更を与えた制約を制約２０２の欄に、選択可能なドメインが変化した決定変数を選択可能なドメインが変化した決定変数２０４の欄にそれぞれ記憶する（Ｓ２１１）。 First, details of the detailed processing of the constraint propagation execution unit 122 will be described with reference to FIGS. 22 and 23.
The constraint propagation execution unit 122 first receives the value of the decision variable to be selected from the value prediction unit 123, and executes constraint propagation (S210). Next, the constraint propagation execution unit 122 sets the value of the decision variable selected for each search step as the constraint propagation execution result 113 in the column of the selection status 201 shown in FIG. In the column, the changed constraint is stored in the column of constraint 202, and the decision variable whose selectable domain is changed is stored in the column of decision variable 204 whose selectable domain is changed (S211).

例えば、図２３に示した制約伝播実行結果１１３の例では、値選択状況２０１の欄のｘ_１＝２の決定変数の値選択については、対応する選択可能なドメイン２０３の欄に、Ｄ_１が記憶されており、選択可能なドメインが変化した決定変数２０４の欄には、ｘ_２、ｘ_４記憶されている。また、選択可能なドメインが変化した決定変数２０４に対応する制約伝播を引き起こした制約２０２の欄については、制約Ａが記憶されている。日次での資源配置計画問題では、この制約２０２の欄に記憶される制約Ａは、容量制限に関する制約などを意味する。 For example, in the example of the constraint propagation execution result 113 shown in FIG. 23, regarding the value selection of the decision variable of x ₁ = 2 in the column of the value selection status 201, D ₁ is added to the column of the corresponding selectable domain 203. X ₂ and x _{4 are} stored in the column of the decision variable 204 that has been stored and the selectable domain has changed. Further, the constraint A is stored in the column of the constraint 202 that caused the constraint propagation corresponding to the decision variable 204 whose domain that can be selected has changed. In the daily resource allocation planning problem, the constraint A stored in the column of the constraint 202 means a constraint related to capacity limitation.

なお、値選択状況２０１と制約２０２との対応は１対多対応、制約２０２と選択可能なドメインが変化した決定変数との対応は１対多対応となりえる。すなわち、あるステップで、一つの値を選択した場合でも、関連する制約、選択可能なドメインが変化した決定変数の数は複数あるうるということである。 Note that the correspondence between the value selection status 201 and the constraint 202 can be a one-to-many correspondence, and the correspondence between the constraint 202 and a decision variable whose selectable domain has changed can be a one-to-many correspondence. That is, even if one value is selected in a certain step, there may be a plurality of decision variables whose associated constraints and selectable domains have changed.

次に、図２４を用いて実施形態２の原因説明部１２４の詳細な処理について説明する。 Next, detailed processing of the cause explanation unit 124 of the second embodiment will be described with reference to FIG.

本実施形態の説明では、主に実施形態１の図１５との相違点について説明する。
原因説明部１２４は、Ｓ１９２の処理の後に、算出した予測モデルの入力部のＤ_ｋの値から、値選択の原因となっている選択可能なドメインに対応する決定変数を抽出する（Ｓ１９４）。次に、原因説明部１２４は、抽出した決定変数の選択可能なドメインが変化した探索ステップを抽出し（Ｓ１９５）、制約伝播実行結果１１３により選択可能なドメインが変化した決定変数２０４を抽出し、抽出した選択可能なドメインが変化した決定変数に対応する制約２０２欄に記載されている制約と値選択状況２０１に記載されている値選択状況を参照して、選択可能なドメインが変化した原因として出力する（Ｓ１９６）。 In the description of the present embodiment, differences from FIG. 15 of the first embodiment will be mainly described.
After the process of S192, the cause explanation unit 124 extracts a decision variable corresponding to the selectable domain that causes the value selection from the value of _{Dk in} the input unit of the calculated prediction model (S194). Next, the cause explanation unit 124 extracts a search step in which the selectable domain of the extracted decision variable has changed (S195), extracts the decision variable 204 in which the selectable domain has changed according to the constraint propagation execution result 113, By referring to the constraint described in the constraint 202 column corresponding to the decision variable in which the extracted selectable domain has changed and the value selection status described in the value selection status 201, the reason why the selectable domain has changed is as follows. It outputs (S196).

次に、図２５ないし図２７を用いて実施形態２の原因説明部１２４の詳細な処理を、図２３に示す制約伝播実行結果１１３の例について適用した場合の具体例を説明する。 Next, a specific example when the detailed processing of the cause explanation unit 124 of the second embodiment is applied to the example of the constraint propagation execution result 113 shown in FIG. 23 will be described with reference to FIGS.

例えば、実施形態１の図１８のようにｘ_３＝４を予測した原因として、ｘ_４について１と３が選択可能なドメインであったことが推定されたとする。この場合、原因となった選択可能なドメインに対応する決定変数はｘ_４である。ここで、図２３に示される制約伝播実行結果よりｘ_４の選択可能なドメインが変化した制約伝播は、ｘ_３＝４以前のものであり、ｘ_１＝２を選択した場合の制約Ａによるものであると推定できる。 For example, suppose that it is estimated that 1 and 3 were selectable domains for x ₄ as a cause of predicting x ₃ = 4 as shown in FIG. In this case, the decision variable corresponding to a selectable domain that caused is x _4. Here, the constraint propagation in which the selectable domain of x ₄ has changed from the constraint propagation execution result shown in FIG. 23 is before x ₃ = 4, and is due to constraint A when x ₁ = 2 is selected. It can be estimated that.

図２５ないし図２７は、そのときの原因推定結果の画面（図１の解探索処理装置の表示部１０１）への出力例を示したものである。図２５の例では、原因推定の対象となる決定変数とその値がｘ_３＝４の場合、原因となる制約が図２３の制約２０２の欄に記載の制約Ａ、原因となる決定変数の選択がｘ_１＝２となることをテーブル形式で表示している。 25 to 27 show examples of output to the screen of the cause estimation result at that time (display unit 101 of the solution search processing device in FIG. 1). In the example of FIG. 25, when the decision variable to be the cause of the cause estimation and its value is x ₃ = 4, the cause constraint is the constraint A described in the column of the constraint 202 in FIG. X ₁ = 2 is displayed in a table format.

図２６の例では、図２５に表示されている情報を、決定変数の選択をノードで表現したツリー表記で表したものであり、原因推定対象となるノードおよび原因となる決定変数の選択に相当するノードを黒塗りで表記している。原因となる決定変数の選択に相当するノードについては、選択可能な決定変数のドメインに影響を与えた制約伝播を行った制約名も表記する。なお、ツリーの規模が大規模になる場合は、関係のないノードの表記は省略してもよい。 In the example of FIG. 26, the information displayed in FIG. 25 is represented by a tree notation in which selection of a decision variable is represented by a node, which corresponds to selection of a node that is a cause estimation target and a decision variable that is a cause. The nodes to be printed are shown in black. For the node corresponding to the selection of the causal decision variable, the name of the constraint that propagated the constraint that affected the domain of the selectable decision variable is also described. In addition, when the scale of a tree becomes large, the notation of unrelated nodes may be omitted.

また、原因推定結果の出力例として、図２７のように原因推定対象と原因の関係のみに焦点をあてたロジックツリーで表示してもよい。このロジックツリーでは、矢印の根元が原因となる決定変数の選択であり、矢印の先が原因推定対象であることを示している。 Further, as an output example of the cause estimation result, as shown in FIG. 27, a logic tree focusing only on the relationship between the cause estimation target and the cause may be displayed. In this logic tree, the root of the arrow is the selection of the decision variable causing the cause, and the tip of the arrow indicates the cause estimation target.

〔その他の実施形態〕
以上、本発明の各実施形態について具体的に説明したが、本発明はこれに限定されるものではなく、その要旨を逸脱しない範囲で種々修正可能である。 [Other Embodiments]
As mentioned above, although each embodiment of this invention was described concretely, this invention is not limited to this, In the range which does not deviate from the summary, various corrections are possible.

実施形態２では、原因推定結果の出力例として，図２５のような探索ツリー、図２７のような原因推定対象と原因の関係のみに焦点をあてたロジックツリーで表示する例を示したが、鉄道運行管理業務や資源配置計画業務などリソース割当業務の作業画面に応じて、原因となる決定変数や制約の表示を各業務のリソースやリソースの運用制約で表示してもよい。 In the second embodiment, as an output example of the cause estimation result, an example is shown in which a search tree as shown in FIG. 25 and a logic tree focusing only on the relationship between the cause estimation object and the cause as shown in FIG. 27 are displayed. Depending on the work screen of the resource allocation task such as the railway operation management task and the resource allocation planning task, the cause variable and the constraint display may be displayed as the resource and resource operation constraint of each task.

ここで、図２８および図２９を用いて鉄道運行における原因推定結果の出力例を説明する。 Here, an output example of the cause estimation result in the railway operation will be described with reference to FIGS. 28 and 29.

鉄道運行における鉄道用語においては、「車両」とは、鉄道車両のハードウェアであり、「列車」とは、ダイヤグラム上の走行（ある駅を何時何分に出発して、他の駅に何時何分に到着）を含む概念である。ここで、列車を決定変数、車両をドメインとする解探索処理問題を考える。制約は、駅の場所、運用時刻である。このときの、鉄道運行における原因推定結果の出力例としては、図２８に示すように、テーブル形式の原因、結果の因果関係を対比して示してもよいし、図２９に示すように、ダイヤグラムの形式で表示してもよい。 In railway terms in railway operation, “vehicle” refers to the hardware of a railway vehicle, and “train” refers to traveling on the diagram (departing from one station at what time and what time to another station) It is a concept that includes (arrival in minutes). Here, consider a solution search processing problem in which a train is a decision variable and a vehicle is a domain. The restrictions are station location and operation time. As an output example of the cause estimation result in the railway operation at this time, as shown in FIG. 28, the cause of the table format and the causal relationship of the result may be shown in comparison, or as shown in FIG. It may be displayed in the form of

上記実施形態では、機械学習を行う予測モデルに畳み込みニューラルネットワークを用いたが、決定変数の値選択状況Ｓ_ｋと選択可能なドメインＤ_ｋを入力として設定可能で、選択するべき決定変数の候補を予測するものであれば、ナイーブベイズ分類器など、その他のモデルであってもよい。 In the above embodiment, a convolutional neural network is used as a prediction model for machine learning. However, a decision variable value selection situation S _k and a selectable domain D _k can be set as inputs, and decision variable candidates to be selected are selected. Other models, such as a naive Bayes classifier, may be used as long as they predict.

また、上記実施形態では、機械学習を行う予測モデルの入力として、決定変数の値選択状況Ｓ_ｋと選択可能なドメインＤ_ｋを想定したが、他にも、初期状態の選択可能なドメインＤ₀を導入するなど、選択可能なドメインの形態について種々のバリエーションが考えられる。 Further, in the above embodiment, the decision variable value selection status S _k and the selectable domain D _k are assumed as the input of the prediction model for performing machine learning. However, the selectable domain D _{0 in the} initial state is also available. Various variations are possible for the form of selectable domains, such as introducing.

上記実施形態では、選択可能なドメインＤ_ｋについては、ツリー探索の過程でたどってもよい分岐の数を反映させるために、探索ステップｋ≠０の場合、Ｓ_ｋの状態での他の決定変数からの制約伝播により、選択可能となった決定変数ｘ_ｈの行のドメインｌの列を１と更新し、選択不能となったものを０と更新するとしたが、この例に限定する必要はなく、例えば、他の決定変数からの制約伝播という限定を解除して、Ｓ_ｋの状態での自己を含めた決定変数からの制約伝播により、選択可能となった決定変数ｘ_ｈの行のドメインｌの列を１と更新し、選択不能となったものを０と更新するとしてもよいし、探索済みの決定変数の値については選択可能だとしても、０ないし−１などの値で表記してもよい。 In the above embodiment, for the selectable domain D _k , in order to reflect the number of branches that may be followed in the course of the tree search, if the search step k ≠ 0, other decision variables in the state of S _k Although the domain l column in the row of the decision variable x _h that can be selected is updated to 1 and the one that cannot be selected is updated to 0 by the constraint propagation from the above, it is not necessary to limit to this example. For example, the domain l of the row of the decision variable x _h that can be selected by the constraint propagation from the decision variable including the self in the state of S _k by removing the restriction of constraint propagation from another decision variable It may be possible to update the column of 1 to 1 and to update the value that has become unselectable to 0, and even if the value of the searched decision variable is selectable, it is expressed by a value such as 0 to -1. Also good.

上記実施形態では，解が算出された場合に原因説明部１２４を利用する実施形態を示したが，解が算出されない場合で，途中まで解探索が進んでいる場合にも原因説明部１２４をデバッガと活用し，解探索途中の段階で選択されている決定変数の値についても原因説明を行ってもよい。 In the above embodiment, the cause explanation unit 124 is used when the solution is calculated. However, even when the solution is not calculated and the solution search progresses halfway, the cause explanation unit 124 is used as the debugger. The cause of the value of the decision variable selected during the solution search may be explained.

〔まとめ〕
以上の各実施形態から明らかなように、本発明によれば、制約プログラミングによって解探索を実施する大規模な制約充足問題の中で決定変数だけでなく制約式の変更が与える影響を把握して、動的に選択するべき決定変数の値を変更し、実用上望まれる制約充足解の求解を容易化するように、決定変数の原因となる制約や他の決定変数の選択など明示的に示すことにより、デバッグの手段と問題解決の示唆を与え、問題の解決者に問題解決を容易化するための情報を明示的に表示するツールを提供することができる。 [Summary]
As is clear from the above embodiments, according to the present invention, in the large-scale constraint satisfaction problem in which solution search is performed by constraint programming, the influence of not only the decision variable but also the change of the constraint equation is grasped. Explicitly indicate the constraints that cause the decision variable and other decision variables so that the decision variable value to be selected dynamically can be changed to facilitate the solution of the constraint satisfaction solution desired in practice. Accordingly, it is possible to provide a tool for explicitly displaying information for facilitating problem solving to a problem solver by giving means for debugging and problem solving suggestions.

１００…ネットワーク
１０１…表示部
１０２…入力部
１０３…ＣＰＵ
１０４…通信部
１０５…メモリ
１０６…プログラム
１０７…記憶部
１１０…過去の探索情報
１１１…ステップ別探索情報
１１２…現在の探索情報
１１３…制約伝播実行結果
１２０…探索ステップ別制約伝播状況抽出部
１２１…予測モデル学習部
１２２…制約伝播実行部
１２３…値予測部
１２４…原因説明部 DESCRIPTION OF SYMBOLS 100 ... Network 101 ... Display part 102 ... Input part 103 ... CPU
104 ... Communication unit 105 ... Memory 106 ... Program 107 ... Storage unit 110 ... Past search information 111 ... Step-specific search information 112 ... Current search information 113 ... Constraint propagation execution result 120 ... Search step-specific constraint propagation status extraction unit 121 ... Prediction model learning unit 122 ... constraint propagation execution unit 123 ... value prediction unit 124 ... cause explanation unit

Claims

A solution search processing device that executes a solution process for a constraint satisfaction problem,
Constraint propagation state extraction unit for each search step that takes a past output solution as an input and calculates a selectable domain that changes when selecting the value of the decision variable in each search step of the solution derivation process;
Based on the results calculated by the constraint propagation execution unit that executes constraint propagation and the constraint propagation status extraction unit by search step, the value of the decision variable before the value selection at each step is input, and the value selected at each step A prediction model learning unit that learns the prediction model as teacher data, updates the domain according to the value selected by the constraint propagation execution unit, and repeatedly executes the learning,
The domain of selectable decision variables before value selection in each search step is used as an input, the value of the decision variable to be selected in the prediction model learned in the prediction model learning unit is predicted, and the value selected by the constraint propagation execution unit And a value prediction unit that updates the domain in response and repeatedly executes the search process for the solution of the decision variable.

Furthermore, based on the prediction model learned by the prediction model learning unit, the state of the decision variable that is the target of the cause explanation is input, and information on selectable domains that cause the value of the specified decision variable is predicted. The solution search processing apparatus according to claim 1, further comprising a cause explanation unit for outputting.

The predictive model learning unit and the value predicting unit perform machine learning based on a machine learning model that predicts a candidate of a decision variable to be selected using a value selection situation of a decision variable and a selectable domain as an input. The solution search processing device according to claim 1.

4. The solution search processing device according to claim 3, wherein the machine learning model is a convolutional neural network.

The cause explanation unit presents values and constraints of other decision variables that cause the value of the specific decision variable to be predicted by the value prediction unit for the value of the decision variable selected during the calculated solution or solution search The solution search processing device according to claim 1, wherein:

6. The value of the decision variable specified by the value prediction unit, the value of the other decision variable that causes the prediction, and the constraint are displayed on a tree in which selection of the decision variable is expressed by a node. The solution search processing device described.

The value of the decision variable identified by the value prediction unit, the value and constraint of the other decision variable that causes the prediction, and the cause that predicted the value of the decision variable and the result are displayed on the logic tree. The solution search processing device according to claim 5, wherein:

6. The solution search processing device according to claim 5, wherein a cause and a result are displayed on a diagram with a vehicle as a decision variable and a train as a domain.