JP2019087273A

JP2019087273A - Optimization apparatus and optimization apparatus control method

Info

Publication number: JP2019087273A
Application number: JP2019001947A
Authority: JP
Inventors: ▲高▼津　求; 求 ▲高▼津; Motomu Takatsu
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-01-09
Filing date: 2019-01-09
Publication date: 2019-06-06

Abstract

To reduce computation time without deteriorating convergence properties.SOLUTION: When a transition controller 20 probabilistically determines whether to accept any one of a plurality of state transitions with reference to a relative relationship between energy change value {-ΔE} and thermal excitation energy, based on temperature value T, energy change value {-ΔE}, and random number value, the transition controller 20 adds an offset value y to the energy change value {-ΔE} and controls the offset value y in the local solution in which the energy is minimized so as to be larger than the case where the energy is not the minimum.SELECTED DRAWING: Figure 1

Description

本発明は、最適化装置及び最適化装置の制御方法に関する。 The present invention relates to an optimization device and a control method of the optimization device.

現在の社会ではあらゆる分野で情報処理が行われている。これらの情報処理はコンピュータ等の演算装置を用いて行われており、様々なデータを演算、加工し、意味のある結果を得ることにより、予測、決定、制御等が行われる。これらの情報処理の１つの分野として最適化処理があり重要な分野となっている。例えばある処理を行う場合に必要な資源やコストを最小化したり、その効果を最大化する解を求める問題等である。これらの問題が非常に重要であるのは明らかであろう。 In the present society, information processing is performed in every field. The information processing is performed using an arithmetic device such as a computer, and prediction, determination, control and the like are performed by computing and processing various data and obtaining meaningful results. Optimization processing is an important field as one of the fields of information processing. For example, there are problems such as minimizing resources and costs necessary for performing a certain process, and finding a solution that maximizes the effect. It will be clear that these issues are of great importance.

最適化問題の代表的なものとして線形計画問題がある。これは複数の連続変数の線形和で表される評価関数を、線形和で表される制約条件の下で最大化または最小化する変数の値を求めるものであり、製品の生産計画等様々な分野で利用されている。この線形計画問題には単体法や内点法といった優れた解法が知られており、何十万以上の変数を持つ問題でも効率的に解くことができる。 The linear programming problem is a representative of optimization problems. This is to find the value of a variable that maximizes or minimizes an evaluation function represented by a linear sum of a plurality of continuous variables under a constraint represented by a linear sum. It is used in the field. This linear programming problem is known to have excellent solution methods such as simplex method and interior point method, and can efficiently solve problems with hundreds of thousands of variables.

一方最適化問題には、変数が連続値では無く離散的な値を取るものも多く知られている。例えば、複数の都市を順番に回り元に戻るときの最短経路を求める巡回セールスマン問題や、ナップザックに異なる品物を詰めるときその価値の和が最大となるような組み合わせを求めるナップザック問題等が挙げられる。このような問題は、離散最適化問題、組合せ最適化問題等と呼ばれ、最適解を得るのが非常に難しいことが知られている。 On the other hand, many optimization problems are known in which variables take discrete values instead of continuous values. For example, there may be a traveling salesman problem for finding the shortest route when going back through multiple cities in order, and a knapsack problem for finding a combination that maximizes the sum of the values when packing different items into knapsack etc. . Such problems are called discrete optimization problems, combinatorial optimization problems, etc., and it is known that it is very difficult to obtain an optimal solution.

離散最適化問題を解くのが難しい最大の原因は、各変数が離散値しか取れないため、評価関数が改善される方向に変数を連続的に変化させることで最適解に到達させるという手法が使えないことである。そして本来の最適値を与える変数の値（最適解、大域解）以外に、局所的に評価関数の極値を与える値（極小（大）解、局所解）が非常に多数存在することである。このため最適解を確実に得るにはしらみつぶしのような方法を取らざるを得ず、計算時間が非常に長くなる。離散最適化問題には計算量理論でＮＰ（Non-deterministic Polynomial）困難問題と呼ばれる、最適解を求めるための計算時間が問題の大きさ（すなわち変数の数）に対して指数的に増加すると予想される問題が多い。上記巡回セールスマン問題やナップザック問題もＮＰ困難問題である。 The biggest cause of difficulty in solving discrete optimization problems is that each variable can only take discrete values, so you can use the method of changing the variables continuously in the direction in which the evaluation function is improved to reach the optimal solution There is no such thing. And there are very many values (minimum (large) solutions, local solutions) which locally give extrema of the evaluation function in addition to the values of the variables which give the original optimum value (optimum solution, global solution) . For this reason, it is necessary to take a method like shredding to obtain an optimal solution surely, and the calculation time becomes very long. For discrete optimization problems, which is called NP (Non-deterministic Polynomial) hard problem in computational complexity theory, it is expected that the calculation time for finding the optimal solution will increase exponentially with the size of the problem (ie the number of variables) There are many problems. The traveling salesman problem and the knapsack problem are also NP difficult problems.

以上述べたように、離散最適化問題の最適解を確実に求めることは非常に困難である。このため実用上重要な離散最適化問題にはその問題に固有な性質を利用した解法が考え出されている。上記のように多くの離散最適化問題では厳密解を得るには指数関数的に増大する計算時間がかかると予想されているため、実用的な解法の多くは近似解法であり、最適解ではないものの評価関数の値が最適値に近い値となる解を得ることができるものである。 As described above, it is very difficult to reliably obtain the optimal solution of the discrete optimization problem. Therefore, for discrete optimization problems that are important for practical use, solutions using properties inherent to the problems have been devised. As mentioned above, many discrete optimization problems are expected to take exponentially increasing computation time to obtain an exact solution, so many practical solutions are approximate solutions and not optimal solutions. It is possible to obtain a solution in which the value of the evaluation function of something is close to the optimum value.

これらの問題に特化した近似解法に対して、問題の性質を用いることなく解くため広範囲な問題を扱える近似解法も知られている。これらはメタヒューリスティックな解法とよばれ、疑似焼き鈍し法（シミュレーテッド・アニーリング法、ＳＡ法）、遺伝的アルゴリズム、ニューラルネットワーク等が挙げられる。これらの方法は、問題の性質をうまく利用した解法よりは効率が悪い可能性があるが、厳密解を得る解法よりは高速に解を得ることが期待できる。 With respect to approximate solutions specific to these problems, approximate solutions that can handle a wide range of problems are also known in order to solve without using the nature of the problems. These are called metaheuristic solutions, and include simulated annealing (simulated annealing, SA), genetic algorithms, neural networks, and the like. Although these methods may be less efficient than solutions that take advantage of the nature of the problem, they can be expected to obtain solutions faster than solutions that obtain exact solutions.

本発明はこのうち疑似焼き鈍し法に関するものである。
疑似焼き鈍し法はモンテカルロ法の一種であり、乱数値を用いて確率的に解を求める方法である。以下では最適化したい評価関数の値を最小化する問題を例に説明し、評価関数の値をエネルギーと呼ぶことにする。最大化の場合は、評価関数の符号を変えればよい。 Among the above, the present invention relates to a pseudo annealing method.
The pseudo-annealing method is a kind of Monte Carlo method, and is a method of stochastically obtaining a solution using random number values. In the following, the problem of minimizing the value of the evaluation function to be optimized will be described as an example, and the value of the evaluation function will be called energy. In the case of maximization, the sign of the evaluation function may be changed.

各変数に離散値の１つを代入した初期状態からはじめ、現在の状態（変数の値の組み合わせ）から、それに近い状態（例えば１つの変数だけ変化させた状態）を選び、その状態遷移を考える。その状態遷移に対するエネルギーの変化を計算し、その値に応じてその状態遷移を採択して状態を変化させるか、採択せずに元の状態を保つかを確率的に決める。エネルギーが下がる場合の採択確率をエネルギーが上がる場合より大きく選ぶと、平均的にはエネルギーが下がる方向に状態変化が起こり、時間の経過とともにより適切な状態へ状態遷移することが期待できる。そして最終的には最適解または最適値に近いエネルギーを与える近似解を得られる可能性がある。もし、これを決定論的にエネルギーが下がる場合に採択、上がる場合に不採択とすれば、エネルギーの変化は時間に対して広義単調減少となるが、局所解に到達したらそれ以上変化が起こらなくなってしまう。上記のように離散最適化問題には非常に多数の局所解が存在するために、状態が、ほとんど確実にあまり最適値に近くない局所解に捕まってしまう。したがって、採択するかどうかを確率的に決定することが重要である。 Starting from the initial state in which one of the discrete values is substituted for each variable, select a state close to that (for example, a state in which only one variable is changed) from the current state (combination of variable values) and consider the state transition . The energy change for the state transition is calculated, and the state transition is adopted according to the value to stochastically decide whether to change the state or to keep the original state without adopting it. If the adoption probability when energy falls is selected to be larger than when energy rises, a state change occurs in the direction of energy decrease on average, and it can be expected that a state transition to a more appropriate state will occur with the passage of time. Finally, it may be possible to obtain an optimal solution or an approximate solution giving an energy close to the optimal value. If this is adopted when energy drops deterministically, if it is rejected, energy change will be monotonically decreasing in a broad sense with respect to time, but no more changes will occur once the local solution is reached It will As described above, because there are a large number of local solutions in the discrete optimization problem, the state is almost certainly caught by local solutions that are not close to the optimal value. Therefore, it is important to determine probabilistically whether to adopt.

疑似焼き鈍し法においては、状態遷移の採択（許容）確率を次のように決めれば、時刻（反復回数）無限大の極限で状態が最適解に到達することが証明されている。
（１）状態遷移に伴うエネルギー変化（エネルギー減少）値（−ΔＥ）に対して、その状態遷移の許容確率ｐを次の何れかの関数ｆ（）により決める。 In the pseudo-annealing method, it has been proved that the state reaches the optimum solution at the limit of time (the number of iterations) infinity if the adoption (permissive) probability of the state transition is determined as follows.
(1) For the energy change (energy decrease) value (−ΔE) accompanying the state transition, the allowable probability p of the state transition is determined by any one of the following functions f ().

ここでＴは温度値と呼ばれるパラメータで次のように変化させる。
（２）温度値Ｔを次式で表されるように反復回数ｔに対数的に減少させる。 Here, T is a parameter called a temperature value and is changed as follows.
(2) The temperature value T is logarithmically reduced to the number of iterations t as expressed by the following equation.

ここでＴ_０は初期温度値であり問題に応じて十分大きくとることが望ましい。
（１）の式で表される許容確率を用いた場合、十分な反復後に定常状態に達したとすると、各状態の占有確率は熱力学における熱平衡状態に対するボルツマン分布にしたがう。そして、高い温度から徐々に下げていくとエネルギーの低い状態の占有確率が増加するため、十分温度が下がるとエネルギーの低い状態が得られるはずである。この様子が材料を焼き鈍したときの状態変化とよく似ているため、この方法は疑似焼き鈍し法と呼ばれるのである。このとき、エネルギーが上がる状態遷移が確率的に起こることは、物理学における熱励起に相当する。 Here, T ₀ is an initial temperature value, and it is desirable to take it sufficiently large according to the problem.
If the steady state is reached after sufficient iterations using the permissible probability expressed by the equation (1), the occupancy probability of each state follows the Boltzmann distribution for the thermal equilibrium state in thermodynamics. And since the occupancy probability of the low energy state increases when the temperature is gradually lowered from the high temperature, the low energy state should be obtained when the temperature is sufficiently lowered. This method is called pseudo-annealing because it is similar to the state change when the material is annealed. At this time, the stochastic occurrence of a state transition in which energy rises corresponds to thermal excitation in physics.

上記のように疑似焼き鈍し法では、反復回数を無限に取れば最適解が得られるが、現実には有限の反復回数で解を得る必要があるため、最適解を確実に求めることはできない。また上の式では温度の下がり方が非常にゆっくりであるため、有限時間では十分に温度が下がらない。したがって実際の疑似焼き鈍し法では対数的な温度変化ではなくより早く温度を下げることが多い。 As described above, in the pseudo-annealing method, an optimum solution can be obtained if the number of iterations is taken infinitely, but in reality it is necessary to obtain a solution with a limited number of iterations, so the optimum solution can not be determined reliably. In the above equation, the temperature drops very slowly, so the temperature does not drop sufficiently in a finite time. Therefore, the actual pseudo-annealing method often lowers the temperature faster rather than logarithmic temperature change.

図１３に疑似焼き鈍し法による最適化装置の概念的構成を示す。ただし、下記説明では、状態遷移の候補を複数発生させる場合についても述べているが、本来の基本的な疑似焼き鈍し法は遷移候補を１つずつ発生させるものである。 FIG. 13 shows a conceptual configuration of the optimization apparatus by the pseudo annealing method. However, in the following description, although the case where a plurality of state transition candidates are generated is described, the basic basic pseudo-annealing method generates one transition candidate one by one.

最適化装置１０には、まず現在の状態Ｓ（複数の状態変数の値）を保持する状態保持部１１がある。また、複数の状態変数の値の何れかが変化することによる現在の状態Ｓからの状態遷移が起こった場合の、各状態遷移のエネルギー変化値｛−ΔＥ_ｉ｝を計算するエネルギー計算部１２がある。そして、最適化装置１０には、温度値Ｔを制御する温度制御部１３、状態変化を制御するための遷移制御部１４がある。 The optimizing device 10 includes a state holding unit 11 which holds the current state S (values of a plurality of state variables). In addition, the energy calculating unit 12 calculates the energy change value {−ΔE _i } of each state transition when the state transition from the current state S occurs due to any of the values of the plurality of state variables changing. is there. The optimization apparatus 10 includes a temperature control unit 13 that controls the temperature value T, and a transition control unit 14 that controls a state change.

遷移制御部１４は、温度値Ｔとエネルギー変化値｛−ΔＥ_ｉ｝と乱数値とに基づいて、エネルギー変化値｛−ΔＥ_ｉ｝と熱励起エネルギーとの相対関係によって複数の状態遷移の何れかを受け入れるか否かを確率的に決定するものである。 The transition control unit 14 selects one of a plurality of state transitions according to the relative relationship between the energy change value {−ΔE _i } and the thermal excitation energy based on the temperature value T, the energy change value {−ΔE _i }, and the random value. It decides probabilistically whether to accept or not.

遷移制御部１４をさらに細分化すると、遷移制御部１４は、状態遷移の候補を発生する候補発生部１４ａ、各候補に対して、そのエネルギー変化値｛−ΔＥ_ｉ｝と温度値Ｔから状態遷移を許可するかどうかを確率的に決定するための可否判定部１４ｂを有する。さらに、可となった候補から採用される候補を決定する遷移決定部１４ｃ、及び、確率変数を発生させるための乱数発生部１４ｄを有する。 When the transition control unit 14 is further subdivided, the transition control unit 14 generates state transition candidates from the energy change value {−ΔE _i } and the temperature value T with respect to each candidate. And the possibility determination unit 14b for determining whether to permit or not. Furthermore, it has a transition determination unit 14c that determines a candidate to be adopted from the accepted candidates, and a random number generation unit 14d for generating a random variable.

一回の反復における動作は次のようなものである。まず、候補発生部１４ａは、状態保持部１１に保持された現在の状態Ｓから次の状態への状態遷移の候補（候補番号｛Ｎｉ｝）を１つまたは複数発生する。エネルギー計算部１２は、現在の状態Ｓと状態遷移の候補を用いて候補に挙げられた各状態遷移に対するエネルギー変化値｛−ΔＥ_ｉ｝を計算する。可否判定部１４ｂは、温度制御部１３で発生した温度値Ｔと乱数発生部１４ｄで生成した確率変数（乱数値）を用い、各状態遷移のエネルギー変化値｛−ΔＥ_ｉ｝に応じて、上記（１）の式の許容確率でその状態遷移を許容する。そして、可否判定部１４ｂは、各状態遷移の可否｛ｆｉ｝を出力する。許容された状態遷移が複数ある場合には、遷移決定部１４ｃは、乱数値を用いてランダムにそのうちの１つを選択する。そして、遷移決定部１４ｃは、選択した状態遷移の遷移番号Ｎと、遷移可否ｆを出力する。許容された状態遷移が存在した場合、採択された状態遷移に応じて状態保持部１１に記憶された状態変数の値が更新される。 The operation in one iteration is as follows. First, the candidate generating unit 14a generates one or more candidates (candidate numbers {Ni}) of state transition from the current state S held in the state holding unit 11 to the next state. The energy calculating unit 12 calculates an energy change value {−ΔE _i } for each state transition listed as a candidate using the current state S and the state transition candidate. The availability determination unit 14b uses the temperature value T generated by the temperature control unit 13 and the random variable (random number value) generated by the random number generation unit 14d, according to the energy change value {−ΔE _i } of each state transition. The state transition is permitted by the allowable probability of the equation (1). Then, the availability determination unit 14b outputs availability {fi} of each state transition. When there are a plurality of permitted state transitions, the transition determination unit 14 c randomly selects one of them using a random number value. Then, the transition determination unit 14 c outputs the transition number N of the selected state transition and the transition availability f. If there is an allowed state transition, the value of the state variable stored in the state holding unit 11 is updated according to the adopted state transition.

初期状態から始めて、温度制御部１３で温度値を下げながら上記反復を繰り返し、一定の反復回数に達したり、エネルギーが一定の値を下回る等の終了判定条件が満たされたとき、動作が終了する。最適化装置１０が出力する答えは終了時の状態である。ただし、実際には有限の反復回数では温度値が０にならないため、終了時においても状態の占有率はボルツマン分布等で表される分布を持っており、必ずしも最適値やよい解になっているとは限らない。したがって、反復の途中でこれまでに得られたエネルギーが最低の状態を保持し、最後にそれを出力するのが現実的な解法となる。 Starting from the initial state, the above-mentioned repetition is repeated while lowering the temperature value by the temperature control unit 13, and the operation is ended when an end determination condition such as reaching a certain number of repetitions or energy falls below a certain value is satisfied. . The answer that the optimization device 10 outputs is the state at the end. However, in reality, the temperature value does not become 0 with a limited number of iterations, and the occupancy rate of the state has a distribution represented by Boltzmann distribution etc. even at the end, and it must be an optimal value or a good solution. There is no limit. Therefore, it is a realistic solution to hold the lowest energy obtained so far during the iteration and finally output it.

図１４は候補を１つずつ発生させる通常の疑似焼き鈍し法における遷移制御部、特に可否判定部のために必要な演算部分の構成例の回路レベルのブロック図である。
遷移制御部１４は、乱数発生回路１４ｂ１、セレクタ１４ｂ２、ノイズテーブル１４ｂ３、乗算器１４ｂ４、比較器１４ｂ５を有する。 FIG. 14 is a circuit level block diagram of a configuration example of an operation portion required for a transition control unit, particularly an availability determination unit in a normal pseudo annealing method in which candidates are generated one by one.
The transition control unit 14 includes a random number generation circuit 14 b 1, a selector 14 b 2, a noise table 14 b 3, a multiplier 14 b 4, and a comparator 14 b 5.

セレクタ１４ｂ２は、各状態遷移の候補に対して計算されたエネルギー変化値｛−ΔＥ_ｉ｝のうち、乱数発生回路１４ｂ１が生成した乱数値である遷移番号Ｎに対応するものを選択して出力する。 The selector 14b2 selects and outputs the energy change value {−ΔE _i } calculated for each state transition candidate that corresponds to the transition number N which is a random number value generated by the random number generation circuit 14b1. .

ノイズテーブル１４ｂ３の機能については後述する。ノイズテーブル１４ｂ３として、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ等のメモリを用いることができる。 The function of the noise table 14b3 will be described later. As the noise table 14b3, for example, a memory such as a random access memory (RAM) or a flash memory can be used.

乗算器１４ｂ４は、ノイズテーブル１４ｂ３が出力する値と、温度値Ｔとを乗算した積（前述した熱励起エネルギーに相当する）を出力する。
比較器１４ｂ５は、乗算器１４ｂ４が出力した乗算結果と、セレクタ１４ｂ２が選択したエネルギー変化値である−ΔＥとを比較した比較結果を遷移可否ｆとして出力する。 The multiplier 14b4 outputs a product (corresponding to the above-mentioned thermal excitation energy) obtained by multiplying the temperature value T by the value output from the noise table 14b3.
The comparator 14b5 outputs a comparison result obtained by comparing the multiplication result output from the multiplier 14b4 with -ΔE, which is the energy change value selected by the selector 14b2, as the transition possibility f.

図１４に示されている遷移制御部１４は、基本的に前述した機能をそのまま実装するものであるが、（１）の式で表される許容確率で状態遷移を許容するメカニズムについてはこれまで説明していないのでこれを補足する。 The transition control unit 14 shown in FIG. 14 basically implements the above-described function as it is, but the mechanism for permitting state transition with the allowance probability represented by the equation (1) has been described so far. I will supplement this as it is not explained.

許容確率ｐで１を、（１−ｐ）で０を出力する回路は、２つの入力Ａ，Ｂを持ち、Ａ＞Ｂのとき１を出力し、Ａ＜Ｂのとき０を出力する比較器の入力Ａに許容確率ｐを、入力Ｂに区間［０，１）の値をとる一様乱数を入力することで実現することができる。したがってこの比較器の入力Ａに、エネルギー変化値と温度値Ｔにより（１）の式を用いて計算される許容確率ｐの値を入力すれば、上記の機能を実現することができる。 A comparator that outputs 1 with tolerance probability p and 0 with (1-p) has two inputs A and B, outputs 1 when A> B, and outputs 0 when A <B The tolerance probability p can be realized by inputting into the input A and the uniform random number taking the value of the interval [0, 1) into the input B. Therefore, the above function can be realized by inputting into the input A of this comparator the value of the allowance probability p calculated using the equation (1) from the energy change value and the temperature value T.

すなわちｆを（１）の式で用いる関数、ｕを区間［０，１）の値をとる一様乱数とするとき、ｆ（ΔＥ／Ｔ）がｕより大きいとき１を出力する回路で、上記の機能を実現できる。 That is, a circuit that outputs 1 when f (ΔE / T) is larger than u, where f is a function used in the equation of (1) and u is a uniform random number taking values in the interval [0, 1), Can realize the function of

このままでもよいのであるが、次のような変形を行っても同じ機能が実現できる。２つの数に同じ単調増加関数を作用させても大小関係は変化しない。したがって比較器の２つの入力に同じ単調増加関数を作用させても出力は変わらない。この単調増加関数としてｆの逆関数ｆ^−１を採用すると、−ΔＥ／Ｔがｆ^−１（ｕ）より大きいとき１を出力する回路でよいことがわかる。さらに温度値Ｔが正であることから−ΔＥがＴｆ^−１（ｕ）より大きいとき１を出力する回路でよい。図１４中のノイズテーブル１４ｂ３はこの逆関数ｆ^−１（ｕ）を実現するための変換テーブルであり、区間［０，１）を離散化した入力に対して次の関数の値を出力するテーブルである。 This may be left as it is, but the same function can be realized even if the following modification is made. Even if the same monotonically increasing function is applied to two numbers, the magnitude relationship does not change. Therefore, the output does not change even if the same monotonically increasing function is applied to the two inputs of the comparator. When the inverse function f ⁻¹ of f is adopted as the monotonically increasing function, it is understood that a circuit that outputs 1 is acceptable when −ΔE / T is larger than f ⁻¹ (u). Furthermore, since the temperature value T is positive, it may be a circuit that outputs 1 when -ΔE is larger than Tf ^-1 (u). The noise table 14b3 in FIG. 14 is a conversion table for realizing the inverse function f ⁻¹ (u), and is a table for outputting the value of the following function to the input obtained by discretizing the interval [0, 1) It is.

遷移制御部１４には、判定結果等を保持するラッチやそのタイミングを発生するステートマシン等も存在するが、図１４では図示を簡単にするため省略されている。
図１５は、従来例における遷移制御部１４の動作フローである。動作フローは、１つの状態遷移を候補として選ぶステップ（Ｓ１）、その状態遷移に対するエネルギー変化値と温度値と乱数値の積の比較で状態遷移の可否を決定するステップ（Ｓ２）、状態遷移が可ならばその状態遷移を採用し、否ならば不採用とするステップ（Ｓ３）を有する。 The transition control unit 14 includes a latch for holding the determination result and the like, a state machine for generating the timing thereof, and the like, but these are omitted in FIG. 14 for the sake of simplicity.
FIG. 15 is an operation flow of the transition control unit 14 in the conventional example. The operation flow is a step of selecting one state transition as a candidate (S1), a step of determining availability of state transition by comparing an energy change value with respect to the state transition, a product of a temperature value and a random number value (S2). If yes, the state transition is adopted, and if not, there is a step (S3) of disapproving.

上記の説明からある程度想像できると思われるが、疑似焼き鈍し法は汎用的で非常に魅力的ではあるが、温度をゆっくり下げる必要があるため計算時間が比較的長くなってしまうという問題がある。さらにその温度の下げ方を問題に合わせて適切に調節することが難しいという問題もある。これは図１６を用いて次のように説明することができる。 From the above explanation, it can be imagined to some extent that the pseudo annealing method is general purpose and very attractive, but there is a problem that the calculation time becomes relatively long because the temperature needs to be lowered slowly. Furthermore, there is also a problem that it is difficult to properly adjust how to lower the temperature according to the problem. This can be described as follows using FIG.

初期値から最適解や近似解に至る状態遷移の経路には近似度の良くない局所解が多数存在する。これらの局所解から十分早く脱出するには、十分な熱励起が可能な高い温度が必要となる。しかし高い温度ではボルツマン分布におけるエネルギーの広がりが大きいため、最適解やエネルギーの低いよい近似解（以下ではよい解と呼ぶ）と、エネルギーの比較的高い近似度の悪い局所解（以下悪い解と呼ぶ）の占有確率の差が小さい。このため局所解を速く脱出できても行く先は多数ある悪い解に分散されてしまい、よい解にたどり着く確率は非常に小さい。よい解の占有確率を増やすには、悪い解とのエネルギー差に比べ、熱励起のエネルギーが十分に小さくなるような低温が必要である。しかしこの場合熱励起のエネルギーが小さいため、経路の途中のエネルギーの山を越えることができる確率が非常に低くなってしまい、状態変化がほとんど起こらない。したがって、ある程度山を越えることができ、占有確率に少し差のつけられる中間温度をゆっくりと経過させることで、徐々によい解の占有確率を増やしてゆく必要がある。もし温度の下げ方が遅すぎると有限時間ではあまり温度が下がらないため、最終的によい解の占有確率が上がらない。逆に速く下げすぎると、局所解を脱出する前に温度が下がってしまい、悪い解に捕まったままになってしまう。したがって温度が下がるほどその変化の割合を十分小さくし、その温度におけるボルツマン分布に近づくまで十分待たなければならない。 There are many local solutions with poor degree of approximation in the path of the state transition from the initial value to the optimal solution and the approximate solution. In order to escape from these local solutions quickly enough, a high temperature is needed to allow sufficient thermal excitation. However, since the spread of energy in the Boltzmann distribution is large at high temperatures, an optimal solution or a low-approximate good approximate solution (hereinafter referred to as a good solution) and a local solution with a relatively high approximation of energy (hereinafter referred to as a bad solution) The difference in occupancy probability of) is small. For this reason, even if the local solution can be escaped quickly, the destinations to go are dispersed to many bad solutions, and the probability of reaching a good solution is very small. In order to increase the occupancy probability of a good solution, it is necessary to have a low temperature such that the energy of thermal excitation is sufficiently smaller than the energy difference with a bad solution. However, in this case, since the energy of the thermal excitation is small, the probability of being able to cross the energy mountain in the middle of the path becomes very low, and almost no state change occurs. Therefore, it is necessary to gradually increase the occupancy probability of a good solution by slowly progressing an intermediate temperature which can go over a mountain to some extent and which is slightly different from the occupancy probability. If the method of lowering the temperature is too slow, the temperature does not decrease so much in finite time, so the probability of occupancy of a good solution eventually does not increase. On the other hand, if it is dropped too fast, the temperature drops before it escapes the local solution, and it remains trapped by the bad solution. Therefore, as the temperature decreases, the rate of change must be sufficiently reduced, and sufficient waiting must be made to approach the Boltzmann distribution at that temperature.

このように本来の疑似焼き鈍し法では、温度による熱励起だけで局所解からの脱出を図っているため、温度をゆっくり下げる必要があるとともに、それを問題に応じて適切に調節する必要があるという問題がある。 Thus, in the original pseudo-annealing method, it is necessary to slowly lower the temperature since it is intended to escape from the local solution only by thermal excitation by the temperature, and it is necessary to adjust it appropriately according to the problem. There's a problem.

この問題に対して局所解に捕まってしまう問題を温度の調節以外の方法により緩和することが考えられる。例えば、特許文献１，２は、温度の制御方式や評価関数を動的に変更することにより、特許文献３は、状態遷移先である近傍の発生方法を動的に変化させることにより、計算の初期には広範囲の検索、末期には狭い範囲で高精度の検索を行い、計算時間の短縮を図るものである。 It is possible to alleviate the problem of being caught by the local solution for this problem by methods other than temperature control. For example, Patent Documents 1 and 2 dynamically change the temperature control method and the evaluation function, and Patent Document 3 dynamically changes the generation method of the neighborhood which is the state transition destination. In the early stage, a wide range of search is performed, and in the final stage, a high-accuracy search is performed in a narrow range to reduce calculation time.

これらは、複数の関数を動的に取り換えたり、探索の進み方を把握するために統計を取る等の比較的複雑な演算が必要である。できればもっと簡便で汎用的な方法で計算時間の短縮を可能にすることが望ましい。 These require relatively complex operations such as dynamically replacing a plurality of functions and taking statistics to grasp how a search proceeds. If possible, it is desirable to make it possible to reduce the calculation time in a simpler and more general method.

特開平６−１９５０７号公報Japanese Patent Application Laid-Open No. 6-19507 特開平９−３４９５１号公報Unexamined-Japanese-Patent No. 9-34951 gazette 特開平１０−２９３７５６号公報Japanese Patent Application Laid-Open No. 10-293756

H.Zhu et.al., “A Boltzmann Machine with Non-rejective Move,”IEICE Trans. Fundamentals vol.E85-A, No.6. pp.1229-1235, June 2002H. Zhu et. Al., “A Boltzmann Machine with Non-rejective Move,” IEICE Trans. Fundamentals vol. E85-A, No. 6. pp. 1229-1235, June 2002

上記のように局所解の脱出に長い時間がかかってしまうことが疑似焼き鈍し法の計算時間が長くなる大きな要因である。したがって局所解の脱出を促進する方法があれば、計算時間を大幅に短縮することが可能になると期待される。しかし、ただ単に局所解から脱出させるだけでは必ずしも計算時間が早くなるとは限らない。上記のように悪い解が非常に多数存在するため、ある局所解からランダムに放り出したとしても、周りの悪い解にまた捕まってしまうだけである。単に脱出させるだけではなく、よりよい状態に状態遷移するように脱出させることが望ましい。 As described above, taking a long time to escape the local solution is a major factor that increases the calculation time of the pseudo annealing method. Therefore, if there is a method to promote the escape of the local solution, it is expected that the calculation time can be significantly shortened. However, simply escaping from the local solution does not always make the computation time faster. As described above, there are a large number of bad solutions, so even if they are thrown out randomly from a certain local solution, they will only get caught by the bad solutions around them. It is desirable not only to let it escape, but to let it transition to a better state.

よい方向に進むように脱出させるためのヒントは、上記の疑似焼き鈍し法の収束定理にある。この定理はメトロポリス法またはギブス法の状態遷移確率にしたがって状態遷移の可否を決定して行けばよい方向へ進むことを示している。 A hint for getting out in a better direction is in the convergence theory of pseudo annealing above. This theorem indicates that it proceeds in a direction that can be determined by determining whether or not state transition is possible according to the state transition probability of the Metropolis method or Gibbs method.

局所解では状態遷移の確率は非常に小さいため遷移候補の選択は何度も行われ、その後の状態遷移の分岐比はメトロポリス法またはギブス法の遷移確率に比例する。したがって、各状態遷移の許容確率の相対比を保ったままその絶対値を増大することができれば、各状態遷移の分岐比が保たれるため、収束性に悪影響を及ぼすことなく局所解での滞在時間を短縮することが可能となり、計算時間の短縮が可能となる。 In the local solution, since the probability of state transition is very small, selection of transition candidates is repeated many times, and the branching ratio of the state transition after that is proportional to the transition probability of the Metropolis method or Gibbs method. Therefore, if the absolute value can be increased while maintaining the relative ratio of the permissible probability of each state transition, the branching ratio of each state transition is maintained, and therefore the stay at the local solution does not adversely affect the convergence. The time can be shortened, and the calculation time can be shortened.

本発明が解決しようとする課題は、評価関数や状態遷移発生方法等を動的に変化させることなく、収束性を損なうことなく局所解からの脱出を促進するための手段を得ることであり、より具体的には、局所解における各状態遷移の許容確率の相対比を保ったままその絶対値を増大する手段を得ることである。 The problem to be solved by the present invention is to obtain means for promoting the escape from the local solution without changing the evaluation function and the state transition generation method, etc. dynamically and without losing the convergence. More specifically, it is to obtain means for increasing the absolute value while maintaining the relative ratio of the permissible probability of each state transition in the local solution.

非特許文献１はこのような手法の１つである。この文献には記述の誤りがあるものの適切な修正を行えば、上記課題を解決することができる。この手法では、全ての状態遷移に対する許容確率を計算し、その値と指数分布を持つ乱数値の比が最大となる許容確率を採用することで、元の許容確率に比例した割合で採択する状態遷移を選ぶことができる。この方法は、許容確率の相対比を保つとともに、元の状態に留まる確率を０にすることができるため、非常に有効である。しかし、許容確率の計算や乱数値の発生における演算量が大きいという問題がある。 Non-Patent Document 1 is one such method. The above-mentioned problems can be solved by appropriately correcting what is described in this document. In this method, the acceptance probability for all state transitions is calculated, and the adoption probability is adopted in proportion to the original allowance probability by adopting the allowance probability that the ratio of the value to the random value having the exponential distribution becomes maximum. You can choose the transition. This method is very effective because it can keep the probability of staying in the original state as well as maintaining the relative ratio of the allowance probability. However, there is a problem that the calculation amount in the calculation of the allowable probability and the generation of the random number is large.

本発明はより簡便な方法で同様の効果、すなわち、収束性を損なうことなく計算時間を短縮できるという効果を得ることを目的とする。 An object of the present invention is to obtain the same effect in a simpler method, that is, the effect that the calculation time can be shortened without losing the convergence.

１つの実施態様では、最適化装置は、エネルギーを表す評価関数に含まれる複数の状態変数の値をそれぞれ保持する状態保持部と、前記複数の状態変数の値の何れかが変化することに応じて状態遷移が起こる場合、前記エネルギーの変化値を複数の状態遷移のそれぞれに対して計算するエネルギー計算部と、温度を示す温度値を制御する温度制御部と、前記温度値と前記変化値と乱数値とに基づいて、前記変化値と熱励起エネルギーとの相対関係によって前記複数の状態遷移の何れかを受け入れるか否かを確率的に決定する際に、前記変化値にオフセット値を加えるとともに、前記エネルギーが極小となる局所解における前記オフセット値を、前記エネルギーが極小ではない場合と比較して大きくなるように制御する遷移制御部と、を有する。 In one embodiment, the optimization device receives a state holding unit that holds values of a plurality of state variables included in an evaluation function representing energy, and changes in any of the values of the plurality of state variables. When a state transition occurs, an energy calculation unit that calculates the change value of the energy for each of a plurality of state transitions, a temperature control unit that controls a temperature value indicating temperature, the temperature value, and the change value An offset value is added to the change value when it is determined probabilistically whether or not to accept any of the plurality of state transitions based on the relative relationship between the change value and the thermal excitation energy based on a random value and And a transition control unit configured to control the offset value in the local solution in which the energy is minimized so as to be larger than that in the case where the energy is not minimized.

また、１つの実施形態では、最適化装置の制御方法が提供される。 Also, in one embodiment, a method of controlling an optimization device is provided.

一つの側面では、本発明は、収束性を損なうことなく計算時間を短縮できる。 In one aspect, the present invention can reduce computation time without compromising convergence.

本発明における疑似焼き鈍し法の遷移制御部の構成例を示す図である。It is a figure which shows the structural example of the transition control part of the pseudo | simulation annealing method in this invention. 本発明における遷移制御部の動作フローを示す図である。It is a figure which shows the operation | movement flow of the transition control part in this invention. 第１の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。It is a figure which shows an example of a circuit structure of the transition control part in the optimization apparatus of 1st Embodiment. パルス信号の発生の状態遷移の一例を示す状態遷移図である。It is a state transition diagram which shows an example of the state transition of generation | occurrence | production of a pulse signal. パルス信号を発生する論理回路の真理値表の一例を示す図である。It is a figure which shows an example of the truth value table of the logic circuit which generate | occur | produces a pulse signal. パルス信号を発生するステートマシンの一例を示す図である。It is a figure which shows an example of the state machine which generate | occur | produces a pulse signal. 図３の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。It is a figure which shows an example of the software simulation result of the pseudo | simulation annealing method implement | achieved using the transition control part of FIG. 第２の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。It is a figure which shows an example of a circuit structure of the transition control part in the optimization apparatus of 2nd Embodiment. 図８の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。It is a figure which shows an example of the software-simulation result of the pseudo | simulation annealing method implement | achieved using the transition control part of FIG. 図８の遷移制御部を用いた最適化装置の一例を示す図である。It is a figure which shows an example of the optimization apparatus using the transition control part of FIG. 第３の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。It is a figure which shows an example of a circuit structure of the transition control part in the optimization apparatus of 3rd Embodiment. 図１１の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。It is a figure which shows an example of the software-simulation result of the pseudo | simulation annealing method implement | achieved using the transition control part of FIG. 疑似焼き鈍し法による最適化装置の概念的構成を示す図である。It is a figure which shows the notional structure of the optimization apparatus by a pseudo | simulation annealing method. 従来例における遷移制御部、特に可否判定部のために必要な演算部分の構成例の回路レベルのブロック図である。It is a block diagram of the circuit level of the example of composition of the operation part required for the transition control part in a prior art example, especially an availability judgment part. 従来例における遷移制御部の動作フローを示す図である。It is a figure which shows the operation | movement flow of the transition control part in a prior art example. 疑似乱数法における状態の占有確率の概念を示す図である。It is a figure which shows the concept of the occupancy probability of the state in a pseudorandom number method.

図１に本発明で提案する局所解からの脱出を促進する機能を備える疑似焼き鈍し法の遷移制御部の構成例を示す。図１４に示した遷移制御部１４と同じ要素については同一符号が付されている。 FIG. 1 shows an example of the configuration of the transition control unit of the pseudo annealing method provided with the function of promoting escape from the local solution proposed in the present invention. The same components as those of the transition control unit 14 shown in FIG. 14 are denoted by the same reference numerals.

図１に示されているように、遷移制御部２０は、図１３に示した可否判定部１４ｂの機能を実現する回路部分に追加された、オフセット加算回路２１とオフセット制御回路２２とを有する。その他の部分は図１４に示した遷移制御部１４と同じである。 As shown in FIG. 1, the transition control unit 20 has an offset addition circuit 21 and an offset control circuit 22 added to the circuit portion that realizes the function of the availability determination unit 14b shown in FIG. The other parts are the same as the transition control unit 14 shown in FIG.

オフセット加算回路２１は、状態遷移に伴うエネルギー変化値（−ΔＥ）にオフセット値ｙを加えるオフセット加算回路として機能する。図１の回路の例では、オフセット加算回路２１は、減算器２１ａである。このため、図１の例では、エネルギー変化値（−ΔＥ）にオフセット値ｙを加える代わりに、比較対象である温度値Ｔと乱数値の積Ｔｆ^−１（ｕ）（熱励起エネルギーに相当する）からオフセット値ｙを減ずる構成となっているがどちらでも同じである。 The offset addition circuit 21 functions as an offset addition circuit that adds the offset value y to the energy change value (−ΔE) accompanying the state transition. In the example of the circuit of FIG. 1, the offset addition circuit 21 is a subtractor 21a. Therefore, in the example of FIG. 1, instead of adding the offset value y to the energy change value (−ΔE), the product Tf ⁻¹ (u) (the thermal excitation energy corresponds to the product of the temperature value T to be compared and the random value). And the offset value y is reduced, but the same is true for either.

オフセット制御回路２２は、局所解（エネルギーが極小となる解）におけるオフセット値ｙを、局所解ではないときに比べて大きくなるように制御する。図１の例では、オフセット制御回路２２は、リセット端子Ｒを有する累算器２２ａである。累算器２２ａは、リセット端子Ｒに入力される遷移可否ｆが、状態遷移を許容することを示すとき（つまり状態遷移が生じるとき）には、オフセット値ｙを０にする。また、累算器２２ａは、入力端子と、クロック端子を有する。累算器２２ａは、遷移可否ｆが、状態遷移を許容しないことを示すとき（つまり状態遷移が生じないとき）には、クロック端子に図示しないパルス信号が入力される度に、オフセット値ｙに入力端子に入力されるオフセット増分値Δｙを加えていく。 The offset control circuit 22 controls the offset value y in the local solution (solution in which the energy is minimized) to be larger than that in the case where it is not the local solution. In the example of FIG. 1, the offset control circuit 22 is an accumulator 22a having a reset terminal R. The accumulator 22a sets the offset value y to 0 when the transition availability f input to the reset terminal R indicates that state transition is permitted (that is, when state transition occurs). The accumulator 22a also has an input terminal and a clock terminal. The accumulator 22a outputs an offset value y every time a pulse signal (not shown) is input to the clock terminal when the transition availability f indicates that the state transition is not permitted (that is, when the state transition does not occur). The offset increment value Δy input to the input terminal is added.

なお、図示しないパルス信号は、例えば、後述するステートマシンによって供給される。オフセット増分値Δｙは、例えば、図示しないレジスタに記憶されている。
このような遷移制御部２０は、セレクタ１４ｂ２により選択されたエネルギー変化値（−ΔＥ）に累算器２２ａに保持されているオフセット値ｙを加えた和である−ΔＥ＋ｙが温度値Ｔと乱数値の積Ｔｆ^−１（ｕ）よりも大きいときその状態遷移を許容する。 A pulse signal (not shown) is supplied by, for example, a state machine described later. The offset increment value Δy is stored, for example, in a register not shown.
Such a transition control unit 20 adds the offset value y held in the accumulator 22 a to the energy change value (−ΔE) selected by the selector 14 b 2 −ΔE + y is a temperature value T and a random number value The state transition is allowed when it is larger than the product Tf ⁻¹ (u) of

そして累算器２２ａは、オフセット値ｙを次のように変化する。もし許容された状態遷移が存在し状態遷移が生じたときは、累算器２２ａは、オフセット値ｙを、０にリセットする。もし許容された遷移が存在せず状態遷移が起こらなかったときは、累算器２２ａは、オフセット増分値Δｙだけオフセット値ｙを増加する。 Then, the accumulator 22a changes the offset value y as follows. If there is an allowed state transition and a state transition occurs, the accumulator 22a resets the offset value y to zero. If there is no permitted transition and no state transition occurs, the accumulator 22a increases the offset value y by the offset increment value Δy.

図２にこの状態遷移の可否判定のための動作フローをまとめる。
動作フローは、１つの状態遷移を候補として選ぶステップ（Ｓ１０）、その状態遷移に対するエネルギー変化値（−ΔＥ）とオフセット値ｙとの和と、温度値Ｔと乱数値の積の比較で状態遷移の可否を決定するステップ（Ｓ１１）を有する。さらに、動作フローは、状態遷移が可ならばその状態遷移を採用し、オフセット値ｙをクリアし、否ならば不採用とし、オフセット値ｙを増加するステップ（Ｓ１２）を有する。 FIG. 2 summarizes the operation flow for determining whether this state transition is possible.
The operation flow is a step of selecting one state transition as a candidate (S10), comparing the sum of the energy change value (-.DELTA.E) and the offset value y for the state transition, and comparing the product of the temperature value T and the random value Of determining whether or not to Furthermore, the operation flow has a step (S12) of adopting the state transition if the state transition is possible, clearing the offset value y, and disapproving the state value, if not, (S12).

このほかの動作は通常の疑似焼き鈍し法と同じでよい。
以下上記のようなオフセット加算回路２１とオフセット制御回路２２を有する遷移制御部２０による効果を説明する。 Other operations may be the same as the normal pseudo annealing method.
The effects of the transition control unit 20 having the offset addition circuit 21 and the offset control circuit 22 as described above will be described below.

現在の状態が局所解に捕まってなかなか脱出できない状態にあるとき、全ての状態遷移に対するエネルギー変化値は大きな正の値である。このときの各状態遷移に対する許容確率はメトロポリス法であってもギブス法であっても、以下の式４−１，４−２に示すように、ほぼ指数関数で表される。 When the current state is caught by the local solution and can not escape easily, the energy change value for all state transitions is a large positive value. The allowable probability for each state transition at this time is approximately expressed by an exponential function as shown in the following equations 4-1 and 4-2, regardless of whether it is the Metropolis method or the Gibbs method.

全ての状態遷移の可否判定において、エネルギー変化値｛−ΔＥ_ｉ｝にオフセット値ｙを加えて判定を行うとすると、全ての状態遷移の許容確率は以下の式５のようになり、全ての状態遷移の許容確率が同じ倍率ｅ^ｙ／Ｔで大きくなることがわかる。 Assuming that the offset value y is added to the energy change value {−ΔE _i } and the determination is performed in all the state transition determinations, the allowable probabilities of all the state transitions are as shown in Equation 5 below, and all states It can be seen that the transition probability of probability increases at the same scaling factor e ^{y / T.}

前述のように、全ての状態遷移の許容確率の相対比を保ったまま許容確率の絶対値を増大することができれば、その後の状態遷移の分岐比を変化させることなく、局所解での滞在時間を短縮することができる。そのため、オフセット値ｙを用いることで局所解からの脱出促進が期待できる。しかしこのオフセット値ｙを適切に制御しなければ、加速効果が十分ではなかったり、収束性を悪化させてしまったりする可能性がある。 As described above, if it is possible to increase the absolute value of the permissible probability while maintaining the relative ratio of the permissible probability of all the state transitions, the residence time in the local solution without changing the branching ratio of the subsequent state transitions Can be shortened. Therefore, by using the offset value y, promotion of escape from the local solution can be expected. However, if the offset value y is not properly controlled, the acceleration effect may not be sufficient or the convergence may be deteriorated.

まず、現在の状態が局所解でないときには、エネルギーの下がる状態遷移があるため、遷移確率は指数関数では近似できない。このためオフセット値ｙがあると分岐比を変えてしまう。このため局所解でないときは、オフセット値ｙは０であるか十分小さいことが望ましい。 First, when the current state is not a local solution, the transition probability can not be approximated by an exponential function because there is a state transition in which the energy decreases. Therefore, if there is an offset value y, the branching ratio is changed. For this reason, when not a local solution, it is desirable that the offset value y be 0 or sufficiently small.

また現在の状態が局所解であるときのオフセット値ｙが一定の値であると加速効果はあるものの必ずしも十分でない。状態遷移に伴うエネルギーの増加が大きいものばかりであるとオフセット値ｙを与えても遷移確率は非常に小さいままである。オフセット値ｙを与えてもなかなか局所解を脱出できない場合には、さらに大きなオフセット値ｙを用いることが望ましい。 If the offset value y when the current state is a local solution is a constant value, although there is an acceleration effect, it is not always sufficient. The transition probability remains very small even if the offset value y is given if the increase in energy accompanying the state transition is only large. If a local solution can not be easily escaped even if the offset value y is given, it is desirable to use a larger offset value y.

これを解決するため、図１のオフセット制御回路２２は、状態遷移が起こらないときオフセット値ｙを少しずつ増やし、状態遷移が起こった場合に、オフセット値ｙを０にリセットする構成となっている。 In order to solve this, the offset control circuit 22 of FIG. 1 is configured to gradually increase the offset value y when no state transition occurs, and reset the offset value y to 0 when a state transition occurs. .

状態が局所解に留まっていると次第にオフセット値ｙが大きくなるため、いつかは必ず脱出することができる。また、状態が局所解でないときは状態遷移に伴うリセットが頻繁に起こるためオフセット値ｙは０または小さい値であり、分岐比に大きな影響を及ぼさないようにすることが可能となる。 If the state remains in the local solution, the offset value y gradually increases, so it can always escape someday. In addition, when the state is not a local solution, the reset accompanying the state transition frequently occurs, so the offset value y is 0 or a small value, and it is possible to prevent the branch ratio from being greatly affected.

オフセット増分値Δｙも適切に選ぶことが望ましい。オフセット増分値Δｙを大きくした方が局所解から速く脱出できる。しかしあまり大きくすると、局所解でないときも必ずしも毎回状態遷移が起こるとは限らないためオフセット値ｙの影響を受ける可能性がある。また、局所解においても比較的エネルギーの増加が少なく許容確率が高くなるべき状態遷移が候補に挙がる前にオフセット値ｙが大きくなってしまい、分岐比が正しい値からずれてしまう可能性がある。分岐比に大きな影響を及ぼさないためには、局所解における平均滞在時間が局所解でないときの平均滞在時間の数倍程度になるようにするのがよいと思われる。 It is desirable to properly select the offset increment value Δy. If the offset increment value Δy is increased, the local solution can be quickly exited. However, if it is too large, the state transition does not necessarily occur every time even if it is not a local solution, and therefore it may be affected by the offset value y. In addition, even in the local solution, the offset value y may become large before a state transition that should increase the energy relatively little and have a high allowable probability as a candidate, and the branch ratio may deviate from the correct value. In order not to greatly affect the branching ratio, it seems to be preferable to make the average stay time in the local solution be several times the average stay time when it is not the local solution.

以上のことからオフセット増分値Δｙを適切に選べば、収束性に悪影響を及ぼすことなく局所解での滞在時間を短縮することが可能となり、最適化の計算時間の短縮が可能となることがわかる。 From the above, it can be understood that if the offset increment value Δy is appropriately selected, the residence time in the local solution can be shortened without adversely affecting the convergence, and the calculation time of the optimization can be shortened. .

この効果のソフトウェアシミュレーションによる検証については、以下に示す実施の形態とともに後述する。
ところで、本発明は上記のように疑似焼き鈍し法を実現する図１３の遷移制御部１４、さらにいえばその中でも可否判定部１４ｂに上記のような新たな機能ブロックを加えることにより、計算時間の短縮を図るものである。その他の部分には何ら変更を加えなくてよい。したがって、現在の状態に対して許されうる状態遷移の集合や、状態遷移に伴うエネルギーの変化を与える関数形やその計算方法等にはまったく依存せずに、本発明を適用することができる。したがって、これらの部分の具体的回路構成法については詳しく説明しない。 The verification by software simulation of this effect will be described later together with the embodiment shown below.
By the way, the present invention shortens the calculation time by adding the above-described new functional block to the transition control unit 14 of FIG. 13 which realizes the pseudo annealing method as described above, and more specifically, the availability determination unit 14b. The There is no need to change the other parts. Therefore, the present invention can be applied without any dependence on the set of state transitions that can be permitted for the current state, the function form giving the change of energy accompanying the state transition, the calculation method thereof, and the like. Therefore, the specific circuit configuration method of these parts will not be described in detail.

ただし、以下で最適化するエネルギーがイジングモデルで表される場合の疑似焼き鈍し法について、また、それとほとんど等価であるボルツマンマシンにおける最適化において、遷移候補の発生及び状態遷移に伴うエネルギーの変化の計算法について簡単に説明する。 However, with regard to the pseudo-annealing method in the case where the energy to be optimized is expressed by the Ising model below, and in the optimization in the Boltzmann machine which is almost equivalent to it, calculation of the change of energy accompanying transition occurrence and state transition I will explain the law briefly.

イジングモデルは、お互いに相互作用を行うＮ個のスピンからなる系を表すモデルであり、各スピンｓ_ｉは±１の２値をとる。系のエネルギーは、以下の式６で表される。 The Ising model is a model representing a system of N spins interacting with each other, and each spin s _i has a binary value of ± 1. The energy of the system is represented by the following equation 6.

式６において、Ｊ_ｉ，ｊは、スピンｓ_ｉとスピンｓ_ｊ間の相互作用係数を示し、ｈ_ｉは、系のバイアス値である外部磁場係数を示す。
現在の状態から次の状態への状態遷移の候補は、１つのスピンの反転であり、Ｎ通り存在する。したがって遷移候補としては反転する１つのスピン番号または複数のスピンの番号の集合を発生させればよい。 In Equation 6, J _{i, j} represents an interaction coefficient between spin s _i and spin s _j , and h _i represents an external magnetic field coefficient which is a bias value of the system.
Candidates for state transition from the current state to the next state are inversions of one spin, and there are N ways. Therefore, as a transition candidate, a set of spin numbers or a plurality of spin numbers to be inverted may be generated.

そしてｉ番目のスピン反転に伴うエネルギーの変化は、以下の式７で表される。 And the change of the energy accompanying the i-th spin inversion is represented by the following formula 7.

ここで、以下の式８のＦ_ｉは、ローカルフィールド（局所場）値と呼ばれ、各スピンの反転によるエネルギー変化の割合を表している。 Here, F _i in the following equation 8 is called a local field value, and represents the rate of energy change due to the inversion of each spin.

状態遷移を許容するかどうかはエネルギーの変化で決まるため、基本的にはエネルギーそのものを計算せずにローカルフィールド値からエネルギーの変化を計算すれば十分である。出力として得られた最低エネルギーに対する状態を用いる場合には、ローカルフィールド値からエネルギーの変化を計算しそれを累算してゆくことでエネルギーを求めることができる。 Since whether to allow state transition is determined by the change in energy, it is basically sufficient to calculate the change in energy from the local field value without calculating the energy itself. When using the state for the lowest energy obtained as the output, the energy can be determined by calculating the change of the energy from the local field value and accumulating it.

さらに、 further,

であるから、ローカルフィールド値を行列演算により毎回計算し直す必要はなく、状態遷移にともなって反転のあったスピンによる変化分だけ加算すればよい。
また、ニューラルネットワークに用いられるボルツマンマシンは、状態変数が（０，１）の２値をとることを除いてイジングモデルの疑似焼き鈍し法と同じである。このためほとんど同様の構成とすることができる。エネルギー、エネルギーの変化値、ローカルフィールド値を表す式は、以下の式１０、式１１、式１２のようになる。 Therefore, it is not necessary to recalculate the local field values by matrix operation every time, and it is sufficient to add only the change due to the spin that has been inverted along with the state transition.
Also, the Boltzmann machine used for the neural network is the same as the pseudo annealing method of the Ising model except that the state variable takes two values of (0, 1). Therefore, almost the same configuration can be made. The equations representing energy, energy change value, and local field value are as in the following Equation 10, Equation 11, and Equation 12.

なお、ボルツマンマシンではイジングモデルのスピンに相当するものをニューロンと呼ぶことが多いが簡単のため以下ではスピンと呼ぶ。
したがって、図１３に示した状態保持部１１は、Ｎ個のスピンの値を保持するＮビットレジスタと加算器、排他的論理和等の比較的簡単な演算回路を用いて構成することができる。 In Boltzmann machines, what is equivalent to the spin of Ising model is often called a neuron, but for simplicity it is called a spin below.
Therefore, the state holding unit 11 shown in FIG. 13 can be configured using an N-bit register holding values of N spins, an adder, and a relatively simple arithmetic circuit such as an exclusive OR.

上記のようにイジングモデルを用いた疑似焼き鈍し法とボルツマンマシンを用いた疑似焼き鈍し法は同等であり、お互いに相互変換できるので、以下では論理回路の０、１と対応の付けやすいボルツマンマシンを想定して説明を行う。 As described above, since the pseudo annealing method using the Ising model and the pseudo annealing method using the Boltzmann machine are equivalent and can be mutually converted, in the following, it is assumed that the Boltzmann machine which easily corresponds to 0 or 1 of the logic circuit. To explain.

なおボルツマンマシン（及びイジングモデルの疑似焼き鈍し）においては、状態遷移に伴い変化する状態変数は１つだけであり、それに対するエネルギー変化値はローカルフィールド値を用いて予め計算しておくことができる。したがって以下の実施の形態では予め計算しておいたエネルギー変化値を遷移候補の発生に応じて選択する形式の実装を例に説明している。しかしながら、ボルツマンマシンでないときは、複数の状態変数が変化する遷移を考える場合もあるため、遷移候補の発生後に必要なエネルギー変化値を計算するような実装が有利になる場合もある。 In the Boltzmann machine (and pseudo annealing of Ising model), there is only one state variable that changes with the state transition, and the energy change value for that can be calculated in advance using the local field value. Therefore, in the following embodiment, an implementation in which the energy change value calculated in advance is selected according to the generation of the transition candidate is described as an example. However, when it is not a Boltzmann machine, since there may be cases in which transitions in which a plurality of state variables change are considered, an implementation that calculates an energy change value necessary after the occurrence of a transition candidate may be advantageous.

以下、ボルツマンマシンを想定した最適化装置の３つの実施の形態を説明する。
（第１の実施の形態）
図３は、第１の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。図１に示した遷移制御部２０と同じ要素については同一符号が付されている。図３の遷移制御部２０ａは、基本的に図１の遷移制御部２０と同じであるが、図３では、累算器２２ａの構成について少し具体的に示されている。 Hereinafter, three embodiments of an optimization apparatus assuming a Boltzmann machine will be described.
First Embodiment
FIG. 3 is a diagram showing an example of a circuit configuration of the transition control unit in the optimization device of the first embodiment. The same elements as those of the transition control unit 20 shown in FIG. 1 are denoted by the same reference numerals. The transition control unit 20a of FIG. 3 is basically the same as the transition control unit 20 of FIG. 1, but in FIG. 3 the configuration of the accumulator 22a is shown a little more concretely.

累算器２２ａは、加算器２２ａ１、セレクタ２２ａ２、レジスタ２２ａ３を有する。
加算器２２ａ１は、オフセット増分値Δｙと、レジスタ２２ａ３が出力するオフセット値ｙとを加算した和を出力する。 The accumulator 22a includes an adder 22a1, a selector 22a2, and a register 22a3.
The adder 22a1 outputs a sum obtained by adding the offset increment value Δy and the offset value y output from the register 22a3.

セレクタ２２ａ２は、遷移可否ｆが状態遷移を許容することを示すとき、０を選択して出力し、遷移可否ｆが状態遷移を許容しないことを示すとき、加算器２２ａ１が出力する加算結果を選択して出力する。 The selector 22a2 selects and outputs 0 when the transition availability f indicates that the state transition is permitted, and selects the addition result output from the adder 22a1 when the transition availability f indicates that the state transition is not permitted. Output.

レジスタ２２ａ３は、クロック端子に供給されるパルス信号に同期して、セレクタ２２ａ２が出力する値を取り込み、オフセット値ｙとして出力する。
加算器２２ａ１とレジスタ２２ａ３のビット幅は、適切に設定される。ビット幅は、エネルギー変化値（−ΔＥ）のビット幅と同程度でよい。例えば相互作用係数のビット幅を１６、スピン数を１０２４とした場合、エネルギー変化値（−ΔＥ）は最大２７ビットとなるのでこのビット幅を用いれば十分である。実際にはこれより少なくても十分である場合がほとんどである。ノイズテーブル１４ｂ３の出力のビット幅もエネルギー変化値（−ΔＥ）のビット幅と同程度以下でよい。 The register 22a3 takes in the value output from the selector 22a2 in synchronization with the pulse signal supplied to the clock terminal, and outputs it as an offset value y.
The bit widths of the adder 22a1 and the register 22a3 are appropriately set. The bit width may be about the same as the bit width of the energy change value (−ΔE). For example, assuming that the bit width of the interaction coefficient is 16 and the number of spins is 1024, the energy change value (-.DELTA.E) is 27 bits at maximum, so it is sufficient to use this bit width. In practice, it is almost always sufficient if there is less than this. The bit width of the output of the noise table 14b3 may be equal to or less than the bit width of the energy change value (-.DELTA.E).

レジスタ２２ａ３のクロック端子に供給されるパルス信号は、回路動作における反復動作をコントロールするステートマシンより供給され、１回の反復における状態遷移の可否が確定した後に、一度だけアクティブになるように制御される。 The pulse signal supplied to the clock terminal of the register 22a3 is supplied from a state machine that controls repetitive operations in the circuit operation, and is controlled to become active only once after determining whether the state transition in one repetitive operation is possible. Ru.

可否判定とその後に続く各パラメータの更新に必要なクロック信号のサイクル数は可否判定結果に依存して変化するため、パルス信号もこのサイクル数に合うように発生させる必要がある。 Since the number of cycles of the clock signal necessary for the determination of availability and the subsequent updating of each parameter changes depending on the result of availability determination, it is necessary to generate a pulse signal so as to match the number of cycles.

以下では、状態の更新があった場合は５サイクル、なかった場合は１サイクルで次の反復に入る場合を例としてパルス信号の発生方法の説明を行う。
図４は、パルス信号の発生の状態遷移の一例を示す状態遷移図である。 In the following, a method of generating a pulse signal will be described by taking as an example the case of entering the next repetition in 5 cycles if there is an update of the state and 1 cycle if there is not.
FIG. 4 is a state transition diagram showing an example of state transition of pulse signal generation.

図４に示すように、０〜４の５つの状態間で、遷移が行われる。状態０のとき、遷移可否ｆが０である場合、パルス信号が発生される。この場合、状態０からの遷移は行われない。状態０のとき、遷移可否ｆが１であると、状態１に遷移する。図４において、Ｄ．Ｃ．は、ドントケアを示している。つまり、状態１からは、遷移可否ｆの値によらずクロック信号ＣＬＫに同期して、状態２、状態３、状態４へと遷移し、状態０へと戻る。そして状態４から状態０に戻る際に、パルス信号が発生される。 As shown in FIG. 4, a transition is made between five states of 0-4. At the time of state 0, if the transition possibility f is 0, a pulse signal is generated. In this case, transition from state 0 is not performed. When the transition possibility f is 1 at the state 0, the transition to the state 1 is made. In FIG. C. Indicates don't care. That is, from state 1, regardless of the value of transition possibility f, the state transitions to states 2, 3 and 4 in synchronization with the clock signal CLK, and the state returns to state 0. Then, upon returning from state 4 to state 0, a pulse signal is generated.

このような状態遷移を実現するためのステートマシンは、以下の真理値表を満たす回路とすればよい。
図５は、パルス信号を発生する論理回路の真理値表の一例を示す図である。 A state machine for realizing such state transition may be a circuit satisfying the following truth table.
FIG. 5 is a diagram showing an example of a truth table of a logic circuit that generates a pulse signal.

また、図６は、パルス信号を発生するステートマシンの一例を示す図である。
ステートマシン５０は、３ビットフリップフロップ５１、インクリメント回路５２、ＡＮＤ回路５３、セレクタ５４、ＡＮＤ回路５５，５６を有している。図５の真理値表は、各状態の３ビットフリップフロップ５１の出力値Ｑ１，Ｑ２，Ｑ３と、入力値Ｄ１，Ｄ２，Ｄ３の関係を示すものである。 FIG. 6 is a diagram showing an example of a state machine that generates a pulse signal.
The state machine 50 has a 3-bit flip flop 51, an increment circuit 52, an AND circuit 53, a selector 54, and AND circuits 55 and 56. The truth table of FIG. 5 shows the relationship between the output values Q1, Q2 and Q3 of the 3-bit flip-flop 51 in each state and the input values D1, D2 and D3.

３ビットフリップフロップ５１には、インクリメント回路５２が出力する３ビットの値のうち、上位２ビット（［ｄ０：ｄ１］）と、セレクタ５４が出力する値が、入力値Ｄ１〜Ｄ３として供給される。３ビットフリップフロップ５１は、クロック信号ＣＬＫに同期したタイミングで、入力値Ｄ１〜Ｄ３を取り込み、出力値Ｑ１〜Ｑ３として出力する。 Of the 3-bit values output from the increment circuit 52, the upper 2 bits ([d0: d1]) of the 3-bit flip-flop 51 and the values output from the selector 54 are supplied as input values D1 to D3. . The 3-bit flip-flop 51 takes in the input values D1 to D3 at timings synchronized with the clock signal CLK, and outputs them as output values Q1 to Q3.

インクリメント回路５２は、３ビットフリップフロップ５１が出力する３ビットの出力値Ｑ１〜Ｑ３を＋１する。例えば、出力値Ｑ１〜Ｑ３が、“００１”（つまりＱ１＝Ｑ２＝０、Ｑ３＝１）である場合、インクリメント回路５２は、“０１０”を出力する。 The increment circuit 52 adds 1 to the 3-bit output values Q1 to Q3 output from the 3-bit flip-flop 51. For example, when the output values Q1 to Q3 are "001" (that is, Q1 = Q2 = 0, Q3 = 1), the increment circuit 52 outputs "010".

ＡＮＤ回路５３は、出力値Ｑ１〜Ｑ３の各ビットの論理レベルを反転した値を入力し、それらの論理積を出力値として出力する。
セレクタ５４の一方の入力端子には、インクリメント回路５２が出力する３ビットの値の最下位ビット（ｄ２）が供給され、他方の入力端子には、遷移可否ｆが供給される。そして、セレクタ５４は、ＡＮＤ回路５３の出力値が１であれば、遷移可否ｆを出力し、ＡＮＤ回路５３の出力値が０であれば、ｄ３を出力する。 The AND circuit 53 inputs a value obtained by inverting the logic level of each bit of the output values Q1 to Q3 and outputs the logical product of them as an output value.
The least significant bit (d2) of the 3-bit value output from the increment circuit 52 is supplied to one input terminal of the selector 54, and the transition availability f is supplied to the other input terminal. Then, the selector 54 outputs the transition availability f if the output value of the AND circuit 53 is 1, and outputs d3 if the output value of the AND circuit 53 is 0.

ＡＮＤ回路５５は、出力値Ｑ１〜Ｑ３の３ビット（［ｑ１：ｑ３］）の各ビットの論理レベルを反転した値を入力し、それらの論理積を出力値として出力する。
ＡＮＤ回路５６は、クロック信号ＣＬＫと、ＡＮＤ回路５５が出力する出力値との論理積を、パルス信号として出力する。 The AND circuit 55 inputs a value obtained by inverting the logic level of each bit of the output values Q1 to Q3 ([q1: q3]), and outputs the logical product of them as an output value.
The AND circuit 56 outputs a logical product of the clock signal CLK and the output value output from the AND circuit 55 as a pulse signal.

以上のようなステートマシン５０でパルス信号を生成することができる。
以下第１の実施の形態の最適化装置の動作例を説明する。
乱数発生回路１４ｂ１は、前述した各反復において状態遷移の候補の番号（遷移番号Ｎ）を乱数値により１つずつ発生する。セレクタ１４ｂ２は、その状態遷移に伴うエネルギー変化値（−ΔＥ）を選択して出力する。また、一様乱数である乱数値に基づきノイズテーブル１４ｂ３による変換を行って得られた値に、乗算器１４ｂ４が温度値Ｔを乗算することによりメトロポリス法またはギブス法における熱励起エネルギーを生成する。そして、減算器２１ａは、熱励起エネルギーから累算器２２ａが出力するオフセット値ｙを減ずる。比較器１４ｂ５は、減算器２１ａが出力する減算結果と、セレクタ１４ｂ２が選択して出力したエネルギー変化値（−ΔＥ）とを比較することで状態遷移の可否を決定する。 The pulse signal can be generated by the state machine 50 as described above.
An operation example of the optimization apparatus according to the first embodiment will be described below.
The random number generation circuit 14b1 generates the number of the state transition candidate (transition number N) one by one according to the random number value in each repetition described above. The selector 14 b 2 selects and outputs the energy change value (−ΔE) accompanying the state transition. Further, the multiplier 14b4 multiplies the temperature value T by the value obtained by performing conversion by the noise table 14b3 based on random numbers which are uniform random numbers, thereby generating thermal excitation energy in the metropolis method or Gibbs method. . Then, the subtractor 21a subtracts the offset value y output from the accumulator 22a from the thermal excitation energy. The comparator 14b5 compares the subtraction result output from the subtractor 21a with the energy change value (-ΔE) selected and output by the selector 14b2 to determine whether the state transition is possible.

オフセット値ｙは、累算器２２ａにより、状態遷移が採用されたとき０にリセットされ、状態遷移が採用されず現在の状態に留まるときオフセット増分値Δｙ増分が加算される。これにより、現在の状態における滞在時間に対してオフセット値ｙが単調増加するよう制御される。 The offset value y is reset to 0 by the accumulator 22a when a state transition is adopted, and the offset increment value Δy increment is added when the state transition is not adopted and remains in the current state. Thus, the offset value y is controlled to monotonically increase with respect to the staying time in the current state.

オフセット増分値Δｙを決める目安は以下のように与えられる。
前述のように、収束性に悪影響を及ぼすことなく加速効果を得るには、局所解の滞在時間が、局所解でない場合の数倍程度になるようにオフセット増分値Δｙを選ぶのがよいと考えられる。本実施の形態のように各反復において状態遷移の候補が１つ発生する場合、各状態遷移が候補に挙がる確率は、全ての状態遷移の数の逆数となる。このことを考慮すると、オフセット増分値Δｙは、滞在時間が全ての状態遷移の数の数倍程度になったときオフセット値ｙが局所解からの脱出に必要な山の高さのエネルギーになるように定めるのがよいと考えられる。 A standard for determining the offset increment value Δy is given as follows.
As mentioned above, in order to obtain the acceleration effect without adversely affecting the convergence, it is better to select the offset increment value Δy so that the residence time of the local solution will be several times that of the non-local solution. Be When one state transition candidate occurs in each iteration as in the present embodiment, the probability that each state transition is a candidate is the inverse of the number of all state transitions. Taking this into consideration, the offset increment value Δy is such that the offset value y becomes the energy of the height of the mountain necessary to escape from the local solution when the staying time becomes several times the number of all state transitions. It is considered that it is better to

図７は図３の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。最適化する問題は３２都市の巡回セールスマン問題をイジングモデル（ボルツマンマシン）により定式化したものである。横軸は反復回数、縦軸は最適解が得られた割合（正答率（％））を表している。結果６０は、図３の遷移制御部２０ａを用いたときの、反復回数と正答率との関係を示し、結果６１は、図１４に示した遷移制御部１４を用いたときの、反復回数と正答率との関係を示す。 FIG. 7 is a diagram showing an example of a software simulation result of the pseudo annealing method realized by using the transition control unit of FIG. The problem to be optimized is the traveling salesman problem of 32 cities formulated by Ising model (Boltzmann machine). The horizontal axis represents the number of iterations, and the vertical axis represents the rate at which the optimal solution was obtained (percent correct answer (%)). The result 60 shows the relationship between the number of iterations and the correct answer rate when the transition control unit 20a of FIG. 3 is used, and the result 61 is the number of iterations when the transition control unit 14 shown in FIG. 14 is used. Indicates the relationship with the correct answer rate.

図７から第１の実施の形態の遷移制御部２０ａを用いた場合のほうが、遷移制御部１４を用いた場合よりも速く正解に達することがわかる。以下の式１３で表される９９％の確率で正答が得られる反復回数Ｎ_９９で比べると遷移制御部１４を用いた場合では４．３×１０^１０、第１の実施の形態の遷移制御部２０ａを用いた場合では７．７×１０^９であり、約５倍高速化されていることが示された。 It can be seen from FIG. 7 that the correct answer is reached more quickly in the case of using the transition control unit 20 a of the first embodiment than in the case of using the transition control unit 14. The transition control unit according to the first embodiment is 4.3 × 10 ¹⁰ in the case where the transition control unit 14 is used in comparison with the number of iterations N ₉₉ by which the correct answer is obtained with the probability of 99% expressed by the following equation 13: In the case of using 20a, it was 7.7 × 10 ⁹ and it was shown that the speed was improved about five times.

ただし、式１３において、ｎは反復回数で、η（ｎ）はその回数での正答率である。
（第２の実施の形態）
図８は第２の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。なお、図８では、乱数値を発生する回路については図示が省略されている。 However, in Equation 13, n is the number of iterations, and η (n) is the correct answer rate at that number.
Second Embodiment
FIG. 8 is a diagram showing an example of a circuit configuration of the transition control unit in the optimization device of the second embodiment. In FIG. 8, the circuit that generates random numbers is not shown.

以下、図８の遷移制御部２０ｂは各ビット反転（スピンの値の変化）を全て状態遷移の候補とするものとして説明するが、各ビット反転の一部のみを状態遷移の候補とすることも可能である。また、以下の説明では、熱励起のために用いる乱数値を、各遷移候補に対して独立とするが、いくつかの状態遷移の候補に対して共通としてもよい。 Hereinafter, the transition control unit 20b in FIG. 8 is described as all bit inversions (changes in spin value) as state transition candidates, but it is also possible to use only a part of each bit inversion as state transition candidates. It is possible. In the following description, random numbers used for thermal excitation are independent for each transition candidate, but may be common for several state transition candidates.

遷移制御部２０ｂは、図３の遷移制御部２０ａと同様に累算器２２ａを有している他、熱励起エネルギー生成部７０、減算器７１、比較器７２、セレクタ７３を有する。
熱励起エネルギー生成部７０は、遷移候補ごとに独立の乱数値｛ｕｉ｝を、前述した逆関数ｆ^−１（ｕ）の値に変換するノイズテーブル（記憶部）を有する。さらに熱励起エネルギー生成部７０は、ノイズテーブルが出力する値に温度値Ｔを乗算した積を、メトロポリス法またはギブス法における熱励起エネルギーとして出力する。 The transition control unit 20 b includes an accumulator 22 a as in the case of the transition control unit 20 a of FIG. 3, and further includes a thermal excitation energy generation unit 70, a subtractor 71, a comparator 72, and a selector 73.
The thermal excitation energy generation unit 70 has a noise table (storage unit) that converts the independent random number value {ui} into the value of the above-described inverse function f ⁻¹ (u) for each transition candidate. Furthermore, the thermal excitation energy generation unit 70 outputs a product of the value output from the noise table and the temperature value T as thermal excitation energy in the Metropolis method or Gibbs method.

減算器７１は、遷移候補ごとに生成された熱励起エネルギーから、累算器２２ａが出力するオフセット値ｙを減ずる。
比較器７２は、減算器７１が出力する各減算結果と、エネルギー変化値｛−ΔＥ_ｉ｝とを比較することで各状態遷移の可否を示す遷移可否｛ｆｉ｝を出力する。なお、この比較器７２の動作は、複数の状態遷移のそれぞれに対して計算されたエネルギー変化値｛−ΔＥ_ｉ｝とオフセット値ｙとの和のそれぞれと、複数の乗算（熱励起エネルギー）とのそれぞれとの比較結果を出力することに相当する。 The subtractor 71 subtracts the offset value y output from the accumulator 22a from the thermal excitation energy generated for each transition candidate.
The comparator 72 compares transition results of the subtractor 71 with the energy change value {−ΔE _i } to output transition availability {fi} indicating availability of each state transition. Note that the operation of the comparator 72 is as follows: each of the sum of the energy change value {−ΔE _i } calculated for each of the plurality of state transitions and the offset value y; and a plurality of multiplications (thermal excitation energy) It corresponds to outputting the comparison result with each of.

セレクタ７３は、遷移可否｛ｆｉ｝に基づいて、許容された状態遷移が複数存在するときは、乱数値を用いてその中から１つをランダムに選択する。そして、セレクタ７３は、選択した状態遷移の候補の番号（遷移番号Ｎ）を出力するとともに、遷移可否ｆとして１を出力する。状態遷移が生じないときには、遷移可否ｆは０となる。 If there are a plurality of permitted state transitions, the selector 73 randomly selects one of the state transitions using random number values based on the transition availability {fi}. Then, the selector 73 outputs the number (transition number N) of the selected state transition candidate, and outputs 1 as the transition possibility f. When no state transition occurs, the transition availability f is zero.

以下第２の実施の形態の最適化装置の動作例を説明する。
前述した各反復において、熱励起エネルギー生成部７０は、状態遷移の候補の数と等しい独立な一様乱数である乱数値｛ｕｉ｝を受け、ノイズテーブルを用いて変換を行う。そして熱励起エネルギー生成部７０は、変換で得られた値に共通の温度値Ｔを乗算することにより、メトロポリス法またはギブス法における熱励起エネルギーを生成する。 An operation example of the optimization device of the second embodiment will be described below.
In each iteration described above, the thermal excitation energy generation unit 70 receives random value {ui} which is an independent uniform random number equal to the number of state transition candidates, and performs conversion using a noise table. Then, the thermal excitation energy generation unit 70 generates thermal excitation energy in the metropolis method or Gibbs method by multiplying the value obtained by the conversion by the common temperature value T.

遷移候補ごとに生成された熱励起エネルギーから、減算器７１によって、累算器２２ａが出力するオフセット値ｙが減ぜられ、比較器７２で、減算器７１が出力する各減算結果と、エネルギー変化値｛−ΔＥ_ｉ｝とが比較される。比較器７２は、比較結果に基づいて、各状態遷移の可否を示す遷移可否｛ｆｉ｝を出力する。許容された状態遷移が複数存在する時は、セレクタ７３は、乱数値を用いてその中から１つをランダムに選択する。 From the thermal excitation energy generated for each transition candidate, the subtractor 71 subtracts the offset value y output from the accumulator 22a, and the comparator 72 reduces each subtraction result output from the subtractor 71, and changes in energy The value {−ΔE _i } is compared. The comparator 72 outputs transition availability {fi} indicating availability of each state transition based on the comparison result. When there are a plurality of permitted state transitions, the selector 73 randomly selects one from among them using random number values.

オフセット値ｙは、許容された状態遷移が存在し状態が変化するとき（遷移可否ｆが１のとき）、累算器２２ａによって０にリセットされる。候補となった状態遷移が全て許容されず現在の状態に留まるとき（遷移可否ｆが０のとき）、累算器２２ａは、オフセット値ｙにオフセット増分値Δｙを加算することで、現在の状態における滞在時間に対してオフセット値ｙが単調増加するよう制御する。 The offset value y is reset to 0 by the accumulator 22a when the permitted state transition exists and the state changes (when the transition possibility f is 1). When all the candidate state transitions are not permitted and remain in the current state (when transition possibility f is 0), the accumulator 22a adds the offset increment value Δy to the offset value y to obtain the current state. The offset value y is controlled to increase monotonically with respect to the staying time in

全ての状態遷移が候補として挙げられ、局所解でないときほぼ１回の反復で状態遷移が起こることを考慮すると、オフセット増分値Δｙは、滞在時間が数回程度になったとき局所解からの脱出に必要なエネルギーになるように定めるのがよいと考えられる。 The offset increment value Δy escapes from the local solution when the residence time becomes several times, considering that all the state transitions are candidates and the state transition occurs in almost one iteration when not a local solution. It is considered good to set the energy necessary for

図９は図８の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。最適化する問題は３２都市の巡回セールマン問題をイジングモデル（ボルツマンマシン）により定式化したものである。横軸は反復回数、縦軸は最適解が得られた割合（正答率（％））を表している。結果６０ａは、図８の遷移制御部２０ｂを用いたときの、反復回数と正答率との関係を示し、結果６１ａは、遷移制御部２０ｂから、減算器７１と累算器２２ａを除いたときの、反復回数と正答率との関係を示す。 FIG. 9 is a view showing an example of a software simulation result of the pseudo annealing method realized by using the transition control unit of FIG. The problem to be optimized is formulated by using the Ising model (Boltzmann machine) for the traveling saleman problem in 32 cities. The horizontal axis represents the number of iterations, and the vertical axis represents the rate at which the optimal solution was obtained (percent correct answer (%)). The result 60a shows the relationship between the number of iterations and the correct answer rate when the transition control unit 20b of FIG. 8 is used, and the result 61a is when the subtractor 71 and the accumulator 22a are removed from the transition control unit 20b. Shows the relationship between the number of iterations and the correct answer rate.

図９から第２の実施の形態の遷移制御部２０ｂを用いた場合のほうが、減算器７１と累算器２２ａがない場合よりも速く正解に達することがわかる。９９％の確率で正答が得られる反復回数Ｎ_９９で比べると減算器７１と累算器２２ａがない場合では５．３×１０^７、第２の実施の形態の遷移制御部２０ｂを用いた場合では１．１×１０^７であり、約５倍高速化されていることが示された。 It can be seen that the correct solution is reached faster in the case of using the transition control unit 20b of the second embodiment than in the case of not having the subtractor 71 and the accumulator 22a in FIG. 5.3 × 10 ⁷ is in the absence of accumulator 22a and the subtractor 71 compared with the number of iterations _{N 99} the resulting correct 99% probability, the case of using the transition control portion 20b of the second embodiment It was 1.1 × 10 ⁷ and was shown to be about 5 times faster.

以下、図８の遷移制御部２０ｂを用いた最適化装置の一例を説明する。
図１０は、図８の遷移制御部を用いた最適化装置の一例を示す図である。
最適化装置８０は、エネルギー計算部８１ａ１，…，８１ａｉ，…，８１ａｎ、遷移制御部８２、状態更新部８３を有している。 Hereinafter, an example of the optimization apparatus using the transition control unit 20b of FIG. 8 will be described.
FIG. 10 is a diagram showing an example of an optimization apparatus using the transition control unit of FIG.
The optimization device 80 includes energy calculation units 81a1, ..., 81ai, ..., 81an, a transition control unit 82, and a state update unit 83.

エネルギー計算部８１ａ１〜８１ａｎは、図１３に示したエネルギー計算部１２の一例であり、エネルギー変化値（−ΔＥ_１，…，−ΔＥ_ｉ，…，−ΔＥ_ｎ（前述の｛−ΔＥ_ｉ｝に相当））を計算し、出力する。 Energy calculation unit 81a1~81an is an example of an energy calculation unit 12 shown in FIG. 13, the energy change value _{_{(-ΔE 1, ..., -ΔE i}} , ..., -ΔE n ( the aforementioned {-ΔE _i} Equivalent)) to calculate and output.

例えば、エネルギー計算部８１ａｉは、レジスタ８１ｂ、セレクタ８１ｃ，８１ｄ、乗算器８１ｅ、加算器８１ｆ、レジスタ８１ｇ、セレクタ８１ｈ、乗算器８１ｉを有している。 For example, the energy calculation unit 81 ai includes a register 81 b, selectors 81 c and 81 d, a multiplier 81 e, an adder 81 f, a register 81 g, a selector 81 h, and a multiplier 81 i.

レジスタ８１ｂは、前述の式１０等における相互作用係数Ｊ_ｉ，１，Ｊ_ｉ，２，…，Ｊ_ｉ，ｎを格納する。
なお、相互作用係数Ｊ_ｉ，１〜Ｊ_ｉ，ｎは、例えば、最適化装置８０内の図示しない制御装置または、最適化装置８０の外部の装置により、計算対象の問題に応じて予め計算され、レジスタ８１ｂに格納される。なお、上記のような相互作用係数Ｊ_ｉ，１〜Ｊ_ｉ，ｎは、ＲＡＭ等のメモリに格納されてもよい。 Register 81b, the interaction coefficients _{_{J i, 1, J i,}} 2 in equation 10 like the foregoing, _{..., J i,} stores _n.
The interaction coefficients J _{i, 1 to} J _{i, n} are calculated in advance according to the problem to be calculated, for example, by a control device (not shown) in the optimization device 80 or a device outside the optimization device 80. , And stored in the register 81 b. Note that the interaction coefficients J _{i, 1 to} J _{i, n} as described above may be stored in a memory such as a RAM.

セレクタ８１ｃは、遷移制御部８２が出力する遷移番号Ｎに基づき、レジスタ８１ｂに格納されている相互作用係数Ｊ_ｉ，１〜Ｊ_ｉ，ｎのうち１つを選択して出力する。
例えば、Ｎ＝ｎがセレクタ８１ｃに入力されたとき、セレクタ８１ｃは、相互作用係数Ｊ_ｉ，ｎを選択する。 The selector 81 c selects and outputs one of the interaction coefficients J _{i, 1 to} J _{i, n} stored in the register 81 b based on the transition number N output from the transition control unit 82.
For example, when N = n is input to the selector 81 c, the selector 81 c selects the interaction coefficient J _{i, n} .

セレクタ８１ｄは、式１１の１−２ｓ_ｉの演算を実現するものであり、状態更新部８３が出力する更新後のスピンｓ_Ｎの値に基づき、１または−１を選択して出力する。更新後の値が０のときには、セレクタ８１ｄは、−１を選択して出力し、更新後の値が１のときには、セレクタ８１ｄは、１を選択して出力する。 The selector 81 d implements the calculation of 1−2s _i of Expression 11, and selects and outputs 1 or −1 based on the value of the updated spin s _N output from the state updating unit 83. When the updated value is 0, the selector 81d selects and outputs -1, and when the updated value is 1, the selector 81d selects and outputs 1.

乗算器８１ｅは、セレクタ８１ｃが出力する相互作用係数と、セレクタ８１ｄが出力する値とを乗算した積を出力する。
加算器８１ｆは、乗算器８１ｅが出力する乗算結果と、レジスタ８１ｇに格納されている値とを加算した和を出力する。 The multiplier 81 e outputs a product obtained by multiplying the interaction coefficient output from the selector 81 c and the value output from the selector 81 d.
The adder 81 f outputs the sum of the multiplication result output from the multiplier 81 e and the value stored in the register 81 g.

レジスタ８１ｇは、図示しないクロック信号に同期して、加算器８１ｆが出力する値を取り込む。レジスタ８１ｇは、例えば、フリップフロップである。なお、レジスタ８１ｇに格納される値が、式１２におけるローカルフィールド値Ｆ_ｉである。 The register 81g takes in the value output from the adder 81f in synchronization with a clock signal (not shown). The register 81 g is, for example, a flip flop. The value stored in the register 81 g is the local field value F _i in Equation 12.

セレクタ８１ｈは、変化後のスピンｓ_ｉの値が、０のとき１を出力し、１のとき−１を出力する。セレクタ８１ｈの出力は、式１１の１−２ｓ_ｉに相当する。
乗算器８１ｉは、レジスタ８１ｇが出力するローカルフィールド値Ｆ_ｉとセレクタ８１ｈが出力する値とを乗算した積をエネルギー変化値（−ΔＥ_ｉ）として出力する。 The selector 81 _h outputs 1 when the value of the spin s _i after change is 0, and outputs −1 when it is 1. The output of the selector 81h corresponds to 1-2s _i of Equation 11.
The multiplier 81i outputs a product obtained by multiplying the local field value F _i output from the register 81 g and the value output from the selector 81 _h as an energy change value (−ΔE _i ).

遷移制御部８２は、回路部８２ａ１，…，８２ａｉ，…，８２ａｎ、セレクタ８２ｂ、オフセット制御回路８２ｃを有している。
回路部８２ａ１〜８２ａｎは、図８に示した遷移制御部２０ｂの熱励起エネルギー生成部７０、減算器７１、比較器７２の機能を、状態遷移の候補ごとに分割して行うものであり、セレクタ８２ｂは、図８に示したセレクタ７３に相当する。また、オフセット制御回路８２ｃは、図８に示した累算器２２ａに相当する。 The transition control unit 82 includes circuit units 82a1, ..., 82ai, ..., 82an, a selector 82b, and an offset control circuit 82c.
The circuit units 82a1 to 82an divide the functions of the thermal excitation energy generation unit 70, the subtractor 71, and the comparator 72 of the transition control unit 20b shown in FIG. 82 b corresponds to the selector 73 shown in FIG. The offset control circuit 82c corresponds to the accumulator 22a shown in FIG.

したがって、遷移制御部８２は、図８に示した遷移制御部２０ｂと同様の動作を行う。
状態更新部８３は、図１３に示した状態保持部１１の機能を有し、遷移制御部１４が出力する遷移可否ｆと遷移番号Ｎに基づき、保持されているスピンｓ_１〜ｓ_ｎの値を更新して、その値の組み合わせ（Ｓｔａｔｅ）を出力する。また、状態更新部８３は、更新後のスピンの値（図１１の例ではｓ_Ｎと表記されている）を出力する。 Therefore, transition control unit 82 performs the same operation as transition control unit 20b shown in FIG.
The state update unit 83 has the function of the state holding unit 11 shown in FIG. 13, and based on the transition availability f and the transition number N output from the transition control unit 14, the values of the retained spins s _{1 to} s _n And output the combination (State) of the values. The state updating unit 83 outputs the spin updated values (labeled s _N in the example of FIG. 11).

第２の実施の形態の遷移制御部２０ｂ，８２は、上記のような最適化装置８０に適用可能である。
（第３の実施の形態）
図１１は第３の実施の形態の最適化装置における遷移制御部の回路構成の一例を示す図である。なお、図１１では、乱数値を発生する回路については図示が省略されている。また、図８に示した遷移制御部２０ｂと同じ要素については同一符号が付されている。 The transition control units 20b and 82 according to the second embodiment are applicable to the optimization apparatus 80 as described above.
Third Embodiment
FIG. 11 is a diagram showing an example of a circuit configuration of the transition control unit in the optimization device of the third embodiment. In FIG. 11, the circuit for generating the random number is not shown. The same elements as those of the transition control unit 20b shown in FIG. 8 are denoted by the same reference numerals.

以下、図１１の遷移制御部２０ｃは各ビット反転（スピンの値の変化）を全て状態遷移の候補とするものとして説明するが、各ビット反転の一部のみを状態遷移の候補とすることも可能である。 Hereinafter, the transition control unit 20 c in FIG. 11 is described as all bit inversions (changes in spin value) as state transition candidates, but it is also possible to use only a part of each bit inversion as state transition candidates. It is possible.

遷移制御部２０ｃは、図３の遷移制御部２０ａと同様に累算器２２ａを有している他、熱励起エネルギー生成部７０ａ、減算器７１ａ、比較器７２ａ、セレクタ７３を有する。
熱励起エネルギー生成部７０ａは、各遷移候補に対して共通の乱数値ｕ（一様乱数）を、前述した逆関数ｆ^−１（ｕ）の値に変換するノイズテーブルを有し、その値に温度値Ｔを乗算した積を、メトロポリス法またはギブス法における熱励起エネルギーとして出力する。 Similar to the transition control unit 20a of FIG. 3, the transition control unit 20c includes an accumulator 22a, and further includes a thermal excitation energy generation unit 70a, a subtractor 71a, a comparator 72a, and a selector 73.
The thermal excitation energy generation unit 70a has a noise table for converting the common random number value u (uniform random number) for each transition candidate into the value of the inverse function f ⁻¹ (u) described above, and The product multiplied by the temperature value T is output as thermal excitation energy in the Metropolis method or Gibbs method.

減算器７１ａは、熱励起エネルギーから、全ての状態遷移の候補に共通なオフセット値ｙを減ずる。
比較器７２ａは、減算器７１が出力する減算結果と、各状態遷移によるエネルギー変化値｛−ΔＥ_ｉ｝とを比較することで各状態遷移の可否を示す遷移可否｛ｆｉ｝を出力する。 The subtractor 71a subtracts the offset value y common to all state transition candidates from the thermal excitation energy.
The comparator 72a compares the subtraction result output from the subtractor 71 with the energy change value {−ΔE _i } due to each state transition to output transition availability {fi} indicating whether each state transition is possible.

セレクタ７３は、遷移可否｛ｆｉ｝に基づいて、許容された状態遷移が複数存在するときは、乱数を用いてその中から１つをランダムに選択する。そして、セレクタ７３は、選択した状態遷移の候補の番号（遷移番号Ｎ）を出力するとともに、遷移可否ｆとして１を出力する。状態遷移が生じないときには、遷移可否ｆは０となる。 If there are a plurality of permitted state transitions, the selector 73 randomly selects one of the state transitions using random numbers based on the transition availability {fi}. Then, the selector 73 outputs the number (transition number N) of the selected state transition candidate, and outputs 1 as the transition possibility f. When no state transition occurs, the transition availability f is zero.

以下第３の実施の形態の最適化装置の動作例を説明する。
前述した各反復において、熱励起エネルギー生成部７０ａは、各ビット反転に共通な一様乱数である乱数値ｕを受け、ノイズテーブルを用いて変換を行う。そして熱励起エネルギー生成部７０ａは、変換で得られた値に温度値Ｔを乗算することにより、メトロポリス法またはギブス法における熱励起エネルギーを生成する。 An operation example of the optimization device of the third embodiment will be described below.
In each iteration described above, the thermal excitation energy generation unit 70a receives a random value u which is a uniform random number common to each bit inversion, and performs conversion using a noise table. Then, the thermal excitation energy generating unit 70a generates thermal excitation energy in the metropolis method or Gibbs method by multiplying the temperature value T by the value obtained by the conversion.

生成された熱励起エネルギーから、減算器７１ａによって、累算器２２ａが出力するオフセット値ｙが減ぜられ、比較器７２ａで、減算器７１ａが出力する減算結果と、エネルギー変化値｛−ΔＥ_ｉ｝とが比較される。比較器７２ａは、比較結果に基づいて、各状態遷移の状態遷移の可否を示す遷移可否｛ｆｉ｝を出力する。許容された状態遷移が複数存在する時は、セレクタ７３は、乱数値を用いてその中から１つをランダムに選択する。 The offset value y output from the accumulator 22a is reduced by the subtractor 71a from the generated thermal excitation energy, and the subtraction result output from the subtractor 71a and the energy change value {-ΔE _i are reduced by the comparator 72a. } Is compared. The comparator 72a outputs transition availability {fi} indicating availability of the state transition of each state transition based on the comparison result. When there are a plurality of permitted state transitions, the selector 73 randomly selects one from among them using random number values.

オフセット値ｙは、第２の実施の形態の遷移制御部２０ｂと同様に制御される。
図１２は図１１の遷移制御部を用いて実現される疑似焼き鈍し法のソフトウェアシミュレーション結果の一例を示す図である。最適化する問題は３２都市の巡回セールマン問題をイジングモデル（ボルツマンマシン）により定式化したものである。横軸は反復回数、縦軸は最適解が得られた割合（正答率（％））を表している。結果６０ｂは、図１１の遷移制御部２０ｃを用いたときの、反復回数と正答率との関係を示し、結果６１ｂは、遷移制御部２０ｃから、減算器７１ａと累算器２２ａを除いたときの、反復回数と正答率との関係を示す。 The offset value y is controlled in the same manner as the transition control unit 20b of the second embodiment.
FIG. 12 is a diagram showing an example of a software simulation result of the pseudo annealing method realized by using the transition control unit of FIG. The problem to be optimized is formulated by using the Ising model (Boltzmann machine) for the traveling saleman problem in 32 cities. The horizontal axis represents the number of iterations, and the vertical axis represents the rate at which the optimal solution was obtained (percent correct answer (%)). The result 60b shows the relationship between the number of iterations and the correct answer rate when the transition control unit 20c of FIG. 11 is used, and the result 61b is when the subtractor 71a and the accumulator 22a are removed from the transition control unit 20c. Shows the relationship between the number of iterations and the correct answer rate.

図１２から第３の実施の形態の遷移制御部２０ｃを用いた場合のほうが、減算器７１ａと累算器２２ａがない場合よりも速く正解に達することがわかる。９９％の確率で正答が得られる反復回数Ｎ_９９で比べると減算器７１ａと累算器２２ａがない場合では３．４×１０^７、第３の実施の形態の遷移制御部２０ｃを用いた場合では１．０×１０^７であり、約３倍高速化されていることが示された。 It can be seen that the correct solution is reached more quickly in the case where the transition control unit 20c of the third embodiment is used from FIG. 12 than in the case where the subtractor 71a and the accumulator 22a are not provided. When the subtractor 71a and the accumulator 22a do not have the number of iterations N ₉₉ at which the correct answer is obtained with a probability of 99%, 3.4 × 10 ⁷ in the absence of the accumulator 22a, the transition control unit 20c of the third embodiment is used. In this case, it was 1.0 × 10 ⁷ and it was shown to be about 3 times faster.

また、第３の実施の形態の遷移制御部２０ｃでは、各状態遷移で共通の乱数値ｕを用いるため、第２の実施の形態の遷移制御部２０ｂよりも、回路面積を削減できる。
以上、実施の形態に基づき、本発明の最適化装置及び最適化装置の制御方法の一観点について説明してきたが、これらは一例にすぎず、上記の記載に限定されるものではない。 Further, in the transition control unit 20c of the third embodiment, the circuit area can be reduced as compared with the transition control unit 20b of the second embodiment since the common random number u is used in each state transition.
As mentioned above, although one aspect of a control method of an optimization device and an optimization device of the present invention was explained based on an embodiment, these are only examples and are not limited to the above-mentioned statement.

１４ｂ１乱数発生回路
１４ｂ２セレクタ
１４ｂ３ノイズテーブル
１４ｂ４乗算器
１４ｂ５比較器
２０遷移制御部
２１オフセット加算回路
２１ａ減算器
２２オフセット制御回路
２２ａ累算器 14b1 random number generation circuit 14b2 selector 14b3 noise table 14b4 multiplier 14b5 comparator 20 transition control unit 21 offset addition circuit 21a subtractor 22 offset control circuit 22a accumulator

Claims

A state holding unit for holding values of a plurality of state variables included in an evaluation function representing energy;
An energy calculator configured to calculate a change value of the energy for each of a plurality of state transitions when a state transition occurs in response to any of the values of the plurality of state variables changing;
A temperature control unit that controls a temperature value indicating the temperature;
In probabilistically determining whether to accept any of the plurality of state transitions based on the temperature value, the change value, and the random value based on the relative relationship between the change value and the thermal excitation energy. A transition control unit that adds an offset value to the change value and controls the offset value in the local solution in which the energy is minimized so as to be larger than that in the case where the energy is not minimized;
An optimization apparatus characterized by having.

The transition control unit has an offset control circuit,
The offset control circuit sets the offset value to 0 when accepting any of the plurality of state transitions, and increments the offset value every first period when not accepting any of the plurality of state transitions. And monotonously increasing the offset value with respect to the staying time of the current state represented by the values of the plurality of state variables,
The optimization device according to claim 1, characterized in that:

The offset control circuit further includes an accumulator having a reset terminal, and the accumulator sets the offset value to 0 when receiving a first signal indicating that any one of the plurality of state transitions is accepted. An offset increment value is added to the offset value upon receiving a second signal indicating that none of the plurality of state transitions is accepted.
The optimization device according to claim 2, characterized in that:

The accumulator further has a clock terminal, and adds the offset increment value to the offset value each time a pulse signal from a state machine is input to the clock terminal.
The optimization device according to claim 3, characterized in that:

The transition control unit
A selector that selects one of the change values calculated for each of the plurality of state transitions in accordance with the random number value;
A storage unit that outputs a value of an inverse function of a function indicating an allowance probability of the plurality of state transitions represented by the metropolis method or the Gibbs method according to the random value;
A multiplier for outputting the thermal excitation energy represented by a product of the value of the inverse function and the temperature value;
A state transition corresponding to the change value selected by the selector, represented by a value corresponding to a comparison result of the sum of the change value selected by the selector and the offset value and the thermal excitation energy A comparator that outputs the determination result as to whether to accept or not;
The optimization apparatus according to any one of claims 1 to 4, characterized in that

The transition control unit
Outputs a plurality of inverse values of a function indicating an allowable probability of the plurality of state transitions represented by the metropolis method or the Gibbs method according to the random number values independent of one another for each of the plurality of state transitions A storage unit to
A multiplier for outputting the thermal excitation energy represented by a plurality of products obtained by multiplying each of the plurality of values by the temperature value;
A plurality of values corresponding to comparison results between each of a plurality of sums obtained by adding the change value calculated for each of the plurality of state transitions and the offset value, and each of the plurality of products A comparator for outputting a plurality of determination results as to whether or not each of the plurality of state transitions is accepted;
A selector that selects any one state transition when there are a plurality of state transitions to be accepted among the plurality of state transitions based on the plurality of determination results;
The optimization apparatus according to any one of claims 1 to 4, characterized in that

The transition control unit
A memory for outputting a value of an inverse function of a function indicating an allowable probability of the plurality of state transitions represented by the metropolis method or the Gibbs method according to the random number value common to all of the plurality of state transitions Department,
A multiplier for outputting the thermal excitation energy represented by a product of the value of the inverse function and the temperature value;
Represented by a plurality of values corresponding to a comparison result of each of a plurality of sums obtained by adding the change value calculated for each of the plurality of state transitions and the offset value, and the product A comparator that outputs a plurality of determination results as to whether or not each of a plurality of state transitions is accepted;
A selector that receives one of the plurality of determination results and selects one of the state transitions to be accepted when there are a plurality of state transitions to be accepted among the plurality of state transitions;
The optimization apparatus according to any one of claims 1 to 4, characterized in that

The transition control unit
A plurality of inverse functions of a function indicating an allowable probability of the plurality of state transitions represented by Metropolis method or Gibbs method according to the random number value common to two or more state transitions among the plurality of state transitions A storage unit that outputs the value of
A multiplier for outputting the thermal excitation energy represented by a plurality of products obtained by multiplying each of the plurality of values by the temperature value;
A plurality of values corresponding to comparison results between each of a plurality of sums obtained by adding the change value calculated for each of the plurality of state transitions and the offset value, and each of the plurality of products A comparator for outputting a plurality of determination results as to whether or not each of the plurality of state transitions is accepted;
A selector that selects one of the accepted state transitions when there are multiple accepted state transitions among the plurality of state transitions based on the plurality of determination results;
The optimization apparatus according to any one of claims 1 to 4, characterized in that

In the control method of the optimization device,
A state holding unit included in the optimization device holds values of a plurality of state variables included in an evaluation function representing energy, respectively.
When a state transition occurs in response to a change in any of the values of the plurality of state variables, an energy calculation unit included in the optimization device calculates the change value of the energy for each of the plurality of state transitions. And
A temperature control unit of the optimization device controls a temperature value indicating a temperature;
Whether the transition control unit included in the optimization device accepts any of the plurality of state transitions according to the relative relationship between the change value and the thermal excitation energy based on the temperature value, the change value, and a random value In addition, an offset value is added to the change value, and the offset value in the local solution in which the energy is minimized is controlled to be larger than that in the case where the energy is not minimized. Do,
And controlling the optimization device.