JP7375426B2

JP7375426B2 - Robot control arithmetic processing FPGA and data bus width determination method for arithmetic processing FPGA

Info

Publication number: JP7375426B2
Application number: JP2019177060A
Authority: JP
Inventors: 大介川瀬
Original assignee: Denso Wave Inc
Current assignee: Denso Wave Inc
Priority date: 2019-09-27
Filing date: 2019-09-27
Publication date: 2023-11-08
Anticipated expiration: 2039-09-27
Also published as: JP2021056609A

Description

本発明は、可動部を有するロボットを制御するための演算処理を行うＦＰＧＡ，及びそのＦＰＧＡが記憶部にアクセスするためのデータバス幅を決定する方法に関する。 The present invention relates to an FPGA that performs arithmetic processing to control a robot having a movable part, and a method for determining a data bus width for the FPGA to access a storage part.

ロボットの実働時や教示時又は動作シミュレーション時に、ロボットと例えば設備のような障害物とが衝突するか否かを判定する処理を行う技術がある。この衝突判定処理の演算は非常に負荷が高いため、特に衝突判定処理をリアルタイムで演算するには、高性能な演算処理機能を有したＣＰＵを用いる必要がある。 There is a technology that performs processing to determine whether or not a robot will collide with an obstacle such as equipment during actual operation, teaching, or motion simulation of the robot. Since the computation of this collision determination process is very heavy, it is necessary to use a CPU having high-performance arithmetic processing functions, especially in order to perform the collision determination process in real time.

特開２０１３－１８４２４２号公報Japanese Patent Application Publication No. 2013-184242

しかしながら、高性能なＣＰＵは当然に高価であるため、製品のコストアップを抑える観点からは、安価なＣＰＵを用いても衝突判定処理のリアルタイム演算が可能になることが望ましい。そのためには、ＦＰＧＡ（Field Programmable Gate Array）を併用してＦＰＧＡに衝突判定処理を実行させ、ＣＰＵは判定結果のみを取得する構成が想定される。 However, since a high-performance CPU is naturally expensive, from the viewpoint of suppressing an increase in product costs, it is desirable to be able to perform collision determination processing in real time even with an inexpensive CPU. To this end, a configuration is assumed in which an FPGA (Field Programmable Gate Array) is used in conjunction with the FPGA to execute the collision determination process, and the CPU acquires only the determination result.

ＦＰＧＡが衝突判定処理を行うには、判定対象となる物体の位置及び形状等を表すモデルデータをメモリから読み出す必要があるが、判定処理を高速に実行するためには、各モデルデータを１回のメモリアクセスで読み出せるようにデータバス幅を決定することが望ましい。従来、上記のようにデータバス幅を決定するための具体的な指標は、提示されていなかった。 In order for the FPGA to perform collision determination processing, it is necessary to read model data representing the position and shape of the object to be determined from memory, but in order to execute the determination processing at high speed, each model data must be read once. It is desirable to determine the data bus width so that it can be read by memory access. Conventionally, no specific index has been presented for determining the data bus width as described above.

本発明は、上記実情に鑑みてなされたものであり、その目的は、ＣＰＵの演算処理負荷を低減して、衝突判定処理のリアルタイム演算を行うことを可能にするロボット制御の演算処理用ＦＰＧＡ，及び演算処理用ＦＰＧＡのデータバス幅決定方法を提供することにある。 The present invention has been made in view of the above-mentioned circumstances, and its purpose is to provide an FPGA for arithmetic processing for robot control, which reduces the arithmetic processing load on the CPU and makes it possible to perform real-time calculations for collision determination processing. Another object of the present invention is to provide a data bus width determination method for an FPGA for arithmetic processing.

請求項１記載のロボット制御の演算処理用ＦＰＧＡによれば、ロボットの可動部と障害物との衝突判定処理要求が入力されると、可動部の少なくとも一部の形状等と障害物の少なくとも一部の形状等とをそれぞれ表したモデルのデータが記憶されている記憶部より双方のモデルのデータを読み出して双方の衝突判定を行い、その判定結果を出力する際に、衝突判定に係る演算の少なくとも一部を並列処理するパイプラインを構成するようにロジックがコンフィギュレーションされる。尚、「形状等」は、物体の形状に加えて、位置，大きさを含むものとする。 According to the FPGA for arithmetic processing of robot control according to claim 1, when a request for collision determination processing between a movable part of the robot and an obstacle is input, the shape of at least a part of the movable part and the shape of at least one part of the obstacle are input. The data of both models is read out from the storage unit that stores the data of the models representing the shapes of the parts, etc., collision determination is made between both, and when the determination results are output, the calculations related to the collision determination are performed. Logic is configured to form a pipeline that processes at least in part in parallel. Note that "shape, etc." includes the position and size of an object in addition to its shape.

このように構成すれば、ＣＰＵがＦＰＧＡに対して衝突判定処理要求を入力すると、ＦＰＧＡは、双方のモデルのデータを記憶部より読み出して双方の衝突判定を行い、その判定結果をＣＰＵに出力する、というシステムを構成できる。したがって、比較的安価なＣＰＵとＦＰＧＡとの組み合わせによって、ＣＰＵはＦＰＧＡに衝突判定を行わせながらロボットのその他の制御を処理することが可能になる。また、衝突判定の演算は処理負荷が高いが、その演算の少なくとも一部を並列処理するパイプラインを構成するようにＦＰＧＡをコンフィギュレーションすることで、演算を効率的に行うことが可能になり、ＣＰＵが要求する時間内に衝突判定を実行させることができる。 With this configuration, when the CPU inputs a collision determination processing request to the FPGA, the FPGA reads the data of both models from the storage unit, performs collision determination for both, and outputs the determination result to the CPU. It is possible to configure a system called . Therefore, by combining a relatively inexpensive CPU and FPGA, the CPU can process other controls of the robot while having the FPGA perform collision determination. In addition, collision determination calculations have a high processing load, but by configuring the FPGA to form a pipeline that processes at least part of the calculations in parallel, it is possible to perform the calculations efficiently. Collision determination can be executed within the time required by the CPU.

そして、ロボットの最大リンク長をＬ［ｍｍ］，許容判定誤差をＥ［ｍｍ］，モデルの形状等を複数のオブジェクトに分割した数をＤ，ｘの小数点以下を整数に切り上げる関数をｆ_ＲＵ（ｘ），最小限必要となるデータバスのビット幅ＤＢ_Ｗ０とすると、そのビット幅ＤＢ_Ｗ０を以下の（７）式により決定し、 Then, the maximum link length of the robot is L [mm], the allowable judgment error is E [mm], the number of divisions of the model shape etc. into multiple objects is D, and the function that rounds up the decimal point of x to an integer is f _RU ( x), Assuming that the minimum required bit width of the data bus is DB _W 0, the bit width DB _W 0 is determined by the following equation (7),

記憶部に接続されるデータバスのビット幅ＤＢ_Ｗ１を、ビット幅ＤＢ_Ｗ０以上に設定する。

The bit width DB _W 1 of the data bus connected to the storage section is set to be greater than or equal to the bit width DB _W 0.

前記ビット幅ＤＢ_Ｗ０は、各モデルの位置を表すのに必要なデータのビット数や、各モデルの大きさを、複数のオブジェクトに分割することも考慮して必要になるデータのビット数も考慮して決定される。したがって、ＦＰＧＡと記憶部とを接続するデータバスのビット幅ＤＢ_Ｗ１をビット幅ＤＢ_Ｗ０以上に設定すれば、ＦＰＧＡは、モデルデータを１回のアクセスによって記憶部より確実に読み出すことができる。 The bit width DB _W 0 includes the number of data bits required to represent the position of each model, and the number of data bits required in consideration of dividing the size of each model into multiple objects. Determined by consideration. Therefore, if the bit width DB _W 1 of the data bus connecting the FPGA and the storage section is set to the bit width DB _W 0 or more, the FPGA can reliably read model data from the storage section with one access. .

請求項２記載のロボット制御の演算処理用ＦＰＧＡによれば、モデルの位置を表すデータのビット数をＮ_Ｐとし、符号ビット及びクォータニオンを用いる際のビット幅ＤＢ_Ｗ２とすると、ビット幅ＤＢ_Ｗ２を以下の（８）式により決定し、
ＤＢ_Ｗ２＝ＤＢ_Ｗ０＋４（Ｎ_Ｐ＋１） …（８）
記憶部に接続されるデータバスのビット幅ＤＢ_Ｗ３をビット幅ＤＢ_Ｗ２以上に設定する。このように、ビット幅ＤＢ_Ｗ０に４（Ｎ_Ｐ＋１）ビット分を追加することで、モデルデータに符号及びクォータニオンも含めることができるので、データバスのビット幅ＤＢ_Ｗ３をビット幅ＤＢ_Ｗ２以上に設定すれば、ＦＰＧＡは、衝突判定処理をより高速に実行できる。 According to the FPGA for arithmetic processing of robot control according to claim 2, if the number of bits of data representing the position of the model is N _P and the bit width DB _W when using a sign bit and a quaternion is 2, then the bit width DB _W 2 is determined by the following formula (8),
DB _W 2 = DB _W 0 + 4 ( _NP + 1) ... (8)
The bit width DB _W 3 of the data bus connected to the storage section is set to the bit width DB _W 2 or more. In this way, by adding 4 (N _P +1) bits to the bit width DB _W 0, the code and quaternion can also be included in the model data, so the bit width DB _W 3 of the data bus can be reduced to the bit width DB _W If set to 2 or more, the FPGA can execute collision determination processing faster.

請求項３記載のロボット制御の演算処理用ＦＰＧＡによれば、双方のモデルを何れも直方体モデルとし、衝突判定は１５方向の分離軸を用いて行う。直方体モデルは、衝突判定において一般的に使用されるモデルであり、その場合、衝突判定は１５方向の分離軸を用いて行うことになる。この１５方向の分離軸を用いた衝突判定に係る演算の少なくとも一部をパイプラインにて並列処理することで、直方体モデル同士の衝突判定を効率的に行うことができる。 According to the robot control arithmetic processing FPGA according to the third aspect, both models are rectangular parallelepiped models, and collision determination is performed using separation axes in 15 directions. The rectangular parallelepiped model is a model commonly used in collision determination, and in that case, collision determination is performed using separation axes in 15 directions. By processing at least part of the calculations related to collision determination using the separation axes in 15 directions in parallel in a pipeline, collision determination between rectangular parallelepiped models can be efficiently performed.

請求項４記載のロボット制御の演算処理用ＦＰＧＡによれば、パイプラインを具体的には、記憶部よりモデルデータを読み出すロードステージ，双方の直方体モデルを同一の座標系に変換する座標変換ステージ，双方の直方体モデルの特定の軸方向又はそれらの外積により分離軸を求める方向演算ステージ，双方の直方体モデルを分離軸に投影して、各モデルの像が重なるか否かを判定する分離軸判定ステージ，分離軸判定ステージにおける全ての分離軸についての判定結果に基づき衝突判定を行う衝突判定ステージとで構成する。このように構成すれば、モデルデータの読み出しから衝突判定までの各処理を、パイプラインの各ステージによって効率的に行うことができる。 According to the FPGA for arithmetic processing of robot control according to claim 4, the pipeline specifically includes a load stage for reading model data from a storage section, a coordinate conversion stage for converting both rectangular parallelepiped models into the same coordinate system, A direction calculation stage that determines the separation axis from a specific axial direction of both rectangular parallelepiped models or their cross product, and a separation axis determination stage that projects both rectangular parallelepiped models onto the separation axis and determines whether the images of each model overlap. , and a collision determination stage that performs collision determination based on the determination results for all separated axes in the separation axis determination stage. With this configuration, each process from reading model data to collision determination can be efficiently performed by each stage of the pipeline.

第１実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the first embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. メモリロードステージに係る処理手順を示すフローチャートFlowchart showing the processing procedure related to the memory load stage ２つの物体の形状をそれぞれ直方体モデルとして扱う場合に行う分離軸判定を説明する図A diagram explaining the separation axis determination performed when the shapes of two objects are treated as rectangular parallelepiped models. ハードウェア構成の一例を示す図（その１）Diagram showing an example of hardware configuration (Part 1) ハードウェア構成の一例を示す図（その２）Diagram showing an example of hardware configuration (Part 2) ハードウェア構成の一例を示す図（その３）Diagram showing an example of hardware configuration (Part 3) ＣＰＵとＦＰＧＡとの間で行われる処理・演算タイミングの概要を示す図Diagram showing an overview of processing/calculation timing performed between the CPU and FPGA 第２実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is a second embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第３実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is a third embodiment, and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第４実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the fourth embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第５実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the fifth embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第６実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the sixth embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第７実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the seventh embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第８実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the eighth embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA. 第９実施形態であり、ＦＰＧＡの内部ロジックで構成されるパイプライン構造を示す図This is the ninth embodiment and is a diagram showing a pipeline structure composed of internal logic of an FPGA.

（第１実施形態）
以下、第１実施形態について図１から図５を参照して説明する。本実施形態は、ＦＰＧＡの内部ロジックをどのように定義してコンフィギュレーションするかという点と、ＦＰＧＡがモデルデータを読み出すためのメモリバス幅をどのように設定するか、という点に特徴がある。したがって、外部的なハードウェア構成は、一般的なＣＰＵ１とＦＰＧＡ２との組合せである。 (First embodiment)
The first embodiment will be described below with reference to FIGS. 1 to 5. This embodiment is characterized by how the internal logic of the FPGA is defined and configured, and how the memory bus width for the FPGA to read model data is set. Therefore, the external hardware configuration is a general combination of CPU1 and FPGA2.

例えば図４Ａに示すように、ＣＰＵ１は、自身の制御プログラムがＲＯＭやハードディスク等から転送されると共に、ワークエリアとして使用されるＲＡＭ３にアクセスし、ＦＰＧＡ２は、衝突判定用のモデルデータがハードディスク等から転送されるＲＡＭ４にアクセスする構成である。図４Ｂは、ＦＰＧＡ２の内部にＲＡＭ４が組み込まれている構成である。図４Ｃは、ＣＰＵ１，ＦＰＧＡ２及びＲＡＭ４がＳｏＣ（System on Chip）５として構成されており、ＲＡＭ３がＳｏＣ５に外付けされた構成である。尚、ＳｏＣ５内のＲＡＭ４の容量が十分に確保できる場合には外付けのＲＡＭ３を除いて、ＣＰＵ１もＲＡＭ４にアクセスする構成を採用しても良い。 For example, as shown in FIG. 4A, the CPU 1 has its own control program transferred from the ROM, hard disk, etc., and accesses the RAM 3 used as a work area, and the FPGA 2 receives model data for collision determination from the hard disk, etc. The configuration is such that the RAM 4 to be transferred is accessed. FIG. 4B shows a configuration in which the RAM 4 is incorporated inside the FPGA 2. FIG. 4C shows a configuration in which the CPU 1, FPGA 2, and RAM 4 are configured as an SoC (System on Chip) 5, and the RAM 3 is externally attached to the SoC 5. Note that if the capacity of the RAM 4 in the SoC 5 can be sufficiently secured, a configuration may be adopted in which the CPU 1 also accesses the RAM 4, excluding the external RAM 3.

図５は、本実施形態におけるＣＰＵ１とＦＰＧＡ２との間で行われる処理シーケンスを示している。ＣＰＵ１は、電源が投入されて起動すると、ＦＰＧＡ２に対してコンフィギュレーションを指示する。コンフィギュレーションの指示は、ＦＰＧＡ２に対するリセットの解除でも良い。するとＦＰＧＡ２はコンフィギュレーションプログラムを例えばＰＲＯＭやフラッシュＲＯＭ等から読み出してコンフィギュレーションを実行し、完了するとＣＰＵ１に対してコンフィギュレーション完了を通知する。それ以降、ＣＰＵ２はＦＰＧＡ２に対して衝突判定処理の開始の指示を入力すると、ＦＰＧＡ２は衝突判定処理演算を行う。衝突判定処理演算が完了すると、その完了及び判定結果をＣＰＵ１に通知することを繰り返す。ＣＰＵ１は、その間にロボットの制御に関する他の演算処理を実行する。 FIG. 5 shows a processing sequence performed between the CPU 1 and the FPGA 2 in this embodiment. When the CPU 1 is powered on and started, it instructs the FPGA 2 to configure. The configuration instruction may also be a release of reset for the FPGA 2. Then, the FPGA 2 reads the configuration program from, for example, a PROM or a flash ROM, executes the configuration, and upon completion, notifies the CPU 1 of the completion of the configuration. After that, when the CPU 2 inputs an instruction to start the collision determination process to the FPGA 2, the FPGA 2 performs a collision determination process calculation. When the collision determination processing calculation is completed, the CPU 1 is repeatedly notified of the completion and the determination result. During this time, the CPU 1 executes other arithmetic processing related to robot control.

一例として、ＣＰＵ１の動作クロック周波数は１Ｇ～２ＧＨｚ程度，ＦＰＧＡ２の動作クロック周波数は５００ＭＨｚ程度で、衝突判定処理の実行間隔は、例えば１ｍｓ程度である。 As an example, the operating clock frequency of the CPU 1 is about 1 GHz to 2 GHz, the operating clock frequency of the FPGA 2 is about 500 MHz, and the execution interval of the collision determination process is, for example, about 1 ms.

図３は、ロボットの可動部であるアームの形状と、例えば工場内における各設備等を障害物とした際に、その障害物の形状とをそれぞれ直方体モデルとして扱う場合に行う分離軸判定を説明するものである。２つの物体を物体Ａ，Ｂとする。物体Ａの各方向ベクトルを、図中に示すＸＡ，ＹＡ，ＺＡとし、物体Ｂの各方向ベクトルをＸＢ，ＹＢ，ＺＢとする。そして、判定軸Ｖを方向ベクトルＸＡとする。また、
ＬＡ＝［ＬＡＸＬＡＹＬＡＺ］，ＰＡ＝［ＰＡＸＰＡＹＰＡＺ］
ＬＢ＝［ＬＢＸＬＢＹＬＢＺ］，ＰＢ＝［ＰＢＸＰＢＹＰＢＺ］
とする。 Figure 3 explains the separation axis determination performed when the shape of the arm, which is the movable part of the robot, and the shape of the obstacle, for example, each piece of equipment in a factory, are treated as a rectangular parallelepiped model. It is something to do. Let two objects be objects A and B. Let the directional vectors of object A be XA, YA, and ZA shown in the figure, and let the directional vectors of object B be XB, YB, and ZB. Then, the determination axis V is set as the direction vector XA. Also,
LA=[LAX LAY LAZ], PA=[PAX PAY PAZ]
LB=[LBX LBY LBZ], PB=[PBX PBY PBZ]
shall be.

以下の判定式
（｜ＬＡ・Ｖ｜－｜ＬＢ・Ｖ｜）／２＜｜（ＰＡ－ＰＢ）・Ｖ｜ …（１）
が成立すれば、判定軸Ｖは分離軸であり、物体Ａと物体Ｂとは衝突しない。一方、上記の条件が成立しなければ判定軸Ｖは分離軸ではなく、物体Ａと物体Ｂとは衝突することになる。Ｖ＝ＸＡの場合、判定式（１）は以下のようになる。
（ＬＡＸ＋ＬＸＢ’）／２＜｜ＰＡＸ－ＰＢＸ｜ …（２） The following judgment formula (|LA・V|-|LB・V|)/2<|(PA-PB)・V|...(1)
If this holds true, the determination axis V is a separation axis, and objects A and B do not collide. On the other hand, if the above conditions are not met, the determination axis V is not the separation axis, and objects A and B will collide. In the case of V=XA, the determination formula (1) becomes as follows.
(LAX+LXB')/2<|PAX-PBX|...(2)

そして、直方体モデルの分離軸判定は、以下の１５軸を判定軸として行う。
・物体Ａの各方向ベクトル：ＸＡ，ＹＡ，ＺＡ
・物体Ｂの各方向ベクトル：ＸＢ，ＹＢ，ＺＢ
・物体Ａ，Ｂの各方向ベクトルの外積：ＸＡ×ＸＢ，ＸＡ×ＹＢ，ＸＡ×ＺＢ
ＹＡ×ＸＢ，ＹＡ×ＹＢ，ＹＡ×ＺＢ，ＺＡ×ＸＢ，ＺＡ×ＹＢ，ＺＡ×ＺＢ
そして、これら１５軸の判定軸に分離軸が１つも存在しなければ、物体Ａと物体Ｂとは衝突していることになる。 Separation axis determination for the rectangular parallelepiped model is performed using the following 15 axes as determination axes.
・Each direction vector of object A: XA, YA, ZA
・Each direction vector of object B: XB, YB, ZB
- Cross product of each direction vector of objects A and B: XA×XB, XA×YB, XA×ZB
YA×XB, YA×YB, YA×ZB, ZA×XB, ZA×YB, ZA×ZB
If there is no separation axis among these 15 judgment axes, it means that object A and object B have collided with each other.

図１に示すように、本実施形態のＦＰＧＡ２は、５ステージで５段のパイプラインを構成するようにロジックがコンフィギュレーションされる。パイプラインの各ステージは、以下のようになる
Ｌ：メモリロードステージ
Ｔ：座標変換ステージ
Ｄ：方向演算ステージ
Ｓ１５：分離軸判定ステージ
Ｒ：衝突判定ステージ As shown in FIG. 1, the logic of the FPGA 2 of this embodiment is configured to form a five-stage pipeline. Each stage of the pipeline is as follows: L: Memory load stage T: Coordinate transformation stage D: Direction calculation stage S15: Separation axis determination stage R: Collision determination stage

＜メモリロードステージ＞
ＲＡＭ４から各モデルデータの形状を、複数に分割したオブジェクト単位で読み込む。
＜座標変換ステージ＞
双方の直方体モデルを同一の座標系に変換する。
＜方向演算ステージ＞
双方の直方体モデルの特定の軸方向又はそれらの外積により分離軸を求める。
＜分離軸判定ステージ＞
双方の直方体モデルを分離軸に投影して、各モデルの像が重なるか否かを判定する。ここで、１５方向の分離軸について並列処理を行う。
＜衝突判定ステージ＞
分離軸判定ステージにおける全ての分離軸についての判定結果に基づき、衝突判定を行う。 <Memory load stage>
The shape of each model data is read from the RAM 4 in units of objects divided into a plurality of parts.
<Coordinate transformation stage>
Convert both rectangular parallelepiped models to the same coordinate system.
<Direction calculation stage>
The separation axis is determined by specific axial directions of both rectangular parallelepiped models or their cross product.
<Separation axis judgment stage>
Both rectangular parallelepiped models are projected onto the separation axis, and it is determined whether the images of each model overlap. Here, parallel processing is performed on separation axes in 15 directions.
<Collision judgment stage>
Collision determination is performed based on the determination results for all the separated axes in the separated axis determination stage.

図２は、メモリロードステージにおいて、各モデルのデータをオブジェクト毎に読み込む場合の挙動を示すフローチャートである。先ず、オブジェクト情報が格納されたメモリＭ，つまりＲＡＭ４から、オブジェクト番号［１］を判定用のレジスタＭＡにロードする（Ｓ１）。ａ，ｂは、それぞれ物体Ａ，Ｂの判定対象オブジェクト番号が格納されるポインタであり、ポインタａ，ｂにそれぞれ「２」が格納される（Ｓ２，Ｓ３）。 FIG. 2 is a flowchart showing the behavior when data of each model is read for each object in the memory load stage. First, the object number [1] is loaded from the memory M in which object information is stored, that is, the RAM 4, into the determination register MA (S1). A and b are pointers in which the determination target object numbers of objects A and B are stored, respectively, and "2" is stored in pointers a and b, respectively (S2, S3).

次に、ポインタｂがオブジェクト数Ｎ以下か否かを判断し（Ｓ４）、Ｎ以下であれば（ＹＥＳ）判定用のレジスタＭＢにＭ［ｂ］の内容をロードする（Ｓ５）。そして、ポインタｂをインクリメントしてから（Ｓ６）ステップＳ４に戻る。ステップＳ４において、ポインタｂがオブジェクト数Ｎを超えると（ＮＯ）、ポインタｂがオブジェクト数Ｎ未満か否かを判断する（Ｓ７）。Ｎ未満であれば（ＹＥＳ）、Ａ側と同様に、レジスタＭＡにＭ［ａ］の内容をロードして（Ｓ８）ポインタａをインクリメントする（Ｓ９）。それから、ポインタａの内容をポインタｂに格納すると、ステップＳ４に戻る。ステップＳ７において、ポインタａがオブジェクト数Ｎに達すると（ＮＯ）終了となる。 Next, it is determined whether the pointer b is less than or equal to the number of objects N (S4), and if it is less than or equal to N (YES), the contents of M[b] are loaded into the determination register MB (S5). Then, after incrementing the pointer b (S6), the process returns to step S4. In step S4, if the pointer b exceeds the number N of objects (NO), it is determined whether the pointer b is less than the number N of objects (S7). If it is less than N (YES), similarly to the A side, the contents of M[a] are loaded into the register MA (S8) and the pointer a is incremented (S9). Then, after storing the contents of pointer a into pointer b, the process returns to step S4. In step S7, when the pointer a reaches the number of objects N (NO), the process ends.

上記の処理では、物体Ｂの衝突判定モデルを先に更新し、それが最後まで行ったら、Ｂを最初に戻して他方Ａを次の衝突判定モデルに更新しているが、Ａ，Ｂの関係は逆でも良い。 In the above process, the collision detection model for object B is updated first, and when it reaches the end, B is returned to the beginning and the other A is updated to the next collision detection model, but the relationship between A and B is The opposite may also be true.

例えば、Ａを最初に戻してからＢが更新される前の判定は、衝突なしとして無視するか、演算を実施しなくても良い。ただし、同じ組み合わせの衝突判定演算を２回行うだけなので、そのまま演算しても良い。その場合、各周期の最初の衝突判定時に、前回の位置関係でのオブジェクトが残ってしまう可能性がある。しかし、この過検出による影響は無視しても問題ない程度なので、そのまま演算しても良い。また、最初の衝突判定モデルを別のレジスタに退避しておくことで、Ｂを次の衝突判定モデルに更新している間に、レジスタからＡを最初に戻すこともできる。 For example, a determination made after A is returned to the beginning and before B is updated may be ignored as no collision, or no calculation may be performed. However, since the collision determination calculation for the same combination is only performed twice, the calculation may be performed as is. In that case, there is a possibility that objects in the previous positional relationship may remain at the time of the first collision determination in each cycle. However, since the influence of this over-detection can be ignored without any problem, the calculation may be performed as is. Furthermore, by saving the first collision determination model in another register, A can be returned to the beginning from the register while B is being updated to the next collision determination model.

図１に示すメモリロードステージの「Ｌ１，２」，「Ｌ１，３」，…，「Ｌ１，Ｎ」等は、図２のステップＳ１，Ｓ５，Ｓ８の処理に対応しており、最後の「Ｌ（Ｎ－１），Ｎ」を実行した後にステップＳ７で「ＮＯ」と判断することになる。このように、ＦＰＧＡ２が衝突判定を５ステージ・５段のパイプラインにより処理することで、演算を効率的に実行できる。 "L1, 2", "L1, 3", ..., "L1, N", etc. of the memory load stage shown in FIG. 1 correspond to the processing of steps S1, S5, S8 in FIG. After executing "L(N-1), N", a "NO" determination is made in step S7. In this way, the FPGA 2 processes collision determination using a five-stage, five-stage pipeline, allowing efficient execution of calculations.

ここで、図１に示すように、パイプライン処理を円滑に実行するためには、メモリロードステージにおいて、必要なサイズ，つまりビット数のデータを１回で読み込む必要がある。具体的に、ＦＰＧＡ２及びＲＡＭ４間でデータを１回で転送するために、どれ位のデータバス幅が必要になるかを検討する。 Here, as shown in FIG. 1, in order to smoothly execute pipeline processing, it is necessary to read data of a required size, that is, the number of bits, at one time in the memory load stage. Specifically, we will consider how much data bus width is required to transfer data between the FPGA 2 and the RAM 4 at one time.

ロボットの全リンクのうち最大のリンク長をＬ［ｍｍ］，許容判定誤差をＥ［ｍｍ］，モデルの分割数をＤとする。この分割数Ｄは、上述したオブジェクト数Ｎに等しい。各モデルの位置を表すには、少なくとも Assume that the maximum link length among all the links of the robot is L [mm], the allowable judgment error is E [mm], and the number of model divisions is D. This number of divisions D is equal to the number of objects N described above. To represent the position of each model, at least

ビットは必要となる。但し、右辺の関数ｆ_ＲＵ（ｘ）は、ｘの小数点以下を整数に切り上げる関数である。また、各モデルの大きさは、分割により細分化されることを考慮すると、少なくとも bits are required. However, the function f _RU (x) on the right side is a function that rounds up the decimal part of x to an integer. Also, considering that the size of each model is subdivided by division, the size of each model is at least

ビットは必要となる。以上を合計すると、少なくとも bits are required. In total, at least

ビットのメモリ幅が必要となる。また、切り上げ関数をまとめることで近似して Requires memory width of bits. Also, by combining round-up functions, we can approximate

ビットのメモリ幅以上のメモリ幅が必要と評価しても問題ない。 There is no problem in evaluating that a memory width greater than the memory width of bits is required.

例えばリンク長が５００［ｍｍ］で許容判定誤差を１［ｍｍ］，分割数を１６とすると、Ｎ_Ｐ＝９，Ｎ_Ｓ＝８となり、ＤＢ_Ｗ０＝３×９＋３×８＝５１となるから、ビット幅ＤＢ_Ｗ１として５１ビット以上のメモリが必要ということになる。例えばＤＢ_Ｗ１を５２ビット等にする。つまり、一般的に使用されることが多い３２ビットのバス幅では、一度のメモリロードで衝突判定モデルのロード完了しないことを意味する。 For example, if the link length is 500 [mm], the allowable judgment error is 1 [mm], and the number of divisions is 16, then N _P = 9, N _S = 8, and DB _W 0 = 3 x 9 + 3 x 8 = 51. , a memory with a bit width of DB _W 1 of 51 bits or more is required. For example, DB _W 1 is set to 52 bits. In other words, with the commonly used 32-bit bus width, loading the collision determination model cannot be completed with one memory load.

ＦＰＧＡ２に衝突判定モデルが読み込まれた後、それを使用した衝突判定結果が出力されるまでの間に、（５）又は（６）式で求められるビット数以上のメモリをロードすれば良い。また、パイプラインが、オーバーラップして処理可能な複数の演算ブロックにより構成されている場合には、各ブロックのうち最長の演算時間の間に、（５）又は（６）式で求められるビット数以上のメモリをロードすれば良い。 After the collision determination model is loaded into the FPGA 2 and until the collision determination result using the model is output, it is sufficient to load memory with a number of bits greater than or equal to that determined by equation (5) or (6). In addition, if the pipeline is composed of multiple calculation blocks that can be processed in an overlapping manner, the bits determined by equation (5) or (6) during the longest calculation time among each block. All you have to do is load more memory than that number.

尚、上記の式ではモデル位置に符号ビットを含めていないが、各モデルの座標系の原点を適切に設定することで、例えば、各軸のそれぞれ最も小さい箇所を原点とすれば、位置を正方向のみで表現できるためである。勿論、全体で３ビット増えるが、位置にそれぞれ符号ビットを付加しても良い。 Although the above formula does not include the sign bit in the model position, by appropriately setting the origin of the coordinate system of each model, for example, if the smallest point of each axis is set as the origin, the position can be corrected. This is because it can be expressed only by direction. Of course, the total number increases by 3 bits, but a sign bit may be added to each position.

また、衝突検出モデルの方向成分は、それが属する座標系，例えば、アームの１軸目リンクを構成する衝突検出モデルであれば、１軸目リンクの座標系と同一としても問題ないため、上記の式には方向成分を含めていない。
方向成分を含めることを考えると、例えばクォータニオンをｑ＝｛ｗ,（ｘ,ｙ,ｚ）｝で表記し、単位クォータニオンであることを利用すれば、例えば
ｗ＝√｛１－（ｘ^２＋ｙ^２＋ｚ^２）｝
としてｗを求めることで、１要素はメモリからロードせずとも計算で求めることができる。したがって、３要素で表現できる。 In addition, the direction component of the collision detection model can be the same as the coordinate system to which it belongs, for example, the coordinate system of the first axis link if it is a collision detection model that constitutes the first axis link of the arm. The expression does not include the directional component.
Considering the inclusion of the directional component, for example, if we write the quaternion as q={w,(x,y,z)} and use the fact that it is a unit quaternion, we can write, for example, w=√{1-(x ² + y ² +z ² )}
By finding w as , one element can be found by calculation without loading it from memory. Therefore, it can be expressed with three elements.

モデル位置と同等のビット数に符号ビットを加えると、１要素当たり最低でも
Ｎ_Ｒ＝Ｎ_Ｐ＋１ビットは必要となる。勿論、クォータニオンの４要素を全てモデルに持たせても良いし、方向成分を９要素からなる３×３の回転行列で持たせても良い。要素数を増やせば、座標変換や方向演算に必要な演算処理を低減できる。特に、３要素だけでは残りの１要素を計算で導出する必要があり、その際には平方根の計算などの回路規模が比較的の大きくなる演算が必要となるから、メモリ帯域を広げた方が全体の回路規模が小さくなる場合もある。そのため、最も適しているのはクォータニオンの４要素全てを持たせる構成となる。 Adding the sign bit to the number of bits equivalent to the model position, at least N _R =N _P +1 bits are required per element. Of course, the model may have all four elements of the quaternion, or the direction component may be provided as a 3×3 rotation matrix consisting of nine elements. By increasing the number of elements, the calculation processing required for coordinate transformation and direction calculation can be reduced. In particular, if there are only three elements, it is necessary to derive the remaining one element by calculation, which requires calculations such as square root calculations that require a relatively large circuit size, so it is better to widen the memory bandwidth. The overall circuit scale may also be reduced. Therefore, the most suitable configuration is one that has all four quaternion elements.

ただし、演算時間については、残りの１要素を演算する回転要素補間演算部を新たにパイプラインのステージ，つまり演算ブロックとして追加することで、衝突検出処理全体としては演算時間の増加の影響を抑えることができる。この場合、各モデル間の判定ではなく、全判定で１回の演算分しか増加しない。 However, regarding the calculation time, by adding a rotating element interpolation calculation unit that calculates the remaining one element as a new pipeline stage, that is, a calculation block, the impact of the increase in calculation time on the collision detection process as a whole can be suppressed. be able to. In this case, the amount of calculation increases by only one time for all the determinations, not for the determination between each model.

先に提示した具体例の場合、Ｎ_Ｐ＝９であるからＮ_Ｒ＝１０となり、回転成分を４要素とすれば、ＤＢ_Ｗ２＝５１＋４×１０＝９１ビット以上のメモリ幅が必要となる。つまり、ビット幅ＤＢ_Ｗ２は、
ＤＢ_Ｗ２＝ＤＢ_Ｗ０＋４（Ｎ_Ｐ＋１） …（８）
で決定される。この場合、６４ビットのメモリ幅，又はこれにＥＣＣ (Error Check and Correct：エラー訂正機能)機能の付与を想定した７２ビットのメモリ幅でも、一回のメモリロードで衝突判定モデルのロードは完了しない。本実施形態では、９１ビット以上のメモリ幅に対応して、例えば９６ビットのデータバス構成を採用している。この９６ビットはビット幅ＤＢ_Ｗ３の一例である。 In the case of the specific example presented earlier, since N _P =9, N _R =10, and if the rotation component is 4 elements, a memory width of DB _W 2 = 51 + 4 x 10 = 91 bits or more is required. In other words, the bit width DB _W 2 is
DB _W 2 = DB _W 0 + 4 ( _NP + 1) ... (8)
determined by In this case, even with a 64-bit memory width or a 72-bit memory width with an ECC (Error Check and Correct) function added, loading the collision detection model will not be completed in one memory load. . In this embodiment, a 96-bit data bus configuration, for example, is adopted in response to a memory width of 91 bits or more. This 96 bits is an example of the bit width DB _W 3.

尚、ここまでの検討は、衝突判定演算を固定小数点で実行することを前提としている。固定小数点とする利点は、必要精度を最小限の回路規模で実現できることである。これに対して、ＣＰＵで演算していた内容と同じ結果となることが要求される場合、浮動小数点で演算しても良い。この場合、方向成分を含めない場合は３２×６＝１９２ビット、方向成分を含めた場合，４要素とすれば、３２×１０＝３２０ビットが必要となり、要求されるメモリ幅は更に大きくなる。 Note that the discussion up to this point is based on the assumption that the collision determination calculation is performed using a fixed point. The advantage of using fixed-point numbers is that the required accuracy can be achieved with a minimum circuit scale. On the other hand, if it is required to obtain the same result as the content calculated by the CPU, the calculation may be performed using floating point numbers. In this case, if the direction component is not included, 32×6=192 bits are required, and if the direction component is included, 32×10=320 bits are required for four elements, and the required memory width becomes even larger.

以上のように本実施形態によれば、ＦＰＧＡ２は、ＣＰＵ１より、ロボットのアームと障害物との衝突判定処理要求が入力されると、アームの少なくとも一部の形状等と障害物の少なくとも一部の形状等とをそれぞれ表したモデルのデータが記憶されているＲＡＭ４より各モデルのデータを読み出して双方の衝突判定を行い、その判定結果をＣＰＵ１に出力する。そして、ＦＰＧＡ２は、衝突判定に係る演算の少なくとも一部を並列処理するパイプラインを構成するようにロジックがコンフィギュレーションされる。 As described above, according to the present embodiment, when the CPU 1 inputs a request for processing for determining a collision between the arm of the robot and an obstacle, the FPGA 2 determines the shape of at least a portion of the arm and the shape of at least a portion of the obstacle. The data of each model is read out from the RAM 4 in which data representing the shape, etc. of each model is stored, a collision determination is made between the two, and the determination result is output to the CPU 1. The logic of the FPGA 2 is configured to form a pipeline that processes at least a portion of calculations related to collision determination in parallel.

これにより、ＣＰＵ１がＦＰＧＡ２に対して衝突判定処理要求を入力すると、ＦＰＧＡ２が双方のモデルのデータをＲＡＭ４より読み出して衝突判定を行い、その判定結果をＣＰＵ１に出力する、というシステムを構成できる。したがって、比較的安価なＣＰＵ１とＦＰＧＡ２との組み合わせにより、ＣＰＵ１はＦＰＧＡ２に衝突判定を行わせながらロボットのその他の制御を処理することが可能になる。また、衝突判定の演算は処理負荷が高いが、その演算の少なくとも一部を並列処理するパイプラインを構成するようにＦＰＧＡ２をコンフィギュレーションすることで、演算を効率的に行うことが可能になり、ＣＰＵ１が要求する時間内に衝突判定を実行させることができる。 Thereby, when the CPU 1 inputs a collision determination processing request to the FPGA 2, a system can be constructed in which the FPGA 2 reads data of both models from the RAM 4, performs collision determination, and outputs the determination result to the CPU 1. Therefore, by combining the relatively inexpensive CPU 1 and FPGA 2, the CPU 1 can process other controls of the robot while having the FPGA 2 perform collision determination. In addition, collision determination calculations have a high processing load, but by configuring the FPGA 2 to form a pipeline that processes at least part of the calculations in parallel, it is possible to perform the calculations efficiently. Collision determination can be executed within the time required by the CPU 1.

この場合、双方のモデルを何れも直方体モデルとし、衝突判定に１５方向の分離軸を用いて行う。直方体モデルは、衝突判定において一般的に使用されるモデルであるから、１５方向の分離軸を用いた衝突判定に係る演算の少なくとも一部をパイプラインにて並列処理することで、直方体モデル同士の衝突判定を効率的に行うことができる。 In this case, both models are rectangular parallelepiped models, and collision determination is performed using separation axes in 15 directions. The rectangular parallelepiped model is a model commonly used in collision detection, so by processing at least a part of the calculations related to collision determination using separation axes in 15 directions in parallel in the pipeline, it is possible to improve the relationship between the rectangular parallelepiped models. Collision determination can be performed efficiently.

また、ＦＰＧＡ２のパイプラインを具体的には、ＲＡＭ４よりモデルデータを読み出すロードステージ，双方の直方体モデルを同一の座標系に変換する座標変換ステージ，双方の直方体モデルの特定の軸方向又はそれらの外積により分離軸を求める方向演算ステージ，双方の直方体モデルを分離軸に投影して、各モデルの像が重なるか否かを判定する分離軸判定ステージ，分離軸判定ステージにおける全ての分離軸についての判定結果に基づき衝突判定を行う衝突判定ステージとで構成する。これにより、モデルデータの読み出しから衝突判定までの各処理を、パイプラインの各ステージによって効率的に行うことができる。 Specifically, the pipeline of FPGA2 includes a load stage that reads model data from RAM4, a coordinate conversion stage that transforms both rectangular parallelepiped models into the same coordinate system, a specific axis direction of both rectangular parallelepiped models, or their outer product. A direction calculation stage that calculates the separation axis by , a separation axis judgment stage that projects both rectangular parallelepiped models onto the separation axis and determines whether the images of each model overlap, and a judgment about all separation axes in the separation axis judgment stage. It consists of a collision determination stage that performs collision determination based on the results. This allows each stage of the pipeline to efficiently perform each process from reading model data to collision determination.

そして、最小限必要となるデータバスのビット幅ＤＢ_Ｗ０とすると、そのビット幅ＤＢ_Ｗ０を（６）式により決定し、ＲＡＭ４に接続されるデータバスのビット幅ＤＢ_Ｗ１をビット幅ＤＢ_Ｗ０以上に設定する。このビット幅ＤＢ_Ｗ０は、各モデルの位置を表すのに必要なデータのビット数や、各モデルの大きさを、複数のオブジェクトに分割することも考慮して必要になるデータのビット数も考慮して決定されている。したがって、データバスのビット幅ＤＢ_Ｗ１をビット幅ＤＢ_Ｗ０以上に設定することで、ＦＰＧＡ２は、モデルデータを１回のアクセスによってＲＡＭ４より確実に読み出すことができる。 Then, assuming that the minimum required bit width of the data bus is DB _W 0, the bit width DB _W 0 is determined by equation (6), and the bit width DB _W 1 of the data bus connected to the RAM 4 is determined as the bit width DB. _W Set to 0 or higher. This bit width DB _W 0 also includes the number of data bits required to represent the position of each model, and the number of data bits required to take into account the size of each model to be divided into multiple objects. It has been decided with consideration. Therefore, by setting the bit width DB _W 1 of the data bus to be greater than or equal to the bit width DB _W 0, the FPGA 2 can reliably read model data from the RAM 4 with one access.

更に、ビット幅ＤＢ_Ｗ２を（８）式により決定し、ＲＡＭ４に接続されるデータバスのビット幅ＤＢ_Ｗ３をビット幅ＤＢ_Ｗ２以上に設定することで、モデルデータに符号及びクォータニオンも含めることができるので、ＦＰＧＡ２は、衝突判定処理をより高速に実行できる。 Furthermore, by determining the bit width DB _W 2 using equation (8) and setting the bit width DB _W 3 of the data bus connected to the RAM 4 to be greater than or equal to the bit width DB _W 2, the code and quaternion can also be included in the model data. Therefore, the FPGA 2 can execute collision determination processing at higher speed.

（第２実施形態）
以下、第１実施形態と同一部分には同一符号を付して説明を省略し、異なる部分について説明する。図６に示すように、第２実施形態では、ＦＰＧＡ２のパイプラインを、６ステージ・３段構成とするようにロジックをコンフィギュレーションする。第１実施形態と異なるステージは、以下になる。
ＬＡ：第１メモリロードステージ
ＬＢ：第２メモリロードステージ
Ｄ／Ｓ９Ａ：方向演算ステージ（分離軸判定の一部を並列処理）
Ｓ９Ｂ：分離軸判定ステージ (Second embodiment)
Hereinafter, parts that are the same as those in the first embodiment are given the same reference numerals and explanations will be omitted, and different parts will be explained. As shown in FIG. 6, in the second embodiment, the logic is configured so that the pipeline of the FPGA 2 has a six-stage, three-stage configuration. The stages different from the first embodiment are as follows.
LA: 1st memory load stage LB: 2nd memory load stage D/S9A: Direction calculation stage (parallel processing of part of separation axis determination)
S9B: Separation axis judgment stage

＜第１メモリロードステージ＞
ＲＡＭ４から物体Ａのモデルデータの形状等を、複数に分割したオブジェクト単位で読み込む。
＜第２メモリロードステージ＞
ＲＡＭ４から物体Ｂのモデルデータの形状等を、複数に分割したオブジェクト単位で読み込む。すなわち、第２実施形態では、パイプライン１段の処理で物体Ａ，Ｂのモデルデータを順次読み込む。
＜方向演算ステージ＞
分離軸を求めると共に、・物体Ａの各方向ベクトルＸＡ，ＹＡ，ＺＡ及び物体Ｂの各方向ベクトルＸＢ，ＹＢ，ＺＢの分離軸判定を６並列処理する。これが第１処理に相当する。
＜分離軸判定ステージ＞
残り９方向の外積演算を含む分離軸判定について並列処理を行う。これが第２処理に相当する。 <1st memory load stage>
The shape and the like of the model data of object A are read from the RAM 4 in units of objects divided into a plurality of parts.
<Second memory load stage>
The shape and the like of the model data of object B are read from the RAM 4 in units of objects divided into a plurality of parts. That is, in the second embodiment, model data of objects A and B are sequentially read in one stage of pipeline processing.
<Direction calculation stage>
In addition to finding the separation axes, the separation axis determination of each directional vector XA, YA, ZA of object A and each directional vector XB, YB, ZB of object B is processed six times in parallel. This corresponds to the first process.
<Separation axis judgment stage>
Parallel processing is performed on separation axis determination including cross product calculations in the remaining nine directions. This corresponds to the second process.

尚、方向演算ステージで並列処理する６方向の分離軸判定を「Ｓ９Ａ」としているのは、分離軸判定ステージで使用する「Ｓ９Ｂ」と同じ演算ブロックを使用していることによる。したがって、これらは並列に実行できない。 The reason why the 6-direction separation axis determination that is processed in parallel in the direction calculation stage is set to "S9A" is that the same calculation block as "S9B" used in the separation axis determination stage is used. Therefore, they cannot be executed in parallel.

以上のように第２実施形態によれば、メモリロードステージを、物体Ａの直方体モデルのデータを読み出す第１ステージと、物体Ｂの直方体モデルのデータを読み出す第２ステージとで構成する。これにより、データバス幅の制約等により、１回のメモリアクセスで双方のモデルデータを同時に読み出すことができず、メモリアクセスを２回行わざるを得ない場合にも対応させることができる。 As described above, according to the second embodiment, the memory load stage is composed of a first stage that reads data of a rectangular parallelepiped model of object A, and a second stage that reads data of a rectangular parallelepiped model of object B. This makes it possible to cope with the case where both model data cannot be read out simultaneously in one memory access due to data bus width constraints, etc., and the memory access must be performed twice.

そして、１５方向の分離軸のうち、物体Ａ，Ｂの方向ベクトルのみからなる分離軸判定を並列処理する第１処理と、残りの分離軸判定を並列処理する第２処理とに分別する。すなわち、第１処理は外積演算が無い分離軸判定となり、第２処理は外積演算を含む分離軸判定となる。これにより、方向演算ステージにおいて第１処理を並列処理することが可能になり、分離軸判定ステージでは第２処理を並列処理すれば良くなる。 Then, among the 15 separation axes, the process is divided into a first process in which the separation axis judgment consisting of only the direction vectors of objects A and B is processed in parallel, and a second process in which the remaining separation axis judgments are processed in parallel. That is, the first process is a separation axis determination without a cross product calculation, and the second process is a separation axis determination including a cross product calculation. This makes it possible to process the first process in parallel in the direction calculation stage, and only needs to process the second process in parallel in the separation axis determination stage.

このように構成すれば、メモリロードステージを第１及び第２ステージに分けたことに伴い、分離軸判定の並列処理の一部を方向演算ステージで行うようにすると、ＦＰＧＡ２内の各演算ブロックの稼働率が向上する。したがって、ＦＰＧＡ２内の論理回路リソースの使用量と演算速度とのバランスが良好になる。更に、ＦＰＧＡ２を、分離軸判定の第１処理と第２処理とを、共通の演算ブロックで実行するようにコンフィギュレーションするので、論理回路リソースをより効率的に使用できる。 With this configuration, if the memory load stage is divided into the first and second stages, and part of the parallel processing for separation axis determination is performed in the direction calculation stage, each calculation block in the FPGA 2 will be Operation rate improves. Therefore, there is a good balance between the amount of logic circuit resources used in the FPGA 2 and the calculation speed. Furthermore, since the FPGA 2 is configured to execute the first and second processing of separation axis determination using a common calculation block, logic circuit resources can be used more efficiently.

その他、パイプライン１段の処理で物体Ａ，Ｂのモデルデータを順次読み込む構成としては、
・メモリを複数持ち、それらに少なくとも一部同じ衝突判定モデルデータを格納する。例えばメモリＡ，Ｂがある場合、最初に判定されるロボットや柵などはメモリＡでしか使用しないので、メモリＢに格納しなくても良く、最後に判定されるロボットや柵などはメモリＢでしか使用しないのでメモリＡに格納しなくても良い。つまり、メモリＡ，Ｂに格納されるデータを完全に同一にする必要はない。
・判定対象のロボット毎と、柵などの障害物とを、それぞれ違うメモリに格納する。
・デュアルポートメモリを使用して、同じタイミングで両方の衝突判定モデルを同一のメモリからロードする。
等がある。 Other configurations that sequentially read the model data of objects A and B in one stage of pipeline processing include:
- Have multiple memories and store at least some of the same collision judgment model data in them. For example, if there are memories A and B, the robots and fences that are judged first are used only in memory A, so they do not need to be stored in memory B, and the robots and fences that are judged last are stored in memory B. There is no need to store it in memory A since it is only used. In other words, it is not necessary that the data stored in memories A and B be completely the same.
- Store each robot to be judged and obstacles such as fences in separate memories.
・Use dual-port memory to load both collision detection models from the same memory at the same time.
etc.

（第３～第９実施形態）
第３～第９実施形態は、パイプライン構成のバリエーションを示す。ＦＰＧＡの内部ロジックのリソースによっては、１ステージで並列に処理できる処理数に制約があることも想定される。そのような場合にも対応できるように、以下にバリエーションを示す。 (Third to Ninth Embodiments)
The third to ninth embodiments show variations in pipeline configuration. Depending on the internal logic resources of the FPGA, it is assumed that there is a limit to the number of processes that can be processed in parallel in one stage. To accommodate such cases, variations are shown below.

図７に示す第３実施形態は、第１実施形態の分離軸判定ステージを、第１ステージ：Ｓ８Ａと第２ステージ：Ｓ８Ｂとに分けて６ステージ・３段構成とした場合である。例えば「Ｓ８Ａ」は８並列処理，「Ｓ８Ｂ」は７並列処理とするが、これらは同じ演算ブロックを使用する。 The third embodiment shown in FIG. 7 is a case in which the separation axis determination stage of the first embodiment is divided into a first stage: S8A and a second stage: S8B, and configured into six stages and three stages. For example, "S8A" is 8-parallel processing, and "S8B" is 7-parallel processing, but they use the same calculation block.

図８に示す第４実施形態は、第１実施形態の分離軸判定ステージを、第１ステージ：Ｓ５Ａ，第２ステージ：Ｓ５Ｂ，第３ステージ：Ｓ５Ｃとに分けて７ステージ・３段構成とした場合である。「Ｓ５Ａ」～「Ｓ５Ｃ」はそれぞれ５並列処理を行い、何れも同じ演算ブロックを使用する。 In the fourth embodiment shown in FIG. 8, the separation axis determination stage of the first embodiment is divided into a first stage: S5A, a second stage: S5B, and a third stage: S5C, and has a seven-stage, three-stage configuration. This is the case. "S5A" to "S5C" each perform 5 parallel processing and use the same calculation block.

図９に示す第５実施形態は、第４実施形態の分離軸判定第１ステージ：Ｓ５Ａを、方向演算ステージに組み込んで並列処理することで、６ステージ・２段構成とした場合である。この場合、「Ｓ５Ａ」では、第２実施形態と同様に、物体Ａの各方向ベクトルＸＡ，ＹＡ，ＺＡ及び物体Ｂの各方向ベクトルＸＢ，ＹＢ，ＺＢの内から５つを選択して並列処理する。 The fifth embodiment shown in FIG. 9 is a case where the separation axis determination first stage S5A of the fourth embodiment is incorporated into the direction calculation stage and processed in parallel, resulting in a six-stage, two-stage configuration. In this case, in "S5A", five of the directional vectors XA, YA, ZA of object A and the directional vectors XB, YB, ZB of object B are selected and processed in parallel, as in the second embodiment. do.

図１０に示す第６実施形態は、第１実施形態の分離軸判定ステージを、第１ステージ：Ｓ３Ａ～第５ステージ：Ｓ３Ｅに分けて９ステージ・２段構成とした場合である。「Ｓ３Ａ」～「Ｓ３Ｅ」はそれぞれ３並列処理を行い、何れも同じ演算ブロックを使用する。 The sixth embodiment shown in FIG. 10 is a case where the separation axis determination stage of the first embodiment is divided into a first stage: S3A to a fifth stage: S3E, and configured into nine stages and two stages. "S3A" to "S3E" each perform three parallel processes and use the same calculation block.

図１１に示す第７実施形態は、第５実施形態の分離軸判定第１ステージ：Ｓ３Ａを、方向演算ステージに組み込んで並列処理することで、８ステージ・２段構成とした場合である。この場合、「Ｓ３Ａ」では、物体Ａの各方向ベクトルＸＡ，ＹＡ，ＺＡ及び物体Ｂの各方向ベクトルＸＢ，ＹＢ，ＺＢの内から３つを選択して並列処理する。 The seventh embodiment shown in FIG. 11 is a case in which the first separated axis determination stage S3A of the fifth embodiment is incorporated into the direction calculation stage and processed in parallel, resulting in an eight-stage, two-stage configuration. In this case, in "S3A", three of the directional vectors XA, YA, ZA of object A and the directional vectors XB, YB, ZB of object B are selected and processed in parallel.

図１２に示す第８実施形態は、第１実施形態の分離軸判定ステージを、第１ステージ：Ｓ２Ａ～第８ステージ：Ｓ２Ｈに分けて１２ステージ・２段構成とした場合である。「Ｓ２Ａ」～「Ｓ２Ｇ」はそれぞれ２並列処理を行い、「Ｓ２Ｈ」は単独処理を行う。これらは何れも同じ演算ブロックを使用する。 The eighth embodiment shown in FIG. 12 is a case where the separation axis determination stage of the first embodiment is divided into 12 stages, 2 stages, from the first stage: S2A to the eighth stage: S2H. "S2A" to "S2G" each perform two parallel processes, and "S2H" performs single processing. All of these use the same calculation block.

図１３に示す第９実施形態は、第８実施形態の分離軸判定第１ステージ：Ｓ２Ａを、方向演算ステージに組み込んで並列処理することで、１１ステージ・２段構成とした場合である。この場合、「Ｓ２Ａ」では、物体Ａの各方向ベクトルＸＡ，ＹＡ，ＺＡ及び物体Ｂの各方向ベクトルＸＢ，ＹＢ，ＺＢの内から２つを選択して並列処理する。 The ninth embodiment shown in FIG. 13 is a case where the first separated axis determination stage S2A of the eighth embodiment is incorporated into the direction calculation stage and processed in parallel, resulting in an 11-stage, two-stage configuration. In this case, in "S2A", two of the directional vectors XA, YA, ZA of object A and the directional vectors XB, YB, ZB of object B are selected and processed in parallel.

本発明は上記した、又は図面に記載した実施形態にのみ限定されるものではなく、以下のような変形又は拡張が可能である。
ＣＰＵ及びＦＰＧＡの動作クロック周波数は、個別の設計に応じて適宜変更すれば良い。
ＤＢ_Ｗ０～ＤＢ_Ｗ３の具体数値についても、個別の設計に応じて適宜設定すれば良い。
衝突判定を行う対象モデルは、直方体モデルに限らない。したがって、分離軸判定を行う方向数も「１５」に限らない。 The present invention is not limited to the embodiments described above or illustrated in the drawings, but can be modified or expanded as described below.
The operating clock frequencies of the CPU and FPGA may be changed as appropriate depending on individual designs.
The specific numerical values of DB _W 0 to DB _W 3 may also be set as appropriate depending on the individual design.
The target model for collision determination is not limited to a rectangular parallelepiped model. Therefore, the number of directions in which separation axis determination is performed is not limited to "15".

図面中、１はＣＰＵ、２はＦＰＧＡ、４はＲＡＭを示す。 In the drawings, 1 represents a CPU, 2 represents an FPGA, and 4 represents a RAM.

Claims

When a request for collision determination processing between a movable part of the robot and an obstacle is input, data of a model representing the shape, etc. of at least a part of the movable part and the shape, etc. of at least a part of the obstacle is stored. In order to read the data of both models from the storage unit in which the data is stored, perform a collision determination for both, and output the determination result, a pipeline is configured to process at least a part of the calculations related to the collision determination in parallel. The logic is configured and
The maximum link length among all the links of the robot is L [mm], the allowable judgment error is E [mm], the number of divisions of the model shape etc. into multiple objects is D, and the function that rounds up the decimal point of x to an integer is f _RU (x) and the minimum required data bus bit width DB _W 0, the bit width DB _W 0 is determined by the following formula,

An FPGA for arithmetic processing for robot control, wherein a bit width DB _W 1 of a data bus connected to the storage section is set to be greater than or equal to the bit width DB _W 0.

If the number of bits of data representing the position of the model is N _P , and the bit width when using sign bits and quaternions is DB _W 2, then the bit width DB _W 2 is expressed as the following formula: DB _W 2=DB _W 0+4(N _P +1)
determined by
2. The FPGA for robot control arithmetic processing according to claim 1, wherein a bit width DB _W 3 of a data bus connected to said storage section is set to be greater than said bit width DB _W 2.

Both of the above models are rectangular parallelepiped models,
3. The FPGA for arithmetic processing of robot control according to claim 1, wherein said collision determination is performed using separation axes in 15 directions.

The pipeline is
a load stage for reading the data from the storage unit;
a coordinate transformation stage that transforms both rectangular parallelepiped models into the same coordinate system;
a direction calculation stage for determining a separation axis by a specific axis direction of both of the rectangular parallelepiped models or their cross product;
a separation axis determination stage that projects both of the rectangular parallelepiped models onto the separation axis and determines whether images of each model overlap;
4. The FPGA for arithmetic processing of robot control according to claim 3, further comprising a collision determination stage that performs collision determination based on the determination results for all the separation axes in the separation axis determination stage.

When a request for collision determination processing between a movable part of the robot and an obstacle is input, data of a model representing the shape, etc. of at least a part of the movable part and the shape, etc. of at least a part of the obstacle is stored. It reads the data of both models from the storage unit that is installed, performs a collision judgment for both, and outputs the judgment result.
A data bus width determination method applied to a robot control arithmetic processing FPGA in which logic is configured to form a pipeline that processes at least a part of the calculations related to collision determination in parallel, the method comprising:
The maximum link length among all the links of the robot is L [mm], the allowable judgment error is E [mm], the number of divisions of the model shape etc. into multiple objects is D, and the function that rounds up the decimal point of x to an integer is f _RU (x) and the minimum required data bus bit width DB _W 0, the bit width DB _W 0 is determined by the following formula,

A data bus width determining method for an FPGA for arithmetic processing in which a bit width DB _W 1 of a data bus connected to the storage unit is set to be equal to or greater than the bit width DB _W 0.

If the number of bits of data representing the position of the model is N _P and the bit width when using sign bits and quaternions is DB _W 2, then the bit width DB _W 2 can be expressed as the following formula: DB _W 2=DB _W 0+4(N _P +1)
determined by
6. The data bus width determination method for an arithmetic processing FPGA according to claim 5, wherein the bit width DB _W 3 of the data bus connected to the storage section is set to be equal to or larger than the bit width DB _W 2.