JP2009134635A

JP2009134635A - Multiprocessor system

Info

Publication number: JP2009134635A
Application number: JP2007311409A
Authority: JP
Inventors: Yuichiro Shima; 雄一郎島
Original assignee: Renesas Technology Corp
Current assignee: Renesas Technology Corp
Priority date: 2007-11-30
Filing date: 2007-11-30
Publication date: 2009-06-18

Abstract

<P>PROBLEM TO BE SOLVED: To provide a multiprocessor system that increase efficiency of data transfer between a plurality of processors. <P>SOLUTION: In a configuration having, for example, a microcomputer [1]MC1 and a system memory MEM1 thereof, and a microcomputer [2]MC2 and a system memory MEM2 thereof, a data relay device PCIC is disposed on a path of data transfer from MEM1 to MEM2. PCIC includes a first buffer BFA and a second buffer BFB, which are accessed alternately when large volume data is transferred. For example, when a DMA controller DMAC1 in MC1 stores data into BFB, a DMA controller DMAC2 in MC2 reads out data from BFA (S63), and then DMAC1 stores the subsequent data into BFA emptied (S64). <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明はマルチプロセッサシステムに関し、特に各プロセッサが各ＤＭＡコントローラ（Direct Memory Access Controller）を用いて各プロセッサ同士のデータ転送を行うマルチプロセッサシステムに適用して有益な技術に関するものである。 The present invention relates to a multiprocessor system, and more particularly to a technique that is useful when applied to a multiprocessor system in which each processor transfers data between processors using each DMA controller (Direct Memory Access Controller).

例えば、２つのプロセッサ間でデータ転送を行う際には、この各プロセッサ間を繋ぐバス上に共有メモリを設置する方式などが知られている。この方式によってデータ転送を行う場合は、２つのプロセッサ間でバス使用権の調停を行いながら、一方のプロセッサが共有メモリにデータを書き込み、他方のプロセッサが共有メモリからデータを読み出すような動作が行われる。 For example, when data is transferred between two processors, a method of installing a shared memory on a bus connecting the processors is known. When data transfer is performed using this method, one processor writes data to the shared memory and the other processor reads data from the shared memory while arbitrating the bus use right between the two processors. Is called.

ところで、前記のようなマルチプロセッサシステムの技術について、本発明者が検討した結果、以下のようなことが明らかとなった。 By the way, as a result of examination of the technique of the multiprocessor system as described above by the present inventor, the following has been clarified.

図１２は、本発明の前提として検討した従来技術のマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。図１２に示すマルチプロセッサシステムは、プロセッサ［１］ＣＰＵ１とプロセッサ［２］ＣＰＵ２を共通のローカルバスＬＢＵＳで接続し、このＬＢＵＳに転送用バッファＢＦが接続された構成となっている。例えば、ＣＰＵ１用のシステムメモリＭＥＭ１からＣＰＵ２用のシステムメモリＭＥＭ２にデータ転送を行う場合、ＭＥＭ１のデータをＣＰＵ１を介して転送用バッファＢＦにライトし、このＢＦのデータをＣＰＵ２を介してＭＥＭ２に書き込む。 FIG. 12 is a block diagram showing an example of the configuration of a conventional multiprocessor system studied as a premise of the present invention. The multiprocessor system shown in FIG. 12 has a configuration in which a processor [1] CPU1 and a processor [2] CPU2 are connected by a common local bus LBUS, and a transfer buffer BF is connected to this LBUS. For example, when data is transferred from the system memory MEM1 for the CPU1 to the system memory MEM2 for the CPU2, the data of the MEM1 is written to the transfer buffer BF via the CPU1, and the data of this BF is written to the MEM2 via the CPU2. .

このような動作を行うため、ＣＰＵ１とＣＰＵ２は、それぞれバス調停機能を備える必要がある。例えば、ＣＰＵ１は、バッファＢＦにライトを行う際に、予めＣＰＵ２に対してバス権要求信号ＢＲＥＱを出力し、その返答としてＣＰＵ２からバス権要求許可信号ＢＡＣＫを受けることでＬＢＵＳの権限を確保する必要がある。また、ＣＰＵ２は、ＢＦからリードを行う際に、その逆の手順でＬＢＵＳの権限を確保する必要がある。 In order to perform such an operation, each of the CPUs 1 and 2 needs to have a bus arbitration function. For example, when writing to the buffer BF, the CPU 1 needs to ensure the LBUS authority by outputting the bus right request signal BREQ to the CPU 2 in advance and receiving the bus right request permission signal BACK from the CPU 2 as a response. There is. Further, when reading from the BF, the CPU 2 needs to secure the authority of the LBUS in the reverse procedure.

したがって、このような構成を用いると、転送処理の最中でプロセッサＣＰＵのリソースが奪われると共に、例えばデータを分割して転送する際にはその都度ハンドシェイク（ＢＲＥＱおよびＢＡＣＫ）が必要となる。更には、ＣＰＵ１がＢＦにデータをライト中、ＣＰＵ２は待たされることになる。このようなことから、処理手順の複雑化、転送効率の低下、プロセッサの処理効率の低下などが懸念される。 Therefore, when such a configuration is used, the resources of the processor CPU are deprived during the transfer process, and for example, when data is divided and transferred, a handshake (BREQ and BACK) is required each time. Further, the CPU 2 waits while the CPU 1 is writing data to the BF. For this reason, there is a concern that the processing procedure becomes complicated, the transfer efficiency decreases, the processor processing efficiency decreases, and the like.

本発明は、このようなことを鑑みてなされたものであり、その目的の一つは、複数のプロセッサ間でのデータ転送効率を向上可能なマルチプロセッサシステムを提供することにある。また、本発明の他の目的の一つは、複数のプロセッサ間でのデータ転送効率を小さい回路規模で向上可能なマルチプロセッサシステムを提供することにある。本発明の前記ならびにそれ以外の目的と新規な特徴は、本明細書の記述および添付図面から明らかになるであろう。 The present invention has been made in view of the above, and an object of the present invention is to provide a multiprocessor system capable of improving the data transfer efficiency between a plurality of processors. Another object of the present invention is to provide a multiprocessor system capable of improving the data transfer efficiency between a plurality of processors with a small circuit scale. The above and other objects and novel features of the present invention will be apparent from the description of this specification and the accompanying drawings.

本発明の一実施の形態によるマルチプロセッサシステムは、第１プロセッサ側の第１システムメモリと第２プロセッサ側の第２システムメモリとの間のデータ転送を、第１プロセッサ側の第１データ転送制御部（代表的にはＤＭＡコントローラ）と第２プロセッサ側の第２データ転送制御部を用いて行うものとなっている。そして、この第１システムメモリと第２システムメモリとの間のデータ転送経路に、Ｎ（Ｎ≧２）個の記憶領域を含むデータバッファを含んだデータ中継部が備わったことが特徴となっている。 In the multiprocessor system according to the embodiment of the present invention, the data transfer between the first system memory on the first processor side and the second system memory on the second processor side is controlled by the first data transfer control on the first processor side. This is performed using a second data transfer control unit on the second processor side (typically a DMA controller) and the second processor side. The data transfer path between the first system memory and the second system memory includes a data relay unit including a data buffer including N (N ≧ 2) storage areas. Yes.

例えば、第１システムメモリから第２システムメモリにデータ転送を行う場合は、第１データ転送制御部が第１システムメモリからデータ中継部のデータバッファに向けたデータ転送を制御し、第２データ転送制御部がデータバッファから第２システムメモリに向けたデータ転送を制御する。この際に、第１システムメモリからのデータは、Ｎ個の記憶領域において、第１番目の記憶領域から第Ｎ番目の記憶領域に向けて順に格納され、第Ｎ番目の記憶領域に格納された後は第１番目の記憶領域に戻って格納される。一方、これと並行して、このＮ個の記憶領域に格納されたデータは、第１番目の記憶領域から第Ｎ番目の記憶領域に向けて順に読み出され、第Ｎ番目の記憶領域から読み出された後は第１番目の記憶領域に戻って読み出され、その都度第２システムメモリに転送される。 For example, when data is transferred from the first system memory to the second system memory, the first data transfer control unit controls the data transfer from the first system memory to the data buffer of the data relay unit, and the second data transfer The control unit controls data transfer from the data buffer to the second system memory. At this time, the data from the first system memory is sequentially stored in the N storage areas from the first storage area to the Nth storage area, and then stored in the Nth storage area. After that, the data is stored back to the first storage area. On the other hand, in parallel with this, the data stored in the N storage areas are sequentially read from the first storage area to the Nth storage area and read from the Nth storage area. After being issued, the data is read back to the first storage area and transferred to the second system memory each time.

また、この際に、このＮ個の記憶領域のそれぞれがライト済み（充満状態）であるかリード済み（空状態）であるかが監視される。データ中継部は、対象とする記憶領域が空状態の場合には、第１データ転送制御部のデータ転送（即ち記憶領域への書き込み）をイネーブルとし、対象とする記憶領域が充満状態である場合には、第２データ転送制御部のデータ転送（即ち記憶領域からの読み出し）をイネーブルとする。 At this time, it is monitored whether each of the N storage areas is written (full state) or read (empty state). When the target storage area is empty, the data relay unit enables data transfer (that is, writing to the storage area) of the first data transfer control unit, and the target storage area is full In this case, data transfer (that is, reading from the storage area) of the second data transfer control unit is enabled.

このような構成を用いると、例えば、大容量のデータ転送を行った場合に、ある記憶領域に対するライト動作と、別の記憶領域に対するリード動作とを並行して行うことができ、高いデータ転送効率を実現できる。また、この際に、ＤＭＡコントローラ等を用いているため、プロセッサのリソースは奪われずに、その処理効率の低下を抑制できる。さらに、Ｎ個の記憶領域を巡回させながら使用する方式であるため、大容量のデータ転送をこの容量に満たない小容量のデータバッファを用いて実現でき、回路規模の低減が可能となる。なお、最小構成としては、２個の記憶領域を用いて、この２個の記憶領域に対して交互にライト又はリードするような方式となる。 When such a configuration is used, for example, when a large amount of data is transferred, a write operation to one storage area and a read operation to another storage area can be performed in parallel, and high data transfer efficiency is achieved. Can be realized. At this time, since a DMA controller or the like is used, it is possible to suppress a reduction in processing efficiency without depriving processor resources. Furthermore, since the N storage areas are used while being cycled, large-capacity data transfer can be realized using a small-capacity data buffer that does not have this capacity, and the circuit scale can be reduced. As a minimum configuration, two storage areas are used and the two storage areas are alternately written or read.

また、本発明の一実施の形態によるマルチプロセッサシステムは、前述したようなデータバッファを、第１システムメモリから第２システムメモリに向けたデータ転送用と、第２システムメモリから第１システムメモリに向けたデータ転送用とに分けることで２系統備えたことが特徴となっている。この場合、例えば第１データ転送制御部と第２データ転送制御部が備える複数のチャネルを用いて双方向のデータ転送を設定すれば、途中で各プロセッサの処理が介入することなくこの双方向のデータ転送を並行して行うことが可能となる。例えば、１系統のデータバッファを双方向対応とした場合、回路が複雑化すると共に、双方向のデータ転送を行う際に途中でプロセッサの処理が介入することになる。また、例えば双方向対応のデータバッファを２系統設けた場合、回路の複雑化や回路規模の増大が懸念される。したがって、前述したように一方向対応のデータバッファを２系統設けることで、小さい回路規模で双方向のデータ転送を効率的に行うことができる。 The multiprocessor system according to an embodiment of the present invention includes a data buffer as described above for transferring data from the first system memory to the second system memory and from the second system memory to the first system memory. It is characterized by having two systems by dividing it for data transfer. In this case, for example, if bidirectional data transfer is set by using a plurality of channels included in the first data transfer control unit and the second data transfer control unit, the processing of each processor is not performed in the middle of the bidirectional data transfer. Data transfer can be performed in parallel. For example, when one data buffer is bidirectionally compatible, the circuit becomes complicated and the processing of the processor intervenes during the bidirectional data transfer. Further, for example, when two systems of bidirectional data buffers are provided, there is a concern that the circuit may be complicated or the circuit scale may be increased. Therefore, by providing two unidirectional data buffers as described above, bidirectional data transfer can be efficiently performed with a small circuit scale.

本発明の一実施の形態によるマルチプロセッサシステムを用いることで、複数のプロセッサ間でのデータ転送効率を向上可能となる。また、複数のプロセッサ間でのデータ転送効率を小さい回路規模で向上可能となる。 By using the multiprocessor system according to the embodiment of the present invention, it is possible to improve the data transfer efficiency between a plurality of processors. In addition, data transfer efficiency between a plurality of processors can be improved with a small circuit scale.

以下、本発明の実施の形態を図面に基づいて詳細に説明する。実施の形態を説明するための全図において、同一の部材には原則として同一の符号を付し、その繰り返しの説明は省略する。以下の実施の形態においては、便宜上その必要があるときは、複数のセクションまたは実施の形態に分割して説明するが、特に明示した場合を除き、それらはお互いに無関係なものではなく、一方は他方の一部または全部の変形例、詳細、補足説明等の関係にある。また、以下の実施の形態において、要素の数等（個数、数値、量、範囲等を含む）に言及する場合、特に明示した場合および原理的に明らかに特定の数に限定される場合等を除き、その特定の数に限定されるものではなく、特定の数以上でも以下でも良い。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In all the drawings for explaining the embodiments, the same members are denoted by the same reference symbols in principle, and the repeated explanation thereof is omitted. In the following embodiments, when it is necessary for the sake of convenience, the description will be divided into a plurality of sections or embodiments. However, unless otherwise specified, they are not irrelevant to each other. The other part or all of the modifications, details, supplementary explanations, and the like are related. Further, in the following embodiments, when referring to the number of elements (including the number, numerical value, quantity, range, etc.), especially when clearly indicated and when clearly limited to a specific number in principle, etc. Except, it is not limited to the specific number, and may be more or less than the specific number.

さらに、以下の実施の形態において、その構成要素（要素ステップ等も含む）は、特に明示した場合および原理的に明らかに必須であると考えられる場合等を除き、必ずしも必須のものではないことは言うまでもない。同様に、以下の実施の形態において、構成要素等の形状、位置関係等に言及するときは、特に明示した場合および原理的に明らかにそうでないと考えられる場合等を除き、実質的にその形状等に近似または類似するもの等を含むものとする。このことは、上記数値および範囲についても同様である。 Further, in the following embodiments, the constituent elements (including element steps and the like) are not necessarily indispensable unless otherwise specified and clearly considered essential in principle. Needless to say. Similarly, in the following embodiments, when referring to the shapes, positional relationships, etc. of the components, etc., the shapes are substantially the same unless otherwise specified, or otherwise apparent in principle. And the like are included. The same applies to the above numerical values and ranges.

まず、本発明の検討にあたり、その前提として検討した比較例となるマルチプロセッサシステムについて説明を行う。図１は、本発明の前提として検討した比較例のマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。図１に示すマルチプロセッサシステムは、ローカルバス［１］ＬＢＵＳ１にマイクロコンピュータ（以下、マイコンと略す）［１］ＭＣ１およびこのＭＣ１用のシステムメモリＭＥＭ１が接続され、ローカルバス［２］ＬＢＵＳ２にマイコン［２］ＭＣ２およびこのＭＣ２用のシステムメモリＭＥＭ２が接続されている。 First, a multiprocessor system, which is a comparative example studied as a premise in examining the present invention, will be described. FIG. 1 is a block diagram showing an example of the configuration of a comparative multiprocessor system examined as a premise of the present invention. In the multiprocessor system shown in FIG. 1, a microcomputer (hereinafter abbreviated as a microcomputer) [1] MC1 and a system memory MEM1 for MC1 are connected to a local bus [1] LBUS1, and a microcomputer [ 2] MC2 and system memory MEM2 for this MC2 are connected.

さらに、図１のマルチプロセッサシステムは、ＬＢＵＳ１およびＬＢＵＳ２をそれぞれ一方のポートおよび他方のポートとするデュアルポートメモリＤＰＲＡＭを備えている。マイコン［１］ＭＣ１は、プロセッサ［１］ＣＰＵ１およびＤＭＡコントローラ［１］ＤＭＡＣ１を含み、マイコン［２］ＭＣ２は、プロセッサ［２］ＣＰＵ２およびＤＭＡコントローラ［２］ＤＭＡＣ２を含んでいる。例えば、ＭＥＭ１からＭＥＭ２にデータ転送を行いたい場合、ＤＭＡＣ１がＭＥＭ１より所望のデータをリードすると共にＤＰＲＡＭにライトし、ＤＭＡＣ２がこのライトデータをＤＰＲＡＭよりリードすると共にＭＥＭ２にライトする。したがって、前述した図１２の場合と異なり、このデータ転送の最中、ＣＰＵ１およびＣＰＵ２のリソースは奪われないので、プロセッサの処理効率はある程度確保できる。 Further, the multiprocessor system of FIG. 1 includes a dual port memory DPRAM having LBUS1 and LBUS2 as one port and the other port, respectively. The microcomputer [1] MC1 includes a processor [1] CPU1 and a DMA controller [1] DMAC1, and the microcomputer [2] MC2 includes a processor [2] CPU2 and a DMA controller [2] DMAC2. For example, when it is desired to transfer data from MEM1 to MEM2, DMAC1 reads desired data from MEM1 and writes it to DPRAM, and DMAC2 reads this write data from DPRAM and writes it to MEM2. Therefore, unlike the case of FIG. 12 described above, the CPU 1 and CPU 2 resources are not deprived during the data transfer, so that the processing efficiency of the processor can be ensured to some extent.

ただし、この場合、ＤＭＡコントローラＤＭＡＣの１回の起動に伴う最大データ転送量は、ＤＰＲＡＭの容量となる。したがって、大容量のデータを転送したい場合、（１）ＤＰＲＡＭの容量を大きくする、（２）ＤＰＲＡＭの容量を小さくしてＤＭＡコントローラＤＭＡＣを複数回起動する、のいずれかの方式を用いる必要がある。まず、（１）の方式を用いた場合、当然のことながら回路規模が大きくなってしまう。 However, in this case, the maximum data transfer amount accompanying one activation of the DMA controller DMAC is the capacity of the DPRAM. Therefore, when transferring a large amount of data, it is necessary to use one of the following methods: (1) increasing the capacity of the DPRAM, and (2) starting the DMA controller DMAC multiple times by decreasing the capacity of the DPRAM. . First, when the method (1) is used, the circuit scale naturally increases.

一方、（２）の方式を用いた場合、ＣＰＵ１とＣＰＵ２を制御するソフトウェアの負担が大きくなる。すなわち、例えば、ＣＰＵ１は、ＤＭＡＣ１に対してＮ回目の起動を行う際、ＣＰＵ２がＤＭＡＣ２に対して行った（Ｎ−１）回目の起動に伴う動作が完了していることを確認する必要がある。このため、例えば、ＣＰＵ２のソフトウェアは、ＤＭＡＣ２から（例えば割り込み信号等によって）動作完了の通知を受けた際に割り込み信号［Ａ］等をアサートし、ＣＰＵ１のソフトウェアは、この割り込み信号［Ａ］等のアサートを検知してＤＭＡＣ１を起動するといった処理を実行する必要がある。そうすると、ソフトウェア処理が煩雑になると共に、このような処理の実行に伴いデータ転送効率やプロセッサＣＰＵの処理効率が低下する恐れもある。以降に説明する実施の形態では、このような問題点も含めて解決を図る。 On the other hand, when the method (2) is used, the burden of software for controlling the CPU 1 and the CPU 2 increases. That is, for example, when the CPU 1 performs the N-th activation for the DMAC 1, it is necessary to confirm that the operation associated with the (N-1) -th activation performed by the CPU 2 for the DMAC 2 is completed. . Therefore, for example, the software of the CPU 2 asserts the interrupt signal [A] or the like when the operation completion notification is received from the DMAC 2 (for example, by an interrupt signal or the like), and the software of the CPU 1 It is necessary to execute processing such as detecting DMA assertion and starting DMAC1. Then, the software processing becomes complicated, and there is a risk that the data transfer efficiency and the processing efficiency of the processor CPU will be reduced with the execution of such processing. In the embodiments described below, solutions are made including such problems.

（実施の形態１）
図２は、本発明の実施の形態１によるマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。図２に示すマルチプロセッサシステムは、ローカルバス［１］ＬＢＵＳ１に接続されたマイコン［１］ＭＣ１と、ローカルバス［２］ＬＢＵＳ２に接続されたマイコン［２］ＭＣ２と、ＭＣ１用のシステムメモリＭＥＭ１と、ＭＣ２用のシステムメモリＭＥＭ２等を含んでいる。マイコン［１］ＭＣ１は、プロセッサ［１］ＣＰＵ１およびＤＭＡコントローラ［１］ＤＭＡＣ１を含み、マイコン［２］ＭＣ２は、プロセッサ［２］ＣＰＵ２およびＤＭＡコントローラ［２］ＤＭＡＣ２を含んでいる。各ローカルバスＬＢＵＳ１，ＬＢＵＳ２は各マイコンＭＣ１，ＭＣ２の外部に接続される外部バスである。各マイコンＭＣ１，ＭＣ２は、プロセッサ及びＤＭＡコントローラとを接続する図示されない内部バスを含む。 (Embodiment 1)
FIG. 2 is a block diagram showing an example of the configuration of the multiprocessor system according to the first embodiment of the present invention. The multiprocessor system shown in FIG. 2 includes a microcomputer [1] MC1 connected to the local bus [1] LBUS1, a microcomputer [2] MC2 connected to the local bus [2] LBUS2, and a system memory MEM1 for MC1. , A system memory MEM2 for MC2, and the like. The microcomputer [1] MC1 includes a processor [1] CPU1 and a DMA controller [1] DMAC1, and the microcomputer [2] MC2 includes a processor [2] CPU2 and a DMA controller [2] DMAC2. The local buses LBUS1 and LBUS2 are external buses connected to the outside of the microcomputers MC1 and MC2. Each of the microcomputers MC1 and MC2 includes an internal bus (not shown) that connects the processor and the DMA controller.

システムメモリＭＥＭ１は、メモリバスＭＢＵＳ１によってマイコン［１］ＭＣ１と接続され、システムメモリＭＥＭ２は、メモリバスＭＢＵＳ２によってマイコン［２］ＭＣ２と接続される。ＤＭＡコントローラ（データ転送制御部）ＤＭＡＣは、広く知られているように、予めプロセッサＣＰＵによって転送元デバイスのアドレス（メモリアドレス、ＩＯアドレス等）、転送先デバイスのアドレス、および転送データのサイズなどが設定された後に起動命令を受けると、この設定に基づいてＣＰＵを経由せずにデータ転送を行う機能を備えている。また、転送元デバイスや転送先デバイスとして、ＤＲＥＱ（データ転送要求信号）付デバイスを指定することも可能となっている。この場合、ＤＭＡＣは、当該デバイスからのＤＲＥＱを受信した際に、当該デバイスに対してデータ転送受付信号ＤＡＣＫを送信することでアクセスを行う。なお、ＤＭＡＣは、通常、複数のチャネルを備え、各チャネル毎に前述したような設定を任意に行うことが可能となっている。 The system memory MEM1 is connected to the microcomputer [1] MC1 via the memory bus MBUS1, and the system memory MEM2 is connected to the microcomputer [2] MC2 via the memory bus MBUS2. As is widely known, the DMA controller (data transfer control unit) DMAC previously determines the address of the transfer source device (memory address, IO address, etc.), the address of the transfer destination device, the size of the transfer data, etc. by the processor CPU. When a start command is received after the setting, the function of transferring data without going through the CPU based on this setting is provided. It is also possible to specify a device with a DREQ (data transfer request signal) as a transfer source device or a transfer destination device. In this case, when the DMAC receives a DREQ from the device, the DMAC accesses the device by transmitting a data transfer acceptance signal DACK. Note that the DMAC normally includes a plurality of channels, and the above-described setting can be arbitrarily performed for each channel.

このような構成において、図２のマルチプロセッサシステムは、さらに、ＬＢＵＳ１とＬＢＵＳ２に接続されるデータ中継装置ＰＣＩＣを備えたことが主要な特徴となっている。図３は、図２におけるデータ中継装置ＰＣＩＣの詳細な構成例を示すブロック図である。図３に示すデータ中継装置ＰＣＩＣは、ＬＢＵＳ１からＬＢＵＳ２へ向かうデータ転送を処理するチャネル０用の回路部と、ＬＢＵＳ２からＬＢＵＳ１へ向かうデータ転送を処理するチャネル１用の回路部を含んでいる。 In such a configuration, the multiprocessor system of FIG. 2 is further characterized by further including a data relay device PCIC connected to LBUS1 and LBUS2. FIG. 3 is a block diagram showing a detailed configuration example of the data relay device PCIC in FIG. The data relay device PCIC shown in FIG. 3 includes a circuit unit for channel 0 that processes data transfer from LBUS1 to LBUS2, and a circuit unit for channel 1 that processes data transfer from LBUS2 to LBUS1.

チャネル０用の回路部は、ＬＢＵＳ１に接続されるライト制御部ＷＣＴＬ０と、ＬＢＵＳ２に接続されるリード制御部ＲＣＴＬ０と、例えばデュアルポートを備えたバッファ［Ａ］ＢＦＡ０，バッファ［Ｂ］ＢＦＢ０と、カウンタ部ＣＵＮＴ０を含んでいる。チャネル１用の回路部は、ＬＢＵＳ１に接続されるリード制御部ＲＣＴＬ１と、ＬＢＵＳ２に接続されるライト制御部ＷＣＴＬ１と、例えばデュアルポートを備えたバッファ［Ａ］ＢＦＡ１，バッファ［Ｂ］ＢＦＢ１と、カウンタ部ＣＵＮＴ１を含んでいる。各バッファＢＦＡ０，ＢＦＢ０，ＢＦＡ１，ＢＦＢ１は、例えばそれぞれ独立した記憶回路とすることで任意に同時アクセスが可能なように構成され、特に限定はされないが、ここではそれぞれ１２８Ｂｙｔｅの容量を備えるものとする。 The circuit unit for channel 0 includes a write control unit WCTL0 connected to LBUS1, a read control unit RCTL0 connected to LBUS2, a buffer [A] BFA0, a buffer [B] BFB0 having a dual port, and a counter. Part CUNT0 is included. The circuit unit for channel 1 includes a read control unit RCTL1 connected to LBUS1, a write control unit WCTL1 connected to LBUS2, a buffer [A] BFA1, a buffer [B] BFB1, and a counter having dual ports, for example. Part CUNT1. Each of the buffers BFA0, BFB0, BFA1, and BFB1 is configured to be capable of arbitrary simultaneous access by using, for example, independent storage circuits, and is not particularly limited. Here, each of the buffers BFA0, BFB0, BFA1, and BFB1 has a capacity of 128 bytes. .

ライト制御部ＷＣＴＬ０（ＷＣＴＬ１）は、外部よりライト命令を受けた際に、ローカルバスＬＢＵＳ１（ＬＢＵＳ２）からライトデータを受信し、当該ライトデータをバッファ［Ａ］ＢＦＡ０またはバッファ［Ｂ］ＢＦＢ０（ＢＦＡ１またはＢＦＢ１）の一方のポートからライトする。また、ＷＣＴＬ０（ＷＣＴＬ１）は、カウンタ部ＣＵＮＴ０（ＣＵＮＴ１）からの制御信号によってＢＦＡ０およびＢＦＢ０（ＢＦＡ１およびＢＦＢ１）の状態を認識し、その状態に応じて外部に向けてデータ転送要求信号ＤＲＥＱ０（ＤＲＥＱ１）を送信する。図２のＤＭＡＣ１（ＤＭＡＣ２）は、このＤＲＥＱ０（ＤＲＥＱ１）に応答してデータ転送要求受付信号ＤＡＣＫ０（ＤＡＣＫ１）を送信するが、ここでは、このＤＡＣＫ０（ＤＡＣＫ１）は外部からのライト命令に相当することになる。 When receiving a write command from the outside, the write control unit WCTL0 (WCTL1) receives write data from the local bus LBUS1 (LBUS2), and receives the write data from the buffer [A] BFA0 or the buffer [B] BFB0 (BFA1 or Write from one port of BFB1). Also, WCTL0 (WCTL1) recognizes the states of BFA0 and BFB0 (BFA1 and BFB1) by the control signal from the counter unit CUNT0 (CUNT1), and sends the data transfer request signal DREQ0 (DREQ1) to the outside according to the state. Send. DMAC1 (DMAC2) in FIG. 2 transmits a data transfer request acceptance signal DACK0 (DACK1) in response to this DREQ0 (DREQ1). Here, this DACK0 (DACK1) corresponds to a write command from the outside. become.

リード制御部ＲＣＴＬ０（ＲＣＴＬ１）は、外部よりリード命令を受けた際に、バッファ［Ａ］ＢＦＡ０またはバッファ［Ｂ］ＢＦＢ０（ＢＦＡ１またはＢＦＢ１）の他方のポートからデータをリードし、このリードデータをローカルバスＬＢＵＳ２（ＬＢＵＳ１）に送信する。また、ＲＣＴＬ０（ＲＣＴＬ１）は、カウンタ部ＣＵＮＴ０（ＣＵＮＴ１）からの制御信号によってＢＦＡ０およびＢＦＢ０（ＢＦＡ１およびＢＦＢ１）の状態を認識し、その状態に応じて外部に向けてデータ転送要求信号ＤＲＥＱ０（ＤＲＥＱ１）を送信する。図２のＤＭＡＣ２（ＤＭＡＣ１）は、このＤＲＥＱ０（ＤＲＥＱ１）に応答してＤＡＣＫ０（ＤＡＣＫ１）を送信するが、ここでは、このＤＡＣＫ０（ＤＡＣＫ１）が外部からのリード命令に相当することになる。 When the read control unit RCTL0 (RCTL1) receives a read command from the outside, the read control unit RCTL0 (RCTL1) reads data from the other port of the buffer [A] BFA0 or the buffer [B] BFB0 (BFA1 or BFB1), and the read data is locally Transmit to bus LBUS2 (LBUS1). Further, RCTL0 (RCTL1) recognizes the states of BFA0 and BFB0 (BFA1 and BFB1) by the control signal from the counter unit CUNT0 (CUNT1), and sends the data transfer request signal DREQ0 (DREQ1) to the outside according to the state. Send. DMAC2 (DMAC1) in FIG. 2 transmits DACK0 (DACK1) in response to this DREQ0 (DREQ1), and here, this DACK0 (DACK1) corresponds to an external read command.

図４は、図３におけるカウンタ部ＣＵＮＴ０，１の詳細な構成例を示すブロック図である。図４に示すカウンタ部（バッファ管理部）ＣＵＮＴは、ライトカウンタＷＣＮＴ、リードカウンタＲＣＮＴ、ステートマシンＳＭなどを含んでいる。ライトカウンタＷＣＮＴは、ライト制御部ＷＣＴＬからのライトパルスをカウントし、そのカウンタ値が各バッファの容量である１２８Ｂｙｔｅの整数倍の時、または全ての転送が完了した時(予め定められた設定値にカウンタ値が達した時)に制御信号（ｗｒ＿ｅｎｄ）をステートマシンＳＭに向けて送信する。ライトパルスは、ライト制御部ＷＣＴＬがバッファ［Ａ］ＢＦＡ，［Ｂ］ＢＦＢに対してライトを１サイクル実行する毎に生成される。なお、ライトカウンタＷＣＮＴに予め定める設定値は、一回の転送サイズを設定する。例えば、図４においては、３２０Ｂｙｔｅを設定する。 FIG. 4 is a block diagram showing a detailed configuration example of the counter units CUNT0, 1 in FIG. The counter unit (buffer management unit) CUNT shown in FIG. 4 includes a write counter WCNT, a read counter RCNT, a state machine SM, and the like. The write counter WCNT counts the write pulses from the write control unit WCTL, and when the counter value is an integral multiple of 128 bytes, which is the capacity of each buffer, or when all transfers have been completed (to a preset setting value). When the counter value is reached, a control signal (wr_end) is transmitted to the state machine SM. The write pulse is generated every time the write control unit WCTL executes one cycle of writing to the buffers [A] BFA and [B] BFB. Note that a preset transfer value for the write counter WCNT sets a single transfer size. For example, in FIG. 4, 320 bytes are set.

リードカウンタＲＣＮＴは、リード制御部ＲＣＴＬからのリードパルスをカウントし、そのカウンタ値が各バッファの容量である１２８Ｂｙｔｅの整数倍の時、または全ての転送が完了した時(予め定められた設定値にカウンタ値が達した時)に制御信号（ｒｄ＿ｅｎｄ）をステートマシンＳＭに向けて送信する。リードパルスは、リード制御部ＲＣＴＬがバッファ［Ａ］ＢＦＡ，［Ｂ］ＢＦＢからのリードを１サイクル実行する毎に生成される。なお、リードカウンタＲＣＮＴに予め定める設定値は、一回の転送サイズを設定する。 The read counter RCNT counts the read pulses from the read control unit RCTL, and when the counter value is an integral multiple of 128 bytes, which is the capacity of each buffer, or when all transfers are completed (set to a predetermined set value). When the counter value is reached, a control signal (rd_end) is transmitted to the state machine SM. The read pulse is generated every time the read control unit RCTL executes one cycle of reading from the buffers [A] BFA and [B] BFB. Note that a preset transfer value for the read counter RCNT sets a single transfer size.

ステートマシンＳＭは、初期状態を空状態（ＥＭＰ）として、この状態で制御信号（ｗｒ＿ｅｎｄ）を受信すると充満状態（ＮｏｔＥＭＰ）へ遷移する。逆に、充満状態（ＮｏｔＥＭＰ）において制御信号（ｒｄ＿ｅｎｄ）を受信すると空状態（ＥＭＰ）へ遷移する。この空状態（ＥＭＰ）または充満状態（ＮｏｔＥＭＰ）の判断は、各バッファ［Ａ］ＢＦＡ，［Ｂ］ＢＦＢ毎に行われる。そして、ステートマシンＳＭは、リード制御部ＲＣＴＬおよびライト制御部ＷＣＴＬに向けて、各サイクル毎に、各バッファ［Ａ］ＢＦＡ，［Ｂ］ＢＦＢのそれぞれが空状態（ＥＭＰ）であるか充満状態（ＮｏｔＥＭＰ）であるかを送信する。 The state machine SM transitions to the full state (Not EMP) when the initial state is the empty state (EMP) and the control signal (wr_end) is received in this state. On the contrary, when the control signal (rd_end) is received in the full state (Not EMP), the state transits to the empty state (EMP). The determination of the empty state (EMP) or the full state (Not EMP) is performed for each of the buffers [A] BFA and [B] BFB. Then, the state machine SM is directed to the read control unit RCTL and the write control unit WCTL and each buffer [A] BFA, [B] BFB is in an empty state (EMP) or full state (for each cycle). (Not EMP).

図５は、図４におけるリード制御部ＲＣＴＬおよびライト制御部ＷＣＴＬの動作例を示すものであり、（ａ）はリード制御部の動作説明図、（ｂ）はライト制御部の動作説明図である。リード制御部ＲＣＴＬは、図５（ａ）に示すように、前述したステートマシンＳＭからの出力信号を受けて、バッファ［Ａ］ＢＦＡおよびバッファ［Ｂ］ＢＦＢ共に空状態（ＥＭＰ）の場合にのみディスエーブルとなり、それ以外の場合はイネーブルとなる。一方、ライト制御部ＷＣＴＬは、図５（ｂ）に示すように、前述したステートマシンＳＭからの出力信号を受けて、バッファ［Ａ］ＢＦＡおよびバッファ［Ｂ］ＢＦＢ共に充満状態（ＮｏｔＥＭＰ）の場合にのみディスエーブルとなり、それ以外の場合はイネーブルとなる。 FIGS. 5A and 5B show examples of operations of the read control unit RCTL and the write control unit WCTL in FIG. 4. FIG. 5A is an operation explanatory diagram of the read control unit, and FIG. 5B is an operation explanatory diagram of the write control unit. . As shown in FIG. 5A, the read control unit RCTL receives the output signal from the state machine SM described above, and only when the buffer [A] BFA and the buffer [B] BFB are both empty (EMP). Disabled, otherwise enabled. On the other hand, as shown in FIG. 5B, the write control unit WCTL receives the output signal from the state machine SM described above, and both the buffer [A] BFA and the buffer [B] BFB are in the full state (Not EMP). Only disabled if disabled, otherwise enabled.

なお、充満状態（ＮｏｔＥＭＰ）とは、当該バッファが全て満たされた状態（ＦＵＬＬ）かバッファの有効データが各バッファの容量である１２８Ｂｙｔｅ未満の状態（ＨＡＬＦ）を意味する。この状態（ＨＡＬＦ）は、前述したようにライトカウンタＷＣＮＴおよびリードカウンタＲＣＮＴのカウント値が１２８Ｂｙｔｅ未満となった場合に用いられる。 The full state (Not EMP) means a state where the buffer is completely filled (FULL), or a state where the effective data of the buffer is less than 128 bytes (HALF) which is the capacity of each buffer. This state (HALF) is used when the count values of the write counter WCNT and the read counter RCNT become less than 128 bytes as described above.

図６は、図２のマルチプロセッサシステムにおいて、その動作概要の一例を示す説明図である。ここでは、図２のＭＣ１用システムメモリＭＥＭ１のデータをＭＣ２用システムメモリＭＥＭ２に転送する場合を想定し、これに伴い、図３のデータ中継装置ＰＣＩＣにおけるチャネル０用の回路部を用いるものとする。また、ここでは、ローカルバスＬＢＵＳ１とＬＢＵＳ２は同一周波数で動作するものとする。 FIG. 6 is an explanatory diagram showing an example of the operation outline in the multiprocessor system of FIG. Here, it is assumed that the data in the MC1 system memory MEM1 in FIG. 2 is transferred to the MC2 system memory MEM2, and accordingly, the circuit unit for channel 0 in the data relay device PCIC in FIG. 3 is used. . Here, it is assumed that the local buses LBUS1 and LBUS2 operate at the same frequency.

図６のＳ６１においては、まず、ＣＰＵ１が、ＤＭＡＣ１に対して転送元をＭＥＭ１に、転送先をＰＣＩＣに設定し、ＣＰＵ２が、ＤＭＡＣ２に対して転送元をＰＣＩＣに、転送先をＭＥＭ２に設定し、それぞれの転送サイズをここでは３２０Ｂｙｔｅに設定する。その後、ＣＰＵ１およびＣＰＵ２は、ＤＭＡＣ１およびＤＭＡＣ２を起動状態にする。バッファ［Ａ］ＢＦＡおよびバッファ［Ｂ］ＢＦＢのサイズは、それぞれ１２８Ｂｙｔｅとする。初期状態では、ＢＦＡおよびＢＦＢ共に空状態（ＥＭＰ）であるため、これに伴いＤＭＡＣ１に対してはデータ転送要求信号ＤＲＥＱがイネーブルとなり、ＤＭＡＣ２に対してはＤＲＥＱがディスエーブルとなる。 In S61 of FIG. 6, first, the CPU 1 sets the transfer source to MEM1 and the transfer destination to the PCIC for the DMAC1, and the CPU 2 sets the transfer source to the PCIC and the transfer destination to the MEM2 for the DMAC2. Each transfer size is set to 320 bytes here. Thereafter, CPU1 and CPU2 activate DMAC1 and DMAC2. The sizes of the buffer [A] BFA and the buffer [B] BFB are each 128 bytes. In the initial state, since both BFA and BFB are empty (EMP), the data transfer request signal DREQ is enabled for DMAC1 and DREQ is disabled for DMAC2.

次いで、Ｓ６２において、ＤＭＡＣ１が、ＤＲＥＱのイネーブルに伴いＭＥＭ１のデータをバッファ［Ａ］ＢＦＡの例えばポート［１］からライトする。一方、ＤＭＡＣ２は、ＤＲＥＱのディスエーブルに伴い待機状態となる。ここで、ＢＦＡに対して１２８Ｂｙｔｅのデータが転送されるとＢＦＡは充満状態（ＦＵＬＬ）となる。ＢＦＡが充満状態（ＦＵＬＬ）になると、ＤＭＡＣ２に対するＤＲＥＱがイネーブルに遷移する。 Next, in S62, the DMAC1 writes the data of the MEM1 from, for example, the port [1] of the buffer [A] BFA when the DREQ is enabled. On the other hand, the DMAC 2 enters a standby state when DREQ is disabled. Here, when 128 bytes of data are transferred to the BFA, the BFA is in a full state (FULL). When the BFA becomes full (FULL), the DREQ for the DMAC 2 transitions to enable.

続いて、Ｓ６３において、ＤＭＡＣ１が、ＭＥＭ１のデータをバッファ［Ｂ］ＢＦＢのポート［１］からライトする。一方、ＤＭＡＣ２は、前述したＤＲＥＱのイネーブルに伴いバッファ［Ａ］ＢＦＡのデータをポート［２］を介してＭＥＭ２へリードする。ここで、ＢＦＢに対して１２８Ｂｙｔｅ（すなわち累計２５６Ｂｙｔｅ）のデータが転送されると、ＢＦＢは充満状態（ＦＵＬＬ）となる。一方、ＢＦＡは、元々充満状態（ＦＵＬＬ）であったがＤＭＡＣ２に伴い１２８Ｂｙｔｅの転送が行われると空状態（ＥＭＰ）に遷移する。これにより、ＤＭＡＣ１側およびＤＭＡＣ２側のＤＲＥＱ共にイネーブルを保つ。 Subsequently, in S63, the DMAC1 writes the data of MEM1 from the port [1] of the buffer [B] BFB. On the other hand, the DMAC 2 reads the data in the buffer [A] BFA to the MEM 2 via the port [2] in accordance with the enablement of the DREQ. Here, when data of 128 bytes (that is, a total of 256 bytes) is transferred to the BFB, the BFB is in a full state (FULL). On the other hand, the BFA was originally in a full state (FULL), but transitions to an empty state (EMP) when a 128-byte transfer is performed along with DMAC2. Thereby, both the DREQ on the DMAC1 side and the DMAC2 side are kept enabled.

続いて、Ｓ６４において、ＤＭＡＣ１が、ＭＥＭ１のデータをＳ６３で空状態（ＥＭＰ）となったバッファ［Ａ］ＢＦＡにポート［１］を介してライトする。一方、ＤＭＡＣ２は、バッファ［Ｂ］ＢＦＢのデータをポート［２］を介してＭＥＭ２へリードする。ここで、ＢＦＡに対して６４Ｂｙｔｅ（すなわち累計３２０Ｂｙｔｅ）のデータが転送されると、ＢＦＡは充満状態（ＨＡＬＦ）となる。一方、ＢＦＢは、元々充満状態（ＦＵＬＬ）であったがＤＭＡＣ２による１２８Ｂｙｔｅの転送が行われると空状態（ＥＭＰ）に遷移する。そして、ＤＭＡＣ１側の転送は完了となり、ＤＭＡＣ１は、割り込み信号等を用いてＣＰＵ１に対してデータ転送の完了を通知する。 Subsequently, in S64, the DMAC1 writes the data of MEM1 to the buffer [A] BFA that has become empty (EMP) in S63 via the port [1]. On the other hand, the DMAC 2 reads the data in the buffer [B] BFB to the MEM 2 via the port [2]. Here, when data of 64 bytes (that is, a total of 320 bytes) is transferred to the BFA, the BFA is in a full state (HALF). On the other hand, the BFB was originally full (FULL), but transitions to an empty state (EMP) when 128 bytes are transferred by the DMAC 2. Then, the transfer on the DMAC 1 side is completed, and the DMAC 1 notifies the CPU 1 of the completion of the data transfer using an interrupt signal or the like.

最後に、Ｓ６５において、ＤＭＡＣ２が、バッファ［Ａ］ＢＦＡのデータをポート［２］を介してＭＥＭ２へリードする。ここで、ＢＦＡから６４Ｂｙｔｅのデータが転送されると、ＢＦＡは、充満状態（ＨＡＬＦ）から空状態（ＥＭＰ）に遷移する。そして、ＤＭＡＣ２側の転送も完了となり、ＤＭＡＣ２は、割り込み信号等を用いてＣＰＵ２に対してデータ転送の完了を通知する。 Finally, in S65, the DMAC 2 reads the data in the buffer [A] BFA to the MEM 2 via the port [2]. Here, when 64 bytes of data are transferred from the BFA, the BFA transitions from the full state (HALF) to the empty state (EMP). The transfer on the DMAC 2 side is also completed, and the DMAC 2 notifies the CPU 2 of the completion of the data transfer using an interrupt signal or the like.

図７は、図３のデータ中継装置ＰＣＩＣにおいて、その各バッファＢＦＡ０，ＢＦＢ０，ＢＦＡ１，ＢＦＢ１の論理アドレス割付の一例を示す説明図である。図７に示すように、例えば、ベースアドレス＋「ｈ’０００００００〜ｈ’１ＦＦＦＦＦＦ」の３２ＭＢｙｔｅをチャネル０用バッファの見かけ上のアドレス空間とし、ベースアドレス＋「ｈ’２００００００〜ｈ’３ＦＦＦＦＦＦ」の３２ＭＢｙｔｅをチャネル１用バッファの見かけ上のアドレス空間とする。例えば、３２ＭＢｙｔｅは、ＤＭＡコントローラＤＭＡＣによって連続転送可能な最大データサイズであり、異なるＤＭＡコントローラを採用し、用途に応じたデータサイズを選択することが可能となる。 FIG. 7 is an explanatory diagram showing an example of logical address assignment of the buffers BFA0, BFB0, BFA1, and BFB1 in the data relay device PCIC of FIG. As shown in FIG. 7, for example, 32 Mbytes of the base address + “h′000 0000 to h′1FF FFFF” is set as an apparent address space of the channel 0 buffer, and the base address + “h′200 0000 to h′3FF” “FFFF” of 32 Mbyte is assumed to be an apparent address space of the channel 1 buffer. For example, 32 MByte is the maximum data size that can be continuously transferred by the DMA controller DMAC, and a different DMA controller can be adopted to select a data size according to the application.

ここで、チャネル０用バッファとして実際に物理的に存在するのは、「ｈ’０００００００〜ｈ’０００００７Ｆ」となる１２８Ｂｙｔｅのバッファ［Ａ］ＢＦＡ０と、これに連続する１２８Ｂｙｔｅのバッファ［Ｂ］ＢＦＢ０である。これ以降は、物理的に存在しないバッファ［Ａ］のシャドウとバッファ［Ｂ］のシャドウが交互に連続する。一方、チャネル１用バッファとして実際に物理的に存在するのは、「ｈ’２００００００〜ｈ’２００００７Ｆ」となる１２８Ｂｙｔｅのバッファ［Ａ］ＢＦＡ１と、これに連続する１２８Ｂｙｔｅのバッファ［Ｂ］ＢＦＢ１である。これ以降は、物理的に存在しないバッファ［Ａ］のシャドウとバッファ［Ｂ］のシャドウが交互に連続する。 Here, what actually physically exists as a buffer for channel 0 is a 128-byte buffer [A] BFA0 that becomes “h′000 0000 to h′000 007F”, and a 128-byte buffer [B] that follows this 128-byte buffer [B]. BFB0. Thereafter, the shadows of the buffer [A] and the shadows of the buffer [B] that do not exist physically continue alternately. On the other hand, what actually physically exists as a buffer for channel 1 is a 128-byte buffer [A] BFA1 that becomes “h′200 0000 to h′200 007F”, and a continuous 128-byte buffer [B] BFB1. It is. Thereafter, the shadows of the buffer [A] and the shadows of the buffer [B] that do not exist physically continue alternately.

すなわち、物理的にはチャネル０用バッファとして２５６Ｂｙｔｅ、チャネル１用バッファとして２５６Ｂｙｔｅしか設けていないが、図６で説明したようなバッファ［Ａ］とバッファ［Ｂ］の２交代動作を行うことで各チャネル毎に３２ＭＢｙｔｅの連続データ転送が可能となる。なお、図６から判るように原理的には３２ＭＢｙｔｅに限らず、それ以上のデータを連続データ転送することも勿論可能である。 In other words, although only 256 bytes are provided as a buffer for channel 0 and 256 bytes are provided as a buffer for channel 1, each operation is performed by performing two alternate operations of buffer [A] and buffer [B] as described in FIG. Continuous data transfer of 32 MByte is possible for each channel. As can be seen from FIG. 6, in principle, the data is not limited to 32 MByte, and it is of course possible to transfer more data than that.

ＤＭＡコントローラＤＭＡＣは、通常、連続するアドレス空間にアクセスする仕様となっているため、例えば、バッファ［Ｂ］となる「ｈ’０００００ＦＦ」にアクセスした直後は、バッファ［Ａ］のシャドウとなる「ｈ’００００１００」にアクセスする。ただし、これは見かけ上のアクセスであり、この場合にデータ中継装置ＰＣＩＣ内で実際にアクセスされるアドレスはバッファ［Ａ］となる「ｈ’０００００００」である。このような２交代動作の方式は、例えば、アドレス「ｈ’ｘｘｘｘｘｙｙ」が入力された際にその下２桁の「ｙｙ」の箇所のみでアクセスを行えば容易に実現可能である。 Since the DMA controller DMAC normally has a specification for accessing a continuous address space, for example, immediately after accessing “h′000 00FF” serving as the buffer [B], it becomes a shadow of the buffer [A]. h'000 0100 "is accessed. However, this is an apparent access. In this case, the address actually accessed in the data relay device PCIC is “h′000 0000” which is the buffer [A]. Such a two-alternating operation method can be easily realized, for example, when an address “h′xxx xxxyy” is inputted and accessed only at the last two digits “yy”.

このように、図２のマルチプロセッサシステムを用いることで、（１）高いデータ転送効率を実現可能となり、（２）データ転送に際してのプロセッサの処理効率を向上可能となり、（３）小回路規模でデータ転送効率またはプロセッサの処理効率を向上可能となる。すなわち、例えば、図１２の構成例と比較すると、各プロセッサＣＰＵにバス調停機能を設けて調停動作を行わせる必要がなく、さらに、一方のシステムメモリＭＥＭからのデータ転送を行っている間、他方のシステムメモリＭＥＭに向けてもデータ転送を行うことができる。また、例えば、図１の構成例と比較すると、小容量のバッファＢＦを用いて当該容量を超える連続データ転送が行える。そして、このデータ転送の最中におけるプロセッサＣＰＵの介入（ＣＰＵのソフトウェアを用いたＤＭＡＣの操作等）は不要であるため、ＣＰＵによる所望のプログラム処理と各システムメモリ間でのデータ転送との並列動作を効率的に行える。 As described above, by using the multiprocessor system of FIG. 2, (1) high data transfer efficiency can be realized, (2) processor processing efficiency at the time of data transfer can be improved, and (3) on a small circuit scale Data transfer efficiency or processor processing efficiency can be improved. That is, for example, as compared with the configuration example of FIG. 12, it is not necessary to provide each processor CPU with a bus arbitration function to perform the arbitration operation, and while the data transfer from one system memory MEM is performed, the other Data transfer can also be performed toward the system memory MEM. Further, for example, as compared with the configuration example of FIG. 1, continuous data transfer exceeding the capacity can be performed using a small capacity buffer BF. Further, since intervention of the processor CPU (DMAC operation using CPU software, etc.) is not required during the data transfer, a parallel operation of desired program processing by the CPU and data transfer between the system memories. Can be done efficiently.

なお、図３等の説明においては、各バッファＢＦＡ０，ＢＦＢ０，ＢＦＡ１，ＢＦＢ１をデュアルポートメモリとしたが、図６の説明から判るように、ある１つのバッファに対してライト命令とリード命令が同時に発生することはない。したがって、各バッファにおいては、同時アクセス時の調停機能などは必要なく、各バッファの小規模化が図れる。さらに、例えばポート［１］とポート［２］の周波数が同一の場合または周波数差を許容できるような場合には、各バッファをシングルポートメモリとすることもでき、このような場合は、各バッファの更なる小規模化が図れる。 In the description of FIG. 3, etc., each buffer BFA0, BFB0, BFA1, BFB1 is a dual port memory. However, as can be seen from the description of FIG. 6, a write command and a read command are simultaneously issued to a certain buffer. It does not occur. Therefore, each buffer does not require an arbitration function at the time of simultaneous access, and the size of each buffer can be reduced. Further, for example, when the frequencies of the port [1] and the port [2] are the same or when a frequency difference can be allowed, each buffer can be a single port memory. Can be further reduced.

図８は、図６を補足する動作説明図である。図６では、ローカルバスＬＢＵＳ１とＬＢＵＳ２が同一周波数で動作する場合を想定したが、ここでは、ＬＢＵＳ１（すなわちバッファＢＦのポート［１］）がＬＢＵＳ２（すなわちＢＦのポート［２］）の２倍の周波数で動作する場合を想定する。 FIG. 8 is an operation explanatory diagram supplementing FIG. In FIG. 6, it is assumed that the local buses LBUS1 and LBUS2 operate at the same frequency, but here, LBUS1 (that is, port [1] of the buffer BF) is twice that of LBUS2 (that is, port [2] of BF). Assume a case of operating at a frequency.

この場合、例えば、図８のｔ＝ｔ２において、ポート［１］側からのライト動作によりバッファ［Ａ］ＢＦＡが充満状態（ＦＵＬＬ）になると、ポート［２］側のデータ転送要求信号ＤＲＥＱがイネーブルに遷移し、ポート［２］側よりＢＦＡのリード動作が開始される。その後、ｔ＝ｔ４において、ポート［１］側からのライト動作によりバッファ［Ｂ］ＢＦＢが充満状態（ＦＵＬＬ）となるが、ポート［２］側からのＢＦＡのリード動作は半分しか完了していないため、ＢＦＡおよびＢＦＢ共に充満状態（ＦＵＬＬ）となり、ポート［１］側のＤＲＥＱはディスエーブルに遷移する。 In this case, for example, when the buffer [A] BFA is full (FULL) by the write operation from the port [1] side at t = t2 in FIG. 8, the data transfer request signal DREQ on the port [2] side is enabled. The BFA read operation is started from the port [2] side. After that, at t = t4, the buffer [B] BFB is filled (FULL) by the write operation from the port [1] side, but the BFA read operation from the port [2] side is only half completed. Therefore, both BFA and BFB are in a full state (FULL), and the DREQ on the port [1] side changes to disabled.

ｔ＝ｔ５においては、ポート［１］側のＤＲＥＱのディスエーブルに伴い、ポート［１］側からライト動作を行うＤＭＡＣは待機状態となり、ポート［２］側のＤＭＡＣによるリード動作のみが行われる。その後、ｔ＝ｔ６において、ポート［２］側からのＢＦＡのリード動作が完了すると、ＢＦＡが空状態（ＥＭＰ）に遷移し、ポート［１］側のＤＲＥＱがイネーブルに遷移する。そして、ｔ＝ｔ７において、ポート［１］側のＤＭＡＣによるＢＦＡに対するライト動作が再開される。 At t = t5, with the DREQ on the port [1] side disabled, the DMAC that performs the write operation from the port [1] side is in a standby state, and only the read operation by the DMAC on the port [2] side is performed. Thereafter, when the BFA read operation from the port [2] side is completed at t = t6, the BFA transitions to an empty state (EMP), and the DREQ on the port [1] side transitions to enable. At t = t7, the write operation to the BFA by the DMAC on the port [1] side is resumed.

このｔ＝ｔ４〜ｔ６のように、ローカルバスＬＢＵＳ１とＬＢＵＳ２の周波数が異なる場合、周波数が速いローカルバス側のＤＭＡＣにおいて待機状態が発生することがある。ただし、これはＤＭＡＣがＤＲＥＱを判別することによるハードウェア処理で行われるため、プロセッサＣＰＵの処理が介入するようなことはない。なお、本実施の形態では、バッファＢＦを２交代動作としたためこのような待機状態が発生するが、例えば、３個のバッファＢＦＡ，ＢＦＢ，ＢＦＣで３交代動作を行わせたり、更にはＮ（Ｎ＞３）個のバッファでＮ交代動作を行わせれば、このような待機状態をある程度回避できる。ただし、この場合、勿論、バッファ容量が大きくなると共に、これに伴う各種制御回路も必要となるため、この観点からは２交代動作とすることが望ましい。 When the frequencies of the local buses LBUS1 and LBUS2 are different as in t = t4 to t6, a standby state may occur in the DMAC on the local bus side with a fast frequency. However, since this is performed by hardware processing by the DMAC determining DREQ, the processing of the processor CPU does not intervene. In this embodiment, such a standby state occurs because the buffer BF is changed two times. For example, three buffers BFA, BFB, and BFC are changed three times, or N ( Such a standby state can be avoided to some extent by performing N alternation operation with N> 3) buffers. However, in this case, as a matter of course, the buffer capacity increases and various control circuits associated therewith are required. From this point of view, it is desirable to perform the two-shift operation.

図９は、図３のデータ中継装置ＰＣＩＣにおいて、その動作例を示す説明図である。前述したように、図３のデータ中継装置ＰＣＩＣは、ローカルバスＬＢＵＳ１からＬＢＵＳ２へ向けた転送を担うチャネル０と、ＬＢＵＳ２からＬＢＵＳ１へ向けた転送を担うチャネル１とを備えている。したがって、図９に示すように、例えば、システムメモリＭＥＭ１内の記憶領域ＡＲ１１をシステムメモリＭＥＭ２内の記憶領域ＡＲ２１に転送する処理と、ＭＥＭ２内の記憶領域ＡＲ２２をＭＥＭ１内の記憶領域ＡＲ１２に転送する処理とを並行して実行可能である。 FIG. 9 is an explanatory diagram showing an operation example of the data relay device PCIC in FIG. As described above, the data relay device PCIC in FIG. 3 includes the channel 0 that handles transfer from the local bus LBUS1 to LBUS2, and the channel 1 that handles transfer from LBUS2 to LBUS1. Therefore, as shown in FIG. 9, for example, the process of transferring the storage area AR11 in the system memory MEM1 to the storage area AR21 in the system memory MEM2, and the storage area AR22 in the MEM2 is transferred to the storage area AR12 in the MEM1. Processing can be executed in parallel.

この場合、例えば、ＤＭＡＣ１のチャネル０において、転送元をＡＲ１１に、転送先をＰＣＩＣのチャネル０用バッファに設定し、チャネル１において、転送元をＰＣＩＣのチャネル１用バッファに、転送先をＡＲ１２に設定する。一方、ＤＭＡＣ２のチャネル０においては、転送元をＰＣＩＣのチャネル０用バッファに、転送先をＡＲ２１に設定し、チャネル１においては、転送元をＡＲ２２に、転送先をＰＣＩＣのチャネル１用バッファに設定する。 In this case, for example, in channel 0 of DMAC1, the transfer source is set to AR11, the transfer destination is set to the PCIC channel 0 buffer, and in channel 1, the transfer source is set to the PCIC channel 1 buffer and the transfer destination is set to AR12. Set. On the other hand, in DMAC2 channel 0, the transfer source is set to the PCIC channel 0 buffer, the transfer destination is set to AR21, and in channel 1, the transfer source is set to AR22 and the transfer destination is set to the PCIC channel 1 buffer. To do.

このような設定を行った後、ＣＰＵ１およびＣＰＵ２がＤＭＡＣ１およびＤＭＡＣ２に対して起動を行うと、前述したような双方向のデータ転送を途中でＣＰＵを介することなく行うことができる。特に限定はされないが、例えばＤＭＡＣ１およびＤＭＡＣ２の各チャネルが所謂ラウンドロビンモード（１転送単位の転送が終了するとそのチャネルの優先順位が最下位になる方式）で動作する場合、各ＤＭＡＣはＰＣＩＣに対するライト動作とリード動作とを時系列的に交互に行うことになる。 When the CPU 1 and the CPU 2 start up the DMAC 1 and the DMAC 2 after performing such setting, the bidirectional data transfer as described above can be performed without going through the CPU. For example, when each channel of DMAC1 and DMAC2 operates in a so-called round robin mode (a method in which the priority of the channel becomes the lowest when the transfer of one transfer unit is completed), each DMAC writes to the PCIC. The operation and the read operation are alternately performed in time series.

このような双方向のデータ転送は、例えば、ＣＰＵ１で処理したデータをＣＰＵ２で処理し、更に、ＣＰＵ２で処理したデータをＣＰＵ１で処理するようなプログラムを実行する場合に有益なものとなる。また、詳細は後述するが、図３のＰＣＩＣのように、２個のチャネルの一方をＬＢＵＳ１→ＬＢＵＳ２のデータ転送用とし、他方をＬＢＵＳ２→ＬＢＵＳ１のデータ転送用とすることで小回路規模または容易な構成で効率的な双方向データ転送を実現可能となる。 Such bidirectional data transfer is useful, for example, when executing a program in which data processed by the CPU 1 is processed by the CPU 2 and data processed by the CPU 2 is further processed by the CPU 1. Although details will be described later, as in the case of the PCIC in FIG. 3, one of the two channels is used for data transfer of LBUS1 → LBUS2, and the other is used for data transfer of LBUS2 → LBUS1, so that the circuit scale is small or easy. An efficient bidirectional data transfer can be realized with a simple configuration.

以上のように、本実施の形態１のマルチプロセッサシステムを用いることによる代表的な効果を述べると、複数のプロセッサ間でのデータ転送効率を向上可能となる。また、複数のプロセッサ間でのデータ転送効率を小さい回路規模で向上可能となる。 As described above, when the typical effect of using the multiprocessor system of the first embodiment is described, the data transfer efficiency between a plurality of processors can be improved. In addition, data transfer efficiency between a plurality of processors can be improved with a small circuit scale.

（実施の形態２）
図１０は、本発明の実施の形態２によるマルチプロセッサシステムにおいて、図２のデータ中継装置ＰＣＩＣの詳細な構成例を示すブロック図である。図１０に示すデータ中継装置ＰＣＩＣは、図３と同様に、バッファ［Ａ］ＢＦＡおよびバッファ［Ｂ］ＢＦＢと、カウンタ部ＣＵＮＴを備え、図３と異なり、ローカルバスＬＢＵＳ１側にリード・ライト制御部ＲＷＣＴＬａを、ローカルバスＬＢＵＳ２側にリード・ライト制御部ＲＷＣＴＬｂを備えた構成となっている。すなわち、図３と異なり１個のチャネルで双方向のデータ転送が可能な構成となっている。 (Embodiment 2)
FIG. 10 is a block diagram showing a detailed configuration example of the data relay device PCIC in FIG. 2 in the multiprocessor system according to the second embodiment of the present invention. The data relay device PCIC shown in FIG. 10 includes a buffer [A] BFA and a buffer [B] BFB, and a counter unit CUNT, as in FIG. RWCTLa is configured to include a read / write control unit RWCTLb on the local bus LBUS2 side. That is, unlike FIG. 3, it is configured such that bidirectional data transfer is possible with one channel.

例えば、各ＤＭＡＣによってＬＢＵＳ１からＬＢＵＳ２へ向けたデータ転送が設定された場合、リード・ライト制御部ＲＷＣＴＬａは前述したようなライト制御部ＷＣＴＬとして動作し、ＲＷＣＴＬｂは、前述したようなリード制御部ＲＣＴＬとして動作する。逆に、各ＤＭＡＣによってＬＢＵＳ２からＬＢＵＳ１へ向けたデータ転送が設定された場合、ＲＷＣＴＬａはリード制御部ＲＣＴＬとして動作し、ＲＷＣＴＬｂは、ライト制御部ＷＣＴＬとして動作する。 For example, when data transfer from LBUS1 to LBUS2 is set by each DMAC, the read / write control unit RWCTLa operates as the write control unit WCTL as described above, and RWCTLb serves as the read control unit RCTL as described above. Operate. Conversely, when data transfer from LBUS2 to LBUS1 is set by each DMAC, RWCTLa operates as a read control unit RCTL, and RWCTLb operates as a write control unit WCTL.

このような構成を用いた場合も、実施の形態１で述べたような各種効果をある程度得ることは可能である。ただし、この場合、各リード・ライト制御部ＲＷＣＴＬａ，ＲＷＣＴＬｂは、転送方向に応じて、バッファＢＦに対するリードイネーブルおよびライトイネーブル等の制御信号を使い分ける必要があり、また、ＤＭＡコントローラＤＭＡＣに向けたデータ転送要求信号ＤＲＥＱの論理なども使い分ける必要がある。したがって、多くの信号選択回路が必要になるため、回路規模の増大や制御の複雑化が生じ、また、ＬＢＵＳ１とＬＢＵＳ２の動作周波数が異なると、場合によっては信号選択に伴い同期処理も必要となることがある。 Even when such a configuration is used, various effects as described in the first embodiment can be obtained to some extent. However, in this case, the read / write control units RWCTLa and RWCTLb need to use control signals such as read enable and write enable for the buffer BF according to the transfer direction, and also transfer data to the DMA controller DMAC. It is necessary to use the logic of the request signal DREQ properly. Therefore, a large number of signal selection circuits are required, resulting in an increase in circuit scale and complication of control. In addition, if the operating frequencies of LBUS1 and LBUS2 are different, synchronization processing is also required depending on signal selection in some cases. Sometimes.

さらに、動作の観点からは、図１０の構成例によって図９で述べたような双方向のデータ転送を行う場合、まず、ＭＥＭ１の記憶領域ＡＲ１１をＭＥＭ２の記憶領域ＡＲ２１に転送した後、各ＣＰＵによる各ＤＭＡＣへの再設定および起動を介してＭＥＭ２の記憶領域ＡＲ２２からＭＥＭ１の記憶領域ＡＲ１２への転送が行われることになる。この場合、図３の場合と比較してデータ転送効率やＣＰＵの処理効率が低下する。一方、例えば、図１０のような構成例を２チャネル分設ければ、図３の場合と同等のデータ転送効率やＣＰＵの処理効率は確保できる。しかしながら、この場合、前述したように、回路規模の増大や制御の複雑化などが懸念される。また、例えば２個のチャネルを同時に起動し、２個共にＬＢＵＳ１からＬＢＵＳ２へ向けたデータ転送で用いるようなことは、実使用上のメリットが小さい。 Further, from the viewpoint of operation, when performing bidirectional data transfer as described in FIG. 9 by the configuration example of FIG. 10, first, the storage area AR11 of MEM1 is transferred to the storage area AR21 of MEM2, and then each CPU is transferred. The transfer from the storage area AR22 of the MEM2 to the storage area AR12 of the MEM1 is performed through the resetting and activation to each DMAC. In this case, data transfer efficiency and CPU processing efficiency are reduced as compared with the case of FIG. On the other hand, for example, if the configuration example shown in FIG. 10 is provided for two channels, the data transfer efficiency and CPU processing efficiency equivalent to those in FIG. However, in this case, as described above, there is a concern about an increase in circuit scale or complicated control. Further, for example, when two channels are activated at the same time and both of them are used for data transfer from LBUS1 to LBUS2, the merit in practical use is small.

したがって、このような観点からは、図３のような構成例を用いることが望ましい。図１０のような構成例は、例えば、ＤＭＡコントローラＤＭＡＣにおいて使用可能なチャネルの数が限られている（例えば１個）のような場合に有益となる。 Therefore, from such a viewpoint, it is desirable to use the configuration example as shown in FIG. The configuration example as shown in FIG. 10 is useful when the number of channels that can be used in the DMA controller DMAC is limited (for example, one).

（実施の形態３）
図１１は、本発明の実施の形態３によるマルチプロセッサシステムにおいて、その構成の一例を示す概略図である。図１１に示すマルチプロセッサシステムは、パーソナルコンピュータＰＣと、このＰＣが備える例えばＰＣＩＥｘｐｒｅｓｓ用のスロット端子に接続される画像処理用のボードＢＤなどから構成される。ＰＣは、プロセッサＣＰＵ１、システムメモリＭＥＭ１、ＤＭＡコントローラＤＭＡＣ１およびコントローラＣＴＬ１などを含み、ＣＴＬ１からスロット端子を介してボードＢＤとの通信が可能となっている。 (Embodiment 3)
FIG. 11 is a schematic diagram showing an example of the configuration of a multiprocessor system according to Embodiment 3 of the present invention. The multiprocessor system shown in FIG. 11 includes a personal computer PC and an image processing board BD connected to, for example, a PCI Express slot terminal included in the PC. The PC includes a processor CPU1, a system memory MEM1, a DMA controller DMAC1, a controller CTL1, and the like, and can communicate with the board BD from the CTL1 through a slot terminal.

ボードＢＤには、プロセッサＣＰＵ２およびＤＭＡコントローラＤＭＡＣ２を含むマイコンＭＣと、マイコン用のシステムメモリＭＥＭ２と、実施の形態１および２で述べたようなデータ中継装置ＰＣＩＣと、コントローラＣＴＬ２などが搭載されている。ＰＣＩＣは、ローカルバスＬＢＵＳ１によってマイコンＭＣと接続され、ローカルバスＬＢＵＳ２によってコントローラＣＴＬ２と接続される。ＣＴＬ２は、例えばＰＣＩＥｘｐｒｅｓｓの通信機能を備え、ボードＢＤは、このＣＴＬ２を介してＰＣとの通信が可能となっている。特に限定はされないが、ＬＢＵＳ１は例えば６６ＭＨｚで動作し、ＬＢＵＳ２は例えば５０ＭＨｚで動作する。 The board BD is equipped with a microcomputer MC including a processor CPU2 and a DMA controller DMAC2, a system memory MEM2 for the microcomputer, a data relay device PCIC as described in the first and second embodiments, a controller CTL2, and the like. . The PCIC is connected to the microcomputer MC through a local bus LBUS1, and is connected to the controller CTL2 through a local bus LBUS2. The CTL2 has a communication function of, for example, PCI Express, and the board BD can communicate with the PC via the CTL2. Although not particularly limited, LBUS1 operates at 66 MHz, for example, and LBUS2 operates at 50 MHz, for example.

このようなマルチプロセッサシステムでは、例えば、ＰＣのＣＰＵ１で処理した比較的大容量の画像データをボードＢＤに送信し、この転送された画像データをボードＢＤのマイコンＭＣで処理した後、当該データをＰＣに返信するような動作が繰り返し行われる。したがって、ＰＣ内のシステムメモリＭＥＭ１とボードＢＤ内のシステムメモリＭＥＭ２との間のデータ転送効率をどれだけ上げられるかが画像処理のパフォーマンスを決める上で重要となる。また、近年では、モバイル端末の普及などにより、ボードＢＤの小型化も重要な要素となっている。こうした中、ボードＢＤ上に前述したようなデータ中継装置ＰＣＩＣを搭載することで、前述したような要求に対して有益な効果を得ることができる。 In such a multiprocessor system, for example, relatively large-capacity image data processed by the CPU 1 of the PC is transmitted to the board BD, and the transferred image data is processed by the microcomputer MC of the board BD. The operation of returning to the PC is repeated. Therefore, how much the data transfer efficiency between the system memory MEM1 in the PC and the system memory MEM2 in the board BD can be increased is important in determining the performance of image processing. In recent years, the miniaturization of the board BD has become an important factor due to the spread of mobile terminals. Under such circumstances, it is possible to obtain a beneficial effect for the above-described requirements by mounting the data relay device PCIC as described above on the board BD.

以上、本発明者によってなされた発明を実施の形態に基づき具体的に説明したが、本発明は前記実施の形態に限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能である。 As mentioned above, the invention made by the present inventor has been specifically described based on the embodiments. However, the present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the invention.

例えば、図２においては、一例としてマイコンを用いる構成例としたが、勿論、マイコンである必要はなく、例えばパーソナルコンピュータ等のようにプロセッサＣＰＵとＤＭＡコントローラＤＭＡＣが別チップで構成されていてもよい。また、図２では、システムメモリＭＥＭを外付けメモリとしたが、勿論、マイコン内の内蔵メモリ等であってもよい。すなわち、プロセッサＣＰＵとＤＭＡコントローラＤＭＡＣとシステムメモリＭＥＭが、１個のバスまたはバスコントローラ等を介して複数のバスで接続され、当該ＣＰＵによって当該ＤＭＡＣの設定および起動が可能であり、当該ＤＭＡＣによって当該ＭＥＭのアクセスが可能な構成であればよい。 For example, in FIG. 2, a configuration example using a microcomputer is used as an example, but of course, it is not necessary to be a microcomputer, and the processor CPU and the DMA controller DMAC may be configured as separate chips, such as a personal computer. . In FIG. 2, the system memory MEM is an external memory, but, of course, it may be a built-in memory in a microcomputer. That is, the processor CPU, the DMA controller DMAC, and the system memory MEM are connected by a plurality of buses via one bus or a bus controller, and the CPU can set and start the DMAC. Any configuration that allows MEM access is acceptable.

本実施の形態によるマルチプロセッサシステムは、各プロセッサ同士で頻繁にデータのやり取りを行いながら所望の処理を行う画像処理システム等を代表に、複数のプロセッサおよび複数のＤＭＡコントローラを備えたマルチプロセッサシステム全般に対して広く適用可能である。 The multiprocessor system according to the present embodiment is a general multiprocessor system including a plurality of processors and a plurality of DMA controllers, such as an image processing system that performs desired processing while frequently exchanging data between the processors. Widely applicable.

本発明の前提として検討した比較例のマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。1 is a block diagram showing an example of the configuration of a comparative multiprocessor system studied as a premise of the present invention. FIG. 本発明の実施の形態１によるマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。1 is a block diagram showing an example of the configuration of a multiprocessor system according to Embodiment 1 of the present invention. 図２におけるデータ中継装置の詳細な構成例を示すブロック図である。FIG. 3 is a block diagram illustrating a detailed configuration example of a data relay device in FIG. 2. 図３におけるカウンタ部の詳細な構成例を示すブロック図である。It is a block diagram which shows the detailed structural example of the counter part in FIG. 図４におけるリード制御部およびライト制御部の動作例を示すものであり、（ａ）はリード制御部の動作説明図、（ｂ）はライト制御部の動作説明図である。FIGS. 4A and 4B show an operation example of the read control unit and the write control unit in FIG. 4. FIG. 4A is an operation explanatory diagram of the read control unit, and FIG. 図２のマルチプロセッサシステムにおいて、その動作概要の一例を示す説明図である。FIG. 3 is an explanatory diagram showing an example of an operation outline in the multiprocessor system of FIG. 2. 図３のデータ中継装置において、その各バッファの論理アドレス割付の一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of logical address assignment of each buffer in the data relay device of FIG. 3. 図６を補足する動作説明図である。It is operation | movement explanatory drawing which supplements FIG. 図３のデータ中継装置において、その動作例を示す説明図である。FIG. 4 is an explanatory diagram showing an operation example in the data relay device of FIG. 3. 本発明の実施の形態２によるマルチプロセッサシステムにおいて、図２のデータ中継装置の詳細な構成例を示すブロック図である。3 is a block diagram illustrating a detailed configuration example of a data relay device in FIG. 2 in a multiprocessor system according to a second embodiment of the present invention. FIG. 本発明の実施の形態３によるマルチプロセッサシステムにおいて、その構成の一例を示す概略図である。It is the schematic which shows an example of the structure in the multiprocessor system by Embodiment 3 of this invention. 本発明の前提として検討した従来技術のマルチプロセッサシステムにおいて、その構成の一例を示すブロック図である。1 is a block diagram showing an example of the configuration of a conventional multiprocessor system studied as a premise of the present invention. FIG.

Explanation of symbols

ＭＣマイクロコンピュータ
ＣＰＵプロセッサ
ＤＭＡＣＤＭＡコントローラ
ＭＥＭシステムメモリ
ＬＢＵＳローカルバス
ＤＰＲＡＭデュアルポートメモリ
ＰＣＩＣデータ中継装置
ＭＢＵＳメモリバス
ＷＣＴＬライト制御部
ＲＣＴＬリード制御部
ＢＦ，ＢＦＡ，ＢＦＢバッファ
ＣＵＮＴカウンタ部
ＷＣＮＴライトカウンタ
ＲＣＮＴリードカウンタ
ＳＭステートマシン
ＲＷＣＴＬリード・ライト制御部
ＣＴＬコントローラ
ＰＣパーソナルコンピュータ
ＢＤボード MC Microcomputer CPU Processor DMAC DMA Controller MEM System Memory LBUS Local Bus DPRAM Dual Port Memory PCIC Data Relay Device MBUS Memory Bus WCTL Write Controller RCTL Read Controller BF, BFA, BFB Buffer CUNT Counter WCNT Write Counter RCNT Read Counter SM State Machine RWCTL Read / write control unit CTL controller PC Personal computer BD board

Claims

A first bus and a second bus;
A first processor and a second processor;
A first memory accessed by the first processor;
According to the setting from the first processor, the first memory is accessed without going through the first processor, the read data is transferred to the first bus, or the write data from the first bus is transferred to the first memory. A first data transfer control unit for controlling transfer to one memory;
A second memory accessed by the second processor;
According to the setting from the second processor, the second memory is accessed without going through the second processor, the read data is transferred to the second bus, or the write data from the second bus is A second data transfer control unit that controls transfer to the two memories;
A data relay unit including a data buffer connected to the first bus and the second bus and including N (N ≧ 2) storage areas;
The data relay unit
A buffer management unit for managing whether each of the N storage areas is full corresponding to written or empty corresponding to read;
An enable signal is output to the first or second data transfer control unit according to whether or not the N storage areas are empty, and the data transferred from the first memory to the first bus or the second A write control unit for sequentially writing data transferred from the memory to the second bus from the first storage area as the first of the N storage areas to the Nth storage area as the Nth;
Depending on whether the N storage areas are full or not, an enable signal is output to the second or first data transfer control unit, and the data written in the N storage areas is transferred to the first storage area. A read control unit for sequentially reading from the first to the Nth storage area and transferring the read data to the second bus or the first bus,
The write control unit writes the data following the data in the Nth storage area back to the first storage area when the writing of the Nth storage area is completed and the first storage area is empty,
When the read of the Nth storage area is completed and the first storage area is full, the read control unit returns to the first storage area and reads.

The multiprocessor system of claim 1, wherein
The data relay unit
First and second data buffers to be the data buffers;
First and second buffer managers serving as the buffer manager;
First and second light control units serving as the light control unit;
A first and second lead control unit serving as the lead control unit;
The first buffer management unit manages the N storage areas of the first data buffer;
The second buffer management unit manages the N storage areas of the second data buffer;
The first write control unit outputs an enable signal to the first data transfer control unit according to whether or not the N storage areas in the first data buffer are empty, and the first data control unit outputs the enable signal from the first memory. Write the data transferred to the bus to the first data buffer;
The first read control unit outputs an enable signal to the second data transfer control unit according to whether or not the N storage areas in the first data buffer are full, and is written to the first data buffer. Transfer the data to the second bus,
The second write control unit outputs an enable signal to the second data transfer control unit according to whether or not the N storage areas in the second data buffer are empty, and outputs the enable signal from the second memory. Write the data transferred to the bus to the second data buffer;
The second read control unit outputs an enable signal to the first data transfer control unit according to whether or not the N storage areas in the second data buffer are full, and is written to the second data buffer. A multiprocessor system, wherein the data is transferred to the first bus.

The multiprocessor system of claim 1, wherein
2. The multiprocessor system according to claim 1, wherein the N storage areas are two storage areas.

The multiprocessor system according to claim 3, wherein
The multi-processor system is characterized in that the two storage areas are realized by two storage circuits formed independently of each other.