JPH0496867A

JPH0496867A - Memory bank access system and vector arithmetic unit applied to the same system

Info

Publication number: JPH0496867A
Application number: JP21438690A
Authority: JP
Inventors: Koichi Kuroiwa; 功一黒岩
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1990-08-15
Filing date: 1990-08-15
Publication date: 1992-03-30

Abstract

PURPOSE:To reduce an area occupied by a decoder circuit, and to promote a miniaturization and a high integration by sharing the plural memory banks of the decoder circuit. CONSTITUTION:A decoder circuit 13 which decodes an address signal, latch circuit 11 which latches the output signal of this circuit 13, and memory cell array 12 connected with a word line WL1 to which the output from the circuit 11 is supplied, are provided at a first memory bank 1. And also, latch circuits 21 - 41 which latch the output signals of the latch circuits at the memory banks at the previous stage, and memory cell arrays 22 - 42 connected with word lines WL2 - WL4 to which the outputs of these circuits 21 - 41 are supplied, are provided at each of memory banks 2 - 4 after the second. Then, the output of the decoder circuit 13 provided at the first memory bank 1 is delayed in every one cycle by the latch circuit provided at each memory bank, and the memory cell array in each memory bank is simultaneously performed access of by the address which is different in every one cycle.

Description

【発明の詳細な説明】〔概　要〕ベクトル処理に適した高速のメモリバンクアクセス方式
および該方式を適用したベクトル演算装置に関し、デコーダ回路を複数のメモリバンクで共有化することに
よって、デコーダ回路の占有する面積を減少して小型化
および高集積化を推進することを目的とし、複数のメモリバンクを備え、該複数のメモリノ〈ンクに
格納されているベクトルデータをアクセスしてパイプラ
イン処理を行うベクトル演算装置であって、アドレス信
号をデコードするデコーダ回路と、該デコーダ回路の出
力信号をラッチするラッチ回路と、該ラッチ回路の出力
が供給されるワード線に接続されたメモリセルアレイと
を具備する１番目のメモリバンクと、前段のメモリノ＼
ンクにおけるラッチ回路の出力信号をラッチするラッチ
回路と、当該ラッチ回路の出力が供給されるワード線に
接続されたメモリセルアレイとを具備する２番目以鋒の
メモリバンクとを備え、前記１番目のメモリバンクに設
けられたデコーダ回路の出力を各メモリバンクに設けら
れたう・ノチ回路で１サイクルずつ遅延させ、咳各メモ
リバンク内のメモリセルアレイを順次１サイクルずつ異
なるアドレスで同時にアクセスするように構成する。[Detailed Description of the Invention] [Summary] Regarding a high-speed memory bank access method suitable for vector processing and a vector arithmetic device applying the method, the decoder circuit is shared by a plurality of memory banks. The purpose is to promote miniaturization and high integration by reducing the area occupied, and it is equipped with multiple memory banks and performs pipeline processing by accessing vector data stored in the multiple memory nodes. A vector arithmetic device, comprising a decoder circuit that decodes an address signal, a latch circuit that latches an output signal of the decoder circuit, and a memory cell array connected to a word line to which the output of the latch circuit is supplied. The first memory bank and the previous memory bank\
a second or subsequent memory bank comprising a latch circuit that latches an output signal of the latch circuit in the first bank; and a memory cell array connected to a word line to which the output of the latch circuit is supplied; The output of the decoder circuit provided in the memory bank is delayed one cycle at a time by the circuit provided in each memory bank, and the memory cell array in each memory bank is accessed simultaneously at different addresses one cycle at a time. Configure.

[Industrial application field]

本発明は、ベクトル処理におけるメモリバンクアクセス
技術に関し、特に、ベクトル処理に適した高速のメモリ
バンクアクセス方式および該方式を適用したベクトル演
算装置に関する。The present invention relates to memory bank access technology in vector processing, and in particular to a high-speed memory bank access method suitable for vector processing and a vector arithmetic device applying the method.

近年、計算機処理の高速化の要求は強く、ベクトル演算
処理も大型汎用コンピュータからワークステーションや
パーソナルコンピュータへと適用分野が広がっている。In recent years, there has been a strong demand for faster computer processing, and the field of application of vector arithmetic processing has expanded from large general-purpose computers to workstations and personal computers.

そこで、ベクトル演算装置の小型化および高集積化が要
望されている。Therefore, there is a demand for vector arithmetic devices to be smaller and more highly integrated.

〔従来の技術］第６図は従来のベクトル演算装置の一例を示すブロック
図である。同図に示されるように、従来のベクトル演算
装置は、複数のメモリバンク１０１〜１０４を備え、各
メモリバンク１０１〜１０４には、それぞれラッチ回路
１１１〜１４１．メモリセルアレイ１１２〜１４２およ
びデコーダ回路１１３〜１４３が設けられている。[Prior Art] FIG. 6 is a block diagram showing an example of a conventional vector calculation device. As shown in the figure, the conventional vector arithmetic device includes a plurality of memory banks 101-104, and each memory bank 101-104 has latch circuits 111-141. Memory cell arrays 112-142 and decoder circuits 113-143 are provided.

ところで、ベクトル処理を行う場合、全てのメモリバン
ク１０１〜１０４から１サイクル毎にデータをアクセス
する必要があるため、各メモリノ＼ンク１０１〜１０４
に対してそれぞれ１サイクルずつアドレスを入力する必
要がある。さらに、各メモリノ〈ンク１０１〜１０４に
は、それぞれデコーダ回路１１３〜１４３が設けられて
いて、１サイクルずつずれた共通のアドレスを各メモリ
バンク１０１〜１０４毎にデコードしてメモリセルアレ
４１１２〜１４２をアクセスするようになっている。By the way, when performing vector processing, it is necessary to access data from all memory banks 101 to 104 every cycle.
It is necessary to input an address for each cycle one cycle at a time. Further, each memory node 101-104 is provided with a decoder circuit 113-143, respectively, which decodes a common address shifted by one cycle for each memory cell array 101-104. is designed to be accessed.

（発明が解決しようとする課題〕上述したように、第６図に示す従来のベクトル演算装置
において、各メモリノ＼ンク１０１〜１０４に（よ、そ
れぞれ専用のデコーダ回路１１３〜１４３が設けられて
いて、これらのデコーダ回路１１３〜１４３に対してそ
れぞれ１サイクルずつアドレス信号を供給しなければな
らない。そして、メモリバンク１０１〜１０４に設けら
れたデコーダ回路１１３〜１４３が占有する面積はベク
トル演算装置の小型化および高集積化の妨げとなってい
る。(Problems to be Solved by the Invention) As described above, in the conventional vector arithmetic device shown in FIG. , it is necessary to supply one cycle of address signals to each of these decoder circuits 113 to 143.The area occupied by the decoder circuits 113 to 143 provided in the memory banks 101 to 104 is limited to the size of a small vector arithmetic device. This is hindering the development of high-density and high-density integration.

本発明は、上述した従来のベクトル演算装置が有する課
題に鑑み、デコーダ回路を複数のメモリバンクで共有化
することによって、デコーダ回路の占有する面積を減少
して小型化および高集積化を推進することを目的とする
。In view of the above-mentioned problems with conventional vector arithmetic devices, the present invention reduces the area occupied by the decoder circuit by sharing the decoder circuit with a plurality of memory banks, thereby promoting miniaturization and high integration. The purpose is to

〔課題を解決するための手段］本発明によれば、複数のメモリバンク１〜４を備え、該
複数のメモリバンクに格納されているベクトルデータを
アクセスしてパイプライン処理を行うベクトル演算装置
であって、アドレス信号をデコードするデコーダ回路１
３と、該デコーダ回路１３の出力信号をラッチするラッ
チ回路１１と、該ラッチ回路１１の出力が供給されるワ
ード線畦、に接続されたメモリセルアレイ１２とを具備
する１番目のメモリバンク１と、前段のメモリバンクに
おけるラッチ回路の出力信号をラッチするラッチ回路２
１〜４１と、当該ラッチ回路２１〜４１の出力が供給さ
れるワード線−Ｌ２〜−Ｌ４に接続されたメモリセルア
レイ２２〜４２とを具備する２番目以降のメモリバンク
２〜４とを備え、前記１番目のメモリバンク１に設けら
れたデコーダ回路１３の出力を各メモリバンクに設けら
れたラッチ回路で１サイクルずつ遅延させ、該各メモリ
バンク内のメモリセルアレイを順次１サイクルずつ異な
るアドレスで同時にアクセスするようにしたことを特徴
とするベクトル演算装置が捉供される。[Means for Solving the Problems] According to the present invention, a vector arithmetic device includes a plurality of memory banks 1 to 4 and performs pipeline processing by accessing vector data stored in the plurality of memory banks. There is a decoder circuit 1 that decodes the address signal.
3, a latch circuit 11 that latches the output signal of the decoder circuit 13, and a memory cell array 12 connected to the word line ridge to which the output of the latch circuit 11 is supplied. , a latch circuit 2 that latches the output signal of the latch circuit in the previous stage memory bank.
1 to 41 and memory cell arrays 22 to 42 connected to word lines -L2 to -L4 to which the outputs of the latch circuits 21 to 41 are supplied, The output of the decoder circuit 13 provided in the first memory bank 1 is delayed one cycle at a time by a latch circuit provided in each memory bank, and the memory cell arrays in each memory bank are sequentially moved one cycle at different addresses at the same time. A vector arithmetic device is provided which is characterized in that it is configured to access.

[For production]

本発明のベクトル演算装置によれば、１番目のメモリバ
ンク１には、アドレス信号をデコードするデコーダ回路
１３と、該デコーダ回路１３の出力信号をラッチするラ
ッチ回路１１と、該ラッチ回路１１の出力が供給される
ワード線すし、に接続されたメモリセルアレイ１２とが
設けられ、また、２番目以降のメモリバンク２〜４には
、それぞれ前段のメモリバンクにおけるラッチ回路の出
力信号をラッチするラッチ回路２１〜４１と、当該ラッ
チ回路２１〜４１の出力が供給されるワード線−Ｌ２〜
−Ｌ４に接続されたメモリセルアレイ２２〜４２とが設
けられている。そして、１番目のメモリバンク１に設け
られたデコーダ回路１３の出力が各メモリバンクに設け
られたラッチ回路で１サイクルずつ遅延され、該各メモ
リバンク内のメモリセルアレイが順次１サイクルずつ異
なるアドレスで同時にアクセスされるようになっている
。このように、本発明のベクトル演算装置は、デコーダ
回路を複数のメモリバンクで共有化することによって、
デコーダ回路の占有する面積を減少して小型化および高
集積化を推進することができる。According to the vector arithmetic device of the present invention, the first memory bank 1 includes a decoder circuit 13 that decodes an address signal, a latch circuit 11 that latches an output signal of the decoder circuit 13, and an output signal of the latch circuit 11. A memory cell array 12 connected to the word line Sushi is provided, and each of the second and subsequent memory banks 2 to 4 includes a latch circuit that latches the output signal of the latch circuit in the previous memory bank. 21 to 41, and word lines -L2 to which the outputs of the latch circuits 21 to 41 are supplied.
-Memory cell arrays 22 to 42 connected to L4 are provided. Then, the output of the decoder circuit 13 provided in the first memory bank 1 is delayed by one cycle by the latch circuit provided in each memory bank, and the memory cell array in each memory bank is sequentially read at a different address by one cycle. They are accessed simultaneously. In this way, the vector arithmetic device of the present invention shares the decoder circuit with a plurality of memory banks.
The area occupied by the decoder circuit can be reduced to promote miniaturization and high integration.

〔Example〕

以下、図面を参照して本発明に係るベクトル演算装置の
実施例を説明する。Hereinafter, embodiments of a vector calculation device according to the present invention will be described with reference to the drawings.

第１Ｍは本発明に係るベクトル演算装置の一実施例を示
すブロック図であり、４バンク構成のレイアウトイメー
ジを示すものである。1M is a block diagram showing an embodiment of the vector calculation device according to the present invention, and shows a layout image of a 4-bank configuration.

第１図に示されるように、本実施例のベクトル演算装置
は、複数のメモリバンク１〜４を備えている。１番目の
メモリバンク１には、ラッチ回路１１、メモリセルアレ
イ１２およびデコーダ回路１３が設けられているが、２
番目以降のメモリバンク２〜３には、ラッチ回路２１〜
４１およびメモリセルアレイ２２〜４２だけが設けられ
ていてデコーダ回路は設けられていない。そして、アド
レス信号は、１番目のメモリバンク１におけるデコーダ
回路１３にだけ供給され、２番目以降のメモリバンク２
〜３では、１番目のメモリバンクｌにおけるテ゛コーダ
回路１３の出力信号を使用するようになっている。As shown in FIG. 1, the vector arithmetic device of this embodiment includes a plurality of memory banks 1-4. The first memory bank 1 is provided with a latch circuit 11, a memory cell array 12, and a decoder circuit 13.
The latch circuits 21 to 3 are provided in memory banks 2 to 3 after the memory bank 2 to 3.
41 and memory cell arrays 22 to 42 are provided, and no decoder circuit is provided. Then, the address signal is supplied only to the decoder circuit 13 in the first memory bank 1, and the address signal is supplied only to the decoder circuit 13 in the first memory bank 1.
3, the output signal of the coder circuit 13 in the first memory bank l is used.

まず、デコーダ回路１３の出力信号は、１番目のメモリ
バンク１のラッチ回路１１を介して１番目のメモリバン
ク１のメモリセルアレイ１２に供給され、さらに、ラッ
チ回路１１の出力信号はメモリセルアレイ１２のワード
線−り、を経て２番目のメモリバンク２のラッチ回路２
１を介して２番目のメモリバンク２のメモリセルアレイ
２２に供給される。同様に、ラッチ回路２１の出力信号
はメモリセルアレイ２２のワード線−Ｌ２を経て３番目
のメモリバンク３のラッチ回路３Ｉを介して３番目のメ
モリバンク３のメモリセルアレイ３２に供給され、また
、ラッチ回路３１の出力信号はメモリセルアレイ３２の
ワードＭＷＩ、。First, the output signal of the decoder circuit 13 is supplied to the memory cell array 12 of the first memory bank 1 via the latch circuit 11 of the first memory bank 1; Latch circuit 2 of second memory bank 2 via word line
1 to the memory cell array 22 of the second memory bank 2. Similarly, the output signal of the latch circuit 21 is supplied to the memory cell array 32 of the third memory bank 3 via the word line -L2 of the memory cell array 22 and the latch circuit 3I of the third memory bank 3. The output signal of the circuit 31 is the word MWI of the memory cell array 32.

を経て４番目のメモリバンク４０ラッチ回路４１を介し
て４番目のメモリバンク４のメモリセルアレイ４２に供
給される。ここで、各メモリバンク１〜４におけるラッ
チ回路１１〜４１にはそれぞれクロック信号ＣＬＫが供
給されるようになっていて、該ラッチ回路１１〜４１に
おいてデコーダ回路１３の出力および前段のラッチ回路
１１〜３１の出力がそれぞれ１サイクル分だけ遅延され
て対応するメモリセルアレイ１２〜４２に供給されるよ
うになっている。The signal is then supplied to the memory cell array 42 of the fourth memory bank 4 via the latch circuit 41 of the fourth memory bank 40. Here, the latch circuits 11 to 41 in each memory bank 1 to 4 are supplied with a clock signal CLK, respectively, and in the latch circuits 11 to 41, the output of the decoder circuit 13 and the previous stage latch circuits 11 to 41 are supplied with a clock signal CLK. The outputs of 31 are each delayed by one cycle and supplied to the corresponding memory cell arrays 12-42.

すなわち、メモリバンク１のメモリセルアレイ１２は、
ラッチ回路１１により１サイクル分だけ遅延すしたデコ
ーダ回路１３の出力信号によりアクセスされ、また、メ
モリバンク２のメモリセルアレイ２２は、ラッチ回路１
１および２１により２サイクル分だけ遅延されたデコー
ダ回路１３の出力信号によりアクセスされる。同様に、
メモリバンク３のメモリセルアレイ３２は、ラッチ回路
ＩＩ、２１および３１により３サイクル分だけ遅延され
たデコーダ回路１３の出力信号によりアクセスされ、そ
して、メモリバンク４のメモリセルアレイ２２は、ラッ
チ回路１１゜２１．３１および４１により４サイクル分
だけ遅延されたデコーダ回路１３の出力信号によりアク
セスされる。従って、各メモリバンク１〜４内のメモリ
セルアレイ１１〜４１は、順次１サイクルずつ異なるア
ドレスで、換言すると、前段のメモリバンクをアクセス
しているアドレスが１サイクル遅れて次段のメモリバン
クに供給されて複数のメモリバンクが同時にアクセスさ
れ、複数のメモリバンクで並列的に続出／書込処理が実
行されることになる。That is, the memory cell array 12 of the memory bank 1 is
The memory cell array 22 of the memory bank 2 is accessed by the output signal of the decoder circuit 13 delayed by one cycle by the latch circuit 11.
1 and 21, the output signal of the decoder circuit 13 is delayed by two cycles. Similarly,
The memory cell array 32 of the memory bank 3 is accessed by the output signal of the decoder circuit 13 delayed by three cycles by the latch circuits II, 21 and 31, and the memory cell array 22 of the memory bank 4 is accessed by the output signal of the decoder circuit 13 delayed by three cycles by the latch circuits II, 21 and 31. It is accessed by the output signal of the decoder circuit 13 delayed by 4 cycles by .31 and 41. Therefore, the memory cell arrays 11 to 41 in each memory bank 1 to 4 have different addresses by one cycle sequentially. In other words, the address accessing the previous memory bank is supplied to the next memory bank with a one cycle delay. As a result, a plurality of memory banks are accessed simultaneously, and successive access/write processing is executed in parallel in a plurality of memory banks.

第２図は第１図のベクトル演算装置の要部を示す回路図
である。同図に示されるように、１番目のメモリバンク
１におけるデコーダ回路１３は、０本のアドレス入力（
アドレス信号）を受け、２″本の出力のうち１本をアク
ティブにする回路である。ここで、デコーダ回路１３は
、例えば、４つのインバータおよび４つのＮＯＲゲート
を有し、また、アドレス入力が２本で４本の出力線の内
の１本を選択するように構成されているが、様々な構成
のものを使用することができる。FIG. 2 is a circuit diagram showing essential parts of the vector calculation device shown in FIG. 1. As shown in the figure, the decoder circuit 13 in the first memory bank 1 has 0 address inputs (
The decoder circuit 13 is a circuit that receives an address signal (address signal) and activates one of the 2'' outputs.Here, the decoder circuit 13 has, for example, four inverters and four NOR gates, and the address input is Although the two output lines are configured to select one of the four output lines, various configurations can be used.

ラッチ回路１１．２Ｈ３１，４１）は、フリップフロッ
プで構成され、それぞれクロック信号ＣＬＫが供給され
てクロック信号ＣＬＫの変化点（立ち上がりエツジ）で
データを保持して各メモリセルアレイ１２．２２（３２
，４２）をアクセスするようになっている。そして、メ
モリセルアレイ１２．２２　（３２，４２）は、対応す
るラッチ回路ＬＬ２１（３１，４１）から出力された信
号（ワード線ＷＬ＋、ＷＬｚ（ＷＬｚＪＬａ））により
該メモリセルアレイ１２．２２（３２，４２）内の個々
のメモリセルをアクセスし、読み出し或いは書き込み処
理を行う（ビット線ＢＬ＋　（：ＢＩＴ、　ＢＩＴ”）
、ＢＬ２（ＢＬ３．ＢＬ４）　）ようになっている。The latch circuits 11.2H31, 41) are each configured with a flip-flop, are supplied with the clock signal CLK, hold data at the change point (rising edge) of the clock signal CLK, and hold the data at each memory cell array 12.22 (32H).
, 42). Then, the memory cell array 12.22 (32, 42) is activated by the signal (word line WL+, WLz (WLzJLa)) output from the corresponding latch circuit LL21 (31, 41). ) to perform read or write processing (bit line BL+ (:BIT, BIT")
, BL2 (BL3.BL4) ).

第３図は第１図のベクトル演算装置の動作を説明するた
めの図である。第１図および第３図に示されるように、
まず、クロック信号ＣＬＫの立ち上がりエツジＣ１にお
いて、アドレス信号ＡＤＯが１番目のメモリバンク１に
おけるデコーダ回路１３に取り込まれてデコードされる
。FIG. 3 is a diagram for explaining the operation of the vector arithmetic device shown in FIG. 1. As shown in Figures 1 and 3,
First, at the rising edge C1 of the clock signal CLK, the address signal ADO is taken into the decoder circuit 13 in the first memory bank 1 and decoded.

次に、クロック信号ＣＬＫの立ち上がりエツジＣ２にお
いて、アドレス信号ＭＤＩがデコーダ回路１３に取り込
まれてデコードされる。この時、デコーダ回路１３の出
力信号ＢＤＯ（アドレスＡＤＯをデコードした信号）が
メモリバンク１のラッチ回路１工にラッチされる。そし
て、ラッチ回路１１から信号ＢＤＯが１番目のメモリバ
ンク１におけるメモリセルアレイ１２に供給されてアド
レスＡＤＯに対応したメモリセルに対してデータの入出
力（書込／読出）処理が行われる。また、ラッチ回路１
１の出力信号（ＢＤＯ）はメモリセルアレイ１２のワー
ド線−Ｌｌ　を介して２番目のメモリバンク２における
ラッチ回路２１にも供給される。この出力信号ＢＤＯは
、クロック信号ＣＬＫのさらに次の立ち上がり工・ッジ
Ｃ３においてラッチ回路２１にラッチされる。Next, at the rising edge C2 of the clock signal CLK, the address signal MDI is taken into the decoder circuit 13 and decoded. At this time, the output signal BDO (signal obtained by decoding address ADO) of the decoder circuit 13 is latched by the latch circuit 1 of the memory bank 1. Then, the signal BDO is supplied from the latch circuit 11 to the memory cell array 12 in the first memory bank 1, and data input/output (write/read) processing is performed on the memory cell corresponding to the address ADO. In addition, latch circuit 1
1 output signal (BDO) is also supplied to the latch circuit 21 in the second memory bank 2 via the word line -Ll of the memory cell array 12. This output signal BDO is latched by the latch circuit 21 at the next rising edge C3 of the clock signal CLK.

さらに、クロック信号ＣＬＫの立ち上がりエツジＣ３に
おいて、アドレス信号ＡＤ２がデコーダ回路１３に取り
込まれてデコードされる。この時、デコーダ回路１３の
出力信号ＢＤＩがメモリバンク１のラッチ回路１１にラ
ッチされ、メモリセルアレイ１２のアドレスＡＤＩに対
応したメモリセルに対してデータの入出力処理が行われ
る。また、２番目のメモリバンク２では、ラッチ回路２
１からの出力信号ＢＤＯによって、２番目のメモリバン
ク２におけるメモリセルアレイ２２のアドレスＡＤＯに
対応したメモリセルに対してデータの入出力処理が行わ
れる。ここで、ラッチ回路２１の出力信号（ＢＤＯ）は
メモリセルアレイ２２のワード線孔２を介して３番目の
メモリバンク３におけるラッチ回路３１にも供給される
。Further, at the rising edge C3 of the clock signal CLK, the address signal AD2 is taken into the decoder circuit 13 and decoded. At this time, the output signal BDI of the decoder circuit 13 is latched by the latch circuit 11 of the memory bank 1, and data input/output processing is performed to the memory cell corresponding to the address ADI of the memory cell array 12. In addition, in the second memory bank 2, the latch circuit 2
Data input/output processing is performed on the memory cell corresponding to the address ADO of the memory cell array 22 in the second memory bank 2 by the output signal BDO from the second memory bank 2 . Here, the output signal (BDO) of the latch circuit 21 is also supplied to the latch circuit 31 in the third memory bank 3 via the word line hole 2 of the memory cell array 22.

この出力信号ＢＤＯは、クロック信号ＣＬＫのさらに次
の立ち上がりエツジＣ４においてラッチ回路３１にラッ
チされることになる。以下このような動作が命令終了ま
で行われる。メモリセルアレイの入出力（ビット線）か
らみるとパイプライン処理されることになる。This output signal BDO is latched by the latch circuit 31 at the next rising edge C4 of the clock signal CLK. Thereafter, such operations are performed until the instruction ends. From the perspective of the input/output (bit line) of the memory cell array, pipeline processing is performed.

このように、各メモリバンク１〜４内のメモリセルアレ
イ１１〜４１は、順次１サイクルずつ異なるアドレスに
よって、並列的にアクセスされることになる。第３図で
は、クロック信号ＣＬＫの立ち上がりエツジＣ５より後
において、４ノ＼ンク全てが並列的に動作することにな
る。In this way, the memory cell arrays 11 to 41 in each memory bank 1 to 4 are sequentially accessed in parallel by different addresses one cycle at a time. In FIG. 3, all four nodes operate in parallel after the rising edge C5 of the clock signal CLK.

第４図は本発明のベクトル演算装置の他の実施例を示す
ブロック図である。FIG. 4 is a block diagram showing another embodiment of the vector calculation device of the present invention.

第４図のベクトル演算装置は、前段のラッチ回路１１〜
３１の出力信号が専用の次段バンク接続線り。The vector arithmetic device in FIG.
The output signal of 31 is a dedicated next-stage bank connection line.

〜Ｌ３により次段のラッチ回路２１〜４１に供給される
ようになっている。すなわち、第１図のベクトル演算装
置においては、例えば、ラッチ回路１１の出力信号が１
番目のメモリバンク１におけるメモリセルアレイ１２の
ワード線−り、を介して２番目のメモリバンク２におけ
るラッチ回路２１に供給されているが、本実施例のベク
トル演算装置においては、例えば、ラッチ回路１１の出
力信号が１番目のメモリバンク１におけるメモリセルア
レイ１２を通ることなく専用の次段バンク接続線Ｌ＋を
介して２番目のメモリバンク２におけるラッチ回路２１
に供給されるようになっている。この第４図のベクトル
演算装置の他の構成および動作は、第１図のベクトル演
算装置と同様であり、説明を省略する。~L3, the signal is supplied to the latch circuits 21 to 41 at the next stage. That is, in the vector calculation device shown in FIG. 1, for example, the output signal of the latch circuit 11 is 1.
It is supplied to the latch circuit 21 in the second memory bank 2 via the word line of the memory cell array 12 in the second memory bank 1. The output signal is sent to the latch circuit 21 in the second memory bank 2 via the dedicated next-stage bank connection line L+ without passing through the memory cell array 12 in the first memory bank 1.
is being supplied to. The other configuration and operation of the vector arithmetic device shown in FIG. 4 are the same as those of the vector arithmetic device shown in FIG. 1, and their explanation will be omitted.

第５図はベクトル演算装置の全体的構成を概略的に示す
図であり、前述した第１図および第４図に示すベクトル
演算装置によりパイプライン処理が行われる様子を示す
ものである。FIG. 5 is a diagram schematically showing the overall configuration of the vector arithmetic device, and shows how pipeline processing is performed by the vector arithmetic device shown in FIGS. 1 and 4 described above.

第５図に示されるように、ベクトル演算装置は、メモリ
バンク１〜４から読み出された（Ｒ）データはバンクセ
レクタＢＳを介して、演算パイプラインＡ−Ｃに供給さ
れ、該演算パイプラインＡ−Ｃの出力がバンクセレクタ
ＢＳを介してメモリバンク１〜４へ書き込まれる（Ｗ）
ようになっている。As shown in FIG. 5, in the vector arithmetic device, (R) data read from memory banks 1 to 4 is supplied to arithmetic pipelines A to C via a bank selector BS, and the arithmetic pipelines The output of A-C is written to memory banks 1 to 4 via bank selector BS (W)
It looks like this.

すなわち、演算パイプラインＡ−Ｃは、例えば、加算１
乗算、減算等の演算処理を行うもので、各メモリバンク
１〜４から読み出されたデータがＢンクセレクタＢＳに
より順次切り換えられて演算パイプラインＡ〜Ｃへ供給
され、ベクトル処理が行われるようになっている。ここ
で、ベクトル処理とは、処理すべきデータが順序よく格
納されており、そのデータを順番に読み出してパイプラ
イン処理を行うことにより、データの処理速度を見かけ
上高速にみせる処理のことをいう。また、ベクトルデー
タは、１サイクルごとに読みだす必要があるが、その時
、他のバンクのアクセスも同時に行われるようになって
いる。そして、第１回および第４図に示すベクトル演算
装置では、ヘクトルレジスタ（メモリバンク１〜４）の
Ｒ／Ｗ４こ伴うアクセス方式において、アドレスデコー
ダ（デコーダ回路１３）が１つとされ、同一アドレスを
メモリバンク毎にずらして入力することにより、連続す
るベクトルデータのＲ／Ｗを可能とし、ノ々イフ。That is, the operation pipeline A-C is, for example, an addition 1
It performs arithmetic processing such as multiplication and subtraction, and the data read from each memory bank 1 to 4 is sequentially switched by the B link selector BS and supplied to the arithmetic pipelines A to C for vector processing. It has become. Here, vector processing refers to processing in which data to be processed is stored in an orderly manner, and the data is sequentially read out and pipeline processing is performed to make the data processing speed appear higher. Further, vector data needs to be read every cycle, but at that time, other banks are accessed at the same time. In the vector arithmetic device shown in Part 1 and FIG. 4, in the access method involving R/W4 of hector registers (memory banks 1 to 4), there is one address decoder (decoder circuit 13), and the same address is By shifting the input for each memory bank, it is possible to read/write continuous vector data, making it possible to read and write continuous vector data.

ライン処理を行えるようになっている。Line processing is now possible.

（発明の効果〕以上、詳述したように、本発明のベクトル演算装置は、
デコーダ回路を複数のノ＼ンクで共有化することによっ
て、デコーダ回路の占有する面積を減少して小型化およ
び高集積化を推進すること力くできる。(Effects of the Invention) As detailed above, the vector calculation device of the present invention has the following features:
By sharing the decoder circuit among a plurality of nodes, the area occupied by the decoder circuit can be reduced and miniaturization and high integration can be promoted.

[Brief explanation of the drawing]

第１図は本発明に係るベクトル演算装置の一実施例を示
すブロック図、第２図は第１図のベクトル演算装置の要部を示す回路図
、第３図は第１図のベクトル演算装置の動作を説明するた
めの図、第４図は本発明のベクトル演算装置の他の実施例を示す
ブロック図、第５図はベクトル演算装置の全体的構成を概略的に示す
図、第６図は従来のベクトル演算装置の一例を示すブロック
図である。（符号の説明）１〜４・・・バンク、１１〜４１・・・ラッチ回路、１２〜４２・・・メモリセルアレイ、１３・・・デコーダ回路、ＢＬ、−ＢＬ４・・・ワード線、ＷＬ２〜ＷＬ４・・・ワード線、Ｌ１〜Ｌ３・・・次段バンク接続線。本発明のベクトル演算装置の他の実施例を示すプロ・ツ
ク図第４図・、バンクセレクタＢＳベクトル演算装置の全体的構成を概略的に示す図第Ｓ図FIG. 1 is a block diagram showing an embodiment of the vector calculation device according to the present invention, FIG. 2 is a circuit diagram showing the main parts of the vector calculation device shown in FIG. 1, and FIG. 3 is the vector calculation device shown in FIG. 1. FIG. 4 is a block diagram showing another embodiment of the vector calculation device of the present invention; FIG. 5 is a diagram schematically showing the overall configuration of the vector calculation device; FIG. 6 1 is a block diagram showing an example of a conventional vector calculation device. (Explanation of symbols) 1-4...Bank, 11-41...Latch circuit, 12-42...Memory cell array, 13...Decoder circuit, BL, -BL4...Word line, WL2- WL4...word line, L1-L3...next stage bank connection line. FIG. 4 is a program diagram showing another embodiment of the vector arithmetic device of the present invention, and bank selector BS is a diagram S schematically showing the overall configuration of the vector arithmetic device.

Claims

[Claims] 1. A vector arithmetic device comprising a plurality of memory banks (1 to 4) and performing pipeline processing by accessing vector data stored in the plurality of memory banks, the device comprising: an address signal; a decoder circuit (13) for decoding the
A latch circuit (1) that latches the output signal of the decoder circuit.
1) and a word line (W) to which the output of the latch circuit is supplied.
a first memory bank (1) comprising a memory cell array (12) connected to the memory cell array (12), a latch circuit (21 to 41) that latches the output signal of the latch circuit in the previous memory bank, and the latch circuit. and memory cell arrays (22 to 42) connected to word lines (WL_2 to WL_4) to which the outputs of the second and subsequent memory banks (2 to 4) are connected to the first memory bank. The output of the provided decoder circuit is delayed one cycle at a time by a latch circuit provided in each memory bank,
A vector arithmetic device characterized in that memory cell arrays in each memory bank are sequentially accessed one cycle at a time using different addresses. 2. The output of the decoder circuit provided in the first memory bank is sequentially supplied to the latch circuit provided in the next stage memory bank via the latch circuit and the word line of the memory cell array. A vector calculation device according to claim 1. 3. The output of the decoder circuit provided in the first memory bank is sequentially connected to a latch circuit and a dedicated connection wiring (L_
2. The vector arithmetic device according to claim 1, wherein the vector arithmetic device is supplied to a latch circuit provided in a next-stage memory bank via L_3). 4. A memory bank access method that performs pipeline processing by accessing vector data stored in multiple memory banks, in which the output of a decoder circuit provided in the first memory bank is provided in each memory bank. A memory bank access method characterized in that the memory cell array in each memory bank is sequentially accessed one cycle at a time at different addresses by delaying one cycle at a time using a latch circuit. 5. The output of the decoder circuit provided in the first memory bank is sequentially supplied to the latch circuit provided in the next stage memory bank via the latch circuit and the word line of the memory cell array. The memory bank access method according to claim 4. 6. The output of the decoder circuit provided in the first memory bank is sequentially supplied to the latch circuit provided in the next stage memory bank via the latch circuit and dedicated connection wiring. The memory bank access method according to item 4.