JP6366103B2

JP6366103B2 - Semiconductor device and data output method

Info

Publication number: JP6366103B2
Application number: JP2015011809A
Authority: JP
Inventors: 良輔南; 次男高橋
Original assignee: NEC Platforms Ltd; NEC Corp
Current assignee: NEC Platforms Ltd; NEC Corp
Priority date: 2015-01-23
Filing date: 2015-01-23
Publication date: 2018-08-01
Anticipated expiration: 2035-01-23
Also published as: JP2016136366A

Description

本発明は、半導体装置及びデータ出力方法に関する。特に、データを記憶する半導体装置及びそのデータ出力方法に関する。 The present invention relates to a semiconductor device and a data output method. In particular, the present invention relates to a semiconductor device that stores data and a data output method thereof.

近年、プログラミング可能なＦＰＧＡ（Field-Programmable Gate Array）が様々な用途（アプリケーション）にて用いられている。大容量のＲＡＭ（Random Access Memory）ブロックが搭載された大規模なＦＰＧＡが市場に安定的に供給されており、このような最新のＦＰＧＡを内部に組み込んだ製品が数多く開発されている。 In recent years, programmable FPGAs (Field-Programmable Gate Arrays) have been used for various purposes (applications). Large-scale FPGAs equipped with a large-capacity RAM (Random Access Memory) block are stably supplied to the market, and many products incorporating such latest FPGAs have been developed.

また、通信技術の発展に伴い、ネットワークの通信容量の大容量化が著しい。このような環境の下、処理容量（処理能力）が１００Ｇｂｐｓに達する通信装置は既に実用化の段階にあり、より高速な装置（例えば、４００ｂｐｓの処理容量）の開発が始まっている。さらに、より一層の通信品質向上を目的として、１００Ｇｂｐｓの処理能力を確保しつつ、より複雑な処理を行うことで、ユーザの独自要望に添ったきめ細かいサービスを提供する装置の開発が始動しつつある。 In addition, with the development of communication technology, the increase in network communication capacity is remarkable. Under such an environment, a communication device whose processing capacity (processing capacity) reaches 100 Gbps has already been put into practical use, and development of a higher speed device (for example, processing capacity of 400 bps) has started. In addition, with the aim of further improving communication quality, development of devices that provide detailed services that meet the user's unique requirements by performing more complex processing while ensuring 100 Gbps processing capability is being started. .

このようなユーザ独自の機能を実現するため、通信装置等に要求される機能をＦＰＧＡにて実現することが多い。例えば、通信容量を１００Ｇｂｐｓと仮定する。この場合、１００Ｇｂｐｓの処理容量を実現する通信装置内部の信号バス帯域は、「信号バスビット数×システムクロック周波数≧１００Ｇｂｐｓ」という条件を満たす必要がある。例えば、信号バスビット数とシステムクロック周波数の関係は、以下のような組み合わせが用いられる場合が多い。
６４０ビット×１５６．２５ＭＨｚ（＝１００．０００Ｇｂｐｓ）
５１２ビット×３１２．５ＭＨｚ（＝１６０．０００Ｇｂｐｓ）
３２０ビット×３１２．５ＭＨｚ（＝１００．０００Ｇｂｐｓ） In order to realize such user-specific functions, functions required for communication devices and the like are often realized by FPGA. For example, assume that the communication capacity is 100 Gbps. In this case, the signal bus band inside the communication apparatus realizing the processing capacity of 100 Gbps needs to satisfy the condition “number of signal bus bits × system clock frequency ≧ 100 Gbps”. For example, the following combinations are often used for the relationship between the number of signal bus bits and the system clock frequency.
640 bits x 156.25 MHz (= 100.000 Gbps)
512 bits x 312.5 MHz (= 160.000 Gbps)
320 bits x 312.5 MHz (= 100.000 Gbps)

特に、イーサネット（登録商標；以下同じ）等の可変長パケットを扱う通信装置は、その内部のチップ間通信プロトコルとして、信号帯域に余裕があるインターラーケン（Ｉｎｔｅｒｌａｋｅｎ）インターフェース（５１２ビット×３１２．５ＭＨｚ＝１６０．０００Ｇｂｐｓ）を標準的に用いることが多い。 In particular, a communication apparatus that handles variable-length packets such as Ethernet (registered trademark; the same shall apply hereinafter) has an interlaken interface (512 bits × 312.5 MHz = 160) having a sufficient signal bandwidth as an internal chip communication protocol. .000 Gbps) is often used as standard.

特許文献１には、リードモディファイライト動作を高速に行なうことが可能な演算システムを提供する、と記載されている。リードモディファイライト動作とは、指定されたアドレスのデータを読み出し、当該読み出したデータの指定ビットを加工し、加工されたデータを元のアドレスに書き戻す動作である。特許文献１が開示する技術では、ＡＬＵ（Arithmetic Logic Unit）が、データを複数部分に分けて演算を行い、複数部分のデータに対応するバンク０、バンク１のメモリを用意する。その上で、バンク０がＡＬＵの演算結果を書き込んでいる際に、バンク１のデータ読み出しが行われる。 Patent Document 1 describes that an arithmetic system capable of performing a read-modify-write operation at high speed is provided. The read modify write operation is an operation of reading data at a specified address, processing a specified bit of the read data, and writing the processed data back to the original address. In the technology disclosed in Patent Document 1, an ALU (Arithmetic Logic Unit) performs calculation by dividing data into a plurality of parts, and prepares memories in banks 0 and 1 corresponding to the data of the plurality of parts. In addition, when the bank 0 is writing the calculation result of the ALU, the data reading of the bank 1 is performed.

特開２００６−２９３５３８号公報JP 2006-293538 A

なお、上記先行技術文献の開示を、本書に引用をもって繰り込むものとする。以下の分析は、本発明者らによってなされたものである。 The disclosure of the above prior art document is incorporated herein by reference. The following analysis was made by the present inventors.

通信装置内部のチップ間通信プロトコルに、インターラーケンインターフェース（５１２ビットパラレル）が採用され、通信装置内部の各モジュール（例えば、ＦＰＧＡを用いて実装された演算モジュール）が、１つのパケットを処理するのに許容される時間を考察する。イーサネット上の最小フレームサイズは、６４バイト（６４×８＝５１２ビット）と規定されている。従って、インターラーケンインターフェースを採用し、パケットの授受を行うＦＰＧＡ等は、６４バイトのパケット（１パケット）を１クロックにて処理する必要がある。上記の例では、１パケット（６４バイト）を処理するのに２クロック要したのでは、処理容量は８０Ｇｂｐｓとなり必要な処理容量を確保できないためである。 An interlaken interface (512-bit parallel) is adopted as a communication protocol between chips in the communication device, and each module (for example, an arithmetic module implemented using FPGA) in the communication device processes one packet. Consider the time allowed. The minimum frame size on Ethernet is defined as 64 bytes (64 × 8 = 512 bits). Therefore, an FPGA or the like that employs an interlaken interface and transmits / receives a packet needs to process a 64-byte packet (one packet) in one clock. In the above example, if two clocks are required to process one packet (64 bytes), the processing capacity is 80 Gbps, and the necessary processing capacity cannot be secured.

このように、通信装置等に実装されたＦＰＧＡは、大容量のデータを高速に処理することが望まれ、１つのパケットを処理するのに許容されるクロック数は１である。以下、１つのパケットを処理するのに許容されるクロック数が１という前提条件の下、ＦＰＧＡに生じる問題を具体的に説明する。その際、図２２に示すパケットモニタ回路を例に取り、問題点を説明する。パケットモニタ回路とは、通信装置が受信したパケット数を積算カウントする回路である。 As described above, an FPGA mounted on a communication device or the like is desired to process a large amount of data at high speed, and the number of clocks allowed to process one packet is one. Hereinafter, a problem that occurs in the FPGA will be specifically described under the premise that the number of clocks allowed to process one packet is one. At that time, the problem will be described by taking the packet monitor circuit shown in FIG. 22 as an example. The packet monitor circuit is a circuit that counts the number of packets received by the communication device.

パケットに複数のグループに振り分けられ、グループ単位にてパケットを積算カウントするための回路が図２２に示されるパケットモニタ回路である。また、各グループのパケットはランダムに入力され、同一のグループに属するパケットが連続して入力される場合も存在する。図２２において、パケットは、グループごとに用意された積算カウント回路８０−１〜８０−ｎ（ｎは正の整数、以下同じ）と、グループ選択回路８１と、に入力される。 A packet monitor circuit shown in FIG. 22 is a circuit that distributes packets into a plurality of groups and counts the packets in groups. In addition, packets of each group are randomly input, and there are cases where packets belonging to the same group are continuously input. In FIG. 22, a packet is input to an integration count circuit 80-1 to 80-n (n is a positive integer, the same applies hereinafter) and a group selection circuit 81 prepared for each group.

積算カウント回路８０−１〜８０−ｎのそれぞれは、演算部８２と、フリップフロップ（ＦＦ；Flip-Flop）８３と、を備える。各演算部８２は、対応するグループのパケットが入力されると、フリップフロップ８３から読み出した値（積算パケット数）に１を加算し、その結果をフリップフロップ８３に出力する。グループ選択回路８１は、パケットのヘッダに格納されたアドレスに応じて、いずれのグループに対応する積算カウント回路８０−１〜８０−ｎを有効にするか決定し、対応するイネーブル信号ＧＲＰ＿ＥＮを活性化する（アクティブとする）。自回路に対応し、且つ、活性化したイネーブル信号ＧＲＰ＿ＥＮを受け取った積算カウント回路８０のフリップフロップ８３は、演算部８２が出力するデータを、選択回路８４に出力する。また、グループ選択回路８１は、パケットに含まれるアドレスに応じて、選択回路８４にグループ選択信号ＧＲＰ＿ＳＥＬを出力する。選択回路８４は、グループ選択信号ＧＲＰ＿ＳＥＬに応じて、外部に出力するデータを選択する。 Each of the integration count circuits 80-1 to 80-n includes a calculation unit 82 and a flip-flop (FF) 83. Each arithmetic unit 82, when a corresponding group of packets is input, adds 1 to the value read from the flip-flop 83 (the total number of packets) and outputs the result to the flip-flop 83. The group selection circuit 81 determines, depending on the address stored in the header of the packet, which of the group count circuits 80-1 to 80-n is to be enabled, and activates the corresponding enable signal GRP_EN. Yes (set as active). The flip-flop 83 of the integration count circuit 80 that corresponds to its own circuit and receives the activated enable signal GRP_EN outputs the data output from the calculation unit 82 to the selection circuit 84. The group selection circuit 81 outputs a group selection signal GRP_SEL to the selection circuit 84 in accordance with the address included in the packet. The selection circuit 84 selects data to be output to the outside according to the group selection signal GRP_SEL.

例えば、積算カウント回路８０−１はグループ１（ＧＲＰ１）に割り当てられているとする。パケットモニタ回路にグループ１に割り当てられたアドレスを含むパケットが入力されると、積算カウント回路８０−１の演算部８２は、フリップフロップ８３が出力する積算カウント値（ＯＬＤ＿ＤＡＴＡ）に１を加算する。演算部８２は、加算結果をフリップフロップ８３に出力する。次に、パケットモニタ回路に入力されるシステムクロック（図示せず）が１クロック進むと、グループ選択回路８１は、積算カウント回路８０−１に対応するイネーブル信号ＧＲＰ＿ＥＮとグループ選択信号ＧＲＰ＿ＳＥＬをアクティブにする。イネーブル信号ＧＲＰ＿ＥＮがアクティブとなることで、積算カウント回路８０−１のフリップフロップ８３は、１クロック前に演算部８２が出力するデータをデータ出力端子から出力する。グループ選択信号ＧＲＰ＿ＳＥＬにより選択回路８４のポート１に供給されたデータ（積算カウント回路８０−１の出力）が選択され、パケットモニタ回路の出力となる。 For example, it is assumed that the integration count circuit 80-1 is assigned to the group 1 (GRP1). When a packet including an address assigned to group 1 is input to the packet monitor circuit, the operation unit 82 of the integration count circuit 80-1 adds 1 to the integration count value (OLD_DATA) output from the flip-flop 83. The calculation unit 82 outputs the addition result to the flip-flop 83. Next, when a system clock (not shown) input to the packet monitor circuit advances by one clock, the group selection circuit 81 activates the enable signal GRP_EN and the group selection signal GRP_SEL corresponding to the integration count circuit 80-1. . When the enable signal GRP_EN becomes active, the flip-flop 83 of the integration count circuit 80-1 outputs the data output from the arithmetic unit 82 one clock before from the data output terminal. Data supplied to the port 1 of the selection circuit 84 (output of the integration count circuit 80-1) is selected by the group selection signal GRP_SEL and becomes the output of the packet monitor circuit.

以上のように、図２２に示す構成により受信パケットの積算カウントが可能となる。しかし、図２２に示す回路構成には、受信パケットを振り分ける際のグループの数が多くなると、回路規模が増大するという問題がある。グループの数と同じだけ積算カウント回路を用意する必要があるためである。 As described above, the configuration shown in FIG. 22 enables the cumulative counting of received packets. However, the circuit configuration shown in FIG. 22 has a problem in that the circuit scale increases as the number of groups when receiving packets are increased. This is because it is necessary to prepare as many integration count circuits as the number of groups.

そのため、ハードウェアとしてＲＡＭが実装されているＦＰＧＡにおいて、多数のグループに対応したパケット積算機能を実現する際には、演算回路（加算回路）を１回路実装し、演算結果はＲＡＭに格納することで回路規模を抑制する対応が取られることが多い。そこで、以下の手順によりパケットの積算カウント回路を実現することが検討された。
（１）受信パケットのグループに相当するアドレスを指定して、積算結果を保持するＲＡＭから、前パケットまでの積算値が読み出される（リード処理）。
（２）加算器にて、読み出した積算値に１が加算される（演算処理）。
（３）グループに相当するアドレスが指定され、演算結果（積算値）がＲＡＭにストアされる（ライト処理）。 Therefore, in the FPGA in which RAM is mounted as hardware, when realizing a packet integration function corresponding to a large number of groups, one arithmetic circuit (adder circuit) is mounted, and the calculation result is stored in the RAM. In many cases, countermeasures to reduce the circuit scale are taken. Accordingly, it has been studied to realize a packet counting circuit by the following procedure.
(1) The address corresponding to the group of received packets is designated, and the accumulated value up to the previous packet is read from the RAM holding the accumulated result (read process).
(2) The adder adds 1 to the read integrated value (arithmetic processing).
(3) An address corresponding to the group is designated, and the operation result (integrated value) is stored in the RAM (write process).

上記（１）〜（３）の動作を１クロックにて実行可能であれば、ＦＰＧＡに実装されたＲＡＭを用いてパケットの積算カウントが可能となる。即ち、ＦＰＧＡにはＲＡＭがハードウェアとしてチップに組み込み済であるので、リソースの有効利用が行える。 If the operations (1) to (3) can be executed in one clock, the packet can be counted using the RAM mounted on the FPGA. That is, since the RAM is already built in the chip as hardware in the FPGA, resources can be used effectively.

高速動作を目的とした通常のクロック同期型ＲＡＭは、書き込み側（ライト側）において、各信号をフリップフロップによりリタイミングした後にＲＡＭコアに入力する構成を有する。また、上記のクロック同期型ＲＡＭは、書き込みアドレスと読み出しアドレスに同じ値を指定してアクセスすることを禁じていることが多い。そのため、クロック同期型ＲＡＭは、書き込んだデータを１クロック後に読み出すことができない。即ち、ＦＰＧＡに組み込まれたクロック同期型ＲＡＭにおいては、ＲＡＭコアに書き込まれたデータを、１クロック後に読み出すことはできない。 A normal clock synchronous RAM intended for high-speed operation has a configuration in which each signal is input to the RAM core after being retimed by a flip-flop on the write side (write side). Also, the clock synchronous RAM described above often prohibits access by designating the same value for the write address and the read address. Therefore, the clock synchronous RAM cannot read out the written data after one clock. That is, in the clock synchronous RAM incorporated in the FPGA, the data written in the RAM core cannot be read after one clock.

従って、近年の通信装置に要求されるような高速動作において、ＦＰＧＡに予め組み込まれたＲＡＭを有効活用することができず、結果的に図２２に示すような回路構成（演算回路とフリップフロップをグループの数だけ用意する構成）が用いられる。 Therefore, in a high-speed operation required for a recent communication apparatus, the RAM built in the FPGA cannot be effectively used. As a result, a circuit configuration (an arithmetic circuit and a flip-flop as shown in FIG. The number of groups prepared) is used.

以上のように、例えば、３１２．５ＭＨｚといったシステムクロックを用いる高速回路において、データの書き込み後の読み出しに１クロック処理が要求される場合、ＦＰＧＡに組み込まれたＲＡＭを有効活用できないという問題がある。 As described above, for example, in a high-speed circuit using a system clock of 312.5 MHz, when one clock processing is required for reading after data writing, there is a problem that the RAM incorporated in the FPGA cannot be effectively used.

上記の問題点は、特許文献１に開示された演算システムにおいても生じ得る。特許文献１が開示する演算システムでは、ライトアドレスとリードアドレスが一致する場合には、フリップフロップ（例えば、特許文献１の図５や図６に示される符号２４）が保持するライトデータが出力される。換言するならば、ライトデータを一時的に保持するフリップフロップが１つであるため、ライトアドレスやライトデータのリタイミング手段を挿入することができない。そのため、上記のような３１２．５ＭＨｚといった高速なシステムクロックを使用すると、システム全体の動作マージンが低下する等の問題が生じる可能性がある。 The above problem can also occur in the arithmetic system disclosed in Patent Document 1. In the arithmetic system disclosed in Patent Document 1, when the write address matches the read address, the write data held by the flip-flop (for example, reference numeral 24 shown in FIGS. 5 and 6 of Patent Document 1) is output. The In other words, since there is one flip-flop that temporarily holds the write data, it is not possible to insert a write address or write data retiming means. For this reason, when a high-speed system clock such as 312.5 MHz as described above is used, there is a possibility that problems such as a decrease in the operation margin of the entire system may occur.

本発明は、ライトアドレスとリードアドレスが競合するような場合であっても、書き込まれたライトデータをリードデータとして出力すると共に、安定した高速動作を実現することに寄与する半導体装置を提供することを目的とする。 The present invention provides a semiconductor device that contributes to the realization of stable high-speed operation while outputting written write data as read data even when the write address and the read address conflict. With the goal.

本発明の第１の視点によれば、ライトアドレス、ライトデータ及びリードアドレスを受け付け、データの書き込みとデータの読み出しの並列動作が可能な第１の記憶部と、直列接続された複数の記憶素子からなる記憶部であって、前記ライトデータを受け付けると共に、前記第１の記憶部と並列接続された第２の記憶部と、少なくとも前記ライトアドレス及び前記リードアドレスに応じて、前記第１の記憶部から読み出されたリードデータ及び前記第２の記憶部をなす前記複数の記憶素子のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定する決定部と、前記決定部により外部に出力すると決定されたデータを選択的に出力する選択部と、を備える半導体装置が提供される。 According to the first aspect of the present invention, a first storage unit that accepts a write address, write data, and a read address and can perform parallel operations of writing data and reading data, and a plurality of storage elements connected in series And a second storage unit that receives the write data and is connected in parallel to the first storage unit, and at least the first storage according to the write address and the read address. A determination unit that determines, as data to be output to the outside, read data read from a unit and data stored in any of the plurality of storage elements forming the second storage unit, and the determination unit There is provided a semiconductor device including a selection unit that selectively outputs data determined to be output to the outside.

本発明の第２の視点によれば、ライトアドレス、ライトデータ及びリードアドレスを受け付け、データの書き込みとデータの読み出しの並列動作が可能な第１の記憶部と、直列接続された複数の記憶素子からなる記憶部であって、前記ライトデータを受け付けると共に、前記第１の記憶部と並列接続された第２の記憶部と、を含む記憶装置からのデータ出力方法であって、少なくとも前記ライトアドレス及び前記リードアドレスに応じて、前記第１の記憶部から読み出されたリードデータ及び前記第２の記憶部をなす前記複数の記憶素子のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定するステップと、前記外部に出力すると決定されたデータを選択的に出力するステップと、を含む、データ出力方法が提供される。 According to a second aspect of the present invention, a first storage unit that accepts a write address, write data, and a read address and can perform parallel operations of writing data and reading data, and a plurality of storage elements connected in series A data output method from a storage device including a second storage unit that receives the write data and is connected in parallel to the first storage unit, wherein the write address is at least the write address According to the read address, either read data read from the first storage unit or data stored in any of the plurality of storage elements forming the second storage unit is output to the outside. A data output method comprising: determining as data to be output; and selectively outputting data determined to be output to the outside .

本発明の各視点によれば、ライトアドレスとリードアドレスが競合するような場合であっても、書き込まれたライトデータをリードデータとして出力すると共に、安定した高速動作を実現することに寄与する半導体装置及びデータ出力方法が、提供される。 According to each aspect of the present invention, even if the write address and the read address conflict, the written write data is output as read data and contributes to realizing a stable high-speed operation An apparatus and a data output method are provided.

一実施形態の概要を説明するための図である。It is a figure for demonstrating the outline | summary of one Embodiment. 第１の実施形態に係る通信装置の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the communication apparatus which concerns on 1st Embodiment. 第１の実施形態に係る演算モジュールの内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the arithmetic module which concerns on 1st Embodiment. 第１の実施形態に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on 1st Embodiment. 第１の実施形態に係る読み出し先決定回路の回路構成の一例を示す図である。It is a figure which shows an example of the circuit structure of the reading destination determination circuit which concerns on 1st Embodiment. 第１の実施形態に係る選択信号出力回路の動作を示す真理値表の一例を示す図である。It is a figure which shows an example of the truth table which shows operation | movement of the selection signal output circuit which concerns on 1st Embodiment. 第１の実施形態に係るＲＡＭコア単体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the RAM core single-piece | unit which concerns on 1st Embodiment. 第１の実施形態に係る記憶部全体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the whole memory | storage part which concerns on 1st Embodiment. 第１の実施形態に係るデータ出力方法の一例を示すフローチャートである。It is a flowchart which shows an example of the data output method which concerns on 1st Embodiment. 第１の変形例に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on a 1st modification. 第１の変形例に係るＲＡＭコア単体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input-output of the signal regarding the RAM core single-piece | unit based on a 1st modification. 第１の変形例に係る記憶部全体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the whole memory | storage part which concerns on a 1st modification. 第２の変形例に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on a 2nd modification. 第２の変形例に係るＲＡＭコア単体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the RAM core single-piece | unit which concerns on a 2nd modification. 第２の変形例に係る記憶部全体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the whole memory | storage part which concerns on a 2nd modification. 第３の変形例に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on a 3rd modification. 第３の変形例に係るＲＡＭコア単体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input-output of the signal regarding the RAM core single-piece | unit which concerns on a 3rd modification. 第３の変形例に係る記憶部全体に関する信号の入出力を示すタイムチャートの一例である。It is an example of the time chart which shows the input / output of the signal regarding the whole memory | storage part which concerns on a 3rd modification. 第４の変形例に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on a 4th modification. 第５の変形例に係る記憶部の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the memory | storage part which concerns on a 5th modification. 第５の変形例に係る読み出し先決定回路の内部構成の一例を示す図である。It is a figure which shows an example of the internal structure of the reading destination determination circuit which concerns on a 5th modification. パケットモニタ回路の内部構成の一例を示す図である。It is a figure which shows an example of an internal structure of a packet monitor circuit.

初めに、一実施形態の概要について説明する。なお、この概要に付記した図面参照符号は、理解を助けるための一例として各要素に便宜上付記したものであり、この概要の記載はなんらの限定を意図するものではない。 First, an outline of one embodiment will be described. Note that the reference numerals of the drawings attached to the outline are attached to the respective elements for convenience as an example for facilitating understanding, and the description of the outline is not intended to be any limitation.

上述のように、ライトアドレスとリードアドレスが競合するような場合であっても、書き込まれたライトデータをリードデータとして出力すると共に、安定した高速動作を実現する半導体装置が望まれる。 As described above, there is a demand for a semiconductor device that outputs written write data as read data and realizes stable high-speed operation even when the write address and the read address conflict.

そこで、一例として図１に示す半導体装置１００を提供する。半導体装置１００は、第１の記憶部１０１と、第２の記憶部１０２と、決定部１０３と、選択部１０４と、を備える。第１の記憶部１０１は、ライトアドレス、ライトデータ及びリードアドレスを受け付け、データの書き込みとデータの読み出しの並列動作が可能である。第２の記憶部１０２は、直列接続された複数の記憶素子からなる記憶部であって、ライトデータを受け付けると共に、第１の記憶部１０１と並列接続される。決定部１０３は、少なくともライトアドレス及びリードアドレスに応じて、第１の記憶部１０１から読み出されたリードデータ及び第２の記憶部１０２をなす複数の記憶素子のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定する。選択部１０４は、決定部１０３により外部に出力すると決定されたデータを選択的に出力する。 Therefore, as an example, the semiconductor device 100 illustrated in FIG. 1 is provided. The semiconductor device 100 includes a first storage unit 101, a second storage unit 102, a determination unit 103, and a selection unit 104. The first storage unit 101 receives a write address, write data, and a read address, and can perform a parallel operation of writing data and reading data. The second storage unit 102 is a storage unit including a plurality of storage elements connected in series, and receives write data and is connected in parallel to the first storage unit 101. The determination unit 103 determines the read data read from the first storage unit 101 and the data stored in any of the plurality of storage elements forming the second storage unit 102 according to at least the write address and the read address. Either one is determined as data to be output to the outside. The selection unit 104 selectively outputs data determined to be output to the outside by the determination unit 103.

半導体装置１００は、例えば、ＦＰＧＡに標準的に搭載されているデュアルポートメモリを第１の記憶部１０１として用いる。また、半導体装置１００は、システムクロックに同期してデータを保持するフリップフロップを複数連結することで、第２の記憶部１０２とする。半導体装置１００は、ライトアドレスとリードアドレスが競合し、第１の記憶部１０１からデータを読み出すことができない場合には、第２の記憶部１０２が記憶するデータを、半導体装置１００の出力データとして外部に出力する。即ち、半導体装置１００は、第１の記憶部１０１と第２の記憶部１０２を併用することで、ライトアドレスとリードアドレスの競合による問題点を解決する。 For example, the semiconductor device 100 uses, as the first storage unit 101, a dual port memory that is typically mounted on an FPGA. Further, the semiconductor device 100 serves as the second storage unit 102 by connecting a plurality of flip-flops that hold data in synchronization with the system clock. When the write address and the read address conflict and the data cannot be read from the first storage unit 101, the semiconductor device 100 uses the data stored in the second storage unit 102 as output data of the semiconductor device 100. Output to the outside. That is, the semiconductor device 100 solves the problem caused by the conflict between the write address and the read address by using the first storage unit 101 and the second storage unit 102 together.

また、半導体装置１００に使用されるシステムクロックが高速であるため、ライトデータやライトアドレスを伝達するバスにリタイミング手段を挿入することが望まれる場合がある。このような場合であっても、第２の記憶部１０２は、直列接続された複数の記憶素子から構成されるため、リタイミング手段を挿入することにより生じる必要なデータの消失を回避できる。例えば、図１において、ライトデータ及びライトアドレスに係るバスに１段のリタイミング手段が挿入されたとする。この場合、ライトデータ及びライトアドレスはリタイミング手段により、１クロック遅延して第１の記憶部１０１に供給される。そのため、第１の記憶部１０１において、ライトアドレスとリードアドレスが一致したとしても、実際に半導体装置１００に供給されているライトアドレスは１クロック前のものである。従って、第２の記憶部１０２に含まれる記憶素子が１つの場合には、１クロック前のライトデータを保持することができず、第１の記憶部１０１にてライトアドレスとリードアドレスが一致した場合に、必要なデータ（ライトデータ）が消滅することになる。対して、上記の例で言えば、第２の記憶部１０２に２個以上の記憶素子が含まれ１クロック前のライトデータは保持されるので、必要なデータが消失することはない。このように、第２の記憶部１０２に複数の記憶素子を含ませることで、高速なシステムクロックが使用されたとしても、必要に応じてリタイミング手段を挿入できる余地があるので、半導体装置１００の安定動作を実現できる。 In addition, since the system clock used in the semiconductor device 100 is high-speed, it may be desired to insert retiming means in a bus that transmits write data and write addresses. Even in such a case, since the second storage unit 102 includes a plurality of storage elements connected in series, it is possible to avoid the loss of necessary data caused by inserting retiming means. For example, in FIG. 1, it is assumed that one stage of retiming means is inserted in the bus related to write data and write address. In this case, the write data and the write address are supplied to the first storage unit 101 after being delayed by one clock by the retiming means. Therefore, even if the write address and the read address match in the first storage unit 101, the write address that is actually supplied to the semiconductor device 100 is one clock before. Therefore, in the case where the second storage unit 102 includes one storage element, it is not possible to hold the write data one clock before, and the write address and the read address match in the first storage unit 101. In this case, necessary data (write data) is lost. On the other hand, in the above example, the second storage unit 102 includes two or more storage elements and holds the write data one clock before, so that necessary data is not lost. In this manner, by including a plurality of storage elements in the second storage unit 102, there is room for inserting retiming means as needed even when a high-speed system clock is used. Stable operation can be realized.

以下に具体的な実施の形態について、図面を参照してさらに詳しく説明する。なお、各実施形態において同一構成要素には同一の符号を付し、その説明を省略する。 Hereinafter, specific embodiments will be described in more detail with reference to the drawings. In addition, in each embodiment, the same code | symbol is attached | subjected to the same component and the description is abbreviate | omitted.

［第１の実施形態］
第１の実施形態について、図面を用いてより詳細に説明する。 [First Embodiment]
The first embodiment will be described in more detail with reference to the drawings.

図２は、第１の実施形態に係る通信装置１の内部構成の一例を示す図である。図２を参照すると、通信装置１は、通信モジュール１０と、演算モジュール１１と、を含んで構成される。 FIG. 2 is a diagram illustrating an example of an internal configuration of the communication apparatus 1 according to the first embodiment. Referring to FIG. 2, the communication device 1 includes a communication module 10 and an arithmetic module 11.

通信モジュール１０は、ネットワークを介して他の装置とパケットの送受信を行うインターフェースである。 The communication module 10 is an interface that transmits and receives packets to and from other devices via a network.

演算モジュール１１は、通信モジュール１０からパケットを受信し、当該受信パケットに応じた演算処理を行う手段である。少なくとも演算モジュール１１は、ＦＰＧＡを用いて実現される。 The arithmetic module 11 is means for receiving a packet from the communication module 10 and performing arithmetic processing according to the received packet. At least the arithmetic module 11 is realized using an FPGA.

通信モジュール１０と演算モジュール１１は、例えば、インターラーケン等のチップ間通信プロトコルを用いてパケットの送受信を行う。 For example, the communication module 10 and the arithmetic module 11 transmit and receive packets using an inter-chip communication protocol such as interlaken.

図３は、演算モジュール１１の内部構成の一例を示す図である。演算モジュール１１は、演算部２１と、記憶部２２と、を含んで構成される。 FIG. 3 is a diagram illustrating an example of the internal configuration of the arithmetic module 11. The calculation module 11 includes a calculation unit 21 and a storage unit 22.

演算部２１は、通信モジュール１０からパケットを入力する。演算部２１は、入力したパケットに対して予め定めた演算処理を行い、演算結果を記憶部２２に格納する。例えば、演算部２１は、入力パケットのヘッダに格納されたアドレスに応じて、各パケットをグループに振り分け、グループごとに受信したパケットの数を積算カウントする。 The computing unit 21 inputs a packet from the communication module 10. The calculation unit 21 performs a predetermined calculation process on the input packet and stores the calculation result in the storage unit 22. For example, the computing unit 21 sorts each packet into a group according to the address stored in the header of the input packet, and counts the number of packets received for each group.

記憶部２２は、２つの入出力ポートを備え、各入出力ポートは独立してアクセス可能に構成される。 The storage unit 22 includes two input / output ports, and each input / output port is configured to be independently accessible.

演算部２１は、ライトデータＷ＿ＤＡＴＡ、ライトアドレスＷ＿ＡＤＤ、ライトイネーブルＷ＿ＥＮのそれぞれを記憶部２２に出力する。また、演算部２１は、リードアドレスＲ＿ＡＤＤ、リードイネーブルＲ＿ＥＮのそれぞれを記憶部２２に出力し、記憶部２２からリードデータＲ＿ＤＡＴＡを取得する。なお、ライトイネーブルＷ＿ＥＮ及びリードイネーブルＲ＿ＥＮのアクティブはＨレベルである。 The arithmetic unit 21 outputs the write data W_DATA, the write address W_ADD, and the write enable W_EN to the storage unit 22. In addition, the calculation unit 21 outputs each of the read address R_ADD and the read enable R_EN to the storage unit 22 and acquires the read data R_DATA from the storage unit 22. Note that the write enable W_EN and the read enable R_EN are active at the H level.

図４は、記憶部２２の内部構成の一例を示す図である。図４を参照すると、記憶部２２は、記憶回路２３と、メモリアレイ２４と、選択回路２５と、読み出し先決定回路２６と、を含んで構成される。 FIG. 4 is a diagram illustrating an example of the internal configuration of the storage unit 22. Referring to FIG. 4, the storage unit 22 includes a storage circuit 23, a memory array 24, a selection circuit 25, and a read destination determination circuit 26.

記憶回路２３は、演算部２１から供給されるライトデータＷ＿ＤＡＴＡを記憶する回路である。記憶回路２３は、２つの独立した入出力ポートを有するＲＡＭコア３１を含む。 The storage circuit 23 is a circuit that stores the write data W_DATA supplied from the calculation unit 21. The storage circuit 23 includes a RAM core 31 having two independent input / output ports.

ＲＡＭコア３１は、書き込みと読み出しが並列して行える所謂デュアルポートメモリである。なお、ＲＡＭコア３１は、ライトアドレスとリードアドレスに同じ値を設定することは禁止されている記憶装置である。ＲＡＭコア３１は、上述の第１の記憶部１０１に相当する。 The RAM core 31 is a so-called dual port memory that can perform writing and reading in parallel. The RAM core 31 is a storage device that is prohibited from setting the same value for the write address and the read address. The RAM core 31 corresponds to the first storage unit 101 described above.

ライトデータＷ＿ＤＡＴＡは、ＥＣＣ（Error Check and Correction）エンコーダ３２、フリップフロップ３３を介して、ＲＡＭコア３１のライトデータ入力端子Ｄ＿ＩＮ＿ＲＡＭに入力される。ライトアドレスＷ＿ＡＤＤは、フリップフロップ３４を介して、ＲＡＭコア３１のライトアドレス入力端子Ｗ＿ＡＤＤ＿ＲＡＭに入力される。ライトイネーブルＷ＿ＥＮは、フリップフロップ３５を介して、ＲＡＭコア３１のライトイネーブル入力端子Ｗ＿ＥＮ＿ＲＡＭに入力される。 The write data W_DATA is input to a write data input terminal D_IN_RAM of the RAM core 31 via an ECC (Error Check and Correction) encoder 32 and a flip-flop 33. The write address W_ADD is input to the write address input terminal W_ADD_RAM of the RAM core 31 via the flip-flop 34. The write enable W_EN is input to the write enable input terminal W_EN_RAM of the RAM core 31 via the flip-flop 35.

リードアドレスＲ＿ＡＤＤは、フリップフロップ３６を介して、ＲＡＭコア３１のリードアドレス入力端子Ｒ＿ＡＤＤ＿ＲＡＭに入力される。リードイネーブルＲ＿ＥＮは、フリップフロップ３７を介して、ＲＡＭコア３１のリードイネーブル入力端子Ｒ＿ＥＮ＿ＲＡＭに入力される。ＲＡＭコア３１は、ＥＣＣデコーダ３８を介して、リードデータ出力端子Ｄ＿ＯＵＴ＿ＲＡＭから、リードデータＲＡＭ＿ＯＵＴを選択回路２５のポート０に出力する。 The read address R_ADD is input to the read address input terminal R_ADD_RAM of the RAM core 31 via the flip-flop 36. The read enable R_EN is input to the read enable input terminal R_EN_RAM of the RAM core 31 via the flip-flop 37. The RAM core 31 outputs the read data RAM_OUT to the port 0 of the selection circuit 25 from the read data output terminal D_OUT_RAM via the ECC decoder 38.

なお、ＥＣＣエンコーダ３２は、ＲＡＭコア３１に書き込むデータに対応する誤り訂正符号を生成する手段（誤り訂正符号化回路）であり、ＥＣＣデコーダ３８は誤り訂正符号のデコードと誤りデータの訂正を行う手段（誤り訂正回路）である。即ち、記憶回路２３は、ＲＡＭコア３１に保持されるデータにソフトエラー（ハードウェアには故障は生じていないが誤ったデータが記憶されるエラー；ビットエラー）が発生した場合に、当該ソフトエラーを訂正する機能を有する。 The ECC encoder 32 is a means (error correction coding circuit) for generating an error correction code corresponding to data to be written to the RAM core 31, and the ECC decoder 38 is a means for decoding the error correction code and correcting the error data. (Error correction circuit). In other words, the storage circuit 23, when a soft error occurs in the data held in the RAM core 31 (error in which incorrect data is stored although no hardware failure has occurred; bit error), the soft error It has a function to correct.

また、記憶回路２３は、ＲＡＭコア３１の各種入力端子にフリップフロップを接続する構成を有し、各種データのリタイミングを実施することで、システムクロックが３１２．５ＭＨｚとなるような高速動作を実現する。具体的には、ＲＡＭコア３１に供給されるライトデータ、ライトアドレス、ライトイネーブル、リードアドレス、リードイネーブルのそれぞれを、システムクロックに同期して保持するフリップフロップ３３〜３７がリタイミング手段として機能する。 The memory circuit 23 has a configuration in which flip-flops are connected to various input terminals of the RAM core 31, and realizes high-speed operation such that the system clock is 312.5 MHz by performing retiming of various data. To do. Specifically, flip-flops 33 to 37 that hold the write data, write address, write enable, read address, and read enable supplied to the RAM core 31 in synchronization with the system clock function as retiming means. .

メモリアレイ２４は、複数のフリップフロップ（記憶素子）が直列接続された回路である。メモリアレイ２４は、内部のフリップフロップによりデータを記憶する回路である。記憶回路２３とメモリアレイ２４は、並列に接続されている。メモリアレイ２４は、上述の第２の記憶部１０２に相当する。 The memory array 24 is a circuit in which a plurality of flip-flops (memory elements) are connected in series. The memory array 24 is a circuit that stores data by an internal flip-flop. The storage circuit 23 and the memory array 24 are connected in parallel. The memory array 24 corresponds to the second storage unit 102 described above.

メモリアレイ２４は、演算部２１から供給されるライトデータＷ＿ＤＡＴＡを、システムクロックに同期して記憶する手段である。図４に示す構成では、メモリアレイ２４は、４つのフリップフロップ４１−１〜４１−４を有するので、３クロック前までのライトデータＷ＿ＤＡＴＡの記憶が可能である。 The memory array 24 is means for storing the write data W_DATA supplied from the arithmetic unit 21 in synchronization with the system clock. In the configuration shown in FIG. 4, the memory array 24 includes four flip-flops 41-1 to 41-4, and therefore can store write data W_DATA up to three clocks before.

フリップフロップ４１−１〜４１−４それぞれのデータ出力端子は、選択回路２５の入力端子と接続されている。具体的には、フリップフロップ４１−１のデータ出力端子は、選択回路２５のポート１に係る入力端子と接続されている。同様に、フリップフロップ４１−２のデータ出力端子は選択回路２５のポート２、フリップフロップ４１−３のデータ出力端子は選択回路２５のポート３、フリップフロップ４１−４のデータ出力端子は選択回路２５のポート４にそれぞれ接続されている。 The data output terminals of the flip-flops 41-1 to 41-4 are connected to the input terminal of the selection circuit 25. Specifically, the data output terminal of the flip-flop 41-1 is connected to the input terminal related to port 1 of the selection circuit 25. Similarly, the data output terminal of the flip-flop 41-2 is port 2 of the selection circuit 25, the data output terminal of the flip-flop 41-3 is port 3 of the selection circuit 25, and the data output terminal of the flip-flop 41-4 is the selection circuit 25. Are connected to ports 4 respectively.

読み出し先決定回路２６は、演算部２１から供給されるアドレス情報（ライトアドレスＷ＿ＡＤＤ、リードアドレスＲ＿ＡＤＤ）と、動作許可情報（ライトイネーブルＷ＿ＥＮ、リードイネーブルＲ＿ＥＮ）と、に応じて、記憶部２２全体の出力データを決定する手段である。より具体的には、読み出し先決定回路２６は、アドレス情報と動作許可情報に基づいて、記憶回路２３の出力データ及びメモリアレイ２４からの出力データのいずれを記憶部２２から外部に出力するデータ（リードデータＲ＿ＤＡＴＡ）とするかを決定する回路である。即ち、読み出し先決定回路２６は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤを監視し、これらのアドレス情報と予め定めた所定の規則（ルール）に基づいて、選択回路２５によるデータの読み出し先を決定する回路である。読み出し先決定回路２６は、上述の決定部１０３に相当する。 The read destination determination circuit 26 determines the entire storage unit 22 according to the address information (write address W_ADD, read address R_ADD) and operation permission information (write enable W_EN, read enable R_EN) supplied from the calculation unit 21. A means for determining output data. More specifically, the read destination determination circuit 26 outputs either the output data of the storage circuit 23 or the output data of the memory array 24 from the storage unit 22 to the outside based on the address information and the operation permission information ( This is a circuit for determining whether to use read data (R_DATA). That is, the read destination determination circuit 26 monitors the write address W_ADD and the read address R_ADD, and determines the data read destination by the selection circuit 25 based on these address information and a predetermined rule. It is. The read destination determination circuit 26 corresponds to the determination unit 103 described above.

読み出し先決定回路２６が選択回路２５に向けて出力する選択信号ＳＥＬは、フリップフロップ２７により１クロック遅延されて、選択信号ＳＥＬ＿１Ｔとして選択回路２５に供給される。なお、図４を含む図面において、各フリップフロップは、それぞれが記憶するデータのビット数に応じた数のフリップフロップが並列接続されて構成される。例えば、ライトデータＷ＿ＤＡＴＡが１６ビットのデータであれば、フリップフロップ３３やフリップフロップ４１−１〜４１−４のそれぞれは１６個のフリップフロップからなる。 The selection signal SEL output from the reading destination determination circuit 26 toward the selection circuit 25 is delayed by one clock by the flip-flop 27 and is supplied to the selection circuit 25 as the selection signal SEL_1T. In the drawings including FIG. 4, each flip-flop is configured by connecting in parallel a number of flip-flops corresponding to the number of bits of data stored therein. For example, if the write data W_DATA is 16-bit data, each of the flip-flop 33 and the flip-flops 41-1 to 41-4 is composed of 16 flip-flops.

図５は、読み出し先決定回路２６の回路構成の一例を示す図である。図５を参照すると、読み出し先決定回路２６は、アドレス比較回路５１−１〜５１−４と、論理積回路５２−１〜５２−４と、フリップフロップ５３−１〜５３−６と、選択信号出力回路５４と、を含んで構成される。 FIG. 5 is a diagram illustrating an example of the circuit configuration of the read destination determination circuit 26. Referring to FIG. 5, the read destination determination circuit 26 includes an address comparison circuit 51-1 to 51-4, an AND circuit 52-1 to 52-4, flip-flops 53-1 to 53-6, and a selection signal. And an output circuit 54.

フリップフロップ５３−１〜５３−３はそれぞれ、ライトアドレスＷ＿ＡＤＤを遅延させる。フリップフロップ５３−４〜５３−６はそれぞれ、ライトイネーブルＷ＿ＥＮを遅延させる。なお、以降の説明において、フリップフロップ５３−１〜５３−３のそれぞれが出力するデータを、ＷＡ＿１Ｔ、ＷＡ＿２Ｔ、ＷＡ＿３Ｔと表記する。また、フリップフロップ５３−４〜５３−６のそれぞれが出力するデータを、ＷＥ＿１Ｔ、ＷＥ＿２Ｔ、ＷＥ＿３Ｔと表記する。 Each of the flip-flops 53-1 to 53-3 delays the write address W_ADD. Each of the flip-flops 53-4 to 53-6 delays the write enable W_EN. In the following description, data output from each of the flip-flops 53-1 to 53-3 will be denoted as WA_1T, WA_2T, and WA_3T. The data output from each of the flip-flops 53-4 to 53-6 is denoted as WE_1T, WE_2T, and WE_3T.

アドレス比較回路５１−１は、現クロック（システムクロックの遅延数＝０）におけるライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤを比較し、両者が一致する場合にはＨレベルを出力する。アドレス比較回路５１−１は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが不一致の場合には、Ｌレベルを出力する。 The address comparison circuit 51-1 compares the write address W_ADD and the read address R_ADD in the current clock (system clock delay number = 0), and outputs an H level if they match. The address comparison circuit 51-1 outputs an L level when the write address W_ADD and the read address R_ADD do not match.

アドレス比較回路５１−２は、１クロック前のライトアドレスＷ＿ＡＤＤ（遅延数＝１クロック）と、現クロックのリードアドレスＲ＿ＡＤＤと、が一致するか否かを判定する回路である。同様に、アドレス比較回路５１−３は２クロック前、アドレス比較回路５１−４は３クロック前のライトアドレスＷ＿ＡＤＤと現クロックのリードアドレスＲ＿ＡＤＤの一致・不一致を判定する回路である。 The address comparison circuit 51-2 is a circuit that determines whether or not the write address W_ADD (the number of delays = 1 clock) one clock before and the read address R_ADD of the current clock match. Similarly, the address comparison circuit 51-3 is a circuit that determines whether the write address W_ADD that is two clocks before and the write address W_ADD that is three clocks old and the read address R_ADD of the current clock match.

論理積回路５２−１は、現クロックにおけるライトイネーブルＷ＿ＥＮとアドレス比較回路５１−１の出力信号の論理積演算を行い、演算結果を選択信号出力回路５４に出力する。論理積回路５２−２は、１クロック前のライトイネーブルＷ＿ＥＮとアドレス比較回路５１−２の出力信号の論理積演算を行い、その結果を選択信号出力回路５４に出力する。同様に、論理積回路５２−３は２クロック前、論理積回路５２−４は３クロック前のライトイネーブルＷ＿ＥＮとアドレス比較回路５１−３、５１−４の出力信号の論理積演算を行い、その結果を選択信号出力回路５４に出力する。 The logical product circuit 52-1 performs a logical product operation of the write enable W_EN in the current clock and the output signal of the address comparison circuit 51-1, and outputs the calculation result to the selection signal output circuit. The logical product circuit 52-2 performs a logical product operation of the write enable W_EN one clock before and the output signal of the address comparison circuit 51-2, and outputs the result to the selection signal output circuit. Similarly, the AND circuit 52-3 performs an AND operation between the write enable W_EN and the output signal of the address comparison circuits 51-3 and 51-4 two clocks before and the AND circuit 52-4, and The result is output to the selection signal output circuit 54.

選択信号出力回路５４は、論理積回路５２−１〜５２−４のそれぞれが出力する演算結果を、入力端子ＩＮ＿１〜ＩＮ＿４により取得する。選択信号出力回路５４は、入力端子ＩＮ＿１〜ＩＮ＿４に印加される論理レベル（Ｈレベル又はＬレベル）に応じて、選択信号ＳＥＬを決定する回路である。選択信号出力回路５４は、ライトイネーブルＷ＿ＥＮがＨレベルであるライトアドレスＷ＿ＡＤＤに関し、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤとの間の関係に基づいて、選択信号ＳＥＬを決定する。具体的には、選択信号出力回路５４は、以下の動作を行う。 The selection signal output circuit 54 acquires the operation results output from the AND circuits 52-1 to 52-4 from the input terminals IN_1 to IN_4. The selection signal output circuit 54 is a circuit that determines the selection signal SEL according to the logic level (H level or L level) applied to the input terminals IN_1 to IN_4. The selection signal output circuit 54 determines the selection signal SEL based on the relationship between the write address W_ADD and the read address R_ADD with respect to the write address W_ADD in which the write enable W_EN is at the H level. Specifically, the selection signal output circuit 54 performs the following operation.

（ａ）選択信号出力回路５４は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合（ＩＮ＿１＝Ｈレベル）、選択回路２５がポート１を選択するように選択信号ＳＥＬを出力する。つまり、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合、フリップフロップ４１−１の出力データが選択回路２５から出力される。
（ｂ）選択信号出力回路５４は、１クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合（ＩＮ＿２＝Ｈレベル）、選択回路２５がポート２を選択するように選択信号ＳＥＬを出力する。つまり、１クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合、フリップフロップ４１−２の出力データが選択回路２５から出力される。
（ｃ）選択信号出力回路５４は、２クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合（ＩＮ＿３＝Ｈレベル）、選択回路２５がポート３を選択するように選択信号ＳＥＬを出力する。つまり、２クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合、フリップフロップ４１−３の出力データが選択回路２５から出力される。
（ｄ）選択信号出力回路５４は、３クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合（ＩＮ＿４＝Ｈレベル）、選択回路２５がポート４を選択するように選択信号ＳＥＬを出力する。つまり、３クロック前のライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合、フリップフロップ４１−４の出力データが選択回路２５から出力される。
（ｅ）選択信号出力回路５４は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤの関係が上記（ａ）〜（ｄ）以外の場合には、選択回路２５がポート０のデータ（リードデータＲＡＭ＿ＯＵＴ）を選択するように選択信号ＳＥＬを出力する。 (A) When the write address W_ADD and the read address R_ADD match (IN_1 = H level), the selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 1. That is, when the write address W_ADD matches the read address R_ADD, the output data of the flip-flop 41-1 is output from the selection circuit 25.
(B) The selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 2 when the write address W_ADD one clock before and the read address R_ADD coincide (IN_2 = H level). That is, when the write address W_ADD one clock before and the read address R_ADD coincide, the output data of the flip-flop 41-2 is output from the selection circuit 25.
(C) When the write address W_ADD two clocks before and the read address R_ADD match (IN_3 = H level), the selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 3. That is, when the write address W_ADD two clocks before and the read address R_ADD coincide, the output data of the flip-flop 41-3 is output from the selection circuit 25.
(D) The selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 4 when the write address W_ADD three clocks before and the read address R_ADD coincide (IN_4 = H level). That is, when the write address W_ADD three clocks before and the read address R_ADD coincide, the output data of the flip-flop 41-4 is output from the selection circuit 25.
(E) When the relationship between the write address W_ADD and the read address R_ADD is other than the above (a) to (d), the selection signal output circuit 54 selects the data of the port 0 (read data RAM_OUT). The selection signal SEL is output as follows.

上記（ａ）〜（ｅ）に係る選択信号出力回路５４の動作を真理値表としてまとめた図が、図６である。なお、図６における「×」はドントケアを示す。 FIG. 6 is a diagram in which the operations of the selection signal output circuit 54 according to the above (a) to (e) are summarized as a truth table. Note that “x” in FIG. 6 indicates don't care.

以上のように、読み出し先決定回路２６は、リードアドレスＲ＿ＡＤＤとライトアドレスＷ＿ＡＤＤが一致する際のシステムクロックの遅延数に応じて、外部に出力するデータを決定する。より具体的には、読み出し先決定回路２６は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合には、メモリアレイ２４をなす複数のフリップフロップのうち、初段のフリップフロップ４１−１が記憶するデータを外部に出力するデータに決定する。また、読み出し先決定回路２６は、システムクロックの遅延数が、メモリアレイ２４を構成するフリップフロップの数（第１の実施形態では４）以上の場合には、ＲＡＭコア３１から読み出されるデータを外部に出力するデータに決定する。 As described above, the read destination determination circuit 26 determines data to be output to the outside according to the number of delays of the system clock when the read address R_ADD and the write address W_ADD match. More specifically, when the write address W_ADD and the read address R_ADD match, the read destination determination circuit 26 stores the data stored in the first flip-flop 41-1 among the plurality of flip-flops forming the memory array 24. Is determined as data to be output to the outside. In addition, the read destination determination circuit 26 externally reads data read from the RAM core 31 when the number of delays of the system clock is equal to or greater than the number of flip-flops (four in the first embodiment) constituting the memory array 24. Determine the data to be output to.

選択回路２５は、選択信号ＳＥＬが１クロック遅延された選択信号ＳＥＬ＿１Ｔにより、記憶部２２の外部に出力するデータを選択的に出力する回路である。選択回路２５は、上述の選択部１０４に相当する。 The selection circuit 25 is a circuit that selectively outputs data to be output to the outside of the storage unit 22 based on a selection signal SEL_1T obtained by delaying the selection signal SEL by one clock. The selection circuit 25 corresponds to the selection unit 104 described above.

次に、図７及び図８に示すタイムチャートを用いて記憶部２２の動作を説明する。 Next, the operation of the storage unit 22 will be described using the time charts shown in FIGS.

図７は、ＲＡＭコア３１単体に関する信号の入出力を示すタイムチャートの一例である。図８は、記憶部２２全体に関する信号の入出力を示すタイムチャートの一例である。図７及び図８では、記憶部２２からデータを読み出し、当該呼び出したデータに対して演算処理（例えば、パケットの積算カウント処理）を施し、次のクロックにて演算結果を書き込むといった演算部２１の動作を想定している。つまり、記憶部２２は、リード動作の１クロック後にライト動作することを想定している。 FIG. 7 is an example of a time chart showing signal input / output related to the RAM core 31 alone. FIG. 8 is an example of a time chart showing signal input / output related to the entire storage unit 22. In FIGS. 7 and 8, the calculation unit 21 reads data from the storage unit 22, performs calculation processing (for example, packet integration count processing) on the called data, and writes the calculation result at the next clock. Operation is assumed. That is, it is assumed that the storage unit 22 performs a write operation one clock after the read operation.

また、パケットに含まれるアドレスに応じて、各パケットはグループ分けされるが、図７及び図８では、同一のグループに関する処理が連続する場合を想定している。具体的には、時刻Ｔ１５〜Ｔ１９にて、ＲＡＭコア３１のアドレス値１に書き込まれるパケットが５回連続して入力されている。 Further, although each packet is grouped according to the address included in the packet, FIG. 7 and FIG. 8 assume a case where the processes related to the same group are continued. Specifically, at time T15 to T19, a packet to be written to the address value 1 of the RAM core 31 is continuously input five times.

図７を参照すると、時刻Ｔ０１〜Ｔ０６の期間は、ライトイネーブルＷ＿ＥＮがＨレベルであるためＲＡＭコア３１には、ライトデータ値０ａ〜５ａのそれぞれがＲＡＭコア３１のアドレス値０〜５に書き込まれる。なお、時刻Ｔ０１〜Ｔ０６の期間では、リードイネーブルＲ＿ＥＮはＬレベルであるため、ＲＡＭコア３１からデータが読み出されることはない。 Referring to FIG. 7, during the period from time T01 to T06, since write enable W_EN is at the H level, write data values 0a to 5a are written in RAM core 31 at address values 0 to 5 of RAM core 31, respectively. . In the period from time T01 to time T06, the read enable R_EN is at the L level, so that data is not read from the RAM core 31.

次に、時刻Ｔ０９にて、演算部２１によるライトデータＷ＿ＤＡＴＡ及びライトアドレスＷ＿ＡＤＤの供給に先立ち、ＲＡＭコア３１からデータの読み出しが行われる。具体的には、ライトアドレスＷ＿ＡＤＤにアドレス値０が設定される１クロック前に、リードイネーブルＲ＿ＥＮがＨレベル、リードアドレスＲ＿ＡＤＤにアドレス値０がそれぞれ設定される。当該リードイネーブルＲ＿ＥＮ及びリードアドレスＲ＿ＡＤＤは１クロック遅延されてＲＡＭコア３１に供給され、時刻Ｔ１０にて、ＲＡＭコア３１のアドレス値０からリードデータ値０ａがリードデータＲＡＭ＿ＯＵＴとして読み出される。 Next, at time T09, data is read from the RAM core 31 prior to the supply of the write data W_DATA and the write address W_ADD by the arithmetic unit 21. Specifically, the read enable R_EN is set to the H level and the address value 0 is set to the read address R_ADD one clock before the address value 0 is set to the write address W_ADD. The read enable R_EN and the read address R_ADD are delayed by one clock and supplied to the RAM core 31. At time T10, the read data value 0a is read as the read data RAM_OUT from the address value 0 of the RAM core 31.

また、時刻Ｔ１０にて、ライトイネーブルＷ＿ＥＮがＨレベルに設定され、アドレス値０、ライトデータ値０ｂに係る書き込み情報が、記憶部２２に供給される。当該書き込み情報（アドレス値０、ライトデータ値０ｂ）は、フリップフロップ３４、３５により１クロック遅延されてＲＡＭコア３１に供給され、時刻Ｔ１１のタイミングにてＲＡＭコア３１に書き込まれる。ＲＡＭコア３１は、このようなデータの読み出し動作と書き込みに係る動作を、時刻Ｔ１４まで繰り返す。 At time T10, the write enable W_EN is set to the H level, and write information relating to the address value 0 and the write data value 0b is supplied to the storage unit 22. The write information (address value 0, write data value 0b) is delayed by one clock by the flip-flops 34 and 35, supplied to the RAM core 31, and written to the RAM core 31 at time T11. The RAM core 31 repeats such data read and write operations until time T14.

時刻Ｔ１５において、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤに同じアドレス値１が、記憶部２２に供給される。従って、時刻Ｔ１５から１クロック遅れた時刻Ｔ１６にて、ＲＡＭコア３１に、同じアドレス値１を持つライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが供給される。この場合、ＲＡＭコア３１に同時アクセスが発生することとなり、ＲＡＭコア３１から出力される値（リードデータＲＡＭ＿ＯＵＴ）は不定となる。即ち、ＲＡＭコア３１の動作は、期待動作と異なる。 At time T15, the same address value 1 is supplied to the storage unit 22 as the write address W_ADD and the read address R_ADD. Accordingly, the write address W_ADD and the read address R_ADD having the same address value 1 are supplied to the RAM core 31 at time T16 which is delayed by one clock from time T15. In this case, simultaneous access occurs in the RAM core 31, and the value (read data RAM_OUT) output from the RAM core 31 is indefinite. That is, the operation of the RAM core 31 is different from the expected operation.

なお、図７以降の図面において、ＲＡＭコア３１によるリードデータＲＡＭ＿ＯＵＴが不定の場合の値を「×」を用いて表記する。また、ライトイネーブルＷ＿ＥＮやリードイネーブルＲ＿ＥＮが無効（Ｌレベル）の場合のデータ値及びアドレス値は無意味であるので、このような場合のデータ値等は「−」を用いて表記する。 In FIG. 7 and subsequent drawings, the value when the read data RAM_OUT by the RAM core 31 is indefinite is described using “×”. In addition, since the data value and the address value when the write enable W_EN and the read enable R_EN are invalid (L level) are meaningless, the data value in such a case is described using “-”.

図８を参照すると、時刻Ｔ１２にて、１クロック前の時刻Ｔ１１におけるライトアドレスＷ＿ＡＤＤ（ライトアドレス値＝１）と、リードアドレスＲ＿ＡＤＤ（リードアドレス値＝１）と、が一致する。そのため、読み出し先決定回路２６の論理積回路５２−２は、Ｈレベルを出力する。その結果、選択信号出力回路５４の入力端子ＩＮ＿２はＨレベルとなる。図６を参照すると、入力端子ＩＮ＿２がＨレベルであるので、選択信号出力回路５４は、選択回路２５がポート２を選択するように選択信号ＳＥＬを出力する。図８の時刻Ｔ１３にて、選択信号ＳＥＬが１クロック遅延された選択信号ＳＥＬ＿１Ｔが、選択回路２５に供給される。選択回路２５は、ポート２に供給されているデータ（メモリアレイ２４のフリップフロップ４１−２の出力データ）である「１ｂ」を、記憶部２２からのリードデータＲ＿ＤＡＴＡとして出力する。 Referring to FIG. 8, at time T12, the write address W_ADD (write address value = 1) and the read address R_ADD (read address value = 1) at time T11 one clock before match. Therefore, the logical product circuit 52-2 of the read destination determination circuit 26 outputs the H level. As a result, the input terminal IN_2 of the selection signal output circuit 54 becomes H level. Referring to FIG. 6, since the input terminal IN_2 is at the H level, the selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 2. At time T13 in FIG. 8, the selection signal SEL_1T obtained by delaying the selection signal SEL by one clock is supplied to the selection circuit 25. The selection circuit 25 outputs “1b”, which is data supplied to the port 2 (output data of the flip-flop 41-2 of the memory array 24), as read data R_DATA from the storage unit 22.

記憶部２２は、このような動作を時刻Ｔ１４まで行う。 The storage unit 22 performs such an operation until time T14.

次に、時刻Ｔ１５において、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤとして同じアドレス値１が、記憶部２２に供給される。この場合、ライトアドレス値とリードアドレス値が一致するため、読み出し先決定回路２６の論理積回路５２−１は、Ｈレベルを出力する。その結果、選択信号出力回路５４の入力端子ＩＮ＿１にＨレベルが供給される。図６を参照すると、入力端子ＩＮ＿１がＨレベルであるので、選択信号出力回路５４は、選択回路２５がポート１を選択するように選択信号ＳＥＬを出力する。時刻Ｔ１６にて、選択信号ＳＥＬが１クロック遅延された選択信号ＳＥＬ＿１Ｔが、選択回路２５に供給される。選択回路２５は、ポート１に供給されているデータ（メモリアレイ２４のフリップフロップ４１−１の出力データ）である「１ｄ」を、記憶部２２からのリードデータＲ＿ＤＡＴＡとして出力する。 Next, at time T15, the same address value 1 is supplied to the storage unit 22 as the write address W_ADD and the read address R_ADD. In this case, since the write address value matches the read address value, the AND circuit 52-1 of the read destination determination circuit 26 outputs the H level. As a result, the H level is supplied to the input terminal IN_1 of the selection signal output circuit 54. Referring to FIG. 6, since the input terminal IN_1 is at the H level, the selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 1. At time T16, the selection signal SEL_1T obtained by delaying the selection signal SEL by one clock is supplied to the selection circuit 25. The selection circuit 25 outputs “1d”, which is data supplied to the port 1 (output data of the flip-flop 41-1 of the memory array 24), as read data R_DATA from the storage unit 22.

このように、同時アクセスが発生するような場合には、読み出し先決定回路２６は、フリップフロップ４１−１が記憶するライトデータＷ＿ＤＡＴＡを、記憶部２２全体の出力データ（リードデータＲ＿ＤＡＴＡ）として外部に出力する。その結果、記憶部２２は、期待通りのデータ出力を行える。 In this way, when simultaneous access occurs, the read destination determination circuit 26 externally uses the write data W_DATA stored in the flip-flop 41-1 as output data (read data R_DATA) of the entire storage unit 22. Output. As a result, the storage unit 22 can output data as expected.

第１の実施形態に係るデータ出力方法をまとめると図９に示すフローチャートのとおりである。 The data output method according to the first embodiment is summarized as a flowchart shown in FIG.

ステップＳ０１において、少なくともライトアドレスＷ＿ＡＤＤ及びリードアドレスＲ＿ＡＤＤに応じて、ＲＡＭコア３１から読み出されたリードデータＲＡＭ＿ＯＵＴ及びメモリアレイ２４をなす複数のフリップフロップ４１のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定する。ステップＳ０２において、外部に出力すると決定されたデータを選択的に出力する。 In step S01, at least according to the write address W_ADD and the read address R_ADD, the read data RAM_OUT read from the RAM core 31 and any of the data stored in any of the plurality of flip-flops 41 forming the memory array 24 are obtained. Determined as data to be output to the outside. In step S02, the data determined to be output to the outside is selectively output.

以上のように、第１の実施形態に係る記憶部２２は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤとの間の関係に基づいて、ＲＡＭコア３１が出力するデータ及びメモリアレイ２４が出力するデータのいずれかを、リードデータＲ＿ＤＡＴＡとして出力するか選択する。その結果、記憶部２２は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致するような場合も含め、これらのアドレス値がどのような値であっても、リード動作の１クロック後にライト動作が可能となる。即ち、第１の実施形態に係る記憶部２２は、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合であっても、ライトされたデータを確実にリードすることが可能である。また、フリップフロップ３３〜３７は各種データをリタイミングする手段として機能する。そのため、例えば、パケットを１００Ｇｂｐｓといった速度にて処理する必要がある通信装置１の演算モジュール１１に記憶部２２を組み込むことで、パケットを１クロックで処理することが可能となる。 As described above, the storage unit 22 according to the first embodiment uses any one of the data output from the RAM core 31 and the data output from the memory array 24 based on the relationship between the write address W_ADD and the read address R_ADD. To output as read data R_DATA. As a result, the storage unit 22 can perform the write operation one clock after the read operation, regardless of the value of these address values, including the case where the write address W_ADD and the read address R_ADD coincide. . That is, the storage unit 22 according to the first embodiment can reliably read the written data even when the write address W_ADD matches the read address R_ADD. The flip-flops 33 to 37 function as means for retiming various data. Therefore, for example, by incorporating the storage unit 22 in the arithmetic module 11 of the communication apparatus 1 that needs to process a packet at a speed of 100 Gbps, the packet can be processed in one clock.

また、演算モジュール１１をＦＰＧＡにて実現する場合に、フリップフロップに係るリソースを記憶装置として使用する必要がなくなり、汎用的なＦＰＧＡのリソースを有効活用できる。即ち、安価な通信装置１を短期間で開発することが可能となる。 Further, when the arithmetic module 11 is realized by an FPGA, it is not necessary to use a resource related to the flip-flop as a storage device, and a general-purpose FPGA resource can be effectively used. That is, an inexpensive communication device 1 can be developed in a short period of time.

ここで、一部のＦＰＧＡには、同時アクセスが発生した場合に、ライトデータを直接リードできる機能を有するものがある。このようなＦＰＧＡを使用すれば、同時アクセスに伴う問題は生じない。しかし、このような機能は、ごく一部のＦＰＧＡメーカから供給されるＦＰＧＡに限り実装されているものあり一般的ではない。つまり、大半のＦＰＧＡにはこのような機能は実装されておらず、上記のような例外的且つ特殊な機能を有するＦＰＧＡを採用することはできない。 Here, some FPGAs have a function of directly reading write data when simultaneous access occurs. If such an FPGA is used, problems associated with simultaneous access do not occur. However, such a function is implemented only in an FPGA supplied from a few FPGA manufacturers, and is not general. That is, most of the FPGAs do not have such a function, and it is not possible to adopt an FPGA having the above exceptional and special functions.

また、１００Ｇｂｐｓといった処理容量が要求される通信装置１では、例えば、３１２．５ＭＨｚにて１クロック処理が必要となるため、ライトデータＷ＿ＤＡＴＡに対するＥＣＣが適用できないという問題がある。ＦＰＧＡにおいて、高速なＥＣＣ回路を実現するためには、予めＦＰＧＡにハードマクロとして組み込まれたＥＣＣ回路を用いるのが通常である。しかし、ハードマクロに係るＥＣＣ回路を組み込みつつ、同時アクセス時の出力を保証する記憶装置は存在しないのが現状である。つまり、上記の特殊機能（同時アクセスが発生した場合には、ライトデータを直接リードできる機能）が実装されたＦＰＧＡであっても、ＥＣＣ機能を有効にすることができないという問題がある。即ち、１クロック処理が要求されるような記憶装置では、ＥＣＣ回路の適用ができずソフトエラーによるＲＡＭのビット反転エラーを回避できない。その結果、信頼性の高い通信装置の実現が困難とある。 In addition, in the communication device 1 that requires a processing capacity of 100 Gbps, for example, one clock processing is required at 312.5 MHz, so that there is a problem that ECC for the write data W_DATA cannot be applied. In an FPGA, in order to realize a high-speed ECC circuit, it is usual to use an ECC circuit that is previously incorporated in the FPGA as a hard macro. However, there is currently no storage device that guarantees output during simultaneous access while incorporating an ECC circuit related to a hard macro. That is, there is a problem that the ECC function cannot be validated even with an FPGA in which the above-described special function (a function that can directly read the write data when simultaneous access occurs) is mounted. That is, in a storage device that requires one clock processing, the ECC circuit cannot be applied, and a RAM bit inversion error due to a soft error cannot be avoided. As a result, it is difficult to realize a highly reliable communication device.

一方、第１の実施形態に係る記憶部２２は、同時アクセス時にはメモリアレイ２４のフリップフロップ４１−１に格納されたライトデータＷ＿ＤＡＴＡをリードデータＲ＿ＤＡＴＡとして出力するためソフトエラーによるビット反転エラーは生じない。また、ＲＡＭコア３１に格納されるライトデータＷ＿ＤＡＴＡには、ＥＣＣ回路（ＥＣＣエンコーダ３２とＥＣＣデコーダ３８）による誤り訂正が適用されるため、ソフトエラーによるビット反転エラーは必要に応じて訂正される。その結果、記憶部２２を用いる通信装置１の信頼性（品質）を向上させることができる。 On the other hand, since the storage unit 22 according to the first embodiment outputs the write data W_DATA stored in the flip-flop 41-1 of the memory array 24 as read data R_DATA during simultaneous access, no bit inversion error due to a soft error occurs. . Further, since error correction by the ECC circuit (the ECC encoder 32 and the ECC decoder 38) is applied to the write data W_DATA stored in the RAM core 31, a bit inversion error due to a soft error is corrected as necessary. As a result, the reliability (quality) of the communication device 1 using the storage unit 22 can be improved.

なお、第１の実施形態にて説明した記憶部２２の構成及び動作は例示であって、種々の変形が可能である。以下、第１の実施形態の変形例について説明する。 The configuration and operation of the storage unit 22 described in the first embodiment are examples, and various modifications can be made. Hereinafter, modifications of the first embodiment will be described.

［変形例１］
図１０は、第１の変形例に係る記憶部２２ａの内部構成の一例を示す図である。図４に示す記憶部２２と図１０に示す記憶部２２ａの相違点は、記憶回路２３の内部にフリップフロップ６１〜６３が追加されている点である。即ち、図１０に示す記憶部２２ａでは、ＲＡＭコア３１における書き込み側のリタイミング段数が２段となっている。 [Modification 1]
FIG. 10 is a diagram illustrating an example of an internal configuration of the storage unit 22a according to the first modification. The difference between the storage unit 22 shown in FIG. 4 and the storage unit 22 a shown in FIG. 10 is that flip-flops 61 to 63 are added inside the storage circuit 23. That is, in the storage unit 22a shown in FIG. 10, the number of retiming stages on the write side in the RAM core 31 is two.

ＲＡＭコア３１の書き込み側のリタイミング段数が２段になっているため、ライトアドレスＷ＿ＡＤＤやライトデータＷ＿ＤＡＴＡは２クロック遅延してＲＡＭコア３１に供給される（例えば、図１１のＴ０１〜Ｔ０３参照）。また、例えば、図１２の時刻Ｔ１５、Ｔ１６を参照すると、同時アクセスが発生した場合であっても、期待通りのデータが記憶部２２ａから出力されているのが理解される。 Since the number of retiming stages on the writing side of the RAM core 31 is two, the write address W_ADD and the write data W_DATA are supplied to the RAM core 31 with a delay of two clocks (see, for example, T01 to T03 in FIG. 11). . For example, referring to times T15 and T16 in FIG. 12, it can be understood that expected data is output from the storage unit 22a even when simultaneous access occurs.

［変形例２］
図１３は、第２の変形例に係る記憶部２２ｂの内部構成の一例を示す図である。図１０に示す記憶部２２ａと図１３に示す記憶部２２ｂの相違点は、ＲＡＭコア３１のリードデータ出力端子Ｄ＿ＯＵＴ＿ＲＡＭとＥＣＣデコーダ３８の間に挿入されるフリップフロップ６４と、リードアドレスＲ＿ＡＤＤを１クロック遅延させるフリップフロップ６５と、が追加されている点である。 [Modification 2]
FIG. 13 is a diagram illustrating an example of an internal configuration of the storage unit 22b according to the second modification. The difference between the storage unit 22a shown in FIG. 10 and the storage unit 22b shown in FIG. 13 is that the flip-flop 64 inserted between the read data output terminal D_OUT_RAM of the RAM core 31 and the ECC decoder 38 and the read address R_ADD are set to one clock. A flip-flop 65 for delaying is added.

記憶部２２ｂは、フリップフロップ６４が追加されることで、読み出し側のデータ出力をリタイミングする。また、ＲＡＭコア３１から出力されるリードデータＲＡＭ＿ＯＵＴと、記憶部２２ｂの外部（演算部２１）から供給されるリードアドレスＲ＿ＡＤＤと、の間の整合を図るためフリップフロップ６５が追加される。なお、図１３以降の図面において、フリップフロップ６５により１クロック遅延されるリードアドレスＲ＿ＡＤＤをリードアドレスＲ＿ＡＤＤ＿１Ｔと表記する。 The storage unit 22b retimes the data output on the reading side by adding the flip-flop 64. In addition, a flip-flop 65 is added to achieve matching between the read data RAM_OUT output from the RAM core 31 and the read address R_ADD supplied from the outside of the storage unit 22b (arithmetic unit 21). In FIG. 13 and subsequent drawings, the read address R_ADD delayed by one clock by the flip-flop 65 is represented as a read address R_ADD_1T.

図１４を参照すると、例えば、時刻Ｔ０９にてＲＡＭコア３１から読み出されたデータは、フリップフロップ６４により１クロック遅延され、時刻Ｔ１０にてリードデータＲＡＭ＿ＯＵＴとして出力される。 Referring to FIG. 14, for example, data read from the RAM core 31 at time T09 is delayed by one clock by the flip-flop 64, and output as read data RAM_OUT at time T10.

また、図１５を参照すると、例えば、時刻Ｔ０８にて演算部２１から供給されたリードアドレスＲ＿ＡＤＤは、フリップフロップ６５により１クロック遅延され、時刻Ｔ０９にてリードアドレスＲ＿ＡＤＤ＿１Ｔとして読み出し先決定回路２６に供給される。 Referring to FIG. 15, for example, the read address R_ADD supplied from the computing unit 21 at time T08 is delayed by one clock by the flip-flop 65 and supplied to the read destination determination circuit 26 as the read address R_ADD_1T at time T09. Is done.

さらに、例えば、図１５の時刻Ｔ１５にて、ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致しているため、同時アクセスが発生している。この場合、時刻Ｔ１５のリードアドレスＲ＿ＡＤＤ＿１Ｔのアドレス値と、ライトアドレスＷ＿ＡＤＤのアドレス値が一致するので、読み出し先決定回路２６の論理積回路５２−１は、Ｈレベルを出力する。その結果、選択信号出力回路５４の入力端子ＩＮ＿１にＨレベルが供給される。入力端子ＩＮ＿１がＨレベルであるので、選択信号出力回路５４は、選択回路２５がポート１を選択するように選択信号ＳＥＬを出力する。時刻Ｔ１６にて、選択信号ＳＥＬが１クロック遅延された選択信号ＳＥＬ＿１Ｔが、選択回路２５に供給され、ポート１に供給されているデータ「１ｄ」が出力される。 Further, for example, at time T15 in FIG. 15, the write address W_ADD and the read address R_ADD coincide with each other, so that simultaneous access occurs. In this case, since the address value of the read address R_ADD_1T at time T15 matches the address value of the write address W_ADD, the AND circuit 52-1 of the read destination determination circuit 26 outputs the H level. As a result, the H level is supplied to the input terminal IN_1 of the selection signal output circuit 54. Since the input terminal IN_1 is at the H level, the selection signal output circuit 54 outputs the selection signal SEL so that the selection circuit 25 selects the port 1. At time T16, the selection signal SEL_1T obtained by delaying the selection signal SEL by one clock is supplied to the selection circuit 25, and the data “1d” supplied to the port 1 is output.

このように、第２の変形例に係る記憶部２２ｂであっても、同時アクセスが発生した場合には、期待通りのデータが出力される。 Thus, even in the storage unit 22b according to the second modified example, when simultaneous access occurs, data as expected is output.

［変形例３］
図１６は、第３の変形例に係る記憶部２２ｃの内部構成の一例を示す図である。図１３に示す記憶部２２ｂと図１６に示す記憶部２２ｃの相違点は、ＲＡＭコア３１の読み出し側におけるデータ出力に対するリタイミングを２段実施するためのフリップフロップ６６、６７が追加されている点である。なお、図１６以降の図面において、フリップフロップ６５及び６６により２クロック遅延されるリードアドレスＲ＿ＡＤＤをリードアドレスＲ＿ＡＤＤ＿２Ｔと表記する。 [Modification 3]
FIG. 16 is a diagram illustrating an example of an internal configuration of the storage unit 22c according to the third modification. The difference between the storage unit 22b shown in FIG. 13 and the storage unit 22c shown in FIG. 16 is that flip-flops 66 and 67 for performing two stages of retiming for data output on the read side of the RAM core 31 are added. It is. In FIG. 16 and subsequent drawings, the read address R_ADD delayed by two clocks by the flip-flops 65 and 66 is referred to as a read address R_ADD_2T.

図１７の時刻Ｔ０８〜Ｔ１０を参照すると、ＲＡＭコア３１から読み出されたデータは、フリップフロップ６４及び６７により２クロック遅延され、リードデータＲＡＭ＿ＯＵＴとして出力される。また、図１８の時刻Ｔ０７〜Ｔ０９を参照すると、リードアドレスＲ＿ＡＤＤは、フリップフロップ６５及び６６により２クロック遅延され、リードアドレスＲ＿ＡＤＤ＿２Ｔとして読み出し先決定回路２６に供給される。また、例えば、図１８の時刻Ｔ１５、Ｔ１６を参照すると、記憶部２２ｃに同時アクセスが発生した場合であっても、期待通りのデータが記憶部２２ｃから出力されているのが理解される。 Referring to times T08 to T10 in FIG. 17, the data read from the RAM core 31 is delayed by two clocks by the flip-flops 64 and 67 and output as read data RAM_OUT. Referring to the times T07 to T09 in FIG. 18, the read address R_ADD is delayed by two clocks by the flip-flops 65 and 66 and supplied to the read destination determination circuit 26 as the read address R_ADD_2T. Further, for example, referring to times T15 and T16 in FIG. 18, it is understood that expected data is output from the storage unit 22c even when simultaneous access to the storage unit 22c occurs.

以上、第１〜第３の変形例にて説明したように、メモリアレイ２４に含まれるフリップフロップの段数を４段とする場合には、ＲＡＭコア３１の書き込み側のリタイミングの段数を２段、読み出し側のリタイミングの段数を２段とすることができる。さらに、リタイミングの段数を増やす場合には、メモリアレイ２４に含まれるフリップフロップの段数を増やし、読み出し先決定回路２６の構成を増加したフリップフロップの段数に適応させればよい。 As described above in the first to third modifications, when the number of flip-flops included in the memory array 24 is four, the number of retiming stages on the write side of the RAM core 31 is two. The number of retiming stages on the reading side can be two. Further, when the number of retiming stages is increased, the number of flip-flop stages included in the memory array 24 may be increased to adapt the configuration of the read destination determination circuit 26 to the increased number of flip-flop stages.

［変形例４］
図１９は、第４の変形例に係る記憶部２２ｄの内部構成の一例を示す図である。図１６に示す記憶部２２ｃと図１９に示す記憶部２２ｄの相違点は、記憶回路２３の入力端子にフリップフロップ６８が接続されている点である。 [Modification 4]
FIG. 19 is a diagram illustrating an example of the internal configuration of the storage unit 22d according to the fourth modification. A difference between the storage unit 22 c illustrated in FIG. 16 and the storage unit 22 d illustrated in FIG. 19 is that a flip-flop 68 is connected to an input terminal of the storage circuit 23.

ＲＡＭコア３１は、ＦＰＧＡのハードマクロとして実装されているので、ＦＰＧＡチップ上での物理的位置が予め定まっている。また、演算部２１にて使用される加算器もハードマクロとして実装されているので、演算部２１と記憶部２２の間の距離が短くなるとは限らない。つまり、演算部２１（加算器）と記憶部２２（ＲＡＭコア３１）の間の距離が長くなる可能性があり、データの送受信に問題が生じることがある。 Since the RAM core 31 is mounted as an FPGA hard macro, a physical position on the FPGA chip is determined in advance. Further, since the adder used in the calculation unit 21 is also implemented as a hard macro, the distance between the calculation unit 21 and the storage unit 22 is not necessarily shortened. That is, there is a possibility that the distance between the calculation unit 21 (adder) and the storage unit 22 (RAM core 31) may be long, which may cause a problem in data transmission / reception.

そこで、第４の変形例に係る記憶部２２ｄのように、記憶回路２３の前段にフリップフロップ６８を追加することで、ライトデータＷ＿ＤＡＴＡの波形を整形する。その結果、例えば、システムクロックが３１２．５ＭＨｚのように高速、且つ、演算部２１と記憶部２２の間が離れている場合であっても、確実なデータの授受が実現できる。 Therefore, the waveform of the write data W_DATA is shaped by adding a flip-flop 68 in the previous stage of the storage circuit 23 as in the storage unit 22d according to the fourth modification. As a result, for example, even when the system clock is as high as 312.5 MHz and the operation unit 21 and the storage unit 22 are separated, reliable data transfer can be realized.

また、フリップフロップ６８の追加は、メモリアレイ２４には直列接続された５個のフリップフロップが含まれることと等価である。従って、選択回路２５のポート数及び読み出し先決定回路２６が出力する選択信号ＳＥＬを適宜変更し、記憶部２２ｄに同時アクセスが生じた場合には、フリップフロップ６８が記憶するデータが、リードデータＲ＿ＤＡＴＡとして出力されるように構成する。 The addition of the flip-flop 68 is equivalent to the memory array 24 including five flip-flops connected in series. Accordingly, when the number of ports of the selection circuit 25 and the selection signal SEL output from the read destination determination circuit 26 are appropriately changed and simultaneous access to the storage unit 22d occurs, the data stored in the flip-flop 68 is read data R_DATA. To be output as

［変形例５］
第１の実施形態及び第１〜第４の変形例に係る記憶装置では、メモリアレイ２４に含まれるフリップフロップの個数を４として説明した。しかし、フリップフロップの個数は４に限定されない。例えば、図２０に示すように、メモリアレイ２４に含まれるフリップフロップの数は１でもよい。 [Modification 5]
In the storage devices according to the first embodiment and the first to fourth modifications, the number of flip-flops included in the memory array 24 is described as four. However, the number of flip-flops is not limited to four. For example, as shown in FIG. 20, the number of flip-flops included in the memory array 24 may be one.

この場合には、読み出し先決定回路２６は、図２１（ａ）に示すように構成できる。図２１（ａ）を参照すると、読み出し先決定回路２６は、同時アクセスが発生した場合（ライトアドレスＷ＿ＡＤＤとリードアドレスＲ＿ＡＤＤが一致する場合）に、選択回路２５のポート１を選択するように選択信号ＳＥＬを出力する。選択信号出力回路５４の動作を真理値表として記載すると、図２１（ｂ）のとおりとなる。 In this case, the read destination determination circuit 26 can be configured as shown in FIG. Referring to FIG. 21A, the read destination determination circuit 26 selects the selection signal so as to select the port 1 of the selection circuit 25 when simultaneous access occurs (when the write address W_ADD and the read address R_ADD match). SEL is output. The operation of the selection signal output circuit 54 is described as a truth table as shown in FIG.

第５の変形例に係る記憶部２２ｅは、高速なシステムクロックによる動作が不要ではあるが、同時アクセスに対応したい場合に好適である。記憶部２２ｅでは、メモリアレイ２４に１個のフリップフロップ４１−１が含まれるだけなので、記憶回路２３の内部にリタイミング用のフリップフロップを追加することができない。しかし、演算モジュール１１を実現するＦＰＧＡを高速に動作させる必要が無い場合には、図２０に示す記憶部２２ｅの構成で足り、メモリアレイ２４と読み出し先決定回路２６の構成が簡略化できるという利点がある。 The storage unit 22e according to the fifth modified example is suitable for the case where it is desired to support simultaneous access, although the operation by the high-speed system clock is unnecessary. In the storage unit 22e, since the memory array 24 includes only one flip-flop 41-1, a retiming flip-flop cannot be added inside the storage circuit 23. However, when it is not necessary to operate the FPGA for realizing the arithmetic module 11 at high speed, the configuration of the storage unit 22e shown in FIG. 20 is sufficient, and the configuration of the memory array 24 and the read destination determination circuit 26 can be simplified. There is.

上記の実施形態の一部又は全部は、以下の付記のようにも記載され得るが、以下には限られない。 A part or all of the above embodiments can be described as in the following supplementary notes, but is not limited thereto.

［付記１］
ライトアドレス、ライトデータ及びリードアドレスを受け付け、データの書き込みとデータの読み出しの並列動作が可能な第１の記憶部と、
直列接続された複数の記憶素子からなる記憶部であって、前記ライトデータを受け付けると共に、前記第１の記憶部と並列接続された第２の記憶部と、
少なくとも前記ライトアドレス及び前記リードアドレスに応じて、前記第１の記憶部から読み出されたリードデータ及び前記第２の記憶部をなす前記複数の記憶素子のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定する決定部と、
前記決定部により外部に出力すると決定されたデータを選択的に出力する選択部と、
を備える半導体装置。
［付記２］
前記第１の記憶部は、ライトイネーブルとリードイネーブルを受け付け、
前記選択部は、前記ライトアドレス、前記リードアドレス、前記ライトイネーブル及び前記リードイネーブルに応じて、外部に出力するデータを選択する、付記１の半導体装置。
［付記３］
前記第１の記憶部に供給される前記ライトデータを保持する第１の保持部と、
前記第１の記憶部に供給される前記ライトアドレスを保持する第２の保持部と、
前記第１の記憶部に供給される前記ライトイネーブルを保持する第３の保持部と、
前記第１の記憶部に供給される前記リードアドレスを保持する第４の保持部と、
前記第１の記憶部に供給される前記リードイネーブルを保持する第５の保持部と、
をさらに備える付記２の半導体装置。
［付記４］
前記決定部は、前記外部に出力するデータを示す選択信号を、前記選択部に向けて出力し、
前記選択信号を保持する第６の保持部をさらに備える、付記３の半導体装置。
［付記５］
前記ライトデータから誤り訂正符号を生成する誤り訂正符号化回路と、
前記誤り訂正符号に応じて、前記第１の記憶部から読み出されたリードデータを訂正する誤り訂正回路と、
をさらに備える、付記１乃至４のいずれか一に記載の半導体装置。
［付記６］
前記決定部は、前記リードアドレスと前記ライトアドレスが一致する際のシステムクロックの遅延数に応じて、前記外部に出力するデータを決定する付記１乃至５のいずれか一に記載の半導体装置。
［付記７］
前記決定部は、
前記ライトアドレスと前記リードアドレスが一致する場合には、前記第２の記憶部をなす複数の記憶素子のうち、初段の記憶素子が記憶するデータを前記外部に出力するデータとして決定し、
前記システムクロックの遅延数が、前記第２の記憶部をなす複数の記憶素子の数以上の場合には、前記第１の記憶部から読み出されたリードデータを前記外部に出力するデータとして決定する、付記６の半導体装置。
［付記８］
前記第１の保持部の前段に配置され、前記ライトデータを保持する第７の保持部と、
前記第２の保持部の前段に配置され、前記ライトアドレスを保持する第８の保持部と、
前記第３の保持部の前段に配置され、前記ライトイネーブルを保持する第９の保持部と、
をさらに備える、付記４の半導体装置。
［付記９］
前記第１の記憶部から読み出されたリードデータを保持する第１０の保持部と、
前記決定部に供給される前記リードアドレスを保持する第１１の保持部と、
をさらに備える、付記８の半導体装置。
［付記１０］
前記第１０の保持部の後段に配置され、前記第１の記憶部から読み出されたリードデータを保持する第１２の保持部と、
前記第１１の保持部の前段に配置され、前記リードアドレスを保持する第１３の保持部と、
をさらに備える、付記９の半導体装置。
［付記１１］
前記第７の保持部の前段に配置され、前記ライトデータを保持する第１４の保持部をさらに備える、付記１０の半導体装置。
［付記１２］
ライトアドレス、ライトデータ及びリードアドレスを受け付け、データの書き込みとデータの読み出しの並列動作が可能な第１の記憶部と、直列接続された複数の記憶素子からなる記憶部であって、前記ライトデータを受け付けると共に、前記第１の記憶部と並列接続された第２の記憶部と、を含む記憶装置からのデータ出力方法であって、
少なくとも前記ライトアドレス及び前記リードアドレスに応じて、前記第１の記憶部から読み出されたリードデータ及び前記第２の記憶部をなす前記複数の記憶素子のいずれかに記憶されたデータのいずれかを外部に出力するデータとして決定するステップと、
前記外部に出力すると決定されたデータを選択的に出力するステップと、
を含む、データ出力方法。
なお、付記１２の形態は、付記１の形態と同様に、付記２の形態〜付記１１の形態に展開することが可能である。 [Appendix 1]
A first storage unit that accepts a write address, write data, and a read address, and is capable of parallel operation of writing data and reading data;
A storage unit composed of a plurality of storage elements connected in series, the second storage unit receiving the write data and connected in parallel to the first storage unit;
Any one of the read data read from the first storage unit and the data stored in any of the plurality of storage elements forming the second storage unit according to at least the write address and the read address A determination unit that determines the data to be output to the outside,
A selection unit that selectively outputs data determined to be output to the outside by the determination unit;
A semiconductor device comprising:
[Appendix 2]
The first storage unit accepts a write enable and a read enable,
The semiconductor device according to appendix 1, wherein the selection unit selects data to be output to the outside according to the write address, the read address, the write enable, and the read enable.
[Appendix 3]
A first holding unit for holding the write data supplied to the first storage unit;
A second holding unit for holding the write address supplied to the first storage unit;
A third holding unit for holding the write enable supplied to the first storage unit;
A fourth holding unit for holding the read address supplied to the first storage unit;
A fifth holding unit for holding the read enable supplied to the first storage unit;
The semiconductor device according to appendix 2, further comprising:
[Appendix 4]
The determination unit outputs a selection signal indicating data to be output to the outside toward the selection unit,
The semiconductor device according to appendix 3, further comprising a sixth holding unit that holds the selection signal.
[Appendix 5]
An error correction encoding circuit for generating an error correction code from the write data;
An error correction circuit for correcting read data read from the first storage unit according to the error correction code;
The semiconductor device according to any one of appendices 1 to 4, further comprising:
[Appendix 6]
The semiconductor device according to any one of appendices 1 to 5, wherein the determination unit determines data to be output to the outside according to a delay number of a system clock when the read address matches the write address.
[Appendix 7]
The determination unit
When the write address and the read address match, the data stored in the first storage element among the plurality of storage elements forming the second storage unit is determined as data to be output to the outside,
When the delay number of the system clock is equal to or greater than the number of the plurality of storage elements forming the second storage unit, the read data read from the first storage unit is determined as data to be output to the outside The semiconductor device according to appendix 6.
[Appendix 8]
A seventh holding unit that is arranged in front of the first holding unit and holds the write data;
An eighth holding unit that is arranged in front of the second holding unit and holds the write address;
A ninth holding unit that is arranged in front of the third holding unit and holds the write enable;
The semiconductor device according to appendix 4, further comprising:
[Appendix 9]
A tenth holding unit for holding read data read from the first storage unit;
An eleventh holding unit for holding the read address supplied to the determination unit;
The semiconductor device according to appendix 8, further comprising:
[Appendix 10]
A twelfth holding unit that is arranged downstream of the tenth holding unit and holds read data read from the first storage unit;
A thirteenth holding unit that is arranged in front of the eleventh holding unit and holds the read address;
The semiconductor device according to appendix 9, further comprising:
[Appendix 11]
The semiconductor device according to appendix 10, further comprising a fourteenth holding unit that is arranged in front of the seventh holding unit and holds the write data.
[Appendix 12]
A first storage unit that accepts a write address, write data, and a read address, and that can perform a parallel operation of writing data and reading data; and a storage unit including a plurality of storage elements connected in series, the write data And a data output method from a storage device including a second storage unit connected in parallel with the first storage unit,
Any one of the read data read from the first storage unit and the data stored in any of the plurality of storage elements forming the second storage unit according to at least the write address and the read address Determining as data to be output to the outside,
Selectively outputting data determined to be output to the outside;
Including data output method.
Note that the form of Supplementary Note 12 can be developed into the form of Supplementary Note 2 to the form of Supplementary Note 11 as with the form of Supplementary Note 1.

なお、引用した上記の特許文献の開示は、本書に引用をもって繰り込むものとする。本発明の全開示（請求の範囲を含む）の枠内において、さらにその基本的技術思想に基づいて、実施形態ないし実施例の変更・調整が可能である。また、本発明の全開示の枠内において種々の開示要素（各請求項の各要素、各実施形態ないし実施例の各要素、各図面の各要素等を含む）の多様な組み合わせ、ないし、選択が可能である。すなわち、本発明は、請求の範囲を含む全開示、技術的思想にしたがって当業者であればなし得るであろう各種変形、修正を含むことは勿論である。特に、本書に記載した数値範囲については、当該範囲内に含まれる任意の数値ないし小範囲が、別段の記載のない場合でも具体的に記載されているものと解釈されるべきである。 The disclosure of the cited patent document is incorporated herein by reference. Within the scope of the entire disclosure (including claims) of the present invention, the embodiments and examples can be changed and adjusted based on the basic technical concept. In addition, various combinations or selections of various disclosed elements (including each element in each claim, each element in each embodiment or example, each element in each drawing, etc.) within the scope of the entire disclosure of the present invention. Is possible. That is, the present invention of course includes various variations and modifications that could be made by those skilled in the art according to the entire disclosure including the claims and the technical idea. In particular, with respect to the numerical ranges described in this document, any numerical value or small range included in the range should be construed as being specifically described even if there is no specific description.

１通信装置
１０通信モジュール
１１演算モジュール
２１、８２演算部
２２、２２ａ〜２２ｅ記憶部
２３記憶回路
２４メモリアレイ
２５、８４選択回路
２６読み出し先決定回路
２７、３３〜３７、４１−１〜４１−４、５３−１〜５３−６、６１〜６８、８３フリップフロップ（ＦＦ;Flip-Flop）
３１ＲＡＭ（Random Access Memory）コア
３２ＥＣＣエンコーダ（ECC Encoder）
３８ＥＣＣデコーダ（ECC Decoder）
５１−１〜５１−４アドレス比較回路
５２−１〜５２−４論理積回路
５４選択信号出力回路
８０−１〜８０−ｎ積算カウント回路
８１グループ選択回路
１００半導体装置
１０１第１の記憶部
１０２第２の記憶部
１０３決定部
１０４選択部 DESCRIPTION OF SYMBOLS 1 Communication apparatus 10 Communication module 11 Calculation module 21, 82 Calculation part 22, 22a-22e Storage part 23 Storage circuit 24 Memory array 25, 84 Selection circuit 26 Reading destination determination circuit 27, 33-37, 41-1 to 41-4 53-1-53-6, 61-68, 83 Flip-flop (FF)
31 RAM (Random Access Memory) core 32 ECC encoder (ECC Encoder)
38 ECC Decoder
51-1 to 51-4 Address comparison circuit 52-1 to 52-4 AND circuit 54 selection signal output circuit 80-1 to 80-n integration count circuit 81 group selection circuit 100 semiconductor device 101 first storage unit 102 first Two storage units 103 Determination unit 104 Selection unit

Claims

A first storage unit that accepts a write address, write data, and a read address, and is capable of parallel operation of writing data and reading data;
A storage unit composed of a plurality of storage elements connected in series, the second storage unit receiving the write data and connected in parallel to the first storage unit;
Any one of the read data read from the first storage unit and the data stored in any of the plurality of storage elements forming the second storage unit according to at least the write address and the read address A determination unit that determines the data to be output to the outside,
A selection unit that selectively outputs data determined to be output to the outside by the determination unit;
A semiconductor device comprising:

The first storage unit accepts a write enable and a read enable,
The semiconductor device according to claim 1, wherein the selection unit selects data to be output to the outside according to the write address, the read address, the write enable, and the read enable.

A first holding unit for holding the write data supplied to the first storage unit;
A second holding unit for holding the write address supplied to the first storage unit;
A third holding unit for holding the write enable supplied to the first storage unit;
A fourth holding unit for holding the read address supplied to the first storage unit;
A fifth holding unit for holding the read enable supplied to the first storage unit;
The semiconductor device according to claim 2, further comprising:

The determination unit outputs a selection signal indicating data to be output to the outside toward the selection unit,
The semiconductor device according to claim 3, further comprising a sixth holding unit that holds the selection signal.

An error correction encoding circuit for generating an error correction code from the write data;
An error correction circuit for correcting read data read from the first storage unit according to the error correction code;
The semiconductor device according to claim 1, further comprising:

The semiconductor device according to claim 1, wherein the determination unit determines data to be output to the outside according to a delay number of a system clock when the read address and the write address match. .

The determination unit
When the write address and the read address match, the data stored in the first storage element among the plurality of storage elements forming the second storage unit is determined as data to be output to the outside,
When the delay number of the system clock is equal to or greater than the number of the plurality of storage elements forming the second storage unit, the read data read from the first storage unit is determined as data to be output to the outside The semiconductor device according to claim 6.

A seventh holding unit that is arranged in front of the first holding unit and holds the write data;
An eighth holding unit that is arranged in front of the second holding unit and holds the write address;
A ninth holding unit that is arranged in front of the third holding unit and holds the write enable;
The semiconductor device according to claim 4, further comprising:

A tenth holding unit for holding read data read from the first storage unit;
An eleventh holding unit for holding the read address supplied to the determination unit;
The semiconductor device according to claim 8, further comprising:

A first storage unit that accepts a write address, write data, and a read address, and that can perform a parallel operation of writing data and reading data; and a storage unit including a plurality of storage elements connected in series, wherein the write data And a data output method from a storage device including a second storage unit connected in parallel with the first storage unit,
Any one of the read data read from the first storage unit and the data stored in any of the plurality of storage elements forming the second storage unit according to at least the write address and the read address Determining as data to be output to the outside,
Selectively outputting data determined to be output to the outside;
Including data output method.