JP2001022638A

JP2001022638A - Information processing system

Info

Publication number: JP2001022638A
Application number: JP11190447A
Authority: JP
Inventors: Yuichi Abe; 雄一安部; Yasuhiro Nakatsuka; 康弘中塚; Shigeru Matsuo; 松尾　　茂; Tetsuya Shimomura; 哲也下村; Manabu Jo; 学城; Jun Sato; 潤佐藤
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1999-07-05
Filing date: 1999-07-05
Publication date: 2001-01-26
Anticipated expiration: 2019-07-05
Also published as: JP3639464B2

Abstract

PROBLEM TO BE SOLVED: To provide an information processing system enabling access suitable for each of localities even when processing parts having different localities are mixed concerning memory access. SOLUTION: This information processing system is composed of four modules 0 to 3 accessible at the same time and each capable of reading/writing for the unit of 16 bytes and when storing the image data of 64 bytes composed of four packs in a unified memory capable of reading/writing for the unit of 64 bytes, in a linear access mode, four packs (0, 0), (0, 1), (0, 2) and (0, 3) are stored in the modules 0, 1, 2 and 3, respectively. At the time of tile access mode, four packs (0, 0), (1, 0), (2, 0) and (3, 0) are stored in the modules 0, 1, 2 and 3, respectively.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、複数の処理部が同
一のメモリにアクセスする情報処理システムに関し、特
に、ユニファイドメモリアーキテクチャ（ＵＭＡ）を採
用したシステムにおけるメモリアクセスの高速化に関す
る。[0001] 1. Field of the Invention [0002] The present invention relates to an information processing system in which a plurality of processing units access the same memory, and more particularly to an increase in memory access speed in a system employing a unified memory architecture (UMA).

【０００２】[0002]

【従来の技術】情報処理システムにおける処理部は、そ
の行う処理によって、メモリアクセスに関して、様々な
ローカリティ（局所性）を持つ。ここでのローカリティ
とは、主に空間的な局所性を意味し、複数のデータから
構成されるデータ構造において、あるデータがアクセス
されると、その近くに配置されたデータも近い将来アク
セスされる可能性が高いという性質をいう。従来から、
処理によって異なるローカリティを有効に利用するため
の工夫がなされてきた。2. Description of the Related Art A processing unit in an information processing system has various localities (locality) regarding memory access depending on the processing performed. The locality here mainly means spatial locality. In a data structure composed of a plurality of data, when a certain data is accessed, the data arranged near the data is also accessed in the near future. It is a property that the possibility is high. Traditionally,
Ingenuity has been devised to make effective use of different localities depending on the processing.

【０００３】例えば、特開平８−２９７６０５号公報に
は、メモリ空間を小矩形であるタイルに分割し、タイル
内でリニアになるようにメモリとキャッシュのアドレス
を管理して、ＣＰＵが画像領域にアクセスする際、タイ
ル単位にキャッシュに転送する方式が開示されている。
この方式では、テクスチャマッピングのように画像に対
して２次元的なローカリティを持った処理、即ち、次の
アクセスが画像の２次元的な全ての方向に対してなされ
る可能性がある処理では、２次元のタイルを単位にキャ
ッシングしているためヒット率が向上する。For example, Japanese Patent Application Laid-Open No. 8-297605 discloses that a memory space is divided into small rectangular tiles, and addresses of a memory and a cache are managed so as to be linear in the tiles. When accessing, a method of transferring to a cache in tile units is disclosed.
In this method, in a process such as texture mapping, which has a two-dimensional locality to an image, that is, a process in which the next access may be performed in all the two-dimensional directions of the image, The hit rate is improved because caching is performed in units of two-dimensional tiles.

【０００４】一方、近年、システムＬＳＩでは、メモリ
システムに、ユニファイドメモリ・アーキテクチャ（Ｕ
ＭＡ）が用いられている。ユニファイドメモリ（以下、
ＵＭという）とは、従来、別々のメモリに格納されてい
たデータ（例えば、ＣＰＵの命令やデータと、表示画像
データやテクスチャ・データ等）を、統合して格納する
メモリをいう。On the other hand, in recent years, in a system LSI, a unified memory architecture (U
MA) is used. Unified memory (hereinafter,
The term “UM” refers to a memory that integrates and stores data (eg, CPU instructions and data, display image data, texture data, and the like) conventionally stored in separate memories.

【０００５】このようなＵＭＡを採用した場合、ＵＭに
対して、様々な処理部からアクセスが行われることにな
る。つまり、異なるローカリティを有する処理部からの
メモリアクセスが、同じＵＭをアクセスする場合が生じ
ることになる。[0005] When such a UMA is adopted, various processing units access the UM. That is, memory accesses from processing units having different localities may access the same UM.

【０００６】例えば、ビデオ入力した画像をＵＭに格納
し、この画像をテクスチャとしてテクスチャマッピング
に使用したり、或いはこの画像にフィルタを掛ける等の
処理を行うシステムを考えると、これらの各処理はメモ
リアクセスに関してそれぞれ独自のローカリティを有す
る。For example, considering a system that stores a video input image in a UM and uses the image as a texture for texture mapping, or performs a process such as applying a filter to the image, each of these processes is performed in a memory. Each has its own locality for access.

【０００７】図２０は、これらの処理のローカリティを
説明する図である。FIG. 20 is a diagram for explaining the locality of these processes.

【０００８】同図に示すように、ビデオ入力は、画素デ
ータが左上から右下へと順に送られてくる。つまり、ビ
デオ入力部は、メモリアクセスに関して、一次元的な
（リニアな）なローカリティを有する。As shown in FIG. 1, in the video input, pixel data is sent in order from the upper left to the lower right. That is, the video input unit has one-dimensional (linear) locality with respect to memory access.

【０００９】これに対して、テクスチャマッピングで
は、ＵＭに格納された画素データを、はりつけ先の形状
等に応じて、縦、横、斜め、とあらゆる方向にアクセス
するため、メモリアクセスに関して、二次元的なローカ
リティを有する。また、ＵＭに格納された画像に対して
フィルタリングを施すフィルタリング処理でも、一般
に、注目する画素の周囲数画素を重み付け平均するた
め、メモリアクセスに関して、二次元的なローカリティ
を有する。On the other hand, in the texture mapping, the pixel data stored in the UM is accessed in all directions such as vertical, horizontal and diagonal according to the shape of the mounting destination. Has locality. Further, even in a filtering process for performing filtering on an image stored in the UM, generally, several pixels around a pixel of interest are weighted and averaged, and thus have two-dimensional locality with respect to memory access.

【００１０】この場合、ＵＭに対しては、一次元的な
（リニアな）なローカリティを有する処理部と二次元的
なローカリティを有する処理部の両方がアクセスするこ
とになる。In this case, both the processing unit having one-dimensional (linear) locality and the processing unit having two-dimensional locality access the UM.

【００１１】リニアなローカリティを有する処理につい
ては、アドレスをリニアに管理し、リニアなアクセスや
リニアなキャッシング（バッファリング）を行えるのが
望ましい。また、二次元的なローカリティを有する処理
については、アドレスをタイル型に管理し、タイル型の
アクセスやタイル型のキャッシングを行えるのが望まし
い。For processing having linear locality, it is desirable that addresses be managed linearly and linear access and linear caching (buffering) can be performed. In addition, for processing having two-dimensional locality, it is desirable that addresses are managed in a tile format, and tile-type access and tile-type caching can be performed.

【００１２】[0012]

【発明が解決しようとする課題】前記公報記載の技術で
は、ＣＰＵの命令などリニアなローカリティを有したデ
ータを格納してあるメモリ空間についてはアドレスをリ
ニアに管理している。つまり、図２１に示すように、リ
ニアアクセス（及びリニアキャッシング）を行うか、タ
イルアクセス（及びタイルキャッシング）を行うかは、
アクセスするアドレス領域によって決められており、同
一のアドレス空間に対しリニア型アクセスとタイル型ア
クセスの両方を行うことはできなかった。In the technique described in the above publication, addresses are linearly managed in a memory space in which data having linear locality such as CPU instructions is stored. That is, as shown in FIG. 21, whether to perform linear access (and linear caching) or tile access (and tile caching)
It is determined by the address area to be accessed, and it is not possible to perform both linear access and tile access to the same address space.

【００１３】例えば、タイル型アドレス領域は、次のア
クセスが２次元的な全ての方向になされる可能性が高い
ことを前提として、タイル型アクセスによってのみアク
セス可能としている。この場合、テクスチャマッピング
のように２次元的なローカリティを有する処理は、効率
的なメモリアクセスが可能で、キャッシュのヒット率の
向上も期待できる。しかし、殆ど右となりの画素が次に
アクセスされるビデオ入力処理についても、タイル型ア
ドレス領域については、タイル型アクセスによって、ア
クセスしなければならず、リニアなローカリティを有し
た処理部のアクセス効率は低下してしまう。For example, the tile-type address area is accessible only by tile-type access on the assumption that the next access is likely to be performed in all two-dimensional directions. In this case, a process having two-dimensional locality, such as texture mapping, enables efficient memory access and can be expected to improve the cache hit rate. However, even in the video input processing in which the pixel on the right is accessed next, the tile-type address area must be accessed by tile-type access, and the access efficiency of the processing unit having linear locality is low. Will drop.

【００１４】本発明の目的は、メモリアクセスに関し
て、異なったローカリティ（局所性）を持つ処理部が混
在した場合でもそれぞれのローカリティに適したメモリ
アクセスを可能にする情報処理システムを提供すること
にある。An object of the present invention is to provide an information processing system which enables memory access suitable for each locality even when processing units having different localities (locality) coexist. .

【００１５】[0015]

【課題を解決するための手段】本発明に係る第１の情報
処理システムは、複数のモジュールで構成されるメモリ
と、当該メモリに対してアクセスを行う処理部と、当該
処理部から発行されたメモリのアドレスを、アクセスモ
ードに従って、各モジュール毎の個別のアドレスに変換
するアドレス変換部と、アクセスモード及びアドレスに
従って、メモリに読み書きされるデータを並び替えるデ
ータアライナ部とを具備することを特徴とする。A first information processing system according to the present invention comprises a memory composed of a plurality of modules, a processing unit for accessing the memory, and a processing unit issued from the processing unit. An address conversion unit that converts an address of the memory into an individual address for each module according to an access mode, and a data aligner unit that rearranges data read and written to the memory according to the access mode and the address. I do.

【００１６】また、本発明に係る第２の情報処理システ
ムは、特定のサイズを有するデータ単位で読み書きする
ことが可能なモジュールを、Ｎ個備えたメモリと、当該
メモリとの間で、Ｎ個の前記データ単位からなるデータ
の読み書きを行う処理部と、当該処理部からのアクセス
要求を受けて、メモリに対してアクセスを行うメモリイ
ンタフェース部とを備える。そして、前記メモリインタ
フェース部は、前記処理部から受け取ったＮ個のデータ
単位のそれぞれが、異なるモジュールに格納されるよう
に、アクセスモードに応じて、各データ単位を格納する
モジュールと、各モジュールにおける格納位置を決定す
ることを特徴とする。Further, the second information processing system according to the present invention comprises a memory having N modules capable of reading and writing data in a data unit having a specific size, and an N number of modules between the memory and the memory. A processing unit for reading and writing data consisting of the data unit described above, and a memory interface unit for accessing a memory in response to an access request from the processing unit. The memory interface unit includes: a module that stores each data unit according to an access mode such that each of the N data units received from the processing unit is stored in a different module; The storage position is determined.

【００１７】また、本発明に係る第３の情報処理システ
ムは、特定のサイズを有するデータ単位で、読み書きす
ることが可能なモジュールを、Ｎ個備えたメモリと、当
該メモリとの間で、Ｎ個のデータ単位からなるデータの
読み書きを行う処理部と、当該処理部がメモリにアクセ
スする際に発行したアドレスを、アクセスモードに従っ
て、各モジュール毎の個別アドレスにアドレス変換を行
うアドレス変換部と、処理部とメモリとの間でデータの
やり取りを行う際、アクセスモードに従って、当該デー
タを構成するデータ単位の並び替えを行うデータアライ
ナ部とを備えることを特徴とする。Further, a third information processing system according to the present invention provides a memory having N modules capable of reading and writing data in a data unit having a specific size, and a memory having N modules. A processing unit for reading and writing data consisting of a plurality of data units, and an address conversion unit for performing an address conversion to an individual address for each module according to an access mode, the address issued when the processing unit accesses the memory; When data is exchanged between the processing unit and the memory, a data aligner unit that rearranges data units constituting the data according to an access mode is provided.

【００１８】この場合において、前記アドレス変換部
は、Ｎ×Ｎ個のデータ単位からなる２次元配列におい
て、同一Ｘ座標を有するデータ単位は、すべて異なるモ
ジュールに格納され、かつ、同一Ｙ座標を有するデータ
単位は、すべて異なるモジュールに格納されるように、
アドレス変換を行い、前記データアライナ部は、前記ア
ドレス変換部の当該アドレス変換に応じて、データ単位
の並び替えを行うようにしてもよい。In this case, in the address conversion unit, in a two-dimensional array composed of N × N data units, all data units having the same X coordinate are stored in different modules and have the same Y coordinate. Data units are stored in different modules,
Address conversion may be performed, and the data aligner may rearrange data units according to the address conversion performed by the address converter.

【００１９】また、本発明に係る第４の情報処理システ
ムは、それぞれ異なったローカリティを有する処理部
と、それぞれの処理部が共通にアクセスするユニファイ
ドメモリと、各処理部が使用するデータを一時的に貯め
ておくキャッシュ部と、各処理部からのアクセス要求を
受けて、ユニファイドメモリに対してメモリアクセスを
行うメモリインタフェース部と、各処理部から通知され
るアクセスモードに応じて、ユニファイドメモリへアク
セスするためのアドレスを変換するアドレス変換部と、
前記アクセスモードに応じて、ユニファイドメモリとや
り取りするデータを並べ替えるデータアライナ部とから
構成されることを特徴とする。Further, the fourth information processing system according to the present invention comprises a processing unit having different localities, a unified memory commonly accessed by each processing unit, and a temporary storage of data used by each processing unit. A cache unit for storing data, a memory interface unit for performing memory access to a unified memory in response to an access request from each processing unit, and a unified memory according to an access mode notified from each processing unit. An address conversion unit for converting an address for accessing the memory;
It is characterized by comprising a unified memory and a data aligner for rearranging data to be exchanged according to the access mode.

【００２０】この場合において、前記ユニファイドメモ
リを複数のモジュールで構成し、前記アドレス変換部
は、当該各モジュール内に設けるようにしてもよい。ま
た、前記アドレス変換部は、前記メモリインタフェース
部内に設けるようにしてもよい。In this case, the unified memory may be composed of a plurality of modules, and the address translator may be provided in each of the modules. Further, the address conversion unit may be provided in the memory interface unit.

【００２１】また、本発明に係る第５の情報処理システ
ムは、それぞれ異なったローカリティを有する処理部
と、それぞれの処理部が共通にアクセスするユニファイ
ドメモリと、各処理部が使用するデータを一時的に貯め
ておくキャッシュ部と、各処理部からのアクセス要求を
受けて、ユニファイドメモリに対してメモリアクセスを
行うメモリインタフェース部と、各処理部から通知され
るアクセスモードに応じて、ユニファイドメモリへアク
セスするためのアドレスを変換するアドレス変換部と、
前記処理部と前記キャッシュ部との間に位置し、前記ア
クセスモードに応じて、前記処理部が読み出すデータの
選択を行うデータ選択部とから構成されることを特徴と
する。Further, a fifth information processing system according to the present invention comprises a processing unit having different localities, a unified memory commonly accessed by each processing unit, and a temporary storage of data used by each processing unit. A cache unit for storing data, a memory interface unit for performing memory access to a unified memory in response to an access request from each processing unit, and a unified memory according to an access mode notified from each processing unit. An address conversion unit for converting an address for accessing the memory;
A data selection unit that is located between the processing unit and the cache unit and selects data to be read by the processing unit according to the access mode.

【００２２】なお、本発明に係る情報処理システムは、
例えば、通常の計算機システムとして、または、１チッ
プ構成のシステムＬＳＩとして実装される。Note that the information processing system according to the present invention
For example, it is implemented as a normal computer system or as a one-chip system LSI.

【００２３】また、前記処理部には、例えば、ＣＰＵ、
ビデオ入力部、ビデオ出力部、テクスチャマッピング
部、フィルタリング部などが該当する。The processing unit includes, for example, a CPU,
A video input unit, a video output unit, a texture mapping unit, a filtering unit, and the like correspond.

【００２４】[0024]

【発明の実施の形態】以下、図面を参照しつつ、本発明
の実施の形態について詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２５】図１は、本発明を適用したシステムＬＳＩ
の構成を示す図である。本システムＬＳＩは、例えば、
１チップで構成される。FIG. 1 shows a system LSI to which the present invention is applied.
FIG. 3 is a diagram showing the configuration of FIG. This system LSI is, for example,
It is composed of one chip.

【００２６】同図に示すように、本システムＬＳＩは、
ＣＰＵ１００と、ビデオ入力部１１０と、テクスチャマ
ッピング部／フィルタリング部１２０と、コネクタ部１
０１、１１１、１２１と、メモリインタフェース部１３
０と、ユニファイドメモリ（以下、ＵＭという）１４０
とを備える。As shown in FIG.
CPU 100, video input unit 110, texture mapping unit / filtering unit 120, connector unit 1
01, 111, 121 and the memory interface unit 13
0 and a unified memory (hereinafter referred to as UM) 140
And

【００２７】ＣＰＵ１００は、コネクタ部１０１に接続
され、ビデオ入力部１１０は、コネクタ部１１１に接続
され、テクスチャマッピング部／フィルタリング部１２
０は、コネクタ１２１に接続されている。The CPU 100 is connected to the connector unit 101, the video input unit 110 is connected to the connector unit 111, and the texture mapping unit / filtering unit 12
0 is connected to the connector 121.

【００２８】コネクタ部１０１、１１１、１２１および
メモリインタフェース部１３０は、それぞれ、メモリバ
ス１５０に接続されている。ここでは、メモリバス１５
０のデータ幅は、５１２ビットとする。また、各コネク
タ部１０１、１１１、１２１から出力されるアクセスモ
ード選択信号が、メモリインタフェース部１３０に入力
されている。The connectors 101, 111, 121 and the memory interface 130 are connected to the memory bus 150, respectively. Here, the memory bus 15
The data width of 0 is 512 bits. The access mode selection signal output from each of the connector units 101, 111, and 121 is input to the memory interface unit 130.

【００２９】また、メモリインタフェース部１３０は、
ＵＭ１４０にも接続されている。Further, the memory interface unit 130
It is also connected to UM140.

【００３０】ＣＰＵ１００、ビデオ入力部１１０、テク
スチャマッピング部／フィルタリング部１２０は、それ
ぞれ、異なる処理を行う処理部である。なお、テクスチ
ャマッピング部とフィルタリング部は、ともに二次元的
なローカリティを有しているため、代表して一つの処理
部として示してある。The CPU 100, the video input unit 110, and the texture mapping / filtering unit 120 are processing units that perform different processes. Since both the texture mapping unit and the filtering unit have two-dimensional locality, they are shown as one processing unit as a representative.

【００３１】コネクタ部１０１、１１１、１２１は、各
処理部とメモリバス１５０との間のインターフェースを
とる機能ブロックである。コネクタ部１０１は、キャッ
シュ１０２を備え、コネクタ部１１１は、ライトバッフ
ァ（以下、Ｗバッファという）１１２を備え、コネクタ
部１２１は、キャッシュ１２２を備える。The connector units 101, 111, and 121 are functional blocks that provide an interface between each processing unit and the memory bus 150. The connector unit 101 includes a cache 102, the connector unit 111 includes a write buffer (hereinafter, referred to as a W buffer) 112, and the connector unit 121 includes a cache 122.

【００３２】キャッシュ１０２は、ＣＰＵ１００が最近
アクセスしたデータを保持する高速メモリである。例え
ば、ＣＰＵ１００がメモリ・リードを行う際、アクセス
対象データがキャッシュ１０２内にあれば、そのデータ
がＣＰＵ１００に渡される。一方、アクセス対象データ
がキャッシュ１０２内になければ、メモリバス１５０お
よびメモリインタフェース部１３０を介して、ＵＭ１４
０からアクセス対象データを含む１キャッシュライン分
のデータ（ここでは、５１２バイトのデータとする）が
読み出され、アクセス対象データがＣＰＵ１００に渡さ
れると共に、読み出されたキャッシュラインデータがキ
ャッシュ１０２に保持される。The cache 102 is a high-speed memory for holding data recently accessed by the CPU 100. For example, when the CPU 100 performs a memory read, if the data to be accessed is in the cache 102, the data is passed to the CPU 100. On the other hand, if the data to be accessed is not in the cache 102, the UM 14 via the memory bus 150 and the memory interface unit 130.
From 0, data of one cache line including data to be accessed (here, 512-byte data) is read, and the data to be accessed is passed to the CPU 100, and the read cache line data is stored in the cache 102. Will be retained.

【００３３】Ｗバッファ１１２は、ビデオ入力部１１０
から、例えば、画素単位で入力されるデータを順次格納
し、一杯になった時点で、Ｗバッファ１１２内のデータ
を、メモリバス１５０およびメモリインタフェース部１
３０を介して、ＵＭ１４０に書き込む。Ｗバッファ１１
２は、ビデオ入力部１１０とコネクタ部１１１との間の
データバス幅と、メモリバス１５０のデータバス幅との
間の差を吸収し、メモリバス１５０の使用回数を減らす
ためのバッファである。つまり、ビデオ入力データをＵ
Ｍ１４０に格納する場合、各画素データごとにメモリア
クセスを行っていたのでは、メモリバス１５０の利用頻
度が非常に高くなるので、複数の画素データをＷバッフ
ァにためておいて、あるまとまった単位（ここでは、５
１２バイトとする）で、ＵＭ１４０に書き込みを行う。The W buffer 112 is used for the video input unit 110
For example, data input in units of pixels are sequentially stored, and when the data is full, the data in the W buffer 112 is transferred to the memory bus 150 and the memory interface unit 1.
30 and is written to the UM 140. W buffer 11
Reference numeral 2 denotes a buffer for absorbing the difference between the data bus width between the video input unit 110 and the connector unit 111 and the data bus width of the memory bus 150 and reducing the number of times the memory bus 150 is used. That is, the video input data is
In the case of storing the data in the M140, if the memory access is performed for each pixel data, the frequency of use of the memory bus 150 becomes extremely high. (Here, 5
Write to the UM 140.

【００３４】キャッシュ１２２は、テクスチャマッピン
グ部／フィルタリング部１２０から、例えば、画素単位
でのデータアクセス要求があった場合に、アクセス対象
データがキャッシュ１２２に既に読み込まれていれば、
キャッシュ１２２上のアクセス対象データをテクスチャ
マッピング部／フィルタリング部１２０に渡す。一方、
アクセス対象データがキャッシュ１２２上になければ、
メモリバス１５０およびメモリインタフェース部１３０
を介して、ＵＭ１４０にアクセスを行い、アクセス対象
データを含む１キャッシュライン分のデータ（ここで
は、５１２ビットのデータとする）を読み出し、要求さ
れたデータをテクスチャマッピング部／フィルタリング
部１２０に渡すと共に、読みだされたキャッシュライン
データを保持する。For example, when there is a data access request in pixel units from the texture mapping unit / filtering unit 120, if the data to be accessed has already been read into the cache 122,
The access target data in the cache 122 is passed to the texture mapping unit / filtering unit 120. on the other hand,
If the access target data is not on the cache 122,
Memory bus 150 and memory interface unit 130
To access the UM 140 via the, read out one cache line of data (here, 512-bit data) including the data to be accessed, pass the requested data to the texture mapping unit / filtering unit 120, and , And holds the read cache line data.

【００３５】メモリインタフェース部１３０は、各処理
部１００、１１０、１２０からのアクセス要求を調停
し、メモリアクセス要求を出している処理部の中で、実
際にメモリバス１５０を使うことができる処理部を決定
する。The memory interface unit arbitrates access requests from the processing units 100, 110, and 120, and among the processing units that issue memory access requests, a processing unit that can actually use the memory bus 150. To determine.

【００３６】調停の結果、アクセスを許可された処理部
は、メモリバス１５０を通してメモリインタフェース部
１３０へアドレスとアクセスモード選択信号を送出し、
データの授受を行う。As a result of the arbitration, the processing unit permitted to access sends an address and an access mode selection signal to the memory interface unit 130 through the memory bus 150,
Send and receive data.

【００３７】メモリインタフェース部１３０は、受け取
ったアドレス等に従って、所定のタイミングでＵＭ１４
０にアクセスを行い、ＵＭ１４０に対してデータの読み
書きを行う。The memory interface unit 130 operates the UM 14 at a predetermined timing in accordance with the received address and the like.
0, and read / write data from / to the UM 140.

【００３８】メモリインターフェース部１３０は、アド
レス変換部１３１とデータアライナ部１３２とを備え
る。The memory interface unit 130 includes an address conversion unit 131 and a data aligner unit 132.

【００３９】アドレス変換部１３１は、メモリバス１５
０からメモリインタフェース部１３０が受け取ったアド
レスを、アクセスモード選択信号に基づいて、ＵＭ１４
０の物理アドレスへ変換する。メモリインタフェース部
１３０は、この物理アドレスを用いて、ＵＭ１４０とデ
ータの授受を行う。The address conversion unit 131 is connected to the memory bus 15
0 to the address received by the memory interface unit 130 based on the access mode selection signal.
Convert to a physical address of 0. The memory interface unit 130 exchanges data with the UM 140 using the physical address.

【００４０】メモリインタフェース部１３０がＵＭ１４
０とデータの授受を行う際、データアライナ部１３２
は、必要に応じて、データを所定のデータ単位で並びか
えて、メモリバス１５０上のデータ配列とＵＭ１４０上
のデータ配列との間の変換を行う。When the memory interface unit 130 is connected to the UM 14
When exchanging data with 0, the data aligner unit 132
Performs a conversion between a data array on the memory bus 150 and a data array on the UM 140 by rearranging the data in a predetermined data unit as necessary.

【００４１】次に、ＵＭ１４０の構成について説明す
る。ここでは、ＵＭ１４０をＤＲＡＭを用いて構成した
場合について説明する。Next, the configuration of the UM 140 will be described. Here, a case where the UM 140 is configured using a DRAM will be described.

【００４２】図２は、ＵＭ１４０の構成を示す図であ
る。FIG. 2 is a diagram showing the configuration of the UM 140.

【００４３】同図に示すように、ＵＭ１４０は、２^LM個
の独立したモジュール５００で構成される。例えば、出
力の場合、各モジュール５００からは、２^LWバイトのデ
ータが出力され、各モジュール５００からの出力データ
が２^LM個分集まって、全体で、ＵＭ１４０から出力され
る２^(LW+LM)バイトのデータを構成する。As shown in the figure, the UM 140 is composed of 2 ^LM independent modules 500. For example, in the case of output, 2 ^LW bytes of data are output from each module 500, and 2 ^LM output data from each module 500 are collected, and 2 ^{(LW + LM)} output from the UM 140 as a whole Construct byte data.

【００４４】また、各モジュール５００は、バンクセレ
クタ５１０、および、２^LB個の独立したバンク５２０を
備える。バンクセレクタ５１０は、LBビットのバンクア
ドレス（Ｂアドレス）に基づいて、モジュール５００の
出力として、２^LB個のバンクのうちのいずれかの出力を
選択する。Each module 500 includes a bank selector 510 and 2 ^LB independent banks 520. The bank selector 510 selects one of the 2 ^LB banks as the output of the module 500 based on the LB bit bank address (B address).

【００４５】また、各バンク５２０は、ローセレクタ５
２１と、カラムセレクタ５２２と、センスアンプ５２３
と、２^LR×２^LC個のメモリセル５２４（１メモリセル
は、２^LWバイト）とを備える。Each bank 520 includes a row selector 5
21, a column selector 522, and a sense amplifier 523
And 2 ^LR × 2 ^LC memory cells 524 (one memory cell is 2 ^LW bytes).

【００４６】ローセレクタ５２１は、LRビットのローア
ドレス（Ｒアドレス）に基づいて、２^LR個の行データ
（２^(LC+LW)バイトのデータ）の中から、１つの行デー
タを選択して、センスアンプ５２３に出力する。The row selector 521 selects one row data from 2 ^LR row data (2 ^{(LC + LW)} byte data) based on the LR bit row address (R address). , To the sense amplifier 523.

【００４７】センスアンプ５２３は、ローセレクタ５２
１から出力された２^(LC+LW)バイトの行データを検知・
増幅して、保持する。The sense amplifier 523 is connected to the row selector 52
Detects 2 ^{(LC + LW)} byte row data output from 1
Amplify and retain.

【００４８】カラムセレクタ５２２は、LCビットのカラ
ムアドレス（Ｃアドレス）に基づいて、センスアンプ５
２３に格納されている２^LC個のメモリセル・データの内
の１つを選択し、バンク５２０からの出力として、２^LW
バイトのデータを出力する。The column selector 522 detects the sense amplifier 5 based on the LC bit column address (C address).
23, one of the 2 ^LC memory cell data stored in the memory cell 23 is selected, and as an output from the bank 520, 2 ^LW
Output byte data.

【００４９】なお、図２に示したＵＭ１４０では、すべ
てのモジュール５００からの出力を平行にＵＭ１４０外
部に出力しているが、各モジュール５００からの出力を
入力とするセレクタを更に設け、別途供給されるモジュ
ールアドレスに基づいて、一部のモジュールからの出力
のみを、ＵＭ１４０の出力とするようにしてもよい。例
えば、ＵＭ１４０に４つのモジュール０〜３がある場
合、１ビットのモジュールアドレス（Ｍアドレス）が
「０」のとき、モジュール０および１の出力を出力し、
１ビットのモジュールアドレスが「１」のとき、モジュ
ール２および３の出力を出力するようにしてもよい。In the UM 140 shown in FIG. 2, the outputs from all the modules 500 are output in parallel to the outside of the UM 140. However, a selector which receives the output from each module 500 as an input is further provided, and is separately supplied. Only the outputs from some of the modules may be output from the UM 140 based on the module address. For example, when there are four modules 0 to 3 in the UM 140, when the 1-bit module address (M address) is “0”, the outputs of the modules 0 and 1 are output,
When the one-bit module address is “1”, the outputs of the modules 2 and 3 may be output.

【００５０】次に、ＵＭ１４０の動作について説明す
る。これは、一般的なマルチバンク、マルチモジュール
構成のシンクロナスＤＲＡＭと同様の動作である。Next, the operation of the UM 140 will be described. This is the same operation as a general multi-bank, multi-module synchronous DRAM.

【００５１】ＵＭ１４０には、メモリインタフェース部
１３０から、バンクアドレス、ローアドレス、カラムア
ドレスなどのアドレスと、リード（読み出し）、ライト
（書込み）を表わすコマンドが入力される。なお、ライ
トの場合は、書き込むデータも入力される。Addresses such as a bank address, a row address, and a column address and commands representing read (read) and write (write) are input from the memory interface unit 130 to the UM 140. In the case of writing, data to be written is also input.

【００５２】まず、リードの際の動作について説明す
る。First, the operation at the time of reading will be described.

【００５３】各バンク５２０では、バンクアドレスによ
って自分が指定されると、ローアドレスに対応する２
^(LC+LW)バイトの行データが、センスアンプ５２３に読
み出される。In each bank 520, when the bank address is specified by the bank address, 2 corresponding to the row address
^{The (LC + LW)} byte row data is read out to the sense amplifier 523.

【００５４】センスアンプ５２３に読み出された行デー
タは、カラムセレクタ５２２に入力される。カラムセレ
クタ５２２は、カラムアドレスに基づいて、センスアン
プ５２３に読みだされた行データの中から、２^LWバイト
のデータを一つ選択し、バンク５２０から出力する。The row data read by the sense amplifier 523 is input to the column selector 522. The column selector 522 selects one 2 ^LW byte data from the row data read by the sense amplifier 523 based on the column address, and outputs the data from the bank 520.

【００５５】各バンク５２０から出力された２^LWバイト
のデータは、バンクセレクタ５１０に入力される。バン
クセレクタ５１０は、バンクアドレスに基づいて、２^LB
個のバンク出力のうちから１つを選択して、モジュール
出力として出力する。The data of 2 ^LW bytes output from each bank 520 is input to the bank selector 510. The bank selector 510 determines 2 ^LB based on the bank address.
One of the bank outputs is selected and output as a module output.

【００５６】前述したように、各モジュール５００から
出力された２^LM個の２^LWバイトのデータ、計２^(LM+LW)
バイトがＵＭ１４０より出力される。ＵＭ１４０から読
み出されたデータは、メモリインタフェース部１３０に
渡される。As described above, 2 ^LM 2 ^LW byte data output from each module 500, that is, a total of 2 ^{(LM + LW)}
The bytes are output from UM140. The data read from the UM 140 is passed to the memory interface unit 130.

【００５７】なお、センスアンプ５２３に行データを読
み出すには、所定のサイクル数（例えば、６サイクル）
が必要であるが、センスアンプ５２３に既に読み出され
ているデータをアクセスする場合は、メモリセル５２４
から行データを読み出す必要はないので、高速に（例え
ば、２サイクルで）アクセスすることができる。従っ
て、ローカリティの高いデータは、同時にセンスアンプ
５２３に読み出されるようにすることが望ましい。To read the row data to the sense amplifier 523, a predetermined number of cycles (for example, 6 cycles)
Is required, but when accessing data that has already been read to the sense amplifier 523, the memory cell 524
Since it is not necessary to read the row data from, access can be made at high speed (for example, in two cycles). Therefore, it is desirable that data having high locality be read out to the sense amplifier 523 at the same time.

【００５８】次に、ライトの際の動作について説明す
る。Next, the operation at the time of writing will be described.

【００５９】各バンク５２０では、バンクアドレスで自
分が指定されると、ローアドレスに対応する２^(LC+LW)
バイトの行データがセンスアンプ５２３に送られる。In each bank 520, when the bank address is designated by the bank address, 2 ^{(LC + LW)} corresponding to the row address
The byte row data is sent to the sense amplifier 523.

【００６０】ＵＭ１４０に入力された書込みデータは、
各モジュール５００に入力され、バンクアドレスにより
指定されたバンクのセンスアンプ１２３上にある行デー
タのうち、カラムアドレスにより選択された２^LWバイト
のデータが書込みデータにより書き換えられる。The write data input to the UM 140 is
Of the row data input to each module 500 and located on the sense amplifier 123 of the bank specified by the bank address, 2 ^LW bytes of data selected by the column address are rewritten by the write data.

【００６１】ライトの場合も、リードの場合と同様に、
各バンクのセンスアンプ１２３に既に読み出されている
データは高速に（例えば、１サイクルで）アクセスする
ことができるので、ローカリティの高いデータは同時に
センスアンプに読み出されるようにすることが望まし
い。In the case of writing, as in the case of reading,
Since data that has already been read to the sense amplifier 123 of each bank can be accessed at high speed (for example, in one cycle), it is desirable that data with high locality be read to the sense amplifier at the same time.

【００６２】以下では、ＵＭ１４０の構成として、LM=
2、LB=4、LR=8、LC=4、LW=4の場合を考える。すなわ
ち、ＵＭ１４０は、４（＝２²）個の独立したモジュー
ル５００で構成される。また、各モジュール５００は、
１６（＝２⁴）個のバンク５２０を備え、各バンク５２
０は、２⁸×２⁴個のメモリセル５２４を備える。また、
各メモリセル５２４は、２⁴バイトのデータを格納す
る。この場合、各モジュール５００からは、それぞれ、
１６（＝２⁴）バイトのデータが出力されるので、ＵＭ
１４０からの出力は、４×１６バイト＝６４バイト（＝
５１２ビット）となる。In the following, LM =
Consider the case of 2, LB = 4, LR = 8, LC = 4, LW = 4. That is, the UM 140 includes four (= 2 ² ) independent modules 500. Also, each module 500
There are 16 (= 2 ⁴ ) banks 520, and each bank 52
0 is provided with 2 ⁸ × 2 ⁴ memory cells 524. Also,
Each memory cell 524 stores a 2 ⁴ bytes of data. In this case, from each module 500,
Since 16 (= 2 ⁴ ) bytes of data are output, UM
The output from 140 is 4 × 16 bytes = 64 bytes (=
512 bits).

【００６３】次に、本実施形態で扱われる画像について
説明する。Next, an image handled in this embodiment will be described.

【００６４】図３は、本実施形態で扱われる５１２×５
１２画素サイズの画像の階層構造を示す図である。FIG. 3 is a diagram showing 512 × 5 pixels handled in this embodiment.
FIG. 2 is a diagram illustrating a hierarchical structure of a 12-pixel size image.

【００６５】画像データは、この階層的な区分に対応し
た形でメモリ上に格納される。実際にはこの階層とメモ
リ上のアドレスが対応することになり、この対応をアド
レスマッピングという。The image data is stored on the memory in a form corresponding to the hierarchical division. Actually, this hierarchy corresponds to the address on the memory, and this correspondence is called address mapping.

【００６６】同図に示すように、本実施形態において
は、５１２×５１２画素の画像１枚は、８×３２のブロ
ックから構成されるものとする。また、各ブロックは、
４×４のセルから構成されるものとする。As shown in the figure, in the present embodiment, one image of 512 × 512 pixels is composed of 8 × 32 blocks. Also, each block is
It is assumed that it is composed of 4 × 4 cells.

【００６７】そして、各セルは、１６×４の画素から構
成される。更に、各画素は、Ｒ（赤）、Ｇ（緑）、Ｂ
（青）、α（透明度）各１バイトの４成分から構成され
る。すなわち、１画素は、４バイト＝３２ビットのデー
タから構成される。従って、５１２×５１２画素の画像
１枚は、１Ｍバイトのデータで構成される。Each cell is composed of 16 × 4 pixels. Further, each pixel has R (red), G (green), B
(Blue) and α (transparency) are composed of four components of 1 byte each. That is, one pixel is composed of data of 4 bytes = 32 bits. Therefore, one image of 512 × 512 pixels is composed of 1 Mbytes of data.

【００６８】次に、前述したような画像データをＵＭ１
４０に格納する際のアドレスマッピングについて説明す
る。Next, the image data as described above is stored in the UM1
The address mapping when the data is stored in the storage 40 will be described.

【００６９】図４は、画像データをＵＭ１４０に格納す
る際のアドレスマッピングの例を示す図である。FIG. 4 is a diagram showing an example of address mapping when image data is stored in the UM 140.

【００７０】ここでは、ＵＭ１４０のうち、４Ｍバイト
のメモリ領域（以下、画像領域という）が、画像データ
の格納に使われるものとする。この場合、画像領域は、
２２ビットのアドレスによって、アクセスされる。Here, it is assumed that a 4-Mbyte memory area (hereinafter, referred to as an image area) of the UM 140 is used for storing image data. In this case, the image area is
It is accessed by a 22-bit address.

【００７１】図４の例は、この２２ビットのアドレス
と、ＵＭ１４０における、２ビットのモジュールアドレ
ス（M[1:0]）、４ビットのバンクアドレス（B[3:0]）、
８ビットのローアドレス（R[7:0]）、４ビットのカラム
アドレス（C[3:0]）、４ビットのバイトアドレス（W[3:
0]）との間のアドレスマッピングを示している。In the example of FIG. 4, the 22-bit address, the 2-bit module address (M [1: 0]), the 4-bit bank address (B [3: 0]) in the UM 140,
8-bit row address (R [7: 0]), 4-bit column address (C [3: 0]), 4-bit byte address (W [3:
0]).

【００７２】前述したように、５１２×５１２画素の画
像１枚は、１Ｍバイトなので、先頭の２ビットは、画像
領域内でアクセスすべき画像の先頭アドレスを表わして
いる。この２ビットは、B[3]、B[2]として使われる。こ
こで、B[2]という記述は、バンクアドレスの第２ビット
を表わす。ただし、Ｂの最下位ビットは、B[0]としてい
る。As described above, since one image of 512 × 512 pixels is 1 Mbyte, the first two bits represent the start address of the image to be accessed in the image area. These two bits are used as B [3] and B [2]. Here, the description B [2] represents the second bit of the bank address. However, the least significant bit of B is B [0].

【００７３】次の８ビットは、最上位２ビットで指定さ
れた画像内でアクセスすべきブロックの先頭アドレスを
示している。ここで、上位５ビットは、画像の縦方向の
アドレスＹで、下位３ビットは、画像の横方向のアドレ
スＸである。この８ビットは、ローアドレスR[7］〜R
[0]として使われる。The next 8 bits indicate the start address of the block to be accessed in the image specified by the most significant 2 bits. Here, the upper 5 bits are the vertical address Y of the image, and the lower 3 bits are the horizontal address X of the image. These 8 bits correspond to the row address R [7] to R
Used as [0].

【００７４】同様に、その次の４ビットは、指定された
ブロック内でアクセスすべきセルの先頭アドレスを示し
ている。ここで、上位２ビットは、縦方向のアドレスＹ
で、下位２ビットは、横方向のアドレスＸである。この
４ビットは、B[1］、C[3]、B[0]、C[2]として使われ
る。Similarly, the next four bits indicate the head address of the cell to be accessed in the designated block. Here, the upper two bits are the vertical address Y
The lower two bits are a horizontal address X. These 4 bits are used as B [1], C [3], B [0], and C [2].

【００７５】最後の８ビットは、指定されたセル内部の
アドレスであるが、このうち上位２ビットは、セル内の
ライン（ＵＭ１４０から出力される６４バイトのデータ
の単位）の先頭アドレスである。また、残り６ビット
は、ライン内のバイトアドレスであるが、ＵＭ１４０に
対しては、ライン単位でデータがアクセスされるので、
この６ビットのライン内バイトアドレスは、ＵＭ１４０
に入力する必要はない。The last 8 bits are the address inside the designated cell. Of these, the upper 2 bits are the head address of the line (the unit of 64-byte data output from the UM 140) in the cell. The remaining 6 bits are byte addresses in the line. However, since data is accessed for the UM 140 in line units,
This 6-bit in-line byte address is UM140
You do not need to enter

【００７６】次に、図４に示したアドレスマッピング時
のセル内の画素のアドレス割付について具体的に説明す
る。Next, the address assignment of the pixels in the cell at the time of the address mapping shown in FIG. 4 will be specifically described.

【００７７】図５は、一つのセル（１６×４画素）内の
画像データをメモリに格納する際の格納方式を説明する
図である。FIG. 5 is a diagram for explaining a storage method when image data in one cell (16 × 4 pixels) is stored in a memory.

【００７８】同図に示すように、セル内の各画素には、
横方向（Ｘ方向）４画素のかたまり毎に、２次元のアド
レスが付与されている。ここでは、第一座標をＹ、第二
座標をＸとして（Ｙ，Ｘ）の形で記す。以下、この２次
元のアドレスが付与された４画素のかたまりを、パック
と呼ぶ。As shown in the figure, each pixel in the cell has:
A two-dimensional address is assigned to each group of four pixels in the horizontal direction (X direction). Here, the first coordinate is represented by Y and the second coordinate is represented by X in the form of (Y, X). Hereinafter, a set of four pixels to which the two-dimensional address is assigned is referred to as a pack.

【００７９】パック（０，０）〜（３，３）が、（０，
０）〜（０，３），（１，０）〜（１，３），（２，
０）〜（２，３），（３，０）〜（３，３）の順に、Ｕ
Ｍ１４０に格納されているとすると、図４に示したアド
レス割付においては、同一のＸ座標を持つ４つのパック
が同一モジュール（モジュールアドレス：Ｘ）に格納さ
れる。The packs (0,0) to (3,3) are (0,0,0)
0) to (0,3), (1,0) to (1,3), (2,
0) to (2,3), (3,0) to (3,3)
Assuming that the packs are stored in M140, four packs having the same X coordinate are stored in the same module (module address: X) in the address assignment shown in FIG.

【００８０】すなわち、パック（０，０）、（１，
０）、（２，０）、（３，０）がモジュール０に格納さ
れ、パック（０，１）、（１，１）、（２，１）、
（３，１）がモジュール１に格納され、パック（０，
２）、（１，２）、（２，２）、（３，２）がモジュー
ル２に格納され、パック（０，３）、（１，３）、
（２，３）、（３，３）がモジュール３に格納される。That is, packs (0, 0), (1,
0), (2,0), (3,0) are stored in module 0, and packs (0,1), (1,1), (2,1),
(3, 1) is stored in the module 1 and the pack (0,
2), (1,2), (2,2), (3,2) are stored in the module 2, and packs (0,3), (1,3),
(2, 3) and (3, 3) are stored in the module 3.

【００８１】この時、同一のＹ座標を持つ４つのパック
（例えば、パック（０，０）、（０，１）、（０，
２）、（０，３））は別々のモジュール５００に格納さ
れているので、横並びの１６画素に対しては同時にアク
セスできる。しかし、前述したように同一のＸ座標を持
つ４つのパック（例えば、パック（０，０）、（１，
０）、（２，０）、（３，０））は同一のモジュール５
００に格納されているので、４×４画素に対しては同時
にアクセスができない。つまり、この場合は、リニアア
クセスには適しているが、タイルアクセスには適してい
ない。At this time, four packs having the same Y coordinate (for example, packs (0, 0), (0, 1), (0,
Since (2) and (0, 3)) are stored in separate modules 500, 16 pixels arranged side by side can be accessed simultaneously. However, as described above, four packs having the same X coordinate (for example, packs (0, 0), (1,
0), (2,0), (3,0)) are the same module 5
00, the 4 × 4 pixels cannot be accessed simultaneously. That is, in this case, it is suitable for linear access but not suitable for tile access.

【００８２】一方、パック（０，０）〜（３，３）が、
（０，０）〜（３，０），（０，１）〜（３，３），
（０，２）〜（３，２），（０，３）〜（３，３）の順
に、ＵＭ１４０に格納されているとすると、図４に示し
たアドレス割付においては、同一のＹ座標を持つ４つの
パックが同一モジュール（モジュールアドレス：Ｙ）に
格納される。この場合、タイルアクセスには適している
が、リニアアクセスには適していない。On the other hand, packs (0,0) to (3,3)
(0,0) to (3,0), (0,1) to (3,3),
Assuming that the data are stored in the UM 140 in the order of (0, 2) to (3, 2) and (0, 3) to (3, 3), the same Y coordinate is assigned in the address assignment shown in FIG. The four packs are stored in the same module (module address: Y). In this case, it is suitable for tile access but not for linear access.

【００８３】リニアアクセスとタイルアクセスの両方に
適したものにするためには、「同一セル内において、同
一Ｘ座標を有するパックは、すべて異なるモジュールに
格納されており、かつ、同一Ｙ座標を有するパックは、
すべて異なるモジュールに格納されている」必要があ
る。To make it suitable for both linear access and tile access, "packs having the same X coordinate in the same cell are all stored in different modules and have the same Y coordinate. The pack is
All must be stored in different modules. "

【００８４】図６は、このような条件を満たした格納方
式を示す図である。同図において、縦方向（Ｙ方向）に
並んだ４つのパックは、同一モジュールに格納される。
すなわち、パック（０，０）、（１，３）、（２，
２）、（３，１）は、モジュール０に格納され、パック
（０，１）、（１，０）、（２，３）、（３，２）は、
モジュール１に格納され、パック（０，２）、（１，
１）、（２，０）、（３，３）は、モジュール２に格納
され、パック（０，３）、（１，２）、（２，１）、
（３，０）は、モジュール３に格納される。FIG. 6 is a diagram showing a storage method satisfying such a condition. In the figure, four packs arranged in the vertical direction (Y direction) are stored in the same module.
That is, packs (0,0), (1,3), (2,
2) and (3,1) are stored in module 0, and packs (0,1), (1,0), (2,3), (3,2) are
Packs (0, 2), (1,
1), (2,0), (3,3) are stored in module 2 and packed (0,3), (1,2), (2,1),
(3, 0) is stored in the module 3.

【００８５】図６では、第０行目（Ｙ＝０）のパック
は、Ｘ座標が０，１，２，３と並んでいるが、第１行目
（Ｙ＝１）のパックは、Ｘ座標が０，１，２，３を一つ
ずらした形、つまり、３，０，１，２と並んでいる。同
様に第２行目、第３行目も、さらに一つづつずらした形
で並んでいる。In FIG. 6, the packs on the 0th row (Y = 0) have the X coordinates 0, 1, 2, 3, while the packs on the 1st row (Y = 1) have the X coordinates. The coordinates are shifted by 0, 1, 2, 3 by one, that is, 3, 0, 1, 2, and 3. Similarly, the second and third rows are further arranged one by one.

【００８６】このような形でパックを格納すれば、「同
一セル内において、同一Ｘ座標を有するパックは、すべ
て異なるモジュールに格納され、同一Ｙ座標を有するパ
ックは、すべて異なるモジュールに格納される」という
条件を満たし、リニアアクセスとタイルアクセスを両立
させることができる。When packs are stored in this manner, "packs having the same X coordinate in the same cell are all stored in different modules, and packs having the same Y coordinate are all stored in different modules. Satisfies the condition, and can achieve both linear access and tile access.

【００８７】図７は、このような形でパックを格納する
場合のアドレス割付を示す図である。FIG. 7 is a diagram showing address allocation when storing a pack in such a form.

【００８８】同図に示すように、図４とほぼ同様のアド
レス割付になっているが、セル内のライン選択アドレス
が直接的にカラムアドレスとはなっておらず、新たな２
ビットのライン選択アドレスL[1]，L[0]に代わっている
点が異なる。これはラインを選択する際にモジュール毎
に異なるカラムアドレスを指定する必要があるからであ
る。As shown in the figure, although the address assignment is almost the same as that of FIG. 4, the line selection address in the cell is not directly a column address, and a new 2 is selected.
The difference is that bit line selection addresses L [1] and L [0] are replaced. This is because it is necessary to specify a different column address for each module when selecting a line.

【００８９】したがって、図７に示すアドレスは、最終
的にメモリセルをアクセスするまでにアドレス変換する
必要がある。また、アクセスモードがリニアアクセスモ
ードかタイルアクセスモードかによって、アドレス変換
の方法が異なるため、アドレス変換する際にはアクセス
モード選択信号を考慮する必要がある。アドレス変換部
１３１が、このアドレス変換を行う。Therefore, the address shown in FIG. 7 needs to be converted before the memory cell is finally accessed. Further, since the address conversion method differs depending on whether the access mode is the linear access mode or the tile access mode, it is necessary to consider the access mode selection signal when performing the address conversion. The address conversion unit 131 performs this address conversion.

【００９０】さらに、このようにずらした形で格納され
たデータは、処理部でアクセスする際のデータ並びとは
異なるため、処理部にデータを渡す前に、ＵＭ１４０か
ら読み出したデータの並び替えをする必要がある。デー
タアライナ部１３２が、このデータの並び替えを行う。Further, since the data stored in such a shifted form is different from the data arrangement at the time of access by the processing unit, the data read from the UM 140 must be rearranged before the data is passed to the processing unit. There is a need to. The data aligner 132 rearranges the data.

【００９１】次に、このアドレス変換とデータの並び替
えの方法について説明する。Next, a method of the address conversion and the data rearrangement will be described.

【００９２】図８は、アクセスモードがリニアアクセス
モードの場合の入力アドレス（ライン選択アドレス）に
対するアドレス変換結果およびデータアライメントの対
応を示す図である。FIG. 8 is a diagram showing the correspondence between the address conversion result and the data alignment for the input address (line selection address) when the access mode is the linear access mode.

【００９３】図８に示した表において、１列目はライン
選択アドレスの値、２列目はモジュール番号（モジュー
ルアドレス）を示しており、これらの組み合わせに対し
て、カラムアドレス（３列目）、ＵＭ１４０の各モジュ
ールに格納されているパックの座標（４列目）、画像本
来の画素の並びになるよう並び替えたときのパックの座
標（５列目）、及び、パックの並びを正しく並び替える
ための置換（６列目）を示している。In the table shown in FIG. 8, the first column shows the value of the line selection address, the second column shows the module number (module address), and the column address (third column) , The coordinates of the packs stored in each module of the UM 140 (fourth column), the coordinates of the packs (fifth column) when the original pixels are rearranged so as to be arranged, and the arrangement of the packs are rearranged correctly. (The sixth column).

【００９４】６列目にあるＳ１、Ｓ２の記号は、特定の
置換を表す。Ｓ１は、（０，１，２，３）という配列
を、（１，２，３，０）という配列へと変換する巡回置
換を示し、Ｓ２は、（０，１，２，３）という配列を
（２，３，０，１）という配列へと変換する巡回置換、
即ち、置換Ｓ１を２度施した置換Ｓ１＊Ｓ１を示す。ま
た、１は、配列を変化させない恒等置換を示す。The symbols of S1 and S2 in the sixth column represent a specific substitution. S1 indicates a cyclic permutation for converting an array (0,1,2,3) into an array (1,2,3,0), and S2 indicates an array (0,1,2,3). Into a sequence (2,3,0,1)
That is, a substitution S1 * S1 obtained by performing the substitution S1 twice is shown. Also, 1 indicates an identity substitution that does not change the sequence.

【００９５】図８においてアドレス変換に注目すると、
カラムアドレス[C1,C0]はライン選択アドレスの値と一
致している。Focusing on the address conversion in FIG.
The column address [C1, C0] matches the value of the line selection address.

【００９６】なお、図に示した置換は、読み出し時、す
なわち、各モジュール５００に格納されている状態から
正しい状態に（元の画素配列のように）並べ替える際の
置換である。書込みの際は、６列目の逆置換を施せばよ
い。１の逆置換は１、Ｓ１の逆置換はＳ１＊Ｓ２、Ｓ２
の逆置換はＳ２、Ｓ１＊Ｓ２の逆置換は、Ｓ１である。The replacement shown in the figure is a replacement at the time of reading, that is, when rearranging from the state stored in each module 500 to the correct state (as in the original pixel array). At the time of writing, the reverse substitution in the sixth column may be performed. The reverse substitution of 1 is 1, the reverse substitution of S1 is S1 * S2, S2
Is S2, and the reverse substitution of S1 * S2 is S1.

【００９７】図９は、アクセスモードがタイルアクセス
モードの場合の入力アドレス（ライン選択アドレス）に
対するアドレス変換結果およびデータアライメントの対
応を示す図である。FIG. 9 is a diagram showing the correspondence between the address conversion result and the data alignment for the input address (line selection address) when the access mode is the tile access mode.

【００９８】図９に示した表の構成は、図８に示した表
の構成と同じで、１列目はライン選択アドレスの値、２
列目はモジュール番号を示しており、これらの組み合わ
せに対して、カラムアドレス（３列目）、ＵＭ１４０の
各モジュールに格納されているパックの座標（４列
目）、画像本来の画素の並びになるよう並び替えたとき
のパックの座標（５列目）、及び、パックの並びを正し
く並び替えるための置換（６列目）を表わしている。The structure of the table shown in FIG. 9 is the same as the structure of the table shown in FIG.
The column indicates the module number. For these combinations, the column address (third column), the coordinates of the packs stored in each module of the UM 140 (fourth column), and the original pixels of the image are arranged. In this case, the coordinates of the packs after the rearrangement (the fifth column) and the permutation for correctly rearranging the packs (the sixth column) are shown.

【００９９】図９においてアドレス変換に注目すると、
カラムアドレス[C1,C0]は２ビットの演算でモジュール
番号からライン選択アドレスの値を引いた値になってい
る。Looking at the address conversion in FIG.
The column address [C1, C0] is a value obtained by subtracting the value of the line selection address from the module number by a 2-bit operation.

【０１００】なお、図８と図９の６列目はすべて同じ置
換になっており、この場合にはデータアライナ部１３２
は、アクセスモード選択信号を必要としない。The sixth column in FIGS. 8 and 9 are all replaced by the same data. In this case, the data aligner 132
Does not require an access mode selection signal.

【０１０１】但し、一般に、「セル内において、同一Ｘ
座標を有するパックは、すべて異なるモジュールに格納
されており、同一Ｙ座標を有するパックは、すべて異な
るモジュールに格納されている」という条件の格納方式
をとった場合には、モードによって異なる置換が必要な
場合もあり、その場合、データアライナ部１３２は、ア
クセスモード選択信号に応じて、異なる置換を行う。However, in general, "the same X in the cell
All packs with coordinates are stored in different modules, and packs with the same Y coordinate are all stored in different modules. " In such a case, the data aligner unit 132 performs different replacement according to the access mode selection signal.

【０１０２】次に、前述したアドレス変換を行うアドレ
ス変換部１３１とＵＭ１４０内の各モジュール５００と
の間の接続形態について説明する。Next, a description will be given of a connection form between the address conversion unit 131 for performing the above-described address conversion and each module 500 in the UM 140.

【０１０３】図１０は、メモリインタフェース部１３０
内のアドレス変換部１３１とＵＭ１４０内の各モジュー
ル５００との間の接続形態を示す図である。FIG. 10 shows the configuration of the memory interface unit 130.
FIG. 6 is a diagram showing a connection form between an address conversion unit 131 in the UM 140 and each module 500 in the UM 140.

【０１０４】同図に示すように、アドレス変換部１３１
から、各モジュール５００に対して、カラムアドレスの
上位２ビット［C3，C2］が共通に供給される。また、カ
ラムアドレスの下位２ビット［C1，C0］は、各モジュー
ル５００に対して個別に供給される。As shown in FIG.
Thus, the upper two bits [C3, C2] of the column address are commonly supplied to each module 500. The lower two bits [C1, C0] of the column address are individually supplied to each module 500.

【０１０５】メモリインタフェース部１３０には、アク
セスを許可された処理部からメモリバス１５０を通し
て、アドレス及びアクセスモード選択信号が入力され
る。なお、同図では、メモリインタフェース部１３０に
渡されるアドレスのうち、カラムアドレスの上位２ビッ
ト［C3，C2］とライン選択アドレス２ビット［L1，L0］
のみを示してある。同図に示していないアドレスは、バ
ンクアドレス及びローアドレスとして、全てのモジュー
ル５００に所定のタイミングでブロードキャストされ
る。An address and an access mode selection signal are input to the memory interface unit 130 from the processing unit permitted to access through the memory bus 150. In the figure, the upper two bits [C3, C2] of the column address and the two bits [L1, L0] of the line selection address among the addresses passed to the memory interface unit 130.
Only is shown. Addresses not shown in the figure are broadcast to all the modules 500 at a predetermined timing as a bank address and a row address.

【０１０６】メモリインタフェース部１３０は、入力さ
れたアドレスのうち、カラムアドレスの上位２ビットに
ついては、各モジュール５００にブロードキャストす
る。また、ライン選択アドレス２ビットとアクセスモー
ド選択信号に基づいて、図８及び図９で示したように、
カラムアドレスの下位２ビットを生成する。このカラム
アドレスの下位２ビットはモジュール５００毎に異なる
ので、各モジュール５００に個別に分配する。各モジュ
ール５００は、これら４ビットのカラムアドレスに従
い、出力すべきデータをセンスアンプ１２３上から選択
する。The memory interface unit 130 broadcasts the upper two bits of the column address to each module 500 among the input addresses. Further, based on the line selection address 2 bits and the access mode selection signal, as shown in FIGS.
Generate the lower two bits of the column address. Since the lower two bits of this column address are different for each module 500, they are individually distributed to each module 500. Each module 500 selects data to be output from the sense amplifier 123 according to the 4-bit column address.

【０１０７】以上説明した実施形態では、アドレス変換
部１３１は、メモリインタフェース部１３０内に設けら
れていたが、アドレス変換部１３１を、各モジュール５
００に設けるようにしてもよい。In the embodiment described above, the address conversion unit 131 is provided in the memory interface unit 130.
00 may be provided.

【０１０８】図１１は、各モジュール５００にアドレス
変換部１３１を置いた例を示す図である。同図に示すよ
うに、各モジュール５００は、アドレス変換部１３１を
備える。また、アドレス変換部１３１は、モジュールア
ドレス・レジスタ（Ｍｒｅｇ）１４００を備えるＭｒｅ
ｇ１４００は、各モジュールのモジュールアドレス（モ
ジュール番号）を格納するレジスタである。例えば、モ
ジュール０のＭｒｅｇ１４００には、「０」が設定さ
れ、モジュール１のＭｒｅｇ１４００には、「１」が設
定され、モジュール２のＭｒｅｇ１４００には、「２」
が設定され、モジュール３のＭｒｅｇ１４００には、
「３」が設定される。Ｍｒｅｇ１４００の値は、固定に
しても可変にしてもよい。FIG. 11 is a diagram showing an example in which an address conversion unit 131 is provided in each module 500. As shown in the figure, each module 500 includes an address conversion unit 131. The address conversion unit 131 includes a module address register (Mreg) 1400
g1400 is a register that stores the module address (module number) of each module. For example, “0” is set to Mreg 1400 of module 0, “1” is set to Mreg 1400 of module 1, and “2” is set to Mreg 1400 of module 2.
Is set, and the Mreg 1400 of the module 3 includes:
“3” is set. The value of Mreg 1400 may be fixed or variable.

【０１０９】図１１の場合、メモリインタフェース部１
３０は、メモリバス１５０を介して受け取るアドレスを
すべてのモジュール５００に所定のタイミングでブロー
ドキャストする。In the case of FIG. 11, the memory interface unit 1
30 broadcasts the address received via the memory bus 150 to all the modules 500 at a predetermined timing.

【０１１０】各モジュール５００のアドレス変換部１３
１は、各Ｍｒｅｇ１４００に格納されたモジュールアド
レスと、メモリインタフェース部１３０から供給される
ライン選択アドレス及びアクセスモード選択信号とに基
づいて、カラムアドレスの下位２ビットを生成する。Address converter 13 of each module 500
1 generates the lower two bits of the column address based on the module address stored in each Mreg 1400 and the line selection address and access mode selection signal supplied from the memory interface unit 130.

【０１１１】なお、Ｍｒｅｇ１４００は、各モジュール
５００内のアドレス変換部１３１に各モジュール５００
のモジュールアドレス（モジュール番号）を知らせるた
めに設けられたものであるので、単に、各モジュール５
００のモジュールアドレスを示す信号を各モジュール５
００のアドレス変換部１３１に供給するようにしてもよ
い。Note that the Mreg 1400 sends each module 500 to the address conversion unit 131 in each module 500.
Is provided to notify the module address (module number) of each module.
A signal indicating a module address of 00
00 may be supplied to the address conversion unit 131.

【０１１２】次に、データアライナ部１３２の構成につ
いて説明する。Next, the configuration of the data aligner 132 will be described.

【０１１３】図１２は、データアライナ部１３２の構成
例を示す図である。FIG. 12 is a diagram showing a configuration example of the data aligner unit 132.

【０１１４】ここでは、簡単のため、メモリ読み出し方
向のデータアライナ部１３２のみを示す。なお、メモリ
書込み方向のデータアライナ部も、メモリ読み出し方向
の場合と同様にして、巡回置換を２段重ねることで作る
ことができる。Here, for simplicity, only the data aligner 132 in the memory read direction is shown. The data aligner in the memory writing direction can also be made by overlapping two stages of cyclic permutation in the same manner as in the memory reading direction.

【０１１５】図１２（ａ）に示すように、データアライ
ナ部１３２は、Ｓ１部１５００と、Ｓ２部１５１０とを
備える。データアライナ部１３２は、ライン選択信号L
0、L1に従って、図８及び図９に示したように動作す
る。Ｓ１部１５００およびＳ２部１５１０は、それぞ
れ、図８、図９の６列目に示してある置換Ｓ１、Ｓ２を
行うユニットである。As shown in FIG. 12A, the data aligner unit 132 includes an S1 unit 1500 and an S2 unit 1510. The data aligner unit 132 outputs the line selection signal L
According to 0 and L1, the operation is performed as shown in FIGS. The S1 unit 1500 and the S2 unit 1510 are units that perform the replacements S1 and S2 shown in the sixth column of FIGS. 8 and 9, respectively.

【０１１６】図１２（ｂ）に示すように、Ｓ１部１５０
０は、セレクタ１５０１〜１５０４を備える。セレクタ
１５０１〜１５０４は、選択信号Ｌ０（ライン選択アド
レスL[0]）の「０」，「１」に対応して、セレクタの入
力のうち０，１の添え字が付いているほうを選択して出
力する。すなわち、Ｓ１部１５００は、Ｌ０＝「１」の
とき、３，０，１，２の並びを０，１，２，３へ巡回置
換する。As shown in FIG. 12B, the S1 unit 150
0 includes selectors 1501 to 1504. The selectors 1501 to 1504 select the input of the selector with the suffix of 0 or 1 corresponding to “0” or “1” of the selection signal L0 (line selection address L [0]). Output. That is, when L0 = “1”, the S1 unit 1500 cyclically replaces the arrangement of 3,0,1,2 with 0,1,2,3.

【０１１７】また、図１２（ｃ）に示すように、Ｓ２部
１５１０は、セレクタ１５１１〜１５１４を備える。セ
レクタ１５１１〜１５１４は、選択信号Ｌ１（ライン選
択アドレスL[1]）の「０」，「１」に対応して、セレク
タの入力のうち０，１の添え字が付いているほうを選択
して出力する。すなわち、Ｓ２部１５１０は、Ｌ１＝
「１」のとき、２，３，０，１の並びを、０，１，２，
３へ巡回置換する。As shown in FIG. 12C, the S2 unit 1510 includes selectors 1511 to 1514. The selectors 1511 to 1514 select the input of the selector with the suffix of 0 or 1 corresponding to “0” or “1” of the selection signal L1 (line selection address L [1]). Output. That is, the S2 unit 1510 calculates L1 =
When “1”, the sequence of 2,3,0,1 is changed to 0,1,2,2
Cyclic substitution to 3.

【０１１８】以上のような構成を有するデータアライナ
部１３２で適宜並び替えられた１ライン分のデータは、
キャッシュ１０２、１２２等に格納される。One line of data rearranged as appropriate by the data aligner 132 having the above configuration is
It is stored in the caches 102, 122 and the like.

【０１１９】図１３は、キャッシュの１ラインに入るパ
ックの並びを示す図である。FIG. 13 is a diagram showing the arrangement of packs that fall into one line of the cache.

【０１２０】図１３（ａ）は、リニアキャッシングで、
ライン選択アドレスがＹのときのキャッシュの内容を表
わしている。FIG. 13A shows linear caching.
This represents the contents of the cache when the line selection address is Y.

【０１２１】図１３（ｂ）は、タイルキャッシングで、
ライン選択アドレスがＸのときのキャッシュの内容を表
わしている。FIG. 13B shows tile caching.
This represents the contents of the cache when the line selection address is X.

【０１２２】次に、図６に示した方法とは異なる画像デ
ータの格納方式について説明する。Next, a method of storing image data different from the method shown in FIG. 6 will be described.

【０１２３】図１４は、本発明の一実施形態における別
の画像格納方式を表す図である。図１４に示す格納方式
では、同一のＹ座標を持つ４つのパック、即ち、横並び
の１６画素に対して同時にアクセスするリニアアクセス
と、同一のＸ座標を持つ４つのパック、即ち、４×４画
素に対して同時にアクセスするタイルアクセスに加え
て、更に、２×２パック、即ち８×２画素の領域を同時
にアクセスするモードをサポートしている。以下、この
アクセスを行うモードを、８×２アクセスモードと呼
ぶ。FIG. 14 is a diagram showing another image storage method in one embodiment of the present invention. In the storage method shown in FIG. 14, four packs having the same Y coordinate, that is, linear access for simultaneously accessing 16 pixels arranged in a row, and four packs having the same X coordinate, that is, 4 × 4 pixels In addition to the tile access for simultaneously accessing the image data, a mode for simultaneously accessing a 2 × 2 pack, that is, an area of 8 × 2 pixels is supported. Hereinafter, this access mode is referred to as an 8 × 2 access mode.

【０１２４】８×２アクセスモードにおいては、例え
ば、パック（０，０）、（０，１）、（１，０）、
（１，１）を同時にアクセスすることが可能になる。In the 8 × 2 access mode, for example, packs (0,0), (0,1), (1,0),
(1, 1) can be accessed simultaneously.

【０１２５】同図において、縦方向（Ｙ方向）に並んだ
４つのパックは、同一モジュールに格納される。すなわ
ち、パック（０，０）、（１，２）、（２，１）、
（３，３）は、モジュール０に格納され、パック（０，
１）、（１，３）、（２，０）、（３，２）は、モジュ
ール１に格納され、パック（０，２）、（１，０）、
（２，３）、（３，１）は、モジュール２に格納され、
パック（０，３）、（１，１）、（２，２）、（３，
０）は、モジュール３に格納される。In the figure, four packs arranged in the vertical direction (Y direction) are stored in the same module. That is, packs (0,0), (1,2), (2,1),
(3,3) is stored in module 0 and pack (0,3)
1), (1,3), (2,0), (3,2) are stored in the module 1 and packed (0,2), (1,0),
(2,3) and (3,1) are stored in module 2,
Packs (0,3), (1,1), (2,2), (3,
0) is stored in module 3.

【０１２６】図１５〜図１７は、この場合のアドレス変
換とデータの並び替えの方法を示す図である。FIGS. 15 to 17 are diagrams showing a method of address conversion and data rearrangement in this case.

【０１２７】図１５〜図１７に示した表の構成は、図
８、図９に示した表の構成と同じである。The structure of the tables shown in FIGS. 15 to 17 is the same as the structure of the tables shown in FIGS.

【０１２８】図１５は、アクセスモードがリニアアクセ
スモードの場合を示す図である。FIG. 15 is a diagram showing a case where the access mode is the linear access mode.

【０１２９】図１６は、アクセスモードがタイルアクセ
スモードの場合を示す図である。FIG. 16 is a diagram showing a case where the access mode is the tile access mode.

【０１３０】図１７は、アクセスモードが８×２アクセ
スモードの場合を示す図である。FIG. 17 shows a case where the access mode is the 8 × 2 access mode.

【０１３１】なお、図１５〜図１７の置換の欄で、「0
⇔2」や「2⇔3」などの記述があるが、これは、それぞ
れ、（０，１，２，３）の中で、０と２、および、２と
３を交換する置換、すなわち、（０，１，２，３）から
（２，１，０，３）への置換、及び、（０，１，２，
３）から（０，１，３，２）への置換を表わしている。Note that “0” is set in the replacement column in FIGS.
There are descriptions such as “⇔2” and “2⇔3”, which are permutations in (0, 1, 2, 3) that exchange 0 and 2, and 2 and 3, respectively, ie, Replacement of (0,1,2,3) with (2,1,0,3) and (0,1,2,3)
3) represents the substitution of (0, 1, 3, 2).

【０１３２】次に、本発明の別の実施形態について説明
する。Next, another embodiment of the present invention will be described.

【０１３３】図１８は、本発明を適用した別のシステム
ＬＳＩの構成を示す図である。FIG. 18 is a diagram showing the configuration of another system LSI to which the present invention is applied.

【０１３４】同図に示すように、本システムＬＳＩは、
データアライナ部１３２が、コネクタ部１０１、１１
１、１２１に含まれている点で、図１に示したシステム
ＬＳＩと異なる。As shown in the figure, the present system LSI
The data aligner unit 132 is connected to the connector units 101 and 11.
1 and 121 are different from the system LSI shown in FIG.

【０１３５】各処理部とコネクタ部間のデータ幅がパッ
クのデータ幅以下の場合、必要なデータを含むパックを
選択して処理部に渡せばよいので（リードの場合）、デ
ータアライナ部１３２は、実質的にセレクタとなりデー
タを並べ替える処理は不要となる。したがって、この時
は、データアライナ部１３２をメモリインタフェース部
１３０に置くよりも小さな物量でシステムが構成でき
る。なお、この場合、キャッシュ１０２、１２２等に
は、各パックが、例えば、図８、図９、図１５〜図１７
の４列目に示した並びで格納されることになる。When the data width between each processing unit and the connector unit is equal to or smaller than the data width of the pack, a pack containing necessary data may be selected and passed to the processing unit (in the case of reading), so that the data aligner unit 132 In effect, it becomes a selector, and the process of rearranging the data becomes unnecessary. Therefore, at this time, the system can be configured with a smaller physical quantity than placing the data aligner unit 132 in the memory interface unit 130. In this case, the packs are stored in the caches 102 and 122, for example, as shown in FIGS.
Are stored in the arrangement shown in the fourth column.

【０１３６】また、更に、アドレス変換部１３１を各コ
ネクタ部１０１、１１１、１２１に含めるようにしても
よい。この場合、各処理部がメモリインタフェース部１
３０に送るアドレスの一部がモジュール毎に異なること
になる。すなわち、各処理部からメモリインタフェース
部１３０に対して、アドレスの一部については、モジュ
ール毎に異なるアドレスが渡される。メモリインタフェ
ース部１３０は、各処理部から渡されたアドレスのう
ち、モジュール毎に異なるアドレスについては、モジュ
ール毎に個別に送り、残りのアドレスについては、全て
のモジュールにブロードキャストする。Further, the address conversion unit 131 may be included in each of the connector units 101, 111, and 121. In this case, each processing unit is the memory interface unit 1
Some of the addresses sent to 30 will be different for each module. In other words, for each part of the address, a different address is passed from module to module to the memory interface 130. The memory interface unit 130 sends an address different for each module among the addresses passed from each processing unit individually for each module, and broadcasts the remaining addresses to all modules.

【０１３７】最後に、一般のアプリケーションプログラ
ムが動作するシステムにおける本発明によるメモリ領域
の使用例について説明する。Finally, an example of using the memory area according to the present invention in a system in which a general application program operates will be described.

【０１３８】図１９は、本発明を適用したＵＭ１４０の
メモリ領域の使用例を示す図である。FIG. 19 is a diagram showing a usage example of the memory area of the UM 140 to which the present invention is applied.

【０１３９】この場合、ＵＭ１４０を、ＣＰＵ１００上
で動作しているアプリケーションが直接アクセスする領
域１９００と、表示画像やテクスチャなどを格納してお
く画像領域１９１０とに分けている。そして、一般のア
プリケーションが画像をテクスチャとして登録したり、
ビデオ入力を行う際には、必ず標準のライブラリ（関数
の集まり）を使用して、これらの処理を行うようにし、
これらのライブラリのドライバ（ライブラリ関数の実
体）に対してのみ、画像領域１９１０へのアクセスを許
可しておく。この場合、ドライバは、画像領域１９１０
にアクセスする際には、図６や図１４に示したようなリ
ニアアクセスやタイルアクセスが可能な格納方式に即し
てアクセスする。In this case, the UM 140 is divided into an area 1900 to which an application running on the CPU 100 directly accesses, and an image area 1910 for storing display images, textures, and the like. Then, general applications register images as textures,
When performing video input, be sure to use a standard library (collection of functions) to perform these processes.
Only the drivers of these libraries (substances of the library functions) are allowed to access the image area 1910. In this case, the driver operates the image area 1910
Is accessed in accordance with a storage method that allows linear access and tile access as shown in FIGS.

【０１４０】このようにしておけば、新しいシステムを
提供する際にはライブラリのドライバを共に提供するこ
とで、アプリケーションプログラムやコンパイラを変更
することなく、画像領域１９１０において異なるアクセ
ス方法（例えば、リニアアクセスとタイルアクセス）を
両立させることができる。In this way, when a new system is provided, by providing a library driver together, a different access method (for example, a linear access method) can be used in the image area 1910 without changing an application program or a compiler. And tile access).

【０１４１】画像以外に音声などを扱う場合にも、一般
のアプリケーションプログラムが動作するシステムにお
いては、ＣＰＵ上で動作しているアプリケーションがア
クセスする領域と、画像や音声などＣＰＵ以外のリソー
スがアクセスする領域とを分けておくことで、ＣＰＵ以
外のリソースがアクセスする領域においてアプリケーシ
ョンプログラムやコンパイラを変更することなく、ある
特定の領域においてリニアアクセス（リニアキャッシン
グ）とタイルアクセス（タイルキャッシング）を両立さ
せることができる。In the case of handling audio and the like in addition to images, in a system where a general application program operates, an area accessed by an application running on the CPU and resources other than the CPU such as images and audio access. By separating the area from the area, it is possible to achieve both linear access (linear caching) and tile access (tile caching) in a specific area without changing an application program or a compiler in an area accessed by resources other than the CPU. Can be.

【０１４２】[0142]

【発明の効果】以上詳細に説明したように、本発明によ
れば、同一のアドレス空間に対して、リニアアクセスと
タイルアクセス等、異なるアクセス方法でアクセスする
ことが可能となり、これによって、メモリアクセスに関
して、異なったローカリティ（局所性）を持つ処理部が
混在した場合でも、それぞれのローカリティに適したメ
モリアクセスが可能になる。As described above in detail, according to the present invention, it is possible to access the same address space by different access methods such as linear access and tile access. Regarding the above, even when processing units having different localities (locality) coexist, memory access suitable for each locality can be performed.

【０１４３】その結果、異なったローカリティ（局所
性）を持つ処理部が混在した場合でも、メモリへのアク
セス効率の低下を防止できる。また、各処理部がキャッ
シュを備えている場合は、ヒット率の向上が期待でき、
処理速度の向上が図れる。As a result, even when processing units having different localities (localities) coexist, it is possible to prevent a decrease in the efficiency of accessing the memory. Also, if each processing unit has a cache, an improvement in hit rate can be expected,
The processing speed can be improved.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明によるシステムＬＳＩのブロック図で
ある。FIG. 1 is a block diagram of a system LSI according to the present invention.

【図２】ユニファイドメモリの構成を示すブロック図
である。FIG. 2 is a block diagram showing a configuration of a unified memory.

【図３】画像の階層構造を説明する図である。FIG. 3 is a diagram illustrating a hierarchical structure of an image.

【図４】画像データをメモリに格納する際のアドレス
マッピングの例を示す図である。FIG. 4 is a diagram showing an example of address mapping when storing image data in a memory.

【図５】画像をメモリに格納する際の格納方式の例を
示す図である。FIG. 5 is a diagram illustrating an example of a storage method when an image is stored in a memory.

【図６】本発明による画像格納方式を説明する図であ
る。FIG. 6 is a diagram illustrating an image storage method according to the present invention.

【図７】本発明による画像格納方式で画像データを格
納する際のアドレスマッピングを示す図である。FIG. 7 is a diagram showing address mapping when image data is stored by the image storage method according to the present invention.

【図８】リニアアクセスモード時の入力アドレスに対
するアドレス変換結果およびデータアライメントの対応
を示す図である。FIG. 8 is a diagram showing correspondence between an address conversion result and data alignment for an input address in a linear access mode.

【図９】タイルアクセスモード時の入力アドレスに対
するアドレス変換結果およびデータアライメントの対応
を示す図である。FIG. 9 is a diagram showing correspondence between an address conversion result and data alignment for an input address in a tile access mode.

【図１０】メモリインタフェース部１３０と各モジュ
ール５００との間の接続形態を示す図である。FIG. 10 is a diagram showing a connection form between a memory interface unit 130 and each module 500.

【図１１】各モジュール５００にアドレス変換部１３
１を置いた例を示す図である。FIG. 11 shows an address conversion unit 13 in each module 500.
It is a figure showing the example which put 1.

【図１２】データアライナ部の構成を示すブロック図
である。FIG. 12 is a block diagram illustrating a configuration of a data aligner unit.

【図１３】キャッシュ内のパックの配置を示す図であ
る。FIG. 13 is a diagram showing an arrangement of packs in a cache.

【図１４】本発明による別の画像格納方式を説明する
図である。FIG. 14 is a diagram illustrating another image storage method according to the present invention.

【図１５】リニアアクセスモード時の入力アドレスに
対するアドレス変換結果およびデータアライメントの対
応を示す図である。FIG. 15 is a diagram showing correspondence between an address conversion result and data alignment for an input address in a linear access mode.

【図１６】タイルアクセスモード時の入力アドレスに
対するアドレス変換結果およびデータアライメントの対
応を示す図である。FIG. 16 is a diagram showing a correspondence between an address conversion result and a data alignment for an input address in a tile access mode.

【図１７】８×２アクセスモードの時の入力アドレス
に対するアドレス変換結果およびデータアライメントの
対応を示す図である。FIG. 17 is a diagram showing correspondence between an address conversion result and data alignment for an input address in an 8 × 2 access mode.

【図１８】本発明による別のシステムＬＳＩのブロッ
ク図である。FIG. 18 is a block diagram of another system LSI according to the present invention.

【図１９】一般のアプリケーションプログラムが動作
するシステムにおけるＵＭの使用例を示す図である。FIG. 19 is a diagram illustrating an example of using a UM in a system in which a general application program operates.

【図２０】ローカリティの概念を説明する図である。FIG. 20 is a diagram illustrating the concept of locality.

【図２１】従来方式によるメモリアクセスの概要を説
明する図である。FIG. 21 is a diagram illustrating an overview of memory access according to a conventional method.

[Explanation of symbols]

１００ＣＰＵ１１０ビデオ入力部１２０テクスチャマッピング部／フィルタリング部１０１，１１１，１２１コネクタ部１３０メモリインタフェース部１３１アドレス変換部１３２データアライナ部１４０ユニファイドメモリ（ＵＭ） Reference Signs List 100 CPU 110 Video input unit 120 Texture mapping unit / filtering unit 101, 111, 121 Connector unit 130 Memory interface unit 131 Address conversion unit 132 Data aligner unit 140 Unified memory (UM)

───────────────────────────────────────────────────── フロントページの続き (72)発明者松尾茂茨城県日立市大みか町七丁目１番１号株式会社日立製作所日立研究所内 (72)発明者下村哲也茨城県日立市大みか町七丁目１番１号株式会社日立製作所日立研究所内 (72)発明者城学茨城県日立市大みか町七丁目１番１号株式会社日立製作所日立研究所内 (72)発明者佐藤潤東京都小平市上水本町五丁目20番１号株式会社日立製作所半導体グループ内Ｆターム(参考） 5B047 AB04 EA09 EB01 EB06 EB13 5B060 AA13 AC13 GA06 GA11 GA16 MM13 ──────────────────────────────────────────────────続き Continuing on the front page (72) Inventor Shigeru Matsuo 7-1-1, Omika-cho, Hitachi City, Ibaraki Prefecture Inside Hitachi Research Laboratory, Hitachi, Ltd. (72) Inventor Tetsuya Shimomura 7-1-1, Omika-cho, Hitachi City, Ibaraki Prefecture No. 1 Hitachi, Ltd., Hitachi Research Laboratory (72) Inventor Manabu Shiro 7-1-1, Omikacho, Hitachi City, Ibaraki Prefecture Hitachi, Ltd., Hitachi Research Laboratory (72) Inventor Jun Sato, Kodaira City, Tokyo 5-20-1, Mizumotocho F-term in Hitachi Semiconductor Group, Ltd. (Reference) 5B047 AB04 EA09 EB01 EB06 EB13 5B060 AA13 AC13 GA06 GA11 GA16 MM13

Claims

[Claims]

A memory configured by a plurality of modules; a processing unit that accesses the memory; and an address of the memory issued from the processing unit. An information processing system, comprising: an address conversion unit that converts data into data and a data aligner that rearranges data read and written in a memory according to an access mode and an address.

2. A process for reading and writing data consisting of N data units between a memory including N modules capable of reading and writing data in a data unit having a specific size, and the memory. And a memory interface unit that accesses a memory in response to an access request from the processing unit. The memory interface unit is different in each of the N data units received from the processing unit. A module for storing each data unit according to the access mode so as to be stored in the module,
An information processing system, wherein a storage position in each module is determined.

3. A process for reading and writing data consisting of N data units between a memory having N modules capable of reading and writing data in a data unit having a specific size, and the memory. And an address translation unit that translates an address issued when the processing unit accesses the memory into an individual address for each module in accordance with the access mode, and exchanges data between the processing unit and the memory. An information processing system, comprising: a data aligner unit for rearranging the data units constituting the data according to an access mode.

4. The data conversion unit according to claim 1, wherein in a two-dimensional array of N × N data units, all data units having the same X coordinate are stored in different modules and have the same Y coordinate. 4. The device according to claim 3, wherein the address conversion unit performs an address conversion so that all the data are stored in different modules, and the data aligner unit rearranges data units according to the address conversion performed by the address conversion unit. An information processing system according to claim 1.

5. A processing unit having different localities, a unified memory commonly accessed by each processing unit, a cache unit for temporarily storing data used by each processing unit, and each processing unit A memory interface unit that accesses the unified memory in response to an access request from the server, and an address converter that translates the address for accessing the unified memory according to the access mode notified from each processing unit And a data aligner unit for rearranging data exchanged with the unified memory according to the access mode.

6. The information processing system according to claim 5, wherein the unified memory includes a plurality of modules, and the address conversion unit is provided in each of the modules.

7. A processing unit having different localities, a unified memory commonly accessed by each processing unit, a cache unit for temporarily storing data used by each processing unit, and each processing unit A memory interface unit that accesses the unified memory in response to an access request from the server, and an address converter that translates the address for accessing the unified memory according to the access mode notified from each processing unit An information processing system, which is located between the processing unit and the cache unit and that selects data to be read by the processing unit in accordance with the access mode. .