JP4519151B2

JP4519151B2 - Cache control circuit

Info

Publication number: JP4519151B2
Application number: JP2007073099A
Authority: JP
Inventors: 勝洋依田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2007-03-20
Filing date: 2007-03-20
Publication date: 2010-08-04
Anticipated expiration: 2027-03-20
Also published as: JP2008234320A

Description

本発明は、一般にメモリ制御回路に関し、詳しくはキャッシュメモリを制御するキャッシュ制御回路に関する。 The present invention generally relates to a memory control circuit, and more particularly to a cache control circuit that controls a cache memory.

コンピュータシステムにおいては一般に、主記憶とは別に小容量で高速なキャッシュメモリが設けられる。主記憶に記憶される情報の一部をキャッシュにコピーしておき、この情報をアクセスする場合には主記憶からではなくキャッシュから読み出すことで、高速な情報の読み出しが可能となる。 In general, in a computer system, a small-capacity and high-speed cache memory is provided separately from the main memory. A part of the information stored in the main memory is copied to the cache, and when this information is accessed, the information is read from the cache instead of the main memory, thereby enabling high-speed information reading.

通信の分野等で使われるプロセッサにおいては、通信方式の要求を満足する高い処理性能と低消費電力という、互いにトレードオフとなる条件が要求される。プロセッサの処理性能とキャッシュのヒット率とには密接な関係があり、ヒット率が高いほどプロセッサは本来の性能を十分に発揮することができる。ヒットとは、キャッシュに格納してある外部メモリへのアクセス履歴に今回のアクセスが当てはまる場合、即ち、アクセス対象のデータがキャッシュに格納されている場合のことである。逆に今回のアクセスがアクセス履歴に当てはまらない場合を、キャッシュのミスヒットと呼ぶ。この場合には新たに履歴を格納するが、新たな履歴を格納する容量がない場合には、履歴を上書き更新する必要がある。 A processor used in the field of communication or the like is required to have a trade-off condition of high processing performance and low power consumption that satisfy the requirements of the communication method. There is a close relationship between the processing performance of the processor and the hit rate of the cache, and the higher the hit rate, the better the processor can exhibit its original performance. A hit is when the current access is applied to the access history to the external memory stored in the cache, that is, when the access target data is stored in the cache. Conversely, when the current access does not apply to the access history, it is called a cache miss. In this case, a new history is stored, but when there is no capacity for storing a new history, the history needs to be overwritten and updated.

一般に、アドレスの全ビットのうちで、所定数の下位ビットがキャッシュのインデックスとなり、それより上位に位置する残りのビットがキャッシュのタグ（履歴）となる。データをアクセスする場合には、アクセス先を示すアドレス中のインデックス部分を用いて、キャッシュ中の対応するインデックスのタグ（履歴）を読み出す。読み出したタグと、アドレス中のタグ部分のビットパターンが一致するか否かを判断する。一致しない場合にはキャッシュミスとなる。一致する場合には、キャッシュヒットとなり、当該インデックスに対応するキャッシュデータ（１キャッシュライン分の所定ビット数のデータ）がアクセスされる。 Generally, out of all bits of the address, a predetermined number of lower bits serve as a cache index, and the remaining bits positioned higher than that serve as a cache tag (history). When accessing data, a tag (history) of the corresponding index in the cache is read using the index portion in the address indicating the access destination. It is determined whether or not the read tag matches the bit pattern of the tag portion in the address. If they do not match, a cache miss occurs. If they match, a cache hit occurs, and the cache data corresponding to the index (data of a predetermined number of bits for one cache line) is accessed.

各キャッシュラインに対して１つだけタグを設けたキャッシュの構成を、ダイレクトマッピング方式と呼ぶ。各キャッシュラインに対してＮ個のタグを設けたキャッシュの構成をＮウェイセットアソシアティブと呼ぶ。Ｎウェイのキャッシュでは、１つのキャッシュラインに対して、Ｎ個のデータを格納することができる。 A cache configuration in which only one tag is provided for each cache line is called a direct mapping method. A cache configuration in which N tags are provided for each cache line is called an N-way set associative. In the N-way cache, N data can be stored in one cache line.

キャッシュサイズが大きければ大きいほどヒット率は増加する。しかしキャッシュはコアの近くで高速に動作するので、キャッシュサイズを大きくすると消費電力が大きくなってしまうという問題点がある。プロセッサ全体の消費電力に対するキャッシュの消費電力の比率は高いことが知られている。 The larger the cache size, the higher the hit rate. However, since the cache operates at high speed near the core, there is a problem that the power consumption increases when the cache size is increased. It is known that the ratio of the power consumption of the cache to the power consumption of the entire processor is high.

一般にアプリケーションの状態によって、必要なキャッシュの容量は異なる。そこで、キャッシュの起動停止をウェイ単位で制御できる機構を設け、駆動ウェイ数を変化させてキャッシュ容量及び消費電力を制御する技術がある（特許文献１）。この技術では、キャッシュのミスヒット率を常に測定しておき、ミスヒット率が高くなったらウェイを起動させ、低くなったらウェイを順次停止させる。これにより、動的にキャッシュサイズを必要最小限にすることで、性能の劣化を防ぎつつ低消費電力化をはかっている。 Generally, the required cache capacity varies depending on the state of the application. Therefore, there is a technique for controlling the cache capacity and power consumption by changing the number of drive ways by providing a mechanism that can control the start and stop of the cache in units of ways (Patent Document 1). In this technique, the cache miss hit rate is always measured, and when the miss hit rate becomes high, the way is started, and when the miss hit rate becomes low, the way is sequentially stopped. As a result, by dynamically minimizing the cache size to the minimum necessary, power consumption is reduced while preventing performance degradation.

しかし、測定項目がミスヒット率だけでは、以下に説明するように問題がある。例えば現在のキャッシュのサイズ（現在の駆動ウェイ数）が現在のアプリケーションの状態に対して最適でヒット率が高い場合、測定されるミスヒット率が低い状態が維持されるので、ウェイを停止するのが適切と判断されウェイが１つ停止されてしまう。しかし１つウェイが停止されると現在のキャッシュのサイズ（現在の駆動ウェイ数）が十分ではなくなり、測定されるミスヒット率が高い状態となる。その結果、ウェイを起動するのが適切と判断されウェイが１つ起動される。このようにウェイの停止・起動が繰り返し実行されることになり、性能が低下してしまう。この現象は、ミスヒット率のみで現在のウェイ数の是非を判断し、キャッシュがどの程度使われているかの情報（埋まり具合や使用率）を無視したためにおこると考えられる。
特開平９−５０４０１号公報 However, if the measurement item is only the miss-hit rate, there is a problem as described below. For example, if the current cache size (current number of drive ways) is optimal for the current application state and the hit ratio is high, the measured miss-hit ratio remains low, so the way is stopped. Is determined to be appropriate and one way is stopped. However, if one way is stopped, the current cache size (current number of drive ways) is not sufficient, and the measured miss-hit rate is high. As a result, it is determined that it is appropriate to activate the way, and one way is activated. In this way, the way is stopped and started repeatedly, and the performance deteriorates. This phenomenon is considered to be caused by determining whether the current number of ways is right or wrong based only on the miss hit rate, and ignoring information on how much the cache is used (filling level and usage rate).
Japanese Patent Laid-Open No. 9-50401

以上を鑑みて、本発明は、現在のアプリケーションの状態に対して適切なサイズにキャッシュ容量を制御可能なキャッシュ制御回路を提供することを目的とする。 In view of the above, an object of the present invention is to provide a cache control circuit capable of controlling the cache capacity to an appropriate size for the current application state.

複数のウェイを含むセットアソシアティブ方式のキャッシュメモリを制御するキャッシュ制御回路は、該キャッシュメモリの該複数のウェイのうち有効データが占めている割合に応じたデータ占有割合と、該キャッシュメモリがアクセスされた頻度に応じた使用率と、該キャッシュメモリへのアクセスのミスの割合に応じたミス率とを求める測定部と、該データ占有割合と、該使用率と、該ミス率とに応じて該キャッシュメモリの該複数のウェイ毎の起動及び停止を制御する制御部を含むことを特徴とする。 A cache control circuit for controlling a set associative cache memory including a plurality of ways has a data occupancy ratio according to a ratio of valid data in the plurality of ways of the cache memory, and the cache memory is accessed. A measurement unit that obtains a usage rate according to the frequency and a miss rate according to the miss rate of access to the cache memory, the data occupation rate, the usage rate, and the miss rate according to the miss rate. A control unit that controls activation and deactivation of the plurality of ways of the cache memory is included.

本発明の少なくとも１つの実施例によれば、ミス率だけではなく、複数のウェイのうち有効データが占めている割合に応じたデータ占有割合とキャッシュメモリがアクセスされた頻度に応じた使用率とを考慮することにより、現在のアプリケーションの状態に即したウェイ毎の起動制御及び停止制御を行うことが可能となる。これにより、現在のアプリケーションの状態に対して適切なサイズにキャッシュ容量を制御することができる。 According to at least one embodiment of the present invention, not only the miss rate, but also the data occupation rate according to the proportion of valid data in a plurality of ways and the usage rate according to the frequency at which the cache memory is accessed, Considering the above, it becomes possible to perform start control and stop control for each way in accordance with the current application state. As a result, the cache capacity can be controlled to an appropriate size for the current application state.

以下に、本発明の実施例を、添付の図面を用いて詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

図１は、本発明によるキャッシュ装置の構成の一例を示すブロック図である。 FIG. 1 is a block diagram showing an example of the configuration of a cache device according to the present invention.

図１のキャッシュ装置は、タグ格納部１１、データレジスタ部１２、判定ユニット１３、コントローラ１４、アドレス変換部１５、及びセレクタ１６を含む。例えば、論理アドレスが３２ビット長であるとすると、アドレスの最下位４ビットがアドレスオフセットとなり、それより上位に位置する所定数のビットがキャッシュのインデックスとなり、更にそれより上位に位置する残りのビットがキャッシュのタグとなる。 1 includes a tag storage unit 11, a data register unit 12, a determination unit 13, a controller 14, an address conversion unit 15, and a selector 16. For example, if the logical address is 32 bits long, the least significant 4 bits of the address become the address offset, a predetermined number of bits positioned higher than that become the cache index, and the remaining bits positioned higher than that Becomes the tag of the cache.

データをアクセスする場合には、アクセス先を示すアドレス中のインデックス部分（論理アドレスの第７ビット（６）から第１３ビット（１２）の７ビット）を用いて、タグ格納部１１の対応するインデックスのタグを読み出す。このとき、タグ格納部１１の当該インデックスの有効ビットＶが有効値１であるときに限り、タグを有効データとして読み出すようにしてよい。タグの読出しと並行して、アクセス先を示すアドレスの上位ビット（論理アドレス［３１：１３］：論理アドレスの第１４ビット（１３）から第３２ビット（３１））を、アドレス変換部１５により物理アドレス［３１：１３］に変換する。 When accessing data, the index part (7 bits from the 7th bit (6) to the 13th bit (12) of the logical address) in the address indicating the access destination is used to correspond to the corresponding index in the tag storage unit 11. Read the tag. At this time, the tag may be read as valid data only when the valid bit V of the index in the tag storage unit 11 is a valid value 1. In parallel with the reading of the tag, the upper bits (logical address [31:13]: the 14th bit (13) to the 32nd bit (31) of the logical address) of the address indicating the access destination are physically Convert to address [31:13].

このようにしてタグ格納部１１から読み出したタグとアドレス変換部１５から出力される物理アドレス［３１：１３］とを、判定ユニット１３の比較器１３ａにより比較し、ビットパターンが一致するか否かを判断する。比較動作は４ウェイの４つのタグに対して実行され、そのうちの１つについて比較結果が一致を示すと、対応する判定ユニット１３の出力がアサートされる。アクセスがヒットの場合には判定ユニット１３の何れかの出力がアサートされ、アクセスがミスの場合には判定ユニット１３の何れの出力もアサートされない。 The tag read out from the tag storage unit 11 in this manner and the physical address [31:13] output from the address conversion unit 15 are compared by the comparator 13a of the determination unit 13, and whether or not the bit patterns match. Judging. The comparison operation is executed for four tags of four ways, and when the comparison result indicates coincidence for one of them, the output of the corresponding determination unit 13 is asserted. When the access is hit, any output of the determination unit 13 is asserted. When the access is miss, any output of the determination unit 13 is not asserted.

アクセス先の論理アドレスに応じてデータレジスタ部１２から読み出された４つのウェイのデータのうち、判定ユニット１３のアサート出力に対応するウェイのデータがセレクタ１６に供給される。セレクタ１６は、アクセス先の論理アドレス中のオフセット部分が示すワードを選択し、キャッシュからの読み出しデータとする。 Of the four way data read from the data register unit 12 according to the logical address of the access destination, the way data corresponding to the assert output of the determination unit 13 is supplied to the selector 16. The selector 16 selects the word indicated by the offset portion in the logical address of the access destination and sets it as read data from the cache.

コントローラ１４は、キャッシュ管理に関わる種々の制御動作を実行する。例えば、有効ビットＶの設定をしたり、タグの設定をしたり、有効ビットをチェックすることで利用可能なキャッシュラインを検索したり、例えばＬＲＵ（least recently used）アルゴリズム等に基づいて置換対象となるキャッシュラインを選択したり、データレジスタ部１２へのデータ書き込み動作を制御したりする。またコントローラ１４は、本発明によるキャッシュの起動停止制御を実行する。図１に示されるコントローラ１４のブロック構成は、このキャッシュの起動停止制御に関する部分に対応する。 The controller 14 executes various control operations related to cache management. For example, a valid bit V can be set, a tag can be set, an available cache line can be searched by checking a valid bit, and a replacement target can be determined based on, for example, an LRU (least recently used) algorithm. Or a data write operation to the data register unit 12 is controlled. The controller 14 also executes cache start / stop control according to the present invention. The block configuration of the controller 14 shown in FIG. 1 corresponds to the part related to the cache start / stop control.

コントローラ１４は、パワー制御部２１、動的閾値制御部２２、ウェイ数制御部２３、比較部２４、ミス率測定部２５、使用率測定部２６、キャッシュ状態測定部２７、及び設定レジスタ２８を含む。キャッシュ状態測定部２７では以下の２つの状態を随時監視し、使用率表として保持する。 The controller 14 includes a power control unit 21, a dynamic threshold control unit 22, a way number control unit 23, a comparison unit 24, a miss rate measurement unit 25, a usage rate measurement unit 26, a cache state measurement unit 27, and a setting register 28. . The cache state measurement unit 27 monitors the following two states as needed and holds them as a usage rate table.

監視対象の状態の一つめは、キャッシュライン毎の埋まり具合である。例えばキャッシュのウェイが３つ起動しており、あるラインにおいて、これら３つのウェイ全てに有効なデータが入っている場合、このラインの埋まり具合は１であり、これら３つのウェイのうちで有効なデータが入っていないウェイが１つでもある場合、このラインの埋まり具合は０である。このように、ライン毎に全ての起動中のウェイで有効なデータがはいっているときそのラインの埋まり具合を１とし、１つでも有効なデータがないウェイがあるときそのラインの埋まり具合を０と定義する。図２に示すように、使用率表３１には起動中のウェイが示されており、このうち有効データの入っているウェイには１がフラグとして記録されている。起動中のウェイのフラグのＡＮＤをとることで、埋まり具合のデータ１／０が求められる。 The first state to be monitored is the degree of filling for each cache line. For example, if three cache ways are activated and valid data is stored in all three ways on a line, the line is filled with 1, and the valid of these three ways is valid. If there is even one way that does not contain data, the degree of filling of this line is zero. As described above, when valid data is inserted in all active ways for each line, the degree of filling of the line is 1, and when there is a way without any valid data, the degree of filling of the line is 0. It is defined as As shown in FIG. 2, the usage rate table 31 shows the active way, and 1 is recorded as a flag in the way containing valid data. By taking the AND of the flag of the way being activated, the data 1/0 of the filling state is obtained.

監視対象の状態の一つめは、キャッシュライン毎の使用率である。ここで使用率は以下のようにして求める。キャッシュアクセス毎に、アクセスされたキャッシュラインの番号（ラインナンバー）を、図２に示すＦＩＦＯ３２に格納する。ＦＩＦＯ３２に新規格納されたラインナンバーのラインの使用率を、使用率表３１において１増加させる。またＦＩＦＯ３２から出力されるラインナンバーのラインの使用率を、使用率表３１において１減少させる。ＦＩＦＯ３２の長さがＮ＋１であり、過去Ｎ＋１回のキャッシュアクセスのうちｍ回がラインｋに対するアクセスであったとすると、使用率表３１におけるラインｋの使用率はｍとなる。 The first status to be monitored is the usage rate for each cache line. Here, the usage rate is obtained as follows. For each cache access, the number (line number) of the accessed cache line is stored in the FIFO 32 shown in FIG. The usage rate of the line of the line number newly stored in the FIFO 32 is increased by 1 in the usage rate table 31. Further, the usage rate of the line of the line number output from the FIFO 32 is decreased by 1 in the usage rate table 31. If the length of the FIFO 32 is N + 1 and m of the past N + 1 cache accesses are accesses to the line k, the usage rate of the line k in the usage rate table 31 is m.

ミス率測定部２５では、キャッシュアクセス毎にヒットならば０でありミスならば１である数値を生成し、アクセスされたラインの埋まり具合が１ならば予め設定された１より大きい係数Ｃをこの数値に掛けて、その結果をＦＩＦＯ３２に格納する（図２参照）。このＦＩＦＯ３２の格納データの合計値を、ウェイ起動の判断基準のための測定値（ミス率）としてミス率測定部２５から出力する。この出力測定値であるミス率は、過去の所定回のアクセスのうちでミスが多いほど大きくなり、またミスの場合のアクセスされたラインに空きがないほど大きくなる。この出力測定値を、ウェイ起動の判断基準として利用する。係数Ｃとの積を求めることにより、起動検出が従来よりも係数設定値倍早くなる。 The miss rate measurement unit 25 generates a numerical value that is 0 if hit for each cache access and 1 if it is a miss. If the degree of filling of the accessed line is 1, a coefficient C greater than a preset value C is set to 1. Multiply the value and store the result in the FIFO 32 (see FIG. 2). The total value of the data stored in the FIFO 32 is output from the miss rate measuring unit 25 as a measured value (miss rate) for the way activation determination criterion. The miss rate, which is an output measurement value, increases as the number of misses in the past predetermined number of accesses increases, and increases as the accessed line in the case of a miss has no free space. This output measurement value is used as a criterion for way activation. By obtaining the product of the coefficient C, the activation detection is twice as fast as the coefficient set value than the conventional one.

使用率測定部２６は、キャッシュの埋まり具合と使用頻度とから、ウェイを停止するのが適切か否かを判定するための測定値を計算する。図３に示すようにキャッシュの埋まり具合が歯抜けの状態になっている場合、ウェイを停止して使用箇所をつめても、ヒット率にさほど影響を与えることなく電力削減効果を得ることができる。例えば埋まり具合が１（即ち空きがない状態）であるライン０については、使用率が比較的小さいので、ウェイを減らしてしまっても影響は少ないと考えられる。具体的には、ｉをラインナンバーとしてｉについての和である
Σ｛埋まり具合_ｉ×使用率_ｉ｝（式１）
の値が小さい場合には、ウェイを減らすことが妥当であると考えられる。使用率測定部２６は、上記のようにして求めた値を総合使用率として出力する
比較部２４は、ミス率測定部２５からのミス率と使用率測定部２６からの総合使用率とをそれぞれの閾値と比較する。これら閾値は、予めシミュレーションなどで決めた値であり、例えばレジスタ設定等により可変に設定できる値である。比較部２４は、ミス率が閾値を超えたら起動の指示をウェイ数制御部２３へ出す。ミス率が閾値を超えていない状態で総合使用率が閾値以下になった場合、比較部２４は停止の指示をウェイ数制御部２３へ出す。どちらの条件も満たしていない場合、比較部２４は起動及び停止の何れの指示も出さない。 The usage rate measuring unit 26 calculates a measurement value for determining whether or not it is appropriate to stop the way from the degree of cache filling and the usage frequency. As shown in FIG. 3, when the cache is in a missing state, it is possible to obtain a power reduction effect without significantly affecting the hit rate even if the way is stopped and the use location is closed. . For example, with respect to line 0 in which the degree of filling is 1 (that is, there is no vacancy), the usage rate is relatively small, so it is considered that there is little influence even if the ways are reduced. Specifically, i is the sum of i, where i is a line number. Σ {embedding _i × usage _i } (Expression 1)
If the value of is small, it is considered reasonable to reduce the way. The usage rate measurement unit 26 outputs the value obtained as described above as the total usage rate. The comparison unit 24 calculates the error rate from the error rate measurement unit 25 and the total usage rate from the usage rate measurement unit 26, respectively. Compare with the threshold value. These threshold values are values determined in advance by simulation or the like, and are values that can be variably set by register settings, for example. When the error rate exceeds the threshold value, the comparison unit 24 issues an activation instruction to the way number control unit 23. When the total usage rate becomes equal to or less than the threshold value when the error rate does not exceed the threshold value, the comparison unit 24 issues a stop instruction to the way number control unit 23. When neither condition is satisfied, the comparison unit 24 does not give any instruction of starting and stopping.

図４は、ウェイ数制御部２３の構成の一例を示す図である。 FIG. 4 is a diagram illustrating an example of the configuration of the way number control unit 23.

ウェイ数制御部２３は、比較部２４からの起動／停止指示と現在のウェイの起動状況とを基にしてウェイ数を制御する。内部にウェイ使用数測定部３５を持ち、所定のアクセス回数における各ウェイの使用状況を表として保持している。また、ウェイの起動状況をウェイ起動状態表３６として保持する機構をもつ。起動判定は、比較部２４から起動指示があり、かつ停止中のウェイがあり、かつタイマ待ち状態でない場合に制御判定部３９により行われる。制御判定部３９は、起動判定すると、起動状況を更新し、起動状況を起動指示と合わせてパワー制御部２１に送る。タイマ待ち状態とは、前回起動からウェイの電源が安定するまでの時間、及び、ウェイが起動した直後の一回目のアクセスがミスとなり履歴をキャッシュ内部に格納するために必要な時間（αレジスタ３７の格納値が示す時間）、における待ち状態のことである。この時間はシミュレーションなどで予め決めておき、レジスタ３７に可変に設定可能な構成とする。起動判定とともに起動し待ち時間を計時するタイマ３８を内部に持つ。 The way number control unit 23 controls the number of ways based on the activation / stop instruction from the comparison unit 24 and the activation state of the current way. A way usage count measurement unit 35 is provided inside, and the usage status of each way at a predetermined number of accesses is stored as a table. In addition, a way activation state is stored as a way activation state table 36. The activation determination is performed by the control determination unit 39 when there is an activation instruction from the comparison unit 24, there is a way that is stopped, and the timer is not waiting. When the activation determination is made, the control determination unit 39 updates the activation status and sends the activation status to the power control unit 21 together with the activation instruction. The timer wait state is the time from the last activation until the way power is stabilized, and the time required for storing the history in the cache when the first access immediately after the way activation is missed (α register 37 ) (The time indicated by the stored value). This time is determined in advance by simulation or the like, and the register 37 can be variably set. It has a timer 38 that is activated together with the activation determination and measures the waiting time.

ウェイ数制御部２３による停止判定は、起動指示がなく、タイマ待ちではなく、停止指示があり、且つウェイが２つ以上起動している状態の場合に制御判定部３９により行われる。制御判定部３９は、停止の判定が行われたら、測定していたウェイ毎の使用率を基に最も使用率の低いウェイナンバーを停止させ、ウェイ起動状況を更新し、起動状況を停止指示と合わせてパワー制御部２１に送る。 The stop determination by the way number control unit 23 is performed by the control determination unit 39 when there is no activation instruction, there is no timer waiting, there is a stop instruction, and two or more ways are activated. When the stop determination is made, the control determination unit 39 stops the way number having the lowest usage rate based on the measured usage rate for each way, updates the way activation status, and sets the activation status as a stop instruction. Together, it is sent to the power controller 21.

パワー制御部２１では、ウェイ数制御部２３の指示に従って各ウェイの電源のオン／オフを制御する。 The power control unit 21 controls the power on / off of each way according to an instruction from the way number control unit 23.

前記の閾値は、動的に可変制御してもよい。閾値の設定値は、予めシミュレーションなどにより測定されたある程度妥当な値であることを想定している。しかし適切なシミュレーションができない場合等には、適当な推量値に設定するしかなく、その結果、ウェイの起動停止動作が全く起きない設定になってしまう場合が考えられる。このような場合でも閾値をレジスタ設定により変更することができるが、人手によらずに自動で変更したい場合或いはより早く変えたい場合等の補助機能として、動的閾値制御部２２を設けてよい。 The threshold value may be dynamically variably controlled. It is assumed that the set value of the threshold value is a reasonable value measured in advance by simulation or the like. However, when an appropriate simulation cannot be performed, etc., there is no choice but to set an appropriate guess value. As a result, there is a case where the way start / stop operation does not occur at all. Even in such a case, the threshold value can be changed by register setting. However, the dynamic threshold value control unit 22 may be provided as an auxiliary function in the case where it is desired to change the threshold value automatically without relying on human hands or to change it faster.

図５は、動的閾値制御部２２の構成の一例を示す図である。動的制御ＯＮとともにタイマ４０は起動する。カウンタ４１は起動指示数をカウントする。起動閾値の制御の場合、セレクタ４２は、タイマ終了時にカウンタ４１のカウント値が閾値を超えていたら、以前の値（ＦＦの値）に"＋１"した値を選択する。カウンタ４１のカウント値が０の場合には、以前の値（ＦＦの値）に"−１"した値を選択する。カウンタ４１のカウント値が閾値以上でなく且つ０でない場合には、以前の値（ＦＦの値）を選択する。タイマ４０およびカウンタ４１はタイマ終了時にリセットされる。停止閾値の制御動作も同じである。ただし、起動指示数ではなく停止指示数を監視し、閾値の"±"が起動閾値の場合と反対となる。 FIG. 5 is a diagram illustrating an example of the configuration of the dynamic threshold value control unit 22. When the dynamic control is turned on, the timer 40 starts. The counter 41 counts the number of activation instructions. In the case of start threshold control, the selector 42 selects a value that is “+1” to the previous value (FF value) if the count value of the counter 41 exceeds the threshold at the end of the timer. When the count value of the counter 41 is 0, a value obtained by subtracting “−1” from the previous value (FF value) is selected. If the count value of the counter 41 is not equal to or greater than the threshold value and not 0, the previous value (FF value) is selected. The timer 40 and the counter 41 are reset when the timer ends. The control operation for the stop threshold is the same. However, the number of stop instructions is monitored instead of the number of activation instructions, and the opposite is the case when the threshold “±” is the activation threshold.

図６は、本発明によるキャッシュ起動停止制御の流れを示すフローチャートである。このフローチャートに示す処理はキャッシュアクセス毎に行われる。 FIG. 6 is a flowchart showing the flow of cache start / stop control according to the present invention. The processing shown in this flowchart is performed for each cache access.

測定値更新フェーズ（Ｓ１）では、各測定値の計算を行う。図１のキャッシュ状態測定部２７の動作に相当する。 In the measurement value update phase (S1), each measurement value is calculated. This corresponds to the operation of the cache state measuring unit 27 in FIG.

起動条件測定フェーズ（Ｓ２）では、測定値を基にミス率の計算を行い、起動の有無を判定する。図１のミス率測定部２５及び比較部２４の起動関連部分の動作に相当する。 In the activation condition measurement phase (S2), the miss rate is calculated based on the measured value to determine the presence or absence of activation. This corresponds to the operation of the activation-related portion of the error rate measurement unit 25 and the comparison unit 24 in FIG.

停止条件判定フェーズ（Ｓ３）では、測定値を基に総合使用率の計算を行い、その結果を閾値と比較して停止の判定をする。図1の使用率測定部２６及び比較部２４の停止関連部分の動作に相当する。 In the stop condition determination phase (S3), the total usage rate is calculated based on the measured value, and the result is compared with a threshold value to determine stop. This corresponds to the operation of the stop-related portion of the usage rate measurement unit 26 and the comparison unit 24 in FIG.

起動停止フェーズ（Ｓ４）では、ウェイの起動停止判定を行う。図１のウェイ数制御部２３の動作に相当する。 In the start / stop phase (S4), the start / stop determination of the way is performed. This corresponds to the operation of the way number control unit 23 in FIG.

動的閾値制御フェーズ（Ｓ５）では、各閾値を動的に制御する。図１の動的閾値制御部２２の動作に相当する。ここ処理だけは、キャッシュアクセスサイクルではなく、タイマのカウントするクロックサイクルに同期して動作する。なお動的閾値制御フェーズを図６に示す手順の先頭において実行し、制御後の閾値を利用して、図６示す各ステップの処理を実行してもよい。また動的閾値制御機能は、レジスタ設定などによりオン・オフ制御できるようにしてよい。 In the dynamic threshold control phase (S5), each threshold is dynamically controlled. This corresponds to the operation of the dynamic threshold control unit 22 in FIG. This process only operates in synchronization with the clock cycle counted by the timer, not the cache access cycle. Note that the dynamic threshold control phase may be executed at the head of the procedure shown in FIG. 6, and the processing of each step shown in FIG. 6 may be executed using the threshold after control. The dynamic threshold control function may be controlled to be turned on / off by register setting or the like.

パワー制御起動（Ｓ６）及びパワー制御停止（Ｓ７）では、それぞれウェイの起動及び最も使用頻度の低いウェイの停止の動作を行う。図１のパワー制御部２１の動作に相当する。その後、処理を終了し次のキャッシュアクセスを待つ。 In the power control start (S6) and the power control stop (S7), the way activation and the least frequently used way stop operation are performed, respectively. This corresponds to the operation of the power control unit 21 in FIG. Thereafter, the process is terminated and the next cache access is awaited.

図７は、図６のＳ１に対応する各測定値の計算の手順を示すフローチャートである。各ステップの順不同である。埋まり具合更新（Ｓ１）では、起動ウェイのライン毎に有効データがある場合には１、無い場合には０としてＡＮＤをとった結果を表として保持する。使用率更新（Ｓ２）では、保持していた最も古いアクセスラインナンバーに当たるラインの使用数を１減らし、今回アクセスしたラインナンバーの使用数を１増やす。ウェイ毎使用数更新（Ｓ３）では、最も古いアクセス情報が示すウェイの使用数を１減らし、今回アクセスしたウェイの使用数を"＋１"する。 FIG. 7 is a flowchart showing a procedure for calculating each measurement value corresponding to S1 of FIG. The order of each step is random. In the padding state update (S1), the result of ANDing is held as a table when the valid data exists for each line of the activation way, and when there is no valid data, it is 0. In the usage rate update (S2), the number of used lines corresponding to the oldest access line number held is reduced by 1, and the number of used line numbers accessed this time is increased by 1. In the usage number update for each way (S3), the way usage number indicated by the oldest access information is reduced by 1, and the usage number of the way accessed this time is incremented by "+1".

図８は、図６のＳ２に対応する起動条件判定の手順を示すフローチャートである。 FIG. 8 is a flowchart showing a procedure for determining the activation condition corresponding to S2 of FIG.

まず最も古いヒット又はミスの情報を削除する（Ｓ１）。今回のアクセスのヒット／ミスを判定し（Ｓ２）、ヒットなら測定値（ミス率）を±０する（Ｓ３）。Ｓ２でミスし、ミスしたラインの埋まり具合が１（Ｓ４でＹＥＳ）ならば、測定値を＋Ｃする（Ｓ５）。Ｓ２でミスし、ミスしたラインの埋まり具合が０（Ｓ４でＮＯ）ならば測定値を＋１する（Ｓ６）。測定値と閾値との比較を行い（Ｓ７）、閾値を超えたら、起動条件を満たしたと判定し起動停止フェーズへ移行する（Ｓ８）。閾値を超えなかった場合には、停止条件判定フェーズへ移行する（Ｓ９）。 First, the oldest hit or miss information is deleted (S1). The hit / miss of the current access is determined (S2), and if it is a hit, the measured value (miss rate) is set to ± 0 (S3). If the missed line is missed in S2 and the filling level of the missed line is 1 (YES in S4), the measured value is incremented by + C (S5). If the missed line is missed in S2 and the filling level of the missed line is 0 (NO in S4), the measured value is incremented by 1 (S6). The measured value is compared with the threshold value (S7), and if the threshold value is exceeded, it is determined that the activation condition is satisfied, and the process proceeds to the activation stop phase (S8). If the threshold is not exceeded, the process proceeds to the stop condition determination phase (S9).

図９は、図６のＳ３に対応する停止条件判定の手順を示すフローチャートである。 FIG. 9 is a flowchart showing a stop condition determination procedure corresponding to S3 of FIG.

総合使用率を所定の式に従い計算する（Ｓ１）。総合使用率が閾値以上であれば（Ｓ２でＹＥＳ）、起動停止フェーズへ移行する（Ｓ３）。総合使用率が閾値より小さければ（Ｓ２でＮＯ）、処理を終了し次のキャッシュアクセスを待つ（Ｓ４）。 The total usage rate is calculated according to a predetermined formula (S1). If the total usage rate is equal to or greater than the threshold (YES in S2), the process proceeds to the start / stop phase (S3). If the total usage rate is smaller than the threshold (NO in S2), the process is terminated and the next cache access is awaited (S4).

図１０は、図６のＳ４に対応する起動停止動作の手順を示すフローチャートである。 FIG. 10 is a flowchart showing the procedure of the start / stop operation corresponding to S4 of FIG.

起動指示があり（Ｓ１でＹＥＳ）、電源安定＋α待ちのタイマは終了しており（Ｓ２でＹＥＳ）、停止ウェイがある場合（Ｓ３でＹＥＳ）には、電源安定＋α待ちのタイマを起動し（Ｓ４）、ウェイ起動指示とともに動的閾値制御フェーズへ移行する（Ｓ５）。起動指示があっても電源安定＋α待ちのタイマ待ち（Ｓ２でＮＯ）、または停止ウェイが無い場合（Ｓ３でＮＯ）には、指示を出さずに動的閾値制御フェーズへ移行する（Ｓ６）。起動指示がなく（Ｓ１でＮＯ）、停止指示があり（Ｓ７でＹＥＳ）、稼動ウェイ数が２以上の場合（Ｓ８でＹＥＳ）には、停止指示とともに動的閾値制御フェーズへ移行する（Ｓ９）。停止指示がない（Ｓ７でＮＯ）または稼動ウェイ数が１つの場合（Ｓ８でＮＯ）には、指示を出さずに動的閾値制御フェーズへ移行する（Ｓ１０）。 If there is a start instruction (YES in S1), the timer for power stabilization + α waiting has ended (YES in S2), and if there is a stop way (YES in S3), a timer for power stabilization + α waiting is started ( S4), together with the way activation instruction, shifts to the dynamic threshold control phase (S5). Even if there is an activation instruction, if the timer waits for power stabilization + α (NO in S2) or if there is no stop way (NO in S3), the process proceeds to the dynamic threshold control phase without issuing an instruction (S6). If there is no start instruction (NO in S1), there is a stop instruction (YES in S7), and the number of operating ways is 2 or more (YES in S8), the process proceeds to the dynamic threshold control phase together with the stop instruction (S9). . If there is no stop instruction (NO in S7) or the number of operating ways is one (NO in S8), the process proceeds to the dynamic threshold control phase without issuing an instruction (S10).

図１１Ａは、図６のＳ５に対応する動的閾値制御動作の手順を示すフローチャートである。図１１Ａは、起動閾値の制御の手順を示す。 FIG. 11A is a flowchart showing a procedure of the dynamic threshold control operation corresponding to S5 of FIG. FIG. 11A shows a procedure for controlling the activation threshold.

動的制御設定がＯＮ（Ｓ１でＹＥＳ）されたら、変化指令値（何段階閾値を変化させるかを保持した値）をリセットし、タイマが起動する。タイマが動いている間、起動指令をカウントする（Ｓ２，Ｓ３）。タイマ終了時（Ｓ４でＹＥＳ）、カウンタが閾値以上（Ｓ５でＹＥＳ）ならば、閾値が低すぎると判断し変化指令値を"＋１"し、カウンタリセットする（Ｓ６）。カウンタが閾値以上でなく（Ｓ５でＮＯ）且つ０（Ｓ７でＹＥＳ）ならば、閾値が高すぎると判断し変化指令値を−１する（Ｓ８）。どちらでもない場合には変化指令値は現状のままとする。タイマをリセットして再起動するとともに、変化指令値と変化幅とを乗算し閾値に加算して、比較部で利用する（Ｓ９）。 When the dynamic control setting is turned on (YES in S1), the change command value (a value that holds how many threshold values are changed) is reset, and the timer is started. While the timer is running, the start command is counted (S2, S3). When the timer ends (YES in S4), if the counter is equal to or greater than the threshold (YES in S5), it is determined that the threshold is too low, the change command value is incremented by "+1", and the counter is reset (S6). If the counter is not equal to or greater than the threshold (NO in S5) and 0 (YES in S7), it is determined that the threshold is too high and the change command value is decremented by 1 (S8). If it is neither, the change command value is left as it is. The timer is reset and restarted, and the change command value and the change width are multiplied and added to the threshold value and used in the comparison unit (S9).

図１１Ｂは、図６のＳ５に対応する動的閾値制御動作の手順を示すフローチャートである。図１１Ｂは、停止閾値の制御の手順を示す。 FIG. 11B is a flowchart showing the procedure of the dynamic threshold control operation corresponding to S5 of FIG. FIG. 11B shows a procedure for controlling the stop threshold.

タイマが動いている間停止指令をカウントする（Ｓ１，Ｓ２）。タイマ終了時（Ｓ３でＹＥＳ）、カウンタが閾値以上（Ｓ４でＹＥＳ）ならば、閾値が高すぎると判断し変化指令値を−１し、カウンタリセットする（Ｓ５）。カウンタが閾値以上でなく（Ｓ４でＮＯ）且つ０（Ｓ６でＹＥＳ）ならば閾値が低すぎると判断し、変化指令値を＋１する（Ｓ７）。どちらでもない場合には変化指令値は現状のままとする。タイマをリセットして再起動するとともに、変化指令値と変化幅とを乗算し閾値に加算して、比較部で利用する（Ｓ８）。 While the timer is running, the stop command is counted (S1, S2). When the timer ends (YES in S3), if the counter is equal to or greater than the threshold (YES in S4), it is determined that the threshold is too high, the change command value is decremented by 1, and the counter is reset (S5). If the counter is not equal to or greater than the threshold (NO in S4) and 0 (YES in S6), it is determined that the threshold is too low, and the change command value is incremented by 1 (S7). If it is neither, the change command value is left as it is. The timer is reset and restarted, and the change command value and the change width are multiplied and added to the threshold value and used in the comparison unit (S8).

図１２は、キャッシュ状態測定部２７の構成の一例を示す図である。 FIG. 12 is a diagram illustrating an example of the configuration of the cache state measurement unit 27.

起動ウェイ情報を基に、停止ウェイの有効データフラグはマスクし且つ起動ウェイの有効データフラグはそのままにして、ライン毎に複数のウェイの有効データフラグのＡＮＤをとり、その結果を埋まり具合表５１に格納する。使用率表５２においては、キャッシュアクセス毎に選択したラインナンバーをＦＩＦＯ５３にためておき、入力ナンバーＡと出力ナンバーＢとを比較して、ライン毎に両方にあてはまらなければ使用率を±０、入力に当てはまれば＋１、出力に当てはまれば−１、両方にあてはまれば±０として更新した値を保持する。ウェイの使用率は、ウェイ毎にＦＩＦＯ５４をもち、キャッシュアクセス毎に自分のウェイへのアクセスなら１、さもなければ０を入力し、ＦＩＦＯ５４の総和を使用数として保持する。 Based on the activation way information, the valid data flag of the stop way is masked and the valid data flag of the activation way is left as it is, and the AND of the valid data flags of a plurality of ways is taken for each line, and the result is filled up. To store. In the usage rate table 52, the line number selected for each cache access is stored in the FIFO 53, the input number A and the output number B are compared, and if the situation does not apply to both, the usage rate is ± 0. The updated value is held as +1 if applicable to -1, -1 if applicable to output, or ± 0 if applicable to both. The way usage rate includes a FIFO 54 for each way, and 1 is input for access to its own way for each cache access, and 0 is stored otherwise, and the total of the FIFOs 54 is held as the number of uses.

図１３は、ミス率測定部２５の構成の一例を示す図である。キャッシュアクセスがヒットだった場合は０、ミスかつミスしたラインの埋まり具合が１の場合には係数Ｃ、ミスかつミスしたラインの埋まり具合が０の場合には１を、ＦＩＦＯ５５に挿入する。ＦＩＦＯ５５の各レジスタの値の総和を測定値（ミス率）とする。 FIG. 13 is a diagram illustrating an example of the configuration of the miss rate measurement unit 25. When the cache access is hit, 0 is inserted into the FIFO 55, and when the missed and missed line is filled with a coefficient C, the coefficient C is inserted. When the missed and missed line is filled with 0, 1 is inserted into the FIFO 55. The total sum of the values of each register of the FIFO 55 is taken as a measurement value (miss rate).

なおミス率測定の係数Ｃを1に設定すること、或いはＣ又は１の選択機構自体を削除することにより、埋まり具合を考慮しない単純なヒット／ミスの割合を判定基準（ミス率）として使用するようにしてもよい。この場合でも、本願の停止制御方式を用いることにより、従来とは異なるキャッシュ制御が可能となる。性能（ヒット率）をある値以下に落としたくない場合に、ミス率測定の閾値を変更して使用するのが効果的である。 In addition, by setting the coefficient C of the miss rate measurement to 1 or deleting the selection mechanism itself of C or 1, a simple hit / miss rate that does not consider the degree of filling is used as a criterion (miss rate). You may do it. Even in this case, the cache control different from the conventional one can be performed by using the stop control method of the present application. When it is not desired to drop the performance (hit rate) below a certain value, it is effective to change the threshold value for measuring the miss rate.

図１４は、使用率測定部２６の構成の一例を示す図である。複数の掛け算器５６と加算器５７により、使用率表からライン毎の埋まり具合と使用率との積和を求め測定値（総合使用率）として出力する。 FIG. 14 is a diagram illustrating an example of the configuration of the usage rate measurement unit 26. A plurality of multipliers 56 and adders 57 obtain a product sum of the degree of filling for each line and the usage rate from the usage rate table and output it as a measured value (total usage rate).

図１５は、比較部２４の構成の一例を示す図である。比較部２４は、加算器６１及び６２と、比較回路６３及び６４を含む。これら加算器及び比較回路により、起動及び停止のそれぞれについて測定値と閾値との比較を行う。閾値を超えたら起動又は停止指示をだす。閾値は動的閾値制御部２２からの閾値の変更値をレジスタ設定値に加算した値とする。動的閾値制御部２２が設けられていない場合には、レジスタ設定値をそのまま閾値としてよい。 FIG. 15 is a diagram illustrating an example of the configuration of the comparison unit 24. The comparison unit 24 includes adders 61 and 62 and comparison circuits 63 and 64. The adder and the comparison circuit compare the measured value and the threshold value for each of starting and stopping. When the threshold is exceeded, a start or stop instruction is issued. The threshold value is a value obtained by adding the change value of the threshold value from the dynamic threshold value control unit 22 to the register setting value. When the dynamic threshold value control unit 22 is not provided, the register setting value may be used as the threshold value as it is.

図１６は、パワー制御部２１の構成の一例を示す図である。パワー制御部２１は、指示検出回路７１、稼働状態レジスタ７２、複数のアンド回路７３を含む。指示検出回路７１は、ウェイ数制御部２３から供給される起動／停止指示信号を検出して、取り込みトリガ信号を出力する。この取り込みトリガ信号に応答して、稼働状態レジスタ７２は、ウェイ数制御部２３から供給される更新後のウェイ稼動状態情報を取り込む。これにより内部のウェイ稼動状態情報を更新する。この更新結果によりＡＮＤ回路７３を制御して、各電源ラインをマスクすることにより、各ウェイの電源をオン／オフ制御する。 FIG. 16 is a diagram illustrating an example of the configuration of the power control unit 21. The power control unit 21 includes an instruction detection circuit 71, an operation state register 72, and a plurality of AND circuits 73. The instruction detection circuit 71 detects a start / stop instruction signal supplied from the way number control unit 23 and outputs a capture trigger signal. In response to this capture trigger signal, the operation state register 72 captures the updated way operation state information supplied from the way number control unit 23. As a result, the internal way operation state information is updated. By controlling the AND circuit 73 according to the update result and masking each power supply line, the power supply of each way is controlled on / off.

図１７は、本発明によるキャッシュ装置の変形例を示すブロック図である。図１７において、図１と同一の構成要素は同一の番号で参照し、その説明は省略する。 FIG. 17 is a block diagram showing a modification of the cache device according to the present invention. In FIG. 17, the same components as those in FIG. 1 are referred to by the same numerals, and a description thereof will be omitted.

図１７のキャッシュ装置は、図１のキャッシュ装置と比較して、コントローラ１４の代わりにコントローラ１４Ａが設けられていることが異なる。コントローラ１４Ａは、パワー制御部２１、ウェイ数制御部２３Ａ、キャッシュ状態測定部２７Ａ、及び設定レジスタ２８Ａを含む。 The cache device of FIG. 17 differs from the cache device of FIG. 1 in that a controller 14A is provided instead of the controller 14. The controller 14A includes a power control unit 21, a way number control unit 23A, a cache state measurement unit 27A, and a setting register 28A.

この構成では、埋まり具合をライン毎ではなく、起動中のウェイ全体でみる。即ち、起動中のウェイ全体でのデータ有効フラグの総数を埋まり具合とする。またキャッシュアクセス単位ではなく、コアのクロック単位で動くＦＩＦＯを設け、キャッシュアクセス有りの場合には１を、無しの場合には０をＦＩＦＯに入力するようにする。このＦＩＦＯの格納データの合計（１の数）を使用率とする。このような構成とすることにより、保持用のレジスタ、計算用の回路の構造が単純かつ小さくなる。 In this configuration, the degree of filling is seen not for each line but for the entire active way. That is, the total number of data valid flags in the entire active way is filled. Also, a FIFO that operates in units of core clocks, not cache access units, is provided so that 1 is input to the FIFO when there is cache access and 0 is input when there is no cache access. The total (number of 1) of the data stored in the FIFO is used. With such a configuration, the structure of the holding register and the calculation circuit is simple and small.

図１８は、キャッシュ状態測定部２７Ａの構成の一例を示す図である。図１８において、図１２と同一の構成要素は同一の番号で参照し、その説明は省略する。 FIG. 18 is a diagram illustrating an example of the configuration of the cache state measurement unit 27A. In FIG. 18, the same elements as those of FIG. 12 are referred to by the same numerals, and a description thereof will be omitted.

図１８の構成の場合、起動ウェイ情報を基に、停止ウェイの有効データフラグはマスクし且つ起動ウェイの有効データフラグはそのままにして、全ての有効データフラグの和を求め、その結果を使用率表８１に埋まり具合として格納する。この場合、ライン単位の埋まり具合がわからないので、ミス率に係数Ｃをかけることはできなくなる。ミス率はキャッシュのミス率をそのまま流用する。ＦＩＦＯ８２によりキャッシュの使用率を求め、その結果を使用率表８１に格納する。 In the case of the configuration of FIG. 18, based on the activation way information, the effective data flag of the stop way is masked and the effective data flag of the activation way is left as it is, and the sum of all the effective data flags is obtained, and the result is used. It is stored in Table 81 as the degree of filling. In this case, since the degree of filling in line units is not known, the coefficient C cannot be applied to the miss rate. Miss rate uses cache miss rate as it is. The usage rate of the cache is obtained by the FIFO 82, and the result is stored in the usage rate table 81.

図１９は、埋まり具合、使用率、ミス率の値と制御動作との対応関係を示す表である。図１９の表に基づいて、動作の列の動作（ウェイ起動、そのまま、停止）を実行すればよい。具体的には、ミス率が低く、埋まり具合及び使用率の少なくとも一方が低い場合には、ウェイ停止と判断をする。また、ミス率が高く、埋まり具合及び使用率の少なくとも一方が高い場合には、ウェイ起動と判断する。それ以外の場合には、ウェイの起動停止を行わない。 FIG. 19 is a table showing a correspondence relationship between the value of the degree of filling, the usage rate, the error rate, and the control operation. Based on the table of FIG. 19, the operations in the column of operations (way activation, as it is, as it is stopped) may be executed. Specifically, when the error rate is low and at least one of the degree of filling and the usage rate is low, it is determined that the way is stopped. Further, when the error rate is high and at least one of the degree of filling and the usage rate is high, it is determined that the way is activated. In other cases, the way is not started or stopped.

図２０は、ウェイ数制御部２３Ａの構成の一例を示す図である。ウェイ数制御部２３Ａは、比較器８５乃至８７、判定部８８、及びウェイ起動状態表８９を含む。比較器８５乃至８７により、それぞれ埋まり具合、使用率、及びミス率とレジスタ設定の閾値との比較を行う。この比較結果と、ウェイ起動状態表８９が示す現在のウェイ起動状態とに応じて、判定部８８が判定を行い、起動指示、停止指示、及びウェイ稼働状況を出力する。判定条件は、図２０において判定部８８に記載の条件であり、図１９の表に対応する。 FIG. 20 is a diagram illustrating an example of the configuration of the way number control unit 23A. The way number control unit 23A includes comparators 85 to 87, a determination unit 88, and a way activation state table 89. The comparators 85 to 87 respectively compare the filling level, the usage rate, and the miss rate with the register setting threshold values. The determination unit 88 makes a determination according to the comparison result and the current way activation state indicated by the way activation state table 89, and outputs an activation instruction, a stop instruction, and a way operation status. The determination conditions are the conditions described in the determination unit 88 in FIG. 20, and correspond to the table in FIG.

図２１は、本発明によるキャッシュ装置の更なる変形例を示すブロック図である。図２１において、図１及び図１７と同一の構成要素は同一の番号で参照し、その説明は省略する。 FIG. 21 is a block diagram showing a further modification of the cache device according to the present invention. In FIG. 21, the same components as those in FIGS. 1 and 17 are referred to by the same numerals, and a description thereof will be omitted.

図２１のキャッシュ装置は、図１７のキャッシュ装置と比較して、コントローラ１４の代わりにコントローラ１４Ｂが設けられていることが異なる。コントローラ１４Ｂは、パワー制御部２１、動的閾値制御部２２、ウェイ数制御部２３Ｂ、キャッシュ状態測定部２７Ａ、及び設定レジスタ２８Ａを含む。動的閾値制御部２２を設けることにより、図１の構成の場合と同様な動的閾値制御を可能としている。 The cache device of FIG. 21 differs from the cache device of FIG. 17 in that a controller 14B is provided instead of the controller 14. The controller 14B includes a power control unit 21, a dynamic threshold control unit 22, a way number control unit 23B, a cache state measurement unit 27A, and a setting register 28A. By providing the dynamic threshold control unit 22, the same dynamic threshold control as in the configuration of FIG. 1 can be performed.

図２２は、ウェイ数制御部２３Ｂの構成の一例を示す図である。図２２において、図２０と同一の構成要素は同一の番号で参照し、その説明は省略する。 FIG. 22 is a diagram illustrating an example of the configuration of the way number control unit 23B. In FIG. 22, the same components as those of FIG. 20 are referred to by the same numerals, and a description thereof will be omitted.

ウェイ数制御部２３Ｂは、図２０の構成に加え、更に加算器９１乃至９３を含む。加算器９１乃至９３は、動的閾値制御部２２からの閾値上昇値をレジスタ設定の閾値に加算して、動的に変化する閾値を生成する。この動的に変化する閾値が、比較器８５乃至８７における比較動作に使用される。 The way number control unit 23B further includes adders 91 to 93 in addition to the configuration of FIG. The adders 91 to 93 add the threshold increase value from the dynamic threshold control unit 22 to the register setting threshold, and generate a dynamically changing threshold. This dynamically changing threshold is used for the comparison operation in the comparators 85-87.

図２３は、図１７に示す構成におけるキャッシュ起動停止制御の流れを示すフローチャートである。このフローチャートに示す処理はキャッシュアクセス毎に行われる。 FIG. 23 is a flowchart showing the flow of cache start / stop control in the configuration shown in FIG. The processing shown in this flowchart is performed for each cache access.

測定値更新フェーズ（Ｓ１）では、各測定値の計算を行う。図１７のキャッシュ状態測定部２７Ａの動作に相当する。 In the measurement value update phase (S1), each measurement value is calculated. This corresponds to the operation of the cache state measuring unit 27A of FIG.

条件測定フェーズ（Ｓ２）では、起動及び停止の条件の有無を判定する。起動の判定及び停止の判定は、条件判定フェーズに集約して簡略化できる。起動停止フェーズ（Ｓ３）では、ウェイの起動停止判定を行う。図１７のウェイ数制御部２３Ａの動作に相当する。 In the condition measurement phase (S2), it is determined whether or not there are start and stop conditions. The determination of activation and the determination of stop can be simplified by integrating them into the condition determination phase. In the start / stop phase (S3), the start / stop determination of the way is performed. This corresponds to the operation of the way number control unit 23A in FIG.

パワー制御起動（Ｓ４）及びパワー制御停止（Ｓ５）では、それぞれウェイの起動及び最も使用頻度の低いウェイの停止の動作を行う。図１７のパワー制御部２１の動作に相当する。その後、処理を終了し次のキャッシュアクセスを待つ。 In the power control start (S4) and the power control stop (S5), the way activation and the least frequently used way stop operation are performed, respectively. This corresponds to the operation of the power control unit 21 in FIG. Thereafter, the process is terminated and the next cache access is awaited.

図２４は、図２３のＳ１に対応する各測定値の計算の手順を示すフローチャートである。埋まり具合は、キャッシュの有効ウェイの有効データフラグの総和を求めることで計算する（Ｓ１）。使用率は、最も古いキャッシュ使用情報を削除し、キャッシュを使用したら＋１、使用しなかったら＋０とすることにより更新して求める（Ｓ２）。ウェイ使用数更新は、図７の場合と同様である。なおこれらのステップの処理は順不同でよい。 FIG. 24 is a flowchart showing the procedure for calculating each measured value corresponding to S1 in FIG. The degree of filling is calculated by obtaining the sum of the valid data flags of the cache valid ways (S1). The usage rate is obtained by deleting the oldest cache usage information and updating it by setting it to +1 if the cache is used and +0 if it is not used (S2). The way usage number update is the same as in FIG. Note that these steps may be performed in any order.

図２５は、図２３のＳ２に対応する条件判定処理の手順を示すフローチャートである。まずミス率の閾値判定を行う（Ｓ１）。ミス率が閾値以上の場合（Ｓ１でＹＥＳ）、さらに使用率の閾値判定（Ｓ２）、埋まり具合の閾値判定（Ｓ３）を行い、何れかの判定で閾値以上と判明した場合には起動と判定する（Ｓ４，Ｓ５）。ミス率が閾値以上でなく（Ｓ１でＮＯ）且つ閾値以下である場合（Ｓ６でＹＥＳ）、使用率の閾値判定（Ｓ７）、埋まり具合の閾値判定（Ｓ８）を行い、何れかの判定で閾値以下と判明した場合には停止と判定する（Ｓ９，Ｓ１０）。それ以外の場合には起動停止は行わない。なお使用率、埋まり具合の判断の順番は入れ替え可能である。 FIG. 25 is a flowchart showing the procedure of the condition determination process corresponding to S2 in FIG. First, a threshold determination of a miss rate is performed (S1). If the error rate is greater than or equal to the threshold (YES in S1), the threshold determination for usage rate (S2) and the threshold determination for the degree of filling (S3) are performed. (S4, S5). When the miss rate is not equal to or greater than the threshold (NO in S1) and is equal to or less than the threshold (YES in S6), the threshold determination for the usage rate (S7) and the threshold determination for the degree of filling (S8) are performed. If it is determined that the following is true, it is determined to be stopped (S9, S10). In other cases, start and stop are not performed. Note that the order of determination of the usage rate and the degree of filling can be changed.

図２６は、図２１に示す構成におけるキャッシュ起動停止制御の流れを示すフローチャートである。このフローチャートに示す処理はキャッシュアクセス毎に行われる。図２６においては、図２３のフローチャートに対して動的閾値制御フェーズ（ＳＡ）が追加されたことのみが異なる。他の処理は、図２３と同様である。また動的閾値制御フェーズ（ＳＡ）の処理は、図１１Ａ及び図１１Ｂに示す手順でよい。 FIG. 26 is a flowchart showing the flow of cache start / stop control in the configuration shown in FIG. The processing shown in this flowchart is performed for each cache access. 26 differs from the flowchart of FIG. 23 only in that a dynamic threshold control phase (SA) is added. Other processes are the same as those in FIG. The dynamic threshold control phase (SA) may be processed according to the procedure shown in FIGS. 11A and 11B.

なお上記変形例の説明において、図１９の表に示されるキャッシュの状態と動作の関係を、ユーザーが設定変更できるようにしてもよい。即ち、図１９の表の動作の列の内容を例えばレジスタ設定により可変な構成としてよい。このようなレジスタ設定を用いた構成により、例えば図１９の状態１に対して動作が「停止」と定義されているのを、「起動」或いは「そのまま」と変更することが可能である。図２０又は図２２に示す判定部８８で、判定結果をレジスタ設定に応じて実行すればよい。 In the description of the above modification, the user may be able to change the setting of the relationship between the cache state and the operation shown in the table of FIG. That is, the contents of the operation column in the table of FIG. 19 may be variable by, for example, register setting. With the configuration using such register setting, for example, the operation defined as “stop” for the state 1 in FIG. 19 can be changed to “start” or “as it is”. The determination unit 88 shown in FIG. 20 or FIG. 22 may execute the determination result according to the register setting.

図２７は、動作内容をレジスタ設定可能とした場合の条件判定フェーズのフローチャートを示す。埋まり具合、使用率、及びミス率に関して閾値判定を行い各閾値との比較を行う（Ｓ１）。比較結果に応じてレジスタの設定内容を読み出す（Ｓ２）。高低は閾値以上及び以下を示し、中は閾値以上でも以下でもない場合を示す。レジスタから読み出した結果に応じて「起動」、「停止」、又は「そのまま」に分岐する（Ｓ３）。 FIG. 27 shows a flowchart of the condition determination phase when the operation contents can be set in the register. Threshold determination is performed with respect to the degree of filling, usage rate, and miss rate, and comparison with each threshold value is performed (S1). The setting contents of the register are read according to the comparison result (S2). High and low indicate values above and below the threshold value, and medium indicates a value that is not above and below the threshold value. The process branches to “start”, “stop”, or “as is” according to the result read from the register (S3).

以下に、本発明の少なくとも１つの実施例の効果について説明する。 The effects of at least one embodiment of the present invention will be described below.

（式1）で示した値は、キャッシュの利用効率を随時測定している式である。図３の説明で示したとおり、この式の値が低い場合、埋まっているラインの使用率が低く、ラインの空きが大きい、つまりつめられる上体をあらわしている。 The value shown in (Expression 1) is an expression that measures the cache usage efficiency as needed. As shown in the explanation of FIG. 3, when the value of this equation is low, the use rate of the buried line is low, and the vacancy of the line is large, that is, the upper body to be pinched is shown.

例えばＪＰＥＧなどの画像処理の場合、図２８のように特定の処理を繰り返し行い、終了したらアイドル状態となる動作を繰り返す。このとき、命令コード領域をみると、エンコード処理中は特定の処理を繰り返すためメモリ空間上の特定の領域が繰り返し使われる。そこでキャッシュは使用率が高く、埋まり具合が高く、ヒット率の高い状態となる。アイドル処理中は、通常自分への分岐といった極簡単な処理が繰り返しおこなわれるため特定のメモリ領域だけ使用される。そのためキャッシュの使用率は高く、埋まり具合は低く、ヒット率が高い状態となる。図２９にこの場合の本発明と従来技術（特許文献１）との効果の差を示す。 For example, in the case of image processing such as JPEG, specific processing is repeatedly performed as shown in FIG. At this time, when looking at the instruction code area, a specific area in the memory space is repeatedly used because the specific process is repeated during the encoding process. Therefore, the cache has a high usage rate, a high degree of filling, and a high hit rate. During idle processing, usually simple processing such as branching to itself is repeated, so that only a specific memory area is used. Therefore, the usage rate of the cache is high, the degree of filling is low, and the hit rate is high. FIG. 29 shows the difference in effect between the present invention and the prior art (Patent Document 1) in this case.

従来技術の場合、ヒット率のみに依存するためヒット率が高い状態がつづくとウェイを停止させる。そのためキャッシュ容量が不足しミス率が高くなる。そのため今度は起動の判定条件を満たし起動する。すると再びヒット率の高い状態に戻る。これを繰り返すため処理時間が遅くなる。アイドル状態に入るとウェイと停止させる。キャッシュの制御なしの場合と比べて消費電力は下がるがスループットも落ちてしまう。 In the case of the prior art, the way is stopped when a state with a high hit rate continues because it depends only on the hit rate. Therefore, the cache capacity is insufficient and the miss rate is increased. For this reason, this time the system satisfies the start determination condition and starts up. Then, it returns to a state with a high hit rate again. Since this is repeated, the processing time is delayed. When entering the idle state, the way is stopped. Compared to the case without cache control, power consumption is reduced, but throughput is also reduced.

それに対し本発明の場合は、ライン毎の埋まり具合も判断基準に加えるためヒット率の高い状態が続くこの場合でもウェイの停止判断をしない。そのため性能はキャッシュの制御の無い場合と同様に維持される。アイドル状態では使用率が下がるためウェイを停止させ、従来技術と同様の省電力効果が期待できる。つまりキャッシュの制御無しの場合と比べてスループットを落とさず省電力化を行うことができる。 On the other hand, in the case of the present invention, since the degree of burying for each line is also added to the judgment criterion, the way stop judgment is not made even in this case where the state of high hit rate continues. Therefore, the performance is maintained in the same manner as when there is no cache control. Since the usage rate is lowered in the idle state, the way is stopped, and the same power saving effect as in the prior art can be expected. That is, power can be saved without reducing the throughput as compared with the case without cache control.

図３０でミス率と起動判定の時間変化を本発明と従来技術で比べた。本発明では係数Ｃを掛けているため測定したミス率の上昇が早く、そのため起動判定を早く行うことができる。 In FIG. 30, the error rate and the time change of activation determination are compared between the present invention and the prior art. In the present invention, since the coefficient C is multiplied, the measured error rate increases rapidly, so that the activation determination can be performed quickly.

さらに図３１で定量的に比較結果をまとめた。起動が早くなる結果、平均性能は本発明の方が高くなる。また、図中の仮定より、従来技術に比べ平均Ｃ倍、検出が早くなる。 Furthermore, the comparison results are summarized quantitatively in FIG. As a result of faster start-up, the average performance is higher with the present invention. In addition, the detection is faster by an average C times than the conventional technique.

キャシュは複数のキャッシュラインを含み、主記憶からキャシュへの情報のコピーはキャシュライン単位で実行される。主記憶のメモリ空間はキャッシュライン単位で分割され、分割されたメモリ領域を順番にキャッシュラインに割当てておく。キャッシュの容量は主記憶の容量よりも小さいので、主記憶のメモリ領域を繰り返して同一のキャッシュラインに割当てることになる。 The cache includes a plurality of cache lines, and copying of information from the main memory to the cache is executed in units of cache lines. The memory space of the main memory is divided in units of cache lines, and the divided memory areas are sequentially assigned to the cache lines. Since the capacity of the cache is smaller than the capacity of the main memory, the memory area of the main memory is repeatedly assigned to the same cache line.

キャッシュはアクセス履歴を格納するためアドレスをデコードする。その際、通常アドレスのすべてのビットをデコードはしない。キャッシュ内部をライン単位でわけてアドレスの一部はそのままラインナンバーに割り当てる。このためライン毎に使えるアドレスは周期的に決まっている。そこで、同じラインを使用する場所の利用頻度が高ければウェイを起動した場合ライン数が増えヒット率が上昇（性能向上）する。図２で埋まり具合が１のときに係数Ｃをかけるとは、同じラインをよく使ってミスした場合を強調するためのものである。したがって起動検出効率の向上が期待できる。 The cache decodes the address to store the access history. At that time, all bits of the normal address are not decoded. The cache is divided into lines, and a part of the address is assigned to the line number as it is. For this reason, the addresses that can be used for each line are periodically determined. Therefore, if the use frequency of the place where the same line is used is high, the number of lines increases and the hit rate increases (performance improvement) when the way is activated. Applying the coefficient C when the filling level is 1 in FIG. 2 is for emphasizing a case where a mistake is made by frequently using the same line. Therefore, improvement in startup detection efficiency can be expected.

以上、本発明を実施例に基づいて説明したが、本発明は上記実施例に限定されるものではなく、特許請求の範囲に記載の範囲内で様々な変形が可能である。 As mentioned above, although this invention was demonstrated based on the Example, this invention is not limited to the said Example, A various deformation | transformation is possible within the range as described in a claim.

本発明によるキャッシュ装置の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of the cache apparatus by this invention. 使用率、埋まり具合、ミス率を説明するための図である。It is a figure for demonstrating a utilization rate, a filling condition, and a mistake rate. 使用率表に基づくウェイ停止について説明するための図である。It is a figure for demonstrating the way stop based on a utilization rate table. ウェイ数制御部の構成の一例を示す図である。It is a figure which shows an example of a structure of a way number control part. 動的閾値制御部の構成の一例を示す図である。It is a figure which shows an example of a structure of a dynamic threshold value control part. 本発明によるキャッシュ起動停止制御の流れを示すフローチャートである。It is a flowchart which shows the flow of the cache starting stop control by this invention. 各測定値の計算の手順を示すフローチャートである。It is a flowchart which shows the procedure of calculation of each measured value. 起動条件判定の手順を示すフローチャートである。It is a flowchart which shows the procedure of starting condition determination. 停止条件判定の手順を示すフローチャートである。It is a flowchart which shows the procedure of stop condition determination. 起動停止動作の手順を示すフローチャートである。It is a flowchart which shows the procedure of start-stop operation | movement. 動的閾値制御動作の手順を示すフローチャートである。It is a flowchart which shows the procedure of dynamic threshold value control operation | movement. 動的閾値制御動作の手順を示すフローチャートである。It is a flowchart which shows the procedure of dynamic threshold value control operation | movement. キャッシュ状態測定部の構成の一例を示す図である。It is a figure which shows an example of a structure of a cache state measurement part. 停止条件判定の手順を示すフローチャートである。It is a flowchart which shows the procedure of stop condition determination. 起動停止動作の手順を示すフローチャートである。It is a flowchart which shows the procedure of start-stop operation | movement. 比較部の構成の一例を示す図である。It is a figure which shows an example of a structure of a comparison part. パワー制御部の構成の一例を示す図である。It is a figure which shows an example of a structure of a power control part. 本発明によるキャッシュ装置の変形例を示すブロック図である。It is a block diagram which shows the modification of the cache apparatus by this invention. キャッシュ状態測定部の構成の一例を示す図である。It is a figure which shows an example of a structure of a cache state measurement part. 埋まり具合、使用率、ミス率の値と制御動作との対応関係を示す表である。It is a table | surface which shows the correspondence of the value of a filling condition, a usage rate, and a mistake rate, and control action. ウェイ数制御部の構成の一例を示す図である。It is a figure which shows an example of a structure of a way number control part. 本発明によるキャッシュ装置の更なる変形例を示すブロック図である。It is a block diagram which shows the further modification of the cache apparatus by this invention. ウェイ数制御部の構成の一例を示す図である。It is a figure which shows an example of a structure of a way number control part. 図１７に示す構成におけるキャッシュ起動停止制御の流れを示すフローチャートである。It is a flowchart which shows the flow of the cache starting stop control in the structure shown in FIG. 各測定値の計算の手順を示すフローチャートである。It is a flowchart which shows the procedure of calculation of each measured value. 条件判定処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of a condition determination process. 図２１に示す構成におけるキャッシュ起動停止制御の流れを示すフローチャートである。It is a flowchart which shows the flow of the cache starting stop control in the structure shown in FIG. 動作内容をレジスタ設定可能とした場合の条件判定フェーズのフローチャートを示す。The flowchart of the condition determination phase at the time of enabling operation setting of a register is shown. 画像処理の場合のキャッシュの使用状態の例を示す図である。It is a figure which shows the example of the use condition of the cache in the case of image processing. 本発明と従来技術との効果の差を説明するための図である。It is a figure for demonstrating the difference of the effect of this invention and a prior art. ミス率と起動判定の時間変化とを本発明と従来技術とで比較した図である。It is the figure which compared the error rate and the time change of starting determination by this invention and the prior art. 本発明の効果を定量的に示す図である。It is a figure which shows the effect of the present invention quantitatively.

Explanation of symbols

１１タグ格納部
１２データレジスタ部
１３判定ユニット
１４コントローラ
１５アドレス変換部
１６セレクタ
２１パワー制御部
２２動的閾値制御部
２３ウェイ数制御部
２４比較部
２５ミス率測定部
２６使用率測定部
２７キャッシュ状態測定部
２８設定レジスタ 11 Tag storage unit 12 Data register unit 13 Determination unit 14 Controller 15 Address conversion unit 16 Selector 21 Power control unit 22 Dynamic threshold control unit 23 Way number control unit 24 Comparison unit 25 Miss rate measurement unit 26 Usage rate measurement unit 27 Cache state Measurement unit 28 setting register

Claims

A cache control circuit for controlling a cache memory including a plurality of ways,
The data occupying ratio occupied by valid data among the plurality of ways of the cache memory, the usage ratio according to the frequency of accessing the cache memory, and the miss according to the ratio of miss of accessing the cache memory A measuring unit for determining the rate,
A cache control circuit comprising: a control unit that controls activation and deactivation of each of the plurality of ways of the cache memory according to the data occupying ratio, the usage rate, and the miss rate.

The control unit controls the way to start when the miss rate is equal to or higher than a predetermined threshold and any of the data occupation ratio and the usage rate is equal to or higher than the predetermined threshold. 2. The cache control circuit according to claim 1, wherein the control is performed so that the way is stopped when either of the data occupation ratio and the usage ratio is equal to or less than a predetermined threshold.

The measurement unit is
The data occupation ratio corresponding to the ratio occupied by valid data among the plurality of ways is obtained for each cache line, and the usage ratio according to the frequency at which the target cache line is accessed in the past predetermined cache accesses For each cache line,
Among the predetermined number of cache accesses in the past, the weight of the frequency of missed cache access to the cache line whose data occupancy ratio indicates that there is free space, and the cache access to the cache line whose data occupancy ratio indicates that there is no space A miss rate measurement unit that calculates the miss rate as a miss frequency by differentiating the weight of the miss frequency,
A usage rate measuring unit for obtaining a total usage rate obtained by summing up the products of the data occupying ratio and the usage rate for each cache line for all the cache lines;
The control unit controls activation and stop of the cache memory for each of the plurality of ways according to a comparison result between the miss rate and a predetermined threshold and a comparison result between the total usage rate and a predetermined threshold. The cache control circuit according to claim 1.

Weight of frequency of cache access of the data occupancy ratio to the cache line indicating the free-free misses is characterized in that the data occupancy ratio is greater than the weight of the frequency of missed cache access to the cache line indicating the free Yes The cache control circuit according to claim 3.

The cache control circuit according to claim 2, further comprising a dynamic threshold value control unit that dynamically controls the predetermined threshold value.