WO1991003785A1 - Apparatus and method for maintaining cache/main memory consistency - Google Patents

Apparatus and method for maintaining cache/main memory consistency Download PDF

Info

Publication number
WO1991003785A1
WO1991003785A1 PCT/US1990/001641 US9001641W WO9103785A1 WO 1991003785 A1 WO1991003785 A1 WO 1991003785A1 US 9001641 W US9001641 W US 9001641W WO 9103785 A1 WO9103785 A1 WO 9103785A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
write
unit
word
memory
Prior art date
Application number
PCT/US1990/001641
Other languages
French (fr)
Inventor
Bhikoo J. Patel
Original Assignee
Wang Laboratories, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wang Laboratories, Inc. filed Critical Wang Laboratories, Inc.
Priority to DE69031658T priority Critical patent/DE69031658T2/en
Priority to EP90907677A priority patent/EP0491697B1/en
Publication of WO1991003785A1 publication Critical patent/WO1991003785A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0804Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with main memory updating

Definitions

  • This invention relates generally to data processing apparatus and method and, in particular, to a Central Processor Unit (CPU) having a write-through cache and a local buffer memory paralleling the cache for buffering write data.
  • the CPU includes circuitry for detecting an occurrence of an externally generated write to a main memory and circuitry for modifying an associated memory command from a word-type of write access to less than a word type of write access to prevent data in the main memory from being overwritten with possibly non-current data from the buffer memory.
  • the invention described herein is particularly useful in a data processing system of the type wherein a memory controller controls a plurality of memory devices organized for storing multi-byte memory words.
  • a memory error detection and correction syndrome bits are generated typically over a number of bytes of the memory word such as, for example, four bytes of a 32-bit memory word.
  • CPU central processor unit
  • the error syndrome bits are generated for the entire word and stored within associated memory devices.
  • the memory controller when the CPU writes less than a full word of data, such as a byte (eight bits) or a half word (16 bits) of data, the memory controller operates to first read the full word of data, merge the byte or bytes, generate the error syndrome bits over the full merged word and write the merged word with the associated syndrome bits back to memory.
  • a full word of data such as a byte (eight bits) or a half word (16 bits) of data
  • the memory controller operates to first read the full word of data, merge the byte or bytes, generate the error syndrome bits over the full merged word and write the merged word with the associated syndrome bits back to memory.
  • this read/modify/write type of access may be a time consuming process.
  • a plurality of CPUs are coupled to a common system bus and through the bus to one or more memory units.
  • Each CPU may have a local cache memory wherein a copy of a portion of a main system memory is maintained.
  • main memory it is a desirable goal that the main memory be updated to accurately reflect changes made to data within the cache memories.
  • such a system may employ a write-through type of cache memory wherein data written to the cache is also written "through" the cache to the main memory.
  • a first-in/first-out (FIFO) memory can be employed in parallel with the cache, the FIFO accepting write data from the CPU and temporarily buffering the data before providing the data over a system bus to the main memory.
  • the FIFO is normally a word width or greater.
  • the data stored within the FIFO reflects the result of a read/modify/write type of access wherein a byte or half word is already merged by the CPU with a cache word.
  • a more efficient cache memory write-through technique writes a full, already merged word from the FIFO to the memory.
  • a dual port memory such as a FIFO buffer
  • a write-through cache wherein for write operations of less than a word in length the write data stored within a FIFO memory device associated with a first bus agent reflects the result of a read/modify/write type of access wherein a byte or half word is merged by a local processor with a cache word.
  • Memory control lines driven to the system bus indicate to a memory controller that a write operation is to be accomplished as a word write, thereby eliminating the additional time required to achieve a read/modify/write memory controller cycle.
  • circuitry for detecting an external write made by another system bus agent to the system memory there is provided circuitry for detecting that the FIFO has data stored within and circuitry is provided for changing the memory command lines to indicate, instead of a word write, less than a word write. This causes the memory controller to operate only upon the portion of data word that was modified by the local processor and to perform a conventional read/modify/write type of cycle to.merge only that portion of the word with a word from main memory.. For example, a byte or a half-word is identified to the memory controller by the least significant bits of an address that is also buffered by the FIFO.
  • Fig. 1 is is a block diagram of a data processing system constructed and operated in accordance with the invention.
  • Fig. 2 is a simplified schematic diagram illustrating circuitry for implementing the invention.
  • DETAILED DESCRIPTION OF THE INVENTION Fig. 1 is an illustrative block diagram of a portion of an illustrative data processing system 1 constructed and operated in accordance with the invention.
  • System 1 includes at least one central processor unit (CPU) 10 shown as CPU1 through CPUn.
  • CPU 10 is of identical construction to others of the CPUs.
  • CPU 10 includes a processor 12, such as a microprocessor device, that is coupled to a local cache memory 14 via a plurality of data lines 12a, address lines 12b and control lines 12c.
  • the cache memory 14 is coupled via a bus 14a to a bus interface unit 16, the bus interface unit 16 providing bidirectional data communication with a system bus (SB) 20. Also associated with the cache 14 but not shown are a plurality of parity lines and various other signals of a type known to those having skill in the art.
  • the bus 14a includes the data, address and control lines 12a-12c. Coupled in parallel with the cache 14 is a dual port memory device such as a FIFO buffer 18.
  • FIFO 18 functions to briefly buffer write data intended for updating a main memory 24 that is coupled to the system bus 20 through a memory controller 22.
  • the FIFO 14 operates to receive and store write data from the processor 12 before the data is provided to SB 20.
  • the FIFO 18 permits the processor 12 to write the data and continue operation without having to synchronize its operation with the typically slower SB 20.
  • the FIFO 18 is provided with sufficient memory capacity to store at least one and typically up to four words or double words of data although more or less than this typical number may be readily provided.
  • a plurality of memory units may be provided, depending on the memory density of an individual one of the units 24 and the desired total memory capacity of the system 1.
  • the memory controller 22 includes Error Correction Circuitry (ECC) 22a.
  • ECC 22a operates to generate and test syndrome bits upon a word of memory data at a time.
  • a word of memory data is considered to be four bytes, or 32 bits, in width.
  • the memory controller 22, memory units 24, SB 20, FIFO 18 and cache 14, for example, may be operable for simultaneously conveying, reading and/or writing multiple words of data, such as a double word (64 bits) or a quad word (128 bits) .
  • the data stored within the FIFO 18 reflects the result of cache write hit wherein a read/modify/write type of access is performed by processor 12 to merge a byte or half word with a cache word.
  • certain of the CONT 12c lines indicate to the memory controller 22 that the write is to be accomplished as a word write, thereby eliminating the additional time required to achieve the read/modify/write memory controller cycle.
  • the aforementioned problem occurs when, for example, the CPUn writes to the memory 24 during an interval of time that the word of data is temporarily buffered within the FIFO 18. In this case the word in main memory that is the target of the FIFO 18 write may have just been changed by CPUn.
  • this problem is circumvented by detecting within the CPU1 the write made to memory 24 by CPUn, or any other bus agent, and changing the CONT 12c memory command lines to indicate, instead of a word write, a byte write or a half word write operation.
  • This causes the memory controller 22 to operate only upon the byte or half word of data that is being written and to perform a conventional read/modify/write type of cycle to merge the byte or half word with a word from main memory.
  • the byte or half word is identified by the least significant bits of the address that is also buffered by the FIFO 18.
  • each of the CPUs 10 is provided circuitry to detect the occurrence of such an external write to memory 24 and circuitry to prevent the FIFO 18 from writing a full word of data to the memory 24 after such an external write occurs.
  • the FIFO 18 is typically comprised of a plurality of individual FIFO devices for buffering the 32 data lines 12a, an associated 32 bit address line 12b and associated control lines 12c.
  • the control lines identify the type of memory access associated with the data and address. That is, the control lines indicate whether the processor 12 performed a byte write, a half-word write or a word write upon the associated word of data stored within the FIFO 18.
  • the bus 18a includes buffered data (BDATA) , buffered address (BADDR) and buffered control lines (BCONT) from the output of the FIFO 18.
  • a plurality of system bus 20 drivers are associated with the output 18a of the FIFO 18. In Fig. 2 only those drivers 26 associated with a portion of the BCONT lines are shown. Buffers 26c-26f provide a four bit memory control signal (MC0*-MC3*) to the SB 20, the asterisk following the signal name indicating in a conventional manner that the signal is asserted when a logic zero or low. The MCn* control bits are coded to identify to the memory controller 22 the type of memory access in the following manner.
  • NAND buffers 26c through 26f are enabled to drive their associated memory control bits onto the SB 20 when a controlling system access enable (SAEN) signal is asserted by circuitry (not shown) on the CPU 10.
  • SAEN system access enable
  • buffers 26c and 26d Associated with buffers 26c and 26d are two additional NAND buffers 26a and 26b having their outputs wire-ored to the output of buffers 26c and 26d, respectively.
  • the buffers 26 are open collector type devices wherein such wire-ored connections are readily accomplished.
  • buffers 26a and 26b The purpose of buffers 26a and 26b is to normally override the byte write or half-word write control signal indications, MC2* and MC3* high or MC2* high and MC3* low, respectively, in order to make the control signal indication a word write indication with MC2* and MC3* both low.
  • Buffers 26a and 26b each have as an input signal an output of a gate 28 which has as an input signal a "force word write" (FWW) signal.
  • FWW is one of the buffered control lines that is output from the FIFO 18, FWW being asserted high as a result of a CACHE WRITE HIT signal that is asserted when a processor 12 write hits the cache.
  • the state of the CACHE WRITE HIT signal if stored within the FIFO 18 along with the associated word of data that is written to the cache 14 and to the FIFO 18 by the processor 12.
  • the FWW signal when asserted forces, in conjunction with a normally deasserted UNFWW* signal (to be described) , a word write command to the memory controller 22. This causes memory controller 22 to write a word of data, thereby eliminating the normally required read/modify/write type of access that is employed when writing a byte or a half word of data to the memory 24.
  • the forced word write is undesirable in that the FIFO 18 data may overwrite newly written data within the memory 24.
  • the CPU 10 further includes additional circuitry to detect such an external write and to convert the word write type of access back to a byte write or a half-word write type of access.
  • CPU 10 includes circuitry, such as a comparator 29, for detecting when another agent coupled to the SB 20 performs a write to the memory 24.
  • This other bus agent could be, by. example, another CPU in a multi-CPU system or could be I/O interface circuitry operable for writing data to the memory 24 from an I/O device such as a disk or communication port.
  • the occurrence of the write access by another agent also results in an interrogation of the cache 14 and, if necessary, an invalidation of cache 14. Circuitry (not shown) to perform these functions is well known in the art.
  • the occurrence of such a write access over SB 20 furthermore is detected by comparator 29 and generates a "system external write pending" (SXWP) signal.
  • SXWP system external write pending
  • This signal is input to a JK flip/flop (F/F) 30 where the assertion of this signal causes F/F 30 to set on the edge of a next clock (CLK) input.
  • the Q output of F/F 30 is a signal designated "external write pending" (XWP) that, when asserted, indicates that an externally generated write has occurred on the SB 20.
  • XWP external write pending
  • CLRXWP clear external write pending
  • CLRXWP is generated to indicate that the cache 14 interrogation and, if required, the cache 14 invalidation has occurred in response to the externally generated write access of memory 24.
  • the XWP signal output from F/F 30 is provided to AND gate 34 in conjunction with an EMPTY* flag from the FIFO 18.
  • the EMPTY* signal is low forcing a low from AND gate 34.
  • the EMPTY* signal is deasserted or high.
  • EMPTY* being high in conjunction with XWP being high, XWP indicating an occurrence of an external system bus write to memory 24, causes AND gate 34 to have a high output.
  • the high output from AND gate 34 is applied to the J input of a JK F/F 36 and, in conjunction with an edge of CLK, causes the Q* output to go low.
  • the Q* output of F/F 36 is a signal designated "undo forced word write” (UNFWW*) .
  • the UNFWW* signal is asserted low only when an external system bus write is detected and is pending cache interrogation (XWP is high) and the FIFO 18 contains at least one buffered write (EMPTY* is high) .
  • UNFWW* being asserted, that is being low, causes the output of AND gate 28 to be low, further causing the outputs of NAND gates 26a and 26b to be high. This in turn overrides the forced word write due to the assertion of FWW and causes the states of buffered memory control signals (BMC2 and BMC3) to be reflected in the outputs of the memory control line drivers 26d and 26c, respectively.
  • BMC2 and BMC3 buffered memory control signals
  • the processor 12 had accomplished a byte write or a half-word write in generating the word of data within the FIFO 18 the MC0*-MC3* lines indicate same to the memory controller 22.
  • the occurrence of the byte write or half-word write memory control signal is interpreted by the memory controller 22, in conjunction with the address bus, as a read/modify/write cycle to the addressed word of memory 24.
  • the byte or half-word identified by the least significant bits of the address bus is merged with the data word that is read from the main memory, the ECC syndrome bits are generated, and the word is written back to the memory. In this manner any other data within the word that may have been recently changed by another bus agent is not overwritten by the full word of data within the FIFO 18.
  • the SXWP signal is generated for any write access that is detected on SB 20. It can be appreciated however that additional logic can be employed to decode the system address bus in conjunction with the occurrence of the write to generate the SXWP signal only on the occurrence of a write to, for example, a same page of memory 24 as that contained within the cache 14. Likewise, with further decoding of the SB 20 address bus the SXWP signal can be generated only when the detected system bus write occurs to an address that is the same as an address that is presently within the FIFO 18. The degree of granularity of the decoding process is a function of the particular application and the amount of logic that can be expended for this function.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Memory System Of A Hierarchy Structure (AREA)

Abstract

A write-through cache (14) wherein for write operations of less than a word in length the write data stored within a FIFO memory device (18) associated with a first bus agent reflects the result of a read/modify/write type of access wherein a byte or half word has been merged by a local processor (12) with a cache word. Memory control lines driven to a system bus (20) indicate to a memory controller (22) that a write operation is to be accomplished as a word write, thereby eliminating the additional time required to achieve a read/modify/write memory controller cycle. To prevent the occurrence of a problem wherein another bus agent, such as another CPU or an I/O device, writes to a system memory (24) during an interval of time that the word of data is temporarily buffered within the FIFO there is provided circuitry for detecting an external write made to the system memory. Circuitry is also provided for detecting that the FIFO has data stored within and for changing the memory command lines to indicate, instead of a word write, a byte write of a half-word write operation.

Description

APPARATUS AND METHOD FOR MAINTAINING CACHE/MAIN MEMORY
CONSISTENCY
FIELD OF THE INVENTION:
This invention relates generally to data processing apparatus and method and, in particular, to a Central Processor Unit (CPU) having a write-through cache and a local buffer memory paralleling the cache for buffering write data. The CPU includes circuitry for detecting an occurrence of an externally generated write to a main memory and circuitry for modifying an associated memory command from a word-type of write access to less than a word type of write access to prevent data in the main memory from being overwritten with possibly non-current data from the buffer memory.
BACKGROUND OF THE INVENTION:
The invention described herein is particularly useful in a data processing system of the type wherein a memory controller controls a plurality of memory devices organized for storing multi-byte memory words. In such a memory error detection and correction syndrome bits are generated typically over a number of bytes of the memory word such as, for example, four bytes of a 32-bit memory word. When a central processor unit (CPU) writes a word of data to such a memory the error syndrome bits are generated for the entire word and stored within associated memory devices. However, when the CPU writes less than a full word of data, such as a byte (eight bits) or a half word (16 bits) of data, the memory controller operates to first read the full word of data, merge the byte or bytes, generate the error syndrome bits over the full merged word and write the merged word with the associated syndrome bits back to memory. As can be appreciated, this read/modify/write type of access may be a time consuming process.
In some systems a plurality of CPUs are coupled to a common system bus and through the bus to one or more memory units. Each CPU may have a local cache memory wherein a copy of a portion of a main system memory is maintained. In such a system it is a desirable goal that the main memory be updated to accurately reflect changes made to data within the cache memories. For example, such a system may employ a write-through type of cache memory wherein data written to the cache is also written "through" the cache to the main memory. For this purpose a first-in/first-out (FIFO) memory can be employed in parallel with the cache, the FIFO accepting write data from the CPU and temporarily buffering the data before providing the data over a system bus to the main memory. The FIFO is normally a word width or greater.
For those write operations of less than a word in width the data stored within the FIFO reflects the result of a read/modify/write type of access wherein a byte or half word is already merged by the CPU with a cache word. Thus, instead of writing a byte or half word to the memory and incurring the read/modify/write cycle time delay in generating the error syndrome bits, a more efficient cache memory write-through technique writes a full, already merged word from the FIFO to the memory.
However a problem is created when another system bus agent, such as another CPU in a multi-processor system, writes to the main memory during the interval of time that the word of data is temporarily buffered within the FIFO. In this case the word in main memory that is the target of the FIFO may have just been updated by the other bus agent. Permitting the full word to be written from the FIFO would result in the newer data being over-written by the older data contained within the FIFO and the destruction in the main memory of the newer data.
It is thus one object of the invention to provide a method and apparatus for updating data within a main memory as a result of a write operation to a local cache memory.
It is another object of the invention to provide a method and apparatus for use with a write-through cache that updates data within a main memory by providing a dual port memory, such as a FIFO buffer, in parallel with the cache memory and to further provide circuitry for detecting when a write occurs to the main memory to invalidate a full word write of data from the FIFO buffer to the main memory.
It is a further object of the invention to provide a method and apparatus for use with a write-through cache that updates data within a main memory by providing a FIFO buffer in parallel with the cache memory and to further provide circuitry for detecting when a write occurs to the main memory and circuitry to modify an associated memory command from a word write access to less than a word type of write access to prevent data in the main memory from being overwritten with possibly non-current data.
SUMMARY OF THE INVENTION The foregoing problems are overcome and the objects of the invention are realized by method and apparatus for storing data units within a system memory over a system bus. Specifically a write-through cache is taught wherein for write operations of less than a word in length the write data stored within a FIFO memory device associated with a first bus agent reflects the result of a read/modify/write type of access wherein a byte or half word is merged by a local processor with a cache word. Memory control lines driven to the system bus indicate to a memory controller that a write operation is to be accomplished as a word write, thereby eliminating the additional time required to achieve a read/modify/write memory controller cycle. To prevent the occurrence of a problem wherein another bus agent, such as another CPU or an I/O device, writes to the system memory during an interval of time that the word of data is temporarily buffered within the FIFO and the newly written data is overwritten by the FIFO there is provided circuitry for detecting an external write made by another system bus agent to the system memory. Circuitry is also provided for detecting that the FIFO has data stored within and circuitry is provided for changing the memory command lines to indicate, instead of a word write, less than a word write. This causes the memory controller to operate only upon the portion of data word that was modified by the local processor and to perform a conventional read/modify/write type of cycle to.merge only that portion of the word with a word from main memory.. For example, a byte or a half-word is identified to the memory controller by the least significant bits of an address that is also buffered by the FIFO.
BRIEF DESCRIPTION OF THE DRAWING
The above set forth and other features of the invention are made more apparent in the ensuing Detailed Description of the Invention when read in conjunction with the attached Drawing, wherein:
Fig. 1 is is a block diagram of a data processing system constructed and operated in accordance with the invention; and
Fig. 2 is a simplified schematic diagram illustrating circuitry for implementing the invention. DETAILED DESCRIPTION OF THE INVENTION Fig. 1 is an illustrative block diagram of a portion of an illustrative data processing system 1 constructed and operated in accordance with the invention. System 1 includes at least one central processor unit (CPU) 10 shown as CPU1 through CPUn. Typically each CPU 10 is of identical construction to others of the CPUs. CPU 10 includes a processor 12, such as a microprocessor device, that is coupled to a local cache memory 14 via a plurality of data lines 12a, address lines 12b and control lines 12c. The cache memory 14 is coupled via a bus 14a to a bus interface unit 16, the bus interface unit 16 providing bidirectional data communication with a system bus (SB) 20. Also associated with the cache 14 but not shown are a plurality of parity lines and various other signals of a type known to those having skill in the art. The bus 14a includes the data, address and control lines 12a-12c. Coupled in parallel with the cache 14 is a dual port memory device such as a FIFO buffer 18. FIFO 18 functions to briefly buffer write data intended for updating a main memory 24 that is coupled to the system bus 20 through a memory controller 22. The FIFO 14 operates to receive and store write data from the processor 12 before the data is provided to SB 20. In that the processor 12 typically operates at a faster clock rate than the SB 20 the FIFO 18 permits the processor 12 to write the data and continue operation without having to synchronize its operation with the typically slower SB 20. The FIFO 18 is provided with sufficient memory capacity to store at least one and typically up to four words or double words of data although more or less than this typical number may be readily provided.
A plurality of memory units (MEMl-MEMn) may be provided, depending on the memory density of an individual one of the units 24 and the desired total memory capacity of the system 1. The memory controller 22 includes Error Correction Circuitry (ECC) 22a. ECC 22a operates to generate and test syndrome bits upon a word of memory data at a time. In this illustrative embodiment a word of memory data is considered to be four bytes, or 32 bits, in width. The memory controller 22, memory units 24, SB 20, FIFO 18 and cache 14, for example, may be operable for simultaneously conveying, reading and/or writing multiple words of data, such as a double word (64 bits) or a quad word (128 bits) .
For write operations of less than a word in length the data stored within the FIFO 18 reflects the result of cache write hit wherein a read/modify/write type of access is performed by processor 12 to merge a byte or half word with a cache word. Also, certain of the CONT 12c lines indicate to the memory controller 22 that the write is to be accomplished as a word write, thereby eliminating the additional time required to achieve the read/modify/write memory controller cycle. However, the aforementioned problem occurs when, for example, the CPUn writes to the memory 24 during an interval of time that the word of data is temporarily buffered within the FIFO 18. In this case the word in main memory that is the target of the FIFO 18 write may have just been changed by CPUn. If the FIFO 18 write were allowed to proceed as a word write the newer data would be over-written by the older data from the cache 14. As will now be described this problem is circumvented by detecting within the CPU1 the write made to memory 24 by CPUn, or any other bus agent, and changing the CONT 12c memory command lines to indicate, instead of a word write, a byte write or a half word write operation. This causes the memory controller 22 to operate only upon the byte or half word of data that is being written and to perform a conventional read/modify/write type of cycle to merge the byte or half word with a word from main memory. The byte or half word is identified by the least significant bits of the address that is also buffered by the FIFO 18.
In accordance with the invention, each of the CPUs 10 is provided circuitry to detect the occurrence of such an external write to memory 24 and circuitry to prevent the FIFO 18 from writing a full word of data to the memory 24 after such an external write occurs.
Referring to Fig. 2 there is shown in greater detail the FIFO 18. Although illustrated as a single device it should be realized that the FIFO 18 is typically comprised of a plurality of individual FIFO devices for buffering the 32 data lines 12a, an associated 32 bit address line 12b and associated control lines 12c. By example, the control lines identify the type of memory access associated with the data and address. That is, the control lines indicate whether the processor 12 performed a byte write, a half-word write or a word write upon the associated word of data stored within the FIFO 18. The bus 18a includes buffered data (BDATA) , buffered address (BADDR) and buffered control lines (BCONT) from the output of the FIFO 18. A plurality of system bus 20 drivers are associated with the output 18a of the FIFO 18. In Fig. 2 only those drivers 26 associated with a portion of the BCONT lines are shown. Buffers 26c-26f provide a four bit memory control signal (MC0*-MC3*) to the SB 20, the asterisk following the signal name indicating in a conventional manner that the signal is asserted when a logic zero or low. The MCn* control bits are coded to identify to the memory controller 22 the type of memory access in the following manner.
Figure imgf000011_0001
NAND buffers 26c through 26f are enabled to drive their associated memory control bits onto the SB 20 when a controlling system access enable (SAEN) signal is asserted by circuitry (not shown) on the CPU 10. The assertion of SAEN enables the driving of data from the CPU 10 to the SB 20.
Associated with buffers 26c and 26d are two additional NAND buffers 26a and 26b having their outputs wire-ored to the output of buffers 26c and 26d, respectively. Preferably the buffers 26 are open collector type devices wherein such wire-ored connections are readily accomplished.
The purpose of buffers 26a and 26b is to normally override the byte write or half-word write control signal indications, MC2* and MC3* high or MC2* high and MC3* low, respectively, in order to make the control signal indication a word write indication with MC2* and MC3* both low. Buffers 26a and 26b each have as an input signal an output of a gate 28 which has as an input signal a "force word write" (FWW) signal. FWW is one of the buffered control lines that is output from the FIFO 18, FWW being asserted high as a result of a CACHE WRITE HIT signal that is asserted when a processor 12 write hits the cache. The state of the CACHE WRITE HIT signal if stored within the FIFO 18 along with the associated word of data that is written to the cache 14 and to the FIFO 18 by the processor 12. The FWW signal when asserted forces, in conjunction with a normally deasserted UNFWW* signal (to be described) , a word write command to the memory controller 22. This causes memory controller 22 to write a word of data, thereby eliminating the normally required read/modify/write type of access that is employed when writing a byte or a half word of data to the memory 24.
However, for those situations where an external write of the memory occurs while the data is temporarily buffered in the 11
FIFO 18 the forced word write is undesirable in that the FIFO 18 data may overwrite newly written data within the memory 24. To prevent the occurrence of such a situation the CPU 10 further includes additional circuitry to detect such an external write and to convert the word write type of access back to a byte write or a half-word write type of access.
Further in accordance with the invention CPU 10 includes circuitry, such as a comparator 29, for detecting when another agent coupled to the SB 20 performs a write to the memory 24. This other bus agent could be, by. example, another CPU in a multi-CPU system or could be I/O interface circuitry operable for writing data to the memory 24 from an I/O device such as a disk or communication port. The occurrence of the write access by another agent also results in an interrogation of the cache 14 and, if necessary, an invalidation of cache 14. Circuitry (not shown) to perform these functions is well known in the art. The occurrence of such a write access over SB 20 furthermore is detected by comparator 29 and generates a "system external write pending" (SXWP) signal. This signal is input to a JK flip/flop (F/F) 30 where the assertion of this signal causes F/F 30 to set on the edge of a next clock (CLK) input. The Q output of F/F 30 is a signal designated "external write pending" (XWP) that, when asserted, indicates that an externally generated write has occurred on the SB 20. Subsequent to the generation of the SXWP signal a "clear external write pending" (CLRXWP) signal is generated by the CPU 10 to reset the F/F 30 on another edge of CLK. CLRXWP is generated to indicate that the cache 14 interrogation and, if required, the cache 14 invalidation has occurred in response to the externally generated write access of memory 24.
The XWP signal output from F/F 30 is provided to AND gate 34 in conjunction with an EMPTY* flag from the FIFO 18. When the FIFO 18 is empty, that is when the FIFO 18 contains no buffered writes, the EMPTY* signal is low forcing a low from AND gate 34. When FIFO 18 contains one or more buffered writes the EMPTY* signal is deasserted or high. EMPTY* being high in conjunction with XWP being high, XWP indicating an occurrence of an external system bus write to memory 24, causes AND gate 34 to have a high output. The high output from AND gate 34 is applied to the J input of a JK F/F 36 and, in conjunction with an edge of CLK, causes the Q* output to go low. The Q* output of F/F 36 is a signal designated "undo forced word write" (UNFWW*) . The UNFWW* signal is asserted low only when an external system bus write is detected and is pending cache interrogation (XWP is high) and the FIFO 18 contains at least one buffered write (EMPTY* is high) . UNFWW* being asserted, that is being low, causes the output of AND gate 28 to be low, further causing the outputs of NAND gates 26a and 26b to be high. This in turn overrides the forced word write due to the assertion of FWW and causes the states of buffered memory control signals (BMC2 and BMC3) to be reflected in the outputs of the memory control line drivers 26d and 26c, respectively. In other words, if the processor 12 had accomplished a byte write or a half-word write in generating the word of data within the FIFO 18 the MC0*-MC3* lines indicate same to the memory controller 22. The occurrence of the byte write or half-word write memory control signal is interpreted by the memory controller 22, in conjunction with the address bus, as a read/modify/write cycle to the addressed word of memory 24. The byte or half-word identified by the least significant bits of the address bus is merged with the data word that is read from the main memory, the ECC syndrome bits are generated, and the word is written back to the memory. In this manner any other data within the word that may have been recently changed by another bus agent is not overwritten by the full word of data within the FIFO 18.
If UNFWW* is not asserted then the output of AND gate 28 is high so long as FWW is asserted. This high, in conjunction with SAEN being asserted, causes the outputs of open collector buffers 26a and 26b to both be low. The wire-ored node coupled to each of these outputs is thus forced to a logic low and results in the memory control signal appearing as a word write (MC0*-MC3* all low) to memory controller 22.
If the processor 12 had performed a word write that resulted in the cache write hit an assertion of UNFWW* does not change the state of the MC2* and MC3* lines in that BMC2 and BMC3 were both made high by the action of processor 12.
In a presently preferred embodiment of the invention the SXWP signal is generated for any write access that is detected on SB 20. It can be appreciated however that additional logic can be employed to decode the system address bus in conjunction with the occurrence of the write to generate the SXWP signal only on the occurrence of a write to, for example, a same page of memory 24 as that contained within the cache 14. Likewise, with further decoding of the SB 20 address bus the SXWP signal can be generated only when the detected system bus write occurs to an address that is the same as an address that is presently within the FIFO 18. The degree of granularity of the decoding process is a function of the particular application and the amount of logic that can be expended for this function.
The invention has been described in the context of a data processing system having a unit of data expressed as a 32 bit word, a 32 bit address bus and other features as described above. It should be realized however that the teaching of the invention is applicable to a wide variety of data processing systems having characteristics that vary from those disclosed above.
Thus, while the invention has been particularly shown and described with respect to a presently preferred embodiment thereof, it will be understood by those skilled in the art that changes in form and details may be made therein without departing from the scope and spirit of the invention.

Claims

15 CLAIMSWhat is claimed is:
1. Apparatus associated with a first agent coupled to a system bus for temporarily storing data before the data is written to a system memory means, comprising:
means for storing at least one unit of data having a predetermined number of bits;
means for indicating to a system memory means that the entire unit of data is to be written to the system memory means or for indicating that only a portion of the unit of data is to be written to the system memory means; and
means coupled to the system bus for detecting an occurrence of a write access by a second agent to the system memory means, the detecting means further being coupled to and responsive to the storing means having at least one unit of data stored therein during an occurrence of the write access by the second agent for causing the indicating means to change from an indication that the entire unit of data is to be written to the system memory means to an indication that only a portion of the unit of data is to be written to the system memory means. 16 '
2. Apparatus as set forth in Claim 1 wherein the storing means comprises a FIFO buffer means having an input coupled to a source of data on the first agent and an output coupled to the system bus.
3. Apparatus as set forth in Claim 2 wherein the FIFO buffer means has a number of bits for storing at least one word of data, an address associated with the at least one word of data and a plurality of memory control bits for specifying a type of memory means write cycle.
4. Apparatus as set forth in Claim 3 wherein the word of data includes four bytes of data and wherein the memory control bits indicate at least a word write cycle or a byte write cycle.
5. Apparatus as set forth in Claim 2 wherein the first agent further includes a cache memory means coupled between the source of data and the system bus, and wherein both the FIFO buffer means and the cache memory means both simultaneously store data from the source of data.
6. A method of providing data from a first agent to a system bus for storage within a system memory means, comprising the steps of:
receiving from a data processing means associated with the first agent at least one unit of data and storing the 17 at least one unit of data in a storage means; and
providing a stored unit of data to the system bus for storage within the system memory means, wherein the step of providing includes the steps of,
determining if the storage means has at least one unit of data stored therein;
detecting from the system bus an occurrence of a write access by a second agent to the system memory means; and
if the occurrence of a write access is not detected,
providing the stored unit of data to the system memory means over the system bus while indicating to the system memory means that the entire unit of data is to be written to the system memory means,
or, if the occurrence of a write access is detected,
providing the stored unit of data to the system memory means over the system bus while indicating to the system memory means that only a portion of the unit of data is to be written to the system memory means.
7. A method as set forth in Claim 6 wherein the step of storing is accomplished by storing the unit of data within a 18 FIFO memory means.
8. A method as set forth in Claim 6 wherein the step of receiving includes the initial steps of:
reading a unit of data from a cache memory means associated with the first agent, the unit of data being comprised of a plurality of bytes of data;
merging at least one byte of data with the unit of data read from the cache memory means; and
storing the unit of data having the at least one merged byte into the cache memory means at a predetermined address, the step of storing the unit of data into the cache memory means also including the steps of storing the unit of data having the at least one merged byte into the storage means, storing an address associated with the unit of data into the storage means, and storing a memory command into the storage means, the memory command indicating that the unit of data stored within the storage means has at least one byte merged therein.
9. A method as set forth in Claim 8 wherein the second step of providing the stored unit of data to the system memory means includes the steps of reading out the stored memory command from the storage means and providing the stored memory command to the system memory means. 19
10. A method as set forth in Claim 6 wherein the step of indicating that an entire unit of data is to be written to system memory is accomplished by providing a word write memory command over the system bus and wherein the step of indicating that a portion of a unit of data is to be written to system memory is accomplished by providing a byte write memory command or a half-word write memory command over the system bus.
PCT/US1990/001641 1989-09-11 1990-03-28 Apparatus and method for maintaining cache/main memory consistency WO1991003785A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
DE69031658T DE69031658T2 (en) 1989-09-11 1990-03-28 DEVICE AND METHOD FOR MAINTENANCE OF CACHE / CENTRAL STORAGE CONSISTENCY
EP90907677A EP0491697B1 (en) 1989-09-11 1990-03-28 Apparatus and method for maintaining cache/main memory consistency

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US40580089A 1989-09-11 1989-09-11
US405,800 1989-09-11

Publications (1)

Publication Number Publication Date
WO1991003785A1 true WO1991003785A1 (en) 1991-03-21

Family

ID=23605294

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1990/001641 WO1991003785A1 (en) 1989-09-11 1990-03-28 Apparatus and method for maintaining cache/main memory consistency

Country Status (7)

Country Link
US (1) US5276849A (en)
EP (1) EP0491697B1 (en)
JP (1) JP3637054B2 (en)
AU (1) AU645494B2 (en)
CA (1) CA2066454C (en)
DE (1) DE69031658T2 (en)
WO (1) WO1991003785A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2283839A (en) * 1993-11-09 1995-05-17 Hewlett Packard Co Data merging method and apparatus for shared memory multiprocessing systems
US5553265A (en) * 1994-10-21 1996-09-03 International Business Machines Corporation Methods and system for merging data during cache checking and write-back cycles for memory reads and writes
US5724549A (en) * 1992-04-06 1998-03-03 Cyrix Corporation Cache coherency without bus master arbitration signals
GB2368157A (en) * 2000-06-15 2002-04-24 Hewlett Packard Co Byte-swapping for efficient use of memory
US8108593B2 (en) 2008-03-01 2012-01-31 Kabushiki Kaisha Toshiba Memory system for flushing and relocating data

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5452463A (en) * 1990-05-25 1995-09-19 Dell Usa, L.P. Processor and cache controller interface lock jumper
US5379396A (en) * 1991-10-11 1995-01-03 Intel Corporation Write ordering for microprocessor depending on cache hit and write buffer content
US5590310A (en) * 1993-01-14 1996-12-31 Integrated Device Technology, Inc. Method and structure for data integrity in a multiple level cache system
US5557733A (en) * 1993-04-02 1996-09-17 Vlsi Technology, Inc. Caching FIFO and method therefor
JPH09505679A (en) * 1993-07-07 1997-06-03 トーシバ・アメリカ・エレクトロニック・コンポーネンツ・インコーポレーテッド Memory buffer with selective flash function
JPH07129456A (en) * 1993-10-28 1995-05-19 Toshiba Corp Computer system
US5592684A (en) * 1994-07-22 1997-01-07 Dell Usa, L.P. Store queue including a byte order tracking mechanism for maintaining data coherency
US5649158A (en) * 1995-02-23 1997-07-15 International Business Machines Corporation Method for incrementally archiving primary storage to archive storage by utilizing both a partition archive status array and a partition map
US5809228A (en) * 1995-12-27 1998-09-15 Intel Corporaiton Method and apparatus for combining multiple writes to a memory resource utilizing a write buffer
FR2759178B1 (en) * 1997-02-05 1999-04-09 Sgs Thomson Microelectronics MEMORY MANAGEMENT CIRCUIT IN A MULTI-USER ENVIRONMENT WITH REQUEST AND PRIORITY OF ACCESS
US6437789B1 (en) * 1999-02-19 2002-08-20 Evans & Sutherland Computer Corporation Multi-level cache controller
US6564306B2 (en) * 2000-04-25 2003-05-13 Hewlett-Packard Development Company, L.P. Apparatus and method for performing speculative cache directory tag updates
US20050270870A1 (en) * 2004-06-02 2005-12-08 Sangho Shin Time slot interchange switch with cache
US7944876B2 (en) 2004-06-02 2011-05-17 Integrated Device Technology, Inc Time slot interchange switch with bit error rate testing
CN101617354A (en) 2006-12-12 2009-12-30 埃文斯和萨瑟兰计算机公司 Be used for calibrating the system and method for the rgb light of single modulator projector
US20080168331A1 (en) * 2007-01-05 2008-07-10 Thomas Vogelsang Memory including error correction code circuit
JP5162763B2 (en) * 2007-08-07 2013-03-13 株式会社メガチップス Memory access system
US8099734B2 (en) * 2007-09-06 2012-01-17 Kabushiki Kaisha Toshiba Portable system and method for soft reset of computer devices
US8358317B2 (en) 2008-05-23 2013-01-22 Evans & Sutherland Computer Corporation System and method for displaying a planar image on a curved surface
US8702248B1 (en) 2008-06-11 2014-04-22 Evans & Sutherland Computer Corporation Projection method for reducing interpixel gaps on a viewing surface
US8077378B1 (en) 2008-11-12 2011-12-13 Evans & Sutherland Computer Corporation Calibration system and method for light modulation device
US9641826B1 (en) 2011-10-06 2017-05-02 Evans & Sutherland Computer Corporation System and method for displaying distant 3-D stereo on a dome surface
GB2526849B (en) * 2014-06-05 2021-04-14 Advanced Risc Mach Ltd Dynamic cache allocation policy adaptation in a data processing apparatus
US10372602B2 (en) * 2015-01-30 2019-08-06 Hewlett Packard Enterprise Development Lp Ordering updates for nonvolatile memory accesses
US10324850B2 (en) 2016-11-11 2019-06-18 Microsoft Technology Licensing, Llc Serial lookup of tag ways
US10565122B2 (en) 2017-05-30 2020-02-18 Microsoft Technology Licensing, Llc Serial tag lookup with way-prediction

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3984818A (en) * 1974-02-09 1976-10-05 U.S. Philips Corporation Paging in hierarchical memory systems
US4157586A (en) * 1977-05-05 1979-06-05 International Business Machines Corporation Technique for performing partial stores in store-thru memory configuration
EP0149392A2 (en) * 1983-12-29 1985-07-24 Fujitsu Limited Digital computer system
EP0168121A1 (en) * 1984-02-10 1986-01-15 Prime Computer, Inc. Memory access method and apparatus in multiple processor systems

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3883854A (en) * 1973-11-30 1975-05-13 Ibm Interleaved memory control signal and data handling apparatus using pipelining techniques
US4195340A (en) * 1977-12-22 1980-03-25 Honeywell Information Systems Inc. First in first out activity queue for a cache store
US4494190A (en) * 1982-05-12 1985-01-15 Honeywell Information Systems Inc. FIFO buffer to cache memory
US4685082A (en) * 1985-02-22 1987-08-04 Wang Laboratories, Inc. Simplified cache with automatic update
US4933835A (en) * 1985-02-22 1990-06-12 Intergraph Corporation Apparatus for maintaining consistency of a cache memory with a primary memory
US4716545A (en) * 1985-03-19 1987-12-29 Wang Laboratories, Inc. Memory means with multiple word read and single word write
US4805098A (en) * 1986-05-05 1989-02-14 Mips Computer Systems, Inc. Write buffer
US4768148A (en) * 1986-06-27 1988-08-30 Honeywell Bull Inc. Read in process memory apparatus
US4992930A (en) * 1988-05-09 1991-02-12 Bull Hn Information Systems Inc. Synchronous cache memory system incorporating tie-breaker apparatus for maintaining cache coherency using a duplicate directory

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3984818A (en) * 1974-02-09 1976-10-05 U.S. Philips Corporation Paging in hierarchical memory systems
US4157586A (en) * 1977-05-05 1979-06-05 International Business Machines Corporation Technique for performing partial stores in store-thru memory configuration
EP0149392A2 (en) * 1983-12-29 1985-07-24 Fujitsu Limited Digital computer system
EP0168121A1 (en) * 1984-02-10 1986-01-15 Prime Computer, Inc. Memory access method and apparatus in multiple processor systems

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5724549A (en) * 1992-04-06 1998-03-03 Cyrix Corporation Cache coherency without bus master arbitration signals
GB2283839A (en) * 1993-11-09 1995-05-17 Hewlett Packard Co Data merging method and apparatus for shared memory multiprocessing systems
US5710881A (en) * 1993-11-09 1998-01-20 Hewlett Packard Company Data merging method and apparatus for shared memory multiprocessing computer systems
GB2283839B (en) * 1993-11-09 1998-06-10 Hewlett Packard Co Data merging method and apparatus for shared memory multi-processing computer systems
US5553265A (en) * 1994-10-21 1996-09-03 International Business Machines Corporation Methods and system for merging data during cache checking and write-back cycles for memory reads and writes
US5627993A (en) * 1994-10-21 1997-05-06 International Business Machines Corporation Methods and systems for merging data during cache checking and write-back cycles for memory reads and writes
GB2368157A (en) * 2000-06-15 2002-04-24 Hewlett Packard Co Byte-swapping for efficient use of memory
US6629168B1 (en) 2000-06-15 2003-09-30 Hewlett-Packard Development Company, Lp. Byte-swapping for efficient use of memory
GB2368157B (en) * 2000-06-15 2005-01-26 Hewlett Packard Co Byte-swapping for efficient use of memory
US8108593B2 (en) 2008-03-01 2012-01-31 Kabushiki Kaisha Toshiba Memory system for flushing and relocating data

Also Published As

Publication number Publication date
CA2066454C (en) 1998-08-25
EP0491697B1 (en) 1997-10-29
US5276849A (en) 1994-01-04
DE69031658T2 (en) 1998-05-20
AU645494B2 (en) 1994-01-20
JP3637054B2 (en) 2005-04-06
JPH05502123A (en) 1993-04-15
DE69031658D1 (en) 1997-12-04
CA2066454A1 (en) 1991-03-12
EP0491697A1 (en) 1992-07-01
AU5656990A (en) 1991-04-08

Similar Documents

Publication Publication Date Title
AU645494B2 (en) Apparatus and method for maintaining cache/main memory consistency
US5809280A (en) Adaptive ahead FIFO with LRU replacement
US3938097A (en) Memory and buffer arrangement for digital computers
US4995041A (en) Write back buffer with error correcting capabilities
US5493666A (en) Memory architecture using page mode writes and single level write buffering
US5388247A (en) History buffer control to reduce unnecessary allocations in a memory stream buffer
US4768148A (en) Read in process memory apparatus
US4527233A (en) Addressable buffer circuit with address incrementer independently clocked by host computer and external storage device controller
US5742831A (en) Methods and apparatus for maintaining cache coherency during copendency of load and store operations
EP0090575A2 (en) Memory system
JPS624745B2 (en)
JPS5821353B2 (en) Channel-to-memory writing device
JPH11502656A (en) Method and apparatus for combining writes to memory
US5588128A (en) Dynamic direction look ahead read buffer
JPH08185355A (en) Data memory and its operating method
US5854943A (en) Speed efficient cache output selector circuitry based on tag compare and data organization
US5455925A (en) Data processing device for maintaining coherency of data stored in main memory, external cache memory and internal cache memory
US6134632A (en) Controller that supports data merging utilizing a slice addressable memory array
US5295253A (en) Cache memory utilizing a two-phase synchronization signal for controlling saturation conditions of the cache
US20070033306A1 (en) FIFO-type one-way interfacing device between a master unit and a slave unit, and corresponding master unit and slave unit
US5960456A (en) Method and apparatus for providing a readable and writable cache tag memory
EP0943998B1 (en) Cache memory apparatus
JP2700147B2 (en) Instruction cache flash on REI control
GB2037466A (en) Computer with cache memory
US5832307A (en) Satellite communication system overwriting not validated message stored in circular buffer with new message in accordance with address stored in last valid write address register

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB IT LU NL SE

WWE Wipo information: entry into national phase

Ref document number: 1990907677

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2066454

Country of ref document: CA

WWP Wipo information: published in national office

Ref document number: 1990907677

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1990907677

Country of ref document: EP