APPARATUS AND METHOD FOR MAINTAINING CACHE/MAIN MEMORY
CONSISTENCY
FIELD OF THE INVENTION:
This invention relates generally to data processing apparatus and method and, in particular, to a Central Processor Unit (CPU) having a write-through cache and a local buffer memory paralleling the cache for buffering write data. The CPU includes circuitry for detecting an occurrence of an externally generated write to a main memory and circuitry for modifying an associated memory command from a word-type of write access to less than a word type of write access to prevent data in the main memory from being overwritten with possibly non-current data from the buffer memory.
BACKGROUND OF THE INVENTION:
The invention described herein is particularly useful in a data processing system of the type wherein a memory controller controls a plurality of memory devices organized for storing multi-byte memory words. In such a memory error detection and correction syndrome bits are generated typically over a number of bytes of the memory word such as, for example, four bytes of a 32-bit memory word. When a central processor unit (CPU)
writes a word of data to such a memory the error syndrome bits are generated for the entire word and stored within associated memory devices. However, when the CPU writes less than a full word of data, such as a byte (eight bits) or a half word (16 bits) of data, the memory controller operates to first read the full word of data, merge the byte or bytes, generate the error syndrome bits over the full merged word and write the merged word with the associated syndrome bits back to memory. As can be appreciated, this read/modify/write type of access may be a time consuming process.
In some systems a plurality of CPUs are coupled to a common system bus and through the bus to one or more memory units. Each CPU may have a local cache memory wherein a copy of a portion of a main system memory is maintained. In such a system it is a desirable goal that the main memory be updated to accurately reflect changes made to data within the cache memories. For example, such a system may employ a write-through type of cache memory wherein data written to the cache is also written "through" the cache to the main memory. For this purpose a first-in/first-out (FIFO) memory can be employed in parallel with the cache, the FIFO accepting write data from the CPU and temporarily buffering the data before providing the data over a system bus to the main memory. The FIFO is normally a word width or greater.
For those write operations of less than a word in width the data stored within the FIFO reflects the result of a
read/modify/write type of access wherein a byte or half word is already merged by the CPU with a cache word. Thus, instead of writing a byte or half word to the memory and incurring the read/modify/write cycle time delay in generating the error syndrome bits, a more efficient cache memory write-through technique writes a full, already merged word from the FIFO to the memory.
However a problem is created when another system bus agent, such as another CPU in a multi-processor system, writes to the main memory during the interval of time that the word of data is temporarily buffered within the FIFO. In this case the word in main memory that is the target of the FIFO may have just been updated by the other bus agent. Permitting the full word to be written from the FIFO would result in the newer data being over-written by the older data contained within the FIFO and the destruction in the main memory of the newer data.
It is thus one object of the invention to provide a method and apparatus for updating data within a main memory as a result of a write operation to a local cache memory.
It is another object of the invention to provide a method and apparatus for use with a write-through cache that updates data within a main memory by providing a dual port memory, such as a FIFO buffer, in parallel with the cache memory and to further provide circuitry for detecting when a write occurs to the main memory to invalidate a full word write of data
from the FIFO buffer to the main memory.
It is a further object of the invention to provide a method and apparatus for use with a write-through cache that updates data within a main memory by providing a FIFO buffer in parallel with the cache memory and to further provide circuitry for detecting when a write occurs to the main memory and circuitry to modify an associated memory command from a word write access to less than a word type of write access to prevent data in the main memory from being overwritten with possibly non-current data.
SUMMARY OF THE INVENTION The foregoing problems are overcome and the objects of the invention are realized by method and apparatus for storing data units within a system memory over a system bus. Specifically a write-through cache is taught wherein for write operations of less than a word in length the write data stored within a FIFO memory device associated with a first bus agent reflects the result of a read/modify/write type of access wherein a byte or half word is merged by a local processor with a cache word. Memory control lines driven to the system bus indicate to a memory controller that a write operation is to be accomplished as a word write, thereby eliminating the additional time required to achieve a read/modify/write memory controller cycle. To prevent the occurrence of a problem wherein another bus agent, such as another CPU or an I/O device, writes to the system memory during an interval of time
that the word of data is temporarily buffered within the FIFO and the newly written data is overwritten by the FIFO there is provided circuitry for detecting an external write made by another system bus agent to the system memory. Circuitry is also provided for detecting that the FIFO has data stored within and circuitry is provided for changing the memory command lines to indicate, instead of a word write, less than a word write. This causes the memory controller to operate only upon the portion of data word that was modified by the local processor and to perform a conventional read/modify/write type of cycle to.merge only that portion of the word with a word from main memory.. For example, a byte or a half-word is identified to the memory controller by the least significant bits of an address that is also buffered by the FIFO.
BRIEF DESCRIPTION OF THE DRAWING
The above set forth and other features of the invention are made more apparent in the ensuing Detailed Description of the Invention when read in conjunction with the attached Drawing, wherein:
Fig. 1 is is a block diagram of a data processing system constructed and operated in accordance with the invention; and
Fig. 2 is a simplified schematic diagram illustrating circuitry for implementing the invention.
DETAILED DESCRIPTION OF THE INVENTION Fig. 1 is an illustrative block diagram of a portion of an illustrative data processing system 1 constructed and operated in accordance with the invention. System 1 includes at least one central processor unit (CPU) 10 shown as CPU1 through CPUn. Typically each CPU 10 is of identical construction to others of the CPUs. CPU 10 includes a processor 12, such as a microprocessor device, that is coupled to a local cache memory 14 via a plurality of data lines 12a, address lines 12b and control lines 12c. The cache memory 14 is coupled via a bus 14a to a bus interface unit 16, the bus interface unit 16 providing bidirectional data communication with a system bus (SB) 20. Also associated with the cache 14 but not shown are a plurality of parity lines and various other signals of a type known to those having skill in the art. The bus 14a includes the data, address and control lines 12a-12c. Coupled in parallel with the cache 14 is a dual port memory device such as a FIFO buffer 18. FIFO 18 functions to briefly buffer write data intended for updating a main memory 24 that is coupled to the system bus 20 through a memory controller 22. The FIFO 14 operates to receive and store write data from the processor 12 before the data is provided to SB 20. In that the processor 12 typically operates at a faster clock rate than the SB 20 the FIFO 18 permits the processor 12 to write the data and continue operation without having to synchronize its operation with the typically slower SB 20. The FIFO 18 is provided with sufficient memory capacity to store at least one
and typically up to four words or double words of data although more or less than this typical number may be readily provided.
A plurality of memory units (MEMl-MEMn) may be provided, depending on the memory density of an individual one of the units 24 and the desired total memory capacity of the system 1. The memory controller 22 includes Error Correction Circuitry (ECC) 22a. ECC 22a operates to generate and test syndrome bits upon a word of memory data at a time. In this illustrative embodiment a word of memory data is considered to be four bytes, or 32 bits, in width. The memory controller 22, memory units 24, SB 20, FIFO 18 and cache 14, for example, may be operable for simultaneously conveying, reading and/or writing multiple words of data, such as a double word (64 bits) or a quad word (128 bits) .
For write operations of less than a word in length the data stored within the FIFO 18 reflects the result of cache write hit wherein a read/modify/write type of access is performed by processor 12 to merge a byte or half word with a cache word. Also, certain of the CONT 12c lines indicate to the memory controller 22 that the write is to be accomplished as a word write, thereby eliminating the additional time required to achieve the read/modify/write memory controller cycle. However, the aforementioned problem occurs when, for example, the CPUn writes to the memory 24 during an interval of time that the word of data is temporarily buffered within the FIFO
18. In this case the word in main memory that is the target of the FIFO 18 write may have just been changed by CPUn. If the FIFO 18 write were allowed to proceed as a word write the newer data would be over-written by the older data from the cache 14. As will now be described this problem is circumvented by detecting within the CPU1 the write made to memory 24 by CPUn, or any other bus agent, and changing the CONT 12c memory command lines to indicate, instead of a word write, a byte write or a half word write operation. This causes the memory controller 22 to operate only upon the byte or half word of data that is being written and to perform a conventional read/modify/write type of cycle to merge the byte or half word with a word from main memory. The byte or half word is identified by the least significant bits of the address that is also buffered by the FIFO 18.
In accordance with the invention, each of the CPUs 10 is provided circuitry to detect the occurrence of such an external write to memory 24 and circuitry to prevent the FIFO 18 from writing a full word of data to the memory 24 after such an external write occurs.
Referring to Fig. 2 there is shown in greater detail the FIFO 18. Although illustrated as a single device it should be realized that the FIFO 18 is typically comprised of a plurality of individual FIFO devices for buffering the 32 data lines 12a, an associated 32 bit address line 12b and associated control lines 12c. By example, the control lines
identify the type of memory access associated with the data and address. That is, the control lines indicate whether the processor 12 performed a byte write, a half-word write or a word write upon the associated word of data stored within the FIFO 18. The bus 18a includes buffered data (BDATA) , buffered address (BADDR) and buffered control lines (BCONT) from the output of the FIFO 18. A plurality of system bus 20 drivers are associated with the output 18a of the FIFO 18. In Fig. 2 only those drivers 26 associated with a portion of the BCONT lines are shown. Buffers 26c-26f provide a four bit memory control signal (MC0*-MC3*) to the SB 20, the asterisk following the signal name indicating in a conventional manner that the signal is asserted when a logic zero or low. The MCn* control bits are coded to identify to the memory controller 22 the type of memory access in the following manner.
NAND buffers 26c through 26f are enabled to drive their associated memory control bits onto the SB 20 when a controlling system access enable (SAEN) signal is asserted by circuitry (not shown) on the CPU 10. The assertion of SAEN enables the driving of data from the CPU 10 to the SB 20.
Associated with buffers 26c and 26d are two additional NAND
buffers 26a and 26b having their outputs wire-ored to the output of buffers 26c and 26d, respectively. Preferably the buffers 26 are open collector type devices wherein such wire-ored connections are readily accomplished.
The purpose of buffers 26a and 26b is to normally override the byte write or half-word write control signal indications, MC2* and MC3* high or MC2* high and MC3* low, respectively, in order to make the control signal indication a word write indication with MC2* and MC3* both low. Buffers 26a and 26b each have as an input signal an output of a gate 28 which has as an input signal a "force word write" (FWW) signal. FWW is one of the buffered control lines that is output from the FIFO 18, FWW being asserted high as a result of a CACHE WRITE HIT signal that is asserted when a processor 12 write hits the cache. The state of the CACHE WRITE HIT signal if stored within the FIFO 18 along with the associated word of data that is written to the cache 14 and to the FIFO 18 by the processor 12. The FWW signal when asserted forces, in conjunction with a normally deasserted UNFWW* signal (to be described) , a word write command to the memory controller 22. This causes memory controller 22 to write a word of data, thereby eliminating the normally required read/modify/write type of access that is employed when writing a byte or a half word of data to the memory 24.
However, for those situations where an external write of the memory occurs while the data is temporarily buffered in the
11
FIFO 18 the forced word write is undesirable in that the FIFO 18 data may overwrite newly written data within the memory 24. To prevent the occurrence of such a situation the CPU 10 further includes additional circuitry to detect such an external write and to convert the word write type of access back to a byte write or a half-word write type of access.
Further in accordance with the invention CPU 10 includes circuitry, such as a comparator 29, for detecting when another agent coupled to the SB 20 performs a write to the memory 24. This other bus agent could be, by. example, another CPU in a multi-CPU system or could be I/O interface circuitry operable for writing data to the memory 24 from an I/O device such as a disk or communication port. The occurrence of the write access by another agent also results in an interrogation of the cache 14 and, if necessary, an invalidation of cache 14. Circuitry (not shown) to perform these functions is well known in the art. The occurrence of such a write access over SB 20 furthermore is detected by comparator 29 and generates a "system external write pending" (SXWP) signal. This signal is input to a JK flip/flop (F/F) 30 where the assertion of this signal causes F/F 30 to set on the edge of a next clock (CLK) input. The Q output of F/F 30 is a signal designated "external write pending" (XWP) that, when asserted, indicates that an externally generated write has occurred on the SB 20. Subsequent to the generation of the SXWP signal a "clear external write pending" (CLRXWP) •signal is generated by the CPU 10 to reset the F/F 30 on another edge of CLK. CLRXWP is
generated to indicate that the cache 14 interrogation and, if required, the cache 14 invalidation has occurred in response to the externally generated write access of memory 24.
The XWP signal output from F/F 30 is provided to AND gate 34 in conjunction with an EMPTY* flag from the FIFO 18. When the FIFO 18 is empty, that is when the FIFO 18 contains no buffered writes, the EMPTY* signal is low forcing a low from AND gate 34. When FIFO 18 contains one or more buffered writes the EMPTY* signal is deasserted or high. EMPTY* being high in conjunction with XWP being high, XWP indicating an occurrence of an external system bus write to memory 24, causes AND gate 34 to have a high output. The high output from AND gate 34 is applied to the J input of a JK F/F 36 and, in conjunction with an edge of CLK, causes the Q* output to go low. The Q* output of F/F 36 is a signal designated "undo forced word write" (UNFWW*) . The UNFWW* signal is asserted low only when an external system bus write is detected and is pending cache interrogation (XWP is high) and the FIFO 18 contains at least one buffered write (EMPTY* is high) . UNFWW* being asserted, that is being low, causes the output of AND gate 28 to be low, further causing the outputs of NAND gates 26a and 26b to be high. This in turn overrides the forced word write due to the assertion of FWW and causes the states of buffered memory control signals (BMC2 and BMC3) to be reflected in the outputs of the memory control line drivers 26d and 26c, respectively. In other words, if the processor 12 had accomplished a byte write or a half-word write in
generating the word of data within the FIFO 18 the MC0*-MC3* lines indicate same to the memory controller 22. The occurrence of the byte write or half-word write memory control signal is interpreted by the memory controller 22, in conjunction with the address bus, as a read/modify/write cycle to the addressed word of memory 24. The byte or half-word identified by the least significant bits of the address bus is merged with the data word that is read from the main memory, the ECC syndrome bits are generated, and the word is written back to the memory. In this manner any other data within the word that may have been recently changed by another bus agent is not overwritten by the full word of data within the FIFO 18.
If UNFWW* is not asserted then the output of AND gate 28 is high so long as FWW is asserted. This high, in conjunction with SAEN being asserted, causes the outputs of open collector buffers 26a and 26b to both be low. The wire-ored node coupled to each of these outputs is thus forced to a logic low and results in the memory control signal appearing as a word write (MC0*-MC3* all low) to memory controller 22.
If the processor 12 had performed a word write that resulted in the cache write hit an assertion of UNFWW* does not change the state of the MC2* and MC3* lines in that BMC2 and BMC3 were both made high by the action of processor 12.
In a presently preferred embodiment of the invention the SXWP
signal is generated for any write access that is detected on SB 20. It can be appreciated however that additional logic can be employed to decode the system address bus in conjunction with the occurrence of the write to generate the SXWP signal only on the occurrence of a write to, for example, a same page of memory 24 as that contained within the cache 14. Likewise, with further decoding of the SB 20 address bus the SXWP signal can be generated only when the detected system bus write occurs to an address that is the same as an address that is presently within the FIFO 18. The degree of granularity of the decoding process is a function of the particular application and the amount of logic that can be expended for this function.
The invention has been described in the context of a data processing system having a unit of data expressed as a 32 bit word, a 32 bit address bus and other features as described above. It should be realized however that the teaching of the invention is applicable to a wide variety of data processing systems having characteristics that vary from those disclosed above.
Thus, while the invention has been particularly shown and described with respect to a presently preferred embodiment thereof, it will be understood by those skilled in the art that changes in form and details may be made therein without departing from the scope and spirit of the invention.