GB2035632A - Multiprocessor system - Google Patents

Multiprocessor system Download PDF

Info

Publication number
GB2035632A
GB2035632A GB7846847A GB7846847A GB2035632A GB 2035632 A GB2035632 A GB 2035632A GB 7846847 A GB7846847 A GB 7846847A GB 7846847 A GB7846847 A GB 7846847A GB 2035632 A GB2035632 A GB 2035632A
Authority
GB
United Kingdom
Prior art keywords
memory
data
cache memory
unit
cache
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB7846847A
Other versions
GB2035632B (en
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
General Automation Inc
Original Assignee
General Automation Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by General Automation Inc filed Critical General Automation Inc
Priority to GB7846847A priority Critical patent/GB2035632B/en
Publication of GB2035632A publication Critical patent/GB2035632A/en
Application granted granted Critical
Publication of GB2035632B publication Critical patent/GB2035632B/en
Expired legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems

Abstract

A multiprocessor system is described in which a plurality of central processor units 10, 12, 14 share the same main memory 22 over a common asynchronous bus 24. Each processor unit directs all memory requests to its own high speed cache memory 16, 18, 20. If the memory request is to read data from memory, a cache memory controller determines if the addressed data is present in the cache memory. If so, the data is transferred to the processor unit without accessing main memory over the bus. If the data is not present in the cache memory, the cache memory controller gains access to the bus by a priority circuit and reads out the data from main memory, storing the data in the cache memory at the same time that it transfers the data to the processor unit. If the memory request by the processor unit is to write into memory, the cache memory controller gains access to the bus and initiates a data storage operation in main memory. At the same time, the cache memory controller determines if the existing data being overwritten at the addressed location in main memory is present in the cache memory. If so, it updates the data in the cache memory at the same time as it writes the data into main memory. <IMAGE>

Description

SPECIFICATION Multiprocessor system with cache memory This invention relates to a digital computer system, and more particularly to multiprocessor apparatus in which the processors communicate through cache memories over a common bus with a main memory Multiprocessing systems in which a plurality of central processing units share a common memory over a time-shared asynchronous memory bus is well known. However, as the speed of processors has increased, the band width limitations of the bus has become a limiting factor in the number of processors that can effectively share the memory over a common bus.
The present invention is directed to a multiprocessing system which enables a plurality of high speed central processors to share a common memory over an asynchronous bus. The present invention permits a greater number of separate processors to be incorporated into the system without overloading the maximum bit rate capacity (band width) of the bus over which the processors communicate with a common memory.
In brief, embodiments of the present invention provide improved bus band width for a multiprocessing system by providing an arrangement in which each processor in the multiprocessing system communicates with the common bus to main memory through a high speed cache memory unit. The cache memory includes a high speed random access memory storing a plurality of addressable words, each word including a data portion and a tag portion.
When a memory READ operation is initiated by a processor the processor executes a Load command and generates an address in main memory where the data to be read is stored. A group of low order bits in the memory address generated by the processor is used as the address to read out a word of the cache memory. The tag portion of the word read out of the cache memory is then compared with a group of high order bits of the same memory address from the processor and if these two sets of bits are identical, the data portion of the same word read out of the cache memory is transferred directly to the processor to complete the memory access.
Thus no access to the main memory over the bus is required. If the tag bits of the addressed word from the cache memory do not compare with the group of high order bits of the memory address, the cache memory unit, using the full address from the processor, initiates a memory Read operation at the corresponding address location in the main memory using the full memory address from the processor and transfers this data over the bus from main memory to the cache memory unit and to the processor. The cache memory unit stores the data from main memory in the memory of the cache memory unit as the data portion of the word stored at the address location corresponding to the group of lower order bits of the same memory address from the processor.The group of higher order bits of the same memory address are simulaneously stored as the tag porion of the word stored in the same address location of the cache memory.
In the case of a memory Write operation initiated by a STORE command in the processor, the cache memory unit associated with the processor causes the address received from the processor to be transferred by the bus to main memory to initiate a memory Write in the main memory. It then transfers the data from the processor to the bus for writing the data into the main memory. At the same time it uses the lower order bits of the address from the processorto address and read out a word in the cache memory. The tag portion of the word read out of the cache memory is compared with the higher order bits of the address from the processor. If there is identity, the data from the processor is used to update the data portion of the word read out of the cache memory and the word is again written in the same location in the cache memory.
It has been found that for all classes of programs using the present invention, the amount of time each processor utilises the common bus to main memory is substantially reduced. On the average, in 93% of the memory accesses by the processor, the data is already present in the high speed data memory of the cache memory. This enables the number of separate processors communicating with main memory over the bus to be greatly expanded without exceeding the band width limitations of the bus.
The present invention is multiprocessing apparatus comprising a plurality of central processing units, each unit including means for initiating a memory access request at a designated address; a plurality of high speed cache memory units; means interconnecting each processing unit with one cache memory unit; a random access main memory; a common bus connecting each of the cache memory units to the main memory, means in each cache memory unit responsive to a memory request and address from the associated processing unit for initiating a memory cycle in the cache memory unit at the initiated address; and means in each cache memory unit for initiating a memory cycle in the main memory over the common bus in response to a memory access request from the processor.
An embodiment of the present invention will now be described, by way of example, with reference to the accompanying drawings, wherein Figure 1 is a schematic block diagram of the multiprocessing system; Figure 2 is a functional block diagram of the cache memory unit; and Figure 3 is a functional block diagram of the main memory control.
Referring to Figure 1, there is shown a multiprocessing system comprising a plurality of central processing units, three of which are indicated at 10, 12 and 14. However, the number of processors can be expanded to more than three. The processing units may be of any well known type in which digitally coded data words are processed in parallel and which provides controls for interfacing with a random access memory in response to LOAD or STORE commands in the program executed by the processor. A suitable processor is sold, for example, by General Automation, Inc., designated the GA-16/ 440CPU. Associated with each central processing unit is a high speed cache memory unit, three of which are indicated respectively at 16, 18, and 20.
Each cache memory unit interfaces with the associated central processing unit by a group of address lines by which address information is transferred to the cache memory unit, a group of data lines by which data is transferred between the cache memory unit and the central processing unit, and a group of control lines for controlling the transfer of data between the cache memory unit and the central processing unit. The processor uses these lines to transfer and write data into the memories in response to a STORE command or to read data out of the memories in response to a LOAD command.
As hereinafter described in detail, when a central processing unit initiates a memory access, this is serviced by the cache memory unit. However, if the cache memory unit is not capable of directly servicing the request because the addressed location is not present in the cache memory, the cache memory unit initiates a memory access to a main memory 22 over a common bus 24 and memory controller 25.
The main memory 22 is a conventional random access memory which can be accessed from any of the cache memories 16, 18, or 20 by means of a group of control lines in the bus 24 which provide asynchronous transfer of address and data information over a group of addressldata lines in the bus 24.
Access to main memory 22 by any one of the cache memory units 16, 18, or 20 over the common bus 24 is under the control of priority circuits in each of the cache memory units which are linked together by priority control lines forming part of the bus 24.
Referring to Figure 2, each cache memory unit includes a high speed random access memory 30 which, for example, stores 1024 individually addressable words. Each addressable location in memory includes a data portion and a tag portion.
When a memory access is initiated by the associated processor, an address word is coupled to the memory address lines 32. The address may, for example, consist of twenty parallel bits. At the same time the processor signals either a memory Read or a memory Write request, respectively, on either of two input control lines 34 or 36, which are connected to the cache control logic, indicated generally at 38. If the cache memory unit is busy, the cache control logic generates a Busy signal on a control line 39 to signal the processor that the cache memory is in a busy state. Otherwise, the cache control logic 38 causes the address word on the lines 32 to be stored in a group of address latches 40.The cache control logic 38 also, in response to a memory Write signal, causes the data word on a group of bidirectional memory data lines 42 from the processorto be stored in a group of data latches 44.
Considering first the operation of the cache memory in response to a memory Read request, the ten lowest order bits of the memory address on the memory lines 32 are used to address one of the words in the memory 30 and a memory Read is initiated. The data portion of the word read out of the memory 30 is set in a group of CPU data latches 46.
At the same time the group of tag bits stored in the tag portion of the word read out of the memory 30 is applied to a Compare circuit 48 where they are compared with the ten highest order bits of the address stored in the address latches 40. If the tag bits are identical to the highest order address bits, the Compare circuit signals a "hit". The cache control logic 38, in response to the "hit" signal on output line 50 of the Compare circuit 48 signals the processor over the Memory Ready control line 52 that the data is present. The processor then initiates transfer of the data word over the memory data lines 42 from the CPU data latches 46, completing the Read operation.
In the event the tag bits read out of the tag array of the memory 30 are not identical to the higher order bits of the address received from the processor, the Compare circuit 48 signals a "miss". This signals the cache control logic 38 that the data being addressed is not present in the cache memory. The cache control logic 38 then initiates an access to the main memory 22 over the common bus 24.
To this end, the cache control logic 38 first signals a memory request on the priority control lines by means of a priority circuit 54, such as described in detail in the above-identified copending application and incorporated herein by reference. When access to the bus 24 is granted by the priority circuit to the cache memory unit, the control logic 38 signals a main memory request on the control line MRQ going to the main memory controller 25. At the same time, the full address stored in the latches 40 is coupled to the bus 24. The cache control logic 38 also signals over the R/W control line whether a Read or Write operation is required.
The main memory controller is shown in more detail in Figure 3. The control lines to the bus 24 are connected to the memory control logic circuit 60.
Assuming a memory Read is required, the memory control logic the address bits on the bus 24to a group of address latches 62 and returns an Address Accepted signal over the control line ACC to the cache control logic 38, which responds by clearing the address bits from the bus 24. The memory control logic 60 then initiates a readout of data from the random aqcess main memory 22 using the full 20-bit address in the address latches 62. The memory control logic 60 then activates the data drivers to the bus 24 and issues a data-to-source signal on the line DTS signalling that the data from memory is present on the bus 24. When the DTS signal is received by the cache control logic 38, it causes the data on the bus 24 to be stored in the data latches 44 and provides a Memory Ready signal to the processor, allowing the processorto read the data off the memory data lines 42. At the same time, the cache control logic 38 initiates a memory Write on the high-speed memory 30 causing the data word from the bus 24 to be stored in the data array at the location identified by the lower order bits of the address stored in the address latches 40. The ten highest order bits of the address in the address latches 40 is written in the tag portion of the same word. Thus the cache memory is automatically updated by the memory access initiated by the processor whenever the cache control logic signals a "miss".
If the processor initiates a data Write operation, providing a control signal on the line 36, the cache control logic 38 causes the data word on the memory data lines 42 from the processor to be stored in the data latches 44 and requests access to the bus 24 to the main memory 22. It also initiates a Read operation from the high-speed random access memory 30 in response to the ten lower order bits of the address stored in the latches 40 causing the tag bits to be applied to the Compare circuit 48. If the Compare circuit signals a "hit", the cache control logic causes the data in the data array portion of the high-speed memory 30 to be updated by a memory Write operation using the data word stored in the data latches 44. If the Compare circuit signals a "miss" the cache control logic 38 takes no action beyond signalling a memory Write operation to the main memory controller 25.
As in the memory Read operation, when the cache memory is granted access to the bus 24 by the priority circuit, the cache control logic 38 issues a memory Request and a memory Write signai on the control lines MRQ and RNV to the memory control logic 60. At the same time, the address in the address latches 40 is applied to the bus 24. The address is then stored in the address latches 62 by the controller and an Address Accepted signal is applied to the ACC line from the memory control logic 60 to the cache control logic 38. The cache control logic 38 then couples the data latches 44 to the bus 24 and signals the memory control logic that the data is available by means of a DTM control line.
A Amemory Write operation is then initiated on main memory 22 using the address in the address latches 62 and the data from the bus 24 stored in a set of data latches 66. When the memory Write operation is complete, the memory control logic signals that the memory has accepted the data by means of the control line MAD.
From the above description it will be seen that the transfer of data between a plurality of cache memory units and the main memory over the common bus 24 utilizes a conventional "hand shake" asynchronous control in which address and data transfers share the same bus lines. The operation of the processor is the same as though the processor were connected directly to the main memory 22, yet because of the associated cache memory units, common bus, and memory controller, actual access to main memory by the processors is greatly reduced. The cache memory units operate as the "master" units and the controller operates as the "slave" unit. The interface between each processor and associated cache memory unit is identical to the interface between the memory controller and the main memory, with separate sets of data lines and address lines.

Claims (12)

1. Multiprocessing apparatus comprising a plurality of central processing units, each unit including means for initiating a memory access request at a designated address; a plurality of high speed cache memory units; means interconnecting each processing unit with one cache memory unit; a random access main memory; a common bus connecting each of the cache memory units to the main memory, means in each cache memory unit responsive to a memory request and address from the associated processing unit for initiating a memory cycle in the cache memory unit at the initiated address; and means in each cache memory unit for initiating a memory cycle in the main memory over the common bus in response to a memory access request from the processor.
2. Apparatus as claimed in claim 1, wherein each processor further includes means initiating a memory read signal for acquiring data from memory, the cache memory units each including means responsive to a memory read signal from the associated processor for sensing during the cache memory cycle if the addressed data is present in the cache memory, and means activating said main memory cycle initiating means over the common bus if said last-named means senses that the addressed data is not present in the cache memory.
3. Apparatus as claimed in claim 2, further including means in each cache memory unit for receiving data over said common bus in response to a main memory cycle and transferring the data to the associated processor unit.
4. Apparatus as claimed in claim 3, further including means in each cache memory for storing said data received over the common bus from the main memory in the cache memory unit.
5. Apparatus as claimed in claim 1, wherein each processor unit further includes means for initiating a memory Write signal and data signals for storing in memory, each cache memory unit including means responsive to a memory write signal for indicating a memory write cycle in main memory over said common bus and means for connecting said data signals to the common bus for writing the data in main memory.
6. Apparatus as claimed in claim 5, wherein each cache memory unit further includes means responsive to a memory write signal from the associated processor for sensing during the cache memory cycle if the data is already present in the cache memory at the specified address, and means responsive to said last-named means if the data is already present for replacing said data with the data from the processor unit.
7. Multiprocessing apparatus comprising a plurality of separate processor units, each processor unit having associated therewith a cache memory unit; an addressable main memory; a common bus connecting the cache memory units to the main memory for transferring addresses to the main memory and transferring data between the cache memory unit and the main memory; each processor unit including means for directing a group of address signals and memory read or write signals to the associated cache memory unit; the cache memory unit including a high speed addressable memory for storing data and tag information, means in each cache memory unit responsive to a first portion of the address signals and the memory read signal from the associated processor unit for reading out a selected word containing data and tag information from the location in the high speed addressable memory identified by said first portion of the address signals, means in each cache memory unit comparing the tag information of the word read out of the high speed memory with a second portion of the address signals from the associated processor unit, means in each cache memory unit responsive to the comparing means when the compared tag information and the second portion of the address signals are identical for transferring the data information of the word read out of the high speed memory to the associated processor unit, means in each cache memory unit responsive to a memory signal from the processor unit and the comparing means when the compared tag information is not identical to said second portion of the address signals from the processor unit for applying both of said portions of said group of address signals from the associated processor unit to the common bus and initiating a memory read cycle in the main memory, and means in each cache memory unit for transferring data signals received on the common bus from the main memory in response to the main memory cycle to the associated processor unit.
8. Apparatus as claimed in claim 7, wherein each cache memory unit further includes means for storing the data signals received on the common bus from the main memory in the high speed memory as the data information of the word stored at the address corresponding to said first portion of the address signals from the associated processor unit, and means storing said second portion of the address signals from the processor unit as the tag information of the word stored at the same location in the high speed memory.
9. Apparatus as claimed in claim 7, further including priority control means associated with the cache memory units for controlling access to the common bus among cache memory units seeking simultaneous access to the main memory.
10. Apparatus as claimed in claim 7, wherein each cache memory unit includes means for time sharing the common bus during each main memory access between address and data transfers over the same bus.
11. Apparatus as claimed in claim 7, wherein each cache memory unit includes means responsive to a memory write signal from the associated processor unit for transferring the data signals and address signals from the processor unit over the common bus to the main memory and initiating a memory write cycle in main memory, means in the cache memory unit responsive to a memory write signal from the associated processor unit, means in each cache memory unit responsive to a first portion of the address signals and the memory write signal from the associated processor unit for reading out a selected word containing data and tag information from the location in the high speed addressable memory identified by said first portion of the address signals, means in each cache memory unit comparing the tag information of the word read out of the high speed memory with a second portion of the address signals from the associated processor unit, means in each cache memory unit responsive to the comparing means when the compared tag information and the second portion of the address signals are identical for storing signals from the processor unit as the data information of the addressed word in the high speed memory and storing the second portion of the address signals from the processor unit as the tag information of the addressed word.
12. Multiprocessing apparatus substantially as hereinbefore described with reference to, and as shown in, the accompanying drawings.
GB7846847A 1978-12-01 1978-12-01 Multiprocessor system Expired GB2035632B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB7846847A GB2035632B (en) 1978-12-01 1978-12-01 Multiprocessor system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB7846847A GB2035632B (en) 1978-12-01 1978-12-01 Multiprocessor system

Publications (2)

Publication Number Publication Date
GB2035632A true GB2035632A (en) 1980-06-18
GB2035632B GB2035632B (en) 1983-01-19

Family

ID=10501470

Family Applications (1)

Application Number Title Priority Date Filing Date
GB7846847A Expired GB2035632B (en) 1978-12-01 1978-12-01 Multiprocessor system

Country Status (1)

Country Link
GB (1) GB2035632B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0255186A2 (en) * 1986-07-30 1988-02-03 Egendorf, Andrew System and method for parallel processing with mostly functional languages
GB2257275A (en) * 1991-07-02 1993-01-06 Intel Corp Asynchronous modular bus architecture with cache consistency.
US5537640A (en) * 1988-12-30 1996-07-16 Intel Corporation Asynchronous modular bus architecture with cache consistency
WO2003003228A1 (en) * 2001-06-29 2003-01-09 Intel Corporation Memory control for multiple read requests

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0255186A2 (en) * 1986-07-30 1988-02-03 Egendorf, Andrew System and method for parallel processing with mostly functional languages
EP0255186A3 (en) * 1986-07-30 1990-04-18 Egendorf, Andrew System and method for parallel processing with mostly functional languages
US5537640A (en) * 1988-12-30 1996-07-16 Intel Corporation Asynchronous modular bus architecture with cache consistency
GB2257275A (en) * 1991-07-02 1993-01-06 Intel Corp Asynchronous modular bus architecture with cache consistency.
GB2257275B (en) * 1991-07-02 1995-07-19 Intel Corp Asynchronous modular bus architecture with cache consistency
WO2003003228A1 (en) * 2001-06-29 2003-01-09 Intel Corporation Memory control for multiple read requests
US7076627B2 (en) 2001-06-29 2006-07-11 Intel Corporation Memory control for multiple read requests

Also Published As

Publication number Publication date
GB2035632B (en) 1983-01-19

Similar Documents

Publication Publication Date Title
US4141067A (en) Multiprocessor system with cache memory
EP0179811B1 (en) Interleaved set-associative memory
US4471429A (en) Apparatus for cache clearing
US4264953A (en) Virtual cache
US4400774A (en) Cache addressing arrangement in a computer system
KR100491435B1 (en) System and method for maintaining memory coherency in a computer system having multiple system buses
US5390308A (en) Method and apparatus for address mapping of dynamic random access memory
US6151663A (en) Cluster controller for memory and data cache in a multiple cluster processing system
US5251310A (en) Method and apparatus for exchanging blocks of information between a cache memory and a main memory
JPS60147857A (en) Reconstructive memory system
US4513369A (en) Information processing system
US5918069A (en) System for simultaneously writing back cached data via first bus and transferring cached data to second bus when read request is cached and dirty
JP2000242559A (en) Information processor and semiconductor device
EP0509994B1 (en) Centralized reference and change table for a multiprocessor virtual memory system
US5293622A (en) Computer system with input/output cache
GB2065941A (en) Cache store system
CA2026741C (en) Main storage memory cards having single bit set and reset functions
EP0535701A1 (en) Architecture and method for combining static cache memory and dynamic main memory on the same chip (CDRAM)
GB2035632A (en) Multiprocessor system
US6021466A (en) Transferring data between caches in a multiple processor environment
CA1113574A (en) Multiprocessor system with cache memory
US4602329A (en) Data processing system having an address translation unit shared by a CPU and a channel unit
US5546560A (en) Device and method for reducing bus activity in a computer system having multiple bus-masters
US5829043A (en) Coupler circuit and its use in a card and process
JP2503059B2 (en) Connection device between different types of buses

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee