AU765490B2 - FIFO overflow management - Google Patents

FIFO overflow management Download PDF

Info

Publication number
AU765490B2
AU765490B2 AU16405/01A AU1640501A AU765490B2 AU 765490 B2 AU765490 B2 AU 765490B2 AU 16405/01 A AU16405/01 A AU 16405/01A AU 1640501 A AU1640501 A AU 1640501A AU 765490 B2 AU765490 B2 AU 765490B2
Authority
AU
Australia
Prior art keywords
module
fifo
commands
downstream
external memory
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU16405/01A
Other versions
AU1640501A (en
Inventor
Kok Tjoan Lie
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AUPQ5557A external-priority patent/AUPQ555700A0/en
Application filed by Canon Inc filed Critical Canon Inc
Priority to AU16405/01A priority Critical patent/AU765490B2/en
Publication of AU1640501A publication Critical patent/AU1640501A/en
Application granted granted Critical
Publication of AU765490B2 publication Critical patent/AU765490B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Landscapes

  • Information Transfer Systems (AREA)

Description

S&FRef: 535232
AUSTRALIA
PATENTS ACT 1990 COMPLETE SPECIFICATION FOR A STANDARD PATENT
ORIGINAL
Name and Address of Applicant: Canon Kabushiki Kaisha 30-2, Shimomaruko 3-chome, Ohta-ku Tokyo 146 Japan Actual Inventor(s): Address for Service: Kok Tjoan Lie Spruson Ferguson St Martins Tower,Level 31 Market Street Sydney NSW 2000 FIFO Overflow Management Invention Title: ASSOCIATED PROVISIONAL APPLICATION DETAILS [33] Country [31] Applic. No(s) AU PQ5557 [32] Application Date 11 Feb 2000 The following statement is a full description of this invention, including the best method of performing it known to me/us:iP Austra a Documents received on: .2 3 JAU 2j Batch No: 5815c FIFO OVERFLOW MANAGEMENT Field of Invention The present invention relates to FIFO apparatus and, in particular, to the use of such apparatus in a pipeline processor arrangement.
Background The performance of individual submodules in a pipeline processor arrangement depends on the incoming command and/or data rate, and the complexity of operation on those commands and/or data that the submodule is required to perform. The time taken by a submodule to execute a command varies according to the complexity of the command and also to the stall direction and frequency of the downstream submodule. In those cases where the rate of execution of commands varies for two neighbouring pipeline submodules, a first-in-first-out register apparatus (FIFO) of a predetermined length is usually inserted between the submodules to absorb some latencies associated with the first (upstream) submodule while the second (downstream) submodule is stalled or busy.
The size of the FIFO is usually a compromise between performance and cost, unfortunately there may never be an optimum size as the stall pattern may greatly vary for *the two submodules involved.
Summary of the Invention It is an object of the present invention to substantially overcome, or at least ameliorate one or more deficiencies with existing arrangements.
In accordance with one aspect of the present invention there is disclosed a °method of improving the performance of a pipeline system in which a FIFO is incorporated in said pipeline between an upstream processing module and a downstream processing module, each of said modules having access to a common external memory, 25 said method being characterised by: detecting when said FIFO is substantially full and transferring commands from said upstream module to said external memory; and interpreting commands from each of said FIFO and said external memory to said downstream module to determine a source of following ones of said commands.
In accordance with another aspect of the present invention there is disclosed a pipelined processor system comprising: an upstream processor module; a downstream processor module; a FIFO arrangement coupling an output of said upstream module to an input of said downstream module to thus form a processor pipeline; 535232.doc a memory module accessible by each of said processor modules; and an overload arrangement by which a filling of said FIFO arrangement is detected and said output of said upstream module is directed for intermediate storage in said memory module and by which said downstream module can interpret commands received from each of said FIFO arrangement and said memory module to determine a source of subsequent commands.
Other aspects of the invention are also disclosed.
Brief Description of the Drawings The prior art and an embodiment of the invention will now be described with reference to the drawings in which: Fig. 1 is a schematic block diagram showing conventional approach of pipelined modules with an interposed FIFO; Fig. 2 is a schematic block diagram representation showing a pipelined system with interposed FIFO according to preferred embodiment of the present invention; and Fig. 3 shows example of the contents of the external memory and FIFO of Fig. 2 including the two special commands.
Detailed Description of Embodiments of the Invention Fig. 1 shows a conventional pipelined system 10 having an upstream submodule 12, a downstream submodule 16, and a FIFO 14 placed between two g 20 submodules 12 and 16, and interconnected by connections 17, 19. Each of the submodules 12 and 16 have variable latencies to execute or generate commands. The FIFO 14 has a number of internal registers or memories 18, and as a consequence of a traditional operation thereof, the upstream submodule 12 is free to generate commands at least until the registers 18 are full, whereupon the submodule 12 will be stalled. With variable latency, it is possible that the downstream submodule 16 is then able to process the commands with minimal latency and consequently drain the contents of the FIFO 14 at a faster rate than the upstream module 12 can generate or supply those contents. In such circumstances, the overall performance of the arrangement is only slightly improved compared to a configuration in which the submodules 12 and 16 are directly coupled, as 30 the downstream submodule 16 will have to wait for the upstream submodule 12 to S.generate more commands. Further, the performance of the system 10 is very much o dependent on the size of the FIFO 14, being the number of registers 18, which is usually chosen as a compromise between performance and cost.
Fig. 2 shows a system 20 according to the preferred embodiment, having upstream and downstream modules 22 and 26 respectively, and a local FIFO 24 535232.doc positioned therebetween. Both submodules 22 and 26 also have access to a common external local memory 32, this being a feature of many submodule configurations, particularly where the submodules 22 and 26 are each formed within the same integrated circuit package. The external local memory 32 typically provides for random access s localised storage for operations performed individually or collectively by the processor submodules 22 and 26. In this description, reference is made to "commands" being passed between the submodules 22 and 26 and such references are to be interpreted without limitation as including instructions, data, signals or any information that may be passed between the submodules 22and 26 as required or determined by their respective functions.
While the FIFO 24 is not full, the upstream submodule 22 passes commands into the FIFO 24 and the downstream submodule 26 fetches commands from the FIFO 24 as per conventional approach discussed above. When the FIFO 24 becomes full or substantially full, the upstream submodule 22 is not stalled as in the conventional approach of Fig. 1, but rather is able to continue on generating commands for the downstream submodule 26. However, instead of passing the generated commands into the FIFO 24, the upstream module 22 transfers the commands to the local memory 32.
For optimal performance, transfers to the external memory 32 are performed in "burst mode fashion", known in the art of memory utilization, preferably in groups of 8 or 16 commands for a single memory transaction. This operation is repeated for that period during which FIFO 24 remains or is substantially full. To facilitate burst mode transfer, the upstream submodule 22 outputs commands via a connection 50 to a holding buffer 30 which has a capacity of the predetermined burst size. Such an approach reduces any latency associated with access to the local memory 32, which otherwise can be quite 25 severe if only a small number of commands were to handled for each memory access.
The state of the FIFO 24 is communicated to the upstream module 22 by two signals 38 and 46. When the FIFO 24 becomes substantially full, for example with only one or two, or some other predetermined number of locations remain empty, the signal 46 is asserted which causes the upstream submodule 22 to place a first special command into the FIFO 24 via the connection 48. At such time, the submodule 22 immediately commences sending commands to the external memory via the holding buffer 30. The first special command 60 is an instruction "fetch_from_RAM", and is seen in Fig. 3 loaded into the FIFO 24. When received by the downstream submodule 26 via the FIFO 24, the first special command 60 directs the downstream submodule 26 to fetch following commands from the external memory 32, instead of the FIFO 24. The address 535232.doc I- 11 1- 1- .1 -1 -1 I 1-1, 1 el'- to fetch the commands in the external memory 32 is specified as one of the parameters in the first special command The upstream submodule 22 continues to store commands into the external memory 32 via the holding buffer 30, until such time as the FIFO 24 is made available.
The FIFO 24 also generates the signal 38 identifying to the upstream submodule 22 that there are at least a certain number of free or available locations 28 in the FIFO 24. For example, this may occur when the FIFO 24 is, say, about three-quarters full.
Upon detecting such an "available" condition of the FIFO 24, the upstream submodule 22 writes to the external memory 32 a second special command 62 lo "fetch_from_FIFO", also seen in Fig. 3, immediately after a "last" command is stored into the external memory 32. The second special command 62 acts as an instruction for the downstream submodule 26 to fetch following commands from the FIFO 24 and continue fetching from the FIFO 24 until another "fetchfromRAM" command 60 is encountered.
l. 15 In this fashion, where the FIFO 24 has, say, 24 locations, and the holding buffer 30 has 8 locations, the holding buffer 30 may be loaded with 6 commands followed S.by one of the second special commands 62 (fetch_from_FIFO) thereby enabling a single burst -mode memory transaction to occur with the memory 32 sufficient to free space •within the FIFO 24 for storing further commands.
With such arrangements, memory space in the FIFO 24 or external memory 32 is not wasted with special commands 60 and 62 interleaving with actual commands.
ooeoo Operation of the downstream submodule 26 is similar to and complements that of the upstream submodule 22. The downstream submodule 26 has two possible sources of receiving commands, one from the FIFO 24, and the other from the external memory 32. Again, for optimal performance, a further holding buffer 34 is provided to store commands being fetched via a connection 44 in burst mode fashion from the external memory 32 by the downstream submodule 26.
With reference to Fig. 3, the submodule 26 fetches (ordinary) commands 64 from the FIFO 24 until a "fetch from RAM" command 60 is encountered that indicates that following commands 66 are located in the external memory 32 starting from a given address. The downstream submodule 26 must fetch the commands 66 from the memory 32 and place those command in the holding buffer 34, and at the same time, switch the command source from the FIFO 24 to the holding buffer 34.
Such operation is achieved by a multiplexer 36 positioned between the FIFO 24 and holding buffer 34, and the downstream submodule 26. The multiplexer 36 is 535232.doc controlled by a signal 58 generated by the downstream submodule 26 on receipt of the special command 60. The holding buffer 34 can be maintained full through pre-fetching, which can further reduce any latency associated with access to the external memory 32.
The downstream submodule 26 is then able to continue fetching from the external memory 32 via the holding buffer 34 until the special command 62 "fetch_fromFIFO" is found, on receipt of which the submodule 26 switches the multiplexer 36 via the signal 58 so that commands are then sourced from the FIFO 24.
With the system 20, the size of the FIFO 24, compared with the prior art FIFO 14, can be reduced to compensate against provision of the holding buffers lo and 34 that are needed at the output 50 of upstream submodule 22 and input 54 to the downstream submodule 26. Such a reduction in size of the FIFO 24 is considered by the present inventor to have little effect on the overall performance in typical applications as the net effect of the system 20 is a FIFO having a dynamic capacity but which operates without any substantial latency, excepting that imposed by the transfer and handling of the special commands described above.
The FIFO system 20 finds application in pipelined processing arrangements S:•i which are provided with local memory that is available for use by members of the pipeline. Typically, such memory has a capacity many times larger than memory which would be configured or used by a traditional FIFO. Examples of such arrangements •o e include graphic object rendering hardware in which certain rendering processes are pipelined and operate according to instructions passed along the pipeline or according to data stored in memory, such data being for example generated, modified or used by the pipelined processes. The preferred embodiment comprises an implementation within a synchronous graphic pipelined processor having two or more submodules, each 00• 25 submodule having a different task to perform.
The forgoing describes only one embodiment of the present invention and modifications may be made thereto without departing from the scope of the present invention.
In the context of this specification, the word "comprising" means "including principally but not necessarily solely" or "having" or "including" and not "consisting only of'. Variations of the word comprising, such as "comprise" and "comprises" have corresponding meanings.
535232.doc II -r-IX---li-rl L I.l. *X C~Y~a-I~III~ r~ Illl.l~ri )ilill .l rllill--~l-C i~ij ~lli Li I i YIII~-T.C;J-. I lll.n ~YI-X- iY lr~ l~ilY~II~-C jl;l I~ill~i-l lli l( 11IIII riin=oi

Claims (6)

1. A method of improving the performance of a pipeline system in which a FIFO is incorporated in said pipeline between an upstream processing module and a downstream processing module, each of said modules having access to a common external memory, said method being characterised by: detecting when said FIFO is substantially full and transferring commands from said upstream module to said external memory; and interpreting commands from each of said FIFO and said external memory to said downstream module to determine a source of following ones of said commands.
2. A method according to claim 1, wherein said interpreting is performed by said downstream module.
3. A method according to claim 1 wherein, on detecting said FIFO as being substantially full, said upstream module outputs a first special command to said FIFO indicating that following commands are to be sourced from said external memory, wherein upon receipt of said first special command from said FIFO, said downstream o module sources said following commands from said external memory.
4. A method according to claim 1, further comprising detecting when said FIFO has *cJ a predetermined number of vacant locations and, when so, instructing said upstream module to cease transferring commands to said external memory, a terminal one of said transferred commands being a second special command which, when received by said 25 downstream module from said external memory, causes said downstream module to source said following commands from said FIFO. A method according to claim 1, wherein transfer of commands to and from said external memory occurs in burst mode comprising a predetermined data transfer size, said method comprising the further steps of buffering commands output from said upstream module to said external memory, and from said external memory to said downstream module, to facilitate burst mode transfers.
6. A pipelined processor system comprising: an upstream processor module;
535232.doc a downstream processor module; a FIFO arrangement coupling an output of said upstream module to an input of said downstream module to thus form a processor pipeline; a memory module accessible by each of said processor modules; and an overload arrangement by which a filling of said FIFO arrangement is detected and said output of said upstream module is directed for intermediate storage in said memory module and by which said downstream module can interpret commands received from each of said FIFO arrangement and said memory module to determine a source of subsequent commands. I0 7. A system according to claim 6, wherein said overload arrangement comprises a switching device for selectively coupling an output of one of said FIFO arrangement and said memory module to an input of said downstream module. *fee 46 15 8. A system according to claim 7, wherein said switching device is controlled by said downstream module as a consequence of interpreting a special command received from either one of said FIFO arrangement or said external memory. A system according to claim 8, wherein said special command is generated by an upstream module in response to a state of said FIFO arrangement, said special command being output either to said FIFO arrangement when said FIFO arrangement is 9 9.. substantially full, or to said memory module when said FIFO arrangement has a predetermined number of available locations. 99 99 25 10. A system according to claim 6, further comprising a holding buffer interconnecting each of said processor modules to said memory module, each of said holding buffers facilitating burst mode memory transfers with said memory module. 11. A system according to claim 6, wherein said system is formed within a single integrated circuit. 12. A pipeline processor system substantially as described herein with reference to Figs. 2 and 3 of the drawings. 535232.doc 13. A pipeline processing method substantially as described herein with reference to Figs. 2 and 3 of the drawings. Dated this SEVENTEENTH day of JANUARY 2001 CANON KABUSHIKI KAISHA Patent Attorneys for the Applicant SPRUSON&FERGUSON posse 09#.0 0 0 aS.. 53532do
AU16405/01A 2000-02-11 2001-01-23 FIFO overflow management Ceased AU765490B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU16405/01A AU765490B2 (en) 2000-02-11 2001-01-23 FIFO overflow management

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
AUPQ5557A AUPQ555700A0 (en) 2000-02-11 2000-02-11 Fifo overflow management
AUPQ5557 2000-02-11
AU16405/01A AU765490B2 (en) 2000-02-11 2001-01-23 FIFO overflow management

Publications (2)

Publication Number Publication Date
AU1640501A AU1640501A (en) 2001-08-16
AU765490B2 true AU765490B2 (en) 2003-09-18

Family

ID=25616370

Family Applications (1)

Application Number Title Priority Date Filing Date
AU16405/01A Ceased AU765490B2 (en) 2000-02-11 2001-01-23 FIFO overflow management

Country Status (1)

Country Link
AU (1) AU765490B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115586974B (en) * 2022-12-12 2023-10-20 北京象帝先计算技术有限公司 Memory controller, system, device and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696990A (en) * 1995-05-15 1997-12-09 Nvidia Corporation Method and apparatus for providing improved flow control for input/output operations in a computer system having a FIFO circuit and an overflow storage area
US5841722A (en) * 1996-02-14 1998-11-24 Galileo Technologies Ltd. First-in, first-out (FIFO) buffer
US5892979A (en) * 1994-07-20 1999-04-06 Fujitsu Limited Queue control apparatus including memory to save data received when capacity of queue is less than a predetermined threshold

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5892979A (en) * 1994-07-20 1999-04-06 Fujitsu Limited Queue control apparatus including memory to save data received when capacity of queue is less than a predetermined threshold
US5696990A (en) * 1995-05-15 1997-12-09 Nvidia Corporation Method and apparatus for providing improved flow control for input/output operations in a computer system having a FIFO circuit and an overflow storage area
US5841722A (en) * 1996-02-14 1998-11-24 Galileo Technologies Ltd. First-in, first-out (FIFO) buffer

Also Published As

Publication number Publication date
AU1640501A (en) 2001-08-16

Similar Documents

Publication Publication Date Title
US6212597B1 (en) Apparatus for and method of architecturally enhancing the performance of a multi-port internally cached (AMPIC) DRAM array and like
US6049882A (en) Apparatus and method for reducing power consumption in a self-timed system
US7797467B2 (en) Systems for implementing SDRAM controllers, and buses adapted to include advanced high performance bus features
JP3406744B2 (en) Data processor with controlled burst memory access and method thereof
KR19980069869A (en) Load and storage unit for vector processor
US6725299B2 (en) FIFO overflow management
US7899940B2 (en) Servicing commands
GB2415067A (en) Managing conflicting read and write operations on separate read and write buses
US6507899B1 (en) Interface for a memory unit
US20060047754A1 (en) Mailbox interface between processors
US11721373B2 (en) Shared multi-port memory from single port
WO2004099995A2 (en) Hierarchical memory access via pipelining
US20070162644A1 (en) Data packing in A 32-bit DMA architecture
JP2005536798A (en) Processor prefetching that matches the memory bus protocol characteristics
AU765490B2 (en) FIFO overflow management
GB2377138A (en) Ring Bus Structure For System On Chip Integrated Circuits
US20120159093A1 (en) Method and apparatus for data transfer
US6775756B1 (en) Method and apparatus for out of order memory processing within an in order processor
US5185879A (en) Cache system and control method therefor
US6715021B1 (en) Out-of-band look-ahead arbitration method and/or architecture
US7240170B2 (en) High/low priority memory
US20070198754A1 (en) Data transfer buffer control for performance
US6055607A (en) Interface queue with bypassing capability for main storage unit
TWI402674B (en) Apparatus and method for providing information to a cache module using fetch bursts
US7594103B1 (en) Microprocessor and method of processing instructions for responding to interrupt condition

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)