GB2466569A - Rendering A 3 Dimensional Computer Graphics Image Using A Multi-Level Structure - Google Patents

Rendering A 3 Dimensional Computer Graphics Image Using A Multi-Level Structure Download PDF

Info

Publication number
GB2466569A
GB2466569A GB0922374A GB0922374A GB2466569A GB 2466569 A GB2466569 A GB 2466569A GB 0922374 A GB0922374 A GB 0922374A GB 0922374 A GB0922374 A GB 0922374A GB 2466569 A GB2466569 A GB 2466569A
Authority
GB
United Kingdom
Prior art keywords
primitive
tile
block
tiles
mask
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB0922374A
Other versions
GB2466569B (en
GB0922374D0 (en
Inventor
Xile Yang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Imagination Technologies Ltd
Original Assignee
Imagination Technologies Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Imagination Technologies Ltd filed Critical Imagination Technologies Ltd
Publication of GB0922374D0 publication Critical patent/GB0922374D0/en
Publication of GB2466569A publication Critical patent/GB2466569A/en
Application granted granted Critical
Publication of GB2466569B publication Critical patent/GB2466569B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/005General purpose rendering architectures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/40Filling a planar surface by adding surface attributes, e.g. colour or texture
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering

Abstract

A method and apparatus are provided for rendering a 3 dimensional computer graphics image. The image is divided into plurality of rectangular tiles which are arranged in a multi level structure comprising a plurality of levels of progressively larger groupings of tiles. Image data is divided into a plurality of primitive blocks and these are assigned to groupings of tiles within the multi level structure in dependence on the groupings each one intersects. Control stream data is derived for rendering the image and this comprises references to primitive blocks for each grouping of tiles within each level of the multi level structure, the references corresponding to the primitive blocks assigned to each grouping and control stream data is used to render the primitive data into tiles within the groupings of tiles for display. This is done such that for primitive blocks which intersect a plurality of tiles within a grouping, control stream data is written for the grouping of tiles rather than for each tile within the grouping.

Description

Multi Level Display Control List in Tile Based 3D Computer Graphics System
FIELD OF THE INVENTION
This invention relates to 3-dimensional computer graphics systems, and in particular to systems which use tile based rendering.
BACKGROUND To THE INVENTION
As processing power has increased, the complexity of 3-dimensional computer generated images has also increased. Computer models for very complicated 3D objects, like human movements using vertices and triangle meshes have become easier to generate. This kind of 3D model can be sent to a 3D computer graphics system and animated 3D images generated on a computer screen. Computer generated 3D animated images are widely used in 3D computer games, navigation tools and computer aided engineering design tools.
3D computer graphics systems have to cope with demands for more complex graphics and faster speed display. As the details in the display model increase, more and more primitives and vertices are used. Also, as the texture and shading techniques evolve, more and more information is attached to primitive and vertex data. In modern games there may be over a million primitives in a render. Therefore, memory bandwidth is a very big factor affecting the performance of computer graphics systems.
Tile based rendering systems are well-known. These subdivide an image into a plurality of rectangular blocks or tiles. As in British Patent No. GB2343603 and International Patent Application No. WO 2004/086309 these systems divide a render surface into sub surfaces in n x m pixel tiles, a primitive like triangle, line or point is only to be processed in the tiles which overlap with the primitive.
The main steps performed for tiling in a tile based 3D computer graphics system are shown in Figure 1. These are as follows: 1. An input data stream of primitives and vertices is received at 101 in Figure 1.
Primitives in similar locations may arrive sequentially in time, such as triangles strips and fans.
S
2. A Macro Tiling Engine (MTE) transforms the vertices into screen space, removes primitives which may be back facing or clipped by a clipping plane using well-known methods. The primitives are then grouped into primitive blocks with a fixed maximum number of vertices and primitives, and are written into memory at 102 in Figure 1.
The number of vertices and primitives together with the memory addresses of the primitive blocks (primitive block pointers) are sent to a Tiling Engine at 103 to be added to a control stream for a display list for the tiles which are covered by the primitives.
To minimize the impact of memory bandwidth to fetch primitive and vertex data, primitives are grouped into primitive blocks in macro tiles according to a bounding box for the primitive block. In Fig 2 a macro tile 201 is a rectangular area of the screen 200 with a fixed number of tiles 202. A macro tile for example can be a quarter or 16th of the screen size. This structure is used to localize primitive blocks and reduce memory bandwidth. When primitives in a primitive block cross macro tiles, they are written to a special macro tile list called a global macro tile list. In this way the parameter data for a primitive block are only written once. Primitives from a macro tile can only be accessed by the tiles inside the macro tile, while primitives from the global macro tile list can be accessed by all the tiles.
3. Each primitive from a primitive block which is written by Macro Tiling Engine 102, is checked against each tile inside the bounding box of the primitive block. The primitive block is added to the display list for any tile which is covered by any primitives in the primitive block. The control data written in the control stream associated with the display list of the tile includes a primitive block header for the number of vertices and primitives in the primitive block, a primitive block pointer for the memory address of the primitive block written to, and a primitive mask for the primitives which are visible in the tile.
Separate memory spaces are allocated to each tile for the control stream data in the display list. A memory address pointer called a tail pointer is used for the next free address in the control stream data of each tile.
To improve memory access for the control streams in tiles, a small cache Tail Pointer Cache can be added to the Tiling Engine 103. The memory location of the end of the control stream in a tile is stored and read from the Tail Pointer Cache, which reduces main memory accesses from the Tiling Engine.
4. 3D image processing in a tile based 3D computer graphics system is performed at 104 for each tile of the screen from a region array 300 of Figure 3. It traverses through the control stream of each tile 301 in Figure 3, reads the vertex and primitive data from memory addresses pointed to by the primitive block pointer in the control data 302. Image processing operations like hidden surface removing, texturing and shading are performed on all the primitives valid in the tile from the primitive block.
An example of tile based render is shown in Figure 6. A macro tile MTO 600 which is part of the screen has 16 tiles 601 inside. Two triangle strips 602, 603 and a big triangle 604 are processed by Macro Tile Engine MTE 605 and projected into part of the screen in MTO. The vertex and primitive data associated with the three primitive blocks are written to memory 607 with their own memory addresses. The memory address pointers of the three primitive blocks are then passed to Tiling Engine TE 606 for tiling processing. All the tiles are traversed by Tiling Engine to decide if any primitives are inside the tile and control stream data associated to the primitive block which is visible in the tile are written to memory for the tile display list. In the example display control stream in T5 will include control data for the address pointers of the three primitive blocks and triangle visible mask of the triangles within the three primitive blocks. For example the first three triangles from left in primitive block 602 and the first triangle from right of primitive block 603 are visible in T5 together with triangle 604. For tile 19 the control stream data will only consists of the information from primitive block 603 and 604. Meanwhile the control data such as address pointer of the vertex data associated to primitive block 602 are inserted into two control streams for tile T5 and T6, while control data associated with primitive block 604 are inserted into control streams for every tile in the macro tile MTO.
In 3D render processing 608 all the tiles are processed one by one in the order of region array as 300 in Figure 3. For each tile in 3D processing control data from the tile control stream are read from memory first, as 301 in Figure 3. Vertex and primitive data associated to the primitive blocks in the tile control data are read from memory as 302 in Figure 3 therefore all the triangles which are visible to the tile are processed and rendered to screen.
For tile based computer graphics system a render is performed on a tile by tile basis.
The big advantage of tile based rendering is that it significantly reduces the requirement for large graphics system internal storage and memory bandwidth.
The disadvantage of tile based rendering is that increased control data is needed for the display list in each tile. Display control data needs to be written for all the tiles which a triangle covers. For big triangles which cover many tiles the total amount of control data written is significant.
For example a render with screen size 1920x1080 can be divided into 8160 tiles of 16x16 pixels. A big triangle which covers the whole screen has to be added in the display lists of all the tiles. If there are two 32 bit words needed for the control data, then the total control stream data is nearly 64 KB for the single full screen triangle.
The amount of control data needed in this case has a significant impact on the memory space requirement and memory bandwidth, therefore it affects the performance of the render in a tile based 3D computer graphics system.
In UK Patent Application No. 0717781.9, a system is described which processes two tiles at the same time in order to improve the performance of a Tiling Engine. The control stream data from the two adjacent tiles are combined into a single display list, which reduces the total control stream data to around 32 KB in the above example.
The size of the control stream data is about half the size in this case, but it is still a significant amount of control data for a single triangle.
SUMMARY OF THE INVENTION
Preferred embodiments of the invention provide a method and apparatus for enabling multi level display control lists in a tile based 3D computer graphics system. This reduces the amount of tile based control stream data written, especially for the scene with large triangles, thereby reducing the memory bandwidth and enhancing the performance of tile based 3D graphics systems.
In a tile based rendering system triangles covering multiple tiles consume memory and bandwidth for every tile into which a reference must be placed in the control stream data. The proposed invention alleviates this problem by introducing a multi-level list structure in which subsequent levels represent progressively larger groups of tiles such that a triangle overlapping many tiles will only need to write a small number of references to the control stream. In the described embodiment the multi-level list structure is implemented at three levels, tile level (1 tile), tile block level (N x M tiles) and macro tile level (X x Y tiles). The proposed invention can be easily expanded such that multiple sub level representation can be used to make the control stream for a display list even more efficient.
In the preferred embodiment there is provided method for rendering a 3 dimensional computer graphics image comprising the steps of: subdividing the image into a plurality of rectangular tiles; arranging the tiles in a multi level structure comprising a plurality of levels of progressively larger groupings of tiles; dividing image data to be rendered into a plurality of primitive blocks containing image data required to render the image; assigning each primitive block to a grouping of tiles within the multi level structure in dependence on the grouping of tiles it intersects, such that each primitive block is assigned to a grouping in one of the levels in the multi level structure; deriving control stream data for rendering the image, the control stream data comprising references to primitive blocks for each grouping of tiles within each level of the multi level structure corresponding to the primitive blocks assigned to each grouping; and using the control stream data to render primitive data into tiles within the groupings of tiles for display.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows an overview of the main steps performed when implementing tiling in a tile based 3D computer graphics system, as described above; Figure 2 shows macro tiles and tiles on screen as used in a tile based 3D computer graphics system, as described above; Figure 3 shows an overview of the data structure used for tiling, as discussed above;
S
Figure 4 shows an overview of the data structure used for multi level display control list in a macro tile in an embodiment of the invention; and,.
Figure 5 shows an overview of the data structure used for control stream data in a multi level display control list in an embodiment of the invention.
Figure 6 shows an example of tile based render.
DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT
In a tile based 3D computer graphics system as described above, the Tiling Engine takes primitive blocks from the Macro Tiling Engine and computes the minimum number of tiles needed to render the primitives. The minimal list of tiles is then processed and a primitive block pointer for the address of the primitive block data along with a primitive header word describing the primitives present in the tile are written to dynamically allocated memory as control stream data at 301 in Figure 3.
Each tile has its own fixed size memory space used for the control stream data. The memory blocks in this space are allocated on demand by the tile based 3D computer graphics system. When new data to be added to the control stream for a particular tile exceeds the current memory blocks allocated, a new block allocation is requested and the old control stream is linked to the new block allocation using a stream link. If the Macro Tiling Engine indicates the end of a scene via a terminate signal, all the tiles are processed, and control streams for tiles in the valid region are ended with a terminate word.
In a tile based 3D computer graphics system the vertex and index data for a primitive block may only be written once in a macro tile to which the primitive block belong to, or in the global macro tile list when the bounding box of the primitive block crosses macro tiles. When the primitive blocks contain large primitives the control stream data may need to be written multiple times to all the tiles which the primitives overlap. In the extreme case of a full screen triangle, the control stream data has to be written for every tile on screen. The multiple repetitions of control stream data written may cause a significant increase in memory bandwidth.
S
In embodiments of the invention, a multi level display control list is used instead of a single tile based display control list. Control stream data in the multi level display control list can be based on groupings of tiles such as macro tiles and tile blocks rather than individual tiles, although the number of sub-levels selected is a design choice for the skilled person. The control list comprises control stream data for each grouping of tiles, the control stream data comprises reference to primitive blocks to be rendered into each respective grouping of tiles, for use in rendering the image for display. An example structure for the control stream data is described below. Greater and lower degrees of complexity may be implemented.
Multi Level Display Control List In an embodiment of the invention, macro tiles divide an image into 16 Tile Blocks (TB), the number of tiles in a tile block is fixed for a render according to the macro tile size. Assume the macro tile size is 16 tiles by 16 tiles (256 tiles), the tile blocks 401 in a macro tile 400 will each be the same size of 4 tiles by4 tiles (16 tiles) as shown in Fig 4. Control stream lists are formed for each Tile Block in a macro tile. For 4 macro tiles there are 16x4 = 64 control stream lists while for 16 macro tiles there are 16x16 = 256 control stream lists.
A render is still performed on a tile by tile basis, but the tiles in a tile block will share the same control stream list.
The main benefit of using a tile block based multi level display control list is to reduce the amount of control data needed and reduce the memory bandwidth, especially for large primitives which cover many tiles. Other benefits include: There are no tile based control stream lists, therefore there is no need for the storage of the next free address of control stream data to write in each tile. A tile used flag of 1 bit may be used for each tile, in order to identify if a tile is an empty tile or not when the scene is terminated.
Also there is no need to write out a control stream base address word for each tile in a region array, due to the number of macro tile based control stream lists is limited.
S
Control Stream data Structure The data structure of a multi level display control stream list system consists of three main parts: Tile Block Control List 403, Control Pointers 404 and Primitive Vertex Masks 405 as shown in Fig 4.
Tile block control lists 403 are provided per tile block based (TBO, TBI.... TB1 5).
There are 16 tile block control lists in a macro tile structure such as 403 in Fig 4.
Each tile block control list has its own memory space allocated in a macro tile's allocated memory space.
Control pointers 404 end primitive mask data 405 in Fig 4 are provided per macro tile.
These have separate memory space in a macro tile's allocated memory.
Tile Block Control List In a multi level display control list system the control stream list is provided per tile block (TBO etc) based. There is a tile block control list for each of the tile blocks in a macro tile as 500 in Fig 5. The tile block control list includes references to each primitive block referred by that tile block.
Because the tile block control list is shared between all the tiles in the tile block, a primitive block referenced in the control list may only be valid for some of the tiles in the tile block. The data structure used in this embodiment of the invention is designed to use short cuts to skip the tiles in which a primitive block does not need to be processed as soon as possible when rendering an image. For the tiles for which the primitive block does need to be processed, embodiment of the invention obtains all the information for processing the primitive block in a minimum amount of data.
Tile Block Control Word Because the address pointers for vertex data in primitive blocks are written in a per macro tile list 404, they are shared by all tile blocks in the macro tile. In a specific tile block not all of the primitive blocks in the macro tile may be referenced in the tile block. It is efficient to be able to skip these primitive blocks which are not needed in the tile block.
Q
Tile Block Control Word 503 in Fig 5 is the top level control word of the multi level display control list stream. It has two bit encoding as shown in Table 1, and is used primarily to determine whether a primitive block is to be processed for that tile block.
1tu IlJiljfs], 0 Skip one primitive block I I Valid primitive block 2 Skip with skip count 3 rerminate Tabie 1 -Tile Block Control Word Therefore as shown in Table 1, there are 4 possible operations defined by the two bit Tile Block Control Word: Skip One Primitive Block When a primitive block does not have any primitives in it to cover any tiles in the tile block, the primitive block is skipped without being processed.
Valid Primitive Block When any primitives in a primitive block cover any tiles in the tile block, the primitive block cannot be skipped. Further information is needed in the multi level display control list to process the primitive block for the tiles in the tile block.
Skip with Skip Count This is a short cut which is used when the tile block can skip a number of primitive blocks consecutively, i.e. these primitive blocks do not need to be processed for the tile.
In this embodiment there is a skip count offset of 5. If the skip count is less than 5 skip one primitive block encoding is used repeatedly. When the skip count is greater than or equal to 5 skip with skip count is used. The maximum skip count is 255 + 5 = 260 for 8 bits skip count. When the skip count is great than 260 more than one skip encoding is needed.
Terminate The terminate word indicates the multi level display control list is terminated for the tile block.
Tile Block Format Word Tile Block Format Word 504 in Fig 5 in the tile block control list is used after each valid primitive block encoding (value 1 in Table 1) of the Tile Block Control Word, It is used to identify whether the primitive block is valid for the tile which is being processed. The 2 bit encoding of the Tile Block Format Word is: !Piui I1III1l[s]l 0 tile masks present 1 ll tiles valid, tile masks not present 2 Only one tile valid ll tiles not valid, tile masks not 3 present Table 2 -Tile Block Format Word As shown in Table 2, there are 4 possible operations defined by the two bits in Tile Block Format Word: Tile Masks Present When a primitive block does not have primitives to cover every tile in the tile block i.e. some tiles are not overlapped by any primitives in the primitive block, a 1 bit per tile valid tile mask is used to skip the tiles in which the primitive block does not need to be processed. The total number of bits for the tile mask is equivalent to the number of tiles in tile block.
All Tiles Valid When a primitive block has primitives to cover every tile in the macro tile or tile block, then the primitive block is valid for all tiles in the tile block. In this case no tile masks are needed.
Only One Tile Valid When a primitive block covers only one tile in a macro tile, or one tile in a tile block, the tile index X and Y for the valid tile in the tile block are used to reduce the amount of data needed. In this case no tile masks are needed.
For a tile block with 8x8 tiles a 64 bit mask is needed to indicate if each tile is valid. If 4 bits are used for the tile index X and Y of the only valid tile in the tile block, then only 8 bits are needed.
All Tiles Not Valid When a primitive block has no primitives to cover any tile in the macro tile or tile block, then the primitive block is not valid for any tiles in the tile block. In this case no tile masks are needed. This case can only happen in the tile block format word for sub level tile blocks.
After the Tile Block Format Word has been processed, the current primitive block can be identified as valid or not for the tile which has been processed at 505 in Fig 5.
Full Primitive Mask Flag Present The Full Primitive Mask Flag Present 506 in Fig 5 is I bit flag for the primitive block which is valid for the current tile to indicate if the full primitive mask flags are present or not in the primitive block.
When all primitives in the primitive block are visible in a tile it is called a full primitive mask. Only a I bit flag is needed for a full primitive mask instead of individual bit masks for each primitive and vertices in the primitive block.
Full Primitive Mask Flags If the 1 bit flag of Full Primitive Mask Flag Present 506 in Fig 5 is set, then this causes I bit per tile full primitive mask flags 507 in Fig 5 to be set for each tile in the tile block to indicate whether each respective tile has a full primitive mask or not. The total number of bits of the Full Primitive Mask Flags is equal to the number of tiles in the tile block.
If there is not a full primitive mask, the vertex mask is concatenated by the primitive mask and made byte aligned. For a primitive block with a maximum 16 vertices and 32 primitives, the vertex mask is maximum 2 bytes, and the primitive mask is 4 bytes.
The size of the primitive and vertex masks are kept the same for a tile block. The primitive and vertex mask data are written in Primitive Mask Data stream 502 per macro tile as in Fig5.
For any tile block which has tiles with a full primitive mask flag, the use of a Full Primitive Mask Flag will save the time and memory space to get the information for valid primitives and vertices in the primitive block. If the primitive block is invalid for the tile been processed, these flags can be skipped.
Primitive Mask Format Word A two bit Primitive Mask Format Word 508 in Fig 5 in the tile block control list is present when the 1 bit flag of Full Primitive Mask Flag Present 506 in Fig 5 is not set, that is to say there are no Full Primitive Mask Flags.
piYIuI RIiIII1i]l lI tiles in the tile block with primitive 0 mask present ll tiles in the tile block with full 1 primitive mask 2 Same primitive mask in the macro tile 3 Same primitive mask in the tile block Table 3 -Primitive Mask Format Word As in Table 3, the Primitive Mask Format Word contains 4 special cases for primitive mask format in the tile block: All Tiles in Tile Block with Primitive Mask Present This indicates all the tiles in tile a block which is covered by the primitive block have primitive masks present. That is no tiles have a full primitive mask, and the primitive masks are not the same within the tile block. In this case the primitive and vertex masks are read from the Primitive Mask Data stream 502 in Fig5.
All Tiles in Tile Block with Full Primitive Mask This indicates that all the tiles in tile block covered by the primitive block have full primitive masks. There is no need for the 1 bit per tile Full Primitive Mask Flags 507 inFig5.
Same Primitive Mask in Macro Tile This indicates that all the tiles in macro tile covered by the primitive block have the same primitive mask, and the total bits of the common primitive mask and vertex mask are less than or equal to 24 bits.
The primitive and vertex mask data are written in a data stream 502 in Fig 5 for each macro tile in a separate 4 KB memory page. The start address of the primitive and vertex mask data stream is stored in Control Pointers 501 in Fig 5 as Primitive Mask Pointer Word. There are 3 bytes free bits in the address pointer word Primitive Mask Pointer Word due to the 4 KB aligned address.
In this case the common primitive mask and vertex mask for the macro tile are no more than 24 bits, they can be written to the LSB 3 bytes of Primitive Mask Pointer Word in the current Control Pointers.
Same Primitive Mask in Tile Block This indicates that all the tiles in a tile block which is covered by the primitive block have the same primitive mask, or have that same primitive mask in a macro tile, but the total bits of the common primitive mask and vertex mask are more than 24 bits.
In this case the common vertex mask is concatenated by the primitive mask, and both are written in the Tile Block Control List with a leading 3 bits for the number of bytes of the primitive and vertex masks.
The type of primitive mask in a tile being processed can be defined after the process as Full Primitive Mask Flags 507 or Primitive Mask Format Word 508 in Fig 5.
A tile which has been processed has primitive and vertex masks present when either Full Primitive Mask Flags 507 are processed but the tile is not in full primitive mask, or Primitive Mask Format Word 508 is set to 0 (all tiles in the tile block with primitive block mask present). In these two cases the primitive and vertex masks for the tile need to be read from the Primitive Mask Data stream 502 in Fig 5.
In all other cases the primitive and vertex masks are known because of either a full mask or the same mask in a macro tile/tile block.
Tile Block Primitive Mask Offset Present Primitive and vertex masks for each tile in the macro tile are written to the Primitive Mask Data stream 502 in Fig 5. The start address of the primitive and vertex mask data for the macro tile is defined by Control Pointers 501 in Fig 5 as Primitive Mask Pointer Word.
The primitive and vertex data are byte aligned for each tile, and 32 bits aligned for each tile block. For the tile blocks in which the primitive and vertex mask data are not written at the start of the Primitive Mask Data stream, a primitive mask offset is needed to get to the start of the primitive and vertex mask data for the tile block from the data stream. Because the primitive and vertex mask data are the same size for each tile in a tile block, a tile index can be used in conjunction with the tile block primitive mask offset to read back the mask data for a tile.
A 1 bit flag is used to indicate that a tile block primitive mask offset is present when the primitive mask data in the tile block is not at the start of the macro tile primitive mask data stream. The 1 bit flag is only needed in the two cases where the primitive and vertex masks for the tile need to be read from the Primitive Mask Data stream.
Tile Block Primitive Mask Offset Tile Block Primitive Mask Offset 509 in Fig 5 is a 13 bit word offset for the start address of the primitive mask data in the tile block from the start address of the primitive mask data of the primitive block in the macro tile. The field is only present when a Tile Block Primitive Mask Offset Present bit is set.
At the end of process, from the Tile Block Control List 500 in Fig 5 for the primitive block, it is known whether the primitive block needs to be processed in the current tile. If the primitive block does need to be processed, the primitive and vertex masks are known for the current tile as well.
Macro Tile Control Pointers There are two 32 bits control pointer words in Macro Tile Control Pointers 501 in Fig for each primitive block which is valid in the macro tile. If a primitive block is determined to be valid for a tile at 505 then the Macro Tile Control Pointers 505 are used to access the primitive data from a block.
Primitive Block Pointer Word The primitive Block Pointer Word is mainly used to store the start address of the vertex data stream for a primitive block. The structure of the 32 bits Pnmitive Block Pointer Word can be seen in Table 4: I1 HT_PRIM_COUNT 31: 27 Number of primitives -I in the primitive block, maximum 32 0 primitives.
HT_VERT_PTR ?5: 0 32 bits aligned address pointer or the primitive block vertex 1 data, 256 MB range.
Table 4 -Primitive block pointer Word Primitive Mask Pointer Word Primitive Mask Pointer Word is mainly used to store a start address of the primitive and vertex mask data stream in a macro tile for the primitive block. The structure of the 32 bits Primitive Mask Pointer Word can be seen in Table 5: Tr* fTiiI;1I PIIli1l[]i HI_VERTEX_COUNT 31: 28 Number of vertices -1 in the primitive block, maximum 16 0 ertices.
HT_PRIM_MASK_PTR 25: 12 32 bits aligned address pointer I or the start of primitive and ertex mask data in the macro ile for the primitive block, 256 MB range.
HI_SAME_PRIM_MASK_IN 11: 0 Ehe common primitive and 2 MT ertex masks for the macro tile.
Table 5 -Primitive Mask Pointer Word The field 2 HI_SAME_PRIM_MASK_IN_MT is only valid when Primitive Mask Format Word is set to 2 as in Table 3, which means there is a common primitive mask for the macro tile.
Because the control pointers are fixed 2 32 bits per primitive block, the control pointers can be skipped for the primitive blocks which are not valid in a tile block.
Macro Tile Primitive Mask Data When a primitive mask in a tile is neither a full mask nor the same mask in a macro tile or tile block, the primitive and vertex mask data are written to a Macro Tile Primitive Mask Data stream 502 in Fig 5.
In Primitive Mask Data stream a vertex mask is concatenated with a primitive mask for a tile. The number of bits is the number of vertices plus the number of primitives in the primitive block then rounded up to bytes. The size of the primitive and vertex mask data are kept the same within a tile block.
The Macro Tile Primitive Mask Data stream is macro tile based, primitive and vertex mask data from each tile block written continuously to the memory page.
The start address of primitive and vertex mask data for a primitive block in a tile block is 32 bits aligned. The start address is stored by Control Pointers 501 in Fig 5 for the primitive block as Primitive Mask Pointer Word.
The 32 bits offset of the start address for the tile block from the start address of the primitive block in macro tile is stored in Tile Block Control List 500 as 13 bit Tile Block Primitive Mask Offset 509 in Fig 5 when Tile Block Primitive Mask Offset Present bit is set. If Tile Block Primitive Mask Offset Present bit is not set that means the offset of the primitive mask data in the tile block is 0. Because the primitive and vertex mask data are the same size for each tile in a tile block, a tile index can be used in conjunction with the tile block primitive mask offset to read back the primitive and vertex mask data for a tile.
Tile Block Sub Level Mask in Tile Block Control List As a macro tile is divided into 16 tile blocks. Primitive blocks which are not visible in a tile block can be skipped. Within a tile block, valid tile masks are used to skip tiles which the pnmitive block does not need to process. The amount of control data read in 3D processing will be reduced when further sub level blocks are introduced in each tile block.
The tile block control list described can be easily expanded into a multi level display control list structure.
For any valid primitive blocks in the tile block control list, the tile block format word 504 in Fig 5 can be expanded into 4 sub level tile block format words instead of one tile block format word. Each sub level tile block will have a quarter of the tiles from the tile block. The possibility is increased for the sub level tile blocks to encounter the special cases of all tiles valid, all tiles not valid and only one tile valid as 1 to 3 in Tile Block Format Word 504. Furthermore the chances of the sub level tile blocks having all full primitive masks and same primitive mask in Primitive Mask Format Word 508 in Fig 5 are increased as well. Therefore the amount of control data needed will be reduced compared with one tile block format word.
Multi sub levels can be used to reduce the number of tiles contained in a tile block format word further. This is especially true for the large render size with more tiles in a macro tile.
In a render of 640x480 pixels with multi sample anti-aliasing (MSAA) enabled there are 6x4 = 24 tiles in a tile block. By adding one sub level block in a tile block the sub level block size is 3x2 = 6 tiles. In this case the invalid tiles may be skipped early, and the chance of the tiles getting into the special cases for primitive mask data may be increased in the 6 tiles sub level rather than the 24 tiles tile block.
I
4Ox48O no 16 0x30 12x8 x2 3x2 I I 640x480 es 16 80x60 4x16 x4 3x2 1024x768 no 16 64x48 16x12 x3 x2 1024x768 es 16 128x96 12x24 x6 x3 x3 O48x2O4 no 128x12 1x8 x4 2x2 8 16 1 2x32 O48x2O4 es 56x25 16x16 1x8 *x4 ax2 8 16 6 64x64 O96x4O9 no 56x25 16x16 1x8 *x4 2x2 6 16 6 4x64 O96x4O9 es 512x51 128x12 2x32 16x16 8x8 x4 x2 6 16 8 Table 9 -Multi Level Display Control List Tile Block Size Table 9 shows the number of tiles in a tile block for some common used render sizes.
It also shows the number of tiles in sub level blocks within a tile block.
In the case where the number of tiles in a tile block is not an even number on X or/and Y directions, some of the sub level blocks may never have the full valid tile mask. As render size 1 024x768 non MSAA case, the tile block size is 4x3 tiles and sub level size is 2x2 tiles. Inside a tile block the lower two sub level blocks have 2x1 tiles while the top two sub level blocks have 2x2 tiles.
For a small render size sub level blocks may not be needed, like render size 640x480 non MSAA case. For a very large render size with MSAA enabled, multi sub level blocks may be needed, as shown in Table 9.
A Multi Level Display Control List in the embodiment the proposed invention has reduced the amount of control stream data significantly, in some case by 90%. This is achieved by allocating primitive blocks to control stream data for different groupings of tiles within a multi-level structure of progressively larger groupings of tiles such that the number of tiles primitive blocks are written to the control stream data is reduced for blocks that cover a plurality of tiles.

Claims (10)

  1. CLAIM1. A method for rendering a 3 dimensional computer graphics image comprising the steps of: subdividing the image into a plurality of rectangular tiles; arranging the tiles in a multi level structure comprising a plurality of levels of progressively larger groupings of tiles; dividing image data to be rendered into a plurality of primitive blocks containing image data required to render the image; assigning each primitive block to a grouping of tiles within the multi level structure in dependence on the grouping of tiles it intersects, such that each primitive block is assigned to a grouping in one of the levels in the multi level structure; deriving control stream data for rendering the image, the control stream data comprising references to primitive blocks for each grouping of tiles within each level of the multi level structure corresponding to the primitive blocks assigned to each grouping; and using the control stream data to render primitive data into tiles within the groupings of tiles for display.
  2. 2. A method according to claim I in which the control stream data includes a Tile Block Control Word and including the step of determining for the status of the Tile Block Control Word whether a primitive block is to be processed in rendering respective tile block.
  3. 3. A method according to claim 2 in which the control stream data includes a Tile Block Control Format Word, which is used after each primitive block determined to be processed for a tile block, to determine whether the primitive block is to be processed for a tile currently being processed.
  4. 4. A method according to claim 3 in which the control stream data includes a Full Primitive Mask Flag Present bit to determine whether any full primitive masks are present in the control stream data, and the full primitive masks are used to get a bit for each tile in a tile block which determines whether all primitives in a primitive block are visible in each respective tile.
  5. 5. A method according to claim 4 in which the control stream data comprises a Primitive Mask Format Word which is used to determine which tiles in a tile block have full primitives masks when the Full Primitive Mask Flag Present bit is not reset.
  6. 6. Apparatus for rendering 3 dimensional computer graphics image comprising: means for subdividing the image into a plurality of rectangular tiles; means for arranging the tiles in a multi level structure comprising a plurality of levels of progressively larger groupings of tiles; means for dividing image data to be rendered into a plurality of primitive blocks containing image data required to render the image; means for assigning each primitive block to a grouping of tiles within the multi level structure independence on the grouping of tiles in intersects, such that each primitive block is assigned to a grouping in one of the levels in the multi level structure; means for deriving control stream data for rendering the image, the control stream data comprising references to primitive blocks for grouping of tiles within each level of the multi level structure corresponding to the primitive blocks assigned to each grouping; and means for using the control stream data to render primitive data into tiles within the groupings of tiles for display.
  7. 7. An apparatus according to claim 6 in which the control stream data includes a Tile Block Control Word and the system includes the means for determining from the status of the Tile Block Control Word whether a primitive block is to be processed in rendering a respective tile block.
  8. 8. A system according to claim 7 which the control stream data includes a Tile Block Control Format Word and means for using the Tile Block Control Format Word to determine for each primitive determined to be required to be processed to a tile block to determine whether the primitive block is to be processed for a tile currently being processed within the tile block.
  9. 9. A system according to claim 8 in which the control stream data includes a Full Primitive Mask Flag Present bit and including the step of determining whether any Full Primitive Mask bits are present in the control stream data from the Full Primitive Mask Flag Present bit and means for setting Full Primitive Masks bits for each tile in a tile block to determine whether full primitives in a primitive block are visible in each respective tile.
  10. 10. A system according to claim 9 in which the control stream data comprises a Primitive Mask Format Word and including means to determine from the Primitive Mask Format Word which tiles in a tile block have full primitive masks when the Full Primitive Mask Flag Present bit is not set.
GB0922374.4A 2008-12-19 2009-12-21 Multi level display control list in tile based 3D computer graphics system Active GB2466569B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GBGB0823254.8A GB0823254D0 (en) 2008-12-19 2008-12-19 Multi level display control list in tile based 3D computer graphics system

Publications (3)

Publication Number Publication Date
GB0922374D0 GB0922374D0 (en) 2010-02-03
GB2466569A true GB2466569A (en) 2010-06-30
GB2466569B GB2466569B (en) 2012-04-25

Family

ID=40343947

Family Applications (2)

Application Number Title Priority Date Filing Date
GBGB0823254.8A Ceased GB0823254D0 (en) 2008-12-19 2008-12-19 Multi level display control list in tile based 3D computer graphics system
GB0922374.4A Active GB2466569B (en) 2008-12-19 2009-12-21 Multi level display control list in tile based 3D computer graphics system

Family Applications Before (1)

Application Number Title Priority Date Filing Date
GBGB0823254.8A Ceased GB0823254D0 (en) 2008-12-19 2008-12-19 Multi level display control list in tile based 3D computer graphics system

Country Status (6)

Country Link
US (1) US9336623B2 (en)
EP (2) EP2380138B1 (en)
JP (1) JP5579193B2 (en)
CN (2) CN105913470A (en)
GB (2) GB0823254D0 (en)
WO (1) WO2010070302A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2586297A (en) * 2020-02-10 2021-02-17 Imagination Tech Ltd Data structures, methods and tiling engines for storing tiling information in a graphics processing system

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2458488C (en) * 2008-03-19 2018-09-12 Imagination Tech Ltd Untransformed display lists in a tile based rendering system
US8505001B2 (en) * 2009-12-17 2013-08-06 Broadcom Corporation Method and system for utilizing data flow graphs to compile shaders
GB2545589A (en) * 2012-11-02 2017-06-21 Imagination Tech Ltd On demand geometry and acceleration structure creation
US9336561B2 (en) * 2012-12-19 2016-05-10 Intel Corporation Color buffer caching
GB2517185B (en) * 2013-08-14 2020-03-04 Advanced Risc Mach Ltd Graphics tile compositing control
GB2526598B (en) 2014-05-29 2018-11-28 Imagination Tech Ltd Allocation of primitives to primitive blocks
US9569811B2 (en) 2014-06-26 2017-02-14 Qualcomm Incorporated Rendering graphics to overlapping bins
JP6335335B2 (en) * 2014-06-30 2018-05-30 インテル・コーポレーション Adaptive partition mechanism with arbitrary tile shapes for tile-based rendering GPU architecture
GR20140100528A (en) * 2014-10-20 2016-06-01 Arm Limited A title based graphics processor and a method of performing graphics processing in a tile based graphics processor
US10535114B2 (en) * 2015-08-18 2020-01-14 Nvidia Corporation Controlling multi-pass rendering sequences in a cache tiling architecture
KR102454893B1 (en) 2015-09-30 2022-10-14 삼성전자주식회사 Graphic Processing Apparatus and Operating Method thereof
GB2546810B (en) 2016-02-01 2019-10-16 Imagination Tech Ltd Sparse rendering
US10430990B2 (en) * 2017-09-20 2019-10-01 Intel Corporation Pixel compression mechanism
CN107918665B (en) * 2017-11-23 2021-12-14 上海模袋网络科技有限公司 System and method for rapid rendering of three-dimensional model and CAD information query
CN111429556B (en) 2018-12-21 2023-03-24 畅想科技有限公司 Primitive block based rasterization in a graphics processing system
EP3690798A1 (en) 2019-01-29 2020-08-05 Imagination Technologies Limited Memory for storing untransformed primitive blocks
CN114902274A (en) * 2020-01-20 2022-08-12 超威半导体公司 Mixing sub-box
GB2591803B (en) 2020-02-07 2022-02-23 Imagination Tech Ltd Graphics processing method and system for rendering items of geometry based on their size
GB2591802B (en) * 2020-02-07 2022-03-23 Imagination Tech Ltd Graphics processing method and system for rendering items of geometry based on their size
CN113256479A (en) 2020-02-13 2021-08-13 畅想科技有限公司 Method and tiling engine for tiling primitives in a graphics processing system
CN111859589A (en) * 2020-07-20 2020-10-30 山西潞安环保能源开发股份有限公司 Method for automatically laying out primitives in transformer substation suitable for setting calculation software
CN112214809B (en) * 2020-10-16 2023-11-17 安徽工程大学 Random primitive arrangement method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2433014A (en) * 2005-12-05 2007-06-06 Arm Norway As Rendering tile-based graphics using primitive lists
GB2444863A (en) * 2005-12-05 2008-06-18 Arm Norway As Rendering tile-based graphics using primitive lists

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB717781A (en) 1952-03-26 1954-11-03 Henry Edward Billington Improvements in vehicle couplings
CA2026527A1 (en) 1989-10-11 1991-04-12 Douglas A. Fischer Parallel polygon/pixel rendering engine
US5295235A (en) 1992-02-14 1994-03-15 Steve Newman Polygon engine for updating computer graphic display employing compressed bit map data
GB2336982B (en) 1995-01-31 2000-01-19 Videologic Ltd Improvements relating to computer 3D rendering systems
US5867166A (en) 1995-08-04 1999-02-02 Microsoft Corporation Method and system for generating images using Gsprites
US5767856A (en) 1995-08-22 1998-06-16 Rendition, Inc. Pixel engine pipeline for a 3D graphics accelerator
US6104417A (en) 1996-09-13 2000-08-15 Silicon Graphics, Inc. Unified memory computer architecture with dynamic graphics memory allocation
US6697063B1 (en) 1997-01-03 2004-02-24 Nvidia U.S. Investment Company Rendering pipeline
US6084591A (en) 1997-04-29 2000-07-04 Ati Technologies, Inc. Method and apparatus for deferred video rendering
US7102646B1 (en) 1997-11-25 2006-09-05 Nvidia U.S. Investment Company Demand-based memory system for graphics applications
US6856320B1 (en) 1997-11-25 2005-02-15 Nvidia U.S. Investment Company Demand-based memory system for graphics applications
US6362818B1 (en) 1998-01-07 2002-03-26 Evans & Sutherland Computer Corporation System and method for reducing the rendering load for high depth complexity scenes on a computer graphics display
US6611272B1 (en) * 1998-07-02 2003-08-26 Microsoft Corporation Method and apparatus for rasterizing in a hierarchical tile order
GB2343603B (en) * 1998-11-06 2003-04-02 Videologic Ltd Shading 3-dimensional computer generated images
US6380935B1 (en) 1999-03-17 2002-04-30 Nvidia Corporation circuit and method for processing render commands in a tile-based graphics system
US7627736B2 (en) * 1999-04-09 2009-12-01 Clearspeed Technology Plc Thread manager to control an array of processing elements
US6801203B1 (en) * 1999-12-22 2004-10-05 Microsoft Corporation Efficient graphics pipeline with a pixel cache and data pre-fetching
US20020039100A1 (en) 2000-06-08 2002-04-04 Stephen Morphet Memory management for systems for generating 3-dimensional computer images
US7405734B2 (en) 2000-07-18 2008-07-29 Silicon Graphics, Inc. Method and system for presenting three-dimensional computer graphics images using multiple graphics processing units
US6738069B2 (en) * 2001-12-31 2004-05-18 Intel Corporation Efficient graphics state management for zone rendering
US20030123089A1 (en) * 2001-12-31 2003-07-03 Doyle Peter L. Graphics state bandwidth reduction for zone rendering
GB2416100B (en) 2002-03-26 2006-04-12 Imagination Tech Ltd 3D computer graphics rendering system
JP4082194B2 (en) 2002-12-03 2008-04-30 セイコーエプソン株式会社 Image processing method, image processing apparatus, and image processing program
GB0307095D0 (en) 2003-03-27 2003-04-30 Imagination Tech Ltd Improvements to a tiling system for 3d rendered graphics
GB2406184B (en) 2003-09-17 2006-03-15 Advanced Risc Mach Ltd Data processing system
US20050122338A1 (en) * 2003-12-05 2005-06-09 Michael Hong Apparatus and method for rendering graphics primitives using a multi-pass rendering approach
US7652678B2 (en) 2004-06-25 2010-01-26 Apple Inc. Partial display updates in a windowing system using a programmable graphics processing unit
GB2449399B (en) * 2006-09-29 2009-05-06 Imagination Tech Ltd Improvements in memory management for systems for generating 3-dimensional computer images
KR101349171B1 (en) 2007-01-17 2014-01-09 삼성전자주식회사 3-dimensional graphics accelerator and method of distributing pixel thereof
GB2452731B (en) 2007-09-12 2010-01-13 Imagination Tech Ltd Methods and systems for generating 3-dimensional computer images
CN101127125A (en) 2007-09-26 2008-02-20 腾讯科技(深圳)有限公司 Image rendering method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2433014A (en) * 2005-12-05 2007-06-06 Arm Norway As Rendering tile-based graphics using primitive lists
GB2444863A (en) * 2005-12-05 2008-06-18 Arm Norway As Rendering tile-based graphics using primitive lists

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2586297A (en) * 2020-02-10 2021-02-17 Imagination Tech Ltd Data structures, methods and tiling engines for storing tiling information in a graphics processing system
GB2586297B (en) * 2020-02-10 2022-03-02 Imagination Tech Ltd Data structures, methods and tiling engines for storing tiling information in a graphics processing system

Also Published As

Publication number Publication date
US9336623B2 (en) 2016-05-10
CN102292748A (en) 2011-12-21
JP2012513053A (en) 2012-06-07
GB0823254D0 (en) 2009-01-28
EP2380138B1 (en) 2017-06-21
EP3236420A1 (en) 2017-10-25
US20110292032A1 (en) 2011-12-01
WO2010070302A3 (en) 2011-07-07
JP5579193B2 (en) 2014-08-27
EP2380138A2 (en) 2011-10-26
WO2010070302A2 (en) 2010-06-24
GB2466569B (en) 2012-04-25
GB0922374D0 (en) 2010-02-03
CN105913470A (en) 2016-08-31

Similar Documents

Publication Publication Date Title
US9336623B2 (en) Multilevel display control list in tile based 3D computer graphics system
US9384584B2 (en) Display list control stream grouping in tile based 3D computer graphics system
KR100478767B1 (en) Graphic processing with deferred shading
US7023437B1 (en) System and method for accelerating graphics processing using a post-geometry data stream during multiple-pass rendering
US6229553B1 (en) Deferred shading graphics pipeline processor
US5767858A (en) Computer graphics system with texture mapping
US6480205B1 (en) Method and apparatus for occlusion culling in graphics systems
KR100887012B1 (en) Increased scalability in the fragment shading pipeline
US8669999B2 (en) Alpha-to-coverage value determination using virtual samples
US20020171658A1 (en) Rasterization using two-dimensional tiles and alternating bins for improved rendering utilization
WO1996003715A1 (en) A graphical display system
US20030025701A1 (en) Graphics pixel packing for improved fill rate performance
US20140118370A1 (en) Managing per-tile event count reports in a tile-based architecture
US8928679B2 (en) Work distribution for higher primitive rates
EP1255227A1 (en) Vertices index processor
US7027047B2 (en) 3D graphics rendering engine for processing an invisible fragment and a method therefor
KR20180023856A (en) Graphics processing systems and graphics processors
US20030058244A1 (en) Over-evaluating samples during rasterization for improved datapath utilization
CN116261740A (en) Compressing texture data on a per channel basis
Weyrich et al. A hardware architecture for surface splatting
US11954038B2 (en) Efficient evict for cache block memory
US20210398241A1 (en) Techniques for performing accelerated point sampling in a texture processing pipeline