CN112559139A - SystemC-based multi-GPU transaction-level model device and operation method - Google Patents
SystemC-based multi-GPU transaction-level model device and operation method Download PDFInfo
- Publication number
- CN112559139A CN112559139A CN202011393054.0A CN202011393054A CN112559139A CN 112559139 A CN112559139 A CN 112559139A CN 202011393054 A CN202011393054 A CN 202011393054A CN 112559139 A CN112559139 A CN 112559139A
- Authority
- CN
- China
- Prior art keywords
- model
- gpu
- opengl
- command
- transaction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/466—Transaction processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/14—Handling requests for interconnection or transfer
- G06F13/20—Handling requests for interconnection or transfer for access to input/output bus
- G06F13/32—Handling requests for interconnection or transfer for access to input/output bus using combination of interrupt and burst mode transfer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
- G06F15/177—Initialisation or configuration control
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/54—Interprogram communication
- G06F9/542—Event management; Broadcasting; Multicasting; Notifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2213/00—Indexing scheme relating to interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F2213/0026—PCI express
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computer Hardware Design (AREA)
- Multimedia (AREA)
- Controls And Circuits For Display Device (AREA)
Abstract
The invention relates to a multi-GPU transaction-level model device based on SystemC and an operation method. The device comprises an OpenGL driving model, a host model, a multi-GPU TLM model, a video memory model, a display model and transaction interfaces for connecting all model components, wherein the OpenGL driving model, the host model, the multi-GPU TLM model, the video memory model and the display model are sequentially connected; the operation method comprises the following steps: 1) the OpenGL driving model receives an OpenGL command issued by a graphic application; 2) the host model implements a transaction-level interface; 3) the video memory model provides writing and reading of graphic drawing data for each GPU model; 4) the display model converts the image data in the video memory model into a displayable picture format. The invention provides a system C-based multi-GPU transaction-level model device and an operation method, which solve the requirement of early verification of a chip system architecture and related functional units, provide virtual prototypes for multi-GPU graphic processing application development and provide reference design for RTL design.
Description
Technical Field
The invention relates to the technical field of graphic processor modeling, relates to a multi-GPU transaction-level model device, and particularly relates to a multi-GPU transaction-level model device based on SystemC and an operation method.
Background
With the continuous development of graphics applications, a graphics rendering solution based on a single GPU has been difficult to meet the demand for fast graphics processing, and a graphics rendering solution with multiple GPUs in parallel has come into play. In addition to the vigorous development of artificial intelligence, GPUs have been applied to AI parallel computing applications in a comprehensive manner, and since the computing power of a single GPU is effective, a plurality of GPUs are required to process data cooperatively in order to deal with the processing of massive training data in the AI field. The ability of multiple GPUs to co-process data is therefore fundamental to the development of AI applications.
From 1999, the first GPU product released by Nvidia to date, the development of GPU technology mainly goes through the fixed function pipeline stage, the separation stainer architecture stage, and the unified stainer architecture stage, the graphics processing capability of the GPU technology is continuously improved, and the application field is gradually expanded from the initial graphics drawing to the general computing field. The GPU pipeline has high speed, parallel characteristics and flexible programmability, and provides a good running platform for graphic processing and general parallel computing.
At present, the GPU development capability in China is weak, and a large number of commercial GPU chips imported from abroad are adopted in display control systems in various fields. Especially in the military field, the foreign imported commercial GPU chip has hidden dangers in the aspects of safety, reliability, guarantee and the like, and cannot meet the requirements of the military environment; moreover, for political, military, economic reasons and the like, technology blocking and product monopoly are carried out in China abroad, and bottom technical data of the GPU chip, such as register data, detailed internal micro-architecture, core software source codes and the like, are difficult to obtain, so that the functions and the performances of the GPU cannot be fully exerted, and the portability is poor; the problems seriously restrict the independent development and autonomous development of the display system in China, break through the key technology of the graphics processor and develop the graphics processor chip at will.
The development system architecture and the graphic algorithm of the GPU chip supporting the multi-GPU cooperative work are complex, the logic scale of hardware is huge, and the design needs to be described on a higher abstraction level so as to be capable of carrying out higher-speed simulation, software/hardware cooperative simulation and system architecture exploration. When the design is expressed as a system-level model, the design can be easily tried for many times by selecting different algorithms, and the test can be quickly completed by using different structures; if a design is expressed using a register transfer level or gate level model, the scale is usually quite large, and it is time consuming and laborious if different design structures are tried and some changes are made, if not too difficult. The key factor of SystemC as a language for promoting the development and standardization is that system level design can be performed, and the hardware architecture and software algorithm can be described, so as to support verification and IP communication. Using SystemC as a partitioning tradeoff of software and hardware at the system level is much easier than other languages and the simulation is much faster than using multiple languages. Therefore, a multi-GPU transaction level model is designed by adopting a SystemC language, and modeling is directly carried out on a high abstraction level.
Disclosure of Invention
Based on the problems in the background art, the invention provides a multi-GPU transaction-level model device based on SystemC and an operation method thereof. The technical solution of the invention is as follows:
a multi-GPU transaction level model device based on SystemC is characterized in that:
the system comprises an OpenGL driving model, a host model, a multi-GPU TLM model, a video memory model, a display model and a plurality of transaction-level interfaces for connecting each model component, wherein the OpenGL driving model, the host model, the multi-GPU TLM model, the video memory model and the display model are sequentially connected;
the OpenGL driving model is used for receiving an OpenGL command issued by a graphic application, carrying out software preprocessing, packaging the OpenGL command into an OpenGL command packet according to a preset format, and then sending an OpenGL command code to the host model;
the host model comprises a command packet broadcast transaction level interface, an architecture register read-write transaction level interface and a transaction level interface for acquiring and clearing a host interrupt state. After receiving the OpenGL command, the host model broadcasts each GPU model OpenGL command of the multi-GPU TLM model to a PCIe unit of each GPU and acquires and clears a host interrupt state of each GPU through a PCIe configuration interface of each GPU model;
the multi-GPU TLM model is formed by a plurality of GPU models in an interconnected mode, and each GPU model comprises a command processing unit, a PCIe unit, a geometric engine unit, a graphic processing subset and a fragment processing unit;
the video memory model provides a writing and reading interface of graphic drawing data for each GPU TLM model;
the display model is used for converting the image data in the RGBA format in the video memory model into a displayable BMP picture format.
The multi-GPU transaction-level model device based on SystemC is characterized in that:
the GPU model is constructed on the basis of an object-oriented modeling language SystemC;
the GPU TLM model is connected with the host model through an interface provided by a PCIe unit and provides an interrupt state read-write interface method and an architecture register access interface method for the GPU TLM model;
the PCIe unit preprocesses an OpenGL command packet issued by a host model broadcast and then sends the OpenGL command packet to the command processing unit;
and the GPU model command processing unit determines whether to continuously issue the OpenGL command packet or not by judging the OpenGL command function code and the GPU ID.
The multi-GPU transaction-level model device based on SystemC is characterized in that:
the multi-GPU TLM model divides OpenGL commands into broadcast full execution and broadcast partial execution;
and the multi-GPU model command processing unit determines whether to continuously issue the OpenGL command packet or not by judging the OpenGL command function codes and the GPU ID. If the OpenGL command function code type is broadcast full execution, the command processing units of all GPU models send the command packet to subsequent processing units; if the OpenGL command function code type is broadcast partial execution and the GPU ID is 0, the command processing unit of the GPU model with the GPU ID of 0 sends the command packet to a subsequent processing unit; if the OpenGL command function code type is broadcast partial execution and the GPU ID is not 0, the command processing unit of the GPU model whose GPU ID is not 0 directly discards the command packet without sending it to the subsequent unit.
The multi-GPU transaction-level model device based on SystemC is characterized in that: and the multiple GPU models perform corresponding graphic drawing and rendering according to the drawing areas indicated by the GPU IDs and write the graphics into the corresponding video memory models.
The operating method of the multi-GPU transaction-level model device based on SystemC is characterized in that: the method comprises the following operation steps:
1) the OpenGL driving model receives an OpenGL command issued by a graphic application;
2) the host model implements a transaction-level interface;
3) the video memory model provides a writing-in and reading-out interface of graphic drawing data for each GPU model;
4) the display model converts the image data in the video memory model into a displayable picture format.
The operating steps of the SystemC-based multi-GPU transaction-level model device are characterized in that: the step 2) comprises the following steps:
1) assemble it into one of two custom command packages-CM 0 and CM2 command packages, according to OpenGL graphics command features;
2) calling a command packet broadcast transaction-level interface in the host model to broadcast and send the command packet to each GPU model;
3) calling a read-write transaction level interface of an architecture register in a host model to configure a cooperation mode of the multi-GPU TLM model and the coordinates, the width and the height of the starting point of a region which is complexly drawn by the multi-GPU TLM model;
4) and calling an acquisition and host interrupt state clearing transaction level interface in the host model to acquire the frame drawing completion interrupt of each multi-GPU TLM model so as to synchronize drawing actions among the multi-GPU TLM models.
The operating method of the multi-GPU transaction-level model device based on SystemC is characterized in that: the step 3) comprises the following steps:
1) command packet broadcast
The method comprises the steps that DMA channel configuration methods provided by PCIe configuration interfaces of all GPU models are called, and command packets are broadcasted to PCIe units of all GPU models for processing;
2) acquiring and clearing host interrupt status
Acquiring and clearing the host interrupt state of the GPU model by calling an interrupt state read-write method provided by a PCIe configuration interface of each GPU model;
3) architectural register read and write
And configuring the cooperation mode of the GPU model and the coordinates, the width and the height of the starting point of the area which is drawn by the GPU model in a complex way by calling an architecture register access method provided by a PCIe configuration interface of each GPU model.
The invention has the technical effects that:
1. the invention provides a multi-GPU transaction-level model device based on SystemC, wherein connection and communication between models of the device are realized through calling of transaction-level interfaces.
2. The SystemC-based multi-GPU transaction-level model device solves the problem that a multi-GPU system architecture and a related graphic algorithm are difficult to verify at the early stage of a project, provides a virtual prototype of multi-GPU cooperative work for software development, provides a verified architecture, algorithm and reference design for RTL development, improves the speed of software and hardware cooperative verification, and accelerates the progress of project development.
Drawings
FIG. 1 is a block diagram of a multi-GPU transaction level model device architecture based on SystemC.
Detailed Description
The technical solution of the present invention will be clearly and completely described below with reference to the accompanying drawings and the specific embodiments. It is obvious that the described embodiments are only a part of the embodiments of the present invention, rather than the whole embodiments, and that all other embodiments, which can be derived by a person skilled in the art without inventive step based on the embodiments of the present invention, belong to the scope of protection of the present invention.
Referring to fig. 1, a SystemC-based multi-GPU transaction-level model device includes an OpenGL driving model, a host model, a multi-GPU TLM model, a video memory model, a display model, and a plurality of transaction-level interfaces for connecting each model component, which are connected in sequence;
the OpenGL driving model is used for receiving an OpenGL command issued by a graphic application, carrying out software preprocessing, packaging the OpenGL command into an OpenGL command packet according to a preset format, and then sending an OpenGL command code to the host model;
the host model comprises a command packet broadcast transaction level interface, an architecture register read-write transaction level interface and a transaction level interface for acquiring and clearing the host interrupt state. After receiving the OpenGL command, the host model broadcasts each GPU model OpenGL command of the multi-GPU TLM model to a PCIe unit of each GPU and acquires and clears a host interrupt state of each GPU through a PCIe configuration interface of each GPU model;
the multi-GPU TLM model is formed by a plurality of GPU models which are interconnected, wherein each GPU model comprises a command processing unit, a PCIe unit, a geometric engine unit, a graphic processing subset and a fragment processing unit;
the video memory model provides a writing and reading interface of graphic drawing data for each GPU TLM model;
the display model is used for converting the image data in the RGBA format in the video memory model into the displayable BMP picture format.
Preferably, the method comprises the following steps: the multi-GPU model of the multi-GPU transaction-level model device based on SystemC is constructed based on an object-oriented modeling language SystemC;
the GPU TLM model is connected with the host model through an interface provided by a PCIe unit, and an interrupt state read-write interface method and an architecture register access interface method are provided for the GPU TLM model;
the PCIe unit preprocesses an OpenGL command packet issued by a host model broadcast and then sends the OpenGL command packet to the command processing unit;
the GPU model command processing unit determines whether to continue issuing the OpenGL command packet by determining the OpenGL command function code and the GPU ID.
The multi-GPU TLM model of the multi-GPU transaction-level model device based on SystemC divides OpenGL commands into broadcast full execution and broadcast partial execution;
the multi-GPU model command processing unit determines whether to continuously issue the OpenGL command packet or not by judging the OpenGL command function codes and the GPU ID. If the OpenGL command function code type is broadcast full execution, the command processing units of all GPU models send the command packet to subsequent processing units; if the OpenGL command function code type is broadcast partial execution and the GPU ID is 0, the command processing unit of the GPU model with the GPU ID of 0 sends the command packet to a subsequent processing unit; if the OpenGL command function code type is broadcast partial execution and the GPU ID is not 0, the command processing unit of the GPU model whose GPU ID is not 0 directly discards the command packet without sending it to the subsequent unit.
The multi-GPU model of the multi-GPU transaction-level model device based on SystemC performs corresponding graph drawing and rendering according to the drawing area indicated by the GPU ID and writes the drawing area into the corresponding video memory model.
The operation method of the multi-GPU transaction-level model device based on SystemC comprises the following operation steps:
1) the OpenGL driving model receives an OpenGL command issued by a graphic application;
2) the host model implements a transaction-level interface;
3) the video memory model provides a writing-in and reading-out interface of graphic drawing data for each GPU model;
4) the display model converts the image data in the video memory model into a displayable picture format.
The step 2) of the above system c-based multi-GPU transaction-level model device includes the following steps:
1) assemble it into one of two custom command packages-CM 0 and CM2 command packages, according to OpenGL graphics command features;
2) calling a command packet broadcast transaction-level interface in the host model to broadcast and send the command packet to each GPU model;
3) calling a read-write transaction level interface of an architecture register in a host model to configure a cooperation mode of the multi-GPU TLM model and the coordinates, the width and the height of the starting point of a region which is complexly drawn by the multi-GPU TLM model;
4) and calling an acquisition and host interrupt state clearing transaction level interface in the host model to acquire the frame drawing completion interrupt of each multi-GPU TLM model so as to synchronize drawing actions among the multi-GPU TLM models.
Step 3) of the above multi-GPU transaction level model apparatus based on SystemC comprises the following steps:
1) command packet broadcast
The method comprises the steps that DMA channel configuration methods provided by PCIe configuration interfaces of all GPU models are called, and command packets are broadcasted to PCIe units of all GPU models for processing;
2) acquiring and clearing host interrupt status
Acquiring and clearing the host interrupt state of the GPU model by calling an interrupt state read-write method provided by a PCIe configuration interface of each GPU model;
3) architectural register read and write
And configuring the cooperation mode of the GPU model and the coordinates, the width and the height of the starting point of the area which is drawn by the GPU model in a complex way by calling an architecture register access method provided by a PCIe configuration interface of each GPU model.
Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those skilled in the art; the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.
Claims (7)
1. A multi-GPU transaction level model device based on SystemC is characterized in that:
the system comprises an OpenGL driving model, a host model, a multi-GPU TLM model, a video memory model, a display model and a plurality of transaction-level interfaces for connecting each model component, wherein the OpenGL driving model, the host model, the multi-GPU TLM model, the video memory model and the display model are sequentially connected;
the OpenGL driving model is used for receiving an OpenGL command issued by a graphic application, carrying out software preprocessing, packaging the OpenGL command into an OpenGL command packet according to a preset format, and then sending an OpenGL command code to the host model;
the host model comprises a command packet broadcast transaction level interface, an architecture register read-write transaction level interface and a transaction level interface for acquiring and clearing a host interrupt state. After receiving the OpenGL command, the host model broadcasts each GPU model OpenGL command of the multi-GPU TLM model to a PCIe unit of each GPU and acquires and clears a host interrupt state of each GPU through a PCIe configuration interface of each GPU model;
the multi-GPU TLM model is formed by a plurality of GPU models in an interconnected mode, and each GPU model comprises a command processing unit, a PCIe unit, a geometric engine unit, a graphic processing subset and a fragment processing unit;
the video memory model provides a writing and reading interface of graphic drawing data for each GPU TLM model;
the display model is used for converting the image data in the RGBA format in the video memory model into a displayable BMP picture format.
2. The SystemC-based multi-GPU transaction level model device of claim 1, wherein:
the GPU model is constructed on the basis of an object-oriented modeling language SystemC;
the GPU TLM model is connected with the host model through an interface provided by a PCIe unit and provides an interrupt state read-write interface method and an architecture register access interface method for the GPU TLM model;
the PCIe unit preprocesses an OpenGL command packet issued by a host model broadcast and then sends the OpenGL command packet to the command processing unit;
and the GPU model command processing unit determines whether to continuously issue the OpenGL command packet or not by judging the OpenGL command function code and the GPU ID.
3. The SystemC-based multi-GPU transaction level model device of claim 2, wherein:
the multi-GPU TLM model divides OpenGL commands into broadcast full execution and broadcast partial execution;
and the multi-GPU model command processing unit determines whether to continuously issue the OpenGL command packet or not by judging the OpenGL command function codes and the GPU ID. If the OpenGL command function code type is broadcast full execution, the command processing units of all GPU models send the command packet to subsequent processing units; if the OpenGL command function code type is broadcast partial execution and the GPU ID is 0, the command processing unit of the GPU model with the GPU ID of 0 sends the command packet to a subsequent processing unit; if the OpenGL command function code type is broadcast partial execution and the GPU ID is not 0, the command processing unit of the GPU model whose GPU ID is not 0 directly discards the command packet without sending it to the subsequent unit.
4. The SystemC-based multi-GPU transaction level model device of claim 3, wherein: and the multiple GPU models perform corresponding graphic drawing and rendering according to the drawing areas indicated by the GPU IDs and write the graphics into the corresponding video memory models.
5. The method of operating a SystemC-based multi-GPU transaction-level model device according to claim 1, characterized in that: the method comprises the following operation steps:
1) the OpenGL driving model receives an OpenGL command issued by a graphic application;
2) the host model implements a transaction-level interface;
3) the video memory model provides a writing-in and reading-out interface of graphic drawing data for each GPU model;
4) the display model converts the image data in the video memory model into a displayable picture format.
6. The operational steps of a SystemC-based multi-GPU transaction-level model device according to claim 5, characterized in that: the step 2) comprises the following steps:
6.1 assemble OpenGL graphics command features into one of two custom command packages-CM 0 and CM 2;
6.2 calling a command packet broadcast transaction-level interface in the host model to broadcast and send the command packet to each GPU model;
6.3 calling a system structure register read-write transaction level interface in the host model to configure a cooperation mode of the multi-GPU TLM model and the coordinates, the width and the height of the starting point of a region which is complexly drawn by the multi-GPU TLM model;
6.4 calling the transaction level interface for obtaining and clearing the host interrupt state in the host model to obtain the frame drawing completion interrupt of each multi-GPU TLM model so as to synchronize the drawing action among the multi-GPU TLM models.
7. The method of operation of a SystemC-based multi-GPU transaction-level model device according to claim 6, wherein: the step 3) comprises the following steps:
7.1 Command packet broadcast
The method comprises the steps that DMA channel configuration methods provided by PCIe configuration interfaces of all GPU models are called, and command packets are broadcasted to PCIe units of all GPU models for processing;
7.2 getting and clearing host interrupt State
Acquiring and clearing the host interrupt state of the GPU model by calling an interrupt state read-write method provided by a PCIe configuration interface of each GPU model;
7.3 architecture register read and write
And configuring the cooperation mode of the GPU model and the coordinates, the width and the height of the starting point of the area which is drawn by the GPU model in a complex way by calling an architecture register access method provided by a PCIe configuration interface of each GPU model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011393054.0A CN112559139B (en) | 2020-12-05 | 2020-12-05 | SystemC-based multi-GPU transaction-level model device and operation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011393054.0A CN112559139B (en) | 2020-12-05 | 2020-12-05 | SystemC-based multi-GPU transaction-level model device and operation method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112559139A true CN112559139A (en) | 2021-03-26 |
CN112559139B CN112559139B (en) | 2022-12-13 |
Family
ID=75047181
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011393054.0A Active CN112559139B (en) | 2020-12-05 | 2020-12-05 | SystemC-based multi-GPU transaction-level model device and operation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112559139B (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102648483A (en) * | 2009-12-07 | 2012-08-22 | 智能技术无限责任公司 | Method and computing device for capturing screen images and for identifying screen image changes using a GPU |
US20140368516A1 (en) * | 2013-06-07 | 2014-12-18 | Apple Inc. | Multi-Processor Graphics Rendering |
CN109840878A (en) * | 2018-12-12 | 2019-06-04 | 中国航空工业集团公司西安航空计算技术研究所 | It is a kind of based on SystemC towards GPU parameter management method |
CN111047504A (en) * | 2019-11-21 | 2020-04-21 | 中国航空工业集团公司西安航空计算技术研究所 | TLM microstructure for GPU sub-image processing based on SystemC |
-
2020
- 2020-12-05 CN CN202011393054.0A patent/CN112559139B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102648483A (en) * | 2009-12-07 | 2012-08-22 | 智能技术无限责任公司 | Method and computing device for capturing screen images and for identifying screen image changes using a GPU |
US20140368516A1 (en) * | 2013-06-07 | 2014-12-18 | Apple Inc. | Multi-Processor Graphics Rendering |
CN109840878A (en) * | 2018-12-12 | 2019-06-04 | 中国航空工业集团公司西安航空计算技术研究所 | It is a kind of based on SystemC towards GPU parameter management method |
CN111047504A (en) * | 2019-11-21 | 2020-04-21 | 中国航空工业集团公司西安航空计算技术研究所 | TLM microstructure for GPU sub-image processing based on SystemC |
Non-Patent Citations (3)
Title |
---|
ROHIT SINHA等: ""Parallel simulation of mixed-abstraction SystemC models on GPUs and multicore CPUs"", 《17TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE》 * |
姜丽云等: "基于UML&SystemC的GPU片段处理TLM建模", 《计算机技术与发展》 * |
康国栋: ""基于SystemC TLM模型的GPU图像处理单元验证"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Also Published As
Publication number | Publication date |
---|---|
CN112559139B (en) | 2022-12-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20240020134A1 (en) | Overflow detection and correction in state machine engines | |
CN117237513A (en) | Ray tracing system and method, and method for processing data in ray tracing system | |
US20090125854A1 (en) | Automated generation of theoretical performance analysis based upon workload and design configuration | |
CN109669832A (en) | One kind is towards GPU graphics chip pipeline unit performance verification method and platform | |
CN109840878B (en) | GPU parameter management method based on SystemC | |
CN108563808A (en) | The design method of heterogeneous reconfigurable figure computation accelerator system based on FPGA | |
CN109711003A (en) | One kind is towards GPU graphics chip pipeline unit functional simulation method and platform | |
CN111080761B (en) | Scheduling method and device for rendering tasks and computer storage medium | |
CN109710229B (en) | Architecture verification method and platform for graphics pipeline unit of GPU (graphics processing Unit) chip | |
CN110765715B (en) | GPU chip rendering output unit performance simulation method and platform | |
CN111062173A (en) | GPU chip-oriented rendering output unit function simulation method and platform | |
CN112559139B (en) | SystemC-based multi-GPU transaction-level model device and operation method | |
CN108010113B (en) | Deep learning model execution method based on pixel shader | |
CN110928610B (en) | Method, device and computer storage medium for verifying shader function | |
CN111028128B (en) | GPU-oriented vertex output control method and unit based on SystemC | |
CN109657360A (en) | A kind of modeling method and view system towards GPU chip hardware framework | |
US20200371843A1 (en) | Framework for application driven exploration and optimization of hardware engines | |
CN111383314A (en) | Method and device for verifying shader function and computer storage medium | |
CN111028132A (en) | SystemC-based GPU command processor unit hardware TLM microstructure | |
CN109346028B (en) | TLM-based triangular raster scanning structure | |
CN114092813B (en) | Industrial park image extraction method and system, electronic equipment and storage medium | |
CN112581576B (en) | GPU shielding query TLM device based on SystemC and operation method | |
US20090326888A1 (en) | Vectorized parallel collision detection pipeline | |
CN111242832B (en) | System C-based GPU texture mapping period accurate joint simulation device and method | |
CN104679963A (en) | Simulation and verification device and simulation and verification method based on TCL |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |