CN114691385A - Electric power heterogeneous computing system - Google Patents

Electric power heterogeneous computing system Download PDF

Info

Publication number
CN114691385A
CN114691385A CN202111505083.6A CN202111505083A CN114691385A CN 114691385 A CN114691385 A CN 114691385A CN 202111505083 A CN202111505083 A CN 202111505083A CN 114691385 A CN114691385 A CN 114691385A
Authority
CN
China
Prior art keywords
accelerator
gpu
main processor
computing system
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111505083.6A
Other languages
Chinese (zh)
Inventor
高昆仑
林龙
史存存
万能
章海斌
汪晓
黄杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Super High Voltage Branch Of State Grid Anhui Electric Power Co ltd
State Grid Corp of China SGCC
Global Energy Interconnection Research Institute
Original Assignee
Super High Voltage Branch Of State Grid Anhui Electric Power Co ltd
State Grid Corp of China SGCC
Global Energy Interconnection Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Super High Voltage Branch Of State Grid Anhui Electric Power Co ltd, State Grid Corp of China SGCC, Global Energy Interconnection Research Institute filed Critical Super High Voltage Branch Of State Grid Anhui Electric Power Co ltd
Priority to CN202111505083.6A priority Critical patent/CN114691385A/en
Publication of CN114691385A publication Critical patent/CN114691385A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/54Interprogram communication
    • G06F9/544Buffers; Shared memory; Pipes

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Power Sources (AREA)

Abstract

The invention discloses an electric power heterogeneous computing system, which comprises: the circuit board card comprises a main processor, a GPU, an accelerator and a unified memory, wherein the main processor, the GPU and the accelerator are arranged on the circuit board card in an integrated mode, the main processor, the GPU and the accelerator are all connected with the unified memory, and the main processor, the GPU and the accelerator access the unified memory in a unified mode according to a preset access mechanism. The invention utilizes the unified memory to uniformly store the corresponding data of the main processor, the GPU and the accelerator, can solve the problem of time increase caused by copying data from a single memory back and forth, further can reduce the processing time of the system and the power consumption of the system, and obviously improves the data calculation efficiency.

Description

Electric power heterogeneous computing system
Technical Field
The invention relates to the field of electric power data calculation, in particular to an electric power heterogeneous calculation system.
Background
In recent years, the field of Artificial Intelligence (AI for short) has been developed rapidly, and the demand for computing power has been continuously increased, so that an AI server specially designed for AI computing has been created. The AI server mostly adopts a heterogeneous computing system, a Central Processing Unit (CPU) is responsible for the operation and management of an operating system, and a dedicated computing accelerator card is responsible for executing AI computation. At present, in power system computation-intensive tasks, such as large-scale power grid simulation and AI computation, a heterogeneous computation mode, i.e., a computation mode in which a CPU and a dedicated computation accelerator are integrated in a heterogeneous computation architecture and a centralized and distributed manner, is increasingly adopted to meet the requirement of flexibility of power system planning operation on high-performance computing power deployment. In the related art, a memory is usually configured for each CPU individually, and because the memory is configured individually, in the process of accelerating power calculation of each CPU, data needs to be copied from a main memory to another memory of the video memory device, and after the calculation is completed, the calculation result is copied from the memory of the video memory device back to the main memory.
Disclosure of Invention
Therefore, the technical problem to be solved by the present invention is to overcome the problem that in the prior art, a memory is separately configured, in the process of accelerating power calculation of each CPU, data needs to be copied from a main memory to another memory of a video memory device, and after the calculation is completed, the calculation result is copied from the memory of the video memory device back to the main memory.
According to a first aspect, an embodiment of the present invention provides a power heterogeneous computing system, including: the integrated circuit comprises a main processor, a GPU, an accelerator and a unified memory, wherein the main processor, the GPU and the accelerator are arranged on a circuit board card in an integrated mode, the main processor, the GPU and the accelerator are all connected with the unified memory, and the main processor, the GPU and the accelerator uniformly access the unified memory according to a preset access mechanism.
In one embodiment, the host processor, the GPU, and the accelerator are respectively provided with a data cache chip corresponding to each other.
In one embodiment, the host processor comprises a RISC-V processor.
In one embodiment, the main processor further comprises an X86 processor.
In one embodiment, the main processor further comprises an ARM processor.
In one embodiment, mapping information from a virtual address to a physical address is set in the unified memory.
The technical scheme of the invention has the following advantages:
the invention provides a power heterogeneous computing system, which comprises: the circuit board card comprises a main processor, a GPU, an accelerator and a unified memory, wherein the main processor, the GPU and the accelerator are arranged on the circuit board card in an integrated mode, the main processor, the GPU and the accelerator are all connected with the unified memory, and the main processor, the GPU and the accelerator access the unified memory in a unified mode according to a preset access mechanism. The invention utilizes the unified memory to uniformly store the corresponding data of the main processor, the GPU and the accelerator, can solve the problem of time increase caused by copying data from a single memory back and forth, further can reduce the processing time of the system and the power consumption of the system, and obviously improves the computing efficiency of the data.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a schematic structural diagram of an electric power heterogeneous computing system according to an embodiment of the present invention.
Reference numerals:
11-a circuit board card; 110-a main processor; 111-GPU; 112-an accelerator;
113-unified memory; 1101-a first cache chip;
1111-a second cache chip; 1121-third cache chip.
Detailed Description
The technical solutions of the present invention will be described clearly and completely with reference to the accompanying drawings, and it should be understood that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
In the description of the present invention, it should be noted that, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; the two elements may be directly connected or indirectly connected through an intermediate medium, or may be communicated with each other inside the two elements, or may be wirelessly connected or wired connected. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
In addition, the technical features involved in the different embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
In the field of power data calculation, a memory is usually configured for each CPU individually, and because the memory is configured individually, in the process of accelerating power calculation of each CPU, data needs to be copied from a main memory to another memory of a video memory device, and after calculation is completed, a calculation result is copied from the memory of the video memory device back to the main memory.
In view of this, an embodiment of the present invention provides an electric heterogeneous computing system, as shown in fig. 1, including: the host processor 110, the GPU 111, the accelerator 112, and the unified memory 113 are integrally arranged on the circuit board 11, wherein the host processor 110, the GPU 111, and the accelerator 112 are all connected to the unified memory 113, and the host processor 110, the GPU 111, and the accelerator 112 access the unified memory 113 in a unified manner according to a preset access mechanism.
The circuit board card can be a PCB circuit board, the main processor preferably adopts a RISC-V processor, and because the RISC-V processor is an open source Instruction Set Architecture (ISA) based on the principle of Reduced Instruction Set (RISC), the processing speed of electric heterogeneous computation can be improved, and the system power consumption can be reduced. The main processor is used for processing operating software of factory standard configuration of the computer. And the GPU and the accelerator are used as auxiliary processors of the main processor, and when the main processor relates to data with larger calculation scale, the GPU and the accelerator can be used for assisting to complete calculation instructions. For example: when the computer utilizes the power simulation software to simulate data, the main processor processes and loads some related data which can be normally operated by the power simulation software, and large-scale data related to a system to be simulated can be distributed to the GPU and the accelerator corresponding to the main processor according to a preset distribution rule.
The unified memory is used for uniformly storing corresponding data of the main processor, the GPU and the accelerator. Then, the main processor, the GPU and the accelerator can access the unified memory uniformly according to a preset access mechanism without copying the respective cached data to a specific memory, then waiting for the data to be calculated, and then calling the calculation result from the memory.
In an implementation manner, in fig. 1, in the power heterogeneous computing system in the embodiment of the present invention, the host processor, the GPU, and the accelerator are respectively provided with corresponding data cache chips, which are respectively a first cache chip 1101, a second cache chip 1111, and a third cache chip 1121. The first cache chip is used for caching and updating data of the main processor and updating the data in the unified memory in time according to a preset access mechanism. The second cache chip is used for caching and updating the data of the GPU and updating the data in the unified memory device in time according to a preset access mechanism. The third cache chip is used for caching the data of the updating accelerator and updating the data in the unified memory in time according to a preset access mechanism.
In another implementation manner, in the power heterogeneous computing system in the embodiment of the present invention, the main processor further includes an X86 processor or an ARM processor. The X86 processor is a generic name of a microprocessor architecture first developed and manufactured by Intel, the X86 processor and ARM can also complete the corresponding data calculation of the computer, but the processing speed of the RISC-V processor is better than that of the X86 processor and the ARM processor. Therefore, the RISC-V processor is preferred as the master process by embodiments of the present invention.
In the embodiment of the invention, the GPU is used for providing required high-speed matrix operation for the computer system, including calculation of a deep learning model and the like.
In an implementation manner, in the power heterogeneous computing system in the embodiment of the present invention, mapping information from a virtual address to a physical address is set in the unified memory.
It should be understood that the above examples are only for clarity of illustration and are not intended to limit the embodiments. Other variations and modifications will be apparent to persons skilled in the art in light of the above description. And are neither required nor exhaustive of all embodiments. And obvious variations or modifications therefrom are within the scope of the invention.

Claims (6)

1. A power heterogeneous computing system, comprising: the integrated circuit comprises a main processor, a GPU, an accelerator and a unified memory, wherein the main processor, the GPU and the accelerator are arranged on a circuit board card in an integrated mode, the main processor, the GPU and the accelerator are all connected with the unified memory, and the main processor, the GPU and the accelerator uniformly access the unified memory according to a preset access mechanism.
2. The power heterogeneous computing system according to claim 1, wherein corresponding data cache chips are respectively disposed on the host processor, the GPU and the accelerator.
3. The power heterogeneous computing system of claim 1, wherein the main processor comprises a RISC-V processor.
4. The power heterogeneous computing system of claim 1, wherein the main processor further comprises an X86 processor.
5. The power heterogeneous computing system of claim 1, wherein the main processor further comprises an ARM processor.
6. The power heterogeneous computing system according to claim 1, wherein mapping information of virtual addresses to physical addresses is set in the unified memory.
CN202111505083.6A 2021-12-10 2021-12-10 Electric power heterogeneous computing system Pending CN114691385A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111505083.6A CN114691385A (en) 2021-12-10 2021-12-10 Electric power heterogeneous computing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111505083.6A CN114691385A (en) 2021-12-10 2021-12-10 Electric power heterogeneous computing system

Publications (1)

Publication Number Publication Date
CN114691385A true CN114691385A (en) 2022-07-01

Family

ID=82136214

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111505083.6A Pending CN114691385A (en) 2021-12-10 2021-12-10 Electric power heterogeneous computing system

Country Status (1)

Country Link
CN (1) CN114691385A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567378A (en) * 2010-12-28 2012-07-11 上海杉达学院 Information retrieval system based on heterogeneous data
CN104471540A (en) * 2012-08-17 2015-03-25 英特尔公司 Memory sharing via a unified memory architecture
CN107122162A (en) * 2016-02-25 2017-09-01 深圳市知穹科技有限公司 The core high flux processing system of isomery thousand and its amending method based on CPU and GPU
CN107992436A (en) * 2016-10-26 2018-05-04 杭州华为数字技术有限公司 A kind of NVMe data read-write methods and NVMe equipment
CN112463714A (en) * 2020-11-30 2021-03-09 海光信息技术股份有限公司 Remote direct memory access method, heterogeneous computing system and electronic equipment
US20210081353A1 (en) * 2019-09-17 2021-03-18 Micron Technology, Inc. Accelerator chip connecting a system on a chip and a memory chip

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102567378A (en) * 2010-12-28 2012-07-11 上海杉达学院 Information retrieval system based on heterogeneous data
CN104471540A (en) * 2012-08-17 2015-03-25 英特尔公司 Memory sharing via a unified memory architecture
CN106294214A (en) * 2012-08-17 2017-01-04 英特尔公司 By the Memory Sharing of Unified Memory Architecture
CN107122162A (en) * 2016-02-25 2017-09-01 深圳市知穹科技有限公司 The core high flux processing system of isomery thousand and its amending method based on CPU and GPU
CN107992436A (en) * 2016-10-26 2018-05-04 杭州华为数字技术有限公司 A kind of NVMe data read-write methods and NVMe equipment
US20210081353A1 (en) * 2019-09-17 2021-03-18 Micron Technology, Inc. Accelerator chip connecting a system on a chip and a memory chip
CN112463714A (en) * 2020-11-30 2021-03-09 海光信息技术股份有限公司 Remote direct memory access method, heterogeneous computing system and electronic equipment

Similar Documents

Publication Publication Date Title
CN104021109B (en) Technology for transmitting interruption in computer systems
CN113872796B (en) Server and node equipment information acquisition method, device, equipment and medium thereof
US7353338B2 (en) Credit mechanism for multiple banks of shared cache
CN106547592A (en) A kind of method for designing for realizing Novel cloud service device power supply software online updating
US20160179156A1 (en) Hybrid power management approach
WO2009070912A1 (en) Modifying system routing information in link based systems
CN114691385A (en) Electric power heterogeneous computing system
US20060069900A1 (en) Method and system for supporting virtual mappings for shared firmware
CN111079909B (en) Operation method, system and related product
CN105988945A (en) Heterogeneous multiprocessor system and driving control method thereof
CN111078291B (en) Operation method, system and related product
CN111078125B (en) Operation method, device and related product
CN111079915B (en) Operation method, device and related product
CN111079913B (en) Operation method, device and related product
CN111079907B (en) Operation method, device and related product
CN111079914B (en) Operation method, system and related product
CN111078283B (en) Operation method, device and related product
CN111079911B (en) Operation method, system and related product
CN111079910B (en) Operation method, device and related product
CN111078280B (en) Operation method, device and related product
CN111078281B (en) Operation method, system and related product
CN111079912B (en) Operation method, system and related product
CN112394990A (en) Floating point to half precision floating point instruction processing device and method and related products
CN112394902A (en) Device and method for processing half-precision floating point to floating point instruction and related products
CN112394986A (en) Device and method for processing half-precision floating point to floating point instruction and related products

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 102209 18 Riverside Avenue, Changping District science and Technology City, Beijing

Applicant after: State Grid Smart Grid Research Institute Co.,Ltd.

Applicant after: Super high voltage branch of State Grid Anhui Electric Power Co.,Ltd.

Applicant after: STATE GRID CORPORATION OF CHINA

Address before: 102209 18 Riverside Avenue, Changping District science and Technology City, Beijing

Applicant before: GLOBAL ENERGY INTERCONNECTION RESEARCH INSTITUTE Co.,Ltd.

Applicant before: Super high voltage branch of State Grid Anhui Electric Power Co.,Ltd.

Applicant before: STATE GRID CORPORATION OF CHINA