CN113220080A - Modular multi-computing-node GPU server structure - Google Patents

Modular multi-computing-node GPU server structure Download PDF

Info

Publication number
CN113220080A
CN113220080A CN202110453659.2A CN202110453659A CN113220080A CN 113220080 A CN113220080 A CN 113220080A CN 202110453659 A CN202110453659 A CN 202110453659A CN 113220080 A CN113220080 A CN 113220080A
Authority
CN
China
Prior art keywords
module
power supply
interface
gpu
computing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110453659.2A
Other languages
Chinese (zh)
Other versions
CN113220080B (en
Inventor
赵玺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Longwei System Technology Co ltd
Original Assignee
Chengdu Longwei System Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Longwei System Technology Co ltd filed Critical Chengdu Longwei System Technology Co ltd
Priority to CN202110453659.2A priority Critical patent/CN113220080B/en
Publication of CN113220080A publication Critical patent/CN113220080A/en
Application granted granted Critical
Publication of CN113220080B publication Critical patent/CN113220080B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/18Packaging or power distribution
    • G06F1/183Internal mounting support structures, e.g. for printed circuit boards, internal connecting means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/18Packaging or power distribution
    • G06F1/183Internal mounting support structures, e.g. for printed circuit boards, internal connecting means
    • G06F1/185Mounting of expansion boards
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/20Cooling means
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computer Hardware Design (AREA)
  • Power Engineering (AREA)
  • Cooling Or The Like Of Electrical Apparatus (AREA)

Abstract

The invention discloses a modular multi-computing-node GPU server structure which comprises a bottom plate, wherein a plurality of computing module slots, a plurality of GPU card slots, a management system module interface, a network processing module interface, a heat dissipation power supply interface and a power supply input interface are arranged on the bottom plate. The invention has the advantages that: the expansion slots are arranged in a centralized mode, so that high-density slot arrangement is facilitated, various resources such as calculation, storage, management, networks and display cards are effectively integrated, various characteristics such as flexible arrangement and effective expansion of various functional modules are achieved, hot plug installation is achieved, replacement and maintenance are facilitated, when any one module fails, the modules do not affect each other, and normal operation of a server is not affected.

Description

Modular multi-computing-node GPU server structure
Technical Field
The invention relates to the technical field of servers, in particular to a multi-node GPU server structure which is a server bottom plate structure with high density, high expansion and based on modularization technical characteristics. .
Background
Currently, the computational performance of simple GPU products has not been able to meet compute-intensive workloads, such as complex visual computations, large-scale data rendering, etc. in GPU computing application scenarios. If more GPUs are needed, only a plurality of GPU servers are used for stacking, which is not only unfavorable for installation and deployment, but also inevitably causes cost increase and repeated investment.
With the development of the technology, the blade server has appeared, and multiple complete independent GPU systems can be deployed in the same chassis, but due to the influence of the layout structure of the blade server and the structure of the blade itself, high-density expansion cannot be achieved, and the computing module 16 and the GPU module can only be integrated on the same blade. Meanwhile, the blade server is used as high-performance computing equipment, so that the requirements on operation, maintenance, management and exchange performance are particularly high, the standards are not uniform, the expansion performance is poor, and the cost is high.
Disclosure of Invention
Aiming at the defects of the prior art, the invention provides a modular multi-node GPU server structure.
In order to realize the purpose, the technical scheme adopted by the invention is as follows:
a modular multi-node GPU server architecture, comprising: the system comprises a bottom plate 1, a management system module 2, a network processing module 3, a cooling fan module 4, a PSU power supply module 5, a power input back plate 6, a cooling fan module back plate 7 and a case 8;
the bottom board 1 is provided with a computing module slot 9, a GPU card slot 10, a management system module interface 11, a network processing module interface 12, a heat dissipation power interface 13 and a power input interface 14.
The number of the computing module slots 9 is multiple, each computing module slot 9 is inserted with one computing module 16, so that data communication and electric connection between the computing module 16 and the bottom plate 1 are realized, a hot plug function is realized, and the computing module 16 is a server.
The GPU card slots 10 are multiple, each GPU card slot 10 is correspondingly provided with a GPU card additional power supply access interface, the GPU card slots are used for inserting the display cards 15, and one computing module 16 corresponds to one display card 15;
the management system module interface 11 is used for connecting the management system module 3, the management system module 3 is provided with two electric port network connection ports, and the management system module 3 is used for controlling and monitoring the running state of each module on the bottom plate 1, including starting/disconnecting the expansion template, and controlling the rotating speed and starting and stopping of the fan.
The number of the network processing module interfaces 12 is two, the network processing module interfaces 12 are used for accessing the network processing module 3, and the network processing module 3 is used for realizing data communication between the bottom plate 1 and the outside.
The heat dissipation power supply interface 13 is used for connecting the heat dissipation fan module back plate 7, a plurality of heat dissipation fan module interfaces are arranged on the heat dissipation fan module back plate 7, each heat dissipation fan module interface is connected with one heat dissipation fan module 4, and the heat dissipation fan module 4 is used for dissipating heat for the case 8.
Two power input interfaces 14 are respectively connected with the two power input back plates 6; the power input back plate 6 is provided with two PSU power supply module interfaces, and each PSU power supply module interface is provided with one PSU power supply module 5. The power supply module 5 works in a load balancing redundancy mode, and the power supply safety and reliability of the server system are effectively improved on the basis of considering energy conservation.
The chassis 8 is the shell of the modular multi-node GPU server architecture.
Preferably, the network processing module 3, the cooling fan module 4, the computing module 16, the display card 15, the PSU power supply module 5 and the management system module 2 can be hot-plugged;
preferably, the number of the radiator fan modules 4 is eight.
Preferably, the network processing module 3 is provided with three optical port network connection ports.
Preferably, the GPU card slots 10 are ten in number.
Preferably, the number of computing module slots 9 is ten.
Preferably, the plurality of computing module slots 9 are arranged laterally side by side, and the GPU card slot 10 is located corresponding to the computing module slot 9.
Compared with the prior art, the invention has the advantages that:
1. support the simultaneous centralized placement of multiple compute modules 16 and GPU cards on the same chassis floor.
2. The server backplane structure has the advantages of high availability, high expansion, high density and modularization technology based.
3. The bottom plate is used as a substrate of the modularized layout structure, and all modules are uniformly fused and configured, so that various characteristics such as flexible deployment and effective expansion are realized.
4. The hot plugging installation of multiple functional modules can be realized on the same bottom plate, and the installation, maintenance and operation are very convenient.
5. The functional modules on the same bottom plate are configured with different models of computing modules 16 according to different application requirements.
6. The computing module 16 and the display card on the same bottom plate are separated, and when any one module fails, the modules do not affect each other, and the normal operation of the server is not affected.
7. The monitoring and monitoring of the running state of each functional module system are realized through the management control module, and the management modes include but are not limited to Web end background, PC end application, mobile end APP and the like.
Drawings
FIG. 1 is a schematic structural diagram of a base plate according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of the connection of a backplane to modules according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of the display card and computing module installation of an embodiment of the present invention;
fig. 4 is an installation diagram of a management system module, a cooling fan module and a PSU power supply module according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail below with reference to the accompanying drawings by way of examples.
As shown in fig. 1 to 4, a modular multi-node GPU server architecture comprises: the system comprises a bottom plate 1, a management system module 2, a network processing module 3, a cooling fan module 4, a PSU power supply module 5, a power input back plate 6, a cooling fan module back plate 7 and a case 8;
the bottom board 1 is provided with a computing module slot 9, a GPU card slot 10, a management system module interface 11, a network processing module interface 12, a heat dissipation power interface 13 and a power input interface 14.
The number of the computing module slots 9 is ten, and each computing module slot 9 is inserted with one computing module 16, so that the data communication and the electric connection between the computing module 16 and the bottom plate 1 are realized, and the hot plug function is realized.
The number of the GPU card slots 10 is ten, each GPU card slot 10 is correspondingly provided with a GPU card additional power supply access interface, and the GPU card slots are used for inserting the display cards 15;
the management system module interface 11 is used for connecting the management system module 3, the management system module 3 is provided with two electric port network connection ports, and the management system module 3 is used for controlling and monitoring the running state of each module on the bottom plate 1, including starting/disconnecting the expansion template, and controlling the rotating speed and starting and stopping of the fan.
The number of the network processing module interfaces 12 is two, the network processing module interfaces 12 are used for accessing the network processing module 3, three optical port network connection ports are respectively arranged on the network processing module 3, and different users can select the number of the optical port network connection ports according to requirements to realize data communication between the bottom plate 1 and the outside.
The heat dissipation power supply interface 13 is used for connecting the heat dissipation fan module back plate 7, eight heat dissipation fan module interfaces are arranged on the heat dissipation fan module back plate 7, each heat dissipation fan module interface is connected with one heat dissipation fan module 4, and the heat dissipation fan module 4 is used for dissipating heat for the case 8.
Two power input interfaces 14 are respectively connected with the two power input back plates 6; the power input back plate 6 is provided with two PSU power supply module interfaces, and each PSU power supply module interface is provided with one PSU power supply module 5. The power supply module 5 works in a load balancing redundancy mode, and the power supply safety and reliability of the server system are effectively improved on the basis of considering energy conservation.
The invention can realize hot plug installation of each functional module on the same server bottom plate, and users can configure different numbers of the computing modules 16 and the display cards 15, wherein the computing modules 16 have various optional configurations; meanwhile, the management control module, the cooling fan module, the network processing module and the PSU power supply module which are configured on the bottom plate can be selected according to different requirements.
The bottom plate is used as a substrate of a modular layout structure, and all modules are uniformly configured. The management control module is responsible for monitoring and controlling the running state of each module configured on the bottom plate, so that a user can conveniently operate and maintain each module through the management control module, and the operation and maintenance are very simple.
It will be appreciated by those of ordinary skill in the art that the examples described herein are intended to assist the reader in understanding the manner in which the invention is practiced, and it is to be understood that the scope of the invention is not limited to such specifically recited statements and examples. Those skilled in the art can make various other specific changes and combinations based on the teachings of the present invention without departing from the spirit of the invention, and these changes and combinations are within the scope of the invention.

Claims (7)

1. A modular, multi-node GPU server architecture, comprising: the system comprises a bottom plate (1), a management system module (2), a network processing module (3), a cooling fan module (4), a PSU power supply module (5), a power input back plate (6), a cooling fan module back plate (7) and a case (8);
a computing module slot (9), a GPU card slot (10), a management system module interface (11), a network processing module interface (12), a heat dissipation power interface (13) and a power input interface (14) are arranged on the bottom plate (1);
a plurality of computing module slots (9) are provided, each computing module slot (9) is inserted with one computing module (16) to realize the data communication and the electric connection between the computing module (16) and the bottom plate (1) and realize the hot plug function, and the computing module (16) is a server;
the GPU card slots (10) are provided with a plurality of GPU card slots, each GPU card slot (10) is correspondingly provided with a GPU card additional power supply access interface, the GPU card slots are used for inserting the display cards (15), and one computing module (16) corresponds to one display card (15);
the management system module interface (11) is used for connecting the management system module (3), the management system module (3) is provided with two electric port network connection ports, and the management system module (3) is used for controlling and monitoring the running state of each module on the bottom plate (1), and comprises an opening/closing expansion template for controlling the rotating speed and the starting and the stopping of a fan;
the number of the network processing module interfaces (12) is two, the network processing module interfaces (12) are used for being connected to the network processing module (3), and the network processing module (3) is used for realizing data communication between the bottom plate (1) and the outside;
the heat dissipation power supply interface (13) is used for connecting a heat dissipation fan module backboard (7), a plurality of heat dissipation fan module interfaces are arranged on the heat dissipation fan module backboard (7), each heat dissipation fan module interface is connected with one heat dissipation fan module (4), and each heat dissipation fan module (4) is used for dissipating heat for the case (8);
two power input interfaces (14) are respectively connected with the two power input back plates (6); two PSU power supply module interfaces are arranged on the power input back plate (6), and each PSU power supply module interface is provided with one PSU power supply module (5); the power supply module (5) works in a load balancing redundancy mode, and the power supply safety and reliability of the server system are effectively improved on the basis of considering energy conservation;
the chassis (8) is a shell of a modular multi-node GPU server structure.
2. The modular multi-node GPU server architecture of claim 1, wherein: the network processing module (3), the cooling fan module (4), the computing module (16), the display card (15), the PSU power supply module (5) and the management system module (2) can be connected in a hot-plugging mode.
3. The modular multi-node GPU server architecture of claim 1, wherein: the number of the radiating fan modules (4) is eight.
4. The modular multi-node GPU server architecture of claim 1, wherein: the network processing module (3) is respectively provided with three optical port network connection ports.
5. The modular multi-node GPU server architecture of claim 1, wherein: the number of GPU card slots (10) is ten.
6. The modular multi-node GPU server architecture of claim 5, wherein: the number of the computing module slots (9) is ten.
7. The modular multi-node GPU server architecture of claim 6, wherein: the plurality of computing module slots (9) are arranged side by side and transversely, and the GPU card slot (10) corresponds to the computing module slots (9).
CN202110453659.2A 2021-04-26 2021-04-26 Modularized multi-computing-node GPU server structure Active CN113220080B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110453659.2A CN113220080B (en) 2021-04-26 2021-04-26 Modularized multi-computing-node GPU server structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110453659.2A CN113220080B (en) 2021-04-26 2021-04-26 Modularized multi-computing-node GPU server structure

Publications (2)

Publication Number Publication Date
CN113220080A true CN113220080A (en) 2021-08-06
CN113220080B CN113220080B (en) 2024-05-24

Family

ID=77089244

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110453659.2A Active CN113220080B (en) 2021-04-26 2021-04-26 Modularized multi-computing-node GPU server structure

Country Status (1)

Country Link
CN (1) CN113220080B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117806438A (en) * 2024-02-28 2024-04-02 苏州元脑智能科技有限公司 Control method and device of server heat dissipation device, storage medium and electronic device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN206039401U (en) * 2016-07-30 2017-03-22 济宁市天启联合信息技术有限公司 Modular computer machine case
CN107463224A (en) * 2017-08-28 2017-12-12 北京嘉楠捷思信息技术有限公司 Display card expansion board and host and computing equipment applying same
CN108762430A (en) * 2018-08-02 2018-11-06 成都珑微系统科技有限公司 A kind of cabinet module layout structure
CN109062346A (en) * 2018-08-02 2018-12-21 成都珑微系统科技有限公司 A kind of cabinet bearing structure
CN109918199A (en) * 2019-02-28 2019-06-21 中国科学技术大学苏州研究院 Distributed figure processing system based on GPU
CN110427081A (en) * 2019-08-27 2019-11-08 成都珑微系统科技有限公司 A kind of modularization Edge Server structure
CN214896436U (en) * 2021-04-26 2021-11-26 成都珑微系统科技有限公司 Modular multi-computing-node GPU server structure

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN206039401U (en) * 2016-07-30 2017-03-22 济宁市天启联合信息技术有限公司 Modular computer machine case
CN107463224A (en) * 2017-08-28 2017-12-12 北京嘉楠捷思信息技术有限公司 Display card expansion board and host and computing equipment applying same
CN108762430A (en) * 2018-08-02 2018-11-06 成都珑微系统科技有限公司 A kind of cabinet module layout structure
CN109062346A (en) * 2018-08-02 2018-12-21 成都珑微系统科技有限公司 A kind of cabinet bearing structure
CN109918199A (en) * 2019-02-28 2019-06-21 中国科学技术大学苏州研究院 Distributed figure processing system based on GPU
CN110427081A (en) * 2019-08-27 2019-11-08 成都珑微系统科技有限公司 A kind of modularization Edge Server structure
CN214896436U (en) * 2021-04-26 2021-11-26 成都珑微系统科技有限公司 Modular multi-computing-node GPU server structure

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117806438A (en) * 2024-02-28 2024-04-02 苏州元脑智能科技有限公司 Control method and device of server heat dissipation device, storage medium and electronic device
CN117806438B (en) * 2024-02-28 2024-05-14 苏州元脑智能科技有限公司 Control method and device of server heat dissipation device, storage medium and electronic device

Also Published As

Publication number Publication date
CN113220080B (en) 2024-05-24

Similar Documents

Publication Publication Date Title
CN107656588B (en) Server system with optimized heat dissipation and installation method
CN113220085A (en) Server
US11314666B2 (en) Systems and methods for optimizing clock distribution in NVMe storage enclosures
CN104503556A (en) Air cooling and liquid cooling combination-based redundant backup server radiation system
CN206235977U (en) A kind of VHD server architecture
CN210428286U (en) Modular edge server structure
CN214896436U (en) Modular multi-computing-node GPU server structure
CN203786606U (en) Cabinet type server device
CN113220080B (en) Modularized multi-computing-node GPU server structure
CN202443354U (en) A multi-node cable-free modular computer
CN103375420A (en) Equipment cabinet system and fan control system and control method thereof
CN106919533B (en) 4U high-density storage type server
CN110908863A (en) ARM engine cluster server
CN115481068B (en) Server and data center
CN214896435U (en) Modularization display card extension case structure
CN206696775U (en) Multistage JBOD dual controls storage server is connected based on existing cabinet outside
CN207008492U (en) A kind of modular server and bottom plate
CN203422706U (en) Duel-node high-temperature energy-saving integrated server
WO2016065741A1 (en) Server baseplate
CN214011980U (en) Server with RAS (remote server system) characteristic
CN212623908U (en) Novel server framework
CN115639880A (en) Server
CN114077290B (en) A frame and calculation type server for calculation type server
CN213024394U (en) Cluster equipment for ARM architecture processor
CN115268581A (en) AI edge server system architecture with high performance computing power

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant