CN118541674A - 发布用于图形处理单元工作负载的物理拓扑网络局部性信息 - Google Patents

发布用于图形处理单元工作负载的物理拓扑网络局部性信息 Download PDF

Info

Publication number
CN118541674A
CN118541674A CN202280088684.8A CN202280088684A CN118541674A CN 118541674 A CN118541674 A CN 118541674A CN 202280088684 A CN202280088684 A CN 202280088684A CN 118541674 A CN118541674 A CN 118541674A
Authority
CN
China
Prior art keywords
vcn
host
host machine
network
subnet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280088684.8A
Other languages
English (en)
Chinese (zh)
Inventor
J·布拉尔
D·贝克尔
H·D·科克玛德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oracle International Corp
Original Assignee
Oracle International Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oracle International Corp filed Critical Oracle International Corp
Priority claimed from PCT/US2022/082073 external-priority patent/WO2023136964A1/en
Publication of CN118541674A publication Critical patent/CN118541674A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45545Guest-host, i.e. hypervisor is an application program itself, e.g. VirtualBox
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5072Grid computing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • G06F9/5077Logical partitioning of resources; Management or configuration of virtualized resources
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/5041Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the time relationship between creation and deployment of a service
    • H04L41/5051Service on demand, e.g. definition and deployment of services in real time
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45562Creating, deleting, cloning virtual machine instances
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/4557Distribution of virtual machine instances; Migration and load balancing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/455Emulation; Interpretation; Software simulation, e.g. virtualisation or emulation of application or operating system execution engines
    • G06F9/45533Hypervisors; Virtual machine monitors
    • G06F9/45558Hypervisor-specific management and integration aspects
    • G06F2009/45595Network integration; Enabling network access in virtual machine instances
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/502Proximity
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5022Workload threshold
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • H04L41/122Discovery or management of network topologies of virtualised topologies, e.g. software-defined networks [SDN] or network function virtualisation [NFV]

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
CN202280088684.8A 2022-01-12 2022-12-20 发布用于图形处理单元工作负载的物理拓扑网络局部性信息 Pending CN118541674A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202263298685P 2022-01-12 2022-01-12
US63/298,685 2022-01-12
US18/050,392 2022-10-27
US18/050,392 US20230222007A1 (en) 2022-01-12 2022-10-27 Publishing physical topology network locality information for graphical processing unit workloads
PCT/US2022/082073 WO2023136964A1 (en) 2022-01-12 2022-12-20 Publishing physical topology network locality information for graphical processing unit workloads

Publications (1)

Publication Number Publication Date
CN118541674A true CN118541674A (zh) 2024-08-23

Family

ID=87069622

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280088684.8A Pending CN118541674A (zh) 2022-01-12 2022-12-20 发布用于图形处理单元工作负载的物理拓扑网络局部性信息

Country Status (5)

Country Link
US (1) US20230222007A1 (https=)
EP (1) EP4463768A1 (https=)
JP (1) JP2025504416A (https=)
KR (1) KR20240132079A (https=)
CN (1) CN118541674A (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12316601B2 (en) * 2022-07-14 2025-05-27 VMware LLC Two tier DNS
WO2025080831A1 (en) * 2023-10-13 2025-04-17 Oracle International Corporation Global virtual planes

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120185867A1 (en) * 2011-01-17 2012-07-19 International Business Machines Corporation Optimizing The Deployment Of A Workload On A Distributed Processing System

Also Published As

Publication number Publication date
JP2025504416A (ja) 2025-02-12
KR20240132079A (ko) 2024-09-02
EP4463768A1 (en) 2024-11-20
US20230222007A1 (en) 2023-07-13

Similar Documents

Publication Publication Date Title
JP7850184B2 (ja) 画像処理装置のルーティングポリシー
CN116982295A (zh) 基于高速缓存和非高速缓存配置信息的云基础设施中的分组流
JP2025522279A (ja) コンテナ環境における通信の実装
US12531785B2 (en) Publishing physical topology network locality for general workloads
CN120153360A (zh) 图形处理单元的超级集群网络
CN119547382A (zh) 基于几何的流编程
WO2025080831A1 (en) Global virtual planes
CN118541674A (zh) 发布用于图形处理单元工作负载的物理拓扑网络局部性信息
US20240054004A1 (en) Dual top-of-rack switch implementation for dedicated region cloud at customer
JP2025531668A (ja) 顧客専用リージョンクラウド向けデュアルトップオブラックスイッチの実装
KR20240154533A (ko) 일반적인 워크로드들에 대한 물리적 토폴로지 네트워크 지역성 퍼블리싱
WO2023136964A1 (en) Publishing physical topology network locality information for graphical processing unit workloads
JP7832965B2 (ja) グラフィック処理ユニットのルーティングポリシー
CN118541675A (zh) 发布用于一般工作负载的物理拓扑网络局部性
JP2025531666A (ja) 顧客専用リージョンクラウド向けネットワークアーキテクチャ
JP2025531667A (ja) 顧客専用リージョンクラウドにおける耐障害サービスの提供
CN117597894A (zh) 用于图形处理单元的路由策略
WO2024039519A1 (en) Multiple top-of-rack (tor) switches connected to a network virtualization device
CN122003849A (zh) 带有流信息的源节点的动态编程
CN122003848A (zh) 带有流信息的源节点的动态编程
CN122003850A (zh) 带有流信息的源节点的动态编程
CN122003854A (zh) 具有降低的时延的端点连接

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination