WO2023043366A3 - 服务运行方法、装置和电子设备 - Google Patents

服务运行方法、装置和电子设备 Download PDF

Info

Publication number
WO2023043366A3
WO2023043366A3 PCT/SG2022/050601 SG2022050601W WO2023043366A3 WO 2023043366 A3 WO2023043366 A3 WO 2023043366A3 SG 2022050601 W SG2022050601 W SG 2022050601W WO 2023043366 A3 WO2023043366 A3 WO 2023043366A3
Authority
WO
WIPO (PCT)
Prior art keywords
gpu
service
sub
deployment mode
electronic device
Prior art date
Application number
PCT/SG2022/050601
Other languages
English (en)
French (fr)
Other versions
WO2023043366A2 (zh
Inventor
李志超
齐思凯
刘哲瑞
朱亦博
郭传雄
谭丞
张健
王剑
Original Assignee
脸萌有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 脸萌有限公司 filed Critical 脸萌有限公司
Publication of WO2023043366A2 publication Critical patent/WO2023043366A2/zh
Publication of WO2023043366A3 publication Critical patent/WO2023043366A3/zh
Priority to US18/532,819 priority Critical patent/US20240104687A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/20Processor architectures; Processor configuration, e.g. pipelining
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Hardware Redundancy (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Stored Programmes (AREA)

Abstract

说明书摘要本公开的实施例公开了服务运行方法、装置和电子设备。该方法的一具体实施方式包括:根据服务集合中每个服务的性能数据,确定 GPU 的目标部署方式,其中,部署方式包括从 GPU 划分具有相应大小的子 GPU 和确定每个子 GPU 用于运行的服务;对于服务集合中的服务,将该服务从运行于当前部署方式指示的子 GPU,切换至运行于目标部署方式指示的子 GPU。该实施方式在通过 GPU 运行多个服务时,可以降低 GPU 的浪费。
PCT/SG2022/050601 2021-09-16 2022-08-23 服务运行方法、装置和电子设备 WO2023043366A2 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US18/532,819 US20240104687A1 (en) 2021-09-16 2023-12-07 Method and apparatus for runing a service, and electronic device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111088174.4 2021-09-16
CN202111088174.4A CN113791908B (zh) 2021-09-16 2021-09-16 服务运行方法、装置和电子设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/532,819 Continuation US20240104687A1 (en) 2021-09-16 2023-12-07 Method and apparatus for runing a service, and electronic device

Publications (2)

Publication Number Publication Date
WO2023043366A2 WO2023043366A2 (zh) 2023-03-23
WO2023043366A3 true WO2023043366A3 (zh) 2023-05-11

Family

ID=78878696

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SG2022/050601 WO2023043366A2 (zh) 2021-09-16 2022-08-23 服务运行方法、装置和电子设备

Country Status (3)

Country Link
US (1) US20240104687A1 (zh)
CN (1) CN113791908B (zh)
WO (1) WO2023043366A2 (zh)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279332A (zh) * 2013-06-09 2013-09-04 浪潮电子信息产业股份有限公司 一种基于gpu-cuda平台以及遗传算法的数据流并行处理方法
US20200043123A1 (en) * 2018-08-02 2020-02-06 Nvidia Corporation Simultaneous compute and graphics scheduling
CN111552550A (zh) * 2020-04-26 2020-08-18 星环信息科技(上海)有限公司 一种基于图形处理器gpu资源的任务调度方法、设备及介质

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10896064B2 (en) * 2017-03-27 2021-01-19 International Business Machines Corporation Coordinated, topology-aware CPU-GPU-memory scheduling for containerized workloads
US20200007556A1 (en) * 2017-06-05 2020-01-02 Umajin Inc. Server kit configured to marshal resource calls and methods therefor
US20190156246A1 (en) * 2017-11-21 2019-05-23 Amazon Technologies, Inc. Generating and deploying packages for machine learning at edge devices
CN110227259B (zh) * 2018-03-06 2022-04-29 华为技术有限公司 一种数据处理的方法、装置、服务器和系统
CN111489279B (zh) * 2019-01-25 2023-10-31 深圳富联富桂精密工业有限公司 Gpu加速优化方法、装置及计算机存储介质
CN111061569B (zh) * 2019-12-18 2023-05-09 北京工业大学 一种基于遗传算法的异构多核处理器任务分配与调度策略
CN111580974B (zh) * 2020-05-08 2023-06-27 抖音视界有限公司 Gpu实例分配方法、装置、电子设备和计算机可读介质

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279332A (zh) * 2013-06-09 2013-09-04 浪潮电子信息产业股份有限公司 一种基于gpu-cuda平台以及遗传算法的数据流并行处理方法
US20200043123A1 (en) * 2018-08-02 2020-02-06 Nvidia Corporation Simultaneous compute and graphics scheduling
CN111552550A (zh) * 2020-04-26 2020-08-18 星环信息科技(上海)有限公司 一种基于图形处理器gpu资源的任务调度方法、设备及介质

Also Published As

Publication number Publication date
US20240104687A1 (en) 2024-03-28
CN113791908A (zh) 2021-12-14
CN113791908B (zh) 2024-03-29
WO2023043366A2 (zh) 2023-03-23

Similar Documents

Publication Publication Date Title
TW200723906A (en) Method and apparatus for accelerated super 3G cell search
TW200735570A (en) A method and apparatus for bootstraping information in a communication system
ATE409904T1 (de) Betriebssysteme
EA200700837A1 (ru) Способ и устройство для энергосбережения в беспроводных системах
WO2009110753A3 (en) Method and apparatus for image intra prediction
HK1118407A1 (en) System and method for deactivating ip sessions of lower priority
WO2007117816A3 (en) Regrouping wireless devices
EP2683194A3 (en) Method, device and user equipment for transmitting multi-cell scheduling information
EP2618609A4 (en) MULTI-RECEIVING METHOD AND DEVICE THEREFOR
WO2008119934A3 (en) Handover technique for wireless communications enabled devices
WO2008030478A3 (en) System and method for radio frequency resource allocation
TW200713032A (en) Methods and apparatus for dynamically switching processor mode
AU2003240676A1 (en) An apparatus and method for resource allocation in a communication system
WO2009028877A3 (en) Scheduling method and apparatus for high speed video stream service in communication system
WO2003041200A3 (en) Fuel cell system and method for operating same
WO2023043366A3 (zh) 服务运行方法、装置和电子设备
WO2005011144A3 (fr) Procede et dispositif pour ameliorer l’estimation d’un canal de propagation d’un signal multiporteuse
JPWO2008149403A1 (ja) 無線基地局装置、および無線リソース接続切替方法
CN106068667A (zh) 一种lte集群系统同频组网资源调度方法及装置
EP4254701A4 (en) MULTI-PORT HYBRID DIRECT CURRENT CIRCUIT BREAKER, ASSOCIATED APPARATUS, SYSTEM AND CONTROL METHOD
WO2021092843A9 (zh) 寻呼方法及装置
WO2006118412A3 (en) An apparaus and method for receiving signals in multi-carrier multiple access systems
AU2003267343A1 (en) A method for providing telecommunications services, related system and information technology product
GB2466626B (en) Method and apparatus for dynamically determining the scope of services for an infrastructure device operating in logic mode
WO2011055925A3 (en) Method and apparatus for intelligence-oriented service using context information estimation in mobile terminal

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE