WO2023043366A3 - 服务运行方法、装置和电子设备 - Google Patents
服务运行方法、装置和电子设备 Download PDFInfo
- Publication number
- WO2023043366A3 WO2023043366A3 PCT/SG2022/050601 SG2022050601W WO2023043366A3 WO 2023043366 A3 WO2023043366 A3 WO 2023043366A3 SG 2022050601 W SG2022050601 W SG 2022050601W WO 2023043366 A3 WO2023043366 A3 WO 2023043366A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- gpu
- service
- sub
- deployment mode
- electronic device
- Prior art date
Links
- 239000002699 waste material Substances 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Hardware Redundancy (AREA)
- Mobile Radio Communication Systems (AREA)
- Stored Programmes (AREA)
Abstract
说明书摘要本公开的实施例公开了服务运行方法、装置和电子设备。该方法的一具体实施方式包括:根据服务集合中每个服务的性能数据,确定 GPU 的目标部署方式,其中,部署方式包括从 GPU 划分具有相应大小的子 GPU 和确定每个子 GPU 用于运行的服务;对于服务集合中的服务,将该服务从运行于当前部署方式指示的子 GPU,切换至运行于目标部署方式指示的子 GPU。该实施方式在通过 GPU 运行多个服务时,可以降低 GPU 的浪费。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/532,819 US20240104687A1 (en) | 2021-09-16 | 2023-12-07 | Method and apparatus for runing a service, and electronic device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111088174.4 | 2021-09-16 | ||
CN202111088174.4A CN113791908B (zh) | 2021-09-16 | 2021-09-16 | 服务运行方法、装置和电子设备 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US18/532,819 Continuation US20240104687A1 (en) | 2021-09-16 | 2023-12-07 | Method and apparatus for runing a service, and electronic device |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2023043366A2 WO2023043366A2 (zh) | 2023-03-23 |
WO2023043366A3 true WO2023043366A3 (zh) | 2023-05-11 |
Family
ID=78878696
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/SG2022/050601 WO2023043366A2 (zh) | 2021-09-16 | 2022-08-23 | 服务运行方法、装置和电子设备 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240104687A1 (zh) |
CN (1) | CN113791908B (zh) |
WO (1) | WO2023043366A2 (zh) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279332A (zh) * | 2013-06-09 | 2013-09-04 | 浪潮电子信息产业股份有限公司 | 一种基于gpu-cuda平台以及遗传算法的数据流并行处理方法 |
US20200043123A1 (en) * | 2018-08-02 | 2020-02-06 | Nvidia Corporation | Simultaneous compute and graphics scheduling |
CN111552550A (zh) * | 2020-04-26 | 2020-08-18 | 星环信息科技(上海)有限公司 | 一种基于图形处理器gpu资源的任务调度方法、设备及介质 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10896064B2 (en) * | 2017-03-27 | 2021-01-19 | International Business Machines Corporation | Coordinated, topology-aware CPU-GPU-memory scheduling for containerized workloads |
US20200007556A1 (en) * | 2017-06-05 | 2020-01-02 | Umajin Inc. | Server kit configured to marshal resource calls and methods therefor |
US20190156246A1 (en) * | 2017-11-21 | 2019-05-23 | Amazon Technologies, Inc. | Generating and deploying packages for machine learning at edge devices |
CN110227259B (zh) * | 2018-03-06 | 2022-04-29 | 华为技术有限公司 | 一种数据处理的方法、装置、服务器和系统 |
CN111489279B (zh) * | 2019-01-25 | 2023-10-31 | 深圳富联富桂精密工业有限公司 | Gpu加速优化方法、装置及计算机存储介质 |
CN111061569B (zh) * | 2019-12-18 | 2023-05-09 | 北京工业大学 | 一种基于遗传算法的异构多核处理器任务分配与调度策略 |
CN111580974B (zh) * | 2020-05-08 | 2023-06-27 | 抖音视界有限公司 | Gpu实例分配方法、装置、电子设备和计算机可读介质 |
-
2021
- 2021-09-16 CN CN202111088174.4A patent/CN113791908B/zh active Active
-
2022
- 2022-08-23 WO PCT/SG2022/050601 patent/WO2023043366A2/zh unknown
-
2023
- 2023-12-07 US US18/532,819 patent/US20240104687A1/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279332A (zh) * | 2013-06-09 | 2013-09-04 | 浪潮电子信息产业股份有限公司 | 一种基于gpu-cuda平台以及遗传算法的数据流并行处理方法 |
US20200043123A1 (en) * | 2018-08-02 | 2020-02-06 | Nvidia Corporation | Simultaneous compute and graphics scheduling |
CN111552550A (zh) * | 2020-04-26 | 2020-08-18 | 星环信息科技(上海)有限公司 | 一种基于图形处理器gpu资源的任务调度方法、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
US20240104687A1 (en) | 2024-03-28 |
CN113791908A (zh) | 2021-12-14 |
CN113791908B (zh) | 2024-03-29 |
WO2023043366A2 (zh) | 2023-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TW200723906A (en) | Method and apparatus for accelerated super 3G cell search | |
TW200735570A (en) | A method and apparatus for bootstraping information in a communication system | |
ATE409904T1 (de) | Betriebssysteme | |
EA200700837A1 (ru) | Способ и устройство для энергосбережения в беспроводных системах | |
WO2009110753A3 (en) | Method and apparatus for image intra prediction | |
HK1118407A1 (en) | System and method for deactivating ip sessions of lower priority | |
WO2007117816A3 (en) | Regrouping wireless devices | |
EP2683194A3 (en) | Method, device and user equipment for transmitting multi-cell scheduling information | |
EP2618609A4 (en) | MULTI-RECEIVING METHOD AND DEVICE THEREFOR | |
WO2008119934A3 (en) | Handover technique for wireless communications enabled devices | |
WO2008030478A3 (en) | System and method for radio frequency resource allocation | |
TW200713032A (en) | Methods and apparatus for dynamically switching processor mode | |
AU2003240676A1 (en) | An apparatus and method for resource allocation in a communication system | |
WO2009028877A3 (en) | Scheduling method and apparatus for high speed video stream service in communication system | |
WO2003041200A3 (en) | Fuel cell system and method for operating same | |
WO2023043366A3 (zh) | 服务运行方法、装置和电子设备 | |
WO2005011144A3 (fr) | Procede et dispositif pour ameliorer l’estimation d’un canal de propagation d’un signal multiporteuse | |
JPWO2008149403A1 (ja) | 無線基地局装置、および無線リソース接続切替方法 | |
CN106068667A (zh) | 一种lte集群系统同频组网资源调度方法及装置 | |
EP4254701A4 (en) | MULTI-PORT HYBRID DIRECT CURRENT CIRCUIT BREAKER, ASSOCIATED APPARATUS, SYSTEM AND CONTROL METHOD | |
WO2021092843A9 (zh) | 寻呼方法及装置 | |
WO2006118412A3 (en) | An apparaus and method for receiving signals in multi-carrier multiple access systems | |
AU2003267343A1 (en) | A method for providing telecommunications services, related system and information technology product | |
GB2466626B (en) | Method and apparatus for dynamically determining the scope of services for an infrastructure device operating in logic mode | |
WO2011055925A3 (en) | Method and apparatus for intelligence-oriented service using context information estimation in mobile terminal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |