GB2617712A - Predictive auto-scaler for a hierarchical computing infrastructure - Google Patents
Predictive auto-scaler for a hierarchical computing infrastructure Download PDFInfo
- Publication number
- GB2617712A GB2617712A GB2308635.8A GB202308635A GB2617712A GB 2617712 A GB2617712 A GB 2617712A GB 202308635 A GB202308635 A GB 202308635A GB 2617712 A GB2617712 A GB 2617712A
- Authority
- GB
- United Kingdom
- Prior art keywords
- scaling
- level
- resource
- computing platform
- plan
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/50—Indexing scheme relating to G06F9/50
- G06F2209/5019—Workload prediction
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Debugging And Monitoring (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/094,856 US11762709B2 (en) | 2020-11-11 | 2020-11-11 | Predictive auto-scaler for a hierarchical computing infrastructure |
| PCT/CN2021/126859 WO2022100438A1 (en) | 2020-11-11 | 2021-10-28 | Predictive auto-scaler for a hierarchical computing infrastructure |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| GB202308635D0 GB202308635D0 (en) | 2023-07-26 |
| GB2617712A true GB2617712A (en) | 2023-10-18 |
Family
ID=81454479
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| GB2308635.8A Pending GB2617712A (en) | 2020-11-11 | 2021-10-28 | Predictive auto-scaler for a hierarchical computing infrastructure |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US11762709B2 (https=) |
| JP (1) | JP7798450B2 (https=) |
| CN (1) | CN116438519A (https=) |
| DE (1) | DE112021005219T5 (https=) |
| GB (1) | GB2617712A (https=) |
| WO (1) | WO2022100438A1 (https=) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11762709B2 (en) | 2020-11-11 | 2023-09-19 | International Business Machines Corporation | Predictive auto-scaler for a hierarchical computing infrastructure |
| WO2022105473A1 (en) * | 2020-11-17 | 2022-05-27 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for data storage and computing |
| US12106155B2 (en) * | 2021-12-20 | 2024-10-01 | Jpmorgan Chase Bank N.A. | System and method for performing preemptive scaling of micro service instances in cloud network |
| US12561165B2 (en) * | 2022-08-01 | 2026-02-24 | Visa International Service Association | System and method for performing zonal scaling operations |
| DE102023205395A1 (de) * | 2023-06-09 | 2024-12-12 | Robert Bosch Gesellschaft mit beschränkter Haftung | Verfahren und Vorrichtung zum Verarbeiten von mit einem technischen System assoziierten Daten |
| US20250166060A1 (en) * | 2023-11-20 | 2025-05-22 | Salesforce, Inc. | Generative artificial intelligence (ai) contextual credit metering |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104040485A (zh) * | 2012-01-09 | 2014-09-10 | 微软公司 | Paas分层调度和自动缩放 |
| CN106201718A (zh) * | 2016-07-05 | 2016-12-07 | 北京邮电大学 | 一种基于负载预测的云计算资源动态伸缩方法 |
| EP3410301A1 (en) * | 2017-05-30 | 2018-12-05 | Hewlett-Packard Enterprise Development LP | Virtual network function resource allocation |
| CN111491006A (zh) * | 2020-03-03 | 2020-08-04 | 天津大学 | 负载感知的云计算资源弹性分配系统及方法 |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2009259005A (ja) | 2008-04-17 | 2009-11-05 | Hitachi Ltd | リソース監視方法および装置 |
| US9251033B2 (en) * | 2011-07-07 | 2016-02-02 | Vce Company, Llc | Automatic monitoring and just-in-time resource provisioning system |
| US20130019015A1 (en) | 2011-07-12 | 2013-01-17 | International Business Machines Corporation | Application Resource Manager over a Cloud |
| EP2764436A4 (en) | 2011-10-04 | 2015-12-09 | Tier 3 Inc | PREDICTIVE TWO-DIMENSIONAL AUTOSCALING |
| US10552745B2 (en) | 2013-10-18 | 2020-02-04 | Netflix, Inc. | Predictive auto scaling engine |
| US9300552B2 (en) * | 2013-12-16 | 2016-03-29 | International Business Machines Corporation | Scaling a cloud infrastructure |
| US10452992B2 (en) * | 2014-06-30 | 2019-10-22 | Amazon Technologies, Inc. | Interactive interfaces for machine learning model evaluations |
| US9547534B2 (en) | 2014-10-10 | 2017-01-17 | International Business Machines Corporation | Autoscaling applications in shared cloud resources |
| EP3560146B1 (en) | 2016-12-26 | 2022-03-02 | Morgan Stanley Services Group Inc. | Predictive asset optimization for computer resources |
| JP6681377B2 (ja) | 2017-10-30 | 2020-04-15 | 株式会社日立製作所 | リソースの割り当てを最適化するシステム及び方法 |
| KR20220044717A (ko) * | 2019-08-07 | 2022-04-11 | 인텔 코포레이션 | 작업 스케줄링 효율을 향상시키기 위한 방법, 시스템, 제품 및 장치 |
| US11762709B2 (en) | 2020-11-11 | 2023-09-19 | International Business Machines Corporation | Predictive auto-scaler for a hierarchical computing infrastructure |
-
2020
- 2020-11-11 US US17/094,856 patent/US11762709B2/en active Active
-
2021
- 2021-10-28 CN CN202180075777.2A patent/CN116438519A/zh active Pending
- 2021-10-28 JP JP2023526686A patent/JP7798450B2/ja active Active
- 2021-10-28 DE DE112021005219.5T patent/DE112021005219T5/de active Pending
- 2021-10-28 GB GB2308635.8A patent/GB2617712A/en active Pending
- 2021-10-28 WO PCT/CN2021/126859 patent/WO2022100438A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104040485A (zh) * | 2012-01-09 | 2014-09-10 | 微软公司 | Paas分层调度和自动缩放 |
| CN106201718A (zh) * | 2016-07-05 | 2016-12-07 | 北京邮电大学 | 一种基于负载预测的云计算资源动态伸缩方法 |
| EP3410301A1 (en) * | 2017-05-30 | 2018-12-05 | Hewlett-Packard Enterprise Development LP | Virtual network function resource allocation |
| CN111491006A (zh) * | 2020-03-03 | 2020-08-04 | 天津大学 | 负载感知的云计算资源弹性分配系统及方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP7798450B2 (ja) | 2026-01-14 |
| CN116438519A (zh) | 2023-07-14 |
| US11762709B2 (en) | 2023-09-19 |
| WO2022100438A1 (en) | 2022-05-19 |
| GB202308635D0 (en) | 2023-07-26 |
| JP2023548517A (ja) | 2023-11-17 |
| DE112021005219T5 (de) | 2023-08-10 |
| US20220147401A1 (en) | 2022-05-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| GB2617712A (en) | Predictive auto-scaler for a hierarchical computing infrastructure | |
| CN120144260B (zh) | 一种多业务系统的智能算力及存储调度方法和系统 | |
| KR102245341B1 (ko) | 클라우드 엣지 내 워크로드 분산을 위한 예측 모델 적용 방법 | |
| KR102681047B1 (ko) | 클라우드 엣지 플랫폼 적용에 따른 마이그레이션 우선 대상 선정 방법 | |
| CN109936473B (zh) | 基于深度学习预测的分布计算系统及其运行方法 | |
| CN108009016A (zh) | 一种资源负载均衡控制方法及集群调度器 | |
| CN109947532B (zh) | 一种教育云平台中的大数据任务调度方法 | |
| Kumar et al. | QoS‐aware resource scheduling using whale optimization algorithm for microservice applications | |
| US12079654B2 (en) | Virtual machine deployment method, virtual machine management method having the same and virtual machine management system implementing the same | |
| US11310125B2 (en) | AI-enabled adaptive TCA thresholding for SLA assurance | |
| CN118606016A (zh) | 一种基于Manage和Worker的任务分发方法及装置 | |
| Cardellini et al. | Self-adaptive container deployment in the fog: A survey | |
| CN119690603A (zh) | 多云部署调度方法、装置、设备、介质及程序产品 | |
| Faraji-Mehmandar et al. | A self-learning approach for proactive resource and service provisioning in fog environment: M. Faraji-Mehmandar et al | |
| US12273241B2 (en) | Method and system for simultaneous optimization of resources in a distributed compute network | |
| CN120892161B (zh) | 民航行业级数据服务平台的任务调度方法、装置、设备及介质 | |
| Vorozhtsov et al. | Resource control system stability of mobile data centers | |
| CN113596146B (zh) | 一种基于大数据的资源调度的方法及装置 | |
| US12608255B2 (en) | Methods and apparatuses for selecting fault management models | |
| KR20240053405A (ko) | 서버리스 엣지 컴퓨팅 환경에서의 동적 분할 컴퓨팅 방법 | |
| Feng et al. | Tango: Harmonious management and scheduling for mixed services co-located among distributed edge-clouds | |
| CN114637586A (zh) | 一种数据驱动的在线预测和实现k8s资源超售的方法 | |
| US20250077257A1 (en) | Method and system for latency optimization in a distributed compute network | |
| Naganandhini et al. | Temporal fusion transformer-based strategy for efficient multi-cloud content replication | |
| Luo et al. | ADARM: an application-driven adaptive resource management framework for data centers |