GB2617712A - Predictive auto-scaler for a hierarchical computing infrastructure - Google Patents

Predictive auto-scaler for a hierarchical computing infrastructure Download PDF

Info

Publication number
GB2617712A
GB2617712A GB2308635.8A GB202308635A GB2617712A GB 2617712 A GB2617712 A GB 2617712A GB 202308635 A GB202308635 A GB 202308635A GB 2617712 A GB2617712 A GB 2617712A
Authority
GB
United Kingdom
Prior art keywords
scaling
level
resource
computing platform
plan
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
GB2308635.8A
Other languages
English (en)
Other versions
GB202308635D0 (en
Inventor
Paul Wigglesworth Joseph
Rouf Yar
Litoiu Marin
Bogdan Mateescu Radu
Mukherjee Joydeep
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of GB202308635D0 publication Critical patent/GB202308635D0/en
Publication of GB2617712A publication Critical patent/GB2617712A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/5019Workload prediction

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
GB2308635.8A 2020-11-11 2021-10-28 Predictive auto-scaler for a hierarchical computing infrastructure Pending GB2617712A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/094,856 US11762709B2 (en) 2020-11-11 2020-11-11 Predictive auto-scaler for a hierarchical computing infrastructure
PCT/CN2021/126859 WO2022100438A1 (en) 2020-11-11 2021-10-28 Predictive auto-scaler for a hierarchical computing infrastructure

Publications (2)

Publication Number Publication Date
GB202308635D0 GB202308635D0 (en) 2023-07-26
GB2617712A true GB2617712A (en) 2023-10-18

Family

ID=81454479

Family Applications (1)

Application Number Title Priority Date Filing Date
GB2308635.8A Pending GB2617712A (en) 2020-11-11 2021-10-28 Predictive auto-scaler for a hierarchical computing infrastructure

Country Status (6)

Country Link
US (1) US11762709B2 (https=)
JP (1) JP7798450B2 (https=)
CN (1) CN116438519A (https=)
DE (1) DE112021005219T5 (https=)
GB (1) GB2617712A (https=)
WO (1) WO2022100438A1 (https=)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11762709B2 (en) 2020-11-11 2023-09-19 International Business Machines Corporation Predictive auto-scaler for a hierarchical computing infrastructure
WO2022105473A1 (en) * 2020-11-17 2022-05-27 Zhejiang Dahua Technology Co., Ltd. Systems and methods for data storage and computing
US12106155B2 (en) * 2021-12-20 2024-10-01 Jpmorgan Chase Bank N.A. System and method for performing preemptive scaling of micro service instances in cloud network
US12561165B2 (en) * 2022-08-01 2026-02-24 Visa International Service Association System and method for performing zonal scaling operations
DE102023205395A1 (de) * 2023-06-09 2024-12-12 Robert Bosch Gesellschaft mit beschränkter Haftung Verfahren und Vorrichtung zum Verarbeiten von mit einem technischen System assoziierten Daten
US20250166060A1 (en) * 2023-11-20 2025-05-22 Salesforce, Inc. Generative artificial intelligence (ai) contextual credit metering

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104040485A (zh) * 2012-01-09 2014-09-10 微软公司 Paas分层调度和自动缩放
CN106201718A (zh) * 2016-07-05 2016-12-07 北京邮电大学 一种基于负载预测的云计算资源动态伸缩方法
EP3410301A1 (en) * 2017-05-30 2018-12-05 Hewlett-Packard Enterprise Development LP Virtual network function resource allocation
CN111491006A (zh) * 2020-03-03 2020-08-04 天津大学 负载感知的云计算资源弹性分配系统及方法

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009259005A (ja) 2008-04-17 2009-11-05 Hitachi Ltd リソース監視方法および装置
US9251033B2 (en) * 2011-07-07 2016-02-02 Vce Company, Llc Automatic monitoring and just-in-time resource provisioning system
US20130019015A1 (en) 2011-07-12 2013-01-17 International Business Machines Corporation Application Resource Manager over a Cloud
EP2764436A4 (en) 2011-10-04 2015-12-09 Tier 3 Inc PREDICTIVE TWO-DIMENSIONAL AUTOSCALING
US10552745B2 (en) 2013-10-18 2020-02-04 Netflix, Inc. Predictive auto scaling engine
US9300552B2 (en) * 2013-12-16 2016-03-29 International Business Machines Corporation Scaling a cloud infrastructure
US10452992B2 (en) * 2014-06-30 2019-10-22 Amazon Technologies, Inc. Interactive interfaces for machine learning model evaluations
US9547534B2 (en) 2014-10-10 2017-01-17 International Business Machines Corporation Autoscaling applications in shared cloud resources
EP3560146B1 (en) 2016-12-26 2022-03-02 Morgan Stanley Services Group Inc. Predictive asset optimization for computer resources
JP6681377B2 (ja) 2017-10-30 2020-04-15 株式会社日立製作所 リソースの割り当てを最適化するシステム及び方法
KR20220044717A (ko) * 2019-08-07 2022-04-11 인텔 코포레이션 작업 스케줄링 효율을 향상시키기 위한 방법, 시스템, 제품 및 장치
US11762709B2 (en) 2020-11-11 2023-09-19 International Business Machines Corporation Predictive auto-scaler for a hierarchical computing infrastructure

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104040485A (zh) * 2012-01-09 2014-09-10 微软公司 Paas分层调度和自动缩放
CN106201718A (zh) * 2016-07-05 2016-12-07 北京邮电大学 一种基于负载预测的云计算资源动态伸缩方法
EP3410301A1 (en) * 2017-05-30 2018-12-05 Hewlett-Packard Enterprise Development LP Virtual network function resource allocation
CN111491006A (zh) * 2020-03-03 2020-08-04 天津大学 负载感知的云计算资源弹性分配系统及方法

Also Published As

Publication number Publication date
JP7798450B2 (ja) 2026-01-14
CN116438519A (zh) 2023-07-14
US11762709B2 (en) 2023-09-19
WO2022100438A1 (en) 2022-05-19
GB202308635D0 (en) 2023-07-26
JP2023548517A (ja) 2023-11-17
DE112021005219T5 (de) 2023-08-10
US20220147401A1 (en) 2022-05-12

Similar Documents

Publication Publication Date Title
GB2617712A (en) Predictive auto-scaler for a hierarchical computing infrastructure
CN120144260B (zh) 一种多业务系统的智能算力及存储调度方法和系统
KR102245341B1 (ko) 클라우드 엣지 내 워크로드 분산을 위한 예측 모델 적용 방법
KR102681047B1 (ko) 클라우드 엣지 플랫폼 적용에 따른 마이그레이션 우선 대상 선정 방법
CN109936473B (zh) 基于深度学习预测的分布计算系统及其运行方法
CN108009016A (zh) 一种资源负载均衡控制方法及集群调度器
CN109947532B (zh) 一种教育云平台中的大数据任务调度方法
Kumar et al. QoS‐aware resource scheduling using whale optimization algorithm for microservice applications
US12079654B2 (en) Virtual machine deployment method, virtual machine management method having the same and virtual machine management system implementing the same
US11310125B2 (en) AI-enabled adaptive TCA thresholding for SLA assurance
CN118606016A (zh) 一种基于Manage和Worker的任务分发方法及装置
Cardellini et al. Self-adaptive container deployment in the fog: A survey
CN119690603A (zh) 多云部署调度方法、装置、设备、介质及程序产品
Faraji-Mehmandar et al. A self-learning approach for proactive resource and service provisioning in fog environment: M. Faraji-Mehmandar et al
US12273241B2 (en) Method and system for simultaneous optimization of resources in a distributed compute network
CN120892161B (zh) 民航行业级数据服务平台的任务调度方法、装置、设备及介质
Vorozhtsov et al. Resource control system stability of mobile data centers
CN113596146B (zh) 一种基于大数据的资源调度的方法及装置
US12608255B2 (en) Methods and apparatuses for selecting fault management models
KR20240053405A (ko) 서버리스 엣지 컴퓨팅 환경에서의 동적 분할 컴퓨팅 방법
Feng et al. Tango: Harmonious management and scheduling for mixed services co-located among distributed edge-clouds
CN114637586A (zh) 一种数据驱动的在线预测和实现k8s资源超售的方法
US20250077257A1 (en) Method and system for latency optimization in a distributed compute network
Naganandhini et al. Temporal fusion transformer-based strategy for efficient multi-cloud content replication
Luo et al. ADARM: an application-driven adaptive resource management framework for data centers