Cai et al., 2023 - Google Patents
AutoMan: Resource-efficient provisioning with tail latency guarantees for microservicesCai et al., 2023
- Document ID
- 628706847608909474
- Author
- Cai B
- Wang B
- Yang M
- Guo Q
- Publication year
- Publication venue
- Future Generation Computer Systems
External Links
Snippet
Modern user-facing services are progressively evolving from large monolithic applications to complex graphs of loosely-coupled microservices. While microservice architecture greatly improves the efficiency of development and operation, it also complicates resource …
- 238000002474 experimental method 0 abstract description 5
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3466—Performance evaluation by tracing or monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3442—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for planning or managing the needed capacity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3668—Software testing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Error detection; Error correction; Monitoring responding to the occurence of a fault, e.g. fault tolerance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance or administration or management of packet switching networks
- H04L41/14—Arrangements for maintenance or administration or management of packet switching networks involving network analysis or design, e.g. simulation, network model or planning
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance or administration or management of packet switching networks
- H04L41/50—Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Xu et al. | CoScal: Multifaceted scaling of microservices with reinforcement learning | |
Zhu et al. | A performance interference model for managing consolidated workloads in QoS-aware clouds | |
US8140682B2 (en) | System, method, and apparatus for server-storage-network optimization for application service level agreements | |
Cai et al. | AutoMan: Resource-efficient provisioning with tail latency guarantees for microservices | |
US20070250630A1 (en) | Method and a system of generating and evaluating potential resource allocations for an application | |
DE112020002987T5 (en) | DELIVER MICROSERVICES ACROSS A SERVICE INFRASTRUCTURE | |
CN103383655A (en) | Performance interference model for managing consolidated workloads in qos-aware clouds | |
CA2779993A1 (en) | Configurable resource policies | |
US20120221373A1 (en) | Estimating Business Service Responsiveness | |
US11106509B2 (en) | Cluster tuner | |
Zhang et al. | Autrascale: An automated and transfer learning solution for streaming system auto-scaling | |
WO2020206699A1 (en) | Predicting virtual machine allocation failures on server node clusters | |
Cappiello et al. | CO 2-aware adaptation strategies for cloud applications | |
US11429441B2 (en) | Workflow simulator | |
US20200057685A1 (en) | Automatically deploying hardware accelerators based on requests from users | |
Matos et al. | Bottleneck detection in cloud computing performance and dependability: Sensitivity rankings for hierarchical models | |
Ouyang et al. | Mitigating stragglers to avoid QoS violation for time-critical applications through dynamic server blacklisting | |
Zhou et al. | QoS-based Selection of Multi-Granularity Web Services for the Composition. | |
Wang et al. | In stechah: An autoscaling scheme for hadoop in the private cloud | |
Tamanampudi | Development of Real-Time Evaluation Frameworks for Large Language Models (LLMs): Simulating Production Environments to Assess Performance Stability Under Variable System Loads and Usage Scenarios | |
Kounev et al. | Model-based techniques for performance engineering of business information systems | |
Hu et al. | MSARS: A Meta-Learning and Reinforcement Learning Framework for SLO Resource Allocation and Adaptive Scaling for Microservices | |
Araujo et al. | Software aging issues in streaming video player. | |
Parayil et al. | Towards Workload-aware Cloud Efficiency: A Large-scale Empirical Study of Cloud Workload Characteristics | |
Vashistha et al. | [Retracted] Sustainable Technical Debt‐Aware Computing Model for Virtual Machine Migration (TD4VM) in IaaS Cloud |