Mittal et al., 2021 - Google Patents
Mu: An efficient, fair and responsive serverless framework for resource-constrained edge cloudsMittal et al., 2021
View PDF- Document ID
- 14121924111326507814
- Author
- Mittal V
- Qi S
- Bhattacharya R
- Lyu X
- Li J
- Kulkarni S
- Li D
- Hwang J
- Ramakrishnan K
- Wood T
- Publication year
- Publication venue
- Proceedings of the ACM symposium on cloud computing
External Links
Snippet
Serverless computing platforms simplify development, deployment, and automated management of modular software functions. However, existing serverless platforms typically assume an over-provisioned cloud, making them a poor fit for Edge Computing …
- 238000011161 development 0 abstract description 2
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5072—Grid computing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
- G06F9/5088—Techniques for rebalancing the load in a distributed system involving task migration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3442—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for planning or managing the needed capacity
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/10—Network-specific arrangements or communication protocols supporting networked applications in which an application is distributed across nodes in the network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance or administration or management of packet switching networks
- H04L41/50—Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
- H04L41/5003—Managing service level agreement [SLA] or interaction between SLA and quality of service [QoS]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L47/00—Traffic regulation in packet switching networks
- H04L47/10—Flow control or congestion control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network-specific arrangements or communication protocols supporting networked applications
- H04L67/32—Network-specific arrangements or communication protocols supporting networked applications for scheduling or organising the servicing of application requests, e.g. requests for application data transmissions involving the analysis and optimisation of the required network resources
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Mittal et al. | Mu: An efficient, fair and responsive serverless framework for resource-constrained edge clouds | |
| Bhasi et al. | Kraken: Adaptive container provisioning for deploying dynamic dags in serverless platforms | |
| Tian et al. | Owl: Performance-aware scheduling for resource-efficient function-as-a-service cloud | |
| US20200137151A1 (en) | Load balancing engine, client, distributed computing system, and load balancing method | |
| US10979491B2 (en) | Determining load state of remote systems using delay and packet loss rate | |
| KR101603928B1 (en) | Maintaining application performances upon transfer between cloud services | |
| Antonescu et al. | Simulation of SLA-based VM-scaling algorithms for cloud-distributed applications | |
| Bi et al. | SLA-based optimisation of virtualised resource for multi-tier web applications in cloud data centres | |
| KR101448413B1 (en) | Method and apparatus for scheduling communication traffic in atca-based equipment | |
| Kalim et al. | Caladrius: A performance modelling service for distributed stream processing systems | |
| US8024737B2 (en) | Method and a system that enables the calculation of resource requirements for a composite application | |
| Lakew et al. | Performance-based service differentiation in clouds | |
| Ogden et al. | Layercake: Efficient inference serving with cloud and mobile resources | |
| Cusack et al. | Escra: Event-driven, sub-second container resource allocation | |
| Razavi et al. | Sponge: Inference serving with dynamic slos using in-place vertical scaling | |
| Björkqvist et al. | Dynamic replication in service-oriented systems | |
| Sajal et al. | Kerveros: Efficient and scalable cloud admission control | |
| Jacquet et al. | Slackvm: Packing virtual machines in oversubscribed cloud infrastructures | |
| Tsenos et al. | Amesos: a scalable and elastic framework for latency sensitive streaming pipelines | |
| Hegazy et al. | Using application benefit for proactive resource allocation in asynchronous real-time distributed systems | |
| Antonescu et al. | Improving management of distributed services using correlations and predictions in SLA-driven cloud computing systems | |
| Chen et al. | Kairos: Low-latency multi-agent serving with shared llms and excessive loads in the public cloud | |
| Balla et al. | An auto-scaling framework for predictable open source faas function chains | |
| Kalyanasundaram et al. | Load Balancer Filter-Based Approach To Enable Distributed API Rate Limiting | |
| Gu et al. | PathFence: Reducing Cross-Path Dependencies in Microservices |