[go: up one dir, main page]

Imai et al., 2018 - Google Patents

Uncertainty-aware elastic virtual machine scheduling for stream processing systems

Imai et al., 2018

View PDF
Document ID
2342309550672717326
Author
Imai S
Patterson S
Varela C
Publication year
Publication venue
2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

External Links

Snippet

Stream processing systems deployed on the cloud need to be elastic to effectively accommodate workload variations over time. Performance models can predict maximum sustainable throughput (MST) as a function of the number of VMs allocated. We present a …
Continue reading at wcl.cs.rpi.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/48Programme initiating; Programme switching, e.g. by interrupt
    • G06F9/4806Task transfer initiation or dispatching
    • G06F9/4843Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
    • G06F9/4881Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
    • G06Q10/063Operations research or analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance or administration or management of packet switching networks
    • H04L41/50Network service management, i.e. ensuring proper service fulfillment according to an agreement or contract between two parties, e.g. between an IT-provider and a customer
    • H04L41/5003Managing service level agreement [SLA] or interaction between SLA and quality of service [QoS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance or administration or management of packet switching networks
    • H04L41/14Arrangements for maintenance or administration or management of packet switching networks involving network analysis or design, e.g. simulation, network model or planning
    • H04L41/147Arrangements for maintenance or administration or management of packet switching networks involving network analysis or design, e.g. simulation, network model or planning for prediction of network behaviour

Similar Documents

Publication Publication Date Title
Toka et al. Machine learning-based scaling management for kubernetes edge clusters
Imai et al. Uncertainty-aware elastic virtual machine scheduling for stream processing systems
Radhika et al. A review on prediction based autoscaling techniques for heterogeneous applications in cloud environment
Daraghmeh et al. Time series forecasting using facebook prophet for cloud resource management
Bi et al. Dynamic provisioning modeling for virtualized multi-tier applications in cloud data center
Horovitz et al. Efficient cloud auto-scaling with SLA objective using Q-learning
US9386086B2 (en) Dynamic scaling for multi-tiered distributed systems using payoff optimization of application classes
Trihinas et al. Low-cost adaptive monitoring techniques for the internet of things
Adnan et al. Energy efficient geographical load balancing via dynamic deferral of workload
US9923785B1 (en) Resource scaling in computing infrastructure
Arkian et al. Model-based stream processing auto-scaling in geo-distributed environments
Balaji et al. Predictive Cloud resource management framework for enterprise workloads
Salah et al. Estimating service response time for elastic cloud applications
Shahin Using multiple seasonal holt-winters exponential smoothing to predict cloud resource provisioning
Liu et al. Prorenata: Proactive and reactive tuning to scale a distributed storage system
Monshizadeh Naeen et al. Adaptive Markov‐based approach for dynamic virtual machine consolidation in cloud data centers with quality‐of‐service constraints
CN107566535A (en) Adaptive load balancing strategy based on user concurrent access timing planning in a kind of web map service
Chen et al. Derm: Sla-aware resource management for highly dynamic microservices
US12498982B2 (en) Automated predictive infrastructure scaling
Adegboyega Time-series models for cloud workload prediction: A comparison
Ship et al. Optimizing simultaneous autoscaling for serverless cloud computing
Al-Ghamdi et al. Predictive and dynamic resource allocation for enterprise applications
Ghetas et al. A survey of quality of service in multi-tier web applications
Shenoy et al. Probabilistic modeling of computing demand for service level agreement
Kumar et al. A qos-based reactive auto scaler for cloud environment