CN107797863B - Fine-grained resource matching method in cloud computing platform - Google Patents
Fine-grained resource matching method in cloud computing platform Download PDFInfo
- Publication number
- CN107797863B CN107797863B CN201710909672.8A CN201710909672A CN107797863B CN 107797863 B CN107797863 B CN 107797863B CN 201710909672 A CN201710909672 A CN 201710909672A CN 107797863 B CN107797863 B CN 107797863B
- Authority
- CN
- China
- Prior art keywords
- resource
- task
- resources
- server
- computing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000007726 management method Methods 0.000 claims description 62
- 230000006835 compression Effects 0.000 claims description 37
- 238000007906 compression Methods 0.000 claims description 37
- 238000005070 sampling Methods 0.000 claims description 16
- 238000013468 resource allocation Methods 0.000 claims description 13
- 238000004364 calculation method Methods 0.000 claims description 8
- 238000013467 fragmentation Methods 0.000 abstract description 8
- 238000006062 fragmentation reaction Methods 0.000 abstract description 8
- 230000004044 response Effects 0.000 abstract description 3
- 230000000694 effects Effects 0.000 description 2
- WMUIIGVAWPWQAW-DEOSSOPVSA-N (2s)-2-ethoxy-3-{4-[2-(10h-phenoxazin-10-yl)ethoxy]phenyl}propanoic acid Chemical compound C1=CC(C[C@H](OCC)C(O)=O)=CC=C1OCCN1C2=CC=CC=C2OC2=CC=CC=C21 WMUIIGVAWPWQAW-DEOSSOPVSA-N 0.000 description 1
- KRQUFUKTQHISJB-YYADALCUSA-N 2-[(E)-N-[2-(4-chlorophenoxy)propoxy]-C-propylcarbonimidoyl]-3-hydroxy-5-(thian-3-yl)cyclohex-2-en-1-one Chemical compound CCC\C(=N/OCC(C)OC1=CC=C(Cl)C=C1)C1=C(O)CC(CC1=O)C1CCCSC1 KRQUFUKTQHISJB-YYADALCUSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- VRLIPUYDFBXWCH-UHFFFAOYSA-N hydridocarbon(.) Chemical compound [CH] VRLIPUYDFBXWCH-UHFFFAOYSA-N 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/5038—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
- G06F9/5016—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals the resource being the memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/5033—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering data affinity
Landscapes
- Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
技术领域technical field
本发明属于计算机领域,涉及云计算平台中的资源管理和作业调度方法,具体涉及一种云计算平台中细粒度资源匹配方法。The invention belongs to the computer field, relates to a resource management and job scheduling method in a cloud computing platform, and in particular relates to a fine-grained resource matching method in a cloud computing platform.
背景技术Background technique
云计算是一种基于互联网的计算方式,通过这种方式,用户可以按需获取计算资源、计算能力。云计算平台的基础设施一般由许多计算机节点通过高性能网络互联而成,其将众多节点组织成高性能、高可用、可扩展的单一映像,提供给用户使用。Cloud computing is an Internet-based computing method, through which users can obtain computing resources and computing power on demand. The infrastructure of the cloud computing platform is generally composed of many computer nodes interconnected through a high-performance network, which organizes many nodes into a single image with high performance, high availability, and scalability, and provides it to users.
云计算平台的资源管理中通常基于一种资源(通常是内存)或者将两种固定量的计算资源捆绑定义为slot,并以slot作为资源分配的单位。由于云计算中负载对资源的需求具有多样性,基于固定单位的资源分配方式很容易产生资源碎片而造成资源浪费,或因为过度分配引发不良的资源共享。有些资源(如CPU)的不良共享会导致任务之间竞争资源,执行时间大幅增加。在数据中心中,超过53%的落后任务是由不良共享引发的高资源利用率造成的,并且4%-6%的非正常任务影响着37%-49%的作业,造成作业完成时间的大幅延长。而有些资源(比如内存)的过度分配会直接导致任务失败或者服务器崩溃。The resource management of the cloud computing platform is usually based on one resource (usually memory) or a bundle of two fixed amounts of computing resources is defined as a slot, and the slot is used as the unit of resource allocation. Due to the diverse demands on resources in cloud computing, the resource allocation method based on fixed units is prone to resource fragmentation, resulting in resource waste, or poor resource sharing due to excessive allocation. Poor sharing of some resources (such as CPUs) can lead to competition for resources between tasks, and execution time increases dramatically. In a data center, over 53% of lagging tasks are caused by high resource utilization caused by poor sharing, and 4%-6% of out-of-order tasks affect 37%-49% of jobs, resulting in significant job completion times extend. And some resources (such as memory) over-allocation will directly lead to task failure or server crash.
现有的Yarn、Fuxi、Borg等云计算资源管理平台和Apollo、Omega、Tetris、DRF、Carbyne等云计算调度算法基于作业的资源申请分配固定量资源,以避免资源碎片、过度分配等问题。由于作业的资源申请量通常由人为指定,资源申请与实际使用之间存在很大差异。另外,任务的资源使用量波动很大,不会一直保持在峰值,作业使用任务最大资源使用量作为申请量时,资源申请量与实际使用量之间依然存在差异。因此资源管理平台或者调度器按照资源申请量进行分配时,资源碎片依然存在。基于资源申请的分配方式很难取得高资源利用率,该方式一定程度上限制了集群的资源利用率。Existing cloud computing resource management platforms such as Yarn, Fuxi, Borg, and cloud computing scheduling algorithms such as Apollo, Omega, Tetris, DRF, and Carbyne allocate a fixed amount of resources based on job resource application to avoid resource fragmentation and over-allocation. Since the amount of resource requests for a job is usually designated by humans, there is a big difference between resource requests and actual usage. In addition, the resource usage of a task fluctuates greatly and does not always maintain a peak value. When a job uses the maximum resource usage of a task as the request amount, there is still a difference between the resource request amount and the actual usage amount. Therefore, when the resource management platform or scheduler allocates resources according to the amount of resource requests, resource fragments still exist. The allocation method based on resource application is difficult to achieve high resource utilization rate, which limits the resource utilization rate of the cluster to a certain extent.
因此,如何在云计算资源管理和作业调度中避免资源碎片及过度分配,成为了云计算平台研究中的重要问题。Therefore, how to avoid resource fragmentation and over-allocation in cloud computing resource management and job scheduling has become an important issue in cloud computing platform research.
发明内容SUMMARY OF THE INVENTION
本发明的目的在于提供一种云计算平台中细粒度资源匹配方法,该方法在分配时间、资源量两方面提高匹配粒度的同时,具有较低的调度平均响应时间,能够有效提高平台中计算资源的利用效率,提升云计算平台的整体吞吐率。The purpose of the present invention is to provide a fine-grained resource matching method in a cloud computing platform, which improves the matching granularity in terms of allocation time and resource amount, and has a lower average response time for scheduling, which can effectively improve the computing resources in the platform. improve the utilization efficiency of cloud computing platform and improve the overall throughput rate of cloud computing platform.
本发明是通过以下技术方案来实现:The present invention is achieved through the following technical solutions:
本发明公开了一种云计算平台中细粒度资源匹配方法,包括以下步骤:The invention discloses a fine-grained resource matching method in a cloud computing platform, comprising the following steps:
步骤1:将云计算平台中服务器的角色分为计算服务器和管理服务器两种,计算服务器负责具体负载的执行,并定期向管理服务器汇报资源状态;管理服务器负责整个云计算平台的管理工作,包括向计算服务器分配计算任务;Step 1: The roles of the servers in the cloud computing platform are divided into two types: computing servers and management servers. The computing server is responsible for the execution of specific loads and regularly reports the resource status to the management server; the management server is responsible for the management of the entire cloud computing platform, including Allocate computing tasks to computing servers;
步骤2:管理服务器接收各计算服务器定期汇报的信息,并依据相似任务和资源压缩率推测某个任务的各种资源需求和持续时间;Step 2: The management server receives the information regularly reported by each computing server, and infers various resource requirements and durations of a certain task based on similar tasks and resource compression ratios;
其中,相似任务指负载中与所述某个任务具有相同执行逻辑且输入数据量相同的任务;Wherein, similar tasks refer to tasks in the load that have the same execution logic and the same amount of input data as the certain task;
步骤3:管理服务器分析推测得出的任务各进度的CPU、内存空间资源需求,将任务划分为多个执行阶段;Step 3: The management server analyzes the inferred CPU and memory space resource requirements of each progress of the task, and divides the task into multiple execution stages;
步骤4:管理服务器从待调度集合中挑选任务,分阶段匹配任务资源需求和服务器可用计算资源,并根据需要压缩匹配资源需求;Step 4: The management server selects tasks from the to-be-scheduled set, matches task resource requirements and server available computing resources in stages, and compresses and matches resource requirements as needed;
步骤5:若资源需求匹配成功,则管理服务器假设资源已经分配,检查该计算服务器上所有任务是否会受到影响而不能满足约束条件,若所有任务约束条件均满足,则分配计算资源。Step 5: If the resource requirements are matched successfully, the management server assumes that the resources have been allocated, and checks whether all tasks on the computing server will be affected and cannot satisfy the constraints, and if all the task constraints are satisfied, the computing resources are allocated.
优选地,步骤5后还包括以下操作:管理服务器检查该计算服务器上是否剩余足够资源进入下一轮匹配,如果剩余资源满足条件,则进入下一轮匹配,即重复操作步骤4和步骤5。Preferably, after step 5, the following operations are further included: the management server checks whether there are enough resources left on the computing server to enter the next round of matching, and if the remaining resources meet the conditions, enter the next round of matching, that is, repeat steps 4 and 5.
优选地,步骤1中,计算服务器负责具体负载的执行,并定期向管理服务器汇报资源状态,具体是指:计算服务器定期采集本服务器运行中任务资源使用情况,计算本服务器可用资源信息并汇报给管理服务器;Preferably, in
其中,计算服务器上某种资源的可用资源量r按式(1)计算:Among them, the available resource r of a certain resource on the computing server is calculated according to formula (1):
式中,ri为第i次采样的资源量,ti为第i次采样的持续时间,T为采样的计算总时间,n为采样次数。In the formula, ri is the resource amount of the ith sampling, t i is the duration of the ith sampling, T is the total calculation time of the sampling, and n is the number of samplings.
优选地,步骤2中,据相似任务和资源压缩率推测某个任务的各种资源需求和持续时间,资源需求类型包括CPU、内存空间、磁盘空间、磁盘带宽及网络带宽;Preferably, in step 2, various resource requirements and durations of a certain task are estimated according to similar tasks and resource compression ratios, and the types of resource requirements include CPU, memory space, disk space, disk bandwidth and network bandwidth;
其中,将任务的磁盘和网络带宽资源需求量按照任务与数据的相对位置分为三类进行推测,第一类:任务与数据在同服务器;第二类:任务与数据在同机架;第三类:其他;Among them, the disk and network bandwidth resource requirements of the task are divided into three categories according to the relative position of the task and the data. The first category: the task and the data are on the same server; Three categories: other;
具体操作为:The specific operations are:
任务在某进度所需的资源需求和持续时间按式(2)计算:The resource requirements and duration required by a task in a certain progress are calculated according to formula (2):
式中,αn为第n次推测结果,βn为第n次的相似任务资源信息,rc为任务信息的资源压缩率,Thr为最大压缩率限制系数,e为自然底数。In the formula, α n is the nth estimation result, β n is the nth similar task resource information, rc is the resource compression rate of the task information, Thr is the maximum compression rate limit coefficient, and e is the natural base.
优选地,步骤3具体操作为:Preferably, the specific operations of step 3 are:
管理服务器遍历任务在各进度的CPU、内存空间资源需求量,并分别记录这两种资源量的最大值和最小值;The management server traverses the CPU and memory space resource requirements of the task in each progress, and records the maximum and minimum values of these two resources respectively;
当CPU或者内存空间资源需求量的最大值和最小值之差大于划分阈值,并且遍历的进度达到划分长度,则将遍历的进度划分为任务的一个执行阶段;When the difference between the maximum and minimum CPU or memory space resource requirements is greater than the division threshold, and the progress of the traversal reaches the division length, the progress of the traversal is divided into an execution stage of the task;
任务执行阶段CPU、内存空间及磁盘空间的资源需求值为执行阶段内各进度该资源需求的最大值,磁盘带宽、网络带宽资源的需求值为执行阶段内各进度该资源需求的平均值。The resource requirements of CPU, memory space and disk space in the task execution phase are the maximum values of the resource requirements of each progress in the execution phase, and the requirements of disk bandwidth and network bandwidth resources are the average values of the resource requirements of each progress in the execution phase.
进一步优选地,参见图1,任务执行阶段的划分方法,具体包括以下步骤:Further preferably, referring to Fig. 1, the method for dividing the task execution stage specifically includes the following steps:
1)P、Ps、Pe分别代表任务进度、阶段开始进度和阶段结束进度,Cmax、Cmin、Mmax和Mmin分别表示阶段内CPU、内存空间资源需求的最大值和最小值;其中,P、Ps、Pe初始化为0;Cmin、Mmin为100,Cmax、Mmax初始化为0;1) P, P s and P e represent task progress, stage start progress and stage end progress respectively, C max , C min , M max and M min respectively represent the maximum and minimum values of CPU and memory space resource requirements in the stage; Among them, P, P s , and Pe are initialized to 0; C min and M min are 100, and C max and M max are initialized to 0;
2)任务进度P和阶段结束进度Pe增加1,如果任务进度P达到100%,则将Ps到100%划分为新的阶段之后结束,否则继续步骤3);2) The task progress P and the stage end progress P e are increased by 1, if the task progress P reaches 100%, then divide P s to 100% as a new stage and end, otherwise continue to step 3);
3)如果当前进度CPU需求Cp大于Cmax,则将Cmax的值更新为Cp,进入步骤5);3) If the current progress CPU demand C p is greater than C max , update the value of C max to C p , and go to step 5);
4)如果当前进度CPU需求Cp小于Cmin,则将Cmin的值更新为Cp;4) If the current progress CPU requirement C p is less than C min , update the value of C min to C p ;
5)如果当前进度内存需求Mp大于Mmax,则将Mmax的值更新为Mp,进入步骤7);5) If the current progress memory requirement M p is greater than M max , update the value of M max to M p , and enter step 7);
6)如果当前进度内存需求Mp小于Mmin,则将Mmin的值更新为Mp;6) If the current progress memory requirement M p is less than M min , update the value of M min to M p ;
7)如果Cmax与Cmin的差值大于CPU资源总量C与阈值Thc的乘积或者Mmax与Mmin的差值大于内存总量M与阈值Thm的乘积,则进入步骤8,否则进入步骤2);7) If the difference between C max and C min is greater than the product of the total amount of CPU resources C and the threshold Th c or the difference between M max and M min is greater than the product of the total memory M and the threshold Th m , then go to step 8, otherwise Go to step 2);
8)如果Pe与Ps的差值大于阈值Thp,则将Ps到Pe划分为新阶段,将Ps更新为Pe,重新初始化Cmin、Mmin为100,Cmax、Mmax为0;否则,回到步骤2)。8) If the difference between Pe and P s is greater than the threshold Th p , divide P s to Pe into new stages, update P s to Pe , re-initialize C min , M min to 100, C max , M max is 0; otherwise, go back to step 2).
优选地,步骤4具体操作为:Preferably, the specific operations of step 4 are:
首先,管理服务器检查计算服务器上可用计算资源是否达到匹配要求;First, the management server checks whether the available computing resources on the computing server meet the matching requirements;
其次,管理服务器按资源分配公平性和数据本地性策略排序待调度集合;Second, the management server sorts the to-be-scheduled sets according to resource allocation fairness and data locality policies;
最后,资源管理服务器取出任务,从推测结果中获取任务的推测信息;Finally, the resource management server takes out the task, and obtains the guess information of the task from the guess result;
其中,若推测信息获取失败,则按照任务资源申请量匹配计算资源;若推测信息获取成功,则分阶段依次匹配任务的资源需求。Wherein, if the acquisition of the presumed information fails, the computing resources are matched according to the application amount of the task resources; if the acquisition of the presumed information is successful, the resource requirements of the tasks are matched in sequence in stages.
优选地,管理服务器根据资源特性将计算资源抽象为可压缩资源和不可压缩资源;其中,计算资源包括CPU资源、内存资源、磁盘资源及网络资源;Preferably, the management server abstracts the computing resources into compressible resources and incompressible resources according to resource characteristics; wherein the computing resources include CPU resources, memory resources, disk resources and network resources;
在任务执行过程中,若任务分配到的某种资源少于任务对该资源需求量,任务能够通过延长执行时间正常完成,则该资源为可压缩资源,否则为不可压缩资源;In the process of task execution, if a certain resource allocated by the task is less than the task's demand for the resource, and the task can be completed normally by extending the execution time, the resource is a compressible resource, otherwise it is an incompressible resource;
计算服务器上的资源压缩率rc按式(3)计算:The resource compression ratio rc on the computing server is calculated according to formula ( 3 ):
rc为资源压缩率,Rr为资源需求量,Ru为资源分配量; rc is the resource compression rate, R r is the resource demand, and R u is the resource allocation;
若资源为不可压缩资源,则其资源压缩率rc始终为0。If the resource is an incompressible resource, its resource compression ratio rc is always 0.
优选地,在匹配过程中,如果需求量大于可用资源量,则匹配失败;Preferably, in the matching process, if the demand is greater than the available resources, the matching fails;
对于不可压缩资源,计算服务器对该资源的可用资源量不做处理;For an incompressible resource, the computing server does not process the available resource amount of the resource;
对于可压缩资源,管理服务器依据计算服务器资源及负载情况计算该服务器各阶段各种资源的最大压缩率,计算服务器在各阶段内某种可压缩资源的最大压缩率rmax按式(4)计算:For compressible resources, the management server calculates the maximum compression ratio of various resources in each stage of the server according to the computing server resources and load conditions, and calculates the maximum compression ratio rmax of a certain compressible resource in each stage of the computing server according to formula (4). :
式中,服务器上已匹配n个任务,当前匹配任务为第n+1个,n+1个任务的资源总需求大于资源总量,第i个任务在阶段内完成的工作量为wi,Δp为资源压缩引起的性能变化;In the formula, n tasks have been matched on the server, the current matching task is the n+1th task, the total resource demand of the n+1 task is greater than the total amount of resources, and the workload completed by the ith task in the stage is w i , Δp is the performance change caused by resource compression;
可压缩资源匹配中,计算服务器该资源的实际可用资源量按式(5)计算,In the compressible resource matching, the actual available resource amount of the resource of the computing server is calculated according to formula (5),
ai=ni+ri×Ni (5);a i =n i +r i ×N i (5);
式中,ri为该服务器上该种资源的当前匹配阶段的最大可压缩率,Ni为该服务器上该种资源的总量,ni为采集得到的该种资源的可用资源量;In the formula, ri is the maximum compressibility ratio of the current matching stage of the resource on the server, Ni is the total amount of the resource on the server , and ni is the available resource amount of the resource obtained by collection;
若任务的所有执行阶段所有资源的需求均未匹配失败,则匹配成功。If the requirements of all resources in all execution stages of the task are not matched and failed, the matching is successful.
与现有技术相比,本发明具有以下有益的技术效果:Compared with the prior art, the present invention has the following beneficial technical effects:
本发明公开的云计算平台中细粒度资源匹配方法,根据资源特性将计算资源抽象为可压缩和不可压缩两种类型;根据云计算负载中存在的相似任务和资源压缩率推测任务的资源需求及持续时间;根据任务资源需求将任务划分为多个执行阶段,并分阶段分资源类型分别匹配资源需求和可用资源;在匹配过程中,通过资源压缩的方式,以轻微延长任务完成时间为代价换取整体资源利用率和负载性能的提升。该方法在分配时间、资源量两方面提高匹配粒度的同时,具有较低的调度平均响应时间,因而可以应用在各云计算平台资源管理、作业调度等方面,避免资源碎片和过度分配等问题,提高平台中计算资源的利用效率,最终提升云计算平台的整体吞吐率。The fine-grained resource matching method in the cloud computing platform disclosed by the invention abstracts the computing resources into two types: compressible and incompressible according to resource characteristics; infers the resource requirements and Duration: Divide the task into multiple execution stages according to the task resource requirements, and match the resource requirements and available resources respectively by stages and resource types; in the matching process, the resource compression method is used to slightly extend the task completion time in exchange for Overall resource utilization and load performance improvement. This method improves the matching granularity in terms of allocation time and resource amount, and has a lower average scheduling response time, so it can be applied to resource management and job scheduling of various cloud computing platforms to avoid problems such as resource fragmentation and over-allocation. Improve the utilization efficiency of computing resources in the platform, and ultimately improve the overall throughput rate of the cloud computing platform.
附图说明Description of drawings
图1为任务执行阶段划分算法的流程框图。Fig. 1 is a flow chart of a task execution phase partitioning algorithm.
图2为在Yarn平台中实现细粒度资源匹配方法的架构图。Figure 2 is an architecture diagram for implementing a fine-grained resource matching method in the Yarn platform.
具体实施方式Detailed ways
下面结合具体的实施例对本发明做进一步的详细说明,所述是对本发明的解释而不是限定。The present invention will be further described in detail below in conjunction with specific embodiments, which are to explain rather than limit the present invention.
本发明公开的云计算平台中细粒度资源匹配方法,依据相似任务推测任务资源需求及持续时间;根据资源特性将资源抽象为可压缩和不可压缩两种类型;将任务划分为多个执行阶段并依次匹配各执行阶段各种类型资源的需求。该方法能够在云计算平台资源管理和作业调度匹配资源时通过提高资源分配的时间粒度和资源量粒度,避免资源管理中的资源碎片和过度分配等问题,从而实现提高资源利用率和性能。The fine-grained resource matching method in a cloud computing platform disclosed by the invention predicts task resource requirements and duration according to similar tasks; abstracts resources into two types: compressible and incompressible according to resource characteristics; divides tasks into multiple execution stages and Match the requirements of various types of resources in each execution stage in turn. The method can improve resource utilization and performance by improving resource allocation time granularity and resource amount granularity when matching resources in cloud computing platform resource management and job scheduling, avoiding resource fragmentation and over-allocation in resource management.
本发明的公开的云计算平台中细粒度资源匹配方法,包括以下步骤:The fine-grained resource matching method in the disclosed cloud computing platform of the present invention includes the following steps:
步骤1:云计算平台中服务器的角色分为计算服务器和管理服务器两种:计算服务器负责具体负载的执行,并定期向管理服务器汇报资源状态;管理服务器负责整个云平台的各种管理工作,包括向计算服务器分配计算任务。Step 1: The roles of servers in the cloud computing platform are divided into two types: computing servers and management servers: the computing server is responsible for the execution of specific loads and regularly reports the resource status to the management server; the management server is responsible for various management tasks of the entire cloud platform, including Allocate computing tasks to computing servers.
计算服务器定期采集本服务器运行中任务资源使用情况,计算本服务器可用资源信息并汇报给管理服务器。The computing server periodically collects the resource usage of tasks in the running of the server, calculates the available resource information of the server, and reports it to the management server.
计算服务器上某种资源的可用资源量按以下公式计算:The amount of resources available for a resource on a computing server is calculated by the following formula:
其中,ri为第i次采样的资源量,ti为第i次采样的持续时间,T为采样的计算总时间,n为采样次数。Among them, ri is the resource amount of the ith sampling, t i is the duration of the ith sampling, T is the total calculation time of the sampling, and n is the number of sampling times.
步骤2:管理服务器接收各计算服务器定期汇报的信息,并依据相似任务和资源压缩率推测任务的各种资源需求和持续时间。此处,相似任务指负载中与该任务具有相同执行逻辑且输入数据量相同的任务。任务的磁盘、网络带宽资源需求量按照任务与数据的相对位置分为任务与数据在同服务器、在同机架和其他3类分别推测。Step 2: The management server receives the information periodically reported by each computing server, and infers various resource requirements and durations of tasks according to similar tasks and resource compression ratios. Here, similar tasks refer to tasks in the workload that have the same execution logic and the same amount of input data as the task. The disk and network bandwidth resource requirements of tasks are divided into three categories according to the relative positions of tasks and data: tasks and data are in the same server, in the same rack, and other three categories.
任务在某进度所需的一种计算资源需求量及时间按以下公式计算:A computing resource requirement and time required by a task in a certain progress are calculated according to the following formula:
其中,αn为第n次推测结果,βn为第n次的相似任务资源信息,rc为任务信息的资源压缩率,Thr为最大压缩率限制系数,e为自然底数。Among them, α n is the nth estimation result, β n is the nth similar task resource information, rc is the resource compression rate of the task information, Thr is the maximum compression rate limit coefficient, and e is the natural base.
步骤3:管理服务器分析任务各进度的CPU、内存空间资源需求,将任务划分为多个执行阶段。Step 3: The management server analyzes the CPU and memory space resource requirements of each progress of the task, and divides the task into multiple execution stages.
管理服务器遍历任务在各进度的CPU、内存空间资源需求量,并分别记录这两种资源量的最大值和最小值。当CPU或者内存空间资源需求量的最大值和最小值之差大于一定阈值,并且遍历的进度达到一定长度,则将遍历的进度划分为任务的一个执行阶段。任务执行阶段CPU、内存空间、磁盘空间等资源需求值为阶段内各进度该资源需求的最大值,内存带宽、磁盘带宽、网络带宽等资源的需求值为阶段内各进度该资源需求的平均值。The management server traverses the CPU and memory space resource requirements of the task in each progress, and records the maximum and minimum values of these two resources respectively. When the difference between the maximum value and the minimum value of CPU or memory space resource requirements is greater than a certain threshold, and the progress of the traversal reaches a certain length, the progress of the traversal is divided into one execution stage of the task. In the task execution phase, the resource requirements such as CPU, memory space, and disk space are the maximum values of the resource requirements of each progress in the phase. The resource requirements such as memory bandwidth, disk bandwidth, and network bandwidth are the average values of the resource requirements of each progress in the phase. .
步骤4:当计算服务器上可用计算资源达到一定量时,管理服务器从待调度集合中挑选任务,分阶段匹配任务资源需求和服务器可用计算资源。Step 4: When the available computing resources on the computing server reach a certain amount, the management server selects tasks from the to-be-scheduled set, and matches the task resource requirements and the available computing resources of the server in stages.
管理服务器首先检查计算服务器上可用资源是否达到匹配要求。其次,管理服务器按照资源分配公平性、数据本地性等策略排序待调度集合。再次,资源管理服务器取出任务,并尝试从推测结果中获取任务的推测信息。如果推测信息获取失败,则按照任务资源申请量匹配计算资源。如果推测信息获取成功,则分阶段依次匹配任务的资源需求。The management server first checks whether the available resources on the computing server meet the matching requirements. Second, the management server sorts the to-be-scheduled sets according to policies such as resource allocation fairness and data locality. Again, the resource management server takes out the task and tries to obtain the task's speculation information from the speculation result. If the presumed information acquisition fails, the computing resources are matched according to the requested amount of task resources. If the speculative information is obtained successfully, the resource requirements of the tasks are matched in sequence in stages.
管理服务器根据资源特性将CPU、内存、磁盘、网络等计算资源抽象为可压缩和不可压缩两类。在任务执行过程中,如果任务分配到的某种资源少于任务对该资源的需求量,任务可以通过延长执行时间正常完成,则该资源为可压缩资源,否则为不可压缩资源。The management server abstracts computing resources such as CPU, memory, disk, and network into two categories: compressible and incompressible, according to resource characteristics. During task execution, if a certain resource allocated by the task is less than the task's demand for the resource, and the task can be completed normally by extending the execution time, the resource is a compressible resource, otherwise it is an incompressible resource.
计算服务器上资源压缩率按以下公式计算:The resource compression ratio on the calculation server is calculated according to the following formula:
其中,rc为资源压缩率,Rr为资源的需求量,Ru为资源分配量。如果资源为不可压缩资源,则其压缩率rc始终为0。Among them, rc is the resource compression ratio, R r is the resource demand, and R u is the resource allocation. If the resource is incompressible, its compression ratio rc is always 0.
在匹配过程中,如果需求量大于可用资源量,则匹配失败。对于不可压缩资源,计算服务器该资源的可用资源量不做处理;对于可压缩资源,管理服务器依据计算服务器资源及负载情况计算该服务器各阶段各种资源的最大压缩率。During the matching process, if the demand is greater than the available resources, the matching fails. For incompressible resources, the calculation server does not process the available resources of the resource; for compressible resources, the management server calculates the maximum compression ratio of various resources of the server at each stage according to the calculation server resources and load conditions.
计算服务器在阶段内某种可压缩资源的最大压缩率按以下公式计算:The maximum compression ratio of a compressible resource of a computing server in a stage is calculated by the following formula:
其中,服务器上已匹配n个任务,当前匹配任务为第n+1个,n+1个任务的资源总需求大于资源总量,第i个任务在阶段内完成的工作量为wi,△p为资源压缩引起的性能变化。Among them, n tasks have been matched on the server, the current matching task is the n+1th task, the total resource demand of the n+1 task is greater than the total amount of resources, and the workload completed by the ith task in the stage is w i , △ p is the performance change caused by resource compression.
可压缩资源匹配中,计算服务器该资源的实际可用资源量按公式ai=ni+ri×Ni计算;其中,ri为该服务器上该种资源的当前匹配阶段的最大可压缩率,Ni为该服务器上该种资源的总量,ni为采集得到的该种资源的可用资源量。如果任务的所有执行阶段所有资源的需求均未匹配失败,则匹配成功。In the compressible resource matching, the actual available resource amount of the resource on the computing server is calculated according to the formula a i =n i +r i ×N i ; where ri is the maximum compressibility ratio of the current matching stage of the resource on the server , Ni is the total amount of this kind of resource on the server, and ni is the available resource amount of this kind of resource obtained by collection. If the requirements of all resources in all execution phases of the task are not matched and failed, the matching is successful.
具体地,匹配方法按以下步骤进行资源匹配:Specifically, the matching method performs resource matching according to the following steps:
step1),检查服务器可用资源量,如果可用资源量小于调度阈值,则结束调度;step1), check the amount of available resources of the server, if the amount of available resources is less than the scheduling threshold, end the scheduling;
step2),按照资源分配公平性、数据本地性等策略排序待调度任务集合。step 2), sort the set of tasks to be scheduled according to strategies such as resource allocation fairness and data locality.
step3),从待调度任务集合选取一个任务,获取任务各执行阶段的资源需求及持续时间;如果获取成功,则进入step5),否则按申请量匹配,通过后进入step7);未通过则进入step5);step3), select a task from the set of tasks to be scheduled, and obtain the resource requirements and duration of each execution stage of the task; if the acquisition is successful, go to step5), otherwise match according to the application amount, and go to step7) after passing; if not, go to step5) );
step4),以任务资源申请量为需求量匹配资源,匹配成功则进入step7);step4), take the task resource application amount as the demand to match resources, and enter step7) if the match is successful;
step5),s为任务执行阶段,n为服务器相应可用资源阶段,依次比较第i种资源在该执行阶段的资源需求si与可用资源阶段的资源可用量ai,如果第i中资源为不可压缩资源,则ai值为可用资源阶段内相应资源值ni;如果资源为可压缩资源,则ai值按照公式ai=ni+ri×Ni计算,其中ri为该服务器上该种资源相应可用资源阶段内的最大可压缩率,Ni为该服务器上该种资源的总量。如果资源需求大于可用资源,则匹配失败,进入step3),否则进入step6);step5), s is the task execution stage, n is the corresponding available resource stage of the server, compare the resource demand si of the i-th resource in this execution stage with the resource availability a i of the available resource stage in turn, if the resource in the i -th is unavailable If the resource is compressed, the value of a i is the corresponding resource value ni in the available resource phase; if the resource is a compressible resource, the value of a i is calculated according to the formula a i =n i +r i ×N i , where ri is the server The maximum compressibility ratio in the corresponding available resource phase of the resource on the server, and Ni is the total amount of the resource on the server. If the resource requirement is greater than the available resource, the match fails, and go to step3), otherwise go to step6);
step6),如果s为任务最后执行阶段,则匹配成功,进入step 7),否则重复执行step5),匹配下一阶段。step6), if s is the last execution stage of the task, the match is successful and go to step 7), otherwise step5) is repeated to match the next stage.
step7),检查该匹配是否影响服务器上其他任务执行,若检查通过则匹配完成,否则回到step3)。step7), check whether the matching affects the execution of other tasks on the server, if the check is passed, the matching is completed, otherwise go back to step3).
步骤5:如果资源匹配成功,则管理服务器假设资源已经分配,检查该计算服务器上所有任务会受到影响而不能满足约束条件。如果所有任务约束条件均可以满足,则分配计算资源。Step 5: If the resource matching is successful, the management server assumes that the resource has been allocated, and checks that all tasks on the computing server will be affected and cannot satisfy the constraints. If all task constraints can be satisfied, compute resources are allocated.
步骤6:管理服务器检查该计算服务器上是否剩余足够资源进入下一轮匹配,如果剩余资源满足条件,则进入下一轮匹配。Step 6: The management server checks whether there are enough resources left on the computing server to enter the next round of matching, and if the remaining resources meet the conditions, enter the next round of matching.
按上述方法即可实现对云计算平台计算资源的细粒度匹配。According to the above method, the fine-grained matching of the computing resources of the cloud computing platform can be realized.
下面给出采用本发明方法的一个具体应用实例:A specific application example of adopting the inventive method is provided below:
参见图2,该实例结合在开源云计算平台Yarn上,需要说明的是,本发明的方法并不仅仅用于开源云计算平台Yarn,其他满足条件的应用平台也适用。Referring to FIG. 2 , this example is combined with the open source cloud computing platform Yarn. It should be noted that the method of the present invention is not only applicable to the open source cloud computing platform Yarn, but also applicable to other application platforms that meet the conditions.
步骤1:Application Master向Resource Manager注册时,提供应用程序代码及参数的MD5值和输入数据量信息。Application Master向Resource Manager申请任务所需计算资源时,使用Application Master和任务类型信息标识任务。Step 1: When the Application Master registers with the Resource Manager, it provides the MD5 value of the application code and parameters and the input data volume information. When the Application Master applies to the Resource Manager for computing resources required by a task, the Application Master and the task type information are used to identify the task.
步骤2:Node Manager通过分析Linux Proc文件夹下信息获取本节点上运行中任务信息及剩余资源信息。Step 2: Node Manager obtains the running task information and remaining resource information on the node by analyzing the information in the Linux Proc folder.
Node Manager某种资源的可用资源量按以下公式计算:The amount of available resources of a certain resource of Node Manager is calculated according to the following formula:
其中,ri为第i次采样的资源量,ti为第i次采样的持续时间,T为采样的总时间,n为采样次数。Among them, ri is the resource amount of the ith sampling, t i is the duration of the ith sampling, T is the total time of sampling, and n is the number of sampling times.
Node Manager将采集的资源信息及分析计算得出可用资源量等相关信息通过心跳的方式回报给Resource Manager。The Node Manager reports the collected resource information and the available resources through analysis and calculation to the Resource Manager by heartbeat.
步骤3:Resource Manager接收Application Master和Node Manager的请求及汇报,并将Application Master的应用注册和资源申请信息交由Scheduler处理,将NodeManager汇报的资源信息交由Estimator处理。Step 3: The Resource Manager receives the request and report of the Application Master and the Node Manager, and sends the application registration and resource application information of the Application Master to the Scheduler for processing, and the resource information reported by the NodeManager to the Estimator for processing.
步骤4:Estimator根据Application Master的应用注册信息、资源请求信息识别相似任务。相似任务指负载中与该任务具有相同执行逻辑且输入数据量相同的任务。Step 4: The Estimator identifies similar tasks according to the application registration information and resource request information of the Application Master. Similar tasks refer to tasks in the workload that have the same execution logic and the same amount of input data as the task.
Estimator处理Node Manager汇报信息,推测任务的资源需求及持续时间。任务的磁盘、网络带宽资源需求量按照任务与数据的相对位置分为任务与数据在同服务器、在同机架和其他3类分别推测。Estimator processes Node Manager report information and infers the resource requirements and duration of tasks. The disk and network bandwidth resource requirements of tasks are divided into three categories according to the relative positions of tasks and data: tasks and data are in the same server, in the same rack, and other three categories.
任务在某进度的所需的一种计算资源需求量及时间按以下公式计算:A computing resource requirement and time required by a task in a certain progress are calculated according to the following formula:
其中,αn为第n次推测结果,βn为第n次的相似任务资源信息,rc为相似任务信息的资源压缩率,Thr为最大压缩率限制系数,e为自然底数。Among them, α n is the nth estimation result, β n is the nth similar task resource information, rc is the resource compression rate of the similar task information, Thr is the maximum compression rate limit coefficient, and e is the natural base.
Estimator分析任务CPU、内存空间需求信息,将任务划分为多个执行阶段。Estimator遍历任务在各进度的CPU、内存空间资源需求量,并分别记录这两种资源量的最大值和最小值。当CPU或者内存空间资源需求量的最大值和最小值之差大于划分阈值,并且遍历的进度达到划分长度,则将遍历的进度划分为任务的一个执行阶段。此处划分阈值默认取资源总量的20%,划分长度默认为任务进度总长的5%。划分阈值及划分长度应根据具体平台及负载调整。任务执行阶段CPU、内存空间、磁盘空间等资源需求值为阶段内该资源需求的最大值,内存带宽、磁盘带宽、网络带宽等资源的需求值为阶段内该资源需求的平均值。Estimator analyzes the task CPU and memory space requirement information, and divides the task into multiple execution stages. Estimator traverses the CPU and memory space resource requirements of tasks in each progress, and records the maximum and minimum values of these two resources respectively. When the difference between the maximum and minimum CPU or memory space resource requirements is greater than the division threshold, and the traversal progress reaches the division length, the traversal progress is divided into one execution stage of the task. Here, the division threshold is 20% of the total resources by default, and the division length is 5% of the total length of the task progress by default. The division threshold and division length should be adjusted according to the specific platform and load. In the task execution phase, the resource requirements such as CPU, memory space, and disk space are the maximum values of the resource requirements in the phase, and the resource requirements such as memory bandwidth, disk bandwidth, and network bandwidth are the average values of the resource requirements in the phase.
步骤5:Scheduler匹配Application Master的资源申请和Node Manager上的可用计算资源。Step 5: The Scheduler matches the resource request of the Application Master with the available computing resources on the Node Manager.
Scheduler将Application Master的资源申请加入待调度集合中。The Scheduler adds the resource application of the Application Master to the to-be-scheduled set.
Scheduler检查Node Manager上的可用计算资源是否达到匹配要求。The Scheduler checks whether the available computing resources on the Node Manager meet the matching requirements.
Scheduler根据资源分配公平性、数据本地性等策略排序待调度集合Scheduler sorts the to-be-scheduled set according to policies such as resource allocation fairness and data locality
Scheduler从排序后的待调度集合中取出待调度任务,并从Estimator获取推测得出的任务资源需求及持续时间等信息。如果信息获取失败,则按照资源申请量进行匹配。如果信息获取成功,则依次匹配任务的所有执行阶段的所有资源需求。The Scheduler takes out the tasks to be scheduled from the sorted set to be scheduled, and obtains the estimated task resource requirements and duration information from the Estimator. If the information acquisition fails, it will be matched according to the resource application amount. If the information is obtained successfully, all resource requirements of all execution phases of the task are matched in turn.
匹配中,根据资源特性将CPU、内存、磁盘、网络等计算资源抽象为可压缩和不可压缩两类。在任务执行过程中,如果任务分配到的某种资源少于任务对该资源的需求量,任务可以通过延长执行时间正常完成,则该资源为可压缩资源,否则为不可压缩资源。In matching, computing resources such as CPU, memory, disk, and network are abstracted into two categories, compressible and incompressible, according to resource characteristics. During task execution, if a certain resource allocated by the task is less than the task's demand for the resource, and the task can be completed normally by extending the execution time, the resource is a compressible resource, otherwise it is an incompressible resource.
Node Manager资源压缩的程度按以下公式计算:The degree of Node Manager resource compression is calculated by the following formula:
其中,rc为资源压缩率,Rr为资源的需求量,Ru为资源分配量。如果资源为不可压缩资源,则其压缩率rc始终为0。Among them, rc is the resource compression ratio, R r is the resource demand, and R u is the resource allocation. If the resource is incompressible, its compression ratio rc is always 0.
Node Manager在某阶段内某种可压缩资源的最大压缩率按以下公式计算:The maximum compression ratio of a compressible resource of Node Manager in a certain stage is calculated according to the following formula:
其中,Node Manager上有n个已匹配任务,当前匹配任务为第n+1个,n+1个任务的资源总需求大于资源总量,第i个任务在阶段内完成的工作量为wi,△p为资源压缩引起的性能变化。Among them, there are n matched tasks on the Node Manager, the current matching task is the n+1th task, the total resource demand of the n+1 task is greater than the total amount of resources, and the workload of the i-th task completed in the stage is w i , Δp is the performance change caused by resource compression.
匹配过程中,如果任务某种资源需求量大于Node Manager上相应可用资源量,则匹配失败。Node Manager上不可压缩资源的可用资源量为采集量;可压缩资源的可用资源量按公式ai=ni+ri×Ni计算,其中ri为该服务器上该种资源该匹配阶段的最大可压缩率,Ni为该服务器上该种资源的总量,ni为采集得到的该种资源的可用资源量。如果任务的所有执行阶段所有资源的需求均未匹配失败,则匹配成功。During the matching process, if the demand for a certain resource of the task is greater than the corresponding available resource on the Node Manager, the matching fails. The amount of available resources of incompressible resources on Node Manager is the collection amount; the amount of available resources of compressible resources is calculated according to the formula a i =n i +r i ×N i , where ri i is the amount of resources on the server in this matching stage The maximum compressible ratio, Ni is the total amount of this kind of resource on the server, and ni is the available resource amount of this kind of resource obtained by collection. If the requirements of all resources in all execution stages of the task are not matched and failed, the matching is successful.
步骤6:匹配成功之后,Scheduler在会检查若该匹配决策生效Node Manager上所有执行中任务的约束条件是否能得到满足。如果检查通过,则Scheduler向发出资源请求的Application Master分配计算资源。Step 6: After the matching is successful, the Scheduler will check whether the constraints of all executing tasks on the Node Manager can be satisfied if the matching decision takes effect. If the check passes, the Scheduler allocates computing resources to the Application Master that made the resource request.
分配资源之后,Scheduler会检查Node Manager所在服务器的剩余资源,决定是否进入下一轮调度。After allocating resources, the Scheduler will check the remaining resources of the server where the Node Manager is located and decide whether to enter the next round of scheduling.
步骤7:获得资源分配的Application Master与计算资源所在Node Manager通信,启动相应任务。Step 7: The Application Master that obtains the resource allocation communicates with the Node Manager where the computing resource is located, and starts the corresponding task.
实际测试结果表明,该方法得出的资源匹配结果可以避免资源碎片、过度分配,提高云计算平台的资源利用效率,最终提升云计算平台的整体吞吐率。The actual test results show that the resource matching results obtained by this method can avoid resource fragmentation and over-allocation, improve the resource utilization efficiency of the cloud computing platform, and ultimately improve the overall throughput rate of the cloud computing platform.
通过上述实施例可以看,本发明可以用于云计算平台资源管理与作业调度中。本发明根据相似任务推测任务资源需求和持续时间,基于资源需求将任务划分为多个执行阶段,分阶段分资源特性分别匹配任务资源需求与服务器计算资源,必要时在可接受范围内延长单个任务的完成时间换取更高的资源利用率和任务并行数量,最终提高整体性能。It can be seen from the above embodiments that the present invention can be used in cloud computing platform resource management and job scheduling. The invention infers task resource requirements and duration based on similar tasks, divides tasks into multiple execution stages based on resource requirements, matches task resource requirements and server computing resources respectively by stage and resource characteristics, and extends a single task within an acceptable range if necessary In exchange for higher resource utilization and the number of tasks in parallel, the overall performance is ultimately improved.
本发明不仅可用于云计算平台中的资源管理和作业调度中,基于集群的资源管理平台都可借鉴改进。The present invention can not only be used for resource management and job scheduling in cloud computing platforms, but also can be used for reference and improvement in cluster-based resource management platforms.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710909672.8A CN107797863B (en) | 2017-09-29 | 2017-09-29 | Fine-grained resource matching method in cloud computing platform |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710909672.8A CN107797863B (en) | 2017-09-29 | 2017-09-29 | Fine-grained resource matching method in cloud computing platform |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107797863A CN107797863A (en) | 2018-03-13 |
CN107797863B true CN107797863B (en) | 2020-07-28 |
Family
ID=61532969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710909672.8A Expired - Fee Related CN107797863B (en) | 2017-09-29 | 2017-09-29 | Fine-grained resource matching method in cloud computing platform |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107797863B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108572875B (en) * | 2018-04-28 | 2020-09-04 | 辽宁工程技术大学 | Resource allocation method, device and system |
CN110659126B (en) * | 2018-06-29 | 2023-04-14 | 中兴通讯股份有限公司 | Resource management method, device and computer readable storage medium |
CN110166282B (en) * | 2019-04-16 | 2020-12-01 | 苏宁云计算有限公司 | Resource allocation method, device, computer equipment and storage medium |
CN110247979B (en) * | 2019-06-21 | 2021-08-17 | 北京邮电大学 | A scheduling scheme determination method, device and electronic device |
CN110362409B (en) * | 2019-07-19 | 2025-01-28 | 深圳前海微众银行股份有限公司 | Resource allocation method, device, equipment and storage medium based on multiple types |
CN110780977B (en) * | 2019-10-25 | 2022-06-03 | 杭州安恒信息技术股份有限公司 | Cloud computing-based task delivery method, device, system and readable storage medium |
CN114327880A (en) * | 2021-12-23 | 2022-04-12 | 广州亿电邦科智能网络科技有限公司 | Computing method of light code heterogeneous distributed system |
CN114417577A (en) * | 2021-12-30 | 2022-04-29 | 浙江省科技信息研究院 | Cross-platform resource scheduling and optimization control method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102460393A (en) * | 2009-05-01 | 2012-05-16 | 思杰系统有限公司 | Systems and methods for establishing a cloud bridge between virtual storage resources |
CN103533086A (en) * | 2013-10-31 | 2014-01-22 | 中国科学院计算机网络信息中心 | Uniform resource scheduling method in cloud computing system |
CN105718364A (en) * | 2016-01-15 | 2016-06-29 | 西安交通大学 | Dynamic assessment method for ability of computation resource in cloud computing platform |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9319464B2 (en) * | 2013-01-22 | 2016-04-19 | International Business Machines Corporation | Storage managment in a multi-tiered storage architecture |
US10007556B2 (en) * | 2015-12-07 | 2018-06-26 | International Business Machines Corporation | Reducing utilization speed of disk storage based on rate of resource provisioning |
-
2017
- 2017-09-29 CN CN201710909672.8A patent/CN107797863B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102460393A (en) * | 2009-05-01 | 2012-05-16 | 思杰系统有限公司 | Systems and methods for establishing a cloud bridge between virtual storage resources |
CN103533086A (en) * | 2013-10-31 | 2014-01-22 | 中国科学院计算机网络信息中心 | Uniform resource scheduling method in cloud computing system |
CN105718364A (en) * | 2016-01-15 | 2016-06-29 | 西安交通大学 | Dynamic assessment method for ability of computation resource in cloud computing platform |
Non-Patent Citations (1)
Title |
---|
采用资源划分的云环境下Hadoop资源许可调度方法;周墨颂等;《西安交通大学学报》;20150831;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN107797863A (en) | 2018-03-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107797863B (en) | Fine-grained resource matching method in cloud computing platform | |
Yeung et al. | Horus: Interference-aware and prediction-based scheduling in deep learning systems | |
Jalaparti et al. | Network-aware scheduling for data-parallel jobs: Plan when you can | |
Grandl et al. | Altruistic scheduling in {Multi-Resource} clusters | |
CN104915407B (en) | A kind of resource regulating method based under Hadoop multi-job environment | |
US10534542B2 (en) | Dynamic core allocation for consistent performance in a non-preemptive scheduling environment | |
US9201690B2 (en) | Resource aware scheduling in a distributed computing environment | |
US20140019987A1 (en) | Scheduling map and reduce tasks for jobs execution according to performance goals | |
US20130290972A1 (en) | Workload manager for mapreduce environments | |
Chatzistergiou et al. | Fast heuristics for near-optimal task allocation in data stream processing over clusters | |
Chard et al. | Cost-aware cloud provisioning | |
CN107291550B (en) | A Spark platform resource dynamic allocation method and system for iterative applications | |
Vijayakumar et al. | Dynamic resource provisioning for data streaming applications in a cloud environment | |
Yao et al. | LsPS: A job size-based scheduler for efficient task assignments in Hadoop | |
US20230004440A1 (en) | Allocating of computing resources for applications | |
Xu et al. | Improving utilization and parallelism of hadoop cluster by elastic containers | |
Marandi et al. | Filo: Consolidated consensus as a cloud service | |
CN113934525A (en) | Hadoop cluster task scheduling method based on positive and negative feedback load scheduling algorithm | |
Chen et al. | Improving cluster resource efficiency with oversubscription | |
Zhou et al. | Fine-grained scheduling in multi-resource clusters | |
Wang et al. | On optimal budget-driven scheduling algorithms for MapReduce jobs in the hetereogeneous cloud | |
Pace et al. | A data-driven approach to dynamically adjust resource allocation for compute clusters | |
Thamsen et al. | Learning efficient co-locations for scheduling distributed dataflows in shared clusters | |
Xie et al. | A novel independent job rescheduling strategy for cloud resilience in the cloud environment | |
Bakni et al. | Survey on improving the performance of MapReduce in Hadoop |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200728 |