CN116414567A

CN116414567A - Resource scheduling method, device and equipment for smart car operating system

Info

Publication number: CN116414567A
Application number: CN202310542184.3A
Authority: CN
Inventors: 朱林法; 刘洪振; 许智
Original assignee: Zebred Network Technology Co Ltd
Current assignee: Zebred Network Technology Co Ltd
Priority date: 2023-05-12
Filing date: 2023-05-12
Publication date: 2023-07-11

Abstract

The present application provides a resource scheduling method, device and equipment for a smart car operating system, including: performing at least one performance test on the artificial intelligence model in the smart car operating system, obtaining at least one static performance index, and identifying at least one static performance index Priority static index, set the static strategy corresponding to the priority static index as a static allocation strategy; according to the static allocation strategy, allocate graphics processor resources for each artificial intelligence model in the smart car operating system; run the smart car operation At least one artificial intelligence model in the system, and monitor the dynamic performance index of the running artificial intelligence model; generate a dynamic allocation strategy according to the dynamic performance index, and adjust the graphics processor resources allocated to each artificial intelligence model according to the dynamic allocation strategy. The application avoids the situation that multiple requests compete for the working thread in the graphics processor at the same time, and ensures the stability of the resource call of the graphics processor.

Description

Resource scheduling method, device and equipment for smart car operating system

技术领域technical field

本申请涉及智能汽车技术领域，尤其涉及一种智能汽车操作系统的资源调度方法、装置及设备。The present application relates to the technical field of smart cars, in particular to a resource scheduling method, device and equipment for a smart car operating system.

背景技术Background technique

随着智能驾驶技术的发展，智能汽车的自动驾驶场景也应运而生，自动驾驶场景是基于图形处理器通过多个人工智能模型生成的推理结论所构建的；其中，人工智能模型是用于解决问题或分析一组数据的一系列计算和规则,它就像一个流程图，其中包含提出问题的分步说明，只不过是以数学和编程代码形式进行编写，因此，人工智能模型通常用于实现雷达感知、视觉感知、路径规划等指定任务；推理结论是基于多个人工智能模型向图形处理器发送推理请求，使图形处理器调用人工智能模型并根据载入人工智能模型中的采集数据所生成的推理结果。With the development of intelligent driving technology, the automatic driving scene of smart cars has also emerged as the times require. The automatic driving scene is constructed based on the reasoning conclusions generated by the graphics processor through multiple artificial intelligence models; among them, the artificial intelligence model is used to solve A problem or a series of calculations and rules for analyzing a set of data, it is like a flowchart containing step-by-step instructions for posing a problem, but written in the form of mathematical and programming code, so artificial intelligence models are often used to implement Radar perception, visual perception, path planning and other specified tasks; the reasoning conclusion is based on multiple artificial intelligence models to send reasoning requests to the graphics processor, so that the graphics processor calls the artificial intelligence model and generates it based on the collected data loaded into the artificial intelligence model inference results.

然而，发明人发现，当前的自动驾驶场景中的多个人工智能模型向图形处理器发送推理请求时，很容易出现多个请求同时争抢图形处理器中工作线程的情况，导致图形处理器因工作线程频繁被出现冲突的推理请求调用，造成图形处理器资源调用的不稳定的情况发生。However, the inventors found that when multiple artificial intelligence models in the current autonomous driving scene send inference requests to the graphics processor, it is easy for multiple requests to compete for the working thread in the graphics processor at the same time, causing the graphics processor to fail. Worker threads are frequently called by conflicting inference requests, resulting in unstable graphics processor resource calls.

发明内容Contents of the invention

本申请提供一种智能汽车操作系统的资源调度方法、装置及设备，用以解决当前的自动驾驶场景中的多个人工智能模型向图形处理器发送推理请求时，很容易出现多个请求同时争抢图形处理器中工作线程的情况，导致图形处理器因工作线程频繁被出现冲突的推理请求调用，造成图形处理器资源调用的不稳定的的问题。This application provides a resource scheduling method, device, and equipment for an intelligent vehicle operating system, which are used to solve the problem that when multiple artificial intelligence models in the current automatic driving scene send reasoning requests to the graphics processor, it is easy to cause multiple requests to compete at the same time. The situation of grabbing the working thread in the graphics processor causes the graphics processor to be frequently invoked by conflicting inference requests due to the working thread, resulting in unstable resource calls of the graphics processor.

第一方面，本申请提供一种智能汽车操作系统的图形处理器的资源调度方法，包括：In a first aspect, the present application provides a resource scheduling method for a graphics processor of an intelligent vehicle operating system, including:

对智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标；并识别所述至少一个静态性能指标中的优先静态指标，将所述优先静态指标对应的静态策略设为静态分配策略，其中，智能汽车操作系统中具有至少一个人工智能模型，所述人工智能模型用于实现指定任务，所述指定任务用于实现汽车的自动驾驶，所述静态性能指标反映了人工智能模型在性能测试中的性能表现，所述优先静态指标为满足预置的优先规则的静态性能指标，所述静态策略是用于对人工智能模型进行分组，及对分组后的人工智能模型分配工作线程的计算机策略，所述工作线程是用于调度所述图形处理器中的流处理器和/或计算单元的序列图形处理器；Perform at least one performance test on the artificial intelligence model in the smart car operating system to obtain at least one static performance index; and identify the priority static index in the at least one static performance index, and set the static strategy corresponding to the priority static index to Static allocation strategy, wherein, there is at least one artificial intelligence model in the smart car operating system, the artificial intelligence model is used to realize the specified task, the specified task is used to realize the automatic driving of the car, and the static performance index reflects the artificial intelligence model. The performance of the model in the performance test, the priority static index is a static performance index that meets the preset priority rules, and the static strategy is used to group the artificial intelligence models and assign work to the grouped artificial intelligence models a computer policy for threads, said worker thread being a sequential GPU for scheduling stream processors and/or compute units in said graphics processor;

根据所述静态分配策略，为所述智能汽车操作系统中的每一人工智能模型分配图形处理器资源；According to the static allocation strategy, allocate graphics processor resources for each artificial intelligence model in the intelligent vehicle operating system;

运行所述智能汽车操作系统中的至少一个人工智能模型，并监控运行的人工智能模型的动态性能指标，其中，所述动态性能指标反映了人工智能模型在调用分配到的图形处理器资源进行运算时的性能表现；Run at least one artificial intelligence model in the smart car operating system, and monitor the dynamic performance indicators of the running artificial intelligence model, wherein the dynamic performance indicators reflect that the artificial intelligence model is calling the allocated graphics processor resources to perform calculations performance when

根据所述动态性能指标生成动态分配策略，根据所述动态分配策略调整每一人工智能模型分配到的图形处理器资源，其中，所述动态分配策略用于根据所述人工智能模型在运行时的性能表现，对运行的人工智能模型分配图形处理器资源。A dynamic allocation strategy is generated according to the dynamic performance index, and the graphics processor resources allocated to each artificial intelligence model are adjusted according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used for the artificial intelligence model at runtime according to the dynamic allocation strategy. Performance, allocating GPU resources to running AI models.

上述方案中，智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标，包括：In the above scheme, the artificial intelligence model in the smart car operating system conducts at least one performance test to obtain at least one static performance index, including:

获取至少一个静态策略，根据每一所述静态策略，分别对所述智能汽车操作系统中的人工智能模型进行分组并向每组人工智能模型分配工作线程，得到至少一个静态样本，其中，所述静态策略是用于对人工智能模型进行分组，及对分组后的人工智能模型分配工作线程的计算机策略；Obtaining at least one static strategy, grouping the artificial intelligence models in the smart car operating system according to each static strategy and assigning work threads to each group of artificial intelligence models to obtain at least one static sample, wherein the The static strategy is a computer strategy for grouping artificial intelligence models and assigning working threads to the grouped artificial intelligence models;

将预置的测试实例录入每一所述静态样本中的人工智能模型中，并通过每一所述静态样本中的人工智能模型分配到的工作线程，运行所述人工智能模型中的测试实例，以对每一所述静态样本进行性能测试，并得到分别与至少一个所述静态策略对应的至少一个静态性能指标，其中，所述测试实例是用于对人工智能模型进行性能测试的测试用例。Entering the preset test instance into the artificial intelligence model in each of the static samples, and running the test instance in the artificial intelligence model through the worker threads assigned to the artificial intelligence model in each of the static samples, Performance testing is performed on each of the static samples, and at least one static performance indicator corresponding to at least one static strategy is obtained, wherein the test instance is a test case for performance testing of an artificial intelligence model.

上述方案中，根据每一所述静态策略，分别对所述智能汽车操作系统中的人工智能模型进行分组并向每组人工智能模型分配工作线程，得到至少一个静态样本，包括：In the above scheme, according to each of the static strategies, the artificial intelligence models in the smart car operating system are grouped respectively and work threads are assigned to each group of artificial intelligence models to obtain at least one static sample, including:

根据所述静态策略中的划分规则对智能汽车操作系统中每一人工智能模型进行分组，得到至少一个测试组，其中，所述测试组中至少具有一个人工智能模型；Group each artificial intelligence model in the smart car operating system according to the division rules in the static strategy to obtain at least one test group, wherein there is at least one artificial intelligence model in the test group;

获取所述图形处理器中的至少一个工作线程，根据所述静态策略中的资源分配规则向每一所述测试组分配一个工作线程；Acquiring at least one worker thread in the graphics processor, and assigning a worker thread to each of the test groups according to the resource allocation rules in the static policy;

汇总至少一个所述测试组及与每一所述测试组对应的工作线程，形成与所述静态策略对应的静态样本。Summarizing at least one test group and working threads corresponding to each test group to form a static sample corresponding to the static policy.

上述方案中，根据所述静态策略中的划分规则对智能汽车操作系统中每一人工智能模型进行分组，得到至少一个测试组，包括：In the above scheme, each artificial intelligence model in the smart car operating system is grouped according to the division rules in the static strategy to obtain at least one test group, including:

若确定所述智能汽车操作系统中具有至少一个有向无环图，则将属于同一有向无环图中的人工智能模型划分为一个测试组，其中，所述有向无环图反映了智能汽车操作系统中的两个或两个以上的人工智能模型之间的逻辑关系；If it is determined that there is at least one directed acyclic graph in the smart car operating system, the artificial intelligence models belonging to the same directed acyclic graph are divided into a test group, wherein the directed acyclic graph reflects the intelligence The logical relationship between two or more artificial intelligence models in the automotive operating system;

若确定所述智能汽车操作系统中具有不属于所述有向无环图的其他人工智能模型，则根据每一所述其他人工智能模型的模型属性数据对所述其他人工智能模型进行分组，得到至少一个测试组，其中，所述模型属性数据描述了人工智能模型为实现指定任务所消耗的算力；If it is determined that the smart car operating system has other artificial intelligence models that do not belong to the directed acyclic graph, then group the other artificial intelligence models according to the model attribute data of each of the other artificial intelligence models to obtain At least one test group, wherein the model attribute data describes the computing power consumed by the artificial intelligence model to achieve a specified task;

若确定所述智能汽车操作系统中不具有有向无环图，则根据每一人工智能模型的模型属性数据对所述智能汽车操作系统中的人工智能模型进行分组，得到至少一个测试组。If it is determined that there is no directed acyclic graph in the smart car operating system, the artificial intelligence models in the smart car operating system are grouped according to the model attribute data of each artificial intelligence model to obtain at least one test group.

上述方案中，识别所述至少一个静态性能指标中的优先静态指标，包括：In the above solution, identifying the priority static index in the at least one static performance index includes:

提取每一静态性能指标中的第一指标元素，对所述第一指标元素进行排序得到目标序列，其中，所述静态性能指标中具有至少一个静态指标元素，所述静态指标元素反映了人工智能模型在性能测试中的一个性能维度上的性能表现，所述第一指标元素是所述静态性能指标中的一个静态指标元素；Extracting the first index element in each static performance index, and sorting the first index elements to obtain a target sequence, wherein there is at least one static index element in the static performance index, and the static index element reflects the artificial intelligence The performance of the model on a performance dimension in the performance test, the first index element is a static index element in the static performance index;

根据每一第一指标元素在所述目标序列中的位次，确定每一第一指标元素的性能值，其中，所述性能值反映了所述第一指标元素的性能优劣程度；Determine the performance value of each first index element according to the position of each first index element in the target sequence, wherein the performance value reflects the degree of performance of the first index element;

根据每一静态性能指标中各静态指标元素的性能值，得到每一所述静态性能指标的综合性能值；Obtaining the comprehensive performance value of each static performance index according to the performance value of each static index element in each static performance index;

将综合性能值最高的静态性能指标设为优先静态指标。The static performance index with the highest comprehensive performance value is set as the priority static index.

上述方案中，根据所述静态分配策略，为所述智能汽车操作系统中的每一人工智能模型分配图形处理器资源，包括：In the above scheme, according to the static allocation strategy, the allocation of graphics processor resources for each artificial intelligence model in the intelligent vehicle operating system includes:

根据所述静态分配策略中的划分规则，对所述智能汽车操作系统中的人工智能模型进行分组，得到至少一个运行组；According to the division rules in the static allocation strategy, group the artificial intelligence models in the smart car operating system to obtain at least one operation group;

获取所述图形处理器中的至少一个工作线程，根据所述静态分配策略中的资源分配规则向每一运行组分配分配一个工作线程，以向每一运行组中的每一人工智能模型资源分配图形处理器资源。Acquiring at least one worker thread in the graphics processor, assigning a worker thread to each operation group according to the resource allocation rules in the static allocation strategy, so as to allocate resources to each artificial intelligence model in each operation group Graphics processor resources.

上述方案中，根据所述动态性能指标生成动态分配策略，包括：In the above solution, generating a dynamic allocation strategy according to the dynamic performance index includes:

提取所述动态性能指标中的第二指标元素，并获取与所述第二指标元素对应的指标规则，其中，所述动态性能指标中具有至少一个指标元素，所述第二指标元素是所述动态性能指标中的一个指标元素，所述指标规则是用于定义指标元素正常和异常的计算机规则；Extracting a second index element in the dynamic performance index, and obtaining an index rule corresponding to the second index element, wherein the dynamic performance index has at least one index element, and the second index element is the An index element in the dynamic performance index, the index rule is a computer rule used to define normal and abnormal index elements;

若确定所述第二指标元素符合所述指标规则，则将所述第二指标元素设为正常指标元素；If it is determined that the second index element conforms to the index rule, setting the second index element as a normal index element;

若确定所述第二指标元素不符合所述指标规则，则将所述第二指标元素设为异常指标元素；If it is determined that the second index element does not comply with the index rule, setting the second index element as an abnormal index element;

若确定所述动态性能指标中的正常指标元素的数量未达到预置的正常阈值，或异常指标元素的数量达到预置的异常阈值，则确定所述动态性能指标为异常性能指标；If it is determined that the number of normal index elements in the dynamic performance index does not reach a preset normal threshold, or the number of abnormal index elements reaches a preset abnormal threshold, then determine that the dynamic performance index is an abnormal performance index;

若确定所述动态性能指标中的正常指标元素的数量达到预置的正常阈值，或异常指标元素的数量未达到预置的异常阈值，则确定所述动态性能指标为正常性能指标；If it is determined that the number of normal index elements in the dynamic performance index reaches a preset normal threshold, or the number of abnormal index elements does not reach a preset abnormal threshold, then determine that the dynamic performance index is a normal performance index;

根据所述正常性能指标和所述异常性能指标生成动态分配策略。A dynamic allocation policy is generated according to the normal performance index and the abnormal performance index.

上述方案中，根据所述正常性能指标和所述异常性能指标生成动态分配策略，包括：In the above solution, a dynamic allocation strategy is generated according to the normal performance index and the abnormal performance index, including:

将所述正常性能指标对应的人工智能模型设为正常模型，将所述异常性能指标对应的人工智能模型设为异常模型，将所述正常模型所在的运行组和所述智能汽车操作系统中未运行的人工智能模型所在的运行组设为正常组，将所述异常模型所在的运行组为异常组；The artificial intelligence model corresponding to the normal performance index is set as a normal model, the artificial intelligence model corresponding to the abnormal performance index is set as an abnormal model, and the operating group where the normal model is located and the operating group of the smart car operating system are not The operating group where the artificial intelligence model is located is set as a normal group, and the operating group where the abnormal model is located is an abnormal group;

若确定所述异常模型与所述异常组中其他的人工智能模型之间具有逻辑关系；则调整所述静态分配策略或所述动态分配策略中的划分规则，使调整后的划分规则用于将所述异常组中的独立模型调整到一个正常组；和/或调整所述静态分配策略或所述动态分配策略中的资源分配规则，使调整后的资源分配规则用于将所述异常组对应的工作线程，调整为一个正常组对应的工作线程；其中，所述独立模型是异常组中与其他的人工智能模型之间不具有逻辑关系的人工智能模型；If it is determined that there is a logical relationship between the abnormal model and other artificial intelligence models in the abnormal group; then adjust the division rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted division rules are used to divide The independent model in the abnormal group is adjusted to a normal group; and/or the resource allocation rule in the static allocation strategy or the dynamic allocation strategy is adjusted so that the adjusted resource allocation rule is used to map the abnormal group to The working thread is adjusted to a corresponding working thread of a normal group; wherein, the independent model is an artificial intelligence model that has no logical relationship with other artificial intelligence models in the abnormal group;

若确定所述异常模型与所述异常组中其他的人工智能模型之间不具有逻辑关系；则调整所述静态分配策略或所述动态分配策略中的划分规则，使调整后的划分规则用于将所述异常模型调整到一个正常组；和/或调整所述静态分配策略或所述动态分配策略中的资源分配规则，使调整后的资源分配规则用于将所述异常组对应的工作线程，调整为一个正常组对应的工作线程；If it is determined that there is no logical relationship between the abnormal model and other artificial intelligence models in the abnormal group; then adjust the division rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted division rules are used for Adjusting the exception model to a normal group; and/or adjusting the resource allocation rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted resource allocation rules are used to allocate the worker threads corresponding to the exception group , adjusted to a worker thread corresponding to a normal group;

根据调整后的划分规则和/或调整后的资源分配规则，生成动态分配策略。Generate a dynamic allocation policy according to the adjusted division rule and/or the adjusted resource allocation rule.

第二方面，本申请提供一种智能汽车操作系统的图形处理器的资源调度装置，包括：In a second aspect, the present application provides a resource scheduling device for a graphics processor of a smart car operating system, including:

静态测试模块，用于对智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标；并识别所述至少一个静态性能指标中的优先静态指标，将所述优先静态指标对应的静态策略设为静态分配策略，其中，智能汽车操作系统中具有至少一个人工智能模型，所述人工智能模型用于实现指定任务，所述指定任务用于实现汽车的自动驾驶，所述静态性能指标反映了人工智能模型在性能测试中的性能表现，所述优先静态指标为满足预置的优先规则的静态性能指标，所述静态策略是用于对人工智能模型进行分组，及对分组后的人工智能模型分配工作线程的计算机策略，所述工作线程是用于调度所述图形处理器中的流处理器和/或计算单元的序列图形处理器；The static test module is used to perform at least one performance test on the artificial intelligence model in the smart car operating system to obtain at least one static performance index; and identify the priority static index in the at least one static performance index, and use the priority static index The corresponding static strategy is set as a static allocation strategy, wherein, there is at least one artificial intelligence model in the smart car operating system, and the artificial intelligence model is used to realize a specified task, and the specified task is used to realize the automatic driving of the car. The performance index reflects the performance of the artificial intelligence model in the performance test. The priority static index is a static performance index that meets the preset priority rules. The static strategy is used to group the artificial intelligence models and group the A computer strategy for assigning worker threads of an artificial intelligence model that is a sequential graphics processor for scheduling stream processors and/or compute units in the graphics processor;

静态分配模块，用于根据所述静态分配策略，为所述智能汽车操作系统中的每一人工智能模型分配图形处理器资源；A static allocation module, configured to allocate graphics processor resources to each artificial intelligence model in the smart car operating system according to the static allocation strategy;

动态监测模块，用于运行所述智能汽车操作系统中的至少一个人工智能模型，并监控运行的人工智能模型的动态性能指标，其中，所述动态性能指标反映了人工智能模型在调用分配到的图形处理器资源进行运算时的性能表现；A dynamic monitoring module, configured to run at least one artificial intelligence model in the smart car operating system, and monitor the dynamic performance indicators of the running artificial intelligence model, wherein the dynamic performance indicators reflect that the artificial intelligence model is invoking the assigned The performance of graphics processor resources when performing calculations;

动态分配模块，用于根据所述动态性能指标生成动态分配策略，根据所述动态分配策略调整每一人工智能模型分配到的图形处理器资源，其中，所述动态分配策略用于根据所述人工智能模型在运行时的性能表现，对运行的人工智能模型分配图形处理器资源。A dynamic allocation module, configured to generate a dynamic allocation strategy according to the dynamic performance index, and adjust graphics processor resources allocated to each artificial intelligence model according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used to The performance of the intelligent model at runtime, and allocate graphics processor resources to the running artificial intelligence model.

第三方面，本申请提供一种计算机设备，包括：处理器以及与所述处理器通信连接的存储器；In a third aspect, the present application provides a computer device, including: a processor and a memory communicatively connected to the processor;

所述存储器存储计算机执行指令；the memory stores computer-executable instructions;

所述处理器执行所述存储器存储的计算机执行指令，以实现如权利要求上述的图形处理器的资源调度方法。The processor executes the computer-executed instructions stored in the memory, so as to implement the resource scheduling method for a graphics processor as claimed in the claims.

第四方面，本申请提供一种计算机可读存储介质，所述计算机可读存储介质中存储有计算机执行指令，所述计算机执行指令被处理器执行时用于实现上述的图形处理器的资源调度方法。In a fourth aspect, the present application provides a computer-readable storage medium, where computer-executable instructions are stored in the computer-readable storage medium, and when the computer-executable instructions are executed by a processor, they are used to implement the resource scheduling of the above-mentioned graphics processor method.

第五方面，本申请提供一种计算机程序产品，包括计算机程序，所述计算机程序被处理器执行时实现上述的图形处理器的资源调度方法。In a fifth aspect, the present application provides a computer program product, including a computer program, and when the computer program is executed by a processor, the above resource scheduling method for a graphics processor is implemented.

本申请提供的一种智能汽车操作系统的资源调度方法、装置及设备，通过对智能汽车操作系统中的人工智能模型进行至少一次性能测试，以在运行智能汽车操作系统之前，对其中的人工智能模型进行静态的性能测试，得到至少一个静态性能指标，以确定智能汽车操作系统中每一人工智能模型在不同分组，以及对其分配不同的工作线程时，各人工智能模型的性能表现。A resource scheduling method, device, and equipment for a smart car operating system provided by the present application, by performing at least one performance test on the artificial intelligence model in the smart car operating system, before running the smart car operating system, the artificial intelligence in it Static performance testing is performed on the model to obtain at least one static performance index, so as to determine the performance of each artificial intelligence model in different groups and assign different working threads to it in the smart car operating system.

通过识别所述至少一个静态性能指标中的优先静态指标，以识别出性能表现最优的分组方式和工作线程匹配方式，进而在运行智能汽车系统之前最大限度的优化图形处理器资源的分配合理性，并将最优的分组方式和工作线程匹配方式设为静态分配策略。By identifying the priority static index in the at least one static performance index, to identify the best performance grouping method and working thread matching method, and then optimize the rationality of the allocation of graphics processor resources to the greatest extent before running the smart car system , and set the optimal grouping method and worker thread matching method as a static allocation strategy.

通过优先静态指标对应的静态分配策略，向每一人工智能模型分配图形处理器资源，以实现在智能汽车操作系统在运行之前，已为各人工智能模型分配了性能最优配置的图形处理器资源。Allocate graphics processor resources to each artificial intelligence model through the static allocation strategy corresponding to the priority static index, so as to realize that the graphics processor resources with optimal performance have been allocated to each artificial intelligence model before the operating system of the smart car is running .

通过性能采集模块监控运行的人工智能模型的动态性能指标，动态性能指标中的指标元素包括：CPU使用率、内存占用率、磁盘IO、系统平均负载、延迟、帧率；通过根据所述动态性能指标生成动态分配策略，以实现基于当前在运行时的人工智能模型的性能表现对人工智能模型分配图形处理器资源，确保各运行的人工智能模型均有足够的图形处理器资源用于调用，解决了人工智能模型生成的推理请求在图形处理器的工作线程的调用上产生冲突，避免了多个请求同时争抢图形处理器中工作线程的情况，保证了自动驾驶场景中的多个人工智能模型的性能表现，进而确保了图形处理器资源调用的稳定。The dynamic performance indicators of the running artificial intelligence model are monitored through the performance acquisition module. The indicator elements in the dynamic performance indicators include: CPU usage, memory usage, disk IO, system average load, delay, and frame rate; according to the dynamic performance The index generates a dynamic allocation strategy to realize the allocation of graphics processor resources to the artificial intelligence model based on the performance of the current artificial intelligence model at runtime, so as to ensure that each running artificial intelligence model has enough graphics processor resources for calling, solving It prevents the inference requests generated by the artificial intelligence model from conflicting on the invocation of the working thread of the graphics processor, avoids the situation where multiple requests compete for the working thread of the graphics processor at the same time, and ensures the multiple artificial intelligence models in the automatic driving scene performance, thereby ensuring the stability of graphics processor resource calls.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分，示出了符合本申请的实施例，并与说明书一起用于解释本申请的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description serve to explain the principles of the application.

图1为本申请实施例提供的一种应用场景示意图；FIG. 1 is a schematic diagram of an application scenario provided by an embodiment of the present application;

图2为本申请实施例提供的一种图形处理器的资源调度方法的实施例1的流程图；FIG. 2 is a flow chart of Embodiment 1 of a resource scheduling method for a graphics processor provided in an embodiment of the present application;

图3为本发明提供的一种图形处理器的资源调度装置的程序模块示意图；3 is a schematic diagram of program modules of a resource scheduling device for a graphics processor provided by the present invention;

图4为本发明计算机设备中计算机设备的硬件结构示意图。Fig. 4 is a schematic diagram of the hardware structure of the computer device in the computer device of the present invention.

通过上述附图，已示出本申请明确的实施例，后文中将有更详细的描述。这些附图和文字描述并不是为了通过任何方式限制本申请构思的范围，而是通过参考特定实施例为本领域技术人员说明本申请的概念。By means of the above drawings, specific embodiments of the present application have been shown, which will be described in more detail hereinafter. These drawings and text descriptions are not intended to limit the scope of the concept of the application in any way, but to illustrate the concept of the application for those skilled in the art by referring to specific embodiments.

具体实施方式Detailed ways

这里将详细地对示例性实施例进行说明，其示例表示在附图中。下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反，它们仅是与如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

请参阅图1，本申请具体的应用场景为：Please refer to Figure 1, the specific application scenario of this application is:

运行有智能汽车操作系统的图形处理器的资源调度方法的控制单元11，安装在智能汽车操作系统1中，控制单元11与智能汽车操作系统1中的人工智能模型12和图形处理器13连接。The control unit 11 running the resource scheduling method of the graphics processor of the smart car operating system is installed in the smart car operating system 1 , and the control unit 11 is connected with the artificial intelligence model 12 and the graphics processor 13 in the smart car operating system 1 .

控制单元11对智能汽车操作系统1中的人工智能模型12进行至少一次性能测试，得到至少一个静态性能指标，识别至少一个静态性能指标中的优先静态指标，将优先静态指标对应的静态策略设为静态分配策略The control unit 11 performs at least one performance test on the artificial intelligence model 12 in the smart car operating system 1, obtains at least one static performance index, identifies the priority static index in the at least one static performance index, and sets the static strategy corresponding to the priority static index to static allocation strategy

控制单元11根据静态分配策略向智能汽车操作系统1中的每一人工智能模型12分配图形处理器13资源。The control unit 11 allocates graphics processor 13 resources to each artificial intelligence model 12 in the intelligent vehicle operating system 1 according to a static allocation strategy.

控制单元11运行智能汽车操作系统1中的至少一个人工智能模型12，并监控运行的人工智能模型12的动态性能指标。The control unit 11 runs at least one artificial intelligence model 12 in the smart car operating system 1, and monitors the dynamic performance index of the running artificial intelligence model 12.

控制单元11根据动态性能指标生成动态分配策略，根据动态分配策略调整每一人工智能模型12分配到的图形处理器资源，其中，动态分配策略用于根据人工智能模型12在运行时的性能表现，对运行的人工智能模型12分配图形处理器资源。The control unit 11 generates a dynamic allocation strategy according to the dynamic performance index, and adjusts the graphics processor resources allocated to each artificial intelligence model 12 according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used to perform according to the performance of the artificial intelligence model 12 at runtime, Allocate graphics processor resources to the running artificial intelligence model 12 .

控制单元11安装在具有智能汽车操作系统1的电路或芯片中的计算机模块，图形处理器13是安装在该电路或芯片中的图形加速器，以作为智能汽车操作系统1的一部分。The control unit 11 is a computer module installed in a circuit or chip with the smart car operating system 1 , and the graphics processor 13 is a graphics accelerator installed in the circuit or chip as a part of the smart car operating system 1 .

下面以具体地实施例对本申请的技术方案以及本申请的技术方案如何解决现有技术问题进行详细说明。下面这几个具体的实施例可以相互结合，对于相同或相似的概念或过程可能在某些实施例中不再赘述。下面将结合附图，对本申请的实施例进行描述。The technical solution of the present application and how the technical solution of the present application solves the problems in the prior art will be described in detail below with specific embodiments. The following specific embodiments may be combined with each other, and the same or similar concepts or processes may not be repeated in some embodiments. Embodiments of the present application will be described below in conjunction with the accompanying drawings.

实施例1：Example 1:

请参阅图2，本申请提供一种智能汽车操作系统的图形处理器的资源调度方法，包括：Please refer to Fig. 2, the present application provides a kind of resource scheduling method of the graphic processor of intelligent car operating system, including:

S201：对智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标；并识别至少一个静态性能指标中的优先静态指标，将优先静态指标对应的静态策略设为静态分配策略，其中，智能汽车操作系统中具有至少一个人工智能模型，人工智能模型用于实现指定任务，指定任务用于实现汽车的自动驾驶，静态性能指标反映了人工智能模型在性能测试中的性能表现，优先静态指标为满足预置的优先规则的静态性能指标，静态策略是用于对人工智能模型进行分组，及对分组后的人工智能模型分配工作线程的计算机策略，工作线程是用于调度图形处理器中的流处理器和/或计算单元的序列图形处理器。S201: Perform at least one performance test on the artificial intelligence model in the smart car operating system to obtain at least one static performance index; and identify the priority static index in at least one static performance index, and set the static policy corresponding to the priority static index as static allocation Strategy, wherein the smart car operating system has at least one artificial intelligence model, the artificial intelligence model is used to realize the specified task, the specified task is used to realize the automatic driving of the car, and the static performance index reflects the performance of the artificial intelligence model in the performance test , the priority static index is a static performance index that satisfies the preset priority rules. The static strategy is a computer strategy for grouping artificial intelligence models and assigning work threads to the grouped artificial intelligence models. Work threads are used to schedule graphs. A stream processor in a processor and/or a sequential graphics processor in a compute unit.

本步骤中，智能汽车操作系统是智能汽车的底层操作系统(Operating System，OS)，用来控制和管理整个智能汽车的硬件和软件资源，给用户和其他软件提供接口和环境。当前的智能汽车操作系统包括：自动驾驶OS和智能座舱OS。自动驾驶OS对安全性、实时性、稳定性要求非常高，智能座舱OS更加重视开放性、兼容性。In this step, the smart car operating system is the underlying operating system (Operating System, OS) of the smart car, which is used to control and manage the hardware and software resources of the entire smart car, and provide interfaces and environments for users and other software. Current smart car operating systems include: autopilot OS and smart cockpit OS. Autonomous driving OS has very high requirements on safety, real-time performance and stability, while smart cockpit OS pays more attention to openness and compatibility.

自动驾驶OS主要用于车辆底盘与动力控制，以实现油门、转向、换挡、刹车等基本行驶功能。自动驾驶OS是当前重点研发的L3及以上级别自动驾驶功能的核心，其包含高性能复杂嵌入式系统、人工智能芯片及算法、高速网络、海量数据处理、云机协同等多种行业的融合技术。The autonomous driving OS is mainly used for vehicle chassis and power control to realize basic driving functions such as accelerator, steering, gear shifting, and braking. Autopilot OS is the core of L3 and above-level autopilot functions that are currently under development. It includes high-performance complex embedded systems, artificial intelligence chips and algorithms, high-speed networks, massive data processing, and cloud-machine collaboration. .

智能座舱OS是由不同的座舱电子组合成完整的体系。智能座舱主要分为5大部分：车载信息娱乐系统、流媒体中央后视镜、抬头显示系统HUD、全液晶仪表、车联网模块。智能座舱是通过多屏融合实现人机交互，以液晶仪表、HUD、中控屏及中控车载信息终端、后座HMI娱乐屏、车内外后视镜等为载体，实现语音控制、手势操作等更智能化的交互方式。未来有可能将人工智能、AR、ADAS、VR等技术融入其中。The smart cockpit OS is a complete system composed of different cockpit electronics. The smart cockpit is mainly divided into 5 parts: vehicle infotainment system, streaming media central rearview mirror, head-up display system HUD, full LCD instrument, car networking module. The smart cockpit realizes human-computer interaction through multi-screen integration. It uses LCD instrumentation, HUD, central control panel, central control vehicle information terminal, rear seat HMI entertainment screen, and interior and exterior rearview mirrors as carriers to realize voice control and gesture operations. A smarter way of interacting. In the future, it is possible to integrate artificial intelligence, AR, ADAS, VR and other technologies into it.

工作线程用于调度图形处理器中的流处理器(SP，streaming processor，其为图形处理器中的最基本的处理单元，也称为CUDA core)，图形处理器中的具体的指令和任务都是在SP上处理的。The worker thread is used to schedule the stream processor (SP, streaming processor, which is the most basic processing unit in the graphics processor, also known as CUDA core) in the graphics processor. The specific instructions and tasks in the graphics processor are all is processed on the SP.

工作线程还用于调度图形处理器中的计算单元(SM，streaming multiprocessor，又称图形处理器大核)。SM是由多个SP加上其他的一些资源组成一个图形处理器大核，其中，其他资源如：warp scheduler，register，shared memory等，SM可以看做图形处理器的心脏(对比CPU核心)，register和shared memory是SM的稀缺资源。The worker thread is also used to schedule computing units in the graphics processor (SM, streaming multiprocessor, also known as the large core of the graphics processor). SM is composed of multiple SPs plus some other resources to form a large graphics processor core. Among them, other resources such as: warp scheduler, register, shared memory, etc., SM can be regarded as the heart of the graphics processor (compared to the CPU core), Register and shared memory are scarce resources of SM.

本步骤通过对智能汽车操作系统中的人工智能模型进行至少一次性能测试，以在运行智能汽车操作系统之前，对其中的人工智能模型进行静态的性能测试，得到至少一个静态性能指标，以确定智能汽车操作系统中每一人工智能模型在不同分组，以及对其分配不同的工作线程时，各人工智能模型的性能表现。In this step, at least one performance test is performed on the artificial intelligence model in the smart car operating system, so as to perform a static performance test on the artificial intelligence model in the smart car operating system before running the smart car operating system to obtain at least one static performance index to determine the intelligence The performance of each artificial intelligence model in the automotive operating system when it is divided into different groups and assigned different working threads.

通过识别至少一个静态性能指标中的优先静态指标，以识别出性能表现最优的分组方式和工作线程匹配方式，进而在运行智能汽车系统之前最大限度的优化图形处理器资源的分配合理性，并将最优的分组方式和工作线程匹配方式设为静态分配策略。By identifying the priority static index in at least one static performance index, to identify the grouping method and the worker thread matching method with the best performance, and then optimize the rationality of the allocation of graphics processor resources to the greatest extent before running the smart car system, and Set the optimal grouping method and worker thread matching method as a static allocation strategy.

在一个优选的实施例中，对智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标，包括：In a preferred embodiment, at least one performance test is carried out to the artificial intelligence model in the smart car operating system to obtain at least one static performance index, including:

获取至少一个静态策略，获取至少一个静态策略，根据每一静态策略，分别对智能汽车操作系统中的人工智能模型进行分组并向每组人工智能模型分配工作线程，得到至少一个静态样本，其中，静态策略中包括划分规则和资源分配规则，划分规则用于对智能汽车操作系统中的人工智能模型进行分组，资源分配规则用于向每组人工智能模型分配工作线程。Obtaining at least one static strategy, obtaining at least one static strategy, grouping the artificial intelligence models in the smart car operating system according to each static strategy and assigning work threads to each group of artificial intelligence models to obtain at least one static sample, wherein, The static strategy includes division rules and resource allocation rules. The division rules are used to group the artificial intelligence models in the smart car operating system, and the resource allocation rules are used to allocate working threads to each group of artificial intelligence models.

具体地，根据每一静态策略，分别对智能汽车操作系统中的人工智能模型进行分组并向每组人工智能模型分配工作线程，得到至少一个静态样本，包括：Specifically, according to each static strategy, the artificial intelligence models in the smart car operating system are grouped respectively and work threads are assigned to each group of artificial intelligence models to obtain at least one static sample, including:

根据静态策略中的划分规则对智能汽车操作系统中每一人工智能模型进行分组，得到至少一个测试组，其中，测试组中至少具有一个人工智能模型；Group each artificial intelligence model in the smart car operating system according to the division rules in the static strategy to obtain at least one test group, wherein there is at least one artificial intelligence model in the test group;

获取图形处理器中的至少一个工作线程，根据静态策略中的资源分配规则向每一测试组分配一个工作线程；Obtain at least one worker thread in the graphics processor, and assign a worker thread to each test group according to the resource allocation rules in the static policy;

汇总至少一个测试组及与每一测试组对应的工作线程，形成与静态策略对应的静态样本。Summarize at least one test group and the worker threads corresponding to each test group to form a static sample corresponding to the static strategy.

示例性地，例如：将智能汽车操作系统中的人工智能模型包括：模型1、模型2、模型3、模型4、模型5。Exemplarily, for example: the artificial intelligence models in the smart car operating system include: Model 1, Model 2, Model 3, Model 4, and Model 5.

如果静态策略1是将模型两两分组，如果有剩余则单独分组，那么将得到：测试组1：模型1、模型2；测试组2：模型3、模型4；测试组3：模型5。If the static strategy 1 is to group the models in pairs, if there are any leftovers, group them separately, then you will get: test group 1: model 1, model 2; test group 2: model 3, model 4; test group 3: model 5.

如果静态策略2是将某一指定的模型单独列为一组(例如：模型3)，其他模型两两一组，那么将得到：测试组1：模型3；测试组2：模型1、模型2；测试组3：模型4、模型5。If static strategy 2 is to list a specified model as a group (for example: model 3) and other models in pairs, then you will get: test group 1: model 3; test group 2: model 1, model 2 ;Test Group 3: Model 4, Model 5.

图形处理器中具有至少一个工作线程，每一工作线程分别调用与其对应的流处理器或计算单元，因此，各工作线程能够调用的图形处理器资源均是不同的。假设图形处理器中的工作线程包括：线程1和线程2；那么，静态策略1将测试组1和测试组2分配给线程1，将测试组3分配给线程2；静态策略2将测试组1分配给线程1，将测试组2和测试组3分配给线程2，以此类推。There is at least one worker thread in the graphics processor, and each worker thread invokes its corresponding stream processor or computing unit. Therefore, the graphics processor resources that can be invoked by each worker thread are different. Assume that the working threads in the graphics processor include: thread 1 and thread 2; then, static strategy 1 assigns test group 1 and test group 2 to thread 1, and assigns test group 3 to thread 2; static strategy 2 assigns test group 1 Assign to thread 1, assign test group 2 and test group 3 to thread 2, and so on.

因此，基于上述举例，可以得到与静态策略1对应的静态样本：模型1、模型2、模型3、模型4分配线程1，模型5分配线程2，及与静态策略2对应的静态样本：模型3分配线程1，模型1、模型2、模型4和模型5分配线程2的静态样本，以此类推，在此不做赘述。Therefore, based on the above example, static samples corresponding to static strategy 1 can be obtained: model 1, model 2, model 3, model 4 assigning thread 1, model 5 assigning thread 2, and static samples corresponding to static strategy 2: model 3 Thread 1 is allocated, model 1, model 2, model 4, and model 5 allocate static samples of thread 2, and so on, which will not be described here.

进一步地，根据静态策略中的划分规则对智能汽车操作系统中每一人工智能模型进行分组，得到至少一个测试组，包括：Further, according to the division rules in the static strategy, each artificial intelligence model in the smart car operating system is grouped to obtain at least one test group, including:

若确定智能汽车操作系统中具有至少一个有向无环图，则将属于同一有向无环图中的人工智能模型划分为一个测试组，其中，有向无环图反映了智能汽车操作系统中的两个或两个以上的人工智能模型之间的逻辑关系；If it is determined that there is at least one directed acyclic graph in the smart car operating system, the artificial intelligence models belonging to the same directed acyclic graph are divided into a test group, wherein the directed acyclic graph reflects the The logical relationship between two or more artificial intelligence models;

若确定智能汽车操作系统中具有不属于有向无环图的其他人工智能模型，则根据每一其他人工智能模型的模型属性数据对其他人工智能模型进行分组，得到至少一个测试组，其中，模型属性数据描述了人工智能模型为完成指定的任务所消耗的算力；If it is determined that there are other artificial intelligence models that do not belong to the directed acyclic graph in the smart car operating system, then group other artificial intelligence models according to the model attribute data of each other artificial intelligence model to obtain at least one test group, wherein the model Attribute data describes the computing power consumed by the artificial intelligence model to complete the specified task;

若确定智能汽车操作系统中不具有有向无环图，则根据每一人工智能模型的模型属性数据对智能汽车操作系统中的人工智能模型进行分组，得到至少一个测试组。If it is determined that there is no directed acyclic graph in the smart car operating system, the artificial intelligence models in the smart car operating system are grouped according to the model attribute data of each artificial intelligence model to obtain at least one test group.

示例性地，如果确定智能汽车操作系统中的有向无环图包括：第一图和第二图，第一图表征模型1指向模型2，第二图表征模型3指向模型4，那么，将模型1和模型2划分为一个测试组，将模型3和模型4划分为一个测试组。Exemplarily, if it is determined that the directed acyclic graph in the smart car operating system includes: a first graph and a second graph, the first graph represents model 1 pointing to model 2, and the second graph represents model 3 pointing to model 4, then, Model 1 and Model 2 are divided into a test group, and Model 3 and Model 4 are divided into a test group.

如果确定智能汽车操作系统中具有不属于有向无环图的其他人工智能模型，如模型5和模型6，模型5和模型6既不属于第一图，也不属于第二图，那么根据模型5和模型6的属性数据对模型5进行分组。If it is determined that there are other artificial intelligence models in the smart car operating system that do not belong to the directed acyclic graph, such as model 5 and model 6, and model 5 and model 6 neither belong to the first graph nor to the second graph, then according to the model 5 and the attribute data of model 6 to group model 5.

模型属性数据描述了人工智能模型为实现指定任务所消耗的算力；例如：模型5用于实现的指定任务是雷达感知，其所消耗的算力是M1,；模型6是用于实现的指定任务是视觉感知，其所消耗的算力是M2。The model attribute data describes the computing power consumed by the artificial intelligence model to achieve the specified task; for example: the specified task used by model 5 is radar perception, and the computing power consumed by it is M1; model 6 is the specified task used to realize The task is visual perception, and the computing power it consumes is M2.

需要说明的是，有向无环图是由有限个顶点和“有向边”组成，从任意顶点出发，经过若干条有向边，都无法回到该顶点，这种图就是有向无环图，于本实施例中，人工智能模型为有向无环图中的顶点，有向无环图中的有向边用于描述两个人工智能模型之间的关联关系，例如：依赖关系、关联关系、聚合关系、组合关系等；属于同一有向无环图的人工智能模型相互依赖，以有向无环图中位于上一位人工智能模型实现的指定任务，为位于下一位的人工智能模型的输入，使得各人工智能模型依次执行，并最终实现组合任务，该组合任务是基于有向无环图中各人工智能模型实现的指定任务所完成的总任务。It should be noted that a directed acyclic graph is composed of a finite number of vertices and "directed edges". Starting from any vertex, passing through several directed edges, it is impossible to return to the vertex. This kind of graph is a directed acyclic graph. Graph, in this embodiment, the artificial intelligence model is a vertex in a directed acyclic graph, and the directed edge in the directed acyclic graph is used to describe the relationship between two artificial intelligence models, such as: dependency, Association relationship, aggregation relationship, combination relationship, etc.; the artificial intelligence models belonging to the same directed acyclic graph are interdependent, and the specified tasks realized by the artificial intelligence model in the previous directed acyclic graph are the artificial intelligence models in the next one. The input of the intelligent model makes each artificial intelligence model execute in sequence, and finally realizes the combined task, which is the total task completed based on the specified tasks realized by each artificial intelligence model in the directed acyclic graph.

可选的，根据模型属性信息模型属性数据对人工智能模型进行分组，包括：Optionally, artificial intelligence models are grouped according to model attribute information and model attribute data, including:

若确定两个或两个以上的人工智能模型的模型属性数据之和，未超过预置的算力阈值，则将两个或两个以上的人工智能模型划分为一个测试组；If it is determined that the sum of the model attribute data of two or more artificial intelligence models does not exceed the preset computing power threshold, then divide the two or more artificial intelligence models into a test group;

若确定第一人工智能模型的模型属性数据与智能汽车操作系统中其他的人工智能模型的模型属性数据之和超过算力预置，则将第一人工智能模型划为一个测试组，其中个，第一人工智能模型是智能汽车操作系统中的一个人工智能模型。If it is determined that the sum of the model attribute data of the first artificial intelligence model and the model attribute data of other artificial intelligence models in the smart car operating system exceeds the preset computing power, the first artificial intelligence model is divided into a test group, wherein, The first artificial intelligence model is an artificial intelligence model in the smart car operating system.

如果模型5和模型6消耗的算力之和超过了预置的算力阈值M3，则将模型5设为一个测试组，模型6设为一个测试组；If the sum of the computing power consumed by Model 5 and Model 6 exceeds the preset computing power threshold M3, set Model 5 as a test group and Model 6 as a test group;

如果模型5和模型6消耗的算力之和未超过算力预置M3，则将模型5和模型6共同设为一个测试组。If the sum of the computing power consumed by model 5 and model 6 does not exceed the computing power preset M3, then set model 5 and model 6 together as a test group.

如果确定智能汽车操作系统中不具有有向无环图，那么，根据模型属性信息模型属性数据对人工智能模型进行分组，实现对模型1、模型2、模型3、模型4、模型5和模型6进行分组，得到至少一个测试组。If it is determined that there is no directed acyclic graph in the smart car operating system, then the artificial intelligence models are grouped according to the model attribute information model attribute data, and the model 1, model 2, model 3, model 4, model 5 and model 6 are realized Perform grouping to obtain at least one test group.

将预置的测试实例录入每一静态样本中的人工智能模型中，并通过每一静态样本中的人工智能模型分配到的工作线程，运行人工智能模型中的测试实例，以对每一静态样本进行性能测试，并得到分别与至少一个静态策略对应的至少一个静态性能指标，其中，测试实例是用于对人工智能模型进行性能测试的测试用例。Enter the preset test instance into the artificial intelligence model in each static sample, and run the test instance in the artificial intelligence model through the worker thread assigned to the artificial intelligence model in each static sample to test each static sample Performance testing is performed, and at least one static performance index corresponding to at least one static strategy is obtained, wherein the test instance is a test case for performance testing of the artificial intelligence model.

在一个优选的实施例中，识别至少一个静态性能指标中的优先静态指标，包括：In a preferred embodiment, identifying priority static indicators among at least one static performance indicator comprises:

提取每一静态性能指标中的第一指标元素，对第一指标元素进行排序得到目标序列，其中，静态性能指标中具有至少一个静态指标元素，静态指标元素反映了人工智能模型在性能测试中的一个性能维度上的性能表现，第一指标元素是静态性能指标中的一个静态指标元素。Extract the first index element in each static performance index, sort the first index elements to obtain the target sequence, wherein, there is at least one static index element in the static performance index, and the static index element reflects the performance of the artificial intelligence model in the performance test. For performance in a performance dimension, the first index element is a static index element in the static performance index.

示例性地，静态性能指标中的指标元素包括：运行时间、内存占用、SM利用率、功耗。Exemplarily, the index elements in the static performance index include: running time, memory occupation, SM utilization rate, and power consumption.

假设基于三个静态策略生成了三个静态性能指标：静态性能指标1，静态性能指标2，静态性能指标3，第一指标元素的运行时间，那么将得到目标序列：Assuming that three static performance indicators are generated based on three static strategies: static performance indicator 1, static performance indicator 2, static performance indicator 3, and the running time of the first indicator element, then the target sequence will be obtained:

运行时间的目标序列target sequence at run time 运行时间operation hours 静态性能指标1Static performance index 1 0.1s0.1s 静态性能指标2Static performance index 2 0.2s0.2s 静态性能指标3Static performance index 3 0.3s0.3s

根据每一第一指标元素在目标序列中的位次，确定每一第一指标元素的性能值，其中，性能值反映了第一指标元素的性能优劣程度。According to the position of each first index element in the target sequence, the performance value of each first index element is determined, wherein the performance value reflects the degree of performance of the first index element.

于本实施例中，位次越高表征第一指标元素性能越好，位次越低表征第一指标元素的性能越差，因此，运行时间的目标序列为升序排列，其表征运行时间越短，指标元素性能越好；内存占用的目标序列为降序排列，其表征内存占用越小，指标元素性能越好；SM利用率的目标序列为降序排列，其表征SM占用率越高，指标元素性能越好；功耗的目标序列为升序排列，其表征功耗越小，指标元素性能越好。In this embodiment, the higher the rank, the better the performance of the first index element, and the lower the rank, the worse the performance of the first index element. Therefore, the target sequence of the running time is arranged in ascending order, which indicates that the running time is shorter , the better the performance of the index element; the target sequence of memory occupation is in descending order, which indicates that the smaller the memory occupation, the better the performance of the index element; the target sequence of SM utilization is in descending order, which indicates that the higher the SM occupancy rate, the better the performance of the index element. The better; the target sequence of power consumption is in ascending order, and the smaller the power consumption, the better the performance of the index element.

根据每一静态性能指标中各静态指标元素的性能值，得到每一静态性能指标的综合性能值。According to the performance value of each static index element in each static performance index, the comprehensive performance value of each static performance index is obtained.

示例性地，基于上述举例：假设第一位次性能值为3，第二位次性能值为2，第三位次性能值为1，将各性能指标中各静态指标元素的性能值带入预置的加权函数中，计算该加权函数得到综合性能值。Exemplarily, based on the above example: assuming that the performance value of the first rank is 3, the performance value of the second rank is 2, and the performance value of the third rank is 1, the performance value of each static index element in each performance index is brought into In the preset weighting function, calculate the weighting function to obtain the comprehensive performance value.

加权函数为：S＝a*x+b*y+c*z+d*mThe weighting function is: S=a*x+b*y+c*z+d*m

其中，x是运行时间的性能值，a是运行时间的权重；y是内存占用的性能值，b是内存占用的权重；z是SM占用率的性能值，c是SM占用率的权重；m是功耗的性能值，d是功耗的权重；S是综合性能值。Among them, x is the performance value of running time, a is the weight of running time; y is the performance value of memory usage, b is the weight of memory usage; z is the performance value of SM occupancy rate, c is the weight of SM occupancy rate; m is the performance value of power consumption, d is the weight of power consumption; S is the comprehensive performance value.

S202：根据静态分配策略，为智能汽车操作系统中的每一人工智能模型分配图形处理器资源。S202: Allocate graphics processor resources for each artificial intelligence model in the smart car operating system according to the static allocation policy.

本步骤中，通过优先静态指标对应的静态分配策略，向每一人工智能模型分配图形处理器资源，以实现在智能汽车操作系统在运行之前，已为各人工智能模型分配了性能最优配置的图形处理器资源。In this step, the graphics processor resources are allocated to each artificial intelligence model through the static allocation strategy corresponding to the priority static index, so as to realize that each artificial intelligence model has been allocated with the optimal performance configuration before the operation of the intelligent vehicle operating system. Graphics processor resources.

在一个优选的实施例中，根据静态分配策略，为智能汽车操作系统中的每一人工智能模型分配图形处理器资源，包括：In a preferred embodiment, according to the static allocation strategy, allocate graphics processor resources for each artificial intelligence model in the intelligent vehicle operating system, including:

根据静态分配策略中的划分规则，对智能汽车操作系统中的人工智能模型进行分组，得到至少一个运行组；According to the division rules in the static allocation strategy, the artificial intelligence models in the smart car operating system are grouped to obtain at least one operation group;

获取图形处理器中的至少一个工作线程，根据静态分配策略中的资源分配规则，向每一运行组分配分配一个工作线程，以向每一运行组中的每一人工智能模型资源分配图形处理器资源，其中，工作线程是用于调用图形处理器中的图形处理器资源运行人工智能模型的序列。Obtain at least one worker thread in the graphics processor, and assign a worker thread to each operation group according to the resource allocation rules in the static allocation strategy, so as to allocate a graphics processor to each artificial intelligence model resource in each operation group resources, where the worker thread is a sequence for invoking graphics processor resources in the graphics processor to run the artificial intelligence model.

S203：运行智能汽车操作系统中的至少一个人工智能模型，并监控运行的人工智能模型的动态性能指标，其中，动态性能指标反映了人工智能模型在调用分配到的图形处理器资源进行运算时的性能表现。S203: Run at least one artificial intelligence model in the smart car operating system, and monitor the dynamic performance index of the running artificial intelligence model, wherein the dynamic performance index reflects the performance of the artificial intelligence model when calling the allocated graphics processor resources for calculation performance.

本步骤中，通过性能采集模块监控运行的人工智能模型的动态性能指标，动态性能指标中的指标元素包括：CPU使用率、内存占用率、磁盘IO、系统平均负载、延迟、帧率。In this step, monitor the dynamic performance indicators of the running artificial intelligence model through the performance acquisition module. The indicator elements in the dynamic performance indicators include: CPU usage, memory usage, disk IO, system average load, delay, and frame rate.

采用kylinTOP测试与监控平台、或LoadRunner、或kylinPET、或Apache JMeter、或NeoLoad、或WebLOAD、或Loadster、或Loadstorm、或Load impact、或OpenSTA、或Telegraf作为性能采集模块。Use kylinTOP testing and monitoring platform, or LoadRunner, or kylinPET, or Apache JMeter, or NeoLoad, or WebLOAD, or Loadster, or Loadstorm, or Load impact, or OpenSTA, or Telegraf as the performance acquisition module.

S204：根据动态性能指标生成动态分配策略，根据动态分配策略调整每一人工智能模型分配到的图形处理器资源，其中，动态分配策略用于根据人工智能模型在运行时的性能表现，对运行的人工智能模型分配图形处理器资源。S204: Generate a dynamic allocation strategy according to the dynamic performance index, and adjust the graphics processor resources allocated to each artificial intelligence model according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used to optimize the running performance of the artificial intelligence model according to the performance of the artificial intelligence model at runtime. AI models allocate GPU resources.

本步骤中，根据动态性能指标生成动态分配策略，以实现基于当前在运行时的人工智能模型的性能表现对人工智能模型分配图形处理器资源，确保各运行的人工智能模型均有足够的图形处理器资源用于调用，解决了人工智能模型生成的推理请求在图形处理器的工作线程的调用上产生冲突，避免了多个请求同时争抢图形处理器中工作线程的情况，保证了自动驾驶场景中的多个人工智能模型的性能表现。In this step, a dynamic allocation strategy is generated according to the dynamic performance index, so as to allocate graphics processor resources to the AI model based on the performance of the AI model currently running, and ensure that each running AI model has enough graphics processing Server resources are used to call, which solves the conflict between the reasoning requests generated by the artificial intelligence model and the calling of the working thread of the graphics processor, avoids the situation where multiple requests compete for the working thread of the graphics processor at the same time, and ensures the automatic driving scene The performance of multiple artificial intelligence models in .

在一个优选的实施例中，根据动态性能指标生成动态分配策略，包括：In a preferred embodiment, generating a dynamic allocation strategy according to a dynamic performance index includes:

提取动态性能指标中的第二指标元素，并获取与第二指标元素对应的指标规则，其中，动态性能指标中具有至少一个指标元素，第二指标元素是动态性能指标中的一个指标元素，指标规则是用于定义指标元素正常和异常的计算机规则。Extract the second index element in the dynamic performance index, and obtain the index rule corresponding to the second index element, wherein the dynamic performance index has at least one index element, the second index element is an index element in the dynamic performance index, and the index Rules are computer rules that define what is normal and abnormal for an indicator element.

若确定第二指标元素符合指标规则，则将第二指标元素设为正常指标元素。If it is determined that the second index element conforms to the index rule, the second index element is set as a normal index element.

若确定第二指标元素不符合指标规则，则将第二指标元素设为异常指标元素。If it is determined that the second index element does not conform to the index rule, the second index element is set as an abnormal index element.

若确定动态性能指标中的正常指标元素的数量未达到预置的正常阈值，或异常指标元素的数量达到预置的异常阈值，则确定动态性能指标为异常性能指标。If it is determined that the number of normal index elements in the dynamic performance index does not reach the preset normal threshold, or the number of abnormal index elements reaches the preset abnormal threshold, then it is determined that the dynamic performance index is an abnormal performance index.

若确定动态性能指标中的正常指标元素的数量达到预置的正常阈值，或异常指标元素的数量未达到预置的异常阈值，则确定动态性能指标为正常性能指标。If it is determined that the number of normal index elements in the dynamic performance index reaches a preset normal threshold, or the number of abnormal index elements does not reach a preset abnormal threshold, then it is determined that the dynamic performance index is a normal performance index.

根据正常性能指标和异常性能指标生成动态分配策略。Generate dynamic allocation policies based on normal performance metrics and abnormal performance metrics.

示例性地，获取到的动态性能指标包括：CPU使用率、内存占用率、磁盘IO、系统平均负载、延迟、帧率，假设第二指标元素是系统平均负载，其中，系统平均负载用于描述图形处理器的使用率，当系统平均负载等于1.0时，表示图形处理器使用率最高；当系统平均负载小于1.0时，表示图形处理器使用率处于空闲状态；当系统平均负载大于1.0时，表示图形处理器使用率已经超过负荷。Exemplarily, the obtained dynamic performance indicators include: CPU usage, memory usage, disk IO, system average load, delay, and frame rate, assuming that the second indicator element is the system average load, wherein the system average load is used to describe Graphics processor usage. When the system average load is equal to 1.0, it means that the graphics processor usage is the highest; when the system average load is less than 1.0, it means that the graphics processor usage is idle; when the system average load is greater than 1.0, it means Graphics processor usage has been exceeded.

假设负载的指标规则是：若系统平均负载属于【0,1】，则判定系统平均负载为正常指标元素；若系统平均负载不属于【0,1】，则判定系统平均负载为异常指标元素。The index rule of hypothetical load is: if the average system load belongs to [0,1], it is determined that the average system load is a normal index element; if the average system load does not belong to [0,1], it is determined that the average system load is an abnormal index element.

假设动态性能指标中具有6个，正常阈值为4，异常阈值为3。Assume that there are 6 dynamic performance indicators, the normal threshold is 4, and the abnormal threshold is 3.

如果动态性能指标具有5个正常指标元素，1个异常指标元素，则判定动态性能指标为正常性能指标。If the dynamic performance index has 5 normal index elements and 1 abnormal index element, it is determined that the dynamic performance index is a normal performance index.

如果动态性能指标具有2个正常指标元素，4个异常指标元素，则判定动态性能指标为异常性能指标。If the dynamic performance index has 2 normal index elements and 4 abnormal index elements, it is determined that the dynamic performance index is an abnormal performance index.

具体地，根据正常性能指标和异常性能指标生成动态分配策略，包括：Specifically, a dynamic allocation strategy is generated according to normal performance indicators and abnormal performance indicators, including:

将正常性能指标对应的人工智能模型设为正常模型，将异常性能指标对应的人工智能模型设为异常模型，将正常模型所在的运行组和智能汽车操作系统中未运行的人工智能模型所在的运行组设为正常组，将异常模型所在的运行组为异常组；Set the artificial intelligence model corresponding to the normal performance index as the normal model, set the artificial intelligence model corresponding to the abnormal performance index as the abnormal model, and set the operating group where the normal model is located and the operating group where the artificial intelligence model that is not running in the smart car operating system is located. The group is set as the normal group, and the operation group where the abnormal model is located is the abnormal group;

若确定异常模型与异常组中其他的人工智能模型之间具有逻辑关系；则调整静态分配策略或动态分配策略中的划分规则，使调整后的划分规则用于将异常组中的独立模型调整到一个正常组；和/或调整静态分配策略或动态分配策略中的资源分配规则，使调整后的资源分配规则用于将异常组对应的工作线程，调整为一个正常组对应的工作线程；其中，独立模型是异常组中与其他的人工智能模型之间不具有逻辑关系的人工智能模型；If it is determined that there is a logical relationship between the abnormal model and other artificial intelligence models in the abnormal group; then adjust the division rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted division rules are used to adjust the independent models in the abnormal group to A normal group; and/or adjust the resource allocation rules in the static allocation policy or the dynamic allocation policy, so that the adjusted resource allocation rules are used to adjust the worker threads corresponding to the abnormal group to the worker threads corresponding to a normal group; wherein, An independent model is an artificial intelligence model that has no logical relationship with other artificial intelligence models in the abnormal group;

若确定异常模型与异常组中其他的人工智能模型之间不具有逻辑关系；则调整静态分配策略或动态分配策略中的划分规则，使调整后的划分规则用于将异常模型调整到一个正常组；和/或调整静态分配策略或动态分配策略中的资源分配规则，使调整后的资源分配规则用于将异常组对应的工作线程，调整为一个正常组对应的工作线程；If it is determined that there is no logical relationship between the abnormal model and other artificial intelligence models in the abnormal group; adjust the division rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted division rules are used to adjust the abnormal model to a normal group ; and/or adjust the resource allocation rules in the static allocation strategy or the dynamic allocation strategy, so that the adjusted resource allocation rules are used to adjust the worker threads corresponding to the abnormal group to the worker threads corresponding to a normal group;

示例性地，异常组中运行的异常模型为模型1，异常组中包括：模型1、模型2和模型3，假设模型1与模型2之间具有逻辑关系，例如：依赖关系、关联关系、聚合关系、组合关系等，因此，此时的模型3为具有独立关系的人工智能模型。Exemplarily, the exception model running in the exception group is model 1, and the exception group includes: model 1, model 2, and model 3, assuming that there is a logical relationship between model 1 and model 2, such as: dependency relationship, association relationship, aggregation relationship, combination relationship, etc. Therefore, Model 3 at this time is an artificial intelligence model with independent relationships.

假设异常组对应的工作线程为线程1；正常组1包括：模型4和模型5，其对应的工作线程为线程2；正常组2包括：模型6，其对应的工作线程为线程3，其中，模型6是未运行的人工智能模型。可以将模型3调整到正常组1或正常组2，也可以将线程2或线程3分配给异常组。Assume that the working thread corresponding to the abnormal group is thread 1; the normal group 1 includes: model 4 and model 5, and the corresponding working thread is thread 2; the normal group 2 includes: model 6, and the corresponding working thread is thread 3, among which, Model 6 is a non-running AI model. Model 3 can be tuned to Normal Group 1 or Normal Group 2, and Thread 2 or Thread 3 can be assigned to the Abnormal Group.

异常组中包括：模型1、模型2和模型3，并相互之间不具有逻辑关系，假设异常模型为模型1，基于上述举例，可以将模型2和/或模型3调整到正常组1或正常组2，也可以将线程2或线程3分配给异常组。The abnormal group includes: model 1, model 2 and model 3, and there is no logical relationship among them. Assume that the abnormal model is model 1. Based on the above example, model 2 and/or model 3 can be adjusted to normal group 1 or normal Group 2, Thread 2 or Thread 3 can also be assigned to exception groups.

实施例2：Example 2:

请参阅图3，本申请提供一种智能汽车操作系统的图形处理器的资源调度装置3，包括：Referring to Fig. 3, the present application provides a resource scheduling device 3 of a graphic processor of an intelligent vehicle operating system, including:

静态测试模块31，用于对智能汽车操作系统中的人工智能模型进行至少一次性能测试，得到至少一个静态性能指标；并识别至少一个静态性能指标中的优先静态指标，将优先静态指标对应的静态策略设为静态分配策略，其中，智能汽车操作系统中具有至少一个人工智能模型，人工智能模型用于实现指定任务，指定任务用于实现汽车的自动驾驶，静态性能指标反映了人工智能模型在性能测试中的性能表现，优先静态指标为满足预置的优先规则的静态性能指标，静态策略是用于对人工智能模型进行分组，及对分组后的人工智能模型分配工作线程的计算机策略，工作线程是用于调度图形处理器中的流处理器和/或计算单元的序列图形处理器；The static test module 31 is used to perform at least one performance test on the artificial intelligence model in the smart car operating system to obtain at least one static performance index; The strategy is set as a static allocation strategy, wherein the smart car operating system has at least one artificial intelligence model, the artificial intelligence model is used to achieve the specified task, the specified task is used to realize the automatic driving of the car, and the static performance index reflects the performance of the artificial intelligence model. The performance in the test, the priority static index is the static performance index that meets the preset priority rules, and the static strategy is a computer strategy for grouping artificial intelligence models and assigning work threads to the grouped artificial intelligence models. is a sequential GPU for scheduling stream processors and/or compute units in the GPU;

静态分配模块32，用于根据静态分配策略，为智能汽车操作系统中的每一人工智能模型分配图形处理器资源；The static allocation module 32 is used to allocate graphics processor resources for each artificial intelligence model in the smart car operating system according to the static allocation strategy;

动态监测模块33，用于运行智能汽车操作系统中的至少一个人工智能模型，并监控运行的人工智能模型的动态性能指标，其中，动态性能指标反映了人工智能模型在调用分配到的图形处理器资源进行运算时的性能表现；The dynamic monitoring module 33 is used to run at least one artificial intelligence model in the smart car operating system, and monitor the dynamic performance index of the artificial intelligence model running, wherein the dynamic performance index reflects the graphics processor allocated to the artificial intelligence model when calling The performance of resources when computing;

动态分配模块34，用于根据动态性能指标生成动态分配策略，根据动态分配策略调整每一人工智能模型分配到的图形处理器资源，其中，动态分配策略用于根据人工智能模型在运行时的性能表现，对运行的人工智能模型分配图形处理器资源。The dynamic allocation module 34 is used to generate a dynamic allocation strategy according to the dynamic performance index, and adjust the graphics processor resources allocated to each artificial intelligence model according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used to determine the performance of the artificial intelligence model at runtime Performance, allocating GPU resources to running AI models.

实施例3：Example 3:

为实现上述目的，本申请还提供一种计算机设备4，包括：处理器42以及与处理器42通信连接的存储器41；存储器存储计算机执行指令；To achieve the above purpose, the present application also provides a computer device 4, including: a processor 42 and a memory 41 communicatively connected to the processor 42; the memory stores computer-executed instructions;

处理器执行存储器41存储的计算机执行指令，以实现上述的图形处理器的资源调度方法，其中，图形处理器的资源调度装置的组成部分可分散于不同的计算机设备中，计算机设备4可以是执行程序的智能手机、平板电脑、笔记本电脑、台式计算机、机架式服务器、刀片式服务器、塔式服务器或机柜式服务器(包括独立的服务器，或者多个应用服务器所组成的服务器集群)等。本实施例的计算机设备至少包括但不限于：可通过系统总线相互通信连接的存储器41、处理器42，如图4所示。需要指出的是，图4仅示出了具有组件-的计算机设备，但是应理解的是，并不要求实施所有示出的组件，可以替代的实施更多或者更少的组件。本实施例中，存储器41(即可读存储介质)包括闪存、硬盘、多媒体卡、卡型存储器(例如，SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中，存储器41可以是计算机设备的内部存储单元，例如该计算机设备的硬盘或内存。在另一些实施例中，存储器41也可以是计算机设备的外部存储设备，例如该计算机设备上配备的插接式硬盘，智能存储卡(Smart Media Card,SMC)，安全数字(SecureDigital,SD)卡，闪存卡(Flash Card)等。当然，存储器41还可以既包括计算机设备的内部存储单元也包括其外部存储设备。本实施例中，存储器41通常用于存储安装于计算机设备的操作系统和各类应用软件，例如实施例三的图形处理器的资源调度装置的程序代码等。此外，存储器41还可以用于暂时地存储已经输出或者将要输出的各类数据。处理器42在一些实施例中可以是中央处理器(Central Processing Unit，CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器42通常用于控制计算机设备的总体操作。本实施例中，处理器42用于运行存储器41中存储的程序代码或者处理数据，例如运行图形处理器的资源调度装置，以实现上述实施例的图形处理器的资源调度方法。The processor executes the computer-executed instructions stored in the memory 41 to implement the resource scheduling method of the above-mentioned graphics processor, wherein the components of the resource scheduling device of the graphics processor can be dispersed in different computer devices, and the computer device 4 can be executed Smartphones, tablet computers, laptops, desktop computers, rack servers, blade servers, tower servers, or cabinet servers (including independent servers, or server clusters composed of multiple application servers), etc. The computer device in this embodiment at least includes but is not limited to: a memory 41 and a processor 42 that can be communicatively connected to each other through a system bus, as shown in FIG. 4 . It should be noted that FIG. 4 only shows a computer device with components - but it should be understood that it is not required to implement all the components shown, and more or fewer components may be implemented instead. In this embodiment, the memory 41 (that is, a readable storage medium) includes a flash memory, a hard disk, a multimedia card, a card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Programmable Read Only Memory (PROM), Magnetic Memory, Magnetic Disk, Optical Disk, etc. In some embodiments, the memory 41 may be an internal storage unit of a computer device, such as a hard disk or internal memory of the computer device. In some other embodiments, the memory 41 can also be an external storage device of the computer equipment, such as a plug-in hard disk equipped on the computer equipment, a smart memory card (Smart Media Card, SMC), a secure digital (SecureDigital, SD) card , Flash Card (Flash Card) and so on. Of course, the memory 41 may also include both the internal storage unit of the computer device and its external storage device. In this embodiment, the memory 41 is generally used to store the operating system and various application software installed in the computer equipment, such as the program code of the resource scheduling device of the graphics processor in the third embodiment. In addition, the memory 41 can also be used to temporarily store various types of data that have been output or will be output. The processor 42 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments. The processor 42 is generally used to control the overall operation of the computer device. In this embodiment, the processor 42 is used to run program codes stored in the memory 41 or process data, for example, to run a resource scheduling device for a graphics processor, so as to implement the resource scheduling method for a graphics processor in the above embodiment.

上述以软件功能模块的形式实现的集成的模块，可以存储在一个计算机可读取存储介质中。上述软件功能模块存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)或处理器执行本申请各个实施例方法的部分步骤。应理解，上述处理器可以是中央处理单元(Central Processing Unit，简称CPU)，还可以是其它通用处理器、数字信号处理器(Digital Signal Processor，简称DSP)、专用集成电路(Application Specific Integrated Circuit，简称ASIC)等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合申请所公开的方法的步骤可以直接体现为硬件处理器执行完成，或者用处理器中的硬件及软件模块组合执行完成。存储器可能包含高速RAM存储器，也可能还包括非易失性存储NVM，例如至少一个磁盘存储器，还可以为U盘、移动硬盘、只读存储器、磁盘或光盘等。The above-mentioned integrated modules implemented in the form of software function modules can be stored in a computer-readable storage medium. The above-mentioned software function modules are stored in a storage medium, and include several instructions to make a computer device (which may be a personal computer, server, or network device, etc.) or a processor execute some steps of the methods in various embodiments of the present application. It should be understood that the above-mentioned processor may be a central processing unit (Central Processing Unit, referred to as CPU), and may also be other general-purpose processors, a digital signal processor (Digital Signal Processor, referred to as DSP), an application specific integrated circuit (Application Specific Integrated Circuit, referred to as ASIC) and so on. A general-purpose processor may be a microprocessor, or the processor may be any conventional processor, or the like. The steps of the method disclosed in conjunction with the application can be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor. The storage may include a high-speed RAM memory, and may also include a non-volatile storage NVM, such as at least one disk storage, and may also be a U disk, a mobile hard disk, a read-only memory, a magnetic disk, or an optical disk.

为实现上述目的，本申请还提供一种计算机可读存储介质，如闪存、硬盘、多媒体卡、卡型存储器(例如，SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘、服务器、App应用商城等等，其上存储有计算机执行指令，程序被处理器42执行时实现相应功能。本实施例的计算机可读存储介质用于存储实现图形处理器的资源调度方法的计算机执行指令，被处理器42执行时实现上述实施例的图形处理器的资源调度方法。To achieve the above object, the present application also provides a computer-readable storage medium, such as flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory ( SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, server, App application store, etc., on which storage The computer executes instructions, and the program realizes corresponding functions when executed by the processor 42 . The computer-readable storage medium in this embodiment is used to store computer-executable instructions for implementing the resource scheduling method for a graphics processor, and when executed by the processor 42, implements the resource scheduling method for a graphics processor in the above-mentioned embodiment.

上述存储介质可以是由任何类型的易失性或非易失性存储设备或者它们的组合实现，如静态随机存取存储器(SRAM)，电可擦除可编程只读存储器(EEPROM)，可擦除可编程只读存储器(EPROM)，可编程只读存储器(PROM)，只读存储器(ROM)，磁存储器，快闪存储器，磁盘或光盘。存储介质可以是通用或专用计算机能够存取的任何可用介质。The above-mentioned storage medium can be realized by any type of volatile or non-volatile storage device or their combination, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable In addition to programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.

一种示例性的存储介质耦合至处理器，从而使处理器能够从该存储介质读取信息，且可向该存储介质写入信息。当然，存储介质也可以是处理器的组成部分。处理器和存储介质可以位于专用集成电路(Application Specific Integrated Circuits，简称ASIC)中。当然，处理器和存储介质也可以作为分立组件存在于电子设备或主控设备中。An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be a component of the processor. The processor and the storage medium may be located in application specific integrated circuits (Application Specific Integrated Circuits, ASIC for short). Of course, the processor and the storage medium can also exist in the electronic device or the main control device as discrete components.

本申请提供一种计算机程序产品，包括计算机程序，计算机程序被处理器执行时实现上述的图形处理器的资源调度方法。The present application provides a computer program product, including a computer program, and when the computer program is executed by a processor, the above resource scheduling method for a graphics processor is implemented.

需要说明的是，在本文中，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element.

本领域技术人员在考虑说明书及实践这里公开的发明后，将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化，这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的，本申请的真正范围和精神由下面的权利要求书指出。Other embodiments of the present application will be readily apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any modification, use or adaptation of the application, these modifications, uses or adaptations follow the general principles of the application and include common knowledge or conventional technical means in the technical field not disclosed in the application . The specification and examples are to be considered exemplary only, with a true scope and spirit of the application indicated by the following claims.

应当理解的是，本申请并不局限于上面已经描述并在附图中示出的精确结构，并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求书来限制。It should be understood that the present application is not limited to the precise constructions which have been described above and shown in the accompanying drawings, and various modifications and changes may be made without departing from the scope thereof. The scope of the application is limited only by the appended claims.

Claims

1. A method for scheduling resources of a graphics processor of an intelligent automobile operating system, comprising:

performing at least one performance test on an artificial intelligent model in an intelligent automobile operating system to obtain at least one static performance index; the method comprises the steps of identifying a priority static index in at least one static performance index, setting a static strategy corresponding to the priority static index as a static allocation strategy, wherein an intelligent automobile operating system is provided with at least one artificial intelligent model, the artificial intelligent model is used for realizing a specified task, the specified task is used for realizing automatic driving of an automobile, the static performance index reflects performance of the artificial intelligent model in a performance test, the priority static index is a static performance index meeting a preset priority rule, the static strategy is a computer strategy used for grouping the artificial intelligent model, and allocating working threads to the grouped artificial intelligent model, and the working threads are sequences used for scheduling a flow processor and/or a computing unit in a graphic processor;

Distributing graphics processor resources for each artificial intelligent model in the intelligent automobile operating system according to the static distribution strategy;

running at least one artificial intelligent model in the intelligent automobile operating system, and monitoring dynamic performance indexes of the running artificial intelligent model, wherein the dynamic performance indexes reflect performance of the artificial intelligent model when the distributed graphic processor resources are called for operation;

generating a dynamic allocation strategy according to the dynamic performance index, and adjusting the graphics processor resources allocated to each artificial intelligent model according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used for allocating the graphics processor resources to the running artificial intelligent model according to the performance of the artificial intelligent model during running.

2. The resource scheduling method of a graphic processor according to claim 1, wherein performing at least one performance test on an artificial intelligent model in an intelligent automobile operating system to obtain at least one static performance index comprises:

acquiring at least one static strategy, respectively grouping artificial intelligent models in the intelligent automobile operating system according to each static strategy, and distributing working threads to each group of artificial intelligent models to obtain at least one static sample;

Recording preset test cases into artificial intelligent models in each static sample, and running the test cases in the artificial intelligent models through working threads distributed by the artificial intelligent models in each static sample so as to perform performance test on each static sample and obtain at least one static performance index corresponding to at least one static strategy, wherein the test cases are test cases for performing performance test on the artificial intelligent models.

3. The resource scheduling method of a graphic processor according to claim 2, wherein grouping the artificial intelligent models in the intelligent automobile operating system and assigning a work thread to each group of artificial intelligent models according to each static policy, respectively, to obtain at least one static sample, comprises:

grouping each artificial intelligent model in the intelligent automobile operating system according to the partitioning rule in the static strategy to obtain at least one test group, wherein the test group is provided with at least one artificial intelligent model;

acquiring at least one working thread in the graphic processor, and distributing one working thread to each test group according to a resource distribution rule in the static strategy;

And summarizing at least one test group and the working threads corresponding to each test group to form a static sample corresponding to the static strategy.

4. A method for scheduling resources of a graphics processor according to claim 3, wherein grouping each artificial intelligence model in an intelligent automobile operating system according to a partitioning rule in the static policy, to obtain at least one test group, comprises:

if at least one directed acyclic graph is determined to exist in the intelligent automobile operating system, dividing the artificial intelligent models belonging to the same directed acyclic graph into a test group, wherein the directed acyclic graph reflects the logic relationship between two or more artificial intelligent models in the intelligent automobile operating system;

if it is determined that other artificial intelligence models which do not belong to the directed acyclic graph exist in the intelligent automobile operating system, grouping the other artificial intelligence models according to model attribute data of each other artificial intelligence model to obtain at least one test group, wherein the model attribute data describe calculation power consumed by the artificial intelligence model for realizing a specified task;

And if the intelligent automobile operation system is determined to not have the directed acyclic graph, grouping the artificial intelligent models in the intelligent automobile operation system according to the model attribute data of each artificial intelligent model to obtain at least one test group.

5. The resource scheduling method of a graphics processor of claim 1, wherein identifying a priority static indicator of the at least one static performance indicator comprises:

extracting a first index element in each static performance index, and sequencing the first index element to obtain a target sequence, wherein the static performance index is provided with at least one static index element, the static index element reflects the performance of an artificial intelligent model in one performance dimension in a performance test, and the first index element is one static index element in the static performance index;

determining a performance value of each first index element according to the rank of each first index element in the target sequence, wherein the performance value reflects the performance quality degree of the first index element;

obtaining the comprehensive performance value of each static performance index according to the performance value of each static index element in each static performance index;

And setting the static performance index with the highest comprehensive performance value as a priority static index.

6. The method of resource scheduling for a graphics processor of claim 1, wherein allocating graphics processor resources for each artificial intelligence model in the intelligent vehicle operating system according to the static allocation policy comprises:

grouping the artificial intelligent models in the intelligent automobile operating system according to the partitioning rules in the static allocation strategy to obtain at least one running group;

and acquiring at least one working thread in the graphic processor, and distributing one working thread to each running group according to the resource distribution rule in the static distribution strategy so as to distribute the graphic processor resource to each artificial intelligent model resource in each running group.

7. The resource scheduling method of a graphics processor according to claim 6, wherein generating a dynamic allocation policy according to the dynamic performance index comprises:

extracting a second index element in the dynamic performance index, and acquiring an index rule corresponding to the second index element, wherein the dynamic performance index is provided with at least one index element, the second index element is one index element in the dynamic performance index, and the index rule is a computer rule for defining the normal and abnormal index elements;

If the second index element is determined to accord with the index rule, setting the second index element as a normal index element;

if the second index element is determined not to accord with the index rule, setting the second index element as an abnormal index element;

if the number of the normal index elements in the dynamic performance index is not up to the preset normal threshold value or the number of the abnormal index elements is up to the preset abnormal threshold value, determining the dynamic performance index as an abnormal performance index;

if the number of the normal index elements in the dynamic performance index is determined to reach a preset normal threshold value or the number of the abnormal index elements is determined to not reach a preset abnormal threshold value, determining the dynamic performance index as a normal performance index;

and generating a dynamic allocation strategy according to the normal performance index and the abnormal performance index.

8. The resource scheduling method of a graphic processor according to claim 7, wherein generating a dynamic allocation policy according to the normal performance index and the abnormal performance index comprises:

setting an artificial intelligent model corresponding to the normal performance index as a normal model, setting an artificial intelligent model corresponding to the abnormal performance index as an abnormal model, setting an operation group in which the normal model is located and an operation group in which an artificial intelligent model which does not operate in the intelligent automobile operating system is located as a normal group, and setting the operation group in which the abnormal model is located as an abnormal group;

If the abnormal model and other artificial intelligent models in the abnormal group are determined to have a logic relationship; adjusting the partitioning rule in the static partitioning strategy or the dynamic partitioning strategy to enable the adjusted partitioning rule to be used for adjusting the independent model in the abnormal group to a normal group; and/or adjusting the resource allocation rule in the static allocation policy or the dynamic allocation policy, so that the adjusted resource allocation rule is used for adjusting the working thread corresponding to the abnormal group into a working thread corresponding to the normal group; wherein the independent model is an artificial intelligent model which has no logical relationship with other artificial intelligent models in the anomaly group;

if it is determined that the abnormal model and other artificial intelligence models in the abnormal group do not have a logic relationship; adjusting the partitioning rule in the static allocation strategy or the dynamic allocation strategy to enable the adjusted partitioning rule to be used for adjusting the abnormal model to a normal group; and/or adjusting the resource allocation rule in the static allocation policy or the dynamic allocation policy, so that the adjusted resource allocation rule is used for adjusting the working thread corresponding to the abnormal group into a working thread corresponding to the normal group;

And generating a dynamic allocation strategy according to the adjusted partitioning rule and/or the adjusted resource allocation rule.

9. A resource scheduling device for a graphics processor of an intelligent automobile operating system, comprising:

the static test module is used for performing at least one performance test on the artificial intelligent model in the intelligent automobile operating system to obtain at least one static performance index; the method comprises the steps of identifying a priority static index in at least one static performance index, setting a static strategy corresponding to the priority static index as a static allocation strategy, wherein an intelligent automobile operating system is provided with at least one artificial intelligent model, the artificial intelligent model is used for realizing a specified task, the specified task is used for realizing automatic driving of an automobile, the static performance index reflects performance of the artificial intelligent model in a performance test, the priority static index is a static performance index meeting a preset priority rule, the static strategy is a computer strategy used for grouping the artificial intelligent model, and allocating working threads to the grouped artificial intelligent model, and the working threads are sequence graphic processors used for scheduling a flow processor and/or a computing unit in a graphic processor;

The static allocation module is used for allocating graphics processor resources for each artificial intelligent model in the intelligent automobile operating system according to the static allocation strategy;

the dynamic monitoring module is used for running at least one artificial intelligent model in the intelligent automobile operating system and monitoring dynamic performance indexes of the running artificial intelligent model, wherein the dynamic performance indexes reflect performance of the artificial intelligent model when the distributed graphic processor resource is called for operation;

and the dynamic allocation module is used for generating a dynamic allocation strategy according to the dynamic performance index, and adjusting the graphics processor resources allocated to each artificial intelligent model according to the dynamic allocation strategy, wherein the dynamic allocation strategy is used for allocating the graphics processor resources to the running artificial intelligent model according to the performance of the artificial intelligent model during running.

10. A computer device, comprising: a processor and a memory communicatively coupled to the processor;

the memory stores computer-executable instructions;

the processor executes computer-executable instructions stored in the memory to implement a resource scheduling method for a graphics processor as claimed in any one of claims 1 to 8.