CN117585015A

CN117585015A - Automatic driving optimization methods, control methods, devices, electronic equipment and media

Info

Publication number: CN117585015A
Application number: CN202311571239.XA
Authority: CN
Inventors: 周熙钦; 崔迎杰; 梁振宝; 陈勇; 符茂磊; 万雷鸣; 李海波
Original assignee: Zhejiang Geely Holding Group Co Ltd; Geely Automobile Research Institute Ningbo Co Ltd
Current assignee: Zhejiang Geely Holding Group Co Ltd; Geely Automobile Research Institute Ningbo Co Ltd
Priority date: 2023-11-22
Filing date: 2023-11-22
Publication date: 2024-02-23

Abstract

The application discloses an automatic driving optimization method, a control method, an automatic driving optimization device, electronic equipment and a medium, wherein the automatic driving optimization method is applied to a cognitive decision model, the cognitive decision model comprises a cognitive layer and a decision layer, and the automatic driving optimization method comprises the following steps: acquiring in-and-out monitoring data of a training sample vehicle and a training sample scene label, and acquiring target human driving experience scene cognitive characteristics corresponding to the training sample scene label; extracting features of a cognitive layer of a training sample from the in-vehicle monitoring data and the out-vehicle monitoring data of the training sample through the cognitive layer; and carrying out iterative optimization on the cognitive layer according to the training sample cognitive layer characteristics and the target human driving experience scene cognitive characteristics. The technical problem that the safety of automatic driving is lower in the related art is solved.

Description

Automatic driving optimization methods, control methods, devices, electronic equipment and media

技术领域Technical field

本申请涉及车辆技术领域，尤其涉及一种自动驾驶优化方法、控制方法、装置、电子设备及介质。The present application relates to the field of vehicle technology, and in particular to an automatic driving optimization method, control method, device, electronic equipment and medium.

背景技术Background technique

随着科技的不断发展，算力的持续突破，自动驾驶迅猛发展，自动驾驶可以为人们提供更安全、便捷和高效的出行方式，同时也对整个交通系统和城市规划带来了深远的影响。With the continuous development of technology and continuous breakthroughs in computing power, autonomous driving has developed rapidly. Autonomous driving can provide people with a safer, more convenient and efficient way to travel, and it has also had a profound impact on the entire transportation system and urban planning.

相关技术中自动驾驶架构的关注点在地图、外部交通参与者行为等表层行为信息上，对于存在多层逻辑的复杂场景的理解能力和泛化性不足，例如，本车所处位置位于其他交通参与者的视觉盲区时，由于其他交通参与者可能因无法发现本车而不会进行避让甚至加速，因此本车需要减速以避免发生碰撞，又例如，在高速公路上行驶时，右侧道路前方为货车，货车行驶速度较慢，会对其后方车辆进行压速，货车可能会变道到本车前方，因此要提前减速。而仅关注表层行为信息，通常难以灵活应对这些涉及多个交通参与者以及多层逻辑推理的复杂场景，从而会导致自动驾驶的安全性较低。The focus of the autonomous driving architecture in related technologies is on surface behavioral information such as maps and external traffic participant behaviors. The understanding and generalization of complex scenes with multi-layer logic are insufficient. For example, the location of the vehicle is located in other traffic. In the visual blind spot of the participant, since other traffic participants may not be able to spot the car and will not avoid or even accelerate, the car needs to slow down to avoid a collision. For example, when driving on the highway, the road ahead on the right It is a truck. The truck travels slowly and will decelerate the vehicle behind it. The truck may change lanes to the front of the vehicle, so it is necessary to slow down in advance. However, only focusing on surface behavioral information often makes it difficult to flexibly deal with complex scenarios involving multiple traffic participants and multi-layer logical reasoning, which will lead to lower safety of autonomous driving.

发明内容Contents of the invention

本申请的主要目的在于提供一种自动驾驶优化方法、控制方法、装置、电子设备及介质，旨在解决相关技术中自动驾驶的安全性较低的技术问题。The main purpose of this application is to provide an automatic driving optimization method, control method, device, electronic equipment and medium, aiming to solve the technical problem of low safety of automatic driving in related technologies.

为实现上述目的，本申请提供一种自动驾驶优化方法，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，包括以下步骤：In order to achieve the above purpose, this application provides an automatic driving optimization method. The automatic driving optimization method is applied to a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer, including the following steps:

获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征；Obtain training sample vehicle interior and exterior monitoring data and training sample scene labels, and obtain target human driving experience scene cognitive characteristics corresponding to the training sample scene labels;

通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征；Extract training sample cognitive layer features from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化。The cognitive layer is iteratively optimized according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene.

本申请还提供一种自动驾驶优化方法，所述自动驾驶优化方法采用认知决策模型，所述认知决策模型包括认知层和决策层，所述认知层采用如上所述的自动驾驶优化方法进行预训练，所述自动驾驶优化方法包括以下步骤：This application also provides an automatic driving optimization method. The automatic driving optimization method adopts a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The cognitive layer adopts the automatic driving optimization as described above. The method performs pre-training, and the automatic driving optimization method includes the following steps:

获取待预测车内外监测数据；Obtain monitoring data inside and outside the vehicle to be predicted;

通过所述认知层从所述车内外监测数据中提取出待预测认知层特征，其中，所述待预测认知层特征包括待预测场景认知特征；The cognitive layer features to be predicted are extracted from the vehicle interior and exterior monitoring data through the cognitive layer, where the cognitive layer features to be predicted include the scene cognitive features to be predicted;

将所述待预测认知层特征输入所述决策层，确定自动驾驶控制参数。The cognitive layer features to be predicted are input into the decision-making layer to determine the automatic driving control parameters.

本申请还提供一种自动驾驶优化装置，所述自动驾驶优化装置上部署有认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化装置包括：This application also provides an automatic driving optimization device. A cognitive decision-making model is deployed on the automatic driving optimization device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization device includes:

第一获取模块，用于获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征；The first acquisition module is used to acquire training sample vehicle interior and exterior monitoring data and training sample scene labels, and obtain target human driving experience scene cognitive features corresponding to the training sample scene labels;

第一认知模块，用于通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征；A first cognitive module, configured to extract training sample cognitive layer features from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

优化模块，用于根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化。An optimization module, configured to iteratively optimize the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene.

本申请还提供一种自动驾驶控制装置，所述自动驾驶控制装置上部署有认知决策模型，所述认知决策模型包括认知层和决策层，所述认知层采用如上所述的自动驾驶优化方法进行预训练，所述自动驾驶控制装置包括：The present application also provides an automatic driving control device. A cognitive decision-making model is deployed on the automatic driving control device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The cognitive layer adopts the automatic driving method as described above. The driving optimization method is pre-trained, and the automatic driving control device includes:

第二获取模块，用于获取待预测车内外监测数据；The second acquisition module is used to acquire the vehicle interior and exterior monitoring data to be predicted;

第二认知模块，用于通过所述认知层从所述车内外监测数据中提取出待预测认知层特征，其中，所述待预测认知层特征包括待预测场景认知特征；A second cognitive module, configured to extract the cognitive layer features to be predicted from the vehicle interior and exterior monitoring data through the cognitive layer, where the cognitive layer features to be predicted include the scene cognitive features to be predicted;

决策模块，用于将所述待预测认知层特征输入所述决策层，确定自动驾驶控制参数。A decision-making module, used to input the cognitive layer characteristics to be predicted into the decision-making layer and determine automatic driving control parameters.

本申请还提供一种电子设备，所述电子设备为实体设备，所述电子设备包括：存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的所述的自动驾驶优化方法的程序，所述的自动驾驶优化方法的程序被处理器执行时可实现如上述的自动驾驶优化方法的步骤。This application also provides an electronic device. The electronic device is a physical device. The electronic device includes: a memory, a processor, and the automatic driving optimization stored on the memory and capable of running on the processor. The program of the method. When the program of the automatic driving optimization method is executed by the processor, the steps of the automatic driving optimization method as described above can be implemented.

本申请还提供一种介质，所述介质为计算机可读存储介质，所述计算机可读存储介质上存储有实现自动驾驶优化方法的程序，所述的自动驾驶优化方法的程序被处理器执行时实现如上述的自动驾驶优化方法的步骤。This application also provides a medium. The medium is a computer-readable storage medium. The computer-readable storage medium stores a program for implementing an automatic driving optimization method. When the program for the automatic driving optimization method is executed by a processor, Steps to implement the above-mentioned autonomous driving optimization method.

本申请提供了一种自动驾驶优化方法、控制方法、装置、电子设备及介质，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化方法用于对所述认知决策模型的认知层进行迭代优化。首先，通过获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征，实现了训练样本与人类驾驶经验场景认知特征的匹配，从而可以获取到与训练样本的训练样本场景标签相匹配的目标人类驾驶经验场景认知特征，人类驾驶经验场景认知特征可以反映出人类对当前场景的认知；进而通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征，实现了训练样本认知层特征的获得，训练样本认知层特征可以反映出自动驾驶车辆对当前场景的认知；进而，通过根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化，使得认知层学习人类对于场景的认知，对于涉及多个交通参与者以及多层逻辑推理的复杂场景，相比于自动驾驶车辆仅关注表层相位信息的方式，人类通常可以更全面地结合大量有效信息，进行多层逻辑推理，最终作出更精准地驾驶决策，因此，通过使得认知层学习人类对于场景的认知，可以使得认知层对复杂场景的认知更加全面且具有更强大的推理能力，从而更准确地理解并关注到场景中的风险点，从而引导决策层更准确地避开风险，作出更安全的驾驶决策，提高自动驾驶的安全性。This application provides an automatic driving optimization method, a control method, a device, an electronic device and a medium. The automatic driving optimization method is applied to a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization method is used to iteratively optimize the cognitive layer of the cognitive decision-making model. First, by obtaining the training sample vehicle interior and exterior monitoring data and the training sample scene labels, and obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene labels, the matching of the training samples and the human driving experience scene cognitive characteristics is achieved. Thus, the target human driving experience scene cognitive features that match the training sample scene label of the training sample can be obtained, and the human driving experience scene cognitive features can reflect human beings' cognition of the current scene; and then through the cognitive layer, the target human driving experience scene cognitive features can be obtained. The cognitive layer features of the training sample are extracted from the inside and outside monitoring data of the training sample, thereby achieving the acquisition of the cognitive layer features of the training sample. The cognitive layer features of the training sample can reflect the self-driving vehicle's cognition of the current scene; further, By iteratively optimizing the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene, the cognitive layer learns human cognition of the scene, and for traffic involving multiple traffic participants and complex scenarios with multi-layer logical reasoning. Compared with autonomous vehicles that only focus on surface phase information, humans can usually more comprehensively combine a large amount of effective information, perform multi-layer logical reasoning, and ultimately make more accurate driving decisions. Therefore, , by making the cognitive layer learn human cognition of the scene, the cognitive layer can have a more comprehensive understanding of complex scenes and have stronger reasoning capabilities, so as to more accurately understand and pay attention to the risk points in the scene, thereby Guide decision-makers to avoid risks more accurately, make safer driving decisions, and improve the safety of autonomous driving.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分，示出了符合本申请的实施例，并与说明书一起用于解释本申请的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description, serve to explain the principles of the application.

为了更清楚地说明本申请实施例或相关技术中的技术方案，下面将对实施例或相关技术描述中所需要使用的附图作简单地介绍，显而易见地，对于本领域普通技术人员而言，在不付出创造性劳动性的前提下，还可以根据这些附图得到其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or related technologies, the following will briefly introduce the drawings needed to describe the embodiments or related technologies. Obviously, for those of ordinary skill in the art, Other drawings can also be obtained based on these drawings without exerting any creative effort.

图1为本申请自动驾驶优化方法的第一实施例的流程示意图；Figure 1 is a schematic flow chart of the first embodiment of the automatic driving optimization method of the present application;

图2为本申请实施例中认知决策模型的一种可实施方式的结构示意图；Figure 2 is a schematic structural diagram of an implementation manner of the cognitive decision-making model in the embodiment of the present application;

图3为本申请自动驾驶优化方法的第二实施例的流程示意图Figure 3 is a schematic flow chart of the second embodiment of the automatic driving optimization method of the present application.

图4为本申请自动驾驶优化方法的第三实施例的流程示意图；Figure 4 is a schematic flow chart of the third embodiment of the automatic driving optimization method of the present application;

图5为本申请实施例中自动驾驶优化装置的结构示意图；Figure 5 is a schematic structural diagram of the automatic driving optimization device in the embodiment of the present application;

图6为本申请实施例中自动驾驶优化方法涉及的硬件运行环境的设备结构示意图。Figure 6 is a schematic diagram of the equipment structure of the hardware operating environment involved in the automatic driving optimization method in the embodiment of the present application.

本申请目的实现、功能特点及优点将结合实施例，参照附图做进一步说明。The realization of the purpose, functional features and advantages of the present application will be further described with reference to the embodiments and the accompanying drawings.

具体实施方式Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述。显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动的前提下所得到的所有其它实施例，均属于本发明保护的范围。In order to make the above objects, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without any creative work shall fall within the scope of protection of the present invention.

自动驾驶系统主流的技术方案主要分为两种：基于规则分模块架构和基于数据驱动的端到端认知决策架构。虽然基于规则分模块的架构凭借算力需求低、部署简单、过程可解释等优势已经实现量产落地，但是难以实现全场景域覆盖，分模块求解的结果并不是全局最优解。端到端方案以原始传感器数据为输入，并生成规划和/或低级控制动作作为输出，可以进行联合的、全局的优化，且随着深度学习大模型，特别是基于Transformer结构的人工智能模型迅速发展，此类模型借助注意力机制，具有很强的多场景多目标融合能力，可以理解丰富语义信息和处理更复杂任务，端到端方案的热度日益升高。The mainstream technical solutions for autonomous driving systems are mainly divided into two types: rule-based modular architecture and data-driven end-to-end cognitive decision-making architecture. Although the rule-based sub-module architecture has achieved mass production due to its advantages such as low computing power requirements, simple deployment, and explainable processes, it is difficult to achieve full scene domain coverage, and the result of sub-module solution is not the global optimal solution. The end-to-end solution takes raw sensor data as input and generates planning and/or low-level control actions as output. It can perform joint and global optimization, and with the rapid development of deep learning large models, especially artificial intelligence models based on the Transformer structure, With the help of attention mechanism, this type of model has strong multi-scene and multi-objective fusion capabilities, and can understand rich semantic information and handle more complex tasks. End-to-end solutions are becoming increasingly popular.

相关技术中端到端的自动驾驶架构的关注点在地图、外部交通参与者行为等表层行为信息上，容易忽视一些对预测车辆行为的准确性较低的因素，然而，这些因素有时候是造成复杂场景安全事故的关键因素，例如，本车所处位置位于其他交通参与者的视觉盲区时，视觉盲区对于预测车辆行为的作用较小，但由于其他交通参与者可能因无法发现本车而不会进行避让甚至加速，因此本车需要减速以避免发生碰撞，又例如，在高速公路上行驶时，右侧道路前方为货车，货车行驶速度较慢，在货车没有作出变道行为之前，其对于预测车辆行为的作用较小，但由于行驶速度较慢的货车会对其后方车辆进行压速，货车可能会变道到本车前方，因此可以提前减速避免与变道的货车发生碰撞。由此可知，仅关注表层行为信息，对于存在多层逻辑的复杂场景的理解能力和泛化性不足，通常难以灵活应对这些涉及多个交通参与者以及多层逻辑推理的复杂场景，从而会导致自动驾驶的安全性较低。The end-to-end autonomous driving architecture in related technologies focuses on surface behavioral information such as maps and external traffic participant behaviors. It is easy to overlook some factors that are less accurate in predicting vehicle behavior. However, these factors sometimes cause complications. Key factors in scene safety accidents. For example, when the vehicle is located in the visual blind spot of other traffic participants, the visual blind area is less effective in predicting vehicle behavior. However, other traffic participants may not be able to detect the vehicle because they are unable to detect it. To avoid or even accelerate, the vehicle needs to slow down to avoid a collision. For example, when driving on a highway, there is a truck in front of the road on the right, and the truck is traveling slowly. Before the truck makes a lane change, its prediction Vehicle behavior plays a smaller role, but since a slower truck will slow down the vehicle behind it, the truck may change lanes in front of the vehicle, so it can slow down in advance to avoid a collision with the truck changing lanes. It can be seen from this that only focusing on surface behavioral information has insufficient understanding and generalization of complex scenes with multi-layer logic. It is usually difficult to flexibly deal with these complex scenes involving multiple traffic participants and multi-layer logical reasoning, which will lead to Autonomous driving is less safe.

本申请提出了一种基于人类驾驶经验对认知层进行预训练的自动驾驶优化方法，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化方法用于对所述认知决策模型的认知层进行迭代优化。首先，通过获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征，实现了训练样本与人类驾驶经验场景认知特征的匹配，从而可以获取到与训练样本的训练样本场景标签相匹配的目标人类驾驶经验场景认知特征，人类驾驶经验场景认知特征可以反映出人类对当前场景的认知；进而通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征，实现了训练样本认知层特征的获得，训练样本认知层特征可以反映出自动驾驶车辆对当前场景的认知；进而，通过根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化，使得认知层学习人类对于场景的认知，对于涉及多个交通参与者以及多层逻辑推理的复杂场景，相比于自动驾驶车辆仅关注表层相位信息的方式，人类通常可以更全面地结合大量有效信息，进行多层逻辑推理，最终作出更精准地驾驶决策，因此，通过使得认知层学习人类对于场景的认知，可以使得认知层对复杂场景的认知更加全面且具有更强大的推理能力，从而更准确地理解并关注到场景中的风险点，从而引导决策层作出更准确地驾驶决策，提高自动驾驶的安全性。This application proposes an automatic driving optimization method that pre-trains the cognitive layer based on human driving experience. The automatic driving optimization method is applied to a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization method is used to iteratively optimize the cognitive layer of the cognitive decision-making model. First, by obtaining the training sample vehicle interior and exterior monitoring data and the training sample scene labels, and obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene labels, the matching of the training samples and the human driving experience scene cognitive characteristics is achieved. Thus, the target human driving experience scene cognitive features that match the training sample scene label of the training sample can be obtained, and the human driving experience scene cognitive features can reflect human beings' cognition of the current scene; and then through the cognitive layer, the target human driving experience scene cognitive features can be obtained. The cognitive layer features of the training sample are extracted from the inside and outside monitoring data of the training sample, thereby achieving the acquisition of the cognitive layer features of the training sample. The cognitive layer features of the training sample can reflect the self-driving vehicle's cognition of the current scene; further, By iteratively optimizing the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene, the cognitive layer learns human cognition of the scene, and for traffic involving multiple traffic participants and complex scenarios with multi-layer logical reasoning. Compared with autonomous vehicles that only focus on surface phase information, humans can usually more comprehensively combine a large amount of effective information, perform multi-layer logical reasoning, and ultimately make more accurate driving decisions. Therefore, , by making the cognitive layer learn human cognition of the scene, the cognitive layer can have a more comprehensive understanding of complex scenes and have stronger reasoning capabilities, so as to more accurately understand and pay attention to the risk points in the scene, thereby Guide decision-makers to make more accurate driving decisions and improve the safety of autonomous driving.

实施例一Embodiment 1

本申请实施例提供一种自动驾驶优化方法，参照图1，在本申请自动驾驶优化方法的第一实施例中，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，包括以下步骤：An embodiment of the present application provides an automatic driving optimization method. Referring to Figure 1, in the first embodiment of the automatic driving optimization method of the present application, the automatic driving optimization method is applied to a cognitive decision-making model, and the cognitive decision-making model includes The cognitive layer and decision-making layer include the following steps:

步骤S10，获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征；Step S10, obtain the training sample vehicle interior and exterior monitoring data and the training sample scene labels, and obtain the target human driving experience scene cognitive characteristics corresponding to the training sample scene labels;

本实施例方法的执行主体可以是一种自动驾驶优化装置，也可以是一种自动驾驶优化终端设备或服务器，本实施例以自动驾驶优化装置进行举例，该自动驾驶优化装置可以集成在具有数据处理功能的车辆、车载终端、智能手机、计算机等终端设备上。The execution subject of the method in this embodiment can be an automatic driving optimization device, or an automatic driving optimization terminal device or server. This embodiment takes an automatic driving optimization device as an example. The automatic driving optimization device can be integrated in a computer with data Processing functions on vehicles, vehicle-mounted terminals, smartphones, computers and other terminal devices.

在本实施例中，需要说明的是，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化方法用于对所述认知决策模型的认知层进行迭代优化，在对认知决策模型进行全面训练之前，预先对认知层进行预训练，使得认知层学习人类在复杂场景下对场景的认知。训练好的认知决策模型用于实现对自动驾驶车辆进行端到端的驾驶行为控制，所述认知决策模型使用注意力机制，可以为基于Transformer结构的深度学习模型，也可以为其他使用注意力机制的深度学习神经网络模型，具体可以根据实际情况进行设计，本实施例对此不加以限制。In this embodiment, it should be noted that the automatic driving optimization method is applied to a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization method is used to analyze the cognitive decision-making model. The cognitive layer of the cognitive decision-making model is iteratively optimized. Before comprehensive training of the cognitive decision-making model, the cognitive layer is pre-trained in advance, so that the cognitive layer can learn human cognition of scenes in complex scenarios. The trained cognitive decision-making model is used to realize end-to-end driving behavior control of autonomous vehicles. The cognitive decision-making model uses the attention mechanism and can be a deep learning model based on the Transformer structure, or it can also use attention for other purposes. The deep learning neural network model of the mechanism can be specifically designed according to the actual situation, and this embodiment is not limited to this.

所述认知决策模型至少包括认知层和决策层，其中，认知层用于对获取到的车内外监测数据进行特征提取、融合、编码和推理等得到决策层所需的认知层特征；所述决策层用于基于认知层传递的认知层特征规划自动驾驶车辆在下一时间步的驾驶行为，以使得自动驾驶车辆可以规避风险安全行驶到目的地。The cognitive decision-making model at least includes a cognitive layer and a decision-making layer, where the cognitive layer is used to perform feature extraction, fusion, encoding and reasoning on the acquired vehicle interior and exterior monitoring data to obtain the cognitive layer features required by the decision-making layer. ; The decision-making layer is used to plan the driving behavior of the autonomous vehicle in the next time step based on the cognitive layer features transmitted by the cognitive layer, so that the autonomous vehicle can avoid risks and drive safely to the destination.

所述训练样本车内外监测数据是指训练样本的车内外监测数据。所述车内外监测数据是指能够获取到的自动驾驶控制所需的数据，包括车外监测数据和座舱监测数据，所述车内外监测数据可以为图像数据、文本数据、声音数据、信号数据等中的一种或多种模态。示例性地，所述车外监测数据可以为通过摄像头采集的车外图像数据、通过激光雷达和毫米波雷达采集的车外点云数据、通过车辆定位系统采集的车辆位置信息数据、通过麦克风采集的车外声音数据、通过通信模块获取到的信号灯读秒数据等。示例性地，所述座舱监测数据可以为从整车控制器或控制器局域网总线上获取的车速、加速度等车辆行驶数据，也可以为通过摄像头采集的座舱图像数据、通过麦克风采集的座舱声音数据、将采集到的语音转换得到的文本数据、从采集到的图像中提取的文本数据、通过人机交互界面采集的用户需求信号数据等。这些车内外监测数据可以实时采集，因此可以携带有其他交通参与者的状态、驾驶员的状态、路况状态等可能发生变化的信息在当前时刻的实时信息。The training sample vehicle interior and exterior monitoring data refers to the training sample vehicle interior and exterior monitoring data. The vehicle interior and exterior monitoring data refers to the data required for automatic driving control that can be obtained, including exterior vehicle monitoring data and cockpit monitoring data. The vehicle interior and exterior monitoring data can be image data, text data, sound data, signal data, etc. one or more modalities. Exemplarily, the off-vehicle monitoring data may be off-vehicle image data collected through cameras, off-vehicle point cloud data collected through laser radar and millimeter wave radar, vehicle position information data collected through vehicle positioning systems, and vehicle location information data collected through microphones. Sound data outside the car, signal light countdown data obtained through the communication module, etc. For example, the cabin monitoring data may be vehicle driving data such as vehicle speed and acceleration obtained from the vehicle controller or the controller LAN bus, or may be cabin image data collected through a camera, and cabin sound data collected through a microphone. , text data obtained by converting the collected speech, text data extracted from the collected images, user demand signal data collected through the human-computer interaction interface, etc. These monitoring data inside and outside the vehicle can be collected in real time, so they can carry real-time information about the status of other traffic participants, the status of the driver, the status of road conditions and other information that may change at the current moment.

所述目标人类驾驶经验场景认知特征是指与所述训练样本场景标签相对应的人类驾驶经验场景认知特征。所述人类驾驶经验场景认知特征是从人类驾驶经验监测数据中提取出的用于表征人类对于场景的认知的特征。所述人类驾驶经验场景认知特征可以为图像、文本、向量、矩阵等形式。在一种可实施的方式中，可以选取熟练驾驶人员的监测数据进行人类驾驶经验场景认知特征的提取，使得认知层学习到更加有效的场景认知经验。The target human driving experience scene cognitive features refer to the human driving experience scene cognitive features corresponding to the training sample scene labels. The human driving experience scene cognition features are features extracted from human driving experience monitoring data and used to characterize human cognition of the scene. The human driving experience scene cognitive features may be in the form of images, texts, vectors, matrices, etc. In an implementable manner, monitoring data of skilled drivers can be selected to extract scene cognitive features of human driving experience, so that the cognitive layer can learn more effective scene cognitive experience.

在一种可实施的方式中，所述人类驾驶经验场景认知特征的提取方式可以为：在人类驾驶车辆经历第一场景的过程中，采集人类驾驶经验监测数据，例如拍摄座舱内和车外环境的图像；进而从所述人类驾驶经验监测数据中提取人类的注意力分布特征和驾驶行为语义理解特征作为所述第一场景对应的人类驾驶经验场景认知特征。In an implementable manner, the extraction method of the cognitive features of the human driving experience scene may be: collecting human driving experience monitoring data during the process of the human driving the vehicle experiencing the first scene, such as taking pictures inside the cockpit and outside the vehicle. images of the environment; and then extract human attention distribution features and driving behavior semantic understanding features from the human driving experience monitoring data as human driving experience scene cognitive features corresponding to the first scene.

在一种可实施的方式中，所述人类驾驶经验场景认知特征的提取方式也可以为：在人类驾驶车辆的过程中，采集多个人类驾驶经验监测数据，例如拍摄座舱内和车外环境的图像；进而从每个所述人类驾驶经验监测数据中提取人类的注意力分布特征和驾驶行为语义理解特征，作为人类驾驶经验场景认知特征，进而对各所述人类驾驶经验场景认知特征进行分类，确定各所述人类驾驶经验场景认知特征各自对应的场景标签。In an implementable manner, the method of extracting the cognitive features of the human driving experience scene may also be: collecting multiple human driving experience monitoring data during the process of human driving the vehicle, such as photographing the environment inside the cockpit and outside the vehicle. images; and then extract human attention distribution features and driving behavior semantic understanding features from each of the human driving experience monitoring data as human driving experience scene cognitive features, and then extract the human driving experience scene cognitive features Classify and determine the scene labels corresponding to the cognitive features of each human driving experience scene.

所述训练样本场景标签是指训练样本的场景标签，例如鬼探头场景、行人横穿场景、交叉路口场景等，可以通过人工或标注模型对训练样本进行标注确定。The training sample scene label refers to the scene label of the training sample, such as a ghost probe scene, a pedestrian crossing scene, an intersection scene, etc. The training samples can be annotated and determined manually or by an annotation model.

作为一种示例，所述步骤S10包括：获取一批次的训练样本车内外监测数据以及每个训练样本各自对应的训练样本场景标签，根据预设的场景标签与人类驾驶经验场景认知特征之间的映射关系，获取每个所述训练样本场景标签对应的目标人类驾驶经验场景认知特征。一批次训练样本可以为一个或多个，由于一批次训练样本中每个训练样本的处理方法相同，为了便于说明，后续以单个训练样本为例进行说明。As an example, the step S10 includes: obtaining a batch of training sample vehicle interior and exterior monitoring data and the training sample scene labels corresponding to each training sample, and based on the preset scene labels and the scene cognitive characteristics of human driving experience. The mapping relationship between the target human driving experience scene cognitive features corresponding to each training sample scene label is obtained. A batch of training samples can be one or more. Since the processing method of each training sample in a batch of training samples is the same, for the convenience of explanation, a single training sample will be used as an example for the following explanation.

可选地，所述获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征的步骤之前，还包括：Optionally, before the step of obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene label, the step further includes:

步骤A10，获取多个人类驾驶经验场景监测数据，分别从各所述人类驾驶经验场景监测数据中提取人类驾驶经验场景认知特征；Step A10, obtain multiple human driving experience scene monitoring data, and extract human driving experience scene cognitive features from each of the human driving experience scene monitoring data;

步骤A20，对各所述人类驾驶经验场景认知特征进行聚类分析，确定各所述人类驾驶经验场景认知特征各自对应的场景标签。Step A20: Perform cluster analysis on the cognitive features of each human driving experience scene to determine the scene labels corresponding to each of the cognitive features of the human driving experience scene.

在本实施例中，需要说明的是，由于事物是实时变化的，要使得人类驾驶车辆和自动驾驶车辆经历完全相同的场景几乎是不可能的，且很多复杂场景本身就是比较少出现的，因此要找到与每个训练样本车内外监测数据完全一致的人类驾驶经验车内外监测数据是几乎不可能的，且自动驾驶若作出与人类驾驶完全相同的驾驶行为，反而会失去自动驾驶的灵活性，难以灵活应对多变的实际情况。实际情况虽然不完全相同，但存在一定共性，例如，虽然每次经过十字路口，十字路口处的车流量人流量不同，但都需要预测各个路口的车辆的行驶轨迹或路人的步行轨迹是否会与本车的行驶轨迹相交、判断是否存在视觉盲区等等，因此，本实施例按照场景进行分类，从人类在各个场景下驾驶车辆产生的人类驾驶经验场景监测数据中提取人类驾驶经验场景认知特征，作为自动驾驶车辆在各个场景中的学习对象，使得认知层学习人类在各个场景下关注和推理的风险点。In this embodiment, it should be noted that since things change in real time, it is almost impossible for human-driven vehicles and autonomous vehicles to experience exactly the same scenes, and many complex scenes themselves rarely occur, so It is almost impossible to find human driving experience vehicle interior and exterior monitoring data that is completely consistent with the vehicle interior and exterior monitoring data of each training sample. Moreover, if autonomous driving performs exactly the same driving behavior as human driving, it will lose the flexibility of autonomous driving. It is difficult to flexibly respond to changing actual situations. Although the actual situation is not exactly the same, there are certain commonalities. For example, although the traffic flow and pedestrian flow at the intersection are different every time you pass an intersection, it is necessary to predict whether the driving trajectories of vehicles or pedestrians' walking trajectories at each intersection will be consistent with the traffic flow. The driving trajectories of the vehicle intersect, determine whether there is a visual blind spot, etc. Therefore, this embodiment classifies according to the scene, and extracts the human driving experience scene cognitive features from the human driving experience scene monitoring data generated by humans driving the vehicle in various scenes. , as the learning object of autonomous vehicles in various scenarios, allowing the cognitive layer to learn the risk points that humans pay attention to and reason about in various scenarios.

示例性地，所述步骤A10-A20包括：可以在人类驾驶车辆的过程中，采集大量的人类驾驶经验场景监测数据，从各所述人类驾驶经验场景监测数据中提取人类驾驶经验场景认知特征；进而对各所述人类驾驶经验场景认知特征进行聚类分析，进而标注每一分类的场景标签。其中，所述从各所述人类驾驶经验场景监测数据中提取人类驾驶经验场景认知特征的方式可以为通过拍摄驾驶员面部图像，通过识别驾驶员视线指向的方位确定驾驶员的注意力分布，将注意力分布确定为人类驾驶经验场景认知特征；还可以为人工基于采集到的人类驾驶经验场景监测数据标注场景语义理解信息，将人工标注的场景语义理解信息确定为人类驾驶经验场景认知特征。Exemplarily, the steps A10-A20 include: collecting a large amount of human driving experience scene monitoring data during the process of human driving the vehicle, and extracting human driving experience scene cognitive features from each of the human driving experience scene monitoring data. ; Then perform cluster analysis on the cognitive characteristics of each of the human driving experience scenes, and then mark the scene labels of each category. Wherein, the method of extracting cognitive features of human driving experience scenes from each of the human driving experience scene monitoring data may be by photographing the driver's facial image and determining the driver's attention distribution by identifying the direction in which the driver's line of sight is pointing, Attention distribution is determined as a feature of human driving experience scene cognition; it can also be used to manually annotate scene semantic understanding information based on the collected human driving experience scene monitoring data, and the manually annotated scene semantic understanding information is determined as human driving experience scene cognition. feature.

步骤S20，通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征；Step S20, extract the cognitive layer features of the training sample from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

在本实施例中，需要说明的是，所述训练样本认知层特征训练样本的认知层特征。所述认知层特征是指自动驾驶控制所需的特征，自动驾驶控制是为了使得自动驾驶车辆安全到达目的地，也即，在规避风险的情况下控制车辆行驶到目的地，因此，自动驾驶控制的方式可以为预测自动驾驶车辆在下一时间步的高风险行驶区域和/或低风险行驶区域，从而规划出自动驾驶车辆在下一时间步避开高风险行驶区域或经过低风险行驶区域的驾驶行为，所述驾驶行为可以为加速、减速、转弯等车辆可直接执行的操作，也可以为到目的地或下一时间步的行驶轨迹，使得自动驾驶车辆按照接收到的行驶轨迹行驶。In this embodiment, it should be noted that the training sample cognitive layer feature is the cognitive layer feature of the training sample. The cognitive layer features refer to the features required for automatic driving control. Automatic driving control is to enable the automatic driving vehicle to reach the destination safely, that is, to control the vehicle to drive to the destination while avoiding risks. Therefore, automatic driving The control method can be to predict the high-risk driving area and/or low-risk driving area of the autonomous vehicle in the next time step, thereby planning the driving of the autonomous vehicle to avoid the high-risk driving area or pass through the low-risk driving area in the next time step. Behavior, the driving behavior can be operations that can be directly performed by the vehicle such as acceleration, deceleration, turning, etc., or it can be a driving trajectory to the destination or the next time step, so that the autonomous vehicle drives according to the received driving trajectory.

所述认知层特征中至少包括场景认知特征，还可以包括其他特征。在一种可实施的方式中，所述认知层特征可以包括环境认知特征和用户需求特征。所述环境认知特征用于表征自动驾驶车辆对当前所处环境的认知，例如对于交通设施的认知、对于地面障碍物的认知、对于本车行驶状态的认知、对于其他交通参与者的认知、对于所处场景的风险的认知、对于所处场景的风险项的认知等。所述环境认知特征至少包括场景认知特征。自动驾驶车辆行为控制的主要目的是使得自动驾驶车辆安全到达目的地，也即，在规避风险的情况下控制车辆行驶到目的地，在一种可实施的方式中，自动驾驶控制的方式可以为预测自动驾驶车辆在下一时间步的高风险行驶区域和/或低风险行驶区域，从而规划出自动驾驶车辆在下一时间步避开高风险行驶区域或经过低风险行驶区域的驾驶行为。而所述环境认知特征是自动驾驶车辆当前所处环境的客观表示，因此提取到的所述环境认知特征越准确，对于自动驾驶车辆当前的风险判断越准确，自动驾驶控制的安全性则越高。所述环境认知特征可以包括场景认知特征、地图特征、风险目标检测跟踪特征、风险目标运动特征、低风险行驶区域特征等，还可以包括其他特征。所述用户需求特征用于表征自动驾驶车辆上乘坐的用户的需求，例如，用户对于目的地的需求、用户对于行驶到目的地的路径的需求、用户赶时间的需求、用户希望平缓行驶减缓晕车的需求等。所述用户需求特征可以通过检测用户操作、采集用户图像等方式捕捉用户需求信息，进而从中提取出用户需求特征，例如，可以通过摄像头采集用户图像，在通过图像识别技术识别出用户图像中用户当前处于睡眠状态的情况下，用户在睡眠状态通常隐含了用户对车辆平稳行驶的需求，例如，还可以通过麦克风采集用户语音信息，在识别出用户说“开快一点”的情况下，可以知道用户有加速的需求。The cognitive layer features at least include scene cognitive features, and may also include other features. In an implementable manner, the cognitive layer characteristics may include environment cognitive characteristics and user demand characteristics. The environmental cognition features are used to characterize the autonomous vehicle's cognition of the current environment, such as cognition of transportation facilities, cognition of ground obstacles, cognition of the vehicle's driving status, and cognition of other traffic participants. The person's cognition, the cognition of the risks in the scene, the cognition of the risk items in the scene, etc. The environment cognitive characteristics at least include scene cognitive characteristics. The main purpose of autonomous vehicle behavior control is to enable the autonomous vehicle to reach its destination safely, that is, to control the vehicle to drive to the destination while avoiding risks. In an implementable manner, the autonomous driving control method can be Predict the high-risk driving area and/or low-risk driving area of the autonomous vehicle in the next time step, thereby planning the driving behavior of the autonomous vehicle to avoid the high-risk driving area or pass through the low-risk driving area in the next time step. The environmental cognitive features are an objective representation of the current environment of the autonomous driving vehicle. Therefore, the more accurate the extracted environmental cognitive features are, the more accurate the current risk judgment of the autonomous driving vehicle will be, and the safety of the autonomous driving control will be better. The higher. The environment recognition features may include scene recognition features, map features, risk target detection and tracking features, risk target movement features, low-risk driving area features, etc., and may also include other features. The user demand characteristics are used to characterize the needs of users riding in self-driving vehicles, such as the user's demand for the destination, the user's demand for the path to the destination, the user's need to be in a hurry, and the user's desire to drive smoothly to reduce motion sickness. needs, etc. The user demand characteristics can capture user demand information by detecting user operations, collecting user images, etc., and then extract user demand characteristics. For example, user images can be collected through a camera, and the user's current status in the user image can be identified through image recognition technology. When the user is in the sleep state, the user's need for the vehicle to drive smoothly is usually implicit in the sleep state. For example, the user's voice information can also be collected through the microphone. When the user is recognized to say "drive faster", it can be known Users have acceleration needs.

所述认知层可以为多层结构，也即，其中任意一层认知层的输入都可以为其他认知层的输出，任意一层认知层的输出也都可以作为其他认知层的输入，从而可以实现多特征的融合和推理，捕捉认知层提取的各个特征之间的关联性和依赖性，实现对当前驾驶实际情况进行更全面的理解和更深层的逻辑推理。因此，可以通过比较训练样本场景认知特征和对应的目标人类驾驶经验场景认知特征确定场景认知损失，基于场景认知损失对整个认知层进行迭代优化。在一种可实施的方式中，还可以对训练样本的训练样本其他特征进行标注，得到训练样本其他特征标注数据，通过比较训练样本其他特征标注数据和训练样本其他特征，确定其他特征的其他特征提取损失，与场景认知损失共同用于对整个认知层进行迭代优化。The cognitive layer can be a multi-layer structure, that is, the input of any cognitive layer can be the output of other cognitive layers, and the output of any cognitive layer can also be used as the output of other cognitive layers. Input, it can achieve multi-feature fusion and reasoning, capture the correlation and dependence between various features extracted by the cognitive layer, and achieve a more comprehensive understanding of the current driving actual situation and deeper logical reasoning. Therefore, the scene cognitive loss can be determined by comparing the scene cognitive characteristics of the training sample with the corresponding target human driving experience scene cognitive characteristics, and the entire cognitive layer can be iteratively optimized based on the scene cognitive loss. In an implementable manner, other features of the training sample can also be annotated to obtain other feature annotation data of the training sample, and other features of the other features can be determined by comparing the other feature annotation data of the training sample with other features of the training sample. The extraction loss, together with the scene cognitive loss, is used to iteratively optimize the entire cognitive layer.

所述认知层可以采用注意力机制，在一种可实施的方式中，所述认知层可以采用transformer结构。采用注意力机制可以很好地对各种模态各种类型的车辆监测数据进行特征提取、融合、编码和推理，提取出可能发生变化的信息在当前时刻的实时信息，与当前时刻之前一段历史时间的历史信息组成信息序列，并捕捉不同类型的信息之间的关联性和依赖性，进行特征推理，提取出具有深层逻辑信息的认知层特征，以供决策层对自动驾驶车辆进行自动驾驶控制，这样，可以在预测自动驾驶车辆行车风险的过程中充分考虑到可能发生变化的信息以及多个风险项之间的相互影响，提高自动驾驶车辆行车风险的预测准确性，从而使得自动驾驶车辆可以规避更多的风险，提高自动驾驶控制的安全性。其中，不同类型的信息之间的关联性和依赖性可能会对自动驾驶车辆的自动驾驶控制有较大影响，示例性地，交通参与者与交通参与者之间可能产生交互、交通参与者与交通设施之间可能产生交互、交通参与者与场景之间也可能产生交互。例如，十字路口处，本车为东西方向行驶，附近还有南北方向行驶的第二车辆和东西方向行走的行人，假设在不考虑交通参与者与交通参与者之间产生交互的情况下，本车和第二车辆以当前车速行驶不会发生碰撞，但若第二车辆因为避让行人而减速或停车，则可能导致本车与第二车辆发生碰撞，因此，在考虑交通参与者与交通参与者之间产生交互的情况下，本车仍需要减速或者规划其他行驶路线；而若假设第二车辆位于转弯车道，在考虑交通参与者与交通设施之间产生交互的情况下，若第二车辆转弯的行驶轨迹与本车的行驶轨迹不相交，本车则无需减速；另外，在考虑交通参与者与场景之间产生交互的情况下，在当前处于经过十字路口的场景下，由于十字路口场景的复杂性，可以提高十字路口处人和车的风险系数，因此可能需要控制车辆减速慢行。The cognitive layer may adopt an attention mechanism. In an implementable manner, the cognitive layer may adopt a transformer structure. The attention mechanism can be used to perform feature extraction, fusion, encoding and reasoning on various modes and types of vehicle monitoring data, and extract real-time information that may change at the current moment, and a period of history before the current moment. The historical information of time composes the information sequence, and captures the correlation and dependence between different types of information, performs feature reasoning, and extracts cognitive layer features with deep logical information for the decision-making layer to automatically drive autonomous vehicles. Control, in this way, the information that may change and the interaction between multiple risk items can be fully taken into account in the process of predicting the driving risks of autonomous vehicles, and the accuracy of prediction of driving risks of autonomous vehicles can be improved, so that autonomous vehicles can More risks can be avoided and the safety of automatic driving control can be improved. Among them, the correlation and dependence between different types of information may have a greater impact on the automatic driving control of autonomous vehicles. For example, there may be interactions between traffic participants, traffic participants and There may be interactions between transportation facilities, and there may also be interactions between transportation participants and scenes. For example, at an intersection, the vehicle is traveling in the east-west direction, and there is a second vehicle traveling in the north-south direction and pedestrians walking in the east-west direction nearby. Assume that the vehicle is traveling in the east-west direction without considering the interaction between traffic participants. There will be no collision between the car and the second vehicle driving at the current speed, but if the second vehicle slows down or stops to avoid pedestrians, it may cause a collision between the own car and the second vehicle. Therefore, when considering the traffic participants and the traffic participants In the case of interaction between traffic participants, the vehicle still needs to slow down or plan other driving routes; and if it is assumed that the second vehicle is in the turning lane, taking into account the interaction between traffic participants and traffic facilities, if the second vehicle turns The driving trajectory of the vehicle does not intersect with the driving trajectory of the vehicle, and the vehicle does not need to slow down; in addition, taking into account the interaction between traffic participants and the scene, in the current scene of passing through the intersection, due to the intersection scene Complexity can increase the risk factor for people and vehicles at intersections, so vehicles may need to be controlled to slow down.

作为一种示例，所述步骤S20包括：将所述训练样本车内外监测数据输入所述认知层，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，得到训练样本认知层特征。其中，所述认知层的结构以及具体的特征处理方式可以根据决策层的实际需要进行确定，本实施例对此不加以限制。As an example, the step S20 includes: inputting the training sample vehicle interior and exterior monitoring data into the cognitive layer, and performing multi-level progressive feature processing such as feature extraction, feature fusion, feature encoding, and feature inference to obtain Cognitive layer characteristics of training samples. The structure of the cognitive layer and the specific feature processing method can be determined according to the actual needs of the decision-making layer, which is not limited in this embodiment.

可选地，所述认知层包括场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型；所述训练样本认知层特征包括训练样本场景认知特征、训练样本地图特征、训练样本风险目标检测跟踪特征和训练样本风险目标运动特征；Optionally, the cognitive layer includes a scene cognitive model, a map construction model, a risk target detection and tracking model, and a motion prediction model; the training sample cognitive layer features include training sample scene cognitive features, training sample map features, Risk target detection and tracking features of training samples and risk target movement features of training samples;

所述通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征的步骤包括：The step of extracting the cognitive layer features of the training sample from the training sample vehicle interior and exterior monitoring data through the cognitive layer includes:

步骤S21，通过所述场景认知模型从所述训练样本车内外监测数据中提取出训练样本场景认知特征，通过所述地图构建模型从所述训练样本车内外监测数据中提取出训练样本地图特征，通过所述风险目标检测跟踪模型从所述训练样本车内外监测数据中提取出训练样本风险目标检测跟踪特征；Step S21: Extract the scene cognitive features of the training sample from the training sample vehicle interior and exterior monitoring data through the scene recognition model, and extract the training sample map from the training sample vehicle interior and exterior monitoring data through the map construction model. Features: extract the training sample risk target detection and tracking features from the training sample vehicle interior and exterior monitoring data through the risk target detection and tracking model;

步骤S22，通过所述运动预测模型基于所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征，进行风险目标运动预测，得到训练样本风险目标运动特征，其中，所述运动预测模型采用注意力机制。Step S22, use the motion prediction model to perform risk target motion prediction based on the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features to obtain training sample risk target motion features, Wherein, the motion prediction model adopts an attention mechanism.

在本实施例中，需要说明的是，对于不同模态的车内外监测数据，可以采用不同的模型进行特征提取，例如，图像数据可以采用卷积神经网络进行特征提取，时序数据可以采用长短期记忆网络进行特征提取等，也可以采用不同的模型进行特征处理从而提取出决策层所需的不同的认知层特征，因此，认知层中可以部署多个模型进行特征处理，具体的模型结构和模型类型可以根据实际情况进行确定，本实施例对此不加以限制。所述认知层包括场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型，其中，风险目标是指对自动驾驶车辆行驶可能造成风险的物体，可以为交通参与者、交通基础设施等中的一种或多种。所述场景认知模型、所述地图构建模型和所述风险目标检测跟踪模型输出的训练样本场景认知特征、训练样本地图特征、训练样本风险目标检测跟踪特征，均对预测风险目标的在下一时间步的运动有一定影响，且场景认知特征、地图特征和风险目标检测跟踪特征之间还存在一定的关联性和依赖性，因此均可以作为运动预测模型的输入，通过运动预测模型采用注意力机制，捕捉这些特征之间的关联性和依赖性，进行进一步的特征融合、编码和推理，确定至少一个风险目标的训练样本风险目标运动特征。In this embodiment, it should be noted that for different modes of vehicle interior and exterior monitoring data, different models can be used for feature extraction. For example, image data can use convolutional neural networks for feature extraction, and time series data can use long-term and short-term features. Memory networks can be used for feature extraction, etc. Different models can also be used for feature processing to extract different cognitive layer features required by the decision-making layer. Therefore, multiple models can be deployed in the cognitive layer for feature processing. The specific model structure and model types can be determined according to the actual situation, and this embodiment does not limit this. The cognitive layer includes a scene cognition model, a map construction model, a risk target detection and tracking model, and a motion prediction model. Risk targets refer to objects that may cause risks to the driving of autonomous vehicles, and can provide information for traffic participants and traffic infrastructure. One or more of the facilities, etc. The scene recognition model, the map construction model, and the training sample scene recognition features, training sample map features, and training sample risk target detection and tracking features output by the risk target detection and tracking model are all useful for predicting the risk target in the next step. The motion of the time step has a certain impact, and there is a certain correlation and dependence between the scene cognitive features, map features and risk target detection and tracking features. Therefore, they can all be used as inputs to the motion prediction model. The attention is adopted through the motion prediction model. The force mechanism captures the correlation and dependence between these features, performs further feature fusion, encoding and reasoning, and determines the risk target motion characteristics of the training sample of at least one risk target.

所述训练样本认知层特征包括训练样本场景认知特征、训练样本地图特征、训练样本风险目标检测跟踪特征和训练样本风险目标运动特征。场景认知特征用于表征对车辆当前所处场景的风险认知，例如注意力分布、对场景中存在的风险项的语义理解等等，地图特征用于表征车辆所处场景的地面设施、道路元素等地图信息，风险目标检测跟踪特征用于表征风险目标的识别以及当前时刻之前的一段时间范围内风险目标的位置信息，风险目标运动特征用于表征风险目标在当前时刻之后的一段时间范围的运动信息，运动信息可以包括位置信息、轨迹信息、运动意图、运动趋势等中的至少一种。The training sample cognitive layer features include training sample scene cognitive features, training sample map features, training sample risk target detection and tracking features, and training sample risk target motion features. Scene cognitive features are used to characterize the risk perception of the scene the vehicle is currently in, such as attention distribution, semantic understanding of risk items present in the scene, etc. Map features are used to characterize the ground facilities and roads of the scene the vehicle is in. Map information such as elements, risk target detection and tracking features are used to characterize the identification of risk targets and the location information of risk targets within a period of time before the current moment, and risk target motion features are used to characterize the risk targets within a period of time after the current moment. Movement information may include at least one of position information, trajectory information, movement intention, movement trend, etc.

作为一种示例，所述步骤S21-S22包括：将所述训练样本车内外监测数据输入所述场景认知模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出训练样本场景认知特征；同步地，将所述训练样本车内外监测数据输入所述地图构建模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出训练样本地图特征；同步地，将所述训练样本车内外监测数据输入所述风险目标检测跟踪模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出训练风险目标检测跟踪特征。进而，将所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征输入所述运动预测模型，通过运动预测模型，采用注意力机制，捕捉所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征之间的关联性和依赖性，基于这些关联性和依赖性，可以更准确地进行风险目标运行预测，确定至少一个风险目标的训练样本风险目标运动特征。As an example, the steps S21-S22 include: inputting the training sample vehicle interior and exterior monitoring data into the scene cognitive model, and performing multi-level progressive feature extraction, feature fusion, feature encoding, feature reasoning, etc. Process and extract the cognitive features of the training sample scene; simultaneously, input the training sample vehicle interior and exterior monitoring data into the map to build a model, and perform multi-level progressive features such as feature extraction, feature fusion, feature encoding, and feature reasoning. Process and extract the training sample map features; synchronously, input the training sample vehicle interior and exterior monitoring data into the risk target detection and tracking model to perform multi-level progressive features such as feature extraction, feature fusion, feature encoding, and feature inference. Process and extract training risk target detection and tracking features. Furthermore, the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features are input into the motion prediction model, and the attention mechanism is used to capture the training samples through the motion prediction model. The correlation and dependence between the scene cognitive characteristics, the training sample map characteristics and the training sample risk target detection and tracking characteristics. Based on these correlations and dependencies, risk target operation predictions can be made more accurately, and at least A risk target’s training sample risk target movement characteristics.

可选地，所述训练样本风险目标检测跟踪特征包括至少一个训练样本风险目标检测跟踪子特征；所述运动预测模型包括注意力层、多层感知机层和预测层；Optionally, the training sample risk target detection and tracking features include at least one training sample risk target detection and tracking sub-feature; the motion prediction model includes an attention layer, a multi-layer perceptron layer and a prediction layer;

所述通过所述运动预测模型基于所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征，进行风险目标运动预测，得到训练样本风险目标运动特征的步骤包括：The step of using the motion prediction model to perform risk target motion prediction based on the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features to obtain the training sample risk target motion characteristics. include:

步骤S221，将所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征输入所述注意力层，采用注意力机制，确定各所述风险目标检测跟踪子特征之间的训练样本风险目标交互特征、各所述风险目标检测跟踪子特征与所述训练样本场景认知特征之间的训练样本场景交互特征以及各所述风险目标检测跟踪子特征与所述训练样本地图特征之间的训练样本地图交互特征；Step S221: Input the training sample scene cognitive features, the training sample map features and the training sample risk target detection and tracking features into the attention layer, and use the attention mechanism to determine each of the risk target detection and tracking subdivisions. Training sample risk target interaction features between features, training sample scene interaction features between each of the risk target detection and tracking sub-features and the training sample scene cognitive features, and each of the risk target detection and tracking sub-features and the Training sample map interaction features between training sample map features;

步骤S222，将所述训练样本风险目标交互特征、所述训练样本场景交互特征以及所述训练样本地图交互特征输入多层感知机层，得到训练样本场景认知层特征和训练样本运动查询特征；Step S222, input the training sample risk target interaction characteristics, the training sample scene interaction characteristics, and the training sample map interaction characteristics into the multi-layer perceptron layer to obtain the training sample scene cognitive layer characteristics and the training sample motion query characteristics;

步骤S223，将所述训练样本场景认知层特征和所述训练样本运动查询特征输入所述预测层，进行风险目标运动预测，得到训练样本风险目标运动特征。Step S223: Input the training sample scene cognitive layer features and the training sample motion query features into the prediction layer, perform risk target motion prediction, and obtain the training sample risk target motion features.

在本实施例中，需要说明的是，对于复杂场景，可能涉及多个风险目标，在识别到多个风险目标的情况下，风险目标检测跟踪模型可以识别出多个风险目标并对多个风险目标同步进行跟踪，这样获得的风险目标检测跟踪特征中就会包含有多个风险目标的相关信息，也即，所述训练样本风险目标检测跟踪特征包括至少一个训练样本风险目标检测跟踪子特征。在此情况下，可以对每个风险目标分别进行运动预测。通过采用注意力机制，可以捕捉每个风险目标与其他风险目标之间的关联性和依赖性、每个风险目标与地图元素之间的关联性和依赖性、每个风险目标与场景认知之间的关联性和依赖性，基于这些关联性和依赖性，可以更准确地对每个风险目标进行风险目标运行预测，确定每个风险目标的训练样本风险目标运动特征。In this embodiment, it should be noted that for complex scenarios, multiple risk targets may be involved. In the case where multiple risk targets are identified, the risk target detection and tracking model can identify multiple risk targets and evaluate multiple risk targets. The targets are tracked synchronously, so that the risk target detection and tracking features obtained will contain relevant information of multiple risk targets. That is, the training sample risk target detection and tracking features include at least one training sample risk target detection and tracking sub-feature. In this case, motion prediction can be performed separately for each risk target. By using the attention mechanism, it is possible to capture the correlation and dependence between each risk target and other risk targets, the correlation and dependence between each risk target and map elements, and the correlation and dependence between each risk target and scene cognition. Based on these correlations and dependencies, risk target operation predictions can be made more accurately for each risk target, and the movement characteristics of the training sample risk target of each risk target can be determined.

场景认知层特征和运动查询特征表征的是当前场景已经存在的风险，基于场景认知层特征和运动查询特征可以进一步预测风险目标在当前时刻之后的运动信息。Scene cognitive layer features and motion query features represent risks that already exist in the current scene. Based on scene cognitive layer features and motion query features, the motion information of risk targets after the current moment can be further predicted.

作为一种示例，所述步骤S221-S223包括：将所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征输入所述注意力层，采用注意力机制，捕捉各所述训练样本风险目标检测跟踪子特征之间的依赖关系，以确定各个风险目标与其他风险目标之间的相互影响，生成训练样本风险目标交互特征，捕捉每个所述训练样本风险目标检测跟踪子特征与所述训练样本场景认知特征之间的依赖关系，以确定每个风险目标与场景认知之间的相互影响，生成训练样本场景交互特征，捕捉每个所述训练样本风险目标检测跟踪子特征与所述训练样本地图特征之间的依赖关系，以确定每个风险目标与地图元素之间的相互影响，生成训练样本场景交互特征。进而，将所述训练样本风险目标交互特征、所述训练样本场景交互特征以及所述训练样本地图交互特征拼接后，输入多层感知机层，得到训练样本场景认知层特征和训练样本运动查询特征。进而，将所述训练样本场景认知层特征和所述训练样本运动查询特征输入所述预测层，对每个风险目标进行风险目标运动预测，得到每个风险目标的训练样本风险目标运动特征。As an example, the steps S221-S223 include: inputting the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features into the attention layer, using an attention mechanism. , capture the dependencies between the risk target detection and tracking sub-features of each training sample, to determine the interaction between each risk target and other risk targets, generate the risk target interaction features of the training sample, and capture the risk of each training sample Target detection tracks the dependency between the sub-features and the training sample scene cognitive features to determine the interaction between each risk target and the scene cognitive features, generates the training sample scene interaction features, and captures each of the training samples Risk target detection tracks the dependence between sub-features and the training sample map features to determine the interaction between each risk target and map elements, and generates training sample scene interaction features. Furthermore, after splicing the training sample risk target interaction characteristics, the training sample scene interaction characteristics and the training sample map interaction characteristics, input the multi-layer perceptron layer to obtain the training sample scene cognitive layer characteristics and the training sample motion query feature. Furthermore, the training sample scene cognitive layer characteristics and the training sample motion query characteristics are input into the prediction layer, and risk target motion prediction is performed for each risk target to obtain the training sample risk target motion characteristics of each risk target.

在一种可实施的方式中，多层感知机层还可以解码训练样本场景认知层特征和训练样本运动查询特征，将解码得到的场景风险预测信息和风险目标的运动信息作为中间结果输出，以供用户查看。In an implementable manner, the multi-layer perceptron layer can also decode the training sample scene cognitive layer features and the training sample motion query features, and output the decoded scene risk prediction information and risk target motion information as intermediate results, for users to view.

步骤S30，根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化。Step S30: Iteratively optimize the cognitive layer according to the training sample cognitive layer characteristics and the target human driving experience scene cognitive characteristics.

作为一种示例，所述步骤S30包括：根据所述训练样本认知层特征中的训练样本场景认知特征和所述目标人类驾驶经验场景认知特征之间的差异，确定场景认知损失，可以将所述场景认知损失确定为认知层损失，也可以将所述场景认知损失和其他认知层损失进行聚合，得到认知层损失，判断所述认知层损失是否收敛，若所述认知层损失未收敛，则基于所述认知层损失，可以采用梯度下降法对所述认知层进行一轮更新，并返回执行所述获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征的步骤，进行下一轮训练；若所述认知层损失收敛，则可以获得迭代优化完成的认知层。As an example, the step S30 includes: determining the scene cognitive loss based on the difference between the training sample scene cognitive characteristics in the training sample cognitive layer characteristics and the target human driving experience scene cognitive characteristics, The scene cognitive loss can be determined as the cognitive layer loss, or the scene cognitive loss and other cognitive layer losses can be aggregated to obtain the cognitive layer loss, and it can be judged whether the cognitive layer loss has converged. If If the cognitive layer loss has not converged, then based on the cognitive layer loss, the gradient descent method can be used to perform a round of updates on the cognitive layer, and return to the acquisition of training sample vehicle interior and exterior monitoring data and training sample scenarios. label, and obtain the cognitive characteristics of the target human driving experience scene corresponding to the training sample scene label, and perform the next round of training; if the cognitive layer loss converges, the cognitive layer completed by iterative optimization can be obtained.

由于本实施例中提出的所述自动驾驶优化方法仅对认知层进行迭代优化，因此，可以在对认知决策模型进行正式训练之前的预训练，通过学习人类驾驶经验，使得认知层能够更全面且多层次地对复杂场景进行理解，提高认知层对复杂场景的理解能力和风险识别能力，从而引导决策层作出更准确地驾驶决策，提高自动驾驶的安全性。Since the automatic driving optimization method proposed in this embodiment only iteratively optimizes the cognitive layer, it can be pre-trained before formal training of the cognitive decision-making model, so that the cognitive layer can learn from human driving experience. Understand complex scenes more comprehensively and at multiple levels, improve the cognitive layer's ability to understand complex scenes and identify risks, thereby guiding decision-makers to make more accurate driving decisions and improve the safety of autonomous driving.

在一种可实施的方式中，参照图2，所述认知层包括地图构建transformer、风险目标检测跟踪transformer、场景认知transformer和运动预测transformer，通过地图构建transformer提取出地图特征，通过风险目标检测跟踪transformer提取风险目标检测跟踪特征，通过场景认知transformer提取场景认知特征，进一步提取地图特征、风险目标检测跟踪特征和场景认知特征输入多层感知机层之间的交互特征，风险目标之间的交互查询Q_a表达为：In an implementable manner, referring to Figure 2, the cognitive layer includes a map construction transformer, a risk target detection and tracking transformer, a scene recognition transformer, and a motion prediction transformer. The map features are extracted through the map construction transformer, and the risk target is extracted through the map construction transformer. The detection and tracking transformer extracts risk target detection and tracking features, and the scene recognition transformer extracts scene recognition features, and further extracts map features, risk target detection and tracking features, and scene recognition features to input interactive features between multi-layer perceptron layers, risk targets The interactive query Q _a is expressed as:

Q_a＝MCHA(MHSA(Q),Q_A)Q _a =MCHA(MHSA(Q),Q _A )

其中，MHCA表示多头交叉注意力机制、MHSA表示多头自注意力机制，Q表示风险目标检测跟踪查询，Q_A表示风险目标之外的其他交通参与者检测跟踪查询；Among them, MHCA represents the multi-head cross attention mechanism, MHSA represents the multi-head self-attention mechanism, Q represents the risk target detection and tracking query, and Q _A represents the detection and tracking query of other traffic participants other than the risk target;

风险目标与地图之间的交互查询Q_m表达为：The interactive query _Qm between the risk target and the map is expressed as:

Q_m＝MCHA(MHSA(Q),Q_M)Q _m =MCHA(MHSA(Q),Q _M )

其中，Q_M表示地图查询；Among them, Q _M represents map query;

风险目标与场景认知之间的交互查询Q_s表达为：The interactive query Q _s between risk targets and scene cognition is expressed as:

Q_s＝MCHA(MHSA(Q),Q_s)Q _s =MCHA(MHSA(Q),Q _s )

其中，Q_S表示认知场景查询。Among them, Q _S represents cognitive scene query.

进而将Q_a、Q_m和Q_s传递给多层感知机层，将得到的场景认知层特征和运动查询特征传递给运动预测transformer，得到风险目标运动特征，规划transformer进一步基于风险目标运动特征规划自动驾驶车辆的下一步自动驾驶行为。其中，场景认知transformer采用人类场景注意力机制进行预训练。可以预先采集熟练驾驶员的人类驾驶经验数据，存储于场景经验池中，还可以针对自动驾驶车辆使用认知决策模型的过程中遇到的先前没有的交通冲突场景进行采样并提取关键特征，存储与场景经验池中；针对自动驾驶车辆使用认知决策模型的过程中，与用户进行人机交互过程中，用户对于一些关键场景进行提醒与当下采集到的车内外监测数据中提取的场景认知特征进行语义匹配，也可以输入场景记忆池中进行存储，便于场景认知transformer调用训练。Then Q _a , Q _m and Q _s are passed to the multi-layer perceptron layer, and the obtained scene cognitive layer features and motion query features are passed to the motion prediction transformer to obtain the risk target motion features. The planning transformer is further based on the risk target motion features. Plan the next autonomous driving behavior of the autonomous vehicle. Among them, the scene recognition transformer uses the human scene attention mechanism for pre-training. Human driving experience data of skilled drivers can be collected in advance and stored in the scene experience pool. It can also sample and extract key features for previously unavailable traffic conflict scenarios encountered by autonomous vehicles using cognitive decision-making models, and store them. and scene experience pool; in the process of using the cognitive decision-making model for autonomous vehicles, during the human-computer interaction process with the user, the user is reminded of some key scenes and the scene cognition extracted from the currently collected monitoring data inside and outside the vehicle Features are semantically matched and can also be input into the scene memory pool for storage, which facilitates scene cognitive transformer call training.

在本实施例中，所述自动驾驶优化方法应用于认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化方法用于对所述认知决策模型的认知层进行迭代优化。首先，通过获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征，实现了训练样本与人类驾驶经验场景认知特征的匹配，从而可以获取到与训练样本的训练样本场景标签相匹配的目标人类驾驶经验场景认知特征，人类驾驶经验场景认知特征可以反映出人类对当前场景的认知；进而通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征，实现了训练样本认知层特征的获得，训练样本认知层特征可以反映出自动驾驶车辆对当前场景的认知；进而，通过根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化，使得认知层学习人类对于场景的认知，对于涉及多个交通参与者以及多层逻辑推理的复杂场景，相比于自动驾驶车辆仅关注表层相位信息的方式，人类通常可以更全面地结合大量有效信息，进行多层逻辑推理，最终作出更精准地驾驶决策，因此，通过使得认知层学习人类对于场景的认知，可以使得认知层对复杂场景的认知更加全面且具有更强大的推理能力，从而更准确地理解并关注到场景中的风险点，从而引导决策层作出更准确地驾驶决策，提高自动驾驶的安全性。In this embodiment, the automatic driving optimization method is applied to a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization method is used to recognize the cognitive decision-making model. The knowledge layer performs iterative optimization. First, by obtaining the training sample vehicle interior and exterior monitoring data and the training sample scene labels, and obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene labels, the matching of the training samples and the human driving experience scene cognitive characteristics is achieved. Thus, the target human driving experience scene cognitive features that match the training sample scene label of the training sample can be obtained, and the human driving experience scene cognitive features can reflect human beings' cognition of the current scene; and then through the cognitive layer, the target human driving experience scene cognitive features can be obtained. The cognitive layer features of the training sample are extracted from the inside and outside monitoring data of the training sample, thereby achieving the acquisition of the cognitive layer features of the training sample. The cognitive layer features of the training sample can reflect the self-driving vehicle's cognition of the current scene; further, By iteratively optimizing the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene, the cognitive layer learns human cognition of the scene, and for traffic involving multiple traffic participants and complex scenarios with multi-layer logical reasoning. Compared with autonomous vehicles that only focus on surface phase information, humans can usually more comprehensively combine a large amount of effective information, perform multi-layer logical reasoning, and ultimately make more accurate driving decisions. Therefore, , by making the cognitive layer learn human cognition of the scene, the cognitive layer can have a more comprehensive understanding of complex scenes and have stronger reasoning capabilities, so as to more accurately understand and pay attention to the risk points in the scene, thereby Guide decision-makers to make more accurate driving decisions and improve the safety of autonomous driving.

实施例二Embodiment 2

进一步地，在本申请的第二实施例中，与上述实施例相同或相似的内容，可以参考上文介绍，后续不再赘述。在此基础上，参照图3，所述根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化的步骤包括：Furthermore, in the second embodiment of the present application, content that is the same or similar to the above embodiment can be referred to the above introduction, and will not be described again. On this basis, referring to Figure 3, the step of iteratively optimizing the cognitive layer based on the cognitive layer characteristics of the training sample and the cognitive characteristics of the target human driving experience scene includes:

步骤S31，获取训练样本地图标注数据、训练样本风险目标检测跟踪标注数据和训练样本运动标注数据；Step S31, obtain training sample map annotation data, training sample risk target detection and tracking annotation data, and training sample motion annotation data;

在本实施例中，需要说明的是，在认知层采用注意力机制的情况下，可以捕捉认知层中提取到的各个特征之间的关联性和依赖关系，因此，认知层中各个模型之间会相互影响，且认知层中每个模型各自都会产生一定的损失，仅基于其中任意一个损失对其他一个或多个模型进行优化，仍然可能导致整个认知层对当前场景的风险认知不准确，例如，某个风险目标能否被识别到，可能会影响到对与之产生交互的其他交通参与者的行为的理解，从而影响整个场景的风险认知。In this embodiment, it should be noted that when the cognitive layer adopts the attention mechanism, the correlation and dependence between various features extracted in the cognitive layer can be captured. Therefore, each feature in the cognitive layer Models will interact with each other, and each model in the cognitive layer will cause a certain loss. Optimizing one or more other models based on any one of the losses may still cause the entire cognitive layer to pose risks to the current scenario. Cognitive inaccuracies, for example, whether a certain risk target can be identified, may affect the understanding of the behavior of other traffic participants with whom they interact, thereby affecting the risk perception of the entire scenario.

在所述认知层包括场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型的情况下，可以分别计算场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型的损失，并进行聚合，将聚合得到的总损失确定为整个认知层的认知层损失，基于总损失对认知层中的各个模型同步进行迭代优化，可以提升整个认知层对场景风险认知的准确性。When the cognitive layer includes a scene cognitive model, a map construction model, a risk target detection and tracking model, and a motion prediction model, the scene cognitive model, the map construction model, the risk target detection and tracking model, and the motion prediction model can be calculated respectively. The losses are aggregated, and the total loss obtained by aggregation is determined as the cognitive layer loss of the entire cognitive layer. Based on the total loss, iterative optimization of each model in the cognitive layer can be simultaneously performed to improve the risk of the entire cognitive layer. Cognitive accuracy.

作为一种示例，所述步骤S31包括：获取训练样本对应的训练样本地图标注数据、训练样本风险目标检测跟踪标注数据和训练样本运动标注数据，其中，所述训练样本地图标注数据、所述训练样本风险目标检测跟踪标注数据和所述训练样本运动标注数据可以预先通过人工或模型基于训练样本车内外监测数据进行标注得到。As an example, the step S31 includes: obtaining the training sample map annotation data, the training sample risk target detection and tracking annotation data and the training sample motion annotation data corresponding to the training sample, wherein the training sample map annotation data, the training sample The sample risk target detection and tracking annotation data and the training sample motion annotation data can be obtained in advance through manual or model annotation based on the training sample vehicle interior and exterior monitoring data.

步骤S32，根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征之间的差异确定场景认知损失，根据所述训练样本地图特征和所述训练样本地图标注数据之间的差异确定地图构建损失，根据所述训练样本风险目标检测跟踪特征和所述训练样本风险目标检测跟踪标注数据之间的差异确定风险目标检测跟踪损失，并根据所述训练样本风险目标运动特征和所述训练样本运动标注数据之间的差异确定运动预测损失；Step S32: Determine the scene cognitive loss based on the difference between the training sample cognitive layer characteristics and the target human driving experience scene cognitive characteristics, and determine the scene cognitive loss based on the difference between the training sample map characteristics and the training sample map annotation data. The map construction loss is determined based on the difference between the risk target detection and tracking characteristics of the training sample and the risk target detection and tracking annotation data of the training sample, and the risk target detection and tracking loss is determined based on the risk target movement characteristics of the training sample and The difference between the motion annotation data of the training samples determines the motion prediction loss;

在本实施例中，需要说明的是，计算所述场景认知损失、所述地图构建损失、所述风险目标检测跟踪损失和所述运动预测损失的方式可以根据实际需要从现有技术中选择，例如可以采用交叉熵损失函数、均方误差损失函数等进行计算，也可以自行设计对应的损失函数，本实施例对此不加以限制。In this embodiment, it should be noted that the method of calculating the scene recognition loss, the map construction loss, the risk target detection and tracking loss, and the motion prediction loss can be selected from existing technologies according to actual needs. , for example, the cross entropy loss function, the mean square error loss function, etc. can be used for calculation, or the corresponding loss function can be designed by oneself, which is not limited in this embodiment.

作为一种示例，所述步骤S32包括：根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征之间的差异确定场景认知损失，根据所述训练样本地图特征和所述训练样本地图标注数据之间的差异确定地图构建损失，根据所述训练样本风险目标检测跟踪特征和所述训练样本风险目标检测跟踪标注数据之间的差异确定风险目标检测跟踪损失，并根据所述训练样本风险目标运动特征和所述训练样本运动标注数据之间的差异确定运动预测损失。As an example, the step S32 includes: determining the scene cognitive loss according to the difference between the cognitive layer characteristics of the training sample and the target human driving experience scene cognitive characteristics, and determining the scene cognitive loss according to the training sample map characteristics and the target human driving experience scene cognitive characteristics. The map construction loss is determined based on the difference between the training sample map annotation data, the risk target detection and tracking loss is determined based on the difference between the training sample risk target detection and tracking characteristics and the training sample risk target detection and tracking annotation data, and the risk target detection and tracking loss is determined based on the difference between the training sample risk target detection and tracking annotation data. The difference between the risk target motion characteristics of the training sample and the motion annotation data of the training sample determines the motion prediction loss.

可选地，所述训练样本场景认知特征包括多个训练样本场景认知子特征，所述目标人类驾驶经验场景认知特征包括多个人类驾驶经验场景认知子特征；Optionally, the training sample scene cognitive features include a plurality of training sample scene cognitive sub-features, and the target human driving experience scene cognitive features include a plurality of human driving experience scene cognitive sub-features;

所述根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征之间的差异确定场景认知损失的步骤包括：The step of determining the scene cognitive loss based on the difference between the cognitive layer characteristics of the training sample and the target human driving experience scene cognitive characteristics includes:

步骤B10，根据预设时序对应关系，确定各所述训练样本场景认知子特征各自对应的目标人类驾驶经验场景认知子特征；Step B10, determine the target human driving experience scene cognitive sub-features corresponding to each of the training sample scene cognitive sub-features according to the preset time series correspondence;

步骤B20，依次计算各所述训练样本场景认知子特征与各自对应的人类驾驶经验场景认知子特征之间的单体差异值，将各单体差异值的平均差异值，确定为场景认知损失。Step B20: Calculate the individual difference values between each of the training sample scene cognitive sub-features and the corresponding human driving experience scene cognitive sub-features in sequence, and determine the average difference value of each individual difference value as the scene recognition sub-feature. Know the loss.

在本实施例中，需要说明的是，场景认知特征中包含有很多会发生变化的信息，例如交通参与者的行为、位置等会发生变化，基础设施也可能发生改变，对于这些会发生变化的信息，一个时刻的数据往往很难体现出其意图，因此难以预测其变化趋势，因此使用一个时刻的信息进行场景风险认知的准确性较低，从而会导致自动驾驶控制的风险较高，安全性较低。因此，可以提取一段时间范围内产生的数据，提取一系列场景认知子特征组成场景认知特征序列，挖掘场景在一段时间范围内的变化信息，有利于提高风险目标运动预测的准确性，从而降低自动驾驶控制的风险，提高自动驾驶车辆的行驶安全性。In this embodiment, it should be noted that the scene cognitive features contain a lot of information that will change. For example, the behavior and location of traffic participants will change, and the infrastructure may also change. For these changes, Information, data at one moment is often difficult to reflect its intention, so it is difficult to predict its changing trend. Therefore, using information at one moment to perform scene risk recognition is less accurate, which will lead to higher risks in autonomous driving control. Less secure. Therefore, the data generated within a period of time can be extracted, a series of scene cognitive sub-features can be extracted to form a scene cognitive feature sequence, and the change information of the scene within a period of time can be mined, which will help improve the accuracy of risk target movement prediction, thereby Reduce the risk of autonomous driving control and improve the driving safety of autonomous vehicles.

本实施例基于这种时间序列特征，提出了一种损失确定方式，先按照时序匹配各所述训练样本场景认知子特征和目标人类驾驶经验场景认知子特征，分别计算每一对子特征之间的差异，进而基于子特征之间的差异的平均值确定当前场景下的场景认知损失。Based on this time series feature, this embodiment proposes a loss determination method. First, match each of the training sample scene cognitive sub-features and the target human driving experience scene cognitive sub-feature according to the time sequence, and calculate each pair of sub-features respectively. The difference between sub-features is then determined based on the average of the differences between sub-features to determine the scene cognitive loss in the current scene.

作为一种示例，所述步骤B10-B20包括：预先按照相同的时序排列方式，将各所述训练样本场景认知子特征排列成训练样本场景认知特征序列，并将各所述目标人类驾驶经验场景认知子特征排列成目标人类驾驶经验场景认知特征序列。在计算损失时，则可以将相同时序对应的训练样本场景认知子特征和目标人类驾驶经验场景认知子特征组成子特征对。分别计算各所述子特征对中的训练样本场景认知子特征和目标人类驾驶经验场景认知子特征之间的单体差异值，通过对各所述单体差异值求平均值或求加权平均值，得到平均差异值，将所述平均差异值确定为当前的场景认知损失。As an example, the steps B10-B20 include: arranging each of the training sample scene cognitive sub-features into a training sample scene cognitive feature sequence in advance according to the same time sequence arrangement, and assigning each of the target human driving The experience scene cognitive sub-features are arranged into a target human driving experience scene cognitive feature sequence. When calculating the loss, the training sample scene cognitive sub-features and the target human driving experience scene cognitive sub-features corresponding to the same time sequence can be combined into a sub-feature pair. Calculate the individual difference values between the training sample scene cognitive sub-features and the target human driving experience scene cognitive sub-features in each of the sub-feature pairs, respectively, by averaging or weighting the individual difference values The average value is obtained, and the average difference value is determined as the current scene cognitive loss.

可选地，所述依次计算各所述训练样本场景认知子特征与各自对应的人类驾驶经验场景认知子特征之间的单体差异值，将各单体差异值的平均差异值，确定为场景认知损失的步骤包括：Optionally, the single difference value between each of the training sample scene cognitive sub-features and the corresponding human driving experience scene cognitive sub-feature is calculated in sequence, and the average difference value of each single difference value is determined. Steps to cognitive loss for a scenario include:

将各所述训练样本场景认知子特征和各所述目标人类驾驶经验场景认知子特征输入预设的场景认知损失函数，计算场景认知损失，其中，所述场景认知损失函数为：Input each of the training sample scene cognitive sub-features and each of the target human driving experience scene cognitive sub-features into a preset scene cognitive loss function to calculate the scene cognitive loss, where the scene cognitive loss function is :

其中，为场景认知损失，/>为t时刻的人类驾驶经验场景认知子特征，/>为t时刻的训练样本场景认知子特征，T_f为预设的时间戳，t∈T_f。in, For scene cognitive loss,/> is the scene cognitive sub-feature of human driving experience at time t,/> is the scene cognitive sub-feature of the training sample at time t, T _f is the preset timestamp, t∈T _f .

在本实施例中，需要说明的是，所述训练样本场景认知特征序列至少包括t₀至T_f这段时间的训练样本场景认知子特征，所述目标人类驾驶经验场景认知特征序列至少包括t₀至T_f这段时间的人类驾驶经验场景认知子特征，训练样本车内外监测数据的时间信息和人类驾驶经验车内外监测数据的时间信息可能不同，可以通过重新编码的方式进行对齐，将二者的起始时间统一为t₀。In this embodiment, it should be noted that the training sample scene cognitive feature sequence at least includes the training sample scene cognitive sub-features from t ₀ to T _f , and the target human driving experience scene cognitive feature sequence At least include the human driving experience scene cognitive sub-features from t ₀ to T _f . The time information of the training sample vehicle interior and exterior monitoring data may be different from the time information of the human driving experience vehicle interior and exterior monitoring data, and can be recoded. Align and unify the starting time of both to t ₀ .

可选地，所述场景认知模型包括注意力分布认知模型和语义理解模型；所述训练样本场景认知特征包括训练样本注意力分布特征和训练样本语义理解特征；所述目标人类驾驶经验场景认知特征包括目标人类驾驶经验注意力分布特征和目标人类驾驶经验语义理解特征；Optionally, the scene cognitive model includes an attention distribution cognitive model and a semantic understanding model; the training sample scene cognitive features include training sample attention distribution features and training sample semantic understanding features; the target human driving experience Scene cognitive features include target human driving experience attention distribution features and target human driving experience semantic understanding features;

所述通过所述场景认知模型从所述训练样本车内外监测数据中提取出训练样本场景认知特征的步骤包括：The step of extracting the scene cognitive features of the training sample from the training sample vehicle interior and exterior monitoring data through the scene cognitive model includes:

步骤C10，通过所述注意力分布认知模型从所述训练样本车内外监测数据中提取出训练样本注意力分布特征，并通过所述语义理解模型从所述训练样本车内外监测数据中提取出训练样本语义理解特征；Step C10: Extract the training sample attention distribution characteristics from the training sample vehicle interior and exterior monitoring data through the attention distribution cognitive model, and extract the training sample attention distribution characteristics from the training sample vehicle interior and exterior monitoring data through the semantic understanding model. Semantic understanding features of training samples;

在本实施例中，需要说明的是，基于图像数据进行模型训练，训练得到的模型缺乏对图像中的语义理解，从而难以理解场景中深层次的需要进行多层推理才能获得的语义信息，导致模型对场景的理解较差，训练后的模型无法满足实际的安全性需求。通过引入注意力分布认知模型，并集合人类驾驶经验对注意力分布认知模型进行训练，可以使得认知层学习人类驾驶车辆遇到复杂场景时的注意力分布，但对于越复杂的场景，人类的注意力分布的内在原因越是多层次且融合了多种因素的，模型仅仅是模仿人类的注意力分布，往往难以灵活应对多变的实际情况，因此可以引入语义理解模型，提取语义理解特征，使得认知层不仅学习人类的注意力分布，还进一步学习人类注意力分布的内在原因和推理过程，学习人类对于场景进行多层次理解的方式，从而有利于应对复杂场景。In this embodiment, it should be noted that the model is trained based on image data, and the trained model lacks semantic understanding of the image, making it difficult to understand the deep semantic information in the scene that requires multi-layer reasoning to obtain, resulting in The model has poor understanding of the scenario, and the trained model cannot meet actual security requirements. By introducing the attention distribution cognitive model and integrating human driving experience to train the attention distribution cognitive model, the cognitive layer can learn the attention distribution of human driving vehicles when encountering complex scenes. However, for more complex scenes, The more intrinsic reasons for human attention distribution are multi-layered and integrated with multiple factors, the model only imitates human attention distribution and is often difficult to flexibly respond to changing actual situations. Therefore, a semantic understanding model can be introduced to extract semantic understanding. Features enable the cognitive layer to not only learn human attention distribution, but also further learn the internal reasons and reasoning processes of human attention distribution, and learn the way humans understand scenes at multiple levels, thus helping to deal with complex scenes.

作为一种示例，所述步骤C10包括：将所述训练样本车内外监测数据输入注意力分布认知模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，得到训练样本注意力分布特征；并将所述训练样本车内外监测数据输入语义理解模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，得到训练样本语义理解特征。As an example, the step C10 includes: inputting the training sample vehicle interior and exterior monitoring data into the attention distribution cognitive model, and performing multi-level progressive feature processing such as feature extraction, feature fusion, feature encoding, and feature reasoning, etc. Obtain the attention distribution characteristics of the training sample; input the training sample vehicle interior and exterior monitoring data into the semantic understanding model, perform feature extraction, feature fusion, feature encoding, feature reasoning and other multi-level progressive feature processing to obtain the semantic understanding of the training sample feature.

步骤C20，根据所述训练样本注意力分布特征和所述目标人类驾驶经验注意力分布特征之间的差异确定注意力分布认知损失，并根据所述训练样本语义理解特征和所述目标人类驾驶经验语义理解特征之间的差异确定语义理解损失；Step C20, determine the attention distribution cognitive loss based on the difference between the attention distribution characteristics of the training sample and the target human driving experience attention distribution characteristics, and determine the cognitive loss of attention distribution based on the semantic understanding characteristics of the training sample and the target human driving experience. Differences between empirical semantic understanding features determine semantic understanding loss;

步骤C30，聚合所述注意力分布认知损失和所述语义理解损失，得到场景认知损失。Step C30: Aggregate the attention distribution cognitive loss and the semantic understanding loss to obtain the scene cognitive loss.

作为一种示例，所述步骤C20-C30包括：根据所述训练样本注意力分布特征和所述目标人类驾驶经验注意力分布特征之间的差异确定注意力分布认知损失，并根据所述训练样本语义理解特征和所述目标人类驾驶经验语义理解特征之间的差异确定语义理解损失，进而可以通过求和、求平均、加权求和、加权平均等方式将所述注意力分布认知损失和所述语义理解损失进行聚合，得到场景认知损失。As an example, the steps C20-C30 include: determining the attention distribution cognitive loss based on the difference between the attention distribution characteristics of the training sample and the target human driving experience attention distribution characteristics, and determining the attention distribution cognitive loss based on the training sample attention distribution characteristics. The difference between the semantic understanding features of the sample and the target human driving experience semantic understanding features determines the semantic understanding loss, and then the attention distribution cognitive loss and The semantic understanding loss is aggregated to obtain the scene recognition loss.

步骤S33，将所述场景认知损失、所述地图构建损失、所述风险目标检测跟踪损失和所述运动预测损失聚合成认知层损失，基于所述认知层损失对所述认知层进行迭代优化。Step S33: Aggregate the scene recognition loss, the map construction loss, the risk target detection and tracking loss, and the motion prediction loss into a cognitive layer loss, and perform the cognitive layer loss based on the cognitive layer loss. Perform iterative optimization.

作为一种示例，所述步骤S33包括：将所述场景认知损失、所述地图构建损失、所述风险目标检测跟踪损失和所述运动预测损失进行聚合，得到认知层损失，其中，聚合认知层损失的方式可以为加和、加权求和、加权平均等。进而，判断所述认知层损失是否收敛，若所述认知层损失未收敛，则基于所述认知层损失，可以采用梯度下降法对所述认知层进行一轮更新，并返回执行所述获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征的步骤，进行下一轮训练；若所述认知层损失收敛，则可以获得迭代优化完成的认知层。As an example, step S33 includes: aggregating the scene recognition loss, the map construction loss, the risk target detection and tracking loss, and the motion prediction loss to obtain a cognitive layer loss, where aggregation The method of cognitive layer loss can be summation, weighted sum, weighted average, etc. Furthermore, it is determined whether the cognitive layer loss has converged. If the cognitive layer loss has not converged, then based on the cognitive layer loss, the gradient descent method can be used to perform a round of updates on the cognitive layer and return to execution. The steps of obtaining the training sample vehicle interior and exterior monitoring data and the training sample scene labels, and obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene labels, perform the next round of training; if the cognitive layer loss converges , then the cognitive layer completed by iterative optimization can be obtained.

在本实施例中，通过分别计算场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型的损失，并进行聚合，将聚合得到的总损失确定为整个认知层的认知层损失，基于总损失对认知层中的各个模型同步进行迭代优化，可以提升整个认知层对场景风险认知的准确性。In this embodiment, by separately calculating the losses of the scene cognitive model, map construction model, risk target detection and tracking model, and motion prediction model, and aggregating them, the total loss obtained by aggregation is determined as the cognitive layer of the entire cognitive layer. Loss, iterative optimization of each model in the cognitive layer based on the total loss can improve the accuracy of the entire cognitive layer's perception of scene risks.

实施例三Embodiment 3

进一步地，本申请实施例还提供一种自动驾驶控制方法，所述自动驾驶控制方法采用认知决策模型，所述认知决策模型包括认知层和决策层，所述认知层采用如上所述的自动驾驶优化方法进行预训练，与上述实施例相同或相似的内容，可以参考上文介绍，后续不再赘述。Further, embodiments of the present application also provide an automatic driving control method. The automatic driving control method adopts a cognitive decision-making model. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The cognitive layer adopts the above-mentioned method. The above-mentioned automatic driving optimization method is used for pre-training. Contents that are the same or similar to the above-mentioned embodiments can be referred to the above introduction and will not be repeated again.

参照图4，所述自动驾驶控制方法包括以下步骤：Referring to Figure 4, the automatic driving control method includes the following steps:

步骤D10，获取待预测车内外监测数据；Step D10, obtain the monitoring data inside and outside the vehicle to be predicted;

步骤D20，通过所述认知层从所述车内外监测数据中提取出待预测认知层特征，其中，所述待预测认知层特征包括待预测场景认知特征；Step D20: Extract the cognitive layer features to be predicted from the vehicle interior and exterior monitoring data through the cognitive layer, where the cognitive layer features to be predicted include the scene cognitive features to be predicted;

步骤D30，将所述待预测认知层特征输入所述决策层，确定自动驾驶控制参数。Step D30: Input the cognitive layer features to be predicted into the decision-making layer to determine automatic driving control parameters.

在本实施例中，需要说明的是，所述认知决策模型预先经过模型训练，在模型训练过程中，先采用如上所述的自动驾驶优化方法对认知层进行预训练，将预训练完成的认知层与决策层一同进行正式训练，所述正式训练可以采用现有技术相同或相近的方法进行，在此不过多赘述。正式训练后得到的认知决策模型可以用于所述自动驾驶控制方法，对自动驾驶车辆进行自动驾驶控制。其中，所述认知层的预训练可以在第一设备上进行，所述认知决策模型的正式训练可以在第二设备上进行，所述自动驾驶控制方法可以用于第三设备，第一设备、第二设备和第三设备中任意两个可以相同，也可以不同，具体可以根据实际情况进行确定，本实施例对此不加以限制。In this embodiment, it should be noted that the cognitive decision-making model has been model trained in advance. During the model training process, the cognitive layer is pre-trained using the automatic driving optimization method as described above, and the pre-training is completed. The cognitive layer and the decision-making layer conduct formal training together. The formal training can be carried out using the same or similar methods of the existing technology, so I will not go into details here. The cognitive decision-making model obtained after formal training can be used in the automatic driving control method to perform automatic driving control on the automatic driving vehicle. Wherein, the pre-training of the cognitive layer can be performed on the first device, the formal training of the cognitive decision-making model can be performed on the second device, and the automatic driving control method can be used on the third device. Any two of the device, the second device, and the third device may be the same or different, and the details may be determined according to actual conditions, which is not limited in this embodiment.

所述自动驾驶控制参数可以包括行驶轨迹、车辆控制参数等，其中，行驶轨迹是指自动驾驶车辆在当前时刻之后一段时间的轨迹，可以控制自动驾驶车辆基于所述行驶轨迹行驶，所述车辆控制参数包括速度、油门踏板开度、方向盘角度等。The autonomous driving control parameters may include driving trajectories, vehicle control parameters, etc., where the driving trajectory refers to the trajectory of the autonomous vehicle for a period of time after the current moment, and the autonomous driving vehicle can be controlled to drive based on the driving trajectory, and the vehicle control Parameters include speed, accelerator pedal opening, steering wheel angle, etc.

作为一种示例，所述步骤D10-D30包括：通过设置于自动驾驶车辆上的一个或多个传感器，采集当前时刻的传感器数据作为待预测车内外监测数据，还可以通过自动驾驶车辆上的通讯模块，与外部设备进行通信，获取外部设备提供的外部数据作为待预测车内外监测数据，所述外部数据例如红绿灯信号、交通事故信息、路侧监测设备采集的路侧监测数据等。进而，将所述待预测车内外监测数据输入所述认知层，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，得到待预测认知层特征，其中，所述待预测认知层特征至少包括待预测场景认知特征。进而，将所述待预测认知层特征输入所述决策层，确定自动驾驶车辆在当前时刻或当前时刻之后一段时间内的自动驾驶控制参数。As an example, the steps D10-D30 include: collecting sensor data at the current moment as the vehicle interior and exterior monitoring data to be predicted through one or more sensors installed on the self-driving vehicle, or through communication on the self-driving vehicle. The module communicates with the external device and obtains the external data provided by the external device as the monitoring data inside and outside the vehicle to be predicted, such as traffic light signals, traffic accident information, roadside monitoring data collected by the roadside monitoring equipment, etc. Furthermore, the vehicle interior and exterior monitoring data to be predicted is input into the cognitive layer, and multi-level progressive feature processing such as feature extraction, feature fusion, feature encoding, and feature reasoning is performed to obtain the cognitive layer features to be predicted, where, The cognitive layer features to be predicted at least include cognitive features of the scene to be predicted. Furthermore, the cognitive layer features to be predicted are input into the decision-making layer to determine the automatic driving control parameters of the automatic driving vehicle at the current moment or within a period of time after the current moment.

可选地，所述认知层包括场景认知模型、地图构建模型、风险目标检测跟踪模型和运动预测模型；所述待预测认知层特征还包括待预测地图特征、待预测风险目标检测跟踪特征和待预测风险目标运动特征；Optionally, the cognitive layer includes a scene cognitive model, a map construction model, a risk target detection and tracking model, and a motion prediction model; the cognitive layer features to be predicted also include map features to be predicted, risk target detection and tracking to be predicted Characteristics and movement characteristics of risk targets to be predicted;

所述通过所述认知层从所述车内外监测数据中提取出待预测认知层特征的步骤包括：The step of extracting cognitive layer features to be predicted from the vehicle interior and exterior monitoring data through the cognitive layer includes:

步骤D21，通过所述场景认知模型从所述待预测车内外监测数据中提取出待预测场景认知特征，通过所述地图构建模型从所述待预测车内外监测数据中提取出待预测地图特征，通过所述风险目标检测跟踪模型从所述待预测车内外监测数据中提取出待预测风险目标检测跟踪特征；Step D21: Use the scene cognitive model to extract the scene cognitive features to be predicted from the vehicle interior and exterior monitoring data to be predicted, and extract the map to be predicted from the vehicle interior and exterior monitoring data to be predicted using the map construction model. Features: extract the risk target detection and tracking features to be predicted from the vehicle interior and exterior monitoring data to be predicted through the risk target detection and tracking model;

步骤D22，通过所述运动预测模型基于所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征，进行风险目标运动预测，得到待预测风险目标运动特征，其中，所述运动预测模型采用注意力机制。Step D22, use the motion prediction model to perform risk target motion prediction based on the to-be-predicted scene cognitive features, the to-be-predicted map features, and the to-be-predicted risk target detection and tracking features to obtain the to-be-predicted risk target motion features, Wherein, the motion prediction model adopts an attention mechanism.

作为一种示例，所述步骤D21-D22包括：将所述待预测车内外监测数据输入所述场景认知模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出待预测场景认知特征；同步地，将所述待预测车内外监测数据输入所述地图构建模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出待预测地图特征；同步地，将所述待预测车内外监测数据输入所述风险目标检测跟踪模型，进行特征提取、特征融合、特征编码、特征推理等多层次递进式的特征处理，提取出待预测风险目标检测跟踪特征。进而，将所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征输入所述运动预测模型，通过运动预测模型，采用注意力机制，捕捉所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征之间的关联性和依赖性，基于这些关联性和依赖性，可以更准确地进行风险目标运行预测，确定至少一个风险目标的待预测风险目标运动特征。As an example, the steps D21-D22 include: inputting the vehicle interior and exterior monitoring data to be predicted into the scene cognitive model, and performing multi-level progressive features such as feature extraction, feature fusion, feature encoding, and feature reasoning. Processing to extract the cognitive features of the scene to be predicted; simultaneously, input the vehicle interior and exterior monitoring data to be predicted into the map construction model to perform multi-level progressive features such as feature extraction, feature fusion, feature encoding, and feature reasoning. Process to extract the map features to be predicted; simultaneously, input the vehicle interior and exterior monitoring data to be predicted into the risk target detection and tracking model to perform multi-level progressive features such as feature extraction, feature fusion, feature encoding, and feature reasoning. Process and extract the detection and tracking features of the risk target to be predicted. Furthermore, the cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted are input into the motion prediction model, and the attention mechanism is used through the motion prediction model to capture the features to be predicted. The correlation and dependence between the scene cognitive features, the map features to be predicted and the risk target detection and tracking features to be predicted. Based on these correlations and dependencies, risk target operation predictions can be made more accurately, and at least The motion characteristics of a risk target to be predicted.

可选地，所述待预测风险目标检测跟踪特征包括至少一个待预测风险目标检测跟踪子特征；所述运动预测模型包括注意力层、多层感知机层和预测层；Optionally, the risk target detection and tracking features to be predicted include at least one risk target detection and tracking sub-feature to be predicted; the motion prediction model includes an attention layer, a multi-layer perceptron layer and a prediction layer;

所述通过所述运动预测模型基于所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征，进行风险目标运动预测，得到待预测风险目标运动特征的步骤包括：The step of predicting the movement of risky targets through the motion prediction model based on the cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted, and obtaining the motion characteristics of the risky targets to be predicted include:

步骤D221，将所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征输入所述注意力层，采用注意力机制，确定各所述待预测风险目标检测跟踪子特征之间的待预测风险目标交互特征、各所述待预测风险目标检测跟踪子特征与所述待预测场景认知特征之间的待预测场景交互特征以及各所述待预测风险目标检测跟踪子特征与所述待预测地图特征之间的待预测地图交互特征；Step D221: Input the cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted into the attention layer, and use the attention mechanism to determine the detection of each risk target to be predicted. To-be-predicted risk target interaction features between tracking sub-features, to-be-predicted scene interaction features between each of the to-be-predicted risk target detection tracking sub-features and the to-be-predicted scene cognitive features, and to each of the to-be-predicted risk target detection Track the to-be-predicted map interaction features between the sub-features and the to-be-predicted map features;

步骤D222，将所述待预测风险目标交互特征、所述待预测场景交互特征以及所述待预测地图交互特征输入多层感知机层，得到待预测场景认知层特征和待预测运动查询特征；Step D222, input the risk target interaction features to be predicted, the scene interaction features to be predicted, and the map interaction features to be predicted into the multi-layer perceptron layer to obtain the cognitive layer features of the scene to be predicted and the motion query features to be predicted;

步骤D223，将所述待预测场景认知层特征和所述待预测运动查询特征输入所述预测层，进行风险目标运动预测，得到待预测风险目标运动特征。Step D223: Input the cognitive layer features of the scene to be predicted and the motion query features to be predicted into the prediction layer, perform risk target motion prediction, and obtain the risk target motion features to be predicted.

作为一种示例，所述步骤D221-D223包括：将所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征输入所述注意力层，采用注意力机制，捕捉各所述待预测风险目标检测跟踪子特征之间的依赖关系，以确定各个风险目标与其他风险目标之间的相互影响，生成待预测风险目标交互特征，捕捉每个所述待预测风险目标检测跟踪子特征与所述待预测场景认知特征之间的依赖关系，以确定每个风险目标与场景认知之间的相互影响，生成待预测场景交互特征，捕捉每个所述待预测风险目标检测跟踪子特征与所述待预测地图特征之间的依赖关系，以确定每个风险目标与地图元素之间的相互影响，生成待预测场景交互特征。进而，将所述待预测风险目标交互特征、所述待预测场景交互特征以及所述待预测地图交互特征拼接后，输入多层感知机层，得到待预测场景认知层特征和待预测运动查询特征。进而，将所述待预测场景认知层特征和所述待预测运动查询特征输入所述预测层，对每个风险目标进行风险目标运动预测，得到每个风险目标的待预测风险目标运动特征。As an example, the steps D221-D223 include: inputting the cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted into the attention layer, using an attention mechanism. , capture the dependencies between the detection and tracking sub-features of each of the risk targets to be predicted, to determine the interaction between each risk target and other risk targets, generate interaction features of the risk targets to be predicted, and capture each of the risks to be predicted Target detection tracks the dependence between sub-features and the cognitive features of the scene to be predicted to determine the interaction between each risk target and the scene cognition, generate interaction features of the scene to be predicted, and capture each of the scene cognitions to be predicted. Risk target detection tracks the dependence between sub-features and the map features to be predicted to determine the interaction between each risk target and map elements, and generate interaction features of the scene to be predicted. Furthermore, after splicing the risk target interaction features to be predicted, the scene interaction features to be predicted and the map interaction features to be predicted, the multi-layer perceptron layer is input to obtain the cognitive layer features of the scene to be predicted and the motion query to be predicted. feature. Furthermore, the cognitive layer features of the scene to be predicted and the motion query features to be predicted are input into the prediction layer, and risk target motion prediction is performed for each risk target to obtain the risk target motion features to be predicted for each risk target.

在一种可实施的方式中，多层感知机层还可以解码待预测场景认知层特征和待预测运动查询特征，将解码得到的场景风险预测信息和风险目标的运动信息作为中间结果输出，以供用户查看。In an implementable manner, the multi-layer perceptron layer can also decode the cognitive layer features of the scene to be predicted and the motion query features to be predicted, and output the decoded scene risk prediction information and the motion information of the risk target as intermediate results, for users to view.

在本实施例中，基于人类驾驶经验场景认知特征进行训练后得到的认知层，学习了人类驾驶经验，因此对复杂场景的认知更加全面且具有更强大的推理能力，从而可以更准确地理解并关注到场景中的风险点，将其应用于自动驾驶控制，可以引导决策层更准确地避开风险，作出更安全的驾驶决策，提高自动驾驶的安全性。In this embodiment, the cognitive layer obtained after training based on the cognitive characteristics of human driving experience scenes has learned human driving experience, so it has a more comprehensive cognition of complex scenes and has stronger reasoning capabilities, so that it can be more accurate Comprehensively understand and pay attention to the risk points in the scene, and apply it to autonomous driving control, which can guide decision-makers to avoid risks more accurately, make safer driving decisions, and improve the safety of autonomous driving.

实施例四Embodiment 4

进一步地，本申请实施例还提供一种自动驾驶优化装置，参照图5，所述自动驾驶优化装置上部署有认知决策模型，所述认知决策模型包括认知层和决策层，所述自动驾驶优化装置包括：Further, embodiments of the present application also provide an automatic driving optimization device. Referring to Figure 5, a cognitive decision-making model is deployed on the automatic driving optimization device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. Autonomous driving optimization devices include:

第一获取模块10，用于获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征；The first acquisition module 10 is used to acquire training sample vehicle interior and exterior monitoring data and training sample scene labels, and obtain target human driving experience scene cognitive features corresponding to the training sample scene labels;

第一认知模块20，用于通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征；The first cognitive module 20 is configured to extract the cognitive layer features of the training sample from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

优化模块30，用于根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化。The optimization module 30 is configured to iteratively optimize the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene.

所述第一认知模块20，还用于：The first cognitive module 20 is also used to:

通过所述场景认知模型从所述训练样本车内外监测数据中提取出训练样本场景认知特征，通过所述地图构建模型从所述训练样本车内外监测数据中提取出训练样本地图特征，通过所述风险目标检测跟踪模型从所述训练样本车内外监测数据中提取出训练样本风险目标检测跟踪特征；The scene cognitive model extracts the training sample scene cognitive features from the training sample vehicle interior and exterior monitoring data, and the map construction model extracts the training sample map features from the training sample vehicle interior and exterior monitoring data. The risk target detection and tracking model extracts the training sample risk target detection and tracking characteristics from the training sample vehicle interior and exterior monitoring data;

通过所述运动预测模型基于所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征，进行风险目标运动预测，得到训练样本风险目标运动特征，其中，所述运动预测模型采用注意力机制。The motion prediction model performs risk target motion prediction based on the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features to obtain training sample risk target motion features, where The motion prediction model described above uses an attention mechanism.

将所述训练样本场景认知特征、所述训练样本地图特征和所述训练样本风险目标检测跟踪特征输入所述注意力层，采用注意力机制，确定各所述风险目标检测跟踪子特征之间的训练样本风险目标交互特征、各所述风险目标检测跟踪子特征与所述训练样本场景认知特征之间的训练样本场景交互特征以及各所述风险目标检测跟踪子特征与所述训练样本地图特征之间的训练样本地图交互特征；The training sample scene cognitive features, the training sample map features and the training sample risk target detection and tracking features are input into the attention layer, and an attention mechanism is used to determine the relationship between each of the risk target detection and tracking sub-features. The training sample risk target interaction features, the training sample scene interaction features between each of the risk target detection and tracking sub-features and the training sample scene cognitive features, and each of the risk target detection and tracking sub-features and the training sample map Training sample map interaction features between features;

将所述训练样本风险目标交互特征、所述训练样本场景交互特征以及所述训练样本地图交互特征输入多层感知机层，得到训练样本场景认知层特征和训练样本运动查询特征；Input the training sample risk target interaction characteristics, the training sample scene interaction characteristics and the training sample map interaction characteristics into the multi-layer perceptron layer to obtain the training sample scene cognitive layer characteristics and the training sample motion query characteristics;

将所述训练样本场景认知层特征和所述训练样本运动查询特征输入所述预测层，进行风险目标运动预测，得到训练样本风险目标运动特征。The training sample scene cognitive layer features and the training sample motion query features are input into the prediction layer, and risk target motion prediction is performed to obtain training sample risk target motion features.

可选地，所述优化模块30，还用于：Optionally, the optimization module 30 is also used to:

获取训练样本地图标注数据、训练样本风险目标检测跟踪标注数据和训练样本运动标注数据；Obtain training sample map annotation data, training sample risk target detection and tracking annotation data, and training sample motion annotation data;

根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征之间的差异确定场景认知损失，根据所述训练样本地图特征和所述训练样本地图标注数据之间的差异确定地图构建损失，根据所述训练样本风险目标检测跟踪特征和所述训练样本风险目标检测跟踪标注数据之间的差异确定风险目标检测跟踪损失，并根据所述训练样本风险目标运动特征和所述训练样本运动标注数据之间的差异确定运动预测损失；The scene cognitive loss is determined based on the difference between the cognitive layer characteristics of the training sample and the cognitive characteristics of the target human driving experience scene, and the scene cognitive loss is determined based on the difference between the training sample map characteristics and the training sample map annotation data. The map construction loss is determined based on the difference between the risk target detection and tracking characteristics of the training sample and the risk target detection and tracking annotation data of the training sample, and the risk target detection and tracking loss is determined based on the risk target motion characteristics of the training sample and the training sample. The difference between the sample motion annotation data determines the motion prediction loss;

将所述场景认知损失、所述地图构建损失、所述风险目标检测跟踪损失和所述运动预测损失聚合成认知层损失，基于所述认知层损失对所述认知层进行迭代优化。The scene recognition loss, the map construction loss, the risk target detection and tracking loss, and the motion prediction loss are aggregated into a cognitive layer loss, and the cognitive layer is iteratively optimized based on the cognitive layer loss. .

所述优化模块30，还用于：The optimization module 30 is also used to:

根据预设时序对应关系，确定各所述训练样本场景认知子特征各自对应的目标人类驾驶经验场景认知子特征；According to the preset time sequence correspondence, determine the target human driving experience scene cognitive sub-feature corresponding to each of the training sample scene cognitive sub-features;

依次计算各所述训练样本场景认知子特征与各自对应的人类驾驶经验场景认知子特征之间的单体差异值，将各单体差异值的平均差异值，确定为场景认知损失。The individual difference values between each of the training sample scene cognitive sub-features and the corresponding human driving experience scene cognitive sub-features are calculated in sequence, and the average difference value of each individual difference value is determined as the scene cognitive loss.

所述第一认知模块，还用于：The first cognitive module is also used to:

通过所述注意力分布认知模型从所述训练样本车内外监测数据中提取出训练样本注意力分布特征，并通过所述语义理解模型从所述训练样本车内外监测数据中提取出训练样本语义理解特征；The attention distribution characteristics of the training sample are extracted from the training sample vehicle interior and exterior monitoring data through the attention distribution cognitive model, and the training sample semantics are extracted from the training sample vehicle interior and exterior monitoring data through the semantic understanding model. understand characteristics;

根据所述训练样本注意力分布特征和所述目标人类驾驶经验注意力分布特征之间的差异确定注意力分布认知损失，并根据所述训练样本语义理解特征和所述目标人类驾驶经验语义理解特征之间的差异确定语义理解损失；Determine the attention distribution cognitive loss based on the difference between the attention distribution characteristics of the training sample and the target human driving experience attention distribution characteristics, and understand the semantic understanding based on the semantic understanding characteristics of the training sample and the target human driving experience Differences between features determine semantic understanding loss;

聚合所述注意力分布认知损失和所述语义理解损失，得到场景认知损失。The attention distribution cognitive loss and the semantic understanding loss are aggregated to obtain the scene cognitive loss.

可选地，在所述获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征的操作之前，所述自动驾驶优化装置还包括场景标签确定模块，所述场景标签确定模块用于：Optionally, before the operation of obtaining the target human driving experience scene cognitive characteristics corresponding to the training sample scene label, the automatic driving optimization device further includes a scene label determination module, and the scene label determination module is used to:

获取多个人类驾驶经验场景监测数据，分别从各所述人类驾驶经验场景监测数据中提取人类驾驶经验场景认知特征；Acquire multiple human driving experience scene monitoring data, and extract human driving experience scene cognitive features from each of the human driving experience scene monitoring data;

对各所述人类驾驶经验场景认知特征进行聚类分析，确定各所述人类驾驶经验场景认知特征各自对应的场景标签。Perform cluster analysis on the cognitive features of each of the human driving experience scenes to determine the scene labels corresponding to the cognitive features of each of the human driving experience scenes.

本发明提供的自动驾驶优化装置，采用上述实施例中的自动驾驶优化方法，解决了相关技术中自动驾驶的安全性较低的技术问题。与相关技术相比，本发明实施例提供的自动驾驶优化装置的有益与上述实施例提供的自动驾驶优化方法的有益相同，且该自动驾驶优化装置中的其他技术特征与上述实施例方法公开的特征相同，在此不做赘述。The automatic driving optimization device provided by the present invention adopts the automatic driving optimization method in the above embodiment to solve the technical problem of low safety of automatic driving in related technologies. Compared with related technologies, the benefits of the automatic driving optimization device provided by the embodiments of the present invention are the same as those of the automatic driving optimization method provided by the above-mentioned embodiments, and other technical features in the automatic driving optimization device are the same as those disclosed in the above-mentioned embodiments. The characteristics are the same and will not be repeated here.

实施例五Embodiment 5

进一步地，本申请实施例还提供一种自动驾驶控制装置，所述自动驾驶控制装置上部署有认知决策模型，所述认知决策模型包括认知层和决策层，所述认知层采用如上所述的自动驾驶优化方法进行预训练，所述自动驾驶控制装置包括：Further, embodiments of the present application also provide an automatic driving control device. A cognitive decision-making model is deployed on the automatic driving control device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The cognitive layer adopts The automatic driving optimization method as mentioned above is pre-trained, and the automatic driving control device includes:

所述第二认知模型，还用于：The second cognitive model is also used for:

通过所述场景认知模型从所述待预测车内外监测数据中提取出待预测场景认知特征，通过所述地图构建模型从所述待预测车内外监测数据中提取出待预测地图特征，通过所述风险目标检测跟踪模型从所述待预测车内外监测数据中提取出待预测风险目标检测跟踪特征；The scene cognitive features to be predicted are extracted from the vehicle interior and exterior monitoring data to be predicted through the scene cognitive model, and the map features to be predicted are extracted from the vehicle interior and exterior monitoring data to be predicted through the map construction model. The risk target detection and tracking model extracts the risk target detection and tracking features to be predicted from the vehicle interior and exterior monitoring data to be predicted;

通过所述运动预测模型基于所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征，进行风险目标运动预测，得到待预测风险目标运动特征，其中，所述运动预测模型采用注意力机制。The motion prediction model performs motion prediction of risk targets based on the cognitive features of the scene to be predicted, the map features to be predicted, and the risk target detection and tracking features to be predicted, and obtains the motion characteristics of the risk targets to be predicted, wherein, The motion prediction model described above uses an attention mechanism.

将所述待预测场景认知特征、所述待预测地图特征和所述待预测风险目标检测跟踪特征输入所述注意力层，采用注意力机制，确定各所述待预测风险目标检测跟踪子特征之间的待预测风险目标交互特征、各所述待预测风险目标检测跟踪子特征与所述待预测场景认知特征之间的待预测场景交互特征以及各所述待预测风险目标检测跟踪子特征与所述待预测地图特征之间的待预测地图交互特征；The cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted are input into the attention layer, and an attention mechanism is used to determine each of the risk target detection and tracking sub-features to be predicted. The interaction features of risk targets to be predicted, the scene interaction features to be predicted between each of the risk target detection and tracking sub-features to be predicted and the cognitive features of the scene to be predicted, and the detection and tracking sub-features of each risk target to be predicted To-be-predicted map interaction features with the to-be-predicted map features;

将所述待预测风险目标交互特征、所述待预测场景交互特征以及所述待预测地图交互特征输入多层感知机层，得到待预测场景认知层特征和待预测运动查询特征；Input the risk target interaction features to be predicted, the scene interaction features to be predicted, and the map interaction features to be predicted into the multi-layer perceptron layer to obtain the cognitive layer features of the scene to be predicted and the motion query features to be predicted;

将所述待预测场景认知层特征和所述待预测运动查询特征输入所述预测层，进行风险目标运动预测，得到待预测风险目标运动特征。The cognitive layer features of the scene to be predicted and the motion query features to be predicted are input into the prediction layer, and risk target motion prediction is performed to obtain the risk target motion features to be predicted.

实施例六Embodiment 6

进一步地，本发明实施例提供一种电子设备，电子设备包括：至少一个处理器；以及，与至少一个处理器通信连接的存储器；其中，存储器存储有可被至少一个处理器执行的指令，指令被至少一个处理器执行，以使至少一个处理器能够执行上述实施例中的自动驾驶优化方法。Further, embodiments of the present invention provide an electronic device. The electronic device includes: at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions that can be executed by the at least one processor, and the instructions Executed by at least one processor, so that at least one processor can execute the automatic driving optimization method in the above embodiment.

下面参考图6，其示出了适于用来实现本公开实施例的电子设备的结构示意图。本公开实施例中的电子设备可以包括但不限于诸如蓝牙耳机、移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图6示出的电子设备仅仅是一个示例，不应对本公开实施例的功能和使用范围带来任何限制。Referring now to FIG. 6 , a schematic structural diagram of an electronic device suitable for implementing embodiments of the present disclosure is shown. Electronic devices in embodiments of the present disclosure may include, but are not limited to, Bluetooth headsets, mobile phones, laptops, digital broadcast receivers, PDAs (Personal Digital Assistants), PADs (Tablets), PMPs (Portable Multimedia Players), vehicle Mobile terminals such as car navigation terminals and fixed terminals such as digital TVs, desktop computers and the like. The electronic device shown in FIG. 6 is only an example and should not impose any limitations on the functions and scope of use of the embodiments of the present disclosure.

如图6所示，电子设备可以包括处理装置(例如中央处理器、图形处理器等)，其可以根据存储在只读存储器(ROM)中的程序或者从存储装置加载到随机访问存储器(RAM)中的程序而执行各种适当的动作和处理。在RAM中，还存储有电子设备操作所需的各种程序和数组。处理装置、ROM以及RAM通过总线彼此相连。输入/输出(I/O)接口也连接至总线。As shown in FIG. 6 , the electronic device may include a processing device (such as a central processing unit, a graphics processor, etc.) that may be loaded into a random access memory (RAM) according to a program stored in a read-only memory (ROM) or from a storage device. perform various appropriate actions and processing according to the program in it. In RAM, various programs and arrays required for the operation of electronic equipment are also stored. The processing device, ROM and RAM are connected to each other via a bus. Input/output (I/O) interfaces are also connected to the bus.

通常，以下系统可以连接至I/O接口：包括例如触摸屏、触摸板、键盘、鼠标、图像传感器、麦克风、加速度计、陀螺仪等的输入装置；包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置；包括例如磁带、硬盘等的存储装置；以及通信装置。通信装置可以允许电子设备与其他设备进行无线或有线通信以交换数组。虽然图中示出了具有各种系统的电子设备，但是应理解的是，并不要求实施或具备所有示出的系统。可以替代地实施或具备更多或更少的系统。Typically, the following systems can be connected to the I/O interface: input devices including, for example, touch screens, touch pads, keyboards, mice, image sensors, microphones, accelerometers, gyroscopes, etc.; including, for example, liquid crystal displays (LCDs), speakers, vibrators Output devices, etc.; storage devices including tapes, hard disks, etc.; and communication devices. The communication device may allow the electronic device to communicate wirelessly or wiredly with other devices to exchange arrays. Although the figures illustrate electronic devices having various systems, it is to be understood that implementation or availability of all illustrated systems is not required. More or fewer systems may alternatively be implemented or provided.

特别地，根据本公开的实施例，上文参考流程图描述的过程可以被实现为计算机软件程序。例如，本公开的实施例包括一种计算机程序产品，其包括承载在计算机可读介质上的计算机程序，该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中，该计算机程序可以通过通信装置从网络上被下载和安装，或者从存储装置被安装，或者从ROM被安装。在该计算机程序被处理装置执行时，执行本公开实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product including a computer program carried on a computer-readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such embodiments, the computer program may be downloaded and installed from the network through the communication device, or from a storage device, or from a ROM. When the computer program is executed by the processing device, the above-mentioned functions defined in the method of the embodiment of the present disclosure are performed.

本发明提供的电子设备，采用上述实施例中的自动驾驶优化方法，解决了相关技术中自动驾驶的安全性较低的技术问题。与相关技术相比，本发明实施例提供的电子设备的有益与上述实施例提供的自动驾驶优化方法的有益相同，且该电子设备中的其他技术特征与上述实施例方法公开的特征相同，在此不做赘述。The electronic device provided by the present invention adopts the automatic driving optimization method in the above embodiment to solve the technical problem of low safety of automatic driving in related technologies. Compared with related technologies, the benefits of the electronic device provided by the embodiments of the present invention are the same as those of the autonomous driving optimization method provided by the above-mentioned embodiments, and other technical features in the electronic device are the same as those disclosed by the above-mentioned embodiments. This will not be described in detail.

应当理解，本公开的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式的描述中，具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。It should be understood that various parts of the present disclosure may be implemented in hardware, software, firmware, or combinations thereof. In the above description of the embodiments, specific features, structures, materials or characteristics may be combined in any suitable manner in any one or more embodiments or examples.

以上所述，仅为本发明的具体实施方式，但本发明的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本发明揭露的技术范围内，可轻易想到变化或替换，都应涵盖在本发明的保护范围之内。因此，本发明的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited thereto. Any person familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed by the present invention. should be covered by the protection scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

实施例七Embodiment 7

进一步地，本实施例提供一种计算机可读存储介质，具有存储在其上的计算机可读程序指令，计算机可读程序指令用于执行上述实施例中的自动驾驶优化方法。Further, this embodiment provides a computer-readable storage medium having computer-readable program instructions stored thereon. The computer-readable program instructions are used to execute the automatic driving optimization method in the above embodiment.

本发明实施例提供的计算机可读存储介质例如可以是U盘，但不限于电、磁、光、电磁、红外线、或半导体的系统、系统或器件，或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于：具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本实施例中，计算机可读存储介质可以是任何包含或存储程序的有形介质，该程序可以被指令执行系统、系统或者器件使用或者与其结合使用。计算机可读存储介质上包含的程序代码可以用任何适当的介质传输，包括但不限于：电线、光缆、RF(射频)等等，或者上述的任意合适的组合。The computer-readable storage medium provided by the embodiment of the present invention may be, for example, a USB flash drive, but is not limited to electrical, magnetic, optical, electromagnetic, infrared, or semiconductor systems, systems or devices, or any combination thereof. More specific examples of computer readable storage media may include, but are not limited to: an electrical connection having one or more wires, a portable computer disk, a hard drive, random access memory (RAM), read only memory (ROM), removable Programmed read-only memory (EPROM or flash memory), fiber optics, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In this embodiment, a computer-readable storage medium may be any tangible medium that contains or stores a program for use by or in connection with an instruction execution system, system, or device. Program code contained on a computer-readable storage medium may be transmitted using any suitable medium, including but not limited to: wire, optical cable, RF (radio frequency), etc., or any suitable combination of the above.

上述计算机可读存储介质可以是电子设备中所包含的；也可以是单独存在，而未装配入电子设备中。The above-mentioned computer-readable storage medium may be included in the electronic device; it may also exist independently without being assembled into the electronic device.

上述计算机可读存储介质承载有一个或者多个程序，当上述一个或者多个程序被电子设备执行时，使得电子设备：获取训练样本车内外监测数据和训练样本场景标签，并获取所述训练样本场景标签对应的目标人类驾驶经验场景认知特征；通过所述认知层从所述训练样本车内外监测数据中提取出训练样本认知层特征；根据所述训练样本认知层特征和所述目标人类驾驶经验场景认知特征，对所述认知层进行迭代优化。The above computer-readable storage medium carries one or more programs. When the above one or more programs are executed by an electronic device, the electronic device: obtains the training sample vehicle interior and exterior monitoring data and the training sample scene tags, and obtains the training sample. Cognitive features of the target human driving experience scene corresponding to the scene label; extract the cognitive layer features of the training sample from the inside and outside monitoring data of the training sample through the cognitive layer; according to the cognitive layer features of the training sample and the Based on the cognitive characteristics of the target human driving experience scene, the cognitive layer is iteratively optimized.

或者，上述计算机可读存储介质承载有一个或者多个程序，当上述一个或者多个程序被电子设备执行时，使得电子设备：获取待预测车内外监测数据；通过所述认知层从所述车内外监测数据中提取出待预测认知层特征，其中，所述待预测认知层特征包括待预测场景认知特征；将所述待预测认知层特征输入所述决策层，确定自动驾驶控制参数。Alternatively, the computer-readable storage medium carries one or more programs. When the one or more programs are executed by the electronic device, the electronic device: obtains the vehicle interior and exterior monitoring data to be predicted; and obtains the vehicle interior and exterior monitoring data from the vehicle through the cognitive layer. The cognitive layer features to be predicted are extracted from the monitoring data inside and outside the vehicle, where the cognitive layer features to be predicted include the scene cognitive features to be predicted; the cognitive layer features to be predicted are input into the decision-making layer to determine the automatic driving Control parameters.

可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码，上述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++，还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中，远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机，或者，可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing the operations of the present disclosure may be written in one or more programming languages, including object-oriented programming languages such as Java, Smalltalk, C++, and conventional Procedural programming language—such as "C" or a similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through Internet connection).

附图中的流程图和框图，图示了按照本发明各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上，流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分，该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意，在有些作为替换的实现中，方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如，两个接连地表示的方框实际上可以基本并行地执行，它们有时也可以按相反的顺序执行，这依所涉及的功能而定。也要注意的是，框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合，可以用执行规定的功能或操作的专用的基于硬件的系统来实现，或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagram may represent a module, segment, or portion of code that contains one or more logic functions that implement the specified executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown one after another may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or operations. , or can be implemented using a combination of specialized hardware and computer instructions.

描述于本公开实施例中所涉及到的模块可以通过软件的方式实现，也可以通过硬件的方式来实现。其中，模块的名称在某种情况下并不构成对该单元本身的限定。The modules involved in the embodiments of the present disclosure can be implemented in software or hardware. Among them, the name of the module does not constitute a limitation on the unit itself under certain circumstances.

本发明提供的计算机可读存储介质，存储有用于执行上述自动驾驶优化方法的计算机可读程序指令，解决了相关技术中自动驾驶的安全性较低的技术问题。与相关技术相比，本发明实施例提供的计算机可读存储介质的有益与上述实施例提供的自动驾驶优化方法的有益相同，在此不做赘述。The computer-readable storage medium provided by the present invention stores computer-readable program instructions for executing the above-mentioned automatic driving optimization method, and solves the technical problem of low safety of automatic driving in related technologies. Compared with related technologies, the benefits of the computer-readable storage medium provided by the embodiments of the present invention are the same as the benefits of the automatic driving optimization method provided by the above-mentioned embodiments, and will not be described again here.

以上仅为本申请的优选实施例，并非因此限制本申请的专利范围，凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换，或直接或间接运用在其他相关的技术领域，均同理包括在本申请的专利处理范围内。The above are only preferred embodiments of the present application, and are not intended to limit the patent scope of the present application. Any equivalent structure or equivalent process transformation made using the contents of the description and drawings of the present application may be directly or indirectly used in other related technical fields. , are all similarly included in the patent processing scope of this application.

Claims

1. An automatic driving optimization method, characterized in that the automatic driving optimization method is applied to a cognitive decision-making model, and the cognitive decision-making model includes a cognitive layer and a decision-making layer, including the following steps:

Obtain training sample vehicle interior and exterior monitoring data and training sample scene labels, and obtain target human driving experience scene cognitive characteristics corresponding to the training sample scene labels;

Extract training sample cognitive layer features from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

The cognitive layer is iteratively optimized according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene.

2. The automatic driving optimization method according to claim 1, characterized in that the cognitive layer includes a scene cognitive model, a map construction model, a risk target detection and tracking model and a motion prediction model; the training sample cognitive layer Features include training sample scene cognitive features, training sample map features, training sample risk target detection and tracking features, and training sample risk target motion features;

The step of extracting the cognitive layer features of the training sample from the training sample vehicle interior and exterior monitoring data through the cognitive layer includes:

The scene cognitive model extracts the training sample scene cognitive features from the training sample vehicle interior and exterior monitoring data, and the map construction model extracts the training sample map features from the training sample vehicle interior and exterior monitoring data. The risk target detection and tracking model extracts the training sample risk target detection and tracking characteristics from the training sample vehicle interior and exterior monitoring data;

The motion prediction model performs risk target motion prediction based on the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features to obtain training sample risk target motion features, where The motion prediction model described above uses an attention mechanism.

3. The automatic driving optimization method according to claim 2, wherein the training sample risk target detection and tracking features include at least one training sample risk target detection and tracking sub-feature; the motion prediction model includes an attention layer, a multi-level layer perceptron layer and prediction layer;

The step of using the motion prediction model to perform risk target motion prediction based on the training sample scene cognitive features, the training sample map features, and the training sample risk target detection and tracking features to obtain the training sample risk target motion characteristics. include:

The training sample scene cognitive features, the training sample map features and the training sample risk target detection and tracking features are input into the attention layer, and an attention mechanism is used to determine the relationship between each of the risk target detection and tracking sub-features. The training sample risk target interaction features, the training sample scene interaction features between each of the risk target detection and tracking sub-features and the training sample scene cognitive features, and each of the risk target detection and tracking sub-features and the training sample map Training sample map interaction features between features;

Input the training sample risk target interaction characteristics, the training sample scene interaction characteristics and the training sample map interaction characteristics into the multi-layer perceptron layer to obtain the training sample scene cognitive layer characteristics and the training sample motion query characteristics;

The training sample scene cognitive layer features and the training sample motion query features are input into the prediction layer, and risk target motion prediction is performed to obtain training sample risk target motion features.

4. The automatic driving optimization method according to claim 2, wherein the cognitive layer is iteratively optimized according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene. The steps include:

Obtain training sample map annotation data, training sample risk target detection and tracking annotation data, and training sample motion annotation data;

The scene cognitive loss is determined based on the difference between the cognitive layer characteristics of the training sample and the cognitive characteristics of the target human driving experience scene, and the scene cognitive loss is determined based on the difference between the training sample map characteristics and the training sample map annotation data. The map construction loss is determined based on the difference between the risk target detection and tracking characteristics of the training sample and the risk target detection and tracking annotation data of the training sample, and the risk target detection and tracking loss is determined based on the risk target motion characteristics of the training sample and the training sample. The difference between the sample motion annotation data determines the motion prediction loss;

The scene recognition loss, the map construction loss, the risk target detection and tracking loss, and the motion prediction loss are aggregated into a cognitive layer loss, and the cognitive layer is iteratively optimized based on the cognitive layer loss. .

5. The automatic driving optimization method according to claim 4, wherein the training sample scene cognitive characteristics include a plurality of training sample scene cognitive sub-features, and the target human driving experience scene cognitive characteristics include a plurality of human driving experience sub-features. Driving experience-like scene cognition sub-features;

The step of determining the scene cognitive loss based on the difference between the cognitive layer characteristics of the training sample and the target human driving experience scene cognitive characteristics includes:

According to the preset time sequence correspondence, determine the target human driving experience scene cognitive sub-feature corresponding to each of the training sample scene cognitive sub-features;

The individual difference values between each of the training sample scene cognitive sub-features and the corresponding human driving experience scene cognitive sub-features are calculated in sequence, and the average difference value of each individual difference value is determined as the scene cognitive loss.

6. The automatic driving optimization method according to claim 5, characterized in that said sequentially calculates the single difference between each of the training sample scene cognitive sub-features and the respective corresponding human driving experience scene cognitive sub-features. The steps to determine the average difference value of each individual unit difference value as scene cognitive loss include:

Input each of the training sample scene cognitive sub-features and each of the target human driving experience scene cognitive sub-features into a preset scene cognitive loss function to calculate the scene cognitive loss, where the scene cognitive loss function is :

in, For scene cognitive loss,/> is the scene cognitive sub-feature of human driving experience at time t,/> is the scene cognitive sub-feature of the training sample at time t, T _f is the preset timestamp, t∈T _f .

7. The automatic driving optimization method according to claim 4, wherein the scene cognitive model includes an attention distribution cognitive model and a semantic understanding model; the training sample scene cognitive characteristics include a training sample attention distribution. Features and training sample semantic understanding features; the target human driving experience scene cognitive features include target human driving experience attention distribution features and target human driving experience semantic understanding features;

The step of extracting the scene cognitive features of the training sample from the training sample vehicle interior and exterior monitoring data through the scene cognitive model includes:

The attention distribution characteristics of the training sample are extracted from the training sample vehicle interior and exterior monitoring data through the attention distribution cognitive model, and the training sample semantics are extracted from the training sample vehicle interior and exterior monitoring data through the semantic understanding model. understand characteristics;

Determine the attention distribution cognitive loss based on the difference between the attention distribution characteristics of the training sample and the target human driving experience attention distribution characteristics, and understand the semantic understanding based on the semantic understanding characteristics of the training sample and the target human driving experience Differences between features determine semantic understanding loss;

The attention distribution cognitive loss and the semantic understanding loss are aggregated to obtain the scene cognitive loss.

8. The automatic driving optimization method according to claim 1, characterized in that before the step of obtaining the cognitive characteristics of the target human driving experience scene corresponding to the training sample scene label, it also includes:

Acquire multiple human driving experience scene monitoring data, and extract human driving experience scene cognitive features from each of the human driving experience scene monitoring data;

Perform cluster analysis on the cognitive features of each of the human driving experience scenes to determine the scene labels corresponding to the cognitive features of each of the human driving experience scenes.

9. An automatic driving control method, characterized in that the automatic driving control method adopts a cognitive decision-making model, and the cognitive decision-making model includes a cognitive layer and a decision-making layer, and the cognitive layer adopts the method as claimed in claim 1- The automatic driving optimization method described in any one of 8 is pre-trained, and the automatic driving control method includes the following steps:

Obtain monitoring data inside and outside the vehicle to be predicted;

The cognitive layer features to be predicted are extracted from the vehicle interior and exterior monitoring data through the cognitive layer, where the cognitive layer features to be predicted include the scene cognitive features to be predicted;

The cognitive layer features to be predicted are input into the decision-making layer to determine the automatic driving control parameters.

10. The automatic driving control method according to claim 9, wherein the cognitive layer includes a scene cognitive model, a map construction model, a risk target detection and tracking model and a motion prediction model; the cognitive layer to be predicted Features also include map features to be predicted, risk target detection and tracking features to be predicted, and risk target motion features to be predicted;

The step of extracting cognitive layer features to be predicted from the vehicle interior and exterior monitoring data through the cognitive layer includes:

The scene cognitive features to be predicted are extracted from the vehicle interior and exterior monitoring data to be predicted through the scene cognitive model, and the map features to be predicted are extracted from the vehicle interior and exterior monitoring data to be predicted through the map construction model. The risk target detection and tracking model extracts the risk target detection and tracking features to be predicted from the vehicle interior and exterior monitoring data to be predicted;

The motion prediction model performs motion prediction of risk targets based on the cognitive features of the scene to be predicted, the map features to be predicted, and the risk target detection and tracking features to be predicted, and obtains the motion characteristics of the risk targets to be predicted, wherein, The motion prediction model described above uses an attention mechanism.

11. The automatic driving control method according to claim 10, wherein the risk target detection and tracking features to be predicted include at least one risk target detection and tracking sub-feature to be predicted; the motion prediction model includes an attention layer, a multi-level layer perceptron layer and prediction layer;

The step of predicting the movement of risky targets through the motion prediction model based on the cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted, and obtaining the motion characteristics of the risky targets to be predicted include:

The cognitive features of the scene to be predicted, the map features to be predicted and the risk target detection and tracking features to be predicted are input into the attention layer, and an attention mechanism is used to determine each of the risk target detection and tracking sub-features to be predicted. The interaction features of risk targets to be predicted, the scene interaction features to be predicted between each of the risk target detection and tracking sub-features to be predicted and the cognitive features of the scene to be predicted, and the detection and tracking sub-features of each risk target to be predicted To-be-predicted map interaction features with the to-be-predicted map features;

Input the risk target interaction features to be predicted, the scene interaction features to be predicted, and the map interaction features to be predicted into the multi-layer perceptron layer to obtain the cognitive layer features of the scene to be predicted and the motion query features to be predicted;

The cognitive layer features of the scene to be predicted and the motion query features to be predicted are input into the prediction layer, and risk target motion prediction is performed to obtain the risk target motion features to be predicted.

12. An automatic driving optimization device, characterized in that a cognitive decision-making model is deployed on the automatic driving optimization device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The automatic driving optimization device includes:

The first acquisition module is used to acquire training sample vehicle interior and exterior monitoring data and training sample scene labels, and obtain target human driving experience scene cognitive features corresponding to the training sample scene labels;

A first cognitive module, configured to extract training sample cognitive layer features from the training sample vehicle interior and exterior monitoring data through the cognitive layer;

An optimization module, configured to iteratively optimize the cognitive layer according to the cognitive layer characteristics of the training samples and the cognitive characteristics of the target human driving experience scene.

13. An automatic driving control device, characterized in that a cognitive decision-making model is deployed on the automatic driving control device. The cognitive decision-making model includes a cognitive layer and a decision-making layer. The cognitive layer adopts the method described above. The automatic driving optimization method is pre-trained. The automatic driving control device includes:

The second acquisition module is used to acquire the vehicle interior and exterior monitoring data to be predicted;

A second cognitive module, configured to extract the cognitive layer features to be predicted from the vehicle interior and exterior monitoring data through the cognitive layer, where the cognitive layer features to be predicted include the scene cognitive features to be predicted;

A decision-making module, used to input the cognitive layer characteristics to be predicted into the decision-making layer and determine automatic driving control parameters.

14. An electronic device, characterized in that the electronic device includes:

at least one processor; and,

a memory communicatively connected to at least one of the processors; wherein,

The memory stores instructions executable by at least one of the processors, and the instructions are executed by at least one of the processors to enable at least one of the processors to perform as claimed in any one of claims 1 to 8 The steps of the automatic driving optimization method described above, or the steps of the automatic driving control method according to any one of claims 9 to 11.

15. A medium, characterized in that the medium is a computer-readable storage medium, and a program for implementing an automatic driving optimization method is stored on the computer-readable storage medium. The program for realizing an automatic driving optimization method is processed by a processor. Executed to implement the steps of the automatic driving optimization method according to any one of claims 1 to 8, or the steps of the automatic driving control method according to any one of claims 9 to 11.