CN118409532A

CN118409532A - End-to-end automatic driving method, device, equipment and storage medium

Info

Publication number: CN118409532A
Application number: CN202410417712.7A
Authority: CN
Inventors: 周航宁; 朱润文; 张点堃; 赵建博; 陈习武; 宫佳豪; 周琪斌; 王宁梓; 徐子尧; 张弛
Original assignee: Beijing Maichi Zhixing Technology Co ltd
Current assignee: Beijing Maichi Zhixing Technology Co ltd
Priority date: 2024-04-08
Filing date: 2024-04-08
Publication date: 2024-07-30
Also published as: WO2025214357A1

Abstract

The application provides an end-to-end automatic driving method, device, equipment and storage medium, wherein the method comprises the following steps: acquiring sensor data of a self-vehicle at the current moment; processing the sensor data at the current moment to obtain sparse sensor feature vectors; performing interactive processing on the reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector; performing interactive processing on the sparse query quantity of the reference map and the sparse sensor feature vector to obtain a sparse query vector of the map; performing obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and performing map construction according to the map sparse query vector to obtain an online map; and automatically driving the vehicle based on the obstacle detection result and the online map. Therefore, all information is characterized as sparse query vectors, all subtasks of automatic driving are connected in series based on the sparse query vectors, and efficient and high-accuracy end-to-end automatic driving is achieved.

Description

End-to-end autonomous driving method, device, equipment and storage medium

技术领域Technical Field

本申请涉及图像处理技术领域，特别涉及一种端到端的自动驾驶方法、装置、设备及存储介质。The present application relates to the field of image processing technology, and in particular to an end-to-end autonomous driving method, apparatus, device and storage medium.

背景技术Background technique

自动驾驶系统集多种任务于一身，如检测、跟踪、在线建图、运动预测和自车规划/控制。主流自动驾驶系统将每项任务解耦为一个单独的模型，并独立优化。每个模型之间会加上手工设置的后处理步骤用于处理冗余信息或疑难案例，这使得整个流程变得非常复杂。并且，原始传感器数据无法用于下游任务，误差在系统信息传递中逐渐累积，从而导致潜在的安全问题。Autonomous driving systems integrate multiple tasks such as detection, tracking, online mapping, motion prediction, and self-vehicle planning/control. Mainstream autonomous driving systems decouple each task into a separate model and optimize it independently. Manual post-processing steps are added between each model to handle redundant information or difficult cases, which makes the entire process very complicated. In addition, the raw sensor data cannot be used for downstream tasks, and errors gradually accumulate in the system information transmission, leading to potential safety issues.

为了解决上述问题，端到端自动驾驶系统将原始传感器数据作为输入，并以更简洁的方式返回规划结果。在现有的端到端自动驾驶方法中，整个驾驶场景通过密集的鸟瞰图特征来表示，将多传感器数据和时间信息作为端到端自动驾驶系统的原始输入，并在每个子任务中发挥作用。To solve the above problems, the end-to-end autonomous driving system takes raw sensor data as input and returns planning results in a more concise way. In existing end-to-end autonomous driving methods, the entire driving scene is represented by dense bird's-eye view features, and multi-sensor data and time information are used as raw input to the end-to-end autonomous driving system and play a role in each subtask.

尽管此类方法能够简洁得到规划结果，且能模型端到端优化，解决累计误差和优化目标不一致的问题，但在自动驾驶的每个子任务上的性能远落后于相应的单一任务方法。并且，时序信息和传感器数据的融合通过密集的鸟瞰图特征来实现，显著增加计算成本、内存占用，难以实现模型部署。Although such methods can obtain planning results concisely and optimize the model end-to-end, solving the problems of cumulative error and inconsistent optimization targets, their performance in each subtask of autonomous driving lags far behind the corresponding single-task methods. In addition, the fusion of time series information and sensor data is achieved through dense bird's-eye view features, which significantly increases the computational cost and memory usage, making it difficult to deploy the model.

发明内容Summary of the invention

鉴于上述问题，本申请实施例提供了一种端到端的自动驾驶方法、装置、设备及存储介质，以便克服上述问题或者至少部分地解决上述问题。In view of the above problems, the embodiments of the present application provide an end-to-end autonomous driving method, apparatus, device and storage medium to overcome the above problems or at least partially solve the above problems.

本申请实施例的第一方面，公开了一种端到端的自动驾驶方法，所述方法包括：According to a first aspect of an embodiment of the present application, an end-to-end autonomous driving method is disclosed, the method comprising:

获取自车的当前时刻传感器数据；Get the current sensor data of the vehicle;

对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量；Processing the current moment sensor data to obtain a sparse sensor feature vector;

对参考检测稀疏查询向量与所述稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，所述参考检测稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史检测稀疏查询向量；Interactively processing a reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle;

对参考地图稀疏查询量与所述稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，所述参考地图稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史地图稀疏查询向量；Interactively processing the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle;

根据所述检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据所述地图稀疏查询向量进行地图构建，得到在线地图；Performing obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and performing map construction according to the map sparse query vector to obtain an online map;

基于所述障碍物检测结果和所述在线地图对自车进行自动驾驶。The vehicle is automatically driven based on the obstacle detection result and the online map.

可选地，所述方法还包括：Optionally, the method further comprises:

根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量；According to the historical position information of the obstacle, a sparse query vector for obstacle reference trajectory prediction is determined;

根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量；According to the historical position information of the ego vehicle, a sparse query vector for predicting the ego vehicle reference trajectory is determined;

将所述障碍物参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到障碍物轨迹预测稀疏查询向量；Interactively processing the obstacle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an obstacle trajectory prediction sparse query vector;

将所述自车参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到自车轨迹预测稀疏查询向量；Interactively processing the ego-vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an ego-vehicle trajectory prediction sparse query vector;

根据所述障碍物轨迹预测稀疏查询向量和所述自车轨迹预测稀疏查询向量进行轨迹预测，得到障碍物预测轨迹和自车预测轨迹；Performing trajectory prediction according to the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain an obstacle prediction trajectory and an ego vehicle prediction trajectory;

基于所述障碍物检测结果和所述在线地图对自车进行自动驾驶，包括：The autonomous vehicle is driven automatically based on the obstacle detection result and the online map, including:

基于所述障碍物检测结果、所述在线地图、所述障碍物预测轨迹和所述自车预测轨迹对自车进行自动驾驶。The self-vehicle is automatically driven based on the obstacle detection result, the online map, the obstacle prediction trajectory and the self-vehicle prediction trajectory.

可选地，所述方法还包括：Optionally, the method further comprises:

对所述自车轨迹预测稀疏查询向量和自车导航命令信息进行交互处理，得到含导航信息的自车轨迹预测稀疏查询向量；Interactively processing the ego-vehicle trajectory prediction sparse query vector and ego-vehicle navigation command information to obtain an ego-vehicle trajectory prediction sparse query vector containing navigation information;

利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果；Utilizing the ego vehicle trajectory prediction sparse query vector containing navigation information, optimizing the ego vehicle prediction trajectory under multiple constraints to obtain an ego vehicle path planning result;

基于所述自车路径规划结果、所述障碍物检测结果、所述在线地图、所述障碍物预测轨迹和所述自车预测轨迹对自车进行自动驾驶。The self-vehicle is automatically driven based on the self-vehicle path planning result, the obstacle detection result, the online map, the obstacle prediction trajectory and the self-vehicle prediction trajectory.

可选地，在所述多种约束条件包括动力学约束的情况下，利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果，包括：Optionally, when the multiple constraints include dynamic constraints, the ego vehicle trajectory prediction sparse query vector containing navigation information is used to optimize the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result, including:

利用所述含导航信息的自车轨迹预测稀疏查询向量，回归自车的运动信息，所述运动信息包括自车的速度和加速度；Using the ego vehicle trajectory containing navigation information to predict a sparse query vector, regressing the motion information of the ego vehicle, the motion information including the speed and acceleration of the ego vehicle;

利用所述运动信息对所述自车预测轨迹进行优化处理，得到自车路径规划结果。The motion information is used to optimize the predicted trajectory of the ego vehicle to obtain a ego vehicle path planning result.

可选地，在所述多种约束条件包括安全距离约束的情况下，利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果，包括：Optionally, when the multiple constraints include a safety distance constraint, the ego vehicle trajectory prediction sparse query vector containing navigation information is used to optimize the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result, including:

根据所述含导航信息的自车轨迹预测稀疏查询向量，和所述障碍物轨迹预测稀疏查询向量，预测障碍物与自车的相对位置关系；Predicting a relative position relationship between the obstacle and the ego vehicle based on the ego vehicle trajectory prediction sparse query vector containing navigation information and the obstacle trajectory prediction sparse query vector;

利用所述相对位置关系对所述自车预测轨迹进行距离约束优化处理，得到自车路径规划结果。The relative position relationship is used to perform distance constraint optimization processing on the predicted trajectory of the ego vehicle to obtain a ego vehicle path planning result.

可选地，根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量，包括：Optionally, determining the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information includes:

将所述障碍物检测结果中检测框置信度分数超过分数阈值的检测框，确定为跟踪目标检测框；Determine the detection frame whose confidence score in the obstacle detection result exceeds the score threshold as the tracking target detection frame;

将所述跟踪目标检测框对应的稀疏查询向量，确定为跟踪稀疏查询向量；Determine the sparse query vector corresponding to the tracking target detection box as the tracking sparse query vector;

将所述跟踪稀疏查询向量对应的障碍物历史位置信息，编码为历史位置稀疏查询向量；Encoding the obstacle historical position information corresponding to the tracking sparse query vector into a historical position sparse query vector;

将所述历史位置稀疏查询向量和所述跟踪稀疏查询向量进行交互处理，得到障碍物参考轨迹预测稀疏查询向量。The historical position sparse query vector and the tracking sparse query vector are interactively processed to obtain an obstacle reference trajectory prediction sparse query vector.

可选地，所述参考检测稀疏查询向量通过以下步骤得到：Optionally, the reference detection sparse query vector is obtained by the following steps:

获取历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量，所述历史检测稀疏查询向量包括：当前时刻之前的N个时刻的历史检测稀疏查询向量，N为大于1的整数；Obtaining a historical detection sparse query vector and a tracking sparse query vector at a previous moment, wherein the historical detection sparse query vector includes: historical detection sparse query vectors at N moments before the current moment, where N is an integer greater than 1;

根据所述历史检测稀疏查询向量和所述上一时刻的跟踪稀疏查询向量，得到参考检测稀疏查询向量。A reference detection sparse query vector is obtained according to the historical detection sparse query vector and the tracking sparse query vector at the last moment.

可选地，对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量，包括：Optionally, the current moment sensor data is processed to obtain a sparse sensor feature vector, including:

对所述当前时刻传感器数据进行特征提取，得到稀疏的初始传感器特征向量；Performing feature extraction on the current moment sensor data to obtain a sparse initial sensor feature vector;

根据所述当前时刻传感器数据，计算所述稀疏的初始传感器特征向量的空间位置信息；Calculating the spatial position information of the sparse initial sensor feature vector according to the current sensor data;

将所述空间位置信息和所述稀疏的初始传感器特征向量进行融合，得到稀疏的传感器特征向量。The spatial position information and the sparse initial sensor feature vector are fused to obtain a sparse sensor feature vector.

可选地，端到端的自动驾驶是通过多任务处理网络实现的；Optionally, end-to-end autonomous driving is achieved through a multi-tasking network;

在所述端到端的自动驾驶包括障碍物检测和地图构建的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息和真实地图信息；In the case where the end-to-end autonomous driving includes obstacle detection and map construction, the labels of the training data of the multi-task processing network include: real position information of obstacles and real map information;

在所述端到端的自动驾驶包括障碍物检测、地图构建和轨迹预测的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息、真实地图信息、障碍物真实轨迹和自车真实轨迹；In the case where the end-to-end autonomous driving includes obstacle detection, map construction and trajectory prediction, the labels of the training data of the multi-task processing network include: real position information of obstacles, real map information, real trajectory of obstacles and real trajectory of the vehicle;

在所述端到端的自动驾驶包括障碍物检测、地图构建、轨迹预测和路径规划的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息、真实地图信息、障碍物真实轨迹、自车真实轨迹、自车的真实运动信息、障碍物与自车的真实相对位置关系。In the case where the end-to-end autonomous driving includes obstacle detection, map construction, trajectory prediction and path planning, the labels of the training data of the multi-task processing network include: the real position information of the obstacle, the real map information, the real trajectory of the obstacle, the real trajectory of the vehicle, the real motion information of the vehicle, and the real relative position relationship between the obstacle and the vehicle.

本申请实施例的第二方面，公开了一种端到端的自动驾驶装置，所述装置包括：According to a second aspect of the embodiments of the present application, an end-to-end autonomous driving device is disclosed, the device comprising:

数据获取模块，用于获取自车的当前时刻传感器数据；A data acquisition module is used to obtain the current sensor data of the vehicle;

数据处理模块，用于对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量；A data processing module, used for processing the sensor data at the current moment to obtain a sparse sensor feature vector;

第一交互模块，用于对参考检测稀疏查询向量与所述稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，所述参考检测稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史检测稀疏查询向量；A first interaction module is used to interactively process a reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle;

第二交互模块，用于对参考地图稀疏查询量与所述稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，所述参考地图稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史地图稀疏查询向量；A second interaction module is used to interactively process the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle;

检测构建模块，用于根据所述检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据所述地图稀疏查询向量进行地图构建，得到在线地图；A detection construction module, configured to perform obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and to perform map construction according to the map sparse query vector to obtain an online map;

自动驾驶模块，用于基于所述障碍物检测结果和所述在线地图对自车进行自动驾驶。The automatic driving module is used to automatically drive the vehicle based on the obstacle detection result and the online map.

本申请实施例的第三方面，公开了一种电子设备，包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序，所述处理器执行所述计算机程序时实现本申请实施例第一方面所述的端到端的自动驾驶方法的步骤。According to a third aspect of an embodiment of the present application, an electronic device is disclosed, comprising a memory, a processor, and a computer program stored in the memory and executable on the processor. When the processor executes the computer program, the steps of the end-to-end autonomous driving method described in the first aspect of the embodiment of the present application are implemented.

本申请实施例的第四方面，公开了一种计算机可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行时实现本申请实施例第一方面所述的端到端的自动驾驶方法的步骤。The fourth aspect of an embodiment of the present application discloses a computer-readable storage medium having a computer program stored thereon. When the computer program is executed by a processor, the steps of the end-to-end autonomous driving method described in the first aspect of the embodiment of the present application are implemented.

本申请实施例包括以下优点：The embodiments of the present application include the following advantages:

在本申请实施例中，将整个自动驾驶场景中空间维度和时间维度上的所有信息都由稀疏查询向量表示，而不使用任何密集的鸟瞰图特征。获取自车的当前时刻传感器数据，对于自车当前时刻传感器数据，将其处理为稀疏的传感器特征向量；并对参考检测稀疏查询向量与稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量；对参考地图稀疏查询量与稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量；从而根据检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据地图稀疏查询向量进行地图构建，得到在线地图，从而基于障碍物检测结果和所述在线地图对自车进行自动驾驶。In an embodiment of the present application, all information in the spatial and temporal dimensions of the entire autonomous driving scene is represented by a sparse query vector without using any dense bird's-eye view features. The current moment sensor data of the vehicle is obtained, and the current moment sensor data of the vehicle is processed into a sparse sensor feature vector; and the reference detection sparse query vector and the sparse sensor feature vector are interactively processed to obtain a detection sparse query vector; the reference map sparse query vector and the sparse sensor feature vector are interactively processed to obtain a map sparse query vector; obstacle detection is performed according to the detection sparse query vector to obtain an obstacle detection result, and map construction is performed according to the map sparse query vector to obtain an online map, so that the vehicle is autonomously driven based on the obstacle detection result and the online map.

由于参考检测稀疏查询向量包括基于自车的历史传感器数据确定出的历史检测稀疏查询向量，参考地图稀疏查询向量包括基于自车的历史传感器数据确定出的历史地图稀疏查询向量，因此在基于参考检测稀疏查询和稀疏的传感器特征向量得到的检测稀疏查询向量引入了历史时序信息和当前时刻传感器数据，在基于参考地图稀疏查询向量和稀疏的传感器特征向量得到的地图检测稀疏查询向量中引入了历史时序信息和当前时刻传感器数据，得益于历史时序信息和当前时刻传感器数据的引入，使端到端的自动驾驶中的每个任务(如障碍物检测和地图构建)的具有更好性能。并且，通过稀疏查询向量能够更有效地利用长时序的历史信息、扩展到更多的模态和任务，降低计算成本和内存占用。如此，将不同模态、不同任务、空间和时间上的所有信息全部表征为稀疏的查询向量，并基于此串联起自动驾驶的各个子任务，实现高效且高准确性的端到端的自动驾驶。Since the reference detection sparse query vector includes a historical detection sparse query vector determined based on the historical sensor data of the vehicle, and the reference map sparse query vector includes a historical map sparse query vector determined based on the historical sensor data of the vehicle, the detection sparse query vector obtained based on the reference detection sparse query and the sparse sensor feature vector introduces historical time series information and current moment sensor data, and the map detection sparse query vector obtained based on the reference map sparse query vector and the sparse sensor feature vector introduces historical time series information and current moment sensor data. Thanks to the introduction of historical time series information and current moment sensor data, each task in end-to-end autonomous driving (such as obstacle detection and map construction) has better performance. In addition, through the sparse query vector, it is possible to more effectively utilize long-time historical information, expand to more modes and tasks, and reduce computing costs and memory usage. In this way, all information in different modes, different tasks, space and time is represented as a sparse query vector, and based on this, each subtask of autonomous driving is connected in series to achieve efficient and high-accuracy end-to-end autonomous driving.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本申请实施例的技术方案，下面将对本申请实施例的描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings required for use in the description of the embodiments of the present application will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present application. For ordinary technicians in this field, other drawings can be obtained based on these drawings without paying any creative labor.

图1是本申请实施例提供的一种端到端的自动驾驶方法的步骤流程图；FIG1 is a flowchart of the steps of an end-to-end autonomous driving method provided by an embodiment of the present application;

图2是本申请实施例提供的一种稀疏感知网络的结构示意图；FIG2 is a schematic diagram of the structure of a sparse sensing network provided in an embodiment of the present application;

图3是本申请实施例提供的一种运动规划网络的结构示意图；FIG3 is a schematic diagram of the structure of a motion planning network provided in an embodiment of the present application;

图4是本申请实施例提供的另一种端到端的自动驾驶方法的步骤流程图；FIG4 is a flowchart of another end-to-end autonomous driving method provided by an embodiment of the present application;

图5是本申请实施例提供的一种多任务处理网络的结构示意图；FIG5 is a schematic diagram of the structure of a multi-task processing network provided in an embodiment of the present application;

图6是本申请实施例提供的一种端到端的自动驾驶装置的结构示意图；FIG6 is a schematic diagram of the structure of an end-to-end autonomous driving device provided in an embodiment of the present application;

图7是本申请实施例提供的一种电子设备的结构示意图。FIG. 7 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.

具体实施方式Detailed ways

为使本申请的上述目的、特征和优点能够更加明显易懂，下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本申请保护的范围。In order to make the above-mentioned purposes, features and advantages of the present application more obvious and understandable, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by ordinary technicians in this field without creative work are within the scope of protection of this application.

在相关技术中，通过密集的鸟瞰图特征来表示整个驾驶场景，时序信息和传感器数据融合通过密集的鸟瞰图特征来实现，显著增加计算成本、内存占用，难以实现模型部署，并且自动驾驶中的每个子任务上的性能远落后于相应的单一任务处理方法。为了克服上述问题的局限性，申请人提出以下技术构思：提出使用纯稀疏方式，不依赖任何密集的鸟瞰图特征进行地图构建和障碍检测。将不同模态、不同任务、空间和时间上的所有信息全部表征为稀疏的查询向量，并基于此串联起自动驾驶的各个子任务，实现高效且高准确性的端到端的自动驾驶。In the related art, the entire driving scene is represented by dense bird's-eye view features, and the fusion of time series information and sensor data is achieved through dense bird's-eye view features, which significantly increases the computing cost and memory usage, making it difficult to implement model deployment, and the performance of each subtask in autonomous driving lags far behind the corresponding single task processing method. In order to overcome the limitations of the above problems, the applicant proposes the following technical concept: It is proposed to use a purely sparse method without relying on any dense bird's-eye view features for map construction and obstacle detection. All information in different modalities, different tasks, space and time are represented as sparse query vectors, and based on this, the various subtasks of autonomous driving are connected in series to achieve efficient and high-accuracy end-to-end autonomous driving.

基于上述技术构思，本申请实施例提供了一种端到端的自动驾驶方法，参照图1所示，图1是本申请实施例提供的一种端到端的自动驾驶方法的步骤流程图。如图1所示，该方法可以包括步骤S101至步骤S104：Based on the above technical concept, the embodiment of the present application provides an end-to-end autonomous driving method, as shown in Figure 1, which is a step flow chart of an end-to-end autonomous driving method provided by the embodiment of the present application. As shown in Figure 1, the method may include steps S101 to S104:

步骤S101：获取自车的当前时刻传感器数据。Step S101: Acquire the current sensor data of the vehicle.

本申请实施例中，当前时刻传感器数据是当前时刻采集的反映自车整个自动驾驶场景的环境数据，当前时刻传感器数据包括不同类型传感器数据(即多模态的数据)。在一种实施方式中，当前时刻传感器数据包括雷达采集的点云数据和车载相机采集的多视角图像。其中，多视角图像的视角和数量根据车载相机排布的不同会有所不同，多视角图像可以包括自车的正前方视角图像、左前方视角图像、右前方视角图像、正后方视角图像、左后方视角图像、以及右后方视角图像等。并且，多视角图像可以是从视频中抽取的当前时刻的视频帧图像，也可以是车载相机直接采集的当前时刻的图像。In an embodiment of the present application, the sensor data at the current moment is the environmental data collected at the current moment that reflects the entire autonomous driving scene of the vehicle, and the sensor data at the current moment includes different types of sensor data (i.e., multimodal data). In one embodiment, the sensor data at the current moment includes point cloud data collected by the radar and multi-view images collected by the on-board camera. Among them, the viewing angle and number of multi-view images will vary depending on the arrangement of the on-board cameras. The multi-view images may include the front view image, the left front view image, the right front view image, the rear view image, the left rear view image, and the right rear view image of the vehicle. In addition, the multi-view image can be a video frame image at the current moment extracted from the video, or it can be an image at the current moment directly collected by the on-board camera.

步骤S102：对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量。Step S102: Process the sensor data at the current moment to obtain a sparse sensor feature vector.

本申请实施例中，稀疏的传感器特征向量与鸟瞰图特征相比是一个更离散化、更稀疏的三维特征表示，稀疏的传感器特征向量表征驾驶场景中的目标对象，不包含驾驶场景中的冗余信息。例如，稀疏的传感器特征向量表征驾驶场景中的障碍物，或表征驾驶场景中的地图元素。由于稀疏的传感器特征向量表征驾驶场景中的目标对象，所以稀疏的传感器特征向量包含了目标物体的语义信息和位置信息。在具体实施时，对于不同类型传感器数据(例如，图像数据、点云数据)使用不同的特征提取网络进行处理，得到稀疏的传感器特征向量；可以通过增加编码器数量来扩展对多模态数据的支持。并且，对当前时刻传感器数据进行特征编码时，还对数据在三维空间中的位置信息进行编码，进而将特征(语义信息)和位置统一到稀疏的传感器特征向量中。In an embodiment of the present application, the sparse sensor feature vector is a more discrete and sparse three-dimensional feature representation compared to the bird's-eye view feature. The sparse sensor feature vector represents the target object in the driving scene and does not contain redundant information in the driving scene. For example, the sparse sensor feature vector represents obstacles in the driving scene, or represents map elements in the driving scene. Since the sparse sensor feature vector represents the target object in the driving scene, the sparse sensor feature vector contains the semantic information and position information of the target object. In the specific implementation, different types of sensor data (for example, image data, point cloud data) are processed using different feature extraction networks to obtain sparse sensor feature vectors; support for multimodal data can be expanded by increasing the number of encoders. In addition, when the sensor data at the current moment is feature encoded, the position information of the data in three-dimensional space is also encoded, and then the features (semantic information) and positions are unified into the sparse sensor feature vector.

在一种可选的实施例中，对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量，包括：对所述当前时刻传感器数据进行特征提取，得到稀疏的初始传感器特征向量；根据所述当前时刻传感器数据，计算所述稀疏的初始传感器特征向量的空间位置信息；将所述空间位置信息和所述稀疏的初始传感器特征向量进行融合，得到稀疏的传感器特征向量。In an optional embodiment, the sensor data at the current moment is processed to obtain a sparse sensor feature vector, including: extracting features from the sensor data at the current moment to obtain a sparse initial sensor feature vector; calculating spatial position information of the sparse initial sensor feature vector based on the sensor data at the current moment; and fusing the spatial position information with the sparse initial sensor feature vector to obtain a sparse sensor feature vector.

本申请实施例中，稀疏的初始传感器特征向量是指包含目标物体语义信息的特征向量，为了准确表征驾驶场景中的目标对象，在得到稀疏的初始传感器特征向量之后，对每个稀疏的初始传感器特征向量进行位置编码，得到包含语义信息和位置信息的稀疏的传感器特征向量。In an embodiment of the present application, a sparse initial sensor feature vector refers to a feature vector containing semantic information of a target object. In order to accurately characterize the target object in a driving scene, after obtaining the sparse initial sensor feature vector, each sparse initial sensor feature vector is position-encoded to obtain a sparse sensor feature vector containing semantic information and position information.

当前时刻传感器数据包括多种不同类型传感器数据，对于不同类型传感器数据，采样不用的特征提取网络提取出不同类型传感器数据的特征，并将不同类型传感器数据的特征展平拼接在一起，得到稀疏的初始传感器特征向量；对于每个稀疏的初始传感器特征向量，计算出对应的空间位置信息，并将空间位置信息和稀疏的初始传感器特征向量进行融合，得到稀疏的传感器特征向量。The sensor data at the current moment includes multiple different types of sensor data. For different types of sensor data, different feature extraction networks are sampled to extract features of different types of sensor data, and the features of different types of sensor data are flattened and spliced together to obtain sparse initial sensor feature vectors; for each sparse initial sensor feature vector, the corresponding spatial position information is calculated, and the spatial position information and the sparse initial sensor feature vector are fused to obtain a sparse sensor feature vector.

例如，不同类型传感器数据包括图像数据和云数据，将图像数据表示为I∈R^N ^×H×W×3，其中N是输入图像视角的数量；点云数据表示为其中N_p是点云数据中点的个数，C_p是每个点云数据对应的维度。使用特征提取网络分别对图像数据和点云数据进行特征提取，得到图像数据特征和点云数据特征，并将图像数据特征和点云数据特征展平拼接在一起得到稀疏的初始传感器特征向量其中N_f是稀疏的初始传感器特征向的个数，C是稀疏的初始传感器特征向的维度。之后，针对每个稀疏的初始传感器特征向，计算对应的空间位置信息，并将空间位置信息与稀疏的初始传感器特征向量F_t融合，得到稀疏的传感器特征向量。 For example, different types of sensor data include image data and cloud data. Image data is represented as I∈R ^N ^×H×W×3 , where N is the number of input image viewpoints; point cloud data is represented as Where _Np is the number of points in the point cloud data, and _Cp is the dimension corresponding to each point cloud data. Use the feature extraction network to extract features from the image data and point cloud data respectively, obtain image data features and point cloud data features, and flatten and splice the image data features and point cloud data features together to obtain a sparse initial sensor feature vector Where _Nf is the number of sparse initial sensor feature vectors, and C is the dimension of the sparse initial sensor feature vectors. Afterwards, for each sparse initial sensor feature vector, the corresponding spatial position information is calculated, and the spatial position information is fused with the sparse initial sensor feature vector _Ft to obtain a sparse sensor feature vector.

步骤S103：对参考检测稀疏查询向量与所述稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，所述参考检测稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史检测稀疏查询向量。Step S103: interactively processing the reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle.

本申请实施例中，检测稀疏查询向量是指当前时刻用于检测跟踪的稀疏查询向量，每个检测稀疏查询向量表征一个障碍物体，例如，每个检测稀疏查询向量对应驾驶场景中的一个车辆。历史检测稀疏查询向量是指当前时刻之前的N个时刻的历史检测稀疏查询向量，随着自动驾驶的进行，历史检测稀疏查询向量也会随之更新。In the embodiment of the present application, the detection sparse query vector refers to the sparse query vector used for detection and tracking at the current moment, and each detection sparse query vector represents an obstacle object, for example, each detection sparse query vector corresponds to a vehicle in the driving scene. The historical detection sparse query vector refers to the historical detection sparse query vector of N moments before the current moment, and the historical detection sparse query vector will be updated as the autonomous driving progresses.

具体地，对参考检测稀疏查询向量与稀疏的传感器特征向量进行交互处理是指：通过自注意力机制和交叉注意力机制，对参考检测稀疏查询向量和稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量。由于参考检测稀疏查询向量包括历史检测稀疏查询向量，因此通过交互处理在检测稀疏查询向量中引入了历史时序信息和当前时刻传感器数据，基于检测稀疏查询向量可以很好的实现障碍物检测。并且，在得到检测稀疏查询向量之后，将该检测稀疏查询向量进行保存，以用于下一时刻的自动驾驶。Specifically, interactively processing the reference detection sparse query vector and the sparse sensor feature vector means: interactively processing the reference detection sparse query vector and the sparse sensor feature vector through a self-attention mechanism and a cross-attention mechanism to obtain a detection sparse query vector. Since the reference detection sparse query vector includes a historical detection sparse query vector, historical timing information and current moment sensor data are introduced into the detection sparse query vector through interactive processing, and obstacle detection can be well implemented based on the detection sparse query vector. In addition, after obtaining the detection sparse query vector, the detection sparse query vector is saved for autonomous driving at the next moment.

在一种可选的实施例中，所述参考检测稀疏查询向量通过以下步骤得到：获取历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量，所述历史检测稀疏查询向量包括：当前时刻之前的N个时刻的历史检测稀疏查询向量，N为大于1的整数；根据所述历史检测稀疏查询向量和所述上一时刻的跟踪稀疏查询向量，得到参考检测稀疏查询向量。In an optional embodiment, the reference detection sparse query vector is obtained by the following steps: obtaining a historical detection sparse query vector and a tracking sparse query vector at a previous moment, the historical detection sparse query vector comprising: historical detection sparse query vectors of N moments before the current moment, where N is an integer greater than 1; obtaining a reference detection sparse query vector based on the historical detection sparse query vector and the tracking sparse query vector at the previous moment.

本申请实施例中，上一时刻的跟踪稀疏查询向量是根据上一时刻的障碍物检测结果确定的，用于对可信度高的障碍物进行跟踪。在具体实施时，历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量存储在一个端到端多任务记忆库中，因而从端到端多任务记忆库中获取历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量。In the embodiment of the present application, the tracking sparse query vector at the last moment is determined according to the obstacle detection result at the last moment, and is used to track obstacles with high credibility. In the specific implementation, the historical detection sparse query vector and the tracking sparse query vector at the last moment are stored in an end-to-end multi-task memory library, and thus the historical detection sparse query vector and the tracking sparse query vector at the last moment are obtained from the end-to-end multi-task memory library.

具体地，根据所述历史检测稀疏查询向量和所述上一时刻的跟踪稀疏查询向量，包括：将初始化检测稀疏查询向量、历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量进行拼接，得到参考检测稀疏查询向量。其中，初始化检测稀疏查询向量是一个可学习的随机初始化向量。Specifically, according to the historical detection sparse query vector and the tracking sparse query vector at the last moment, it includes: concatenating the initialization detection sparse query vector, the historical detection sparse query vector and the tracking sparse query vector at the last moment to obtain a reference detection sparse query vector. The initialization detection sparse query vector is a learnable random initialization vector.

步骤S104：对参考地图稀疏查询量与所述稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，所述参考地图稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史地图稀疏查询向量。Step S104: interactively processing the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle.

本申请实施例中，地图稀疏查询向量是指当前时刻用于在线建图的稀疏查询向量，每个地图稀疏查询向量表征一个地图元素，例如，每个地图稀疏查询向量对应地图中的一个线段。历史地图稀疏查询向量是指当前时刻之前的N个时刻的历史地图稀疏查询向量，随着自动驾驶的进行，历史地图稀疏查询向量也会随之更新。In the embodiment of the present application, the map sparse query vector refers to the sparse query vector used for online mapping at the current moment, and each map sparse query vector represents a map element, for example, each map sparse query vector corresponds to a line segment in the map. The historical map sparse query vector refers to the historical map sparse query vector of N moments before the current moment, and the historical map sparse query vector will be updated as the autonomous driving progresses.

具体地，对参考地图稀疏查询量与稀疏的传感器特征向量进行交互处理是指：通过自注意力机制和交叉注意力机制，对参考地图稀疏查询量和稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向。由于参考地图稀疏查询量包括历史地图稀疏查询向量，因此通过交互处理在地图稀疏查询向量中引入了历史时序信息和当前时刻传感器数据，基于地图稀疏查询向量可以很好的实现地图构建。并且，在得到地图稀疏查询向量之后，将该地图稀疏查询向量进行保存，以用于下一时刻的自动驾驶。Specifically, interactively processing the reference map sparse query volume and the sparse sensor feature vector means: interactively processing the reference map sparse query volume and the sparse sensor feature vector through the self-attention mechanism and the cross-attention mechanism to obtain the map sparse query vector. Since the reference map sparse query volume includes the historical map sparse query vector, the historical time series information and the current moment sensor data are introduced into the map sparse query vector through interactive processing, and the map construction can be well realized based on the map sparse query vector. In addition, after obtaining the map sparse query vector, the map sparse query vector is saved for autonomous driving at the next moment.

在一种可选的实施例中，所述参考地图稀疏查询向量通过以下步骤得到：获取历史地图稀疏查询向量，所述历史地图稀疏查询向量包括：当前时刻之前的N个时刻的历史地图稀疏查询向量，N为大于1的整数；将所述历史地图稀疏查询向量与初始化地图稀疏查询向量进行拼接，得到历史地图稀疏查询向量，其中，初始化地图稀疏查询向量是一个可学习的随机初始化向量。In an optional embodiment, the reference map sparse query vector is obtained by the following steps: obtaining a historical map sparse query vector, the historical map sparse query vector comprising: historical map sparse query vectors of N moments before the current moment, where N is an integer greater than 1; concatenating the historical map sparse query vector with an initialization map sparse query vector to obtain a historical map sparse query vector, wherein the initialization map sparse query vector is a learnable random initialization vector.

步骤S105：根据所述检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据所述地图稀疏查询向量进行地图构建，得到在线地图。Step S105: performing obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and performing map construction according to the map sparse query vector to obtain an online map.

本申请实施例中，每个检测稀疏查询向量表征一个障碍物体，利用检测网络对检测稀疏查询向量进行处理，得到障碍物检测结果，障碍物可以是驾驶场景中的车辆，也可以是驾驶场景中的行人。其中，障碍物检测结果是通过检测框和检测框置信度分数来表征，检测框置信度分数越高，说明对应障碍物检测结果越可信。每个地图稀疏查询向量表征一个地图元素，利用地图构建网络对地图稀疏查询向量进行处理，得到在线地图。在线地图包括车道线信息和道路标识等信息。In an embodiment of the present application, each detection sparse query vector represents an obstacle object, and the detection sparse query vector is processed by the detection network to obtain an obstacle detection result. The obstacle can be a vehicle in the driving scene or a pedestrian in the driving scene. Among them, the obstacle detection result is represented by a detection box and a detection box confidence score. The higher the detection box confidence score, the more reliable the corresponding obstacle detection result. Each map sparse query vector represents a map element, and the map sparse query vector is processed by the map construction network to obtain an online map. The online map includes information such as lane line information and road signs.

步骤S106：基于所述障碍物检测结果和所述在线地图对自车进行自动驾驶。Step S106: Automatically drive the vehicle based on the obstacle detection result and the online map.

本申请实施例中，障碍物检测结果和在线地图是通过当前时刻传感器数据处理得到的，基于当前时刻传感器数据实现自动驾驶的多个子任务。如此，将不同模态、不同任务、空间和时间上的所有信息全部表征为稀疏的查询向量，并基于此串联起自动驾驶的各个子任务，实现高效且高准确性的端到端的自动驾驶。In the embodiment of the present application, the obstacle detection results and online maps are obtained by processing the sensor data at the current moment, and multiple subtasks of autonomous driving are realized based on the sensor data at the current moment. In this way, all information in different modes, different tasks, space and time are represented as sparse query vectors, and based on this, the various subtasks of autonomous driving are connected in series to achieve efficient and high-accuracy end-to-end autonomous driving.

本申请实施例中，端到端的自动驾驶是通过多任务处理网络实现的；在所述端到端的自动驾驶包括障碍物检测和地图构建的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息和真实地图信息。通过在训练数据中携带障碍物的真实位置信息和真实地图信息的标签，实现对障碍物检测任务的端到端优化，以及实现对地图构建任务的端到端优化。如此，将障碍物检测任务和地图构建任务串联起来，实现高效且高准确性的端到端的自动驾驶。In the embodiment of the present application, end-to-end autonomous driving is achieved through a multi-task processing network; when the end-to-end autonomous driving includes obstacle detection and map construction, the labels of the training data of the multi-task processing network include: the real position information of the obstacle and the real map information. By carrying the labels of the real position information and the real map information of the obstacle in the training data, end-to-end optimization of the obstacle detection task and end-to-end optimization of the map construction task are achieved. In this way, the obstacle detection task and the map construction task are connected in series to achieve efficient and highly accurate end-to-end autonomous driving.

具体地，多任务处理网络包括特征提取网络和稀疏感知网络。其中，特征提取网络用于对当前时刻传感器数据进行处理，得到稀疏的传感器特征向量。稀疏感知网络用于执行障碍物检测任务和地图构建任务。示例地，图2是本申请实施例提供的一种稀疏感知网络的结构示意图，稀疏感知网络包括两个时序编码器、障碍物件检测模块(即障碍物输出头)和地图构建模块(即地图构建输出头)和数据更新模块。Specifically, the multi-task processing network includes a feature extraction network and a sparse perception network. Among them, the feature extraction network is used to process the sensor data at the current moment to obtain a sparse sensor feature vector. The sparse perception network is used to perform obstacle detection tasks and map construction tasks. By way of example, Figure 2 is a structural schematic diagram of a sparse perception network provided in an embodiment of the present application, and the sparse perception network includes two time series encoders, an obstacle detection module (i.e., an obstacle output head) and a map construction module (i.e., a map construction output head) and a data update module.

执行障碍物检测任务和地图构建任务的过程为：从端到端多任务记忆库中获取历史检测稀疏查询向量、历史地图稀疏查询向量、以及上一时刻的跟踪稀疏查询向量；并根据历史检测稀疏查询向量、上一时刻的跟踪稀疏查询向量和初始化检测稀疏查询向量，得到参考检测稀疏查询向量；根据历史地图稀疏查询向量和初始化地图稀疏查询向量，得到参考地图稀疏查询向量；利用一个时序编码器对参考检测稀疏查询向量与稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，并利用障碍物件检测模块对检测稀疏查询向量进行障碍物检测，得到障碍物检测结果；利用另一个时序编码器对参考地图稀疏查询量与稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，并利用地图构建模块对地图稀疏查询向量进行地图构建，得到在线地图。最后，数据更新模块对端到端多任务记忆库中的历史地图稀疏查询向量、历史检测稀疏查询向量和跟踪稀疏查询向量进行更新，以用于下一时刻的端到端的自动驾驶。The process of executing obstacle detection tasks and map construction tasks is as follows: obtaining historical detection sparse query vectors, historical map sparse query vectors, and tracking sparse query vectors at the previous moment from an end-to-end multi-task memory library; and obtaining a reference detection sparse query vector based on the historical detection sparse query vector, the tracking sparse query vector at the previous moment, and the initialization detection sparse query vector; obtaining a reference map sparse query vector based on the historical map sparse query vector and the initialization map sparse query vector; using a temporal encoder to interactively process the reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, and using an obstacle detection module to perform obstacle detection on the detection sparse query vector to obtain an obstacle detection result; using another temporal encoder to interactively process the reference map sparse query vector and the sparse sensor feature vector to obtain a map sparse query vector, and using a map construction module to construct a map on the map sparse query vector to obtain an online map. Finally, the data update module updates the historical map sparse query vector, historical detection sparse query vector, and tracking sparse query vector in the end-to-end multi-task memory library for end-to-end autonomous driving at the next moment.

在一种可选的实施例中，为了得到障碍物预测轨迹和自车预测轨迹，以在障碍物检测结果和在线地图的基础上，还基于障碍物预测轨迹和自车预测轨迹对自车进行自动驾驶。在步骤S105之后，还包括以下步骤S107至步骤S1012：In an optional embodiment, in order to obtain the obstacle prediction trajectory and the vehicle prediction trajectory, the vehicle is automatically driven based on the obstacle prediction trajectory and the vehicle prediction trajectory on the basis of the obstacle detection result and the online map. After step S105, the following steps S107 to S1012 are also included:

步骤S107：根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量。Step S107: Determine the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information.

本申请实施例中，障碍物历史位置信息是指用于跟踪的障碍物历史位置信息，障碍物历史位置信息通过地图中的坐标位置来表示；障碍物参考轨迹预测稀疏查询向量是指用于障碍物轨迹预测的初始化向量。In the embodiment of the present application, the obstacle historical position information refers to the obstacle historical position information used for tracking, and the obstacle historical position information is represented by the coordinate position in the map; the obstacle reference trajectory prediction sparse query vector refers to the initialization vector used for obstacle trajectory prediction.

具体地，根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量，包括：将所述障碍物检测结果中检测框置信度分数超过分数阈值的检测框，确定为跟踪目标检测框；将所述跟踪目标检测框对应的稀疏查询向量，确定为跟踪稀疏查询向量；将所述跟踪稀疏查询向量对应的障碍物历史位置信息，编码为历史位置稀疏查询向量；将所述历史位置稀疏查询向量和所述跟踪稀疏查询向量进行交互处理，得到障碍物参考轨迹预测稀疏查询向量。Specifically, according to the historical position information of the obstacle, a sparse query vector for predicting the obstacle reference trajectory is determined, including: determining a detection frame in the obstacle detection result whose detection frame confidence score exceeds a score threshold as a tracking target detection frame; determining a sparse query vector corresponding to the tracking target detection frame as a tracking sparse query vector; encoding the obstacle historical position information corresponding to the tracking sparse query vector into a historical position sparse query vector; and interactively processing the historical position sparse query vector and the tracking sparse query vector to obtain a sparse query vector for predicting the obstacle reference trajectory.

本申请实施例中，分数阈值是根据实际情况而灵活设置的，障碍物检测结果是通过检测框和检测框置信度分数来表征，检测框置信度分数越高，说明对应障碍物检测结果越可信，因此将障碍物检测结果中检测框置信度分数超过分数阈值的检测框，确定为跟踪目标检测框，从而将跟踪目标检测框对应的稀疏查询向量确定为跟踪稀疏查询向量。实现只对置信度高的障碍物检测结果对应的障碍物轨迹进行预测。In the embodiment of the present application, the score threshold is flexibly set according to the actual situation. The obstacle detection result is characterized by the detection frame and the detection frame confidence score. The higher the detection frame confidence score, the more reliable the corresponding obstacle detection result. Therefore, the detection frame whose detection frame confidence score exceeds the score threshold in the obstacle detection result is determined as the tracking target detection frame, and the sparse query vector corresponding to the tracking target detection frame is determined as the tracking sparse query vector. Only the obstacle trajectory corresponding to the obstacle detection result with high confidence is predicted.

历史位置稀疏查询向量与障碍物历史位置信息相比，是一个高维度的历史位置特征。通过将障碍物历史位置信息编码为历史位置稀疏查询向量，以便能够基于多头注意力机制，将障碍物历史位置信息与跟踪稀疏查询向量进行交互处理。在具体实施时，通过引入一个可学习的嵌入向量来确定障碍物参考轨迹预测稀疏查询向量，将障碍物历史位置信息与跟踪稀疏查询向量的交互处理结果与该可学习的嵌入向量进行拼接，得到障碍物参考轨迹预测稀疏查询向量。Compared with the obstacle historical position information, the historical position sparse query vector is a high-dimensional historical position feature. By encoding the obstacle historical position information into a historical position sparse query vector, the obstacle historical position information and the tracking sparse query vector can be interactively processed based on the multi-head attention mechanism. In the specific implementation, a learnable embedding vector is introduced to determine the obstacle reference trajectory prediction sparse query vector, and the interactive processing results of the obstacle historical position information and the tracking sparse query vector are spliced with the learnable embedding vector to obtain the obstacle reference trajectory prediction sparse query vector.

步骤S108：根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量。Step S108: Determine a sparse query vector for predicting a reference trajectory of the vehicle according to the historical position information of the vehicle.

本申请实施例中，自车历史位置信息是指自车在驾驶场景中的位置，自车历史位置信息可以通过自车的定位设备来确定。自车参考轨迹预测稀疏查询向量是指用于自车轨迹预测的初始化向量。同样地，通过引入一个可学习的嵌入向量来确定自车参考轨迹预测稀疏查询向量；根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量，包括：对自车历史位置信息进行编码，得到编码后的自车历史位置信息；将编码后的自车历史位置信息与自车可学习嵌入向量进行拼接，得到自车参考轨迹预测稀疏查询向量。In the embodiment of the present application, the historical position information of the self-vehicle refers to the position of the self-vehicle in the driving scene, and the historical position information of the self-vehicle can be determined by the positioning device of the self-vehicle. The self-vehicle reference trajectory prediction sparse query vector refers to the initialization vector used for self-vehicle trajectory prediction. Similarly, the self-vehicle reference trajectory prediction sparse query vector is determined by introducing a learnable embedding vector; the self-vehicle reference trajectory prediction sparse query vector is determined according to the self-vehicle historical position information, including: encoding the self-vehicle historical position information to obtain the encoded self-vehicle historical position information; splicing the encoded self-vehicle historical position information with the self-vehicle learnable embedding vector to obtain the self-vehicle reference trajectory prediction sparse query vector.

步骤S109：将所述障碍物参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到障碍物轨迹预测稀疏查询向量。Step S109: interactively processing the obstacle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an obstacle trajectory prediction sparse query vector.

本申请实施例中，障碍物轨迹预测稀疏查询向量包含障碍物位置信息和地图信息，以用于预测障碍物轨迹；不同障碍物对应不同的障碍物轨迹预测稀疏查询向量。检测稀疏查询向量是指上述步骤S103中得到的检测稀疏查询向量，地图稀疏查询向量是指上述步骤S104中得到的地图稀疏查询向量。障碍物参考轨迹预测稀疏查询向量是指用于障碍物轨迹预测的初始化向量，通过多头注意力机制，将障碍物参考轨迹预测稀疏查询向量，与检测稀疏查询向量以及地图稀疏查询向量进行交互处理，以对障碍物参考轨迹预测稀疏查询向量进行交互更新，得到障碍物轨迹预测稀疏查询向量。In an embodiment of the present application, the obstacle trajectory prediction sparse query vector includes obstacle position information and map information for predicting the obstacle trajectory; different obstacles correspond to different obstacle trajectory prediction sparse query vectors. The detection sparse query vector refers to the detection sparse query vector obtained in the above step S103, and the map sparse query vector refers to the map sparse query vector obtained in the above step S104. The obstacle reference trajectory prediction sparse query vector refers to the initialization vector used for obstacle trajectory prediction. Through the multi-head attention mechanism, the obstacle reference trajectory prediction sparse query vector is interactively processed with the detection sparse query vector and the map sparse query vector to interactively update the obstacle reference trajectory prediction sparse query vector to obtain the obstacle trajectory prediction sparse query vector.

步骤S1010：将所述自车参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到自车轨迹预测稀疏查询向量。Step S1010: interactively processing the ego-vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an ego-vehicle trajectory prediction sparse query vector.

本申请实施例中，自车轨迹预测稀疏查询向量包含自车位置信息、周围障碍物位置信息和地图信息，以用于预测自车轨迹。检测稀疏查询向量是指上述步骤S103中得到的检测稀疏查询向量，地图稀疏查询向量是指上述步骤S104中得到的地图稀疏查询向量。通过多头注意力机制，将自车参考轨迹预测稀疏查询向量，与检测稀疏查询向量以及地图稀疏查询向量进行交互更新，得到自车轨迹预测稀疏查询向量。In the embodiment of the present application, the sparse query vector for predicting the trajectory of the vehicle includes the position information of the vehicle, the position information of the surrounding obstacles and the map information, so as to predict the trajectory of the vehicle. The detection sparse query vector refers to the detection sparse query vector obtained in the above step S103, and the map sparse query vector refers to the map sparse query vector obtained in the above step S104. Through the multi-head attention mechanism, the sparse query vector for predicting the reference trajectory of the vehicle is interactively updated with the detection sparse query vector and the map sparse query vector to obtain the sparse query vector for predicting the trajectory of the vehicle.

步骤S1011：根据所述障碍物轨迹预测稀疏查询向量和所述自车轨迹预测稀疏查询向量进行轨迹预测，得到障碍物预测轨迹和自车预测轨迹。Step S1011: performing trajectory prediction according to the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain an obstacle prediction trajectory and an ego vehicle prediction trajectory.

本申请实施例中，障碍物预测轨迹是指障碍物未来的预测行驶轨迹，自车预测轨迹是指自车未来的预测行驶轨迹。通过预测出障碍物预测轨迹和自车预测轨迹，能够实现对自车自动驾驶的路径规划。具体地，利用轨迹预测网络分别对障碍物轨迹预测稀疏查询向量和自车轨迹预测稀疏查询向量进行处理，得到障碍物预测轨迹和自车预测轨迹。In the embodiment of the present application, the obstacle prediction trajectory refers to the predicted driving trajectory of the obstacle in the future, and the vehicle prediction trajectory refers to the predicted driving trajectory of the vehicle in the future. By predicting the obstacle prediction trajectory and the vehicle prediction trajectory, path planning for the autonomous driving of the vehicle can be achieved. Specifically, the obstacle trajectory prediction sparse query vector and the vehicle trajectory prediction sparse query vector are processed by the trajectory prediction network to obtain the obstacle prediction trajectory and the vehicle prediction trajectory.

步骤S1012：基于所述障碍物检测结果、所述在线地图、所述障碍物预测轨迹和所述自车预测轨迹对自车进行自动驾驶。Step S1012: Automatically driving the vehicle based on the obstacle detection result, the online map, the obstacle prediction trajectory and the vehicle prediction trajectory.

本申请实施例中，障碍物检测结果、在线地图、障碍物预测轨迹和自车预测轨迹是通过当前时刻传感器数据处理得到的，从而基于当前时刻传感器数据实现自动驾驶的多个子任务。如此，将不同模态、不同任务、空间和时间上的所有信息全部表征为稀疏的查询向量，并基于此串联起自动驾驶的各个子任务，实现高效且高准确性的端到端的自动驾驶。In the embodiment of the present application, the obstacle detection results, online maps, obstacle prediction trajectories and vehicle prediction trajectories are obtained by processing the sensor data at the current moment, thereby realizing multiple subtasks of autonomous driving based on the sensor data at the current moment. In this way, all information in different modes, tasks, space and time is represented as sparse query vectors, and based on this, the various subtasks of autonomous driving are connected in series to achieve efficient and high-accuracy end-to-end autonomous driving.

本申请实施例中，端到端的自动驾驶是通过多任务处理网络实现的；在所述端到端的自动驾驶包括障碍物检测、地图构建和轨迹预测的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息、真实地图信息、障碍物真实轨迹和自车真实轨迹。通过在训练数据中携带障碍物的真实位置信息、真实地图信息、障碍物真实轨迹和自车真实轨迹的标签，实现对障碍物检测任务的端到端优化、实现对地图构建任务的端到端优化、以及轨迹预测任务的端到端优化。如此，将障碍物检测任务、地图构建任务和轨迹预测任务串联起来，实现高效且高准确性的端到端的自动驾驶。In an embodiment of the present application, end-to-end autonomous driving is achieved through a multi-task processing network; in the case where the end-to-end autonomous driving includes obstacle detection, map construction, and trajectory prediction, the labels of the training data of the multi-task processing network include: the real position information of the obstacle, the real map information, the real trajectory of the obstacle, and the real trajectory of the vehicle. By carrying the labels of the real position information, real map information, real trajectory of the obstacle, and the real trajectory of the vehicle in the training data, end-to-end optimization of the obstacle detection task, end-to-end optimization of the map construction task, and end-to-end optimization of the trajectory prediction task are achieved. In this way, the obstacle detection task, the map construction task, and the trajectory prediction task are connected in series to achieve efficient and highly accurate end-to-end autonomous driving.

在一种可选的实施例中，为了实现自车的轨迹规划，以在障碍物检测结果、在线地图、障碍物预测轨迹和自车预测轨迹的基础上，还基于自车路径规划结果对自车进行自动驾驶。在步骤S1011之后，还包括步骤S1013至步骤S1015：In an optional embodiment, in order to realize the trajectory planning of the self-vehicle, the self-vehicle is automatically driven based on the self-vehicle path planning result on the basis of the obstacle detection result, the online map, the obstacle prediction trajectory and the self-vehicle prediction trajectory. After step S1011, steps S1013 to S1015 are also included:

步骤S1013：对所述自车轨迹预测稀疏查询向量和自车导航命令信息进行交互处理，得到含导航信息的自车轨迹预测稀疏查询向量。Step S1013: interactively process the vehicle trajectory prediction sparse query vector and the vehicle navigation command information to obtain the vehicle trajectory prediction sparse query vector containing navigation information.

步骤S1014：利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果。Step S1014: using the ego vehicle trajectory prediction sparse query vector containing navigation information, optimizing the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result.

步骤S1015：基于所述自车路径规划结果、所述障碍物检测结果、所述在线地图、所述障碍物预测轨迹和所述自车预测轨迹对自车进行自动驾驶。Step S1015: Automatically drive the ego vehicle based on the ego vehicle path planning result, the obstacle detection result, the online map, the obstacle prediction trajectory and the ego vehicle prediction trajectory.

本申请实施例中，自车导航命令信息是指控制车辆的导航命令，例如，向左转、向右转、加速和减速等导航命令。多种约束条件下包括动力学约束和安全距离约束，其中，动力学约束包括速度约束和加速度约束；安全距离约束是指自车与预测障碍物之间的相对位置关系要满足安全距离。为了实现自车的轨迹规划，引入自车导航命令信息对自车预测轨迹进行优化处理，从而得到自车路径规划结果。In the embodiment of the present application, the self-vehicle navigation command information refers to the navigation command for controlling the vehicle, such as navigation commands such as turning left, turning right, accelerating and decelerating. The various constraints include dynamic constraints and safe distance constraints, where the dynamic constraints include speed constraints and acceleration constraints; the safe distance constraint means that the relative position relationship between the self-vehicle and the predicted obstacle must meet the safe distance. In order to realize the trajectory planning of the self-vehicle, the self-vehicle navigation command information is introduced to optimize the predicted trajectory of the self-vehicle, thereby obtaining the self-vehicle path planning result.

由于障碍物检测结果、在线地图、障碍物预测轨迹、自车预测轨迹和自车路径规划结果均是通过当前时刻传感器数据处理得到的，从而基于当前时刻传感器数据实现自动驾驶的多个子任务。如此，将不同模态、不同任务、空间和时间上的所有信息全部表征为稀疏的查询向量，并基于此串联起自动驾驶的各个子任务，实现高效且高准确性的端到端的自动驾驶。Since the obstacle detection results, online maps, obstacle prediction trajectories, vehicle prediction trajectories, and vehicle path planning results are all obtained through processing the current sensor data, multiple subtasks of autonomous driving can be realized based on the current sensor data. In this way, all information in different modes, tasks, space, and time is represented as sparse query vectors, and based on this, the various subtasks of autonomous driving are connected in series to achieve efficient and high-accuracy end-to-end autonomous driving.

下面分别对动力学约束和安全距离约束情况下的自车预测轨迹进行优化处理进行说明。The following describes the optimization processing of the predicted trajectory of the ego-vehicle under dynamic constraints and safety distance constraints respectively.

(1)在所述多种约束条件包括动力学约束的情况下，利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果，包括：利用所述含导航信息的自车轨迹预测稀疏查询向量，回归自车的运动信息，所述运动信息包括自车的速度和加速度；利用所述运动信息对所述自车预测轨迹进行优化处理，得到自车路径规划结果。(1) When the multiple constraints include dynamic constraints, the ego-vehicle trajectory containing navigation information is used to predict a sparse query vector, and the predicted trajectory of the ego-vehicle is optimized under multiple constraints to obtain an ego-vehicle path planning result, including: using the ego-vehicle trajectory containing navigation information to predict the sparse query vector, regressing the motion information of the ego-vehicle, the motion information including the speed and acceleration of the ego-vehicle; and using the motion information to optimize the predicted trajectory of the ego-vehicle to obtain an ego-vehicle path planning result.

其中，回归自车的运动信息相当于预测在自车预测轨迹下自车的运动信息(速度和加速度)，从而利于运动信息对自车预测轨迹进行优化处理。例如，若自车速度超过安全行驶速度，则对自车预测轨迹进行调整，以使调整后的轨迹满足安全行驶速度约束。再例如，若自车加速度超过安全行驶加速度(即加速过快)，则对自车预测轨迹进行调整，以使调整后的轨迹满足安全行驶加速度约束。Among them, regressing the motion information of the ego vehicle is equivalent to predicting the motion information (speed and acceleration) of the ego vehicle under the ego vehicle predicted trajectory, so as to optimize the ego vehicle predicted trajectory with the motion information. For example, if the ego vehicle speed exceeds the safe driving speed, the ego vehicle predicted trajectory is adjusted so that the adjusted trajectory meets the safe driving speed constraint. For another example, if the ego vehicle acceleration exceeds the safe driving acceleration (i.e., the acceleration is too fast), the ego vehicle predicted trajectory is adjusted so that the adjusted trajectory meets the safe driving acceleration constraint.

(2)在所述多种约束条件包括安全距离约束的情况下，利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果，包括：根据所述含导航信息的自车轨迹预测稀疏查询向量，和所述障碍物轨迹预测稀疏查询向量，预测障碍物与自车的相对位置关系；利用所述相对位置关系对所述自车预测轨迹进行距离约束优化处理，得到自车路径规划结果。(2) When the multiple constraints include a safety distance constraint, the ego vehicle trajectory prediction sparse query vector containing navigation information is used to optimize the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result, including: predicting a relative position relationship between the obstacle and the ego vehicle based on the ego vehicle trajectory prediction sparse query vector containing navigation information and the obstacle trajectory prediction sparse query vector; and performing distance constraint optimization processing on the ego vehicle predicted trajectory using the relative position relationship to obtain an ego vehicle path planning result.

具体地，自车和障碍物之间需要满足一定的安全距离，利用相对位置关系对自车预测轨迹进行距离约束优化处理是指通过对自车预测轨迹进行调整，以使调整后的轨迹满足安全距离约束。Specifically, a certain safety distance needs to be met between the ego vehicle and the obstacle. Using the relative position relationship to perform distance constraint optimization processing on the ego vehicle predicted trajectory means adjusting the ego vehicle predicted trajectory so that the adjusted trajectory meets the safety distance constraint.

本申请实施例中，端到端的自动驾驶是通过多任务处理网络实现的；在所述端到端的自动驾驶包括障碍物检测、地图构建、轨迹预测和路径规划的情况下，所述多任务处理网络的训练数据的标签包括：障碍物的真实位置信息、真实地图信息、障碍物真实轨迹、自车真实轨迹、自车的真实运动信息、障碍物与自车的真实相对位置关系。In an embodiment of the present application, end-to-end autonomous driving is achieved through a multi-task processing network; when the end-to-end autonomous driving includes obstacle detection, map construction, trajectory prediction and path planning, the labels of the training data of the multi-task processing network include: the real position information of the obstacle, the real map information, the real trajectory of the obstacle, the real trajectory of the vehicle, the real motion information of the vehicle, and the real relative position relationship between the obstacle and the vehicle.

通过在训练数据中携带障碍物的真实位置信息、真实地图信息、障碍物真实轨迹、自车真实轨迹、自车的真实运动信息、障碍物与自车的真实相对位置关系的标签，实现对障碍物检测任务的端到端优化、实现对地图构建任务的端到端优化、轨迹预测任务的端到端优化、以及路径规划的端到端优化。如此，将障碍物检测任务、地图构建任务、轨迹预测任务和路径规划任务串联起来，实现高效且高准确性的端到端的自动驾驶。By carrying the real position information of obstacles, real map information, real trajectory of obstacles, real trajectory of the vehicle, real motion information of the vehicle, and labels of the real relative position relationship between obstacles and the vehicle in the training data, end-to-end optimization of obstacle detection tasks, end-to-end optimization of map construction tasks, end-to-end optimization of trajectory prediction tasks, and end-to-end optimization of path planning are achieved. In this way, the obstacle detection tasks, map construction tasks, trajectory prediction tasks, and path planning tasks are connected in series to achieve efficient and highly accurate end-to-end autonomous driving.

其中，多任务处理网络包括特征提取网络、稀疏感知网络和运动规划网络。运动规划网络用于执行轨迹预测任务和路径规划任务。示例地，图3是本申请实施例提供的一种运动规划网络的结构示意图，运动规划网络包括轨迹预测模块和路径规划模块。Among them, the multi-task processing network includes a feature extraction network, a sparse perception network and a motion planning network. The motion planning network is used to perform trajectory prediction tasks and path planning tasks. For example, Figure 3 is a structural diagram of a motion planning network provided in an embodiment of the present application, and the motion planning network includes a trajectory prediction module and a path planning module.

具体地，执行轨迹预测任务和路径规划任务的过程为：根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量；并根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量；之后，从端到端多任务记忆库中获取历史障碍物参考轨迹预测稀疏查询向量、历史自车参考轨迹预测稀疏查询向量、检测稀疏查询向量和地图稀疏查询向量，并与将障碍物参考轨迹预测稀疏查询向量和自车参考轨迹预测稀疏查询向量输入到交互预测模块进行交互处理和轨迹预测，得到自车轨迹预测稀疏查询向量。其中，交互处理包括：障碍物参考轨迹预测稀疏查询向量与对应的历史障碍物参考轨迹预测稀疏查询向量进行交互、障碍物参考轨迹预测稀疏查询向量与检测稀疏查询向量以及地图稀疏查询向量进行交互；自车参考轨迹预测稀疏查询向量与对应的历史自车参考轨迹预测稀疏查询向量进行交互，自车参考轨迹预测稀疏查询向量与检测稀疏查询向量以及地图稀疏查询向量进行交互。Specifically, the process of executing the trajectory prediction task and the path planning task is as follows: according to the historical position information of the obstacle, determine the obstacle reference trajectory prediction sparse query vector; and according to the historical position information of the vehicle, determine the ego vehicle reference trajectory prediction sparse query vector; then, obtain the historical obstacle reference trajectory prediction sparse query vector, the historical ego vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector from the end-to-end multi-task memory library, and input the obstacle reference trajectory prediction sparse query vector and the ego vehicle reference trajectory prediction sparse query vector into the interactive prediction module for interactive processing and trajectory prediction, and obtain the ego vehicle trajectory prediction sparse query vector. Among them, the interactive processing includes: the obstacle reference trajectory prediction sparse query vector interacts with the corresponding historical obstacle reference trajectory prediction sparse query vector, the obstacle reference trajectory prediction sparse query vector interacts with the detection sparse query vector and the map sparse query vector; the ego vehicle reference trajectory prediction sparse query vector interacts with the corresponding historical ego vehicle reference trajectory prediction sparse query vector, and the ego vehicle reference trajectory prediction sparse query vector interacts with the detection sparse query vector and the map sparse query vector.

接着，路径规划模块对自车轨迹预测稀疏查询向量和自车导航命令信息进行交互处理，得到含导航信息的自车轨迹预测稀疏查询向量；并利用含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果。Next, the path planning module interactively processes the ego-vehicle trajectory prediction sparse query vector and the ego-vehicle navigation command information to obtain the ego-vehicle trajectory prediction sparse query vector containing navigation information; and uses the ego-vehicle trajectory prediction sparse query vector containing navigation information to optimize the ego-vehicle prediction trajectory under multiple constraints to obtain the ego-vehicle path planning result.

下面结合一种具体的实施方式来说明本申请。The present application is described below in conjunction with a specific implementation method.

图4是本申请实施例提供的另一种端到端的自动驾驶方法的步骤流程图，如图4所示，包括步骤S401至步骤S4013：FIG. 4 is a flowchart of another end-to-end autonomous driving method provided in an embodiment of the present application. As shown in FIG. 4 , the method includes steps S401 to S4013:

步骤S401：获取自车的当前时刻传感器数据。Step S401: Acquire the current sensor data of the vehicle.

步骤S402：对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量。Step S402: Process the sensor data at the current moment to obtain a sparse sensor feature vector.

步骤S403：对参考检测稀疏查询向量与所述稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，所述参考检测稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史检测稀疏查询向量。Step S403: interactively process the reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle.

步骤S404：对参考地图稀疏查询量与所述稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，所述参考地图稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史地图稀疏查询向量。Step S404: interactively processing the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle.

步骤S405：根据所述检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据所述地图稀疏查询向量进行地图构建，得到在线地图。Step S405: performing obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and performing map construction according to the map sparse query vector to obtain an online map.

步骤S406：根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量。Step S406: Determine the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information.

步骤S407：根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量。Step S407: Determine a sparse query vector for predicting a reference trajectory of the vehicle according to the historical position information of the vehicle.

步骤S408：将所述障碍物参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到障碍物轨迹预测稀疏查询向量。Step S408: interactively process the obstacle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an obstacle trajectory prediction sparse query vector.

步骤S409：将所述自车参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到自车轨迹预测稀疏查询向量。Step S409: interactively processing the ego-vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an ego-vehicle trajectory prediction sparse query vector.

步骤S4010：根据所述障碍物轨迹预测稀疏查询向量和所述自车轨迹预测稀疏查询向量进行轨迹预测，得到障碍物预测轨迹和自车预测轨迹。Step S4010: performing trajectory prediction according to the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain an obstacle prediction trajectory and an ego vehicle prediction trajectory.

步骤S4011：对所述自车轨迹预测稀疏查询向量和自车导航命令信息进行交互处理，得到含导航信息的自车轨迹预测稀疏查询向量。Step S4011: interactively process the vehicle trajectory prediction sparse query vector and the vehicle navigation command information to obtain the vehicle trajectory prediction sparse query vector containing navigation information.

步骤S4012：利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果。Step S4012: using the ego vehicle trajectory prediction sparse query vector containing navigation information, optimizing the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result.

步骤S4013：基于所述自车路径规划结果、所述障碍物检测结果、所述在线地图、所述障碍物预测轨迹和所述自车预测轨迹对自车进行自动驾驶。Step S4013: Automatically drive the ego vehicle based on the ego vehicle path planning result, the obstacle detection result, the online map, the obstacle prediction trajectory and the ego vehicle prediction trajectory.

本申请实施例中，将障碍物检测任务、地图构建任务、轨迹预测任务和路径规划任务串联起来，基于自车的当前时刻传感器数据可以实现高效且高准确性的端到端的自动驾驶。具体地，通过执行步骤S401至步骤S405实现障碍物检测和地图构建；通过执行步骤S406至步骤S4010实现轨迹预测，得到障碍物预测轨迹和自车预测轨迹；为了实现路径规划，执行步骤S4011至步骤S4012，得到满足多种约束条件(如动力学约束、安全距离约束)的自车路径规划结果，从而执行步骤S4013对自车进行自动驾驶。In the embodiment of the present application, the obstacle detection task, the map construction task, the trajectory prediction task and the path planning task are connected in series, and efficient and highly accurate end-to-end autonomous driving can be achieved based on the current sensor data of the self-vehicle. Specifically, obstacle detection and map construction are achieved by executing steps S401 to S405; trajectory prediction is achieved by executing steps S406 to S4010 to obtain the obstacle prediction trajectory and the self-vehicle prediction trajectory; in order to achieve path planning, steps S4011 to S4012 are executed to obtain the self-vehicle path planning result that meets multiple constraints (such as dynamic constraints and safety distance constraints), and then step S4013 is executed to perform autonomous driving on the self-vehicle.

综上所述，本申请实施例将整个自动驾驶场景中空间维度和时间维度上的所有信息都由稀疏查询向量表示，而不使用任何密集的鸟瞰图特征，有效地利用长时序的历史信息、扩展到更多的模态和任务，降低计算成本和内存占用。得益于历史时序信息和当前时刻传感器数据的引入，使端到端的自动驾驶中的每个任务的具有更好性能。如此，解决端到端自动驾驶模型单任务性能落后、计算成本高、内存占用大、模型部署困难等问题。In summary, the embodiment of the present application represents all the information in the spatial and temporal dimensions of the entire autonomous driving scene by a sparse query vector, without using any dense bird's-eye view features, effectively utilizing long-time historical information, expanding to more modes and tasks, and reducing computing costs and memory usage. Thanks to the introduction of historical time series information and current sensor data, each task in end-to-end autonomous driving has better performance. In this way, the problems of lagging single-task performance, high computing costs, large memory usage, and difficult model deployment of the end-to-end autonomous driving model are solved.

在本申请实施例中，端到端的自动驾驶是通过多任务处理网络实现的。示例地，图5是本申请实施例提供的一种多任务处理网络的结构示意图，多任务处理网络包括：特征提取网络、稀疏感知网络和运动规划网络。In an embodiment of the present application, end-to-end autonomous driving is achieved through a multi-task processing network. For example, FIG5 is a schematic diagram of a structure of a multi-task processing network provided in an embodiment of the present application, and the multi-task processing network includes: a feature extraction network, a sparse perception network, and a motion planning network.

具体地，将自车的当前时刻传感器数据(包括图像数据和点云数据)输入到特征提取网络，特征提取网络对当前时刻传感器数据进行特征提取和位置编码后，输入稀疏的传感器特征向量。Specifically, the current sensor data of the vehicle (including image data and point cloud data) is input into the feature extraction network. After the feature extraction network extracts features and positions the sensor data at the current moment, it inputs a sparse sensor feature vector.

稀疏感知网络从端到端多任务记忆库中获取历史检测稀疏查询向量、历史地图稀疏查询向量、以及上一时刻的跟踪稀疏查询向量，并根据历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量得到参考检测稀疏查询向量，根据历史地图稀疏查询向量得到参考地图稀疏查询量；之后对参考检测稀疏查询向量与稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量；对参考地图稀疏查询量与稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量；并根据检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，根据地图稀疏查询向量进行地图构建，得到在线地图。The sparse perception network obtains the historical detection sparse query vector, the historical map sparse query vector, and the tracking sparse query vector at the previous moment from the end-to-end multi-task memory library, and obtains the reference detection sparse query vector based on the historical detection sparse query vector and the tracking sparse query vector at the previous moment, and obtains the reference map sparse query volume based on the historical map sparse query vector; then, the reference detection sparse query vector and the sparse sensor feature vector are interactively processed to obtain the detection sparse query vector; the reference map sparse query volume and the sparse sensor feature vector are interactively processed to obtain the map sparse query vector; and obstacle detection is performed based on the detection sparse query vector to obtain the obstacle detection result, and the map is constructed based on the map sparse query vector to obtain the online map.

运动规划网络将检测稀疏查询向量、地图稀疏查询向量和跟踪稀疏查询向量作为输入，根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量；根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量。再将障碍物参考轨迹预测稀疏查询向量，与检测稀疏查询向量以及地图稀疏查询向量进行交互处理，得到障碍物轨迹预测稀疏查询向量；将自车参考轨迹预测稀疏查询向量，与检测稀疏查询向量以及地图稀疏查询向量进行交互处理，得到自车轨迹预测稀疏查询向量；之后，根据障碍物轨迹预测稀疏查询向量和自车轨迹预测稀疏查询向量进行轨迹预测，得到障碍物预测轨迹和自车预测轨迹。最后基于自车导航命令信息，对自车预测轨迹进行优化处理，得到满足多种约束的自车路径规划结果。The motion planning network takes the detection sparse query vector, the map sparse query vector and the tracking sparse query vector as input, and determines the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information; and determines the ego vehicle reference trajectory prediction sparse query vector according to the ego vehicle historical position information. Then the obstacle reference trajectory prediction sparse query vector is interactively processed with the detection sparse query vector and the map sparse query vector to obtain the obstacle trajectory prediction sparse query vector; the ego vehicle reference trajectory prediction sparse query vector is interactively processed with the detection sparse query vector and the map sparse query vector to obtain the ego vehicle trajectory prediction sparse query vector; then, trajectory prediction is performed based on the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain the obstacle prediction trajectory and the ego vehicle prediction trajectory. Finally, based on the ego vehicle navigation command information, the ego vehicle prediction trajectory is optimized to obtain the ego vehicle path planning result that meets multiple constraints.

本申请实施例还提供了一种端到端的自动驾驶装置，参照图6所示，图6是本申请实施例提供的一种端到端的自动驾驶装置的结构示意图，所述装置包括：The embodiment of the present application further provides an end-to-end autonomous driving device, as shown in FIG6 , which is a schematic diagram of the structure of an end-to-end autonomous driving device provided by the embodiment of the present application, the device comprising:

数据获取模块610，用于获取自车的当前时刻传感器数据；The data acquisition module 610 is used to acquire the current sensor data of the vehicle;

数据处理模块620，用于对所述当前时刻传感器数据进行处理，得到稀疏的传感器特征向量；A data processing module 620 is used to process the sensor data at the current moment to obtain a sparse sensor feature vector;

第一交互模块630，用于对参考检测稀疏查询向量与所述稀疏的传感器特征向量进行交互处理，得到检测稀疏查询向量，所述参考检测稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史检测稀疏查询向量；A first interaction module 630 is configured to interactively process a reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle;

第二交互模块640，用于对参考地图稀疏查询量与所述稀疏的传感器特征向量进行交互处理，得到地图稀疏查询向量，所述参考地图稀疏查询向量至少包括：基于自车的历史传感器数据确定出的历史地图稀疏查询向量；The second interaction module 640 is used to interactively process the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle;

检测构建模块650，用于根据所述检测稀疏查询向量进行障碍物检测，得到障碍物检测结果，以及，根据所述地图稀疏查询向量进行地图构建，得到在线地图；A detection construction module 650, configured to perform obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and to perform map construction according to the map sparse query vector to obtain an online map;

自动驾驶模块660，用于基于所述障碍物检测结果和所述在线地图对自车进行自动驾驶。The autonomous driving module 660 is used to perform autonomous driving of the vehicle based on the obstacle detection result and the online map.

在一种可选的实施例中，所述装置还包括：In an optional embodiment, the device further includes:

第一确定模块，用于根据障碍物历史位置信息，确定障碍物参考轨迹预测稀疏查询向量；A first determination module is used to determine the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information;

第二确定模块，用于根据自车历史位置信息，确定自车参考轨迹预测稀疏查询向量；The second determination module is used to determine the sparse query vector for predicting the reference trajectory of the ego vehicle according to the historical position information of the ego vehicle;

第三交互模块，用于将所述障碍物参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到障碍物轨迹预测稀疏查询向量；A third interaction module is used to interactively process the obstacle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an obstacle trajectory prediction sparse query vector;

第四交互模块，用于将所述自车参考轨迹预测稀疏查询向量，与所述检测稀疏查询向量以及所述地图稀疏查询向量进行交互处理，得到自车轨迹预测稀疏查询向量；A fourth interaction module, configured to interactively process the ego-vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an ego-vehicle trajectory prediction sparse query vector;

轨迹预测模块，用于根据所述障碍物轨迹预测稀疏查询向量和所述自车轨迹预测稀疏查询向量进行轨迹预测，得到障碍物预测轨迹和自车预测轨迹。The trajectory prediction module is used to perform trajectory prediction according to the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain an obstacle prediction trajectory and an ego vehicle prediction trajectory.

第五交互模块，用于对所述自车轨迹预测稀疏查询向量和自车导航命令信息进行交互处理，得到含导航信息的自车轨迹预测稀疏查询向量；A fifth interactive module, configured to interactively process the ego-vehicle trajectory prediction sparse query vector and the ego-vehicle navigation command information to obtain the ego-vehicle trajectory prediction sparse query vector containing navigation information;

轨迹优化模块，用于利用所述含导航信息的自车轨迹预测稀疏查询向量，在多种约束条件下对所述自车预测轨迹进行优化处理，得到自车路径规划结果。The trajectory optimization module is used to use the ego vehicle trajectory containing navigation information to predict the sparse query vector, optimize the ego vehicle predicted trajectory under multiple constraints, and obtain the ego vehicle path planning result.

在一种可选的实施例中，在所述多种约束条件包括动力学约束的情况下，所述轨迹优化模块包括：In an optional embodiment, when the multiple constraints include dynamic constraints, the trajectory optimization module includes:

回归模块，用于利用所述含导航信息的自车轨迹预测稀疏查询向量，回归自车的运动信息，所述运动信息包括自车的速度和加速度；A regression module, configured to use the ego vehicle trajectory containing navigation information to predict a sparse query vector and regress the ego vehicle's motion information, wherein the motion information includes the ego vehicle's speed and acceleration;

第一优化子模块，用于利用所述运动信息对所述自车预测轨迹进行优化处理，得到自车路径规划结果。The first optimization submodule is used to optimize the predicted trajectory of the ego vehicle using the motion information to obtain a path planning result of the ego vehicle.

在一种可选的实施例中，在所述多种约束条件包括安全距离约束的情况下，所述轨迹优化模块包括：In an optional embodiment, when the multiple constraints include a safety distance constraint, the trajectory optimization module includes:

位置预测模块，用于根据所述含导航信息的自车轨迹预测稀疏查询向量，和所述障碍物轨迹预测稀疏查询向量，预测障碍物与自车的相对位置关系；A position prediction module, used for predicting a relative position relationship between the obstacle and the ego vehicle based on the ego vehicle trajectory prediction sparse query vector containing navigation information and the obstacle trajectory prediction sparse query vector;

第二优化子模块，用于利用所述相对位置关系对所述自车预测轨迹进行距离约束优化处理，得到自车路径规划结果。The second optimization submodule is used to perform distance constraint optimization processing on the predicted trajectory of the ego vehicle by utilizing the relative position relationship to obtain a path planning result of the ego vehicle.

在一种可选的实施例中，所述第一确定模块包括：In an optional embodiment, the first determining module includes:

第一确定子模块，用于将所述障碍物检测结果中检测框置信度分数超过分数阈值的检测框，确定为跟踪目标检测框；A first determination submodule is used to determine the detection frame whose confidence score of the detection frame in the obstacle detection result exceeds the score threshold as the tracking target detection frame;

第二确定子模块，用于将所述跟踪目标检测框对应的稀疏查询向量，确定为跟踪稀疏查询向量；A second determination submodule is used to determine the sparse query vector corresponding to the tracking target detection box as a tracking sparse query vector;

位置信息编码模块，用于将所述跟踪稀疏查询向量对应的障碍物历史位置信息，编码为历史位置稀疏查询向量；A position information encoding module, used for encoding the obstacle historical position information corresponding to the tracking sparse query vector into a historical position sparse query vector;

第六交互模块，用于将所述历史位置稀疏查询向量和所述跟踪稀疏查询向量进行交互处理，得到障碍物参考轨迹预测稀疏查询向量。The sixth interactive module is used to interactively process the historical position sparse query vector and the tracking sparse query vector to obtain an obstacle reference trajectory prediction sparse query vector.

第一获取模块，用于获取历史检测稀疏查询向量和上一时刻的跟踪稀疏查询向量，所述历史检测稀疏查询向量包括：当前时刻之前的N个时刻的历史检测稀疏查询向量，N为大于1的整数；A first acquisition module is used to acquire a historical detection sparse query vector and a tracking sparse query vector at a previous moment, wherein the historical detection sparse query vector includes: a historical detection sparse query vector at N moments before the current moment, where N is an integer greater than 1;

参考检测向量模块，用于根据所述历史检测稀疏查询向量和所述上一时刻的跟踪稀疏查询向量，得到参考检测稀疏查询向量。The reference detection vector module is used to obtain a reference detection sparse query vector according to the historical detection sparse query vector and the tracking sparse query vector at the previous moment.

在一种可选的实施例中，所述数据处理模块包括：In an optional embodiment, the data processing module includes:

提取模块，用于对所述当前时刻传感器数据进行特征提取，得到稀疏的初始传感器特征向量；An extraction module, used for performing feature extraction on the sensor data at the current moment to obtain a sparse initial sensor feature vector;

计算模块，用于根据所述当前时刻传感器数据，计算所述稀疏的初始传感器特征向量的空间位置信息；A calculation module, used for calculating the spatial position information of the sparse initial sensor feature vector according to the sensor data at the current moment;

融合模块，用于将所述空间位置信息和所述稀疏的初始传感器特征向量进行融合，得到稀疏的传感器特征向量。The fusion module is used to fuse the spatial position information with the sparse initial sensor feature vector to obtain a sparse sensor feature vector.

在一种可选的实施例中，端到端的自动驾驶是通过多任务处理网络实现的；In an optional embodiment, end-to-end autonomous driving is achieved through a multi-tasking network;

本申请实施例还提供了一种电子设备，参照图7，图7是本申请实施例提供的一种电子设备的结构示意图。如图7所示，电子设备700包括：存储器710和处理器720，存储器710与处理器720之间通过总线通信连接，存储器710中存储有计算机程序，该计算机程序可在处理器720上运行，进而实现本申请实施例所述的端到端的自动驾驶方法的步骤。The embodiment of the present application also provides an electronic device, with reference to FIG7, which is a schematic diagram of the structure of an electronic device provided by the embodiment of the present application. As shown in FIG7, the electronic device 700 includes: a memory 710 and a processor 720, the memory 710 and the processor 720 are connected via a bus communication, the memory 710 stores a computer program, and the computer program can be run on the processor 720, thereby implementing the steps of the end-to-end autonomous driving method described in the embodiment of the present application.

本申请实施例还提供了一种计算机可读存储介质，其上存储有计算机程序，该计算机程序被处理器执行时实现本申请实施例所述的端到端的自动驾驶方法的步骤。An embodiment of the present application also provides a computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the steps of the end-to-end autonomous driving method described in the embodiment of the present application are implemented.

本说明书中的各个实施例均采用递进的方式描述，每个实施例重点说明的都是与其他实施例的不同之处，各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments. The same or similar parts between the various embodiments can be referenced to each other.

本申请实施例是参照根据本申请实施例的方法、装置的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The embodiments of the present application are described with reference to the flowcharts and/or block diagrams of the methods and devices according to the embodiments of the present application. It should be understood that each process and/or box in the flowchart and/or block diagram, as well as the combination of the processes and/or boxes in the flowchart and/or block diagram, can be implemented by computer program instructions. These computer program instructions can be provided to a processor of a general-purpose computer, a special-purpose computer, an embedded processor, or other programmable data processing terminal device to produce a machine, so that the instructions executed by the processor of the computer or other programmable data processing terminal device produce a device for implementing the functions specified in one process or multiple processes in the flowchart and/or one box or multiple boxes in the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal device to operate in a specific manner, so that the instructions stored in the computer-readable memory produce a manufactured product including an instruction device that implements the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上，使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing terminal device so that a series of operating steps are executed on the computer or other programmable terminal device to produce computer-implemented processing, so that the instructions executed on the computer or other programmable terminal device provide steps for implementing the functions specified in one or more processes in the flowchart and/or one or more boxes in the block diagram.

尽管已描述了本申请实施例的优选实施例，但本领域内的技术人员一旦得知了基本创造性概念，则可对这些实施例做出另外的变更和修改。所以，所附权利要求意欲解释为包括优选实施例以及落入本申请实施例范围的所有变更和修改。Although the preferred embodiments of the present application have been described, those skilled in the art may make additional changes and modifications to these embodiments once they have learned the basic creative concept. Therefore, the appended claims are intended to be interpreted as including the preferred embodiments and all changes and modifications that fall within the scope of the embodiments of the present application.

最后，还需要说明的是，在本文中，诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来，而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should be noted that, in this article, relational terms such as first and second, etc. are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any such actual relationship or order between these entities or operations. Moreover, the terms "include", "comprise" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, method, article or terminal device including a series of elements includes not only those elements, but also other elements not explicitly listed, or also includes elements inherent to such process, method, article or terminal device. In the absence of further restrictions, the elements defined by the sentence "comprise a ..." do not exclude the existence of other identical elements in the process, method, article or terminal device including the elements.

以上对本申请所提供的一种端到端的自动驾驶方法、装置、设备及存储介质进行了详细介绍，本文中应用了具体个例对本申请的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本申请的方法及其核心思想；同时，对于本领域的一般技术人员，依据本申请的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本申请的限制。The above is a detailed introduction to an end-to-end autonomous driving method, device, equipment and storage medium provided by the present application. Specific examples are used in this article to illustrate the principles and implementation methods of the present application. The description of the above embodiments is only used to help understand the method of the present application and its core idea; at the same time, for general technical personnel in this field, according to the idea of the present application, there will be changes in the specific implementation method and application scope. In summary, the content of this specification should not be understood as a limitation on the present application.

Claims

1. An end-to-end autonomous driving method, characterized in that the method comprises:

Get the current sensor data of the vehicle;

Processing the current moment sensor data to obtain a sparse sensor feature vector;

Interactively processing a reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle;

Interactively processing the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle;

Performing obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and performing map construction according to the map sparse query vector to obtain an online map;

The vehicle is automatically driven based on the obstacle detection result and the online map.

2. The method according to claim 1, characterized in that the method further comprises:

According to the historical position information of the obstacle, a sparse query vector for obstacle reference trajectory prediction is determined;

According to the historical position information of the ego vehicle, a sparse query vector for predicting the ego vehicle reference trajectory is determined;

Interactively processing the obstacle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an obstacle trajectory prediction sparse query vector;

Interactively processing the ego-vehicle reference trajectory prediction sparse query vector, the detection sparse query vector and the map sparse query vector to obtain an ego-vehicle trajectory prediction sparse query vector;

Performing trajectory prediction according to the obstacle trajectory prediction sparse query vector and the ego vehicle trajectory prediction sparse query vector to obtain an obstacle prediction trajectory and an ego vehicle prediction trajectory;

The autonomous vehicle is driven automatically based on the obstacle detection result and the online map, including:

The self-vehicle is automatically driven based on the obstacle detection result, the online map, the obstacle prediction trajectory and the self-vehicle prediction trajectory.

3. The method according to claim 2, characterized in that the method further comprises:

Interactively processing the ego-vehicle trajectory prediction sparse query vector and ego-vehicle navigation command information to obtain an ego-vehicle trajectory prediction sparse query vector containing navigation information;

Utilizing the ego vehicle trajectory prediction sparse query vector containing navigation information, optimizing the ego vehicle prediction trajectory under multiple constraints to obtain an ego vehicle path planning result;

The self-vehicle is automatically driven based on the self-vehicle path planning result, the obstacle detection result, the online map, the obstacle prediction trajectory and the self-vehicle prediction trajectory.

4. The method according to claim 3, characterized in that, when the multiple constraints include dynamic constraints, the ego vehicle trajectory prediction sparse query vector containing navigation information is used to optimize the ego vehicle predicted trajectory under multiple constraints to obtain the ego vehicle path planning result, including:

Using the ego vehicle trajectory containing navigation information to predict a sparse query vector, regressing the motion information of the ego vehicle, the motion information including the speed and acceleration of the ego vehicle;

The motion information is used to optimize the predicted trajectory of the ego vehicle to obtain a ego vehicle path planning result.

5. The method according to claim 3 or 4, characterized in that, when the multiple constraints include a safety distance constraint, the ego vehicle trajectory prediction sparse query vector containing navigation information is used to optimize the ego vehicle predicted trajectory under multiple constraints to obtain an ego vehicle path planning result, including:

Predicting a relative position relationship between the obstacle and the ego vehicle based on the ego vehicle trajectory prediction sparse query vector containing navigation information and the obstacle trajectory prediction sparse query vector;

The relative position relationship is used to perform distance constraint optimization processing on the predicted trajectory of the ego vehicle to obtain a ego vehicle path planning result.

6. The method according to claim 2, characterized in that determining the obstacle reference trajectory prediction sparse query vector according to the obstacle historical position information comprises:

Determine the detection frame whose confidence score in the obstacle detection result exceeds the score threshold as the tracking target detection frame;

Determine the sparse query vector corresponding to the tracking target detection box as the tracking sparse query vector;

Encoding the obstacle historical position information corresponding to the tracking sparse query vector into a historical position sparse query vector;

The historical position sparse query vector and the tracking sparse query vector are interactively processed to obtain an obstacle reference trajectory prediction sparse query vector.

7. The method according to claim 1, wherein the reference detection sparse query vector is obtained by the following steps:

Obtaining a historical detection sparse query vector and a tracking sparse query vector at a previous moment, wherein the historical detection sparse query vector includes: historical detection sparse query vectors at N moments before the current moment, where N is an integer greater than 1;

A reference detection sparse query vector is obtained according to the historical detection sparse query vector and the tracking sparse query vector at the last moment.

8. The method according to claim 1, characterized in that the processing of the current moment sensor data to obtain a sparse sensor feature vector comprises:

Performing feature extraction on the current moment sensor data to obtain a sparse initial sensor feature vector;

Calculating the spatial position information of the sparse initial sensor feature vector according to the current sensor data;

The spatial position information and the sparse initial sensor feature vector are fused to obtain a sparse sensor feature vector.

9. The method according to any one of claims 1 to 8, characterized in that end-to-end autonomous driving is achieved through a multi-tasking processing network;

In the case where the end-to-end autonomous driving includes obstacle detection and map construction, the labels of the training data of the multi-task processing network include: real position information of obstacles and real map information;

In the case where the end-to-end autonomous driving includes obstacle detection, map construction and trajectory prediction, the labels of the training data of the multi-task processing network include: real position information of obstacles, real map information, real trajectory of obstacles and real trajectory of the vehicle;

In the case where the end-to-end autonomous driving includes obstacle detection, map construction, trajectory prediction and path planning, the labels of the training data of the multi-task processing network include: the real position information of the obstacle, the real map information, the real trajectory of the obstacle, the real trajectory of the vehicle, the real motion information of the vehicle, and the real relative position relationship between the obstacle and the vehicle.

10. An end-to-end autonomous driving device, characterized in that the device comprises:

A data acquisition module is used to obtain the current sensor data of the vehicle;

A data processing module, used for processing the sensor data at the current moment to obtain a sparse sensor feature vector;

A first interaction module is used to interactively process a reference detection sparse query vector and the sparse sensor feature vector to obtain a detection sparse query vector, wherein the reference detection sparse query vector at least includes: a historical detection sparse query vector determined based on historical sensor data of the vehicle;

A second interaction module is used to interactively process the reference map sparse query quantity and the sparse sensor feature vector to obtain a map sparse query vector, wherein the reference map sparse query vector at least includes: a historical map sparse query vector determined based on historical sensor data of the vehicle;

A detection construction module, configured to perform obstacle detection according to the detection sparse query vector to obtain an obstacle detection result, and to perform map construction according to the map sparse query vector to obtain an online map;

The automatic driving module is used to automatically drive the vehicle based on the obstacle detection result and the online map.

11. An electronic device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the computer program, the steps of the end-to-end autonomous driving method according to any one of claims 1 to 9 are implemented.

12. A computer-readable storage medium having a computer program stored thereon, characterized in that when the computer program is executed by a processor, the steps of the end-to-end autonomous driving method described in any one of claims 1-9 are implemented.