CN117218629A

CN117218629A - 3D target detection method, system, device and medium based on point cloud semantic information

Info

Publication number: CN117218629A
Application number: CN202311266914.8A
Authority: CN
Inventors: 杜卿; 贺方舟; 谭明奎; 庄壮伟; 陈思涛; 胡灏
Original assignee: Guangdong Provincial Laboratory Of Artificial Intelligence And Digital Economy Guangzhou; South China University of Technology SCUT
Current assignee: Guangdong Provincial Laboratory Of Artificial Intelligence And Digital Economy Guangzhou; South China University of Technology SCUT
Priority date: 2023-09-27
Filing date: 2023-09-27
Publication date: 2023-12-12

Abstract

The invention discloses a 3D target detection method, system, device and storage medium based on point cloud semantic information. The method includes the following steps: obtaining point cloud data, inputting the point cloud data into a semantic segmentation model for processing, and obtaining a classification with classification Semantic segmentation results of the information; project the point cloud data after semantic segmentation to the bird's-eye view perspective; cluster the point clouds in the bird's-eye view perspective based on the semantic segmentation results to obtain the clustering results; perform central symmetry on each cluster Processing; generate target detection frames based on the symmetric clustering results and prior knowledge of object height. The present invention uses spatial prior knowledge and semantic segmentation algorithm to avoid problems such as large calculation amount and complex model of 3D target detection in autonomous driving scenes. It introduces point cloud semantic segmentation to provide more fine-grained scene perception for autonomous driving scenes, which can be widely used. Applied to the field of autonomous driving technology.

Description

3D target detection method, system, device and medium based on point cloud semantic information

技术领域Technical field

本发明涉及智能识别技术领域，尤其涉及一种基于点云语义信息的3D目标检测方法、系统、装置及存储介质。The present invention relates to the field of intelligent recognition technology, and in particular to a 3D target detection method, system, device and storage medium based on point cloud semantic information.

背景技术Background technique

自动驾驶汽车是一种无需人工操控，能自动感知环境、定位、决策和行驶的车辆。技术分为四个关键组成部分：环境感知和建模、定位和地图构建、路径规划和决策，以及行驶控制。其中的环境感知对后续步骤具有至关重要的影响，也是决定自动驾驶汽车安全最重要的步骤，而目标检测是在环境感知中应用比较广泛的技术。2D目标检测通常使用RGB图像，预测目标在图像中的2D边界框、类别和置信度。近年来，自动驾驶的2D目标检测方法的性能得到极大的提高，在KITTI目标检测数据集实现了高于90％的平均精度，但其缺少了目标的方向和3D位置信息。与2D目标检测不同，3D目标检测通常使用RGB图像、深度图、激光雷达点云数据，预测目标在3D空间的3D边界框、类别等信息，弥补了2D目标检测的缺陷，但带来了计算量增加、模型复杂等问题。由于3D场景的理解对自动驾驶汽车至关重要，因此需要更多的研究来弥补其缺陷。A self-driving car is a vehicle that can automatically perceive the environment, position, make decisions and drive without manual control. The technology is divided into four key components: environment perception and modeling, localization and map construction, path planning and decision-making, and driving control. Among them, environmental perception has a crucial impact on subsequent steps and is also the most important step in determining the safety of autonomous vehicles. Target detection is a widely used technology in environmental perception. 2D object detection usually uses RGB images to predict the 2D bounding box, category and confidence of the object in the image. In recent years, the performance of 2D target detection methods for autonomous driving has been greatly improved, achieving an average accuracy higher than 90% in the KITTI target detection data set, but it lacks the direction and 3D position information of the target. Different from 2D target detection, 3D target detection usually uses RGB images, depth maps, lidar point cloud data to predict the 3D bounding box, category and other information of the target in 3D space, which makes up for the shortcomings of 2D target detection, but brings calculation Problems such as increased volume and complex models. Since 3D scene understanding is crucial for self-driving cars, more research is needed to remedy its shortcomings.

发明内容Contents of the invention

为至少一定程度上解决现有技术中存在的技术问题之一，本发明的目的在于提供一种基于空间先验知识与点云语义信息的3D目标检测方法、系统、装置及存储介质。In order to solve one of the technical problems existing in the prior art to at least a certain extent, the purpose of the present invention is to provide a 3D target detection method, system, device and storage medium based on spatial prior knowledge and point cloud semantic information.

本发明所采用的技术方案是：The technical solution adopted by the present invention is:

一种基于点云语义信息的3D目标检测方法，包括以下步骤：A 3D target detection method based on point cloud semantic information, including the following steps:

获取点云数据，将点云数据输入语义分割模型进行处理，获得带有分类信息的语义分割结果；Obtain point cloud data, input the point cloud data into the semantic segmentation model for processing, and obtain semantic segmentation results with classification information;

将语义分割后的点云数据投影到鸟瞰图视角；Project the semantically segmented point cloud data to a bird's-eye view;

根据语义分割结果对鸟瞰图视角中的点云进行聚类，得到聚类结果；Cluster the point cloud in the bird's-eye view according to the semantic segmentation results to obtain the clustering results;

对每个聚类簇进行中心对称处理；Perform center symmetry processing on each cluster;

基于对称后的聚类结果和物体高度的先验知识生成目标检测框。Generate target detection frames based on the symmetric clustering results and prior knowledge of object height.

进一步地，所述根据语义分割结果对鸟瞰图视角中的点云进行聚类，得到聚类结果，包括：Further, the point cloud in the bird's-eye view perspective is clustered according to the semantic segmentation results to obtain clustering results, including:

根据语义分割结果，按照需要检测的目标类别过滤鸟瞰图视角中的点云；Based on the semantic segmentation results, filter the point cloud in the bird's-eye view according to the target category that needs to be detected;

使用密度聚类算法对过滤后的点云数据进行聚类，得到聚类结果。Use the density clustering algorithm to cluster the filtered point cloud data to obtain the clustering results.

进一步地，所述使用密度聚类算法对过滤后的点云数据进行聚类，得到聚类结果，包括：Further, the density clustering algorithm is used to cluster the filtered point cloud data to obtain clustering results, including:

A1、将点云数据的数据集中所有的对象标记为未处理状态；A1. Mark all objects in the point cloud data dataset as unprocessed;

A2、依次根据预设的邻域半径Eps和邻域对象数量MinPts对对象进行判断处理，并将对象判定为核心点、边界点或者噪音点；A2. The objects are judged and processed based on the preset neighborhood radius Eps and the number of neighborhood objects MinPts in turn, and the objects are judged as core points, boundary points or noise points;

A3、根据核心点p建立新簇C，并将该核心点p的邻域半径Eps内所有的对象加入该新簇C中；A3. Create a new cluster C based on the core point p, and add all objects within the neighborhood radius Eps of the core point p to the new cluster C;

A4、获取该核心点p的邻域半径Eps内尚未被处理的对象q，若对象q的邻域半径Eps内的对象的数量大于或等于邻域对象数量MinPts，将对象q的邻域半径Eps中未归入任何一个簇的对象加入新簇C；A4. Obtain the unprocessed object q within the neighborhood radius Eps of the core point p. If the number of objects within the neighborhood radius Eps of the object q is greater than or equal to the number of neighborhood objects MinPts, add the neighborhood radius Eps of the object q Objects that are not classified into any cluster are added to new cluster C;

A5、重复上述步骤A2-A4，直至所有对象遍历完成，得到聚类结果。A5. Repeat the above steps A2-A4 until all objects are traversed and the clustering results are obtained.

进一步地，所述A2，包括：Further, the A2 includes:

如果对象在其邻域半径Eps内含有超过邻域对象数量MinPts的点，则判定该对象为核心点；If an object contains points within its neighborhood radius Eps that exceed the number of neighborhood objects MinPts, the object is determined to be a core point;

如果对象在其邻域半径Eps内含有点的数量小于邻域对象数量MinPts，且该对象落在核心点的邻域半径Eps内，则判定该对象为边界点；If the number of points the object contains within its neighborhood radius Eps is less than the number of neighborhood objects MinPts, and the object falls within the neighborhood radius Eps of the core point, the object is determined to be a boundary point;

如果一个对象既不是核心点也不是边界点，则判定该对象为噪音点。If an object is neither a core point nor a boundary point, the object is determined to be a noise point.

进一步地，所述对每个聚类簇进行中心对称处理，包括：Further, performing central symmetry processing on each cluster includes:

对于每个聚类簇，获取距离聚类簇中点云原点的最左点和最右点，连接成线段；For each cluster, obtain the leftmost point and the rightmost point from the origin of the point cloud in the cluster, and connect them into line segments;

获取该线段的中点；Get the midpoint of the line segment;

将该聚类簇所有点基于线段的中点进行中心对称处理。All points of the cluster are processed centrally symmetrically based on the midpoint of the line segment.

进一步地，所述基于对称后的聚类结果和物体高度的先验知识生成目标检测框，包括：Further, the target detection frame is generated based on the symmetric clustering results and prior knowledge of the object height, including:

生成各个聚类的外包多边形；Generate wrapping polygons for each cluster;

基于外包多边形绘制最小的外接矩形；Draw the smallest enclosing rectangle based on the enclosing polygon;

结合目标高度的先验知识生成目标检测框。Combined with the prior knowledge of the target height, the target detection frame is generated.

本发明所采用的另一技术方案是：Another technical solution adopted by the present invention is:

一种基于点云语义信息的3D目标检测系统，包括：A 3D target detection system based on point cloud semantic information, including:

语义分割模块，用于获取点云数据，将点云数据输入语义分割模型进行处理，获得带有分类信息的语义分割结果；The semantic segmentation module is used to obtain point cloud data, input the point cloud data into the semantic segmentation model for processing, and obtain semantic segmentation results with classification information;

点云投影模块，用于将语义分割后的点云数据投影到鸟瞰图视角；The point cloud projection module is used to project the point cloud data after semantic segmentation to a bird's-eye view;

点云聚类模块，用于根据语义分割结果对鸟瞰图视角中的点云进行聚类，得到聚类结果；The point cloud clustering module is used to cluster point clouds in the bird's-eye view based on semantic segmentation results to obtain clustering results;

对称处理模块，用于对每个聚类簇进行中心对称处理；The symmetry processing module is used to perform central symmetry processing on each cluster;

检测框生成模块，用于基于对称后的聚类结果和物体高度的先验知识生成目标检测框。The detection frame generation module is used to generate target detection frames based on the symmetric clustering results and prior knowledge of object height.

对于每个聚类簇，找到距离点云原点的最左和最右点，连接成线段；For each cluster, find the leftmost and rightmost points from the origin of the point cloud and connect them into line segments;

找到该线段的中点；Find the midpoint of the line segment;

一种基于点云语义信息的3D目标检测装置，包括：A 3D target detection device based on point cloud semantic information, including:

至少一个处理器；at least one processor;

至少一个存储器，用于存储至少一个程序；At least one memory for storing at least one program;

当所述至少一个程序被所述至少一个处理器执行，使得所述至少一个处理器实现如上所述方法。When the at least one program is executed by the at least one processor, the at least one processor implements the method as described above.

一种计算机可读存储介质，其中存储有处理器可执行的程序，所述处理器可执行的程序在由处理器执行时用于执行如上所述方法。A computer-readable storage medium in which a processor-executable program is stored, and the processor-executable program is used to perform the above method when executed by the processor.

本发明的有益效果是：本发明将点云语义分割算法应用到目标检测任务上，通过融合先验知识和语义分割算法降低自动驾驶场景中3D目标检测的计算量大、模型复杂等问题。The beneficial effects of the present invention are: the present invention applies the point cloud semantic segmentation algorithm to the target detection task, and reduces the problems of large calculation amount and complex model of 3D target detection in autonomous driving scenarios by fusing prior knowledge and semantic segmentation algorithm.

附图说明Description of drawings

为了更清楚地说明本发明实施例或者现有技术中的技术方案，下面对本发明实施例或者现有技术中的相关技术方案附图作以下介绍，应当理解的是，下面介绍中的附图仅仅为了方便清晰表述本发明的技术方案中的部分实施例，对于本领域的技术人员而言，在无需付出创造性劳动的前提下，还可以根据这些附图获取到其他附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following is an introduction to the accompanying drawings of the embodiments of the present invention or the relevant technical solutions in the prior art. It should be understood that the drawings in the following introduction are only In order to facilitate and clearly describe some embodiments of the technical solutions of the present invention, those skilled in the art can also obtain other drawings based on these drawings without exerting creative efforts.

图1是本发明实施例中一种基于空间先验知识与点云语义信息的3D目标检测方法的流程示意图；Figure 1 is a schematic flow chart of a 3D target detection method based on spatial prior knowledge and point cloud semantic information in an embodiment of the present invention;

图2是本发明实施例中鸟瞰图投影方式示意图；Figure 2 is a schematic diagram of a bird's-eye view projection method in an embodiment of the present invention;

图3是本发明实施例中中心对称每个聚类簇示意图；Figure 3 is a schematic diagram of each centrally symmetric cluster cluster in the embodiment of the present invention;

图4是本发明实施例中一种基于空间先验知识与点云语义信息的3D目标检测方法的步骤流程图。Figure 4 is a flow chart of a 3D target detection method based on spatial prior knowledge and point cloud semantic information in an embodiment of the present invention.

具体实施方式Detailed ways

下面详细描述本发明的实施例，所述实施例的示例在附图中示出，其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的，仅用于解释本发明，而不能理解为对本发明的限制。对于以下实施例中的步骤编号，其仅为了便于阐述说明而设置，对步骤之间的顺序不做任何限定，实施例中的各步骤的执行顺序均可根据本领域技术人员的理解来进行适应性调整。Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals throughout represent the same or similar elements or elements with the same or similar functions. The embodiments described below with reference to the drawings are exemplary and are only used to explain the present invention and cannot be understood as limiting the present invention. The step numbers in the following embodiments are only set for the convenience of explanation. The order between the steps is not limited in any way. The execution order of each step in the embodiments can be adapted according to the understanding of those skilled in the art. sexual adjustment.

在本发明的描述中，需要理解的是，涉及到方位描述，例如上、下、前、后、左、右等指示的方位或位置关系为基于附图所示的方位或位置关系，仅是为了便于描述本发明和简化描述，而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作，因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that orientation descriptions, such as up, down, front, back, left, right, etc., are based on the orientation or position relationships shown in the drawings and are only In order to facilitate the description of the present invention and simplify the description, it is not intended to indicate or imply that the device or element referred to must have a specific orientation, be constructed and operate in a specific orientation, and therefore should not be construed as a limitation of the present invention.

在本发明的描述中，若干的含义是一个或者多个，多个的含义是两个以上，大于、小于、超过等理解为不包括本数，以上、以下、以内等理解为包括本数。如果有描述到第一、第二只是用于区分技术特征为目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量或者隐含指明所指示的技术特征的先后关系。In the description of the present invention, several means one or more, plural means two or more, greater than, less than, more than, etc. are understood to exclude the original number, and above, below, within, etc. are understood to include the original number. If there is a description of first and second, it is only for the purpose of distinguishing technical features, and cannot be understood as indicating or implying the relative importance or implicitly indicating the number of indicated technical features or implicitly indicating the order of indicated technical features. relation.

此外，在本发明的描述中，除非另有说明，“多个”是指两个或两个以上。“和/或”，描述关联对象的关联关系，表示可以存在三种关系，例如，A和/或B，可以表示：单独存在A，同时存在A和B，单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。Furthermore, in the description of the present invention, "plurality" means two or more unless otherwise stated. "And/or" describes the relationship between associated objects, indicating that there can be three relationships. For example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone. The character "/" generally indicates that the related objects are in an "or" relationship.

本发明的描述中，除非另有明确的限定，设置、安装、连接等词语应做广义理解，所属技术领域技术人员可以结合技术方案的具体内容合理确定上述词语在本发明中的具体含义。In the description of the present invention, unless otherwise explicitly limited, words such as setting, installation, and connection should be understood in a broad sense. Those skilled in the art can reasonably determine the specific meaning of the above words in the present invention in combination with the specific content of the technical solution.

基于点云语义分割，可根据鸟瞰图视角类别的点云输出路面区域，进一步根据目标框的位置获得车辆的可行使区域，从而提供自动驾驶/辅助驾驶需要的场景信息。在避免3D目标检测方法来带的计算量增加、模型复杂等问题的同时，还能为自动驾驶场景提供更加细粒度的场景感知。基于此，本申请提供一种基于空间先验知识与点云语义信息的3D目标检测方法、系统、装置及介质，以实现通过融合先验知识和语义分割算法降低自动驾驶场景中3D目标检测的计算量大、模型复杂等问题。Based on point cloud semantic segmentation, the road area can be output based on the point cloud of the bird's-eye view category, and the vehicle's feasible area can be obtained based on the position of the target frame, thereby providing scene information required for autonomous driving/assisted driving. While avoiding problems such as increased computational load and complex models caused by 3D target detection methods, it can also provide more fine-grained scene perception for autonomous driving scenarios. Based on this, this application provides a 3D target detection method, system, device and medium based on spatial prior knowledge and point cloud semantic information, so as to reduce the cost of 3D target detection in autonomous driving scenarios by integrating prior knowledge and semantic segmentation algorithms. Problems such as large amount of calculation and complex model.

如图1和图4所示，本实施例提供一种基于空间先验知识与点云语义信息的3D目标检测方法，包括以下步骤：As shown in Figures 1 and 4, this embodiment provides a 3D target detection method based on spatial prior knowledge and point cloud semantic information, including the following steps:

S1、获取点云数据，将点云数据输入语义分割模型进行处理，获得带有分类信息的语义分割结果。实现对点云数据的语义分割。S1. Obtain point cloud data, input the point cloud data into the semantic segmentation model for processing, and obtain semantic segmentation results with classification information. Realize semantic segmentation of point cloud data.

S2、将语义分割后的点云数据投影到鸟瞰图视角。如图2所示，图2为投影过程的示意图。S2. Project the point cloud data after semantic segmentation to a bird's-eye view perspective. As shown in Figure 2, Figure 2 is a schematic diagram of the projection process.

S3、根据语义分割结果对鸟瞰图视角中的点云进行聚类，得到聚类结果。S3. Cluster the point cloud in the bird's-eye view according to the semantic segmentation results to obtain the clustering results.

在本实施例中，步骤S3具体包括步骤S31-S32：In this embodiment, step S3 specifically includes steps S31-S32:

S31、根据语义分割结果，按照需要检测的目标类别过滤鸟瞰图视角中的点云；比如需检测汽车目标，保留鸟瞰图中语义分割结果为汽车的点云。S31. Based on the semantic segmentation results, filter the point clouds in the bird's-eye view according to the target categories that need to be detected; for example, if you need to detect car targets, keep the point clouds in the bird's-eye view whose semantic segmentation results are cars.

S32、使用密度聚类算法对过滤后的点云数据进行聚类，得到聚类结果。以下以DBSCAN聚类算法为例进行说明：S32. Use density clustering algorithm to cluster the filtered point cloud data to obtain clustering results. The following uses the DBSCAN clustering algorithm as an example to illustrate:

一、定义参数：1. Define parameters:

1)Eps：定义密度时的邻域半径。1) Eps: Neighborhood radius when defining density.

1)MinPts：定义核心点时的阈值，形成簇所需的最小核心点数量。1)MinPts: The threshold when defining core points, the minimum number of core points required to form a cluster.

二、DBSCAN算法中将数据点分为以下3类2. In the DBSCAN algorithm, data points are divided into the following three categories:

1)核心点：稠密区域内部的点1) Core points: points inside dense areas

如果一个对象在其半径Eps内含有超过MmPts数目的点，则该对象为核心点。If an object contains more than MmPts points within its radius Eps, the object is a core point.

2)边界点：稠密区域边缘的点2) Boundary points: points on the edge of dense areas

如果一个对象在其半径Eps内含有点的数量小于MinPts，但是该对象落在核心点的邻域内，则该对象为边界点。If an object contains a number of points less than MinPts within its radius Eps, but the object falls within the neighborhood of the core point, the object is a boundary point.

3)噪音点：稀疏区域中的点3) Noise points: points in sparse areas

如果一个对象既不是核心点也不是边界点，则该对象为噪音点。If an object is neither a core point nor a boundary point, the object is a noise point.

三、DBSCAN算法中数据点之间的关系3. The relationship between data points in the DBSCAN algorithm

密度直达：如果P为核心点，Q在P的R邻域内，那么称P到Q密度直达。任何核心点到其自身密度直达，密度直达不具有对称性，如果P到Q密度直达，那么Q到P不一定密度直达。Density direct: If P is the core point and Q is in the R neighborhood of P, then it is called density direct from P to Q. Any core point is directly connected to its own density, and the density is direct without symmetry. If P is directly connected to Q, then Q is not necessarily directly connected to P.

密度可达：如果存在核心点P2，P3，……，Pn，且P1到P2密度直达，P2到P3密度直达，……，P(n-1)到Pn密度直达，Pn到Q密度直达，则P1到Q密度可达。密度可达也不具有对称性。Density reachability: If there are core points P2, P3,..., Pn, and the density from P1 to P2 is direct, and the density from P2 to P3 is direct,..., the density from P(n-1) to Pn is direct, and the density from Pn to Q is direct, Then the density from P1 to Q is reachable. Density attainable also has no symmetry.

密度相连：如果存在核心点S，使得S到P和Q都密度可达，则P和Q密度相连。密度相连具有对称性，如果P和Q密度相连，那么Q和P也一定密度相连。密度相连的两个点属于同一个聚类簇。Density connected: If there is a core point S such that S to P and Q are both density reachable, then P and Q are density connected. Density connection has symmetry. If P and Q are density connected, then Q and P must also be density connected. Two points that are densely connected belong to the same cluster.

非密度相连：如果两个点不属于密度相连关系，则两个点非密度相连。非密度相连的两个点属于不同的聚类簇，或者其中存在噪声点Non-density connected: If two points do not belong to the density-connected relationship, then the two points are not density-connected. Two points that are not densely connected belong to different clusters, or there are noise points in them.

四、DBSCAN算法步骤4. DBSCAN algorithm steps

1)根据给定的邻域参数Eps和MinPts确定所有的核心对象。1) Determine all core objects according to the given neighborhood parameters Eps and MinPts.

2)对每一个核心对象：选择一个未处理过的核心对象，找到由其密度可达的的样本生成聚类“簇”。2) For each core object: select an unprocessed core object and find samples with reachable density to generate clustering "clusters".

3)重复以上过程。3) Repeat the above process.

作为一种可选的实施方式，DBSCAN算法的伪代码如下：As an optional implementation, the pseudo code of the DBSCAN algorithm is as follows:

S4、对每个聚类簇进行中心对称处理。S4. Perform center symmetry processing on each cluster.

参见图3，对于每个聚类簇(即经过聚类算法得出的在空间中临近的一团点)，计算出该聚类簇距离点云原点(即激光雷达所处的坐标)的最左点和最右点(以原点为原点，用直线顺时针画圈，最先接触的点就是最左点，最后离开的点为最右点)，将最右点和最左点进行连线，找到该线段点中点，将将该聚类簇所有点基于线段中点进行中心对称处理。Referring to Figure 3, for each cluster (i.e., a group of points adjacent in space obtained through the clustering algorithm), the maximum distance between the cluster cluster and the point cloud origin (i.e., the coordinates of the lidar) is calculated. The left point and the rightmost point (take the origin as the origin, draw a circle clockwise with a straight line, the first point of contact is the leftmost point, and the last point left is the rightmost point), connect the rightmost point and the leftmost point , find the midpoint of the line segment point, and all points of the cluster will be centrally symmetrical based on the line segment midpoint.

具体而言，对于每个聚类簇，找到簇内距离点云原点的最左和最右点。将最左点和最右点连接成线段。找到该线段的中点，再将该聚类簇内的所有点基于线段中点进行中心对称。该步骤利用到了大物体(比如车辆)对称的信息，使得后续的目标检测框位置更加准确。Specifically, for each cluster, find the leftmost and rightmost points within the cluster from the origin of the point cloud. Connect the leftmost point and the rightmost point into a line segment. Find the midpoint of the line segment, and then center-symmetry all points in the cluster based on the midpoint of the line segment. This step utilizes the symmetry information of large objects (such as vehicles) to make subsequent target detection frame positions more accurate.

S5、基于对称后的聚类结果和物体高度的先验知识生成目标检测框。S5. Generate a target detection frame based on the symmetric clustering results and prior knowledge of the object height.

在本实施例中，步骤S5具体包括步骤S51-S53：In this embodiment, step S5 specifically includes steps S51-S53:

S51、生成各个聚类的外包多边形；S51. Generate outsourcing polygons for each cluster;

S52、基于外包多边形绘制最小的外接矩形；S52. Draw the smallest enclosing rectangle based on the enclosing polygon;

S53、结合目标高度的先验知识生成目标检测框。S53. Generate a target detection frame based on the prior knowledge of the target height.

由上可知，应用本发明实施例所提供的方法，获得待识别的点云数据；将点云数据输入语义分割模型进行处理，获得带有分类信息的预测结果；将语义分割后的点云数据投影到鸟瞰图视角；按需检测的目标类别，过滤鸟瞰图的点云。使用密度聚类算法聚类鸟瞰图上的点云，得到多个聚类簇；对每个聚类簇，将其内部的点基于簇中心点进行中心对称处理；计算每个聚类簇的外包矩形，再基于类别的高度先验知识，形成目标检测框，获得预测结果。该方法利用空间先验知识和语义分割算法，避免了自动驾驶场景中3D目标检测的计算量大、模型复杂等问题。同时，可根据鸟瞰图视角类别的点云输出路面区域，根据目标框的位置获得车辆的可行使区域，为自动驾驶场景提供更加细粒度的场景感知。It can be seen from the above that the method provided by the embodiment of the present invention is applied to obtain point cloud data to be identified; the point cloud data is input into the semantic segmentation model for processing, and a prediction result with classification information is obtained; the point cloud data after semantic segmentation is Project to a bird's-eye view perspective; filter the bird's-eye view point cloud based on the target category detected on demand. Use the density clustering algorithm to cluster the point cloud on the bird's-eye view to obtain multiple clusters; for each cluster, perform central symmetry processing on the internal points based on the cluster center point; calculate the outsourcing of each cluster Rectangle, and then based on the high degree of prior knowledge of the category, a target detection frame is formed to obtain the prediction result. This method uses spatial prior knowledge and semantic segmentation algorithms to avoid the problems of large calculation amount and complex model of 3D target detection in autonomous driving scenarios. At the same time, the road area can be output based on the point cloud of the bird's-eye view category, and the vehicle's feasible area can be obtained based on the position of the target frame, providing more fine-grained scene perception for autonomous driving scenarios.

本实施例还提供一种基于点云语义信息的3D目标检测系统，包括：This embodiment also provides a 3D target detection system based on point cloud semantic information, including:

进一步作为可选的实施方式，所述对每个聚类簇进行中心对称处理，包括：As a further optional implementation, performing central symmetry processing on each cluster includes:

找到该线段的中点；Find the midpoint of the line segment;

进一步作为可选的实施方式，所述基于对称后的聚类结果和物体高度的先验知识生成目标检测框，包括：As a further optional implementation, generating a target detection frame based on the symmetric clustering results and prior knowledge of the object height includes:

本实施例的一种基于点云语义信息的3D目标检测系统，可执行本发明方法实施例所提供的一种基于点云语义信息的3D目标检测方法，可执行方法实施例的任意组合实施步骤，具备该方法相应的功能和有益效果。The 3D target detection system based on point cloud semantic information in this embodiment can execute the 3D target detection method based on point cloud semantic information provided by the method embodiment of the present invention, and can execute any combination of implementation steps of the method embodiment. , possessing the corresponding functions and beneficial effects of this method.

本实施例还提供一种计算机可读存储介质，其中存储有处理器可执行的程序，所述处理器可执行的程序在由处理器执行时用于执行如图4所示的方法。This embodiment also provides a computer-readable storage medium in which a processor-executable program is stored, and the processor-executable program is used to perform the method shown in Figure 4 when executed by the processor.

本实施例的基于用户行为的车辆目的地预测系统，可执行本发明方法实施例所提供的基于用户行为的车辆目的地预测方法，可执行方法实施例的任意组合实施步骤，具备该方法相应的功能和有益效果。The vehicle destination prediction system based on user behavior of this embodiment can execute the vehicle destination prediction method based on user behavior provided by the method embodiment of the present invention, can execute any combination of implementation steps of the method embodiment, and has the corresponding requirements of the method. functions and beneficial effects.

本申请实施例还公开了一种计算机程序产品或计算机程序，该计算机程序产品或计算机程序包括计算机指令，该计算机指令存储在计算机可读存介质中。计算机设备的处理器可以从计算机可读存储介质读取该计算机指令，处理器执行该计算机指令，使得该计算机设备执行图4所示的方法。The embodiment of the present application also discloses a computer program product or computer program. The computer program product or computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device can read the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the computer device performs the method shown in FIG. 4 .

本实施例还提供了一种存储介质，存储有可执行本发明方法实施例所提供的一种基于点云语义信息的3D目标检测方法的指令或程序，当运行该指令或程序时，可执行方法实施例的任意组合实施步骤，具备该方法相应的功能和有益效果。This embodiment also provides a storage medium that stores instructions or programs that can execute a 3D target detection method based on point cloud semantic information provided by the method embodiment of the present invention. When the instructions or programs are run, Any combination of implementation steps of the method embodiments has the corresponding functions and beneficial effects of the method.

在一些可选择的实施例中，在方框图中提到的功能/操作可以不按照操作示图提到的顺序发生。例如，取决于所涉及的功能/操作，连续示出的两个方框实际上可以被大体上同时地执行或所述方框有时能以相反顺序被执行。此外，在本发明的流程图中所呈现和描述的实施例以示例的方式被提供，目的在于提供对技术更全面的理解。所公开的方法不限于本文所呈现的操作和逻辑流程。可选择的实施例是可预期的，其中各种操作的顺序被改变以及其中被描述为较大操作的一部分的子操作被独立地执行。In some alternative embodiments, the functions/operations noted in the block diagrams may occur out of the order noted in the operational illustrations. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending on the functionality/operations involved. Furthermore, the embodiments presented and described in the flow diagrams of the present invention are provided by way of example for the purpose of providing a more comprehensive understanding of the technology. The disclosed methods are not limited to the operations and logical flows presented herein. Alternative embodiments are contemplated in which the order of various operations is changed and in which sub-operations described as part of a larger operation are performed independently.

此外，虽然在功能性模块的背景下描述了本发明，但应当理解的是，除非另有相反说明，所述的功能和/或特征中的一个或多个可以被集成在单个物理装置和/或软件模块中，或者一个或多个功能和/或特征可以在单独的物理装置或软件模块中被实现。还可以理解的是，有关每个模块的实际实现的详细讨论对于理解本发明是不必要的。更确切地说，考虑到在本文中公开的装置中各种功能模块的属性、功能和内部关系的情况下，在工程师的常规技术内将会了解该模块的实际实现。因此，本领域技术人员运用普通技术就能够在无需过度试验的情况下实现在权利要求书中所阐明的本发明。还可以理解的是，所公开的特定概念仅仅是说明性的，并不意在限制本发明的范围，本发明的范围由所附权利要求书及其等同方案的全部范围来决定。Furthermore, although the present invention has been described in the context of functional modules, it should be understood that, unless stated to the contrary, one or more of the described functions and/or features may be integrated in a single physical device and/or or software modules, or one or more functions and/or features may be implemented in separate physical devices or software modules. It will also be understood that a detailed discussion regarding the actual implementation of each module is not necessary to understand the invention. Rather, the actual implementation of the various functional modules in the apparatus disclosed herein will be within the ordinary skill of an engineer, taking into account the properties, functions and internal relationships of the modules. Therefore, a person skilled in the art using ordinary skills can implement the invention set forth in the claims without undue experimentation. It will also be understood that the specific concepts disclosed are illustrative only and are not intended to limit the scope of the invention, which is to be determined by the full scope of the appended claims and their equivalents.

所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-Only Memory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of the present invention. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code. .

在流程图中表示或在此以其他方式描述的逻辑和/或步骤，例如，可以被认为是用于实现逻辑功能的可执行指令的定序列表，可以具体实现在任何计算机可读介质中，以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用，或结合这些指令执行系统、装置或设备而使用。就本说明书而言，“计算机可读介质”可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。The logic and/or steps represented in the flowcharts or otherwise described herein, for example, may be considered a sequenced list of executable instructions for implementing the logical functions, and may be embodied in any computer-readable medium, For use by, or in combination with, instruction execution systems, devices or devices (such as computer-based systems, systems including processors or other systems that can fetch instructions from and execute instructions from the instruction execution system, device or device) or equipment. For the purposes of this specification, a "computer-readable medium" may be any device that can contain, store, communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.

计算机可读介质的更具体的示例(非穷尽性列表)包括以下：具有一个或多个布线的电连接部(电子装置)，便携式计算机盘盒(磁装置)，随机存取存储器(RAM)，只读存储器(ROM)，可擦除可编辑只读存储器(EPROM或闪速存储器)，光纤装置，以及便携式光盘只读存储器(CDROM)。另外，计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质，因为可以例如通过对纸或其他介质进行光学扫描，接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序，然后将其存储在计算机存储器中。More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections with one or more wires (electronic device), portable computer disk cartridges (magnetic device), random access memory (RAM), Read-only memory (ROM), erasable and programmable read-only memory (EPROM or flash memory), fiber optic devices, and portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium may even be paper or other suitable medium on which the program may be printed, as the paper or other medium may be optically scanned, for example, and subsequently edited, interpreted, or otherwise suitable as necessary. process to obtain the program electronically and then store it in computer memory.

应当理解，本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中，多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件来实现。例如，如果用硬件来实现，和在另一实施方式中一样，可用本领域公知的下列技术中的任一项或他们的组合来实现：具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路，具有合适的组合逻辑门电路的专用集成电路，可编程门阵列(PGA)，现场可编程门阵列(FPGA)等。It should be understood that various parts of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, various steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system. For example, if it is implemented in hardware, as in another embodiment, it can be implemented by any one or a combination of the following technologies known in the art: a logic gate circuit with a logic gate circuit for implementing a logic function on a data signal. Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, programmable gate arrays (PGA), field programmable gate arrays (FPGA), etc.

在本说明书的上述描述中，参考术语“一个实施方式/实施例”、“另一实施方式/实施例”或“某些实施方式/实施例”等的描述意指结合实施方式或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施方式或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施方式或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施方式或示例中以合适的方式结合。In the above description of this specification, reference to the description of the terms "one embodiment/example", "another embodiment/example" or "certain embodiments/examples" etc. is meant to be described in connection with the embodiment or example Specific features, structures, materials, or characteristics are included in at least one embodiment or example of the invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施方式，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施方式进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those of ordinary skill in the art will understand that various changes, modifications, substitutions and variations can be made to these embodiments without departing from the principles and purposes of the invention. The scope of the invention is defined by the claims and their equivalents.

以上是对本发明的较佳实施进行了具体说明，但本发明并不限于上述实施例，熟悉本领域的技术人员在不违背本发明精神的前提下还可做作出种种的等同变形或替换，这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a detailed description of the preferred implementation of the present invention, but the present invention is not limited to the above embodiments. Those skilled in the art can also make various equivalent modifications or substitutions without violating the spirit of the present invention. Equivalent modifications or substitutions are included within the scope defined by the claims of this application.

Claims

1. The 3D target detection method based on the point cloud semantic information is characterized by comprising the following steps of:

acquiring point cloud data, inputting the point cloud data into a semantic segmentation model for processing, and acquiring a semantic segmentation result with classification information;

projecting the semantically segmented point cloud data to a bird's eye view perspective;

clustering point clouds in the aerial view angle according to the semantic segmentation result to obtain a clustering result;

performing central symmetry processing on each cluster;

and generating a target detection frame based on the symmetrical clustering result and the priori knowledge of the object height.

2. The 3D object detection method based on the semantic information of the point cloud according to claim 1, wherein the clustering the point cloud in the bird's eye view perspective according to the semantic segmentation result to obtain a clustering result includes:

according to semantic segmentation results, filtering point clouds in the aerial view angle according to target categories to be detected;

and clustering the filtered point cloud data by using a density clustering algorithm to obtain a clustering result.

3. The 3D object detection method based on the point cloud semantic information according to claim 2, wherein the clustering the filtered point cloud data using the density clustering algorithm to obtain a clustering result includes:

a1, marking all objects in a data set of point cloud data as unprocessed states;

a2, judging the objects according to a preset neighborhood radius Eps and a neighborhood object number MinPts in sequence, and judging the objects as core points, boundary points or noise points;

a3, a new cluster C is established according to the core point p, and all objects in the neighborhood radius Eps of the core point p are added into the new cluster C;

a4, acquiring an object q which is not processed in the neighborhood radius Eps of the core point p, and adding an object which is not classified into any cluster in the neighborhood radius Eps of the object q into a new cluster C if the number of the objects in the neighborhood radius Eps of the object q is greater than or equal to the neighborhood object number MinPts;

a5, repeating the steps A2-A4 until all the objects are traversed, and obtaining a clustering result.

4. A 3D object detection method based on point cloud semantic information according to claim 3, wherein the A2 comprises:

if the object contains points exceeding the number MinPts of the neighborhood objects in the neighborhood radius Eps, judging that the object is a core point;

if the number of the points contained in the neighborhood radius Eps of the object is smaller than the number MinPts of the neighborhood objects and the object falls in the neighborhood radius Eps of the core points, judging the object as a boundary point;

if an object is neither a core point nor a boundary point, the object is determined to be a noise point.

5. The 3D object detection method based on the point cloud semantic information according to claim 1, wherein the performing central symmetry processing on each cluster includes:

for each cluster, obtaining the leftmost point and the rightmost point which are far from the point cloud origin of the cluster, and connecting the leftmost point and the rightmost point into a line segment;

acquiring the midpoint of the line segment;

and carrying out center symmetry processing on all points of the cluster based on the midpoints of the line segments.

6. The 3D object detection method based on point cloud semantic information according to claim 1, wherein the generating the object detection frame based on the symmetric clustering result and the priori knowledge of the object height comprises:

generating an outsourcing polygon of each cluster;

drawing a minimum circumscribed rectangle based on the outsourcing polygon;

and generating a target detection frame by combining the prior knowledge of the target height.

7. A 3D object detection system based on point cloud semantic information, comprising:

the semantic segmentation module is used for acquiring point cloud data, inputting the point cloud data into the semantic segmentation model for processing, and acquiring semantic segmentation results with classification information;

the point cloud projection module is used for projecting the point cloud data subjected to semantic segmentation to a bird's eye view perspective;

the point cloud clustering module is used for clustering point clouds in the aerial view according to the semantic segmentation result to obtain a clustering result;

the symmetrical processing module is used for carrying out central symmetrical processing on each cluster;

and the detection frame generation module is used for generating a target detection frame based on the symmetrical clustering result and the priori knowledge of the object height.

8. The 3D object detection system based on point cloud semantic information according to claim 7, wherein the performing a central symmetry process on each cluster includes:

for each cluster, finding the leftmost point and the rightmost point which are far away from the origin of the point cloud, and connecting the leftmost point and the rightmost point into a line segment;

finding the midpoint of the line segment;

9. 3D target detection device based on point cloud semantic information, characterized by comprising:

at least one processor;

at least one memory for storing at least one program;

the at least one program, when executed by the at least one processor, causes the at least one processor to implement the method of any one of claims 1-6.

10. A computer readable storage medium, in which a processor executable program is stored, characterized in that the processor executable program is for performing the method according to any of claims 1-6 when being executed by a processor.