CN117308950A

CN117308950A - Method for realizing autonomous navigation of robot by BIM

Info

Publication number: CN117308950A
Application number: CN202311221771.9A
Authority: CN
Inventors: 陈健新; 胡安泰; 王书鑫; 罗恩妮; 葛明
Original assignee: Hong Kong Industrial Artificial Intelligence And Robotics Research And Development Center Ltd
Current assignee: Hong Kong Industrial Artificial Intelligence And Robotics Research And Development Center Ltd
Priority date: 2023-09-21
Filing date: 2023-09-21
Publication date: 2023-12-29

Abstract

The invention relates to a method of using BIM to realize autonomous navigation of robots. First, the BIM model is used to generate a navigation map, and then the scene is optically scanned through three-dimensional scanning to obtain a point cloud image of the scene and semantic segmentation is performed to obtain a semantic map. The robot uses the semantic map to realize autonomy. navigation. The method of using BIM model to map conversion and semantic understanding to realize autonomous navigation of robots can overcome the obstacles of existing robot platforms in realizing automation and other related fields, and increase the autonomy of robots.

Description

Methods for realizing autonomous navigation of robots using BIM

技术领域Technical field

本发明涉及外机器人自主导航的技术，具体是涉及一种利用BIM模型到地图转换、语义理解实现机器人自主导航的方法。The present invention relates to the technology of autonomous navigation of external robots, and specifically to a method for realizing autonomous navigation of robots by utilizing BIM model to map conversion and semantic understanding.

背景技术Background technique

近年来，工业自动化领域取得了长足的进步，但人们仍然需要更易于使用、功能多样、对用户友好的机器人解决方案，以满足各行各业的不同需求。传统的工业机器人通常需要复杂的编程和集成，这给小规模操作和临时自动化方案带来了挑战。此外，现有机器人的导航、初始映射和报告功能可能无法满足不同行业和任务的需求。The field of industrial automation has made great progress in recent years, but there is still a need for robot solutions that are easier to use, versatile, and user-friendly to meet the different needs of various industries. Traditional industrial robots often require complex programming and integration, which creates challenges for small-scale operations and ad hoc automation scenarios. Additionally, the navigation, initial mapping, and reporting capabilities of existing robots may not meet the needs of different industries and tasks.

本发明旨在通过引入一种新型人工智能多用途移动机器人平台来应对这些挑战。该平台旨在克服机器人自动化的关键障碍，以增强机器人的自主性，即：The present invention aims to address these challenges by introducing a new artificial intelligence multi-purpose mobile robot platform. The platform aims to overcome key barriers to robotic automation to enhance robot autonomy, namely:

1.导航限制：现有的机器人解决方案通常涉及详细的制图过程，需要在复杂的环境中导航，这可能会影响操作效率。这些方案不适合建立临时自动化方案情况，而且会影响机器人自主导航和适应不同环境的能力。1. Navigation limitations: Existing robotic solutions often involve detailed mapping processes that require navigation in complex environments, which may affect operational efficiency. These solutions are not suitable for establishing temporary automation solutions and will affect the robot's ability to navigate autonomously and adapt to different environments.

2.建立初始导航地图的挑战：许多机器人的部署过程需要对现场进行手动扫描和并确定机器人初始位置设置，这样既耗时又容易出错。目前的机器人不能利用地图和激光雷达数据对环境进行精确的初始测绘，从而使部署过程更加复杂。2. The challenge of establishing an initial navigation map: The deployment process of many robots requires manual scanning of the site and determining the initial position setting of the robot, which is time-consuming and error-prone. Current robots cannot use maps and lidar data to perform accurate initial mapping of the environment, further complicating the deployment process.

3.报告不足：许多机器人的解决方案对提供报告等功能有限，使用户无法获得有关机器人运行和性能的相关情况和反馈。也由于缺乏比较全面的报告功能，这样会对后续持续对机器人的改善造成阻碍，使用户无法充分发挥机器人自动化操作的潜力。3. Insufficient reporting: Many bot solutions are limited in functionality such as providing reports, preventing users from getting relevant information and feedback about the bot’s operation and performance. Also due to the lack of a relatively comprehensive reporting function, this will hinder the subsequent continuous improvement of the robot and prevent users from fully utilizing the potential of robot automation operations.

发明内容Contents of the invention

本发明旨在提供利用BIM模型到地图转换、语义理解实现机器人自主导航的方法，以克服现有机器人平台在实现自动化等相关领域的障碍，增加机器人的自主性。The present invention aims to provide a method for realizing robot autonomous navigation using BIM model to map conversion and semantic understanding, so as to overcome the obstacles of existing robot platforms in realizing automation and other related fields, and increase the autonomy of robots.

本发明的技术方案为一种利用BIM实现机器人自主导航的方法，包括以下步骤：The technical solution of the present invention is a method for realizing robot autonomous navigation using BIM, which includes the following steps:

S1. 将BIM模型生成导航地图，包括利用BIM模型数据生成二维导航图及模型点云图并根据BIM模型中的对象边界形成模型点云图中与对象相对应的边界数据；并按照对象在BIM模型中的信息获取对象的语义信息；S1. Generate a navigation map from the BIM model, including using the BIM model data to generate a two-dimensional navigation map and a model point cloud map, and forming the boundary data corresponding to the object in the model point cloud map according to the object boundaries in the BIM model; and according to the objects in the BIM model The information in obtains the semantic information of the object;

S2. 通过三维扫描对BIM模型相对应的现场进行光学扫描并获得现场点云图，利用上述边界数据以及模型点云图对现场点云图进行对象分组并使用语义信息对现场点云图进行语义分割形成现场语义地图；S2. Optically scan the site corresponding to the BIM model through three-dimensional scanning and obtain the site point cloud map. Use the above boundary data and the model point cloud map to group the site point cloud maps into object groups and use semantic information to perform semantic segmentation on the site point cloud map to form site semantics. map;

S3. 将二维导航图及语义地图输出至机器人，机器人接收到导航指令后依据二维导航图及语义地图进行自主导航。S3. Output the two-dimensional navigation map and semantic map to the robot. After receiving the navigation instructions, the robot conducts autonomous navigation based on the two-dimensional navigation map and semantic map.

优选地，所述步骤S1中利用BIM模型生成二维导航图的步骤为：将BIM模型数据导出为二维地图，并对导出的二维地图进行修改；将修改后的二维地图导出为图形文件再转换为可执行的ROS地图。Preferably, the step of using the BIM model to generate a two-dimensional navigation map in step S1 is: exporting the BIM model data as a two-dimensional map, and modifying the exported two-dimensional map; exporting the modified two-dimensional map as a graphic. The file is then converted into an executable ROS map.

优选地，所述二维地图为AutoCad格式文档，对AutoCad格式的二维地图进行编辑后生成图形文件，最后产生ROS地图的PGM及YAML文件。Preferably, the two-dimensional map is an AutoCad format document. The two-dimensional map in AutoCad format is edited to generate a graphic file, and finally the PGM and YAML files of the ROS map are generated.

优选地，所述步骤S2中对现场点云图进行语义分割获得语义地图的步骤为：Preferably, the step of performing semantic segmentation on the on-site point cloud image to obtain a semantic map in step S2 is:

S2-1.三维扫描获得的现场点云图进行对象分组，分组方法为通过基于邻近度举例的算法判断现场点云图中的每个点的所构成的几何及空间信息进行分析，以判断其属于对应的对象，实现对象分组；S2-1. The on-site point cloud image obtained by three-dimensional scanning is grouped into objects. The grouping method is to judge the geometric and spatial information composed of each point in the on-site point cloud image through an algorithm based on proximity examples and analyze it to determine whether it belongs to the corresponding Objects to implement object grouping;

S2-2.将从BIM模型中所获取的对象的语义信息与现场点云图中对应的对象形成影射关系；S2-2. Form a mapping relationship between the semantic information of the object obtained from the BIM model and the corresponding object in the on-site point cloud map;

S2-3.重复步骤S2-1及S2-2对现场点云图的对象与BIM模型的语义新型形成影射关系后获得现场语义地图。S2-3. Repeat steps S2-1 and S2-2 to form a mapping relationship between the objects in the on-site point cloud map and the semantic models of the BIM model to obtain the on-site semantic map.

优选地，对现场点云图进行对象分组的方法还包括将对象分组中的点云数据中的几何信息进行编码并输入至神经网络进行处理以识别有用的几何特征；神经网络对几何信息进行特征编码以过滤现场点云图中的噪音信息；然后对特征编码进行解码后完成语义分割，并与对应对象的语义信息形成影射关系。Preferably, the method of grouping objects in the on-site point cloud map also includes encoding the geometric information in the point cloud data in the object grouping and inputting it into a neural network for processing to identify useful geometric features; the neural network performs feature encoding on the geometric information. To filter the noise information in the on-site point cloud image; then decode the feature encoding to complete the semantic segmentation, and form a mapping relationship with the semantic information of the corresponding object.

优选地，还包括以下步骤：Preferably, the following steps are also included:

S4.更新语义地图，利用现场点云图对对象的几何及空间特征进行识别及分类，并进行语义分割后更新语义地图及BIM模型。S4. Update the semantic map, use the on-site point cloud map to identify and classify the geometric and spatial features of the object, perform semantic segmentation and then update the semantic map and BIM model.

S5. 基于现场信息的汇报，将三维扫描获得的现场点云图与BIM模型生成的点云图进行比对，依据比对结果进行汇报。S5. Based on the report of on-site information, compare the on-site point cloud image obtained by 3D scanning with the point cloud image generated by the BIM model, and report based on the comparison results.

与现有技术相比，本发明的有益效果如下：发明具有广泛的商业应用，尤其是在建筑工地和工厂环境中。包括：Compared with the existing technology, the beneficial effects of the present invention are as follows: the invention has a wide range of commercial applications, especially in construction sites and factory environments. include:

建筑工地：通过进行语义分割和BIM模型的功能可显著提高施工现场的运营效率。通过实时提供准确的竣工信息，项目经理和现场工程师可以在资源分配、时间安排和进度跟踪方面做出更明智的决策。此外，语义分割可以帮助自主导航系统在复杂的施工现场准确识别和避开障碍物，进一步提高现场作业的效率。生成的BIM模型可用于实时报告施工状态，从而及早发现潜在问题并及时处理，减少代价高昂的延误和返工的可能性。Construction sites: The operational efficiency of construction sites can be significantly improved through the capabilities of semantic segmentation and BIM models. By providing accurate as-built information in real time, project managers and field engineers can make more informed decisions about resource allocation, scheduling and progress tracking. In addition, semantic segmentation can help autonomous navigation systems accurately identify and avoid obstacles in complex construction sites, further improving the efficiency of on-site operations. The resulting BIM model can be used to report construction status in real time, allowing potential problems to be identified early and dealt with promptly, reducing the possibility of costly delays and rework.

工厂环境：本发明还能显著提高工厂环境的效率和生产力。通过利用语义分割技术识别和定位工厂车间的不同物体和机械，机器人系统可以更安全、更高效地在环境中导航。此外，集成BIM到地图的自动创建功能可让工厂管理人员创建精确的设施3D模型，为设备的布局和定位提供有价值的见解。它还可以快速准确地模拟工厂，创建数字模型，进一步优化自动化实施。这些模型可用于优化机器和材料的摆放，缩短生产时间，降低成本。Factory environment: The invention can also significantly improve the efficiency and productivity of factory environments. By leveraging semantic segmentation technology to identify and locate different objects and machinery on the factory floor, robotic systems can navigate their environment more safely and efficiently. Additionally, integrated BIM-to-map automated creation allows factory managers to create accurate 3D models of facilities, providing valuable insights into the layout and positioning of equipment. It can also quickly and accurately simulate factories, creating digital models to further optimize automation implementation. These models can be used to optimize the placement of machines and materials, shortening production times and reducing costs.

建设环境：除了缺陷检查或巡逻，本发明还能增强机器人在建筑环境中的导航能力，执行特定任务，并模仿人类的方式报告相关问题及发现。Construction environment: In addition to defect inspection or patrolling, the invention can enhance the robot's ability to navigate in the construction environment, perform specific tasks and report related issues and findings in a human-like manner.

因此，本发明在提高各行各业的效率和质量方面的潜力，凸显了该技术的广泛适用性及其作为多功能强大工具的价值。The invention's potential to improve efficiency and quality across a variety of industries therefore highlights the technology's broad applicability and its value as a versatile and powerful tool.

附图说明Description of drawings

图1是本发明的工作流程图；Figure 1 is a work flow chart of the present invention;

图2是本发明中由BIM模型生成导航图的工作流程图；Figure 2 is a workflow diagram for generating navigation diagrams from BIM models in the present invention;

图3是本发明中将三维扫描点云转化为语义地图的工作流程图；Figure 3 is a workflow diagram for converting three-dimensional scanned point clouds into semantic maps in the present invention;

图4是本发明中制动机构的俯视图；Figure 4 is a top view of the braking mechanism of the present invention;

实施方式Implementation

以下将结合实施例和附图对本发明的构思、具体结构及产生的技术效果进行清楚、完整的描述，以充分地理解本发明的目的、方案和效果。需要说明的是，在不冲突的情况下，本申请中的实施例及实施例中的特征可以相互组合。The following will give a clear and complete description of the concept, specific structure and technical effects of the present invention in conjunction with the embodiments and drawings, so as to fully understand the purpose, solutions and effects of the present invention. It should be noted that, as long as there is no conflict, the embodiments and features in the embodiments of this application can be combined with each other.

本系统的主要目的是通过克服导航、初始测绘和报告不足等现有难题，推出一种新型的人工智能多用途移动机器人平台。这个系统的整体工作流程图如图1所示。BIM文件首先通过边界生成和BIM到模型点云图的转换来提取语义信息。一旦在模型点云图中定位了对象，就会进行语义标注，为每个对象分配相对应的类别，以生成语义地图。然后，将语义图与语义分割输出的分割点云进行匹配。然后，就可以找到语义图的位置。此外，这些位置将输入至自主导航系统，包括建立初始导航地图、导航和基于语义位置的报告。The main purpose of this system is to launch a new artificial intelligence multi-purpose mobile robot platform by overcoming existing challenges such as navigation, initial mapping and reporting deficiencies. The overall workflow diagram of this system is shown in Figure 1. BIM files first extract semantic information through boundary generation and conversion of BIM to model point cloud images. Once objects are located in the model point cloud, semantic annotation is performed and each object is assigned a corresponding category to generate a semantic map. Then, the semantic map is matched with the segmented point cloud output by semantic segmentation. Then, the location of the semantic graph can be found. Additionally, these positions will be input into the autonomous navigation system, including building initial navigation maps, navigation and semantic position-based reporting.

在建立初始导航地图中，语义地图可以为机器人建立初始导航地图提供额外的信息，由于在地图中有许多相近的不同区域，如果仅依据区域的形状信息很难识别到所处的初始位置。也正由于自主导航系统被放置在不同的位置，每次初始化的导航地图也可能不同。建立初始导航地图有助于获得正确的地图位置并获得自主导航系统的当前位置。In establishing the initial navigation map, the semantic map can provide additional information for the robot to establish the initial navigation map. Since there are many similar different areas in the map, it is difficult to identify the initial position based only on the shape information of the area. Precisely because the autonomous navigation system is placed in different locations, the navigation map initialized each time may also be different. Building an initial navigation map helps obtain the correct map position and obtain the current position of the autonomous navigation system.

在导航过程中，语义地图可以帮助自主导航系统更好地了解周围环境和环境中的物体，从而帮助自主导航系统在环境中导航。语义地图可以帮助自主导航系统识别环境中的物体并区分它们。这可以帮助自主导航系统绕过障碍物并避免碰撞。此外，还可用于规划和执行环境中的任务。例如，如果自主导航系统的任务是从一个指定位置拾取一个物体，语义地图可以提供有关物体位置和到达路径的信息。因此，语义地图可以帮助自主导航系统导航，使其更好地了解环境和环境中的物体。这可以提高导航的效率和准确性，并提高安全性和可靠性。During the navigation process, semantic maps can help the autonomous navigation system better understand the surrounding environment and objects in the environment, thereby helping the autonomous navigation system navigate in the environment. Semantic maps can help autonomous navigation systems identify objects in the environment and differentiate between them. This can help autonomous navigation systems maneuver around obstacles and avoid collisions. Additionally, it can be used to plan and execute tasks in the environment. For example, if an autonomous navigation system is tasked with picking up an object from a specified location, a semantic map can provide information about the object's location and path to reach it. Therefore, semantic maps can help autonomous navigation systems navigate, giving them a better understanding of the environment and objects in it. This improves the efficiency and accuracy of navigation and increases safety and reliability.

在基于语义定位的报告中，语义地图通过提供更准确、更相关的事件或事故（如缺陷检查）位置信息来帮助报告。通过使用语义定位，机器人可以向用户提供有关事件发生地点的更详细信息，从而帮助用户更好地理解事件的背景和意义。此外，语义本地化还能帮助机器人识别在特定地点发生的事件的模式和趋势，这对于开发更深入、更有洞察力的报道非常有用。因此，语义本地化可以提高报道的质量和准确性。In semantic location-based reporting, semantic maps aid reporting by providing more accurate and relevant information about the location of events or incidents (such as defect inspections). By using semantic localization, bots can provide users with more detailed information about where an event occurred, helping users better understand the context and significance of the event. In addition, semantic localization can help bots identify patterns and trends in events occurring in specific locations, which is useful for developing deeper and more insightful coverage. Therefore, semantic localization can improve the quality and accuracy of coverage.

下面针对本发明的各个工作流程做出详细的说明：The following is a detailed description of each work process of the present invention:

BIM即建筑信息模型，是对建筑物或结构的物理和功能特性实现数字化的模型。它包括使用计算机辅助设计（CAD）软件创建建筑物的虚拟模型，其中包含有关其组件、系统和材料的数据。BIM, Building Information Model, is a model that digitizes the physical and functional characteristics of a building or structure. It involves using computer-aided design (CAD) software to create a virtual model of a building that contains data about its components, systems and materials.

BIM生成地图包括两个部分，BIM转三维模型世界地图和BIM转二维地图。BIM生成地图主要用于Gazebo和RVIZ仿真，可帮助用户在虚拟世界中进行仿真并获得可视化结果。开发人员可以使用这一解决方案，在没有真实环境的情况下开始编程和建模。近年来，随着越来越多的公司认识到使用BIM技术的好处，BIM在建筑行业越来越受欢迎。未来，BIM制图方法将广泛应用于建筑业和制造业。BIM-generated maps include two parts, BIM-to-3D model world map and BIM-to-2D map. BIM-generated maps are mainly used for Gazebo and RVIZ simulations, which can help users simulate in the virtual world and obtain visualization results. Developers can use this solution to start programming and modeling without a real-world environment. In recent years, BIM has become increasingly popular in the construction industry as more and more companies realize the benefits of using BIM technology. In the future, BIM mapping methods will be widely used in the construction and manufacturing industries.

BIM生成地图的工作流程如图2所示。首先，在Revit中选择二维地图并生成其.dwg文件。然后将该文件导入AutoCAD，不同的Revit模型会有不同格式的CAD图纸，用户需要删除所有标记、尺寸和图纸中其他无用的部分。例如，自主导航系统或人可以通过的部分应该有相应通道，而自主导航系统或人不能通过的墙壁或边界应该是通过不间断的线连接等。当CAD文件修改完成后，从AutoCAD导出PNG或JPG文件。ROS地图的格式为PGM，因此还需要进行转换。最后，当YAML包含PGM地图的文件被生成时，就将成为可执行的ROS地图。YAML文件应包含一些必要的参数，如ROS地图的路径、分辨率、原始二维姿态、占用阈值、空闲阈值和negate参数等。在RVIZ中启动导航后，用户需要初始化位置和姿态。用户还可以发送ROS消息来设置初始位置和目的地。The workflow of BIM map generation is shown in Figure 2. First, select the 2D map in Revit and generate its .dwg file. Then import the file into AutoCAD. Different Revit models will have CAD drawings in different formats. The user needs to delete all marks, dimensions and other useless parts of the drawing. For example, parts that can be passed by autonomous navigation systems or people should have corresponding passages, while walls or boundaries that cannot be passed by autonomous navigation systems or people should be connected by uninterrupted lines, etc. When the CAD file modification is completed, export the PNG or JPG file from AutoCAD. The format of the ROS map is PGM, so it also needs to be converted. Finally, when the YAML file containing the PGM map is generated, it will become an executable ROS map. The YAML file should contain some necessary parameters, such as the path of the ROS map, resolution, original 2D pose, occupancy threshold, idle threshold and negate parameters, etc. After starting navigation in RVIZ, the user needs to initialize the position and attitude. Users can also send ROS messages to set initial location and destination.

在使用ROS模拟的传统过程中，用户需要在环境中进行SLAM（定位与地图构建（Simultaneous Localization and Mapping））并绘制可用于RVIZ的地图。因此，使用环境和自主导航系统都必须准备就绪。然而，有时我们需要在建筑物建成之前进行模拟并得出可行性结果。如果我们转而使用BIM到地图的方法，我们就不需要设置环境和绘制地图，而是可以直接在BIM模型创建的虚拟环境中进行模拟。In the traditional process of using ROS simulation, users need to perform SLAM (Simultaneous Localization and Mapping) in the environment and draw a map that can be used for RVIZ. Therefore, both the usage environment and the autonomous navigation system must be ready. However, sometimes we need to simulate and derive feasibility results before the building is built. If we switch to a BIM-to-map approach, we do not need to set up the environment and draw the map, but can directly perform simulations in the virtual environment created by the BIM model.

使用BIM制图的好处包括提高效率、提高准确性、减少错误和返工，以及利益相关者之间更好的沟通。BIM还可以通过更有效的时间安排和协调，帮助降低成本和缩短项目时间。BIM地图可以与计算机视觉合作完成某些任务。具体来说，从BIM文件生成的地图可与BIM提供的信息自动链接，这些信息是可用于报告目的的重要数据。The benefits of using BIM mapping include increased efficiency, improved accuracy, reduced errors and rework, and better communication among stakeholders. BIM can also help reduce costs and shorten project times through more efficient scheduling and coordination. BIM maps can work with computer vision to accomplish certain tasks. Specifically, maps generated from BIM files can be automatically linked with information provided by BIM, which is important data that can be used for reporting purposes.

通过使用激光雷达对现场进行扫描并相应获得三维数据，本系统可以自动分割重要的建筑结构，如墙、柱、梁、管道和暖通空调系统。这一过程包括根据几何和空间特征识别点云数据中的各个组件并对其进行分类。先进的深度学习算法（如ResPointNet++）可用于提高语义分割的效率和准确性。然后将分割后的组件转化为相应的BIM元素，从而生成详细、准确的竣工BIM模型。这一自动化流程可简化BIM模型的创建，从而更好地管理建筑信息、加强维护规划和提高设施管理效率。By using lidar to scan the site and obtain 3D data accordingly, the system can automatically segment important building structures such as walls, columns, beams, pipes and HVAC systems. This process involves identifying and classifying individual components in point cloud data based on geometric and spatial characteristics. Advanced deep learning algorithms such as ResPointNet++ can be used to improve the efficiency and accuracy of semantic segmentation. The segmented components are then converted into corresponding BIM elements to generate detailed and accurate as-built BIM models. This automated process simplifies the creation of BIM models to better manage building information, enhance maintenance planning and improve facility management efficiency.

具体的工作流程如图2所示并详细描述如下：The specific workflow is shown in Figure 2 and described in detail as follows:

步骤1输入数据：首先通过激光雷达获取现场点云图，将现场点云图达数据输入神经网络。每个现场点云图的点是一个三维的空间点，其包含XYZ坐标、强度值和反射率数据等信息。Step 1: Input data: First, obtain the on-site point cloud image through lidar, and input the on-site point cloud image data into the neural network. Each point in the on-site point cloud map is a three-dimensional space point, which contains information such as XYZ coordinates, intensity values, and reflectivity data.

步骤2分组：由于现场点云图可以包含多个对象，因此在现场点云图分割之前最好先对接近的点进行分组，以将不同的对象分开。对于分组，它主要基于来自现场点云图的几何和空间信息。根据经验设定的阈值，如这些检测到这些点的点距离相关区域太远，则应当将它们会被分类到其他对象组中。Step 2 Grouping: Since the on-site point cloud map can contain multiple objects, it is best to group close points before segmenting the on-site point cloud map to separate different objects. For grouping, it is mainly based on geometric and spatial information from on-site point cloud images. Based on an empirically set threshold, if these detected points are too far away from the relevant area, they should be classified into other object groups.

步骤3特征编码：此步骤涉及对现场点云图的几何信息进行编码，以便能够通过神经网络进行处理，向神经网络输入带有空间信息的现场点云图数据，并利用多个神经网络层来捕获现场点云图中局部几何特征。此外，通过设定大小的采样来对这些点进行归一化处理。基于特征编码过程，利用编码特征可以压缩现场点云图中局部几何信息，并将有用信息传递给解码器，从而可以进一步减少由捕获设备和周围环境引起的噪声。Step 3 Feature Encoding: This step involves encoding the geometric information of the scene point cloud map so that it can be processed through the neural network, inputting the scene point cloud map data with spatial information to the neural network, and utilizing multiple neural network layers to capture the scene Local geometric features in point cloud images. Additionally, the points are normalized by taking samples of a set size. Based on the feature encoding process, the encoding features can be used to compress the local geometric information in the on-site point cloud image and pass the useful information to the decoder, which can further reduce the noise caused by the capture device and the surrounding environment.

步骤4特征解码：对特征进行编码后，需要对其进行解码，以生成点云的语义分割。具体是通过应用特征解码过程来实现的，该过程使用解码网络生成一组对应于场景中不同物体类别的每点标签。解码网络可利用RGB图像或其他传感器数据等附加信息来提高分割的准确性。Step 4 Feature Decoding: After encoding the features, they need to be decoded to generate semantic segmentation of the point cloud. This is achieved by applying a feature decoding process, which uses a decoding network to generate a set of per-point labels corresponding to different object categories in the scene. The decoding network can exploit additional information such as RGB images or other sensor data to improve segmentation accuracy.

步骤5输出结果：输出结果是对激光雷达所获得的的现场点云图进行语义分割。语义分割是通过彩色编码方式对现场点云图进行编码的地图，其中每个点都被赋予了与场景中特定对象类别（如建筑物、道路或树木）相对应的标签。这种语义分割可作为后续应用的输入，如物体检测、自主导航和其他相关应用。Step 5 Output result: The output result is semantic segmentation of the on-site point cloud image obtained by lidar. Semantic segmentation is a color-coded map of a scene point cloud, where each point is assigned a label that corresponds to a specific object category in the scene, such as buildings, roads, or trees. This semantic segmentation can be used as input for subsequent applications such as object detection, autonomous navigation and other related applications.

通过应用特征编码和解码过程，所提出的方法可以处理和分割大型、复杂和形状不规则且密度各异的现场点云图。由此产生的语义分割可以提高自动驾驶汽车、机器人和城市规划等广泛应用的准确性和效率。By applying feature encoding and decoding processes, the proposed method can process and segment large, complex, and irregularly shaped live point clouds with varying densities. The resulting semantic segmentation can improve accuracy and efficiency in a wide range of applications, including self-driving cars, robotics, and urban planning.

a)使用BIM进行语义标注a) Use BIM for semantic annotation

基于BIM模型生成导航地图的技术，BIM模型中的模型点云图被标记并分组为对象。在分组和标记时，测量每个点与所有对象之间的距离，并将该点划分至与其属距离最小的对象。被标记的点用于形成语义图，这对于下一阶段的语义定位很有用。Based on the technology of generating navigation maps from BIM models, the model point cloud images in the BIM model are marked and grouped into objects. When grouping and labeling, measure the distance between each point and all objects, and divide the point to the object with the smallest distance to it. The labeled points are used to form a semantic map, which is useful for the next stage of semantic localization.

b）使用激光雷达语义图进行全域语义定位b) Use lidar semantic map for global semantic positioning

语义定位的工作流程如图3所示。在进行语义定位之前，必须在BIM模型转换为导航地图步骤中获得语义信息。然后，使用三维激光雷达进行三维扫描，并在扫描中形成现场点云图。然后，进行语义分割以对现场点云进行分组和标记。接着，分别根据距离和语义信息进行点匹配，将语义信息与扫描的现场点云进行匹配。尽管可以使用邻距离算法找到点与对象的对应关系，如进一步利用语义信息，可以过滤掉错误匹配并将进一步确保点与对象之间对应关系的正确性。然后，通过在优化变换步骤中最小化空间和语义距离方式找到点与对象对应关系。最后，在执行优化转换后，可以将语义信息与三维激光雷达扫描的现场点云图进行对齐，以便提供语义定位。The workflow of semantic positioning is shown in Figure 3. Before semantic positioning, semantic information must be obtained in the step of converting the BIM model into a navigation map. Then, 3D lidar is used to perform 3D scanning, and the on-site point cloud map is formed in the scan. Then, semantic segmentation is performed to group and label the field point cloud. Then, point matching is performed based on distance and semantic information respectively, and the semantic information is matched with the scanned on-site point cloud. Although the neighbor distance algorithm can be used to find the correspondence between points and objects, further utilizing semantic information can filter out false matches and further ensure the correctness of the correspondence between points and objects. Then, point-object correspondences are found by minimizing spatial and semantic distances in the optimization transformation step. Finally, after performing the optimization transformation, the semantic information can be aligned with the on-site point cloud map of the 3D lidar scan to provide semantic localization.

c)使用语义定位进行导航c) Use semantic positioning for navigation

使用语义定位的导航是指使用语义信息来辅助导航和进行路径规划。这可能涉及识别地标、兴趣点或其他语义信息，以帮助引导用户到达目的地。语义定位可用于一系列导航应用，包括GPS导航系统、室内导航系统和增强现实导航。在不同况下，其目的都是为用户提供一条清晰易懂的到达目的地的路线，并使用语义信息引导实现导航。Navigation using semantic positioning refers to using semantic information to assist navigation and path planning. This may involve identifying landmarks, points of interest, or other semantic information to help guide users to their destination. Semantic positioning can be used in a range of navigation applications, including GPS navigation systems, indoor navigation systems and augmented reality navigation. In each case, the aim is to provide users with a clear and easy-to-understand route to their destination, and to use semantic information to guide navigation.

在室内导航系统中，语义定位可用于识别室内环境的关键特征，例如房间、电梯或走廊。然后，考虑到室内环境的语义上下文关系，这些语义信息可用于为用户提供前往目的地的清晰指示。In indoor navigation systems, semantic localization can be used to identify key features of indoor environments, such as rooms, elevators, or corridors. This semantic information can then be used to provide users with clear directions to their destination, taking into account the semantic context of the indoor environment.

d)基于定位信息进行报告d) Report based on positioning information

使用语义定位信息进行报告是指以考虑信息的特定三维位置、上下文和语义的方式进行呈现或总结信息的过程。根据上述过程生成报告，根据语义标签或属性突出显示特定三维环境中的关键特征、对象或变化，或者使用激光雷达所获得现场点云图相关语义数据提供有关特定主题或事件的定位信息。Reporting using semantically positioned information is the process of presenting or summarizing information in a way that takes into account its specific three-dimensional location, context, and semantics. Generate reports based on the above process that highlight key features, objects, or changes in a specific three-dimensional environment based on semantic tags or attributes, or use semantic data related to on-site point clouds obtained by lidar to provide positioning information about specific topics or events.

在机器人技术的背景下，语义定位信息可用于生成有关仓库或制造设施中物体或障碍物的位置和语义的报告。这可能涉及根据语义标签或属性识别关键对象，并以对机器人控制系统相关且有意义的方式呈现此信息。In the context of robotics, semantic localization information can be used to generate reports about the location and semantics of objects or obstacles in a warehouse or manufacturing facility. This may involve identifying key objects based on semantic tags or attributes and presenting this information in a way that is relevant and meaningful to the robot control system.

以上所述，只是本发明的较佳实施例而已，本发明并不局限于上述实施方式，只要其以相同的手段达到本发明的技术效果，凡在本公开的精神和原则之内，所做的任何修改、等同替换、改进等，均应包含在本公开保护的范围之内。都应属于本发明的保护范围。在本发明的保护范围内其技术方案和/或实施方式可以有各种不同的修改和变化。The above are only preferred embodiments of the present invention. The present invention is not limited to the above-mentioned embodiments. As long as the technical effects of the present invention are achieved by the same means, any modification can be made within the spirit and principles of the present disclosure. Any modifications, equivalent substitutions, improvements, etc. shall be included in the scope of protection of this disclosure. All belong to the protection scope of the present invention. Various modifications and changes may be made to the technical solutions and/or implementations within the scope of the present invention.

Claims

1. The method for realizing autonomous navigation of the robot by using BIM is characterized by comprising the following steps: the method comprises the following steps:

s1, generating a navigation map by using a BIM model, wherein the navigation map comprises a two-dimensional navigation map and a model point cloud map generated by using BIM model data, and boundary data corresponding to an object in the model point cloud map is formed according to the boundary of the object in the BIM model; acquiring semantic information of the object according to the information of the object in the BIM;

s2, carrying out optical scanning on a field corresponding to the BIM model through three-dimensional scanning, obtaining a field point cloud picture, carrying out object grouping on the field point cloud picture by utilizing the boundary data and the model point cloud picture, and carrying out semantic segmentation on the field point cloud picture by using semantic information to form a field semantic map;

and S3, outputting the two-dimensional navigation map and the semantic map to the robot, and performing autonomous navigation according to the two-dimensional navigation map and the semantic map after the robot receives the navigation instruction.

2. The method for realizing autonomous navigation of a robot by using BIM according to claim 1, wherein: the step of generating the two-dimensional navigation map by using the BIM model in the step S1 is as follows: the BIM model data are exported as a two-dimensional map, and the exported two-dimensional map is modified; the modified two-dimensional map is exported as a graphic file and converted to an executable ROS map.

3. The method for realizing autonomous navigation of a robot by using BIM according to claim 2, wherein the two-dimensional map is an AutoCad format document, the two-dimensional map in the AutoCad format is edited to generate a graphic file, and finally PGM and YAML files of the ROS map are generated.

4. The method for realizing autonomous navigation of a robot by using BIM according to claim 1, wherein: in the step S2, the step of performing semantic segmentation on the field point cloud image to obtain a semantic map includes:

s2-1, performing object grouping on the field point cloud image obtained by three-dimensional scanning, wherein the grouping method is to judge geometrical and spatial information formed by each point in the field point cloud image through an algorithm based on a proximity example for analysis so as to judge that the point cloud image belongs to a corresponding object and realize object grouping;

s2-2, forming a mapping relation between semantic information of the object obtained from the BIM model and the corresponding object in the field point cloud picture;

s2-3, repeating the steps S2-1 and S2-2 to form a mapping relation between the object of the field point cloud picture and semantic information of the BIM model, and then obtaining the field semantic map.

5. The method for achieving autonomous navigation of a robot using BIM according to claim 4, wherein: the method for grouping the on-site point cloud images further comprises the steps of encoding geometric information in point cloud data in the object groups and inputting the geometric information into a neural network for processing to identify useful geometric features; the neural network performs feature coding on the geometric information to filter noise information in the field point cloud picture; and then, decoding the feature codes to complete semantic segmentation, and forming a mapping relation with semantic information of the corresponding object.

6. The method for achieving autonomous robot navigation using BIM according to claim 1, wherein: the method also comprises the following steps:

s4, updating the semantic map, identifying and classifying geometric and spatial features of the object by using the on-site point cloud picture, and updating the semantic map after semantic segmentation.

7. The method for realizing autonomous navigation of a robot by using BIM according to claim 1, wherein: the method also comprises the following steps:

s5, comparing the field point cloud image obtained by three-dimensional scanning with the point cloud image generated by the BIM based on the report of the field information, and reporting according to the comparison result.