CN114708422B

CN114708422B - A method and device for calculating cabin door coordinates based on binocular images

Info

Publication number: CN114708422B
Application number: CN202210134640.6A
Authority: CN
Inventors: 周杰; 李健; 邓磊; 陈宝华
Original assignee: Tsinghua University
Current assignee: Tsinghua University
Priority date: 2022-02-14
Filing date: 2022-02-14
Publication date: 2024-06-28
Anticipated expiration: 2042-02-14
Also published as: CN114708422A

Abstract

The present invention discloses a method and device for calculating the coordinates of a cabin door based on a binocular image, wherein the method comprises: collecting a binocular image; inputting the binocular image into a first neural network model for first feature extraction, and obtaining a cabin door area sub-map by boundary regression; inputting the cabin door area sub-map into a second neural network model for second feature extraction, and performing a logistic regression on each pixel point of the cabin door area sub-map to obtain a cabin door seam edge map; extracting the coordinates of the edge points based on the cabin door seam edge map, and calculating two two-dimensional coordinate points of the cabin door; and obtaining two three-dimensional coordinate points of the cabin door by triangulation calculation based on the two two-dimensional coordinate points. The present invention can accurately calculate the coordinates of the aircraft cabin door relative to the corridor bridge, and guide the corridor bridge to automatically dock with the aircraft cabin door, greatly improving the degree of automation of the airport, and has strong theoretical significance and practical value.

Description

A method and device for calculating cabin door coordinates based on binocular images

技术领域Technical Field

本发明涉及坐标计算技术领域，尤其涉及一种基于双目图像的舱门坐标计算方法和装置。The present invention relates to the technical field of coordinate calculation, and in particular to a method and device for calculating hatch coordinates based on binocular images.

背景技术Background technique

随着现代社会的发展，航空出行成为人民日常生活中不可或缺的一部分。在机场服务乘客乘机的过程中，登机廊桥引导对位是必不可少的一步。登机廊桥指机场里用以连接候机厅与机舱之间的可移动升降的通道，在乘客上下飞机之前，机场需要引导登机廊桥的一端，使其对接到飞机的舱门上，以供乘客通过廊桥进入到机舱内。图1展示了登机廊桥的对接场景。With the development of modern society, air travel has become an indispensable part of people's daily life. In the process of serving passengers at the airport, boarding bridge guidance is an indispensable step. The boarding bridge refers to the movable lifting passage connecting the terminal and the cabin in the airport. Before passengers get on and off the plane, the airport needs to guide one end of the boarding bridge to dock with the aircraft's cabin door so that passengers can enter the cabin through the bridge. Figure 1 shows the docking scene of the boarding bridge.

目前国内机场基本全部采用人工驾驶廊桥的方式。驾驶员用肉眼判断廊桥与飞机之间的相对位置关系，操纵廊桥并将其对接到飞机舱门上。人工方法无法准确获得廊桥与飞机舱门之间的相对坐标，因此对接较低，同时培养、招聘经验丰富的驾驶员会给机场带来较高的人力成本。另外，受驾驶员判断能力的限制，人工对接的方式也容易引发事故。At present, almost all domestic airports use the method of manually driving the jet bridge. The driver uses the naked eye to judge the relative position relationship between the jet bridge and the aircraft, manipulates the jet bridge and docks it to the aircraft door. The manual method cannot accurately obtain the relative coordinates between the jet bridge and the aircraft door, so the docking rate is low. At the same time, training and recruiting experienced drivers will bring high labor costs to the airport. In addition, due to the limitations of the driver's judgment ability, the manual docking method is also prone to accidents.

对于目前国内机场基本全部采用人工驾驶廊桥的方式完成对接，没有已经商用的飞机舱门坐标计算方法。工业界和学术界对于自动廊桥引导对接系统的尝试分为激光测距法和图像对比法两种，其中激光测距法指在廊桥四周安装激光测距装置，测量飞机与廊桥之间的距离，从而辅助驾驶员对接廊桥；图像对比法指将飞机舱门图像与数据库中的图像做匹配，从而找到图像中的飞机舱门区域，引导廊桥对接到飞机上。At present, most domestic airports use manual driving to complete docking, and there is no commercially available method to calculate the coordinates of the aircraft door. The attempts of the industry and academia to develop an automatic docking system for the bridge guidance are divided into two methods: laser ranging method and image comparison method. The laser ranging method refers to installing laser ranging devices around the bridge to measure the distance between the aircraft and the bridge, thereby assisting the pilot to dock with the bridge; the image comparison method refers to matching the aircraft door image with the image in the database to find the aircraft door area in the image and guide the bridge to dock with the aircraft.

人工驾驶廊桥的方法无法准确获得廊桥与飞机舱门之间的相对坐标，因此对接较低，同时培养、招聘经验丰富的驾驶员会给机场带来较高的人力成本。另外，受驾驶员判断能力的限制，人工对接的方式也容易引发事故。The manual operation of the bridge cannot accurately obtain the relative coordinates between the bridge and the aircraft door, so the docking is low. At the same time, training and recruiting experienced pilots will bring high labor costs to the airport. In addition, due to the limitations of the driver's judgment ability, the manual docking method is also prone to accidents.

激光测距法的缺点包括两方面。一方面，舱门与廊桥间的相对位置坐标由水平方向、竖直方向和前进方向的三个数值组成，分别对应于廊桥上下旋转、左右旋转以及前进后退。而激光测距法只能获得前进方向的距离，因此只能控制廊桥的前进和后退，仍然需要驾驶员观察才能控制廊桥上下左右旋转。另一方面，激光传感器固定在廊桥上，激光发射的角度随廊桥的运动而变化，照射到飞机上的区域也在变化，其获得的距离值并不是飞机上的某个固定点与廊桥之间的距离，因此只能当作引导廊桥对接的参考值。The disadvantages of the laser ranging method include two aspects. On the one hand, the relative position coordinates between the cabin door and the corridor bridge are composed of three values in the horizontal direction, vertical direction and forward direction, which correspond to the up and down rotation, left and right rotation, and forward and backward rotation of the corridor bridge respectively. The laser ranging method can only obtain the distance in the forward direction, so it can only control the forward and backward movement of the corridor bridge, and the driver still needs to observe to control the up and down left and right rotation of the corridor bridge. On the other hand, the laser sensor is fixed on the corridor bridge, and the angle of laser emission changes with the movement of the corridor bridge, and the area irradiated on the aircraft is also changing. The distance value obtained is not the distance between a fixed point on the aircraft and the corridor bridge, so it can only be used as a reference value to guide the docking of the corridor bridge.

图像对比法的缺点包括两方面。一方面，图像对比法必须构建飞机舱门图片的数据库，飞机的种类非常多且每年都会有新增的飞机种类，维护这样的数据库需要较多精力。另一方面，图像对比法只能获得飞机舱门与廊桥间大致的相对位置关系，而不能获得准确的坐标数值，其对接的精度比较低。The disadvantages of the image comparison method include two aspects. On the one hand, the image comparison method must build a database of aircraft cabin door images. There are many types of aircraft and new types of aircraft are added every year. Maintaining such a database requires a lot of effort. On the other hand, the image comparison method can only obtain the approximate relative position relationship between the aircraft cabin door and the jet bridge, but cannot obtain the accurate coordinate value, and its docking accuracy is relatively low.

发明内容Summary of the invention

本发明旨在至少在一定程度上解决相关技术中的技术问题之一。The present invention aims to solve one of the technical problems in the related art at least to a certain extent.

为此，本发明的目的在于提出一种基于双目图像的飞机舱门坐标计算方法，能够准确计算飞机舱门相对于廊桥的坐标，并引导廊桥自动对接到飞机舱门上，极大地提升机场的自动化程度，具有较强的理论意义和实用价值。To this end, the purpose of the present invention is to propose a method for calculating the coordinates of an aircraft door based on binocular images, which can accurately calculate the coordinates of the aircraft door relative to the jet bridge and guide the jet bridge to automatically dock with the aircraft door, greatly improving the degree of automation at the airport, and has strong theoretical significance and practical value.

本发明的另一个目的在于提出一种基于双目图像的飞机舱门坐标计算装置。Another object of the present invention is to provide a device for calculating the coordinates of an aircraft door based on binocular images.

为达上述目的，本发明一方面提出了一种基于双目图像的飞机舱门坐标计算方法，包括：To achieve the above object, the present invention proposes a method for calculating the coordinates of an aircraft door based on a binocular image, comprising:

采集双目图像；将双目图像输入第一神经网络模型进行第一特征提取，通过边界回归得到舱门区域子图；将舱门区域子图输入第二神经网络模型进行第二特征提取，并对舱门区域子图的每个像素点做逻辑回归得到舱门门缝边缘图；基于舱门门缝边缘图提取边缘点的坐标，计算得到舱门的两个二维坐标点；基于两个二维坐标点，通过三角化计算得到舱门的两个三维坐标点。Acquire a binocular image; input the binocular image into a first neural network model for first feature extraction, and obtain a hatch area sub-image through boundary regression; input the hatch area sub-image into a second neural network model for second feature extraction, and perform logistic regression on each pixel point of the hatch area sub-image to obtain a hatch seam edge map; extract the coordinates of the edge points based on the hatch seam edge map, and calculate two two-dimensional coordinate points of the hatch; and obtain two three-dimensional coordinate points of the hatch through triangulation calculation based on the two two-dimensional coordinate points.

本发明实施例的基于双目图像的飞机舱门坐标计算方法，能够精确计算多个维度的舱门坐标，为登机廊桥引导系统提供更加充分准确的舱门位置信息，从根本上提高了廊桥引导系统的对接精度，有效解决了传统廊桥对接方法人力成本高、事故频发的问题，可以在机场的智能化和自动化方面发挥作用。本发明具有较大的理论和实践价值。The aircraft door coordinate calculation method based on binocular images in the embodiment of the present invention can accurately calculate the door coordinates in multiple dimensions, provide more sufficient and accurate door position information for the boarding bridge guidance system, fundamentally improve the docking accuracy of the bridge guidance system, and effectively solve the problems of high labor cost and frequent accidents in the traditional bridge docking method, which can play a role in the intelligence and automation of airports. The present invention has great theoretical and practical value.

另外，根据本发明上述实施例的基于双目图像的飞机舱门坐标计算方法还可以具有以下附加的技术特征：In addition, the aircraft door coordinate calculation method based on binocular images according to the above embodiment of the present invention may also have the following additional technical features:

进一步地，在本发明的一个实施例中，构建训练图像库，训练得到所述第一神经网络模型；构建训练数据库，训练得到所述第二神经网络模型。Furthermore, in one embodiment of the present invention, a training image library is constructed to obtain the first neural network model through training; and a training database is constructed to obtain the second neural network model through training.

进一步地，在本发明的一个实施例中，在得到所述舱门门缝边缘图之后，还包括：通过边缘点采样，得到所述舱门门缝边缘图的舱门门缝边缘点；通过边缘点分类，将所述边缘点分为上、下、左、右四类，通过带权重的最小二乘模块计算得到四个曲线方程；使用所述四个曲线方程构建全局曲线拟合误差。Furthermore, in one embodiment of the present invention, after obtaining the door seam edge map, it also includes: obtaining the door seam edge points of the door seam edge map by edge point sampling; dividing the edge points into four categories of top, bottom, left and right by edge point classification, and obtaining four curve equations by calculating a weighted least squares module; and using the four curve equations to construct a global curve fitting error.

进一步地，在本发明的一个实施例中，所述基于所述舱门门缝边缘图提取边缘点的坐标，计算得到舱门的两个二维坐标点，包括：基于所述舱门门缝边缘图，使用所述边缘点分类分别提取左边缘点集、右边缘点集和下边缘点集；使用ransac将所述左边缘点集和右边缘点集拟合为两条二次曲线，将所述下边缘点集拟合为直线；分别计算所述两条二次曲线与所述直线的交点，得到所述舱门的两个二维坐标点。Furthermore, in one embodiment of the present invention, the coordinates of edge points are extracted based on the hatch door seam edge map to calculate two two-dimensional coordinate points of the hatch door, including: based on the hatch door seam edge map, using the edge point classification to respectively extract a left edge point set, a right edge point set and a lower edge point set; using ransac to fit the left edge point set and the right edge point set into two quadratic curves, and to fit the lower edge point set into a straight line; and respectively calculating the intersection points of the two quadratic curves with the straight line to obtain the two two-dimensional coordinate points of the hatch door.

进一步地，在本发明的一个实施例中，所述双目图像包括第一双目图像和第二双目图像，所述基于所述两个二维坐标点，通过三角化得到舱门的两个三维坐标点，包括：预设计算得到的所述第一双目图像的两个二维坐标点分别为所述第二双目图像的两个二维坐标点分别为预设双目相机的焦距为f，基线为B，分别计算所述第一双目图像和所述第二双目图像的两个坐标点的深度值为：Further, in one embodiment of the present invention, the binocular image includes a first binocular image and a second binocular image, and the two three-dimensional coordinate points of the hatch are obtained by triangulation based on the two two-dimensional coordinate points, including: the two two-dimensional coordinate points of the first binocular image obtained by preset calculation are respectively The two two-dimensional coordinate points of the second binocular image are The focal length of the stereo camera is preset to be f, the baseline is B, and the depth values of the two coordinate points of the first stereo image and the second stereo image are calculated respectively as follows:

预设所述双目相机的内参矩阵为K，分别计算所述第一双目图像和所述第二双目图像的两个点的三维坐标为：The internal parameter matrix of the binocular camera is preset to be K, and the three-dimensional coordinates of two points in the first binocular image and the second binocular image are calculated as follows:

进一步地，在本发明的一个实施例中，所述方法还包括：将采集的所述双目图像进行缩放和裁剪，并输入所述第一神经网络模型。Furthermore, in one embodiment of the present invention, the method further includes: scaling and cropping the acquired binocular image, and inputting the image into the first neural network model.

进一步地，在本发明的一个实施例中，使用舱门的左边缘和右边缘的延长线与舱门下边缘的延长线相交形成的两个交点作为所述舱门的两个二维坐标点。Furthermore, in one embodiment of the present invention, two intersection points formed by the extension lines of the left and right edges of the hatch and the extension line of the lower edge of the hatch are used as two two-dimensional coordinate points of the hatch.

为达到上述目的，本发明另一方面提出了一种基于双目图像的舱门坐标计算装置，包括：To achieve the above object, the present invention proposes, on the other hand, a door coordinate calculation device based on binocular images, comprising:

采集模块，用于采集双目图像；第一提取模块，用于将所述双目图像输入第一神经网络模型进行第一特征提取，通过边界回归得到舱门区域子图；第二提取模块，用于将所述舱门区域子图输入第二神经网络模型进行第二特征提取，并对所述舱门区域子图的每个像素点做逻辑回归得到舱门门缝边缘图；第一计算模块，用于基于所述舱门门缝边缘图提取边缘点的坐标，计算得到舱门的两个二维坐标点；第二计算模块，用于基于所述两个二维坐标点，通过三角化计算得到舱门的两个三维坐标点。An acquisition module is used to acquire binocular images; a first extraction module is used to input the binocular images into a first neural network model for first feature extraction, and obtain a hatch area sub-image through boundary regression; a second extraction module is used to input the hatch area sub-image into a second neural network model for second feature extraction, and perform logistic regression on each pixel point of the hatch area sub-image to obtain a hatch door seam edge map; a first calculation module is used to extract the coordinates of edge points based on the hatch door seam edge map, and calculate two two-dimensional coordinate points of the hatch; a second calculation module is used to obtain two three-dimensional coordinate points of the hatch by triangulation calculation based on the two two-dimensional coordinate points.

本发明实施例的基于双目图像的舱门坐标计算装置，能够精确计算多个维度的舱门坐标，为登机廊桥引导系统提供更加充分准确的舱门位置信息，从根本上提高了廊桥引导系统的对接精度，有效解决了传统廊桥对接方法人力成本高、事故频发的问题，可以在机场的智能化和自动化方面发挥作用。本发明具有较大的理论和实践价值。The door coordinate calculation device based on binocular images in the embodiment of the present invention can accurately calculate the door coordinates in multiple dimensions, provide more sufficient and accurate door position information for the boarding bridge guidance system, fundamentally improve the docking accuracy of the bridge guidance system, effectively solve the problems of high labor cost and frequent accidents in the traditional bridge docking method, and can play a role in the intelligence and automation of the airport. The present invention has great theoretical and practical value.

本发明附加的方面和优点将在下面的描述中部分给出，部分将从下面的描述中变得明显，或通过本发明的实践了解到。Additional aspects and advantages of the present invention will be given in part in the following description and in part will be obvious from the following description, or will be learned through practice of the present invention.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

本发明上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解，其中：The above and/or additional aspects and advantages of the present invention will become apparent and easily understood from the following description of the embodiments in conjunction with the accompanying drawings, in which:

图1为现实场景中登机廊桥的对接场景示意图；FIG1 is a schematic diagram of a docking scenario of a boarding bridge in a real scenario;

图2为根据本发明实施例的基于双目图像的舱门坐标计算方法流程图；2 is a flow chart of a method for calculating door coordinates based on binocular images according to an embodiment of the present invention;

图3为根据本发明实施例的相机采集到的图片示意图；FIG3 is a schematic diagram of a picture captured by a camera according to an embodiment of the present invention;

图4为根据本发明实施例的提取出的舱门区域子图；FIG4 is a door area sub-image extracted according to an embodiment of the present invention;

图5为根据本发明实施例的舱门门缝边缘示意图；FIG5 is a schematic diagram of a hatch door seam edge according to an embodiment of the present invention;

图6为根据本发明实施例的舱门坐标点检测缘示意图；FIG6 is a schematic diagram of a hatch door coordinate point detection edge according to an embodiment of the present invention;

图7为根据本发明实施例的基于双目图像的舱门坐标计算装置结构示意图。FIG. 7 is a schematic diagram of the structure of a device for calculating hatch coordinates based on binocular images according to an embodiment of the present invention.

具体实施方式Detailed ways

需要说明的是，在不冲突的情况下，本申请中的实施例及实施例中的特征可以相互组合。下面将参考附图并结合实施例来详细说明本发明。It should be noted that, in the absence of conflict, the embodiments and features in the embodiments of the present application can be combined with each other. The present invention will be described in detail below with reference to the accompanying drawings and in combination with the embodiments.

为了使本技术领域的人员更好地理解本发明方案，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分的实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都应当属于本发明保护的范围。In order to enable those skilled in the art to better understand the scheme of the present invention, the technical scheme in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by ordinary technicians in this field without creative work should fall within the scope of protection of the present invention.

下面参照附图描述根据本发明实施例提出的基于双目图像的舱门坐标计算方法及装置。The following describes a method and device for calculating door coordinates based on binocular images according to an embodiment of the present invention with reference to the accompanying drawings.

图2是本发明一个实施例的基于双目图像的舱门坐标计算方法的流程图。FIG. 2 is a flow chart of a method for calculating door coordinates based on binocular images according to an embodiment of the present invention.

如图2所示，该基于双目图像的舱门坐标计算方法包括：As shown in FIG2 , the door coordinate calculation method based on binocular images includes:

S1，采集双目图像。S1, collect binocular images.

作为一种示例，本发明在廊桥前端安装双目相机采集图像。具体地，本发明提出的飞机舱门坐标计算方法依赖于相机采集到的飞机舱门图像，而由于图像是二维的，为了获得三维的舱门坐标，需要使用双目相机(即两个摄像头的相机)采集图像。相机采集到的图像如图3所示。As an example, the present invention installs a binocular camera at the front end of the corridor to collect images. Specifically, the aircraft door coordinate calculation method proposed in the present invention relies on the aircraft door image collected by the camera, and since the image is two-dimensional, in order to obtain the three-dimensional door coordinates, it is necessary to use a binocular camera (i.e., a camera with two cameras) to collect images. The image collected by the camera is shown in Figure 3.

本步骤的意义在于采集双目图像从而可以进一步计算舱门坐标。The significance of this step is to collect binocular images so that the hatch coordinates can be further calculated.

S2，将双目图像输入第一神经网络模型进行第一特征提取，通过边界回归得到舱门区域子图。S2, inputting the binocular image into the first neural network model to extract the first feature, and obtaining the hatch area sub-image through boundary regression.

可以理解的是，本步骤是基于深度特征的舱门区域子图提取。由于固定在廊桥上的相机在不断运动，其拍摄到的图片中所包含的内容也是千变万化的。当廊桥离飞机较近时，图片中几乎全是飞机舱门，当廊桥离飞机较远时，图片中会包含飞机机体、机场环境等其他内容。为了尽量减少图片中其他内容对舱门坐标计算的影响，需要首先提取图片中的舱门区域。It can be understood that this step is to extract the door area sub-image based on the depth feature. Since the camera fixed on the bridge is constantly moving, the content contained in the pictures it takes is also ever-changing. When the bridge is closer to the aircraft, the picture is almost entirely the aircraft door. When the bridge is farther away from the aircraft, the picture will contain other content such as the aircraft body and the airport environment. In order to minimize the impact of other content in the picture on the calculation of the door coordinates, it is necessary to first extract the door area in the picture.

本发明首先构建训练图像库，离线地训练神经网络模型。在实际运行时，将相机采集到的图片进行缩放、裁剪后，送入神经网络模型中，先进行特征提取，最后通过边界回归得到精确的舱门区域。提取出的舱门区域子图如图4所示。The present invention first constructs a training image library and trains the neural network model offline. During actual operation, the images captured by the camera are scaled and cropped, and then sent to the neural network model for feature extraction. Finally, the accurate hatch area is obtained through boundary regression. The extracted hatch area sub-image is shown in Figure 4.

本步骤的作用在于提取舱门区域的子图，从而减少相机采集的图片中的其他内容对舱门坐标计算的影响。The purpose of this step is to extract the sub-image of the hatch area, thereby reducing the influence of other contents in the image captured by the camera on the calculation of the hatch coordinates.

S3，将舱门区域子图输入第二神经网络模型进行第二特征提取，并对舱门区域子图的每个像素点做逻辑回归得到舱门门缝边缘图。S3, input the hatch area sub-image into the second neural network model to extract the second feature, and perform logistic regression on each pixel point of the hatch area sub-image to obtain the hatch door seam edge image.

可以理解的是，本步骤是基于全局曲线约束的舱门门缝边缘检测，本发明提出的舱门坐标由舱门门缝边缘计算得来，因此需要检测图像中的舱门门缝边缘。It can be understood that this step is based on the detection of the hatch door seam edge under global curve constraints. The hatch door coordinates proposed in the present invention are calculated from the hatch door seam edge, so it is necessary to detect the hatch door seam edge in the image.

具体地，本发明首先构建训练数据库，离线地训练神经网络模型。传统方法通常直接使用真实边缘图像作为监督信号来训练神经网络模型，然而比起其他种类的边缘，舱门门缝边缘是一个扭曲后的矩形，其上、下边缘的形状为直线，左、右边缘的形状为二次曲线，因此可以利用舱门门缝边缘的这种全局曲线特征来提升边缘检测的效果。Specifically, the present invention first constructs a training database and trains the neural network model offline. Traditional methods usually directly use real edge images as supervisory signals to train the neural network model. However, compared with other types of edges, the edge of the hatch door seam is a distorted rectangle, the upper and lower edges are straight lines, and the left and right edges are quadratic curves. Therefore, the global curve feature of the hatch door seam edge can be used to improve the effect of edge detection.

本发明提出在使用边缘图做监督的同时，增加额外的全局曲线约束模块来进一步优化神经网络模型。在获得边缘图后，首先通过一个边缘点采样模块得到均匀分布的舱门门缝边缘点，然后通过边缘点分类模块将采集到的边缘点划分为上、下、左、右四种，接着通过带权重的最小二乘模块计算得到上下左右四条曲线的方程，最后使用四个曲线方程构建全局曲线拟合误差，通过反向传播进一步优化边缘提取网络的性能。The present invention proposes to add an additional global curve constraint module to further optimize the neural network model while using the edge map for supervision. After obtaining the edge map, firstly, an edge point sampling module is used to obtain uniformly distributed hatch door seam edge points, and then the edge point classification module is used to divide the collected edge points into four types: upper, lower, left, and right. Then, the equations of the four curves of upper, lower, left, and right are calculated by the weighted least squares module. Finally, the four curve equations are used to construct the global curve fitting error, and the performance of the edge extraction network is further optimized by back propagation.

在实际运行时，将上述提取出的舱门区域子图送入神经网络模型中，提取特征，并对整张图像的每个像素点做逻辑回归，得到精确的舱门门缝边缘，如图5所示。During actual operation, the extracted hatch area sub-image is fed into the neural network model to extract features, and a logistic regression is performed on each pixel of the entire image to obtain the precise hatch door gap edge, as shown in FIG5 .

本步骤的作用在于检测图像中的舱门门缝边缘，用于后续的坐标点计算。同时提出了基于全局曲线约束的优化方法，提升了边缘检测的准确性。The purpose of this step is to detect the edge of the hatch door gap in the image for subsequent coordinate point calculation. At the same time, an optimization method based on global curve constraints is proposed to improve the accuracy of edge detection.

S4，基于舱门门缝边缘图提取边缘点的坐标，计算得到舱门的两个二维坐标点。S4, extracting the coordinates of the edge points based on the hatch door seam edge map, and calculating two two-dimensional coordinate points of the hatch door.

可以理解的是，本步骤是基于多曲线拟合的二维坐标点计算。具体地，登机廊桥对接时需要将廊桥的地板对接到舱门踏板附近，因此本发明使用舱门的左、右边缘的延长线与舱门下边缘的延长线相交形成的两个交点作为舱门的坐标点。It is understandable that this step is a two-dimensional coordinate point calculation based on multi-curve fitting. Specifically, when docking the boarding bridge, the floor of the bridge needs to be docked near the door pedal. Therefore, the present invention uses the two intersection points formed by the extension lines of the left and right edges of the door and the extension line of the lower edge of the door as the coordinate points of the door.

在获得舱门门缝边缘后，可以进一步提取出每个边缘点的坐标，使用上述的边缘点分类模块分别提取出左边缘点集、右边缘点集和下边缘点集，然后使用ransac方法将左、右边缘点集拟合为两条二次曲线，将下边缘点集拟合为直线，分别计算两条二次曲线与直线的交点，得到舱门的两个坐标点，如图6所示。After obtaining the edge of the hatch door seam, the coordinates of each edge point can be further extracted. The left edge point set, right edge point set and lower edge point set are extracted respectively using the above-mentioned edge point classification module. Then, the left and right edge point sets are fitted into two quadratic curves using the RANSAC method, and the lower edge point set is fitted into a straight line. The intersection points of the two quadratic curves and the straight line are calculated respectively to obtain the two coordinate points of the hatch door, as shown in Figure 6.

则本步骤的作用在于从图像中计算舱门坐标点的二维坐标，为后续计算三维坐标做准备。The purpose of this step is to calculate the two-dimensional coordinates of the hatch door coordinate point from the image, in preparation for the subsequent calculation of the three-dimensional coordinates.

S5，基于两个二维坐标点，通过三角化计算得到舱门的两个三维坐标点。S5, based on the two two-dimensional coordinate points, obtain two three-dimensional coordinate points of the hatch through triangulation calculation.

具体地，通过上述步骤，可以分别计算两张双目图像对应的两个坐标点，此时通过三角化方法可以得到舱门的三维坐标点。假设图5中左图计算得到的两个坐标点分别为右图的两个坐标点分别为假设双目相机焦距为f，基线为B，则可以分别计算两个坐标点的深度值为：Specifically, through the above steps, the two coordinate points corresponding to the two binocular images can be calculated respectively, and the three-dimensional coordinate points of the hatch can be obtained by triangulation. Assume that the two coordinate points calculated in the left figure of Figure 5 are The two coordinate points in the right figure are Assuming the focal length of the binocular camera is f and the baseline is B, the depth values of the two coordinate points can be calculated as follows:

假设相机的内参矩阵为K，则可以进一步计算两个点的三维坐标为：Assuming that the intrinsic parameter matrix of the camera is K, the three-dimensional coordinates of the two points can be further calculated as:

由此，本步骤的作用在于将图像中计算得到的舱门坐标点二维坐标转换为相对于廊桥的三维坐标，从而可以引导廊桥对接到飞机舱门上。Therefore, the purpose of this step is to convert the two-dimensional coordinates of the door coordinate point calculated in the image into three-dimensional coordinates relative to the corridor bridge, so as to guide the corridor bridge to dock with the aircraft door.

根据本发明实施例的基于双目图像的舱门坐标计算方法，可以实时准确地计算飞机舱门相对于登机廊桥的三维坐标，从而精确地将登机廊桥引导对接到飞机舱门上。本发明使用双目相机采集图像，训练神经网络提取舱门区域子图，利用舱门门缝边缘可以近似为四条曲线的特性，实现了一种基于全局曲线约束的门缝边缘检测方法，并进一步将边缘点拟合为多条曲线，计算曲线的交点并使用三角化方法计算得到了舱门的三维点坐标。与其他方法相比，本发明能够精确计算6个维度的舱门坐标，为登机廊桥引导系统提供更加充分准确的舱门位置信息，从根本上提高了廊桥引导系统的对接精度，有效解决了传统廊桥对接方法人力成本高、事故频发的问题，可以在机场的智能化和自动化方面发挥作用。由此本发明具有较大的理论和实践价值。According to the door coordinate calculation method based on binocular images of the embodiment of the present invention, the three-dimensional coordinates of the aircraft door relative to the boarding bridge can be calculated in real time and accurately, so as to accurately guide the boarding bridge to dock with the aircraft door. The present invention uses a binocular camera to collect images, trains a neural network to extract the door area sub-graph, and uses the characteristic that the edge of the door seam can be approximated as four curves to realize a door seam edge detection method based on global curve constraints, and further fits the edge points into multiple curves, calculates the intersection of the curves, and uses the triangulation method to calculate the three-dimensional point coordinates of the door. Compared with other methods, the present invention can accurately calculate the door coordinates in 6 dimensions, provide more sufficient and accurate door position information for the boarding bridge guidance system, fundamentally improve the docking accuracy of the bridge guidance system, effectively solve the problems of high labor cost and frequent accidents of the traditional bridge docking method, and can play a role in the intelligence and automation of the airport. Therefore, the present invention has great theoretical and practical value.

为了实现上述实施例，如图7所示，本实施例中还提供了基于双目图像的舱门坐标计算装置10，该装置10包括：采集模块100、第一提取模块200、第二提取模块300、第一计算模块400和第二计算模块500。In order to implement the above embodiment, as shown in Figure 7, a cabin door coordinate calculation device 10 based on binocular images is also provided in this embodiment. The device 10 includes: a collection module 100, a first extraction module 200, a second extraction module 300, a first calculation module 400 and a second calculation module 500.

采集模块100，用于采集双目图像；The acquisition module 100 is used to acquire binocular images;

第一提取模块200，用于将双目图像输入第一神经网络模型进行第一特征提取，通过边界回归得到舱门区域子图；A first extraction module 200, configured to input the binocular image into a first neural network model to perform first feature extraction, and obtain a hatch area sub-image by boundary regression;

第二提取模块300，用于将舱门区域子图输入第二神经网络模型进行第二特征提取，并对舱门区域子图的每个像素点做逻辑回归得到舱门门缝边缘图；The second extraction module 300 is used to input the hatch area sub-image into the second neural network model to perform second feature extraction, and perform logistic regression on each pixel point of the hatch area sub-image to obtain a hatch door seam edge image;

第一计算模块400，用于基于舱门门缝边缘图提取边缘点的坐标，计算得到舱门的两个二维坐标点；A first calculation module 400 is used to extract the coordinates of edge points based on the hatch door seam edge map, and calculate two two-dimensional coordinate points of the hatch door;

第二计算模块500，用于基于两个二维坐标点，通过三角化计算得到舱门的两个三维坐标点。The second calculation module 500 is used to obtain two three-dimensional coordinate points of the hatch through triangulation calculation based on the two two-dimensional coordinate points.

进一步地，还包括：第一训练模块，用于构建训练图像库，训练得到第一神经网络模型；第二训练模块，用于构建训练数据库，训练得到第二神经网络模型。Furthermore, it also includes: a first training module, used to build a training image library, and train to obtain a first neural network model; a second training module, used to build a training database, and train to obtain a second neural network model.

进一步地，上述第二提取模块300，还包括：Furthermore, the second extraction module 300 further includes:

采样模块，用于通过边缘点采样，得到舱门门缝边缘图的舱门门缝边缘点；A sampling module, used for obtaining hatch door seam edge points of a hatch door seam edge map by edge point sampling;

分类模块，用于通过边缘点分类，将边缘点分为上、下、左、右四类，通过带权重的最小二乘模块计算得到四个曲线方程；The classification module is used to classify edge points into four categories: up, down, left, and right, and calculate four curve equations through the weighted least squares module;

拟合模块，用于使用四个曲线方程构建全局曲线拟合误差。A fitting module is used to construct a global curve fitting error using four curve equations.

根据本发明实施例提出的基于双目图像的舱门坐标计算装置，可以实时准确地计算飞机舱门相对于登机廊桥的三维坐标，从而精确地将登机廊桥引导对接到飞机舱门上。本发明使用双目相机采集图像，训练神经网络提取舱门区域子图，利用舱门门缝边缘可以近似为四条曲线的特性，并进一步将边缘点拟合为多条曲线，计算曲线的交点并使用三角化方法计算得到了舱门的三维点坐标。本发明能够精确计算6个维度的舱门坐标，为登机廊桥引导系统提供更加充分准确的舱门位置信息，从根本上提高了廊桥引导系统的对接精度，有效解决了传统廊桥对接方法人力成本高、事故频发的问题，可以在机场的智能化和自动化方面发挥作用。由此本发明具有较大的理论和实践价值。According to the binocular image-based door coordinate calculation device proposed in the embodiment of the present invention, the three-dimensional coordinates of the aircraft door relative to the boarding bridge can be calculated in real time and accurately, so as to accurately guide the boarding bridge to dock with the aircraft door. The present invention uses a binocular camera to collect images, trains a neural network to extract the door area sub-graph, utilizes the characteristic that the edge of the door seam can be approximated as four curves, and further fits the edge points into multiple curves, calculates the intersection of the curves and uses the triangulation method to calculate the three-dimensional point coordinates of the door. The present invention can accurately calculate the door coordinates in 6 dimensions, provide more sufficient and accurate door position information for the boarding bridge guidance system, fundamentally improve the docking accuracy of the bridge guidance system, effectively solve the problems of high labor cost and frequent accidents of the traditional bridge docking method, and can play a role in the intelligence and automation of the airport. Therefore, the present invention has great theoretical and practical value.

需要说明的是，前述对基于双目图像的舱门坐标计算方法实施例的解释说明也适用于该实施例的基于双目图像的舱门坐标计算装置，此处不再赘述。It should be noted that the above explanation of the embodiment of the method for calculating the cabin door coordinates based on binocular images is also applicable to the device for calculating the cabin door coordinates based on binocular images of this embodiment, and will not be repeated here.

此外，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本发明的描述中，“多个”的含义是至少两个，例如两个，三个等，除非另有明确具体的限定。In addition, the terms "first" and "second" are used for descriptive purposes only and should not be understood as indicating or implying relative importance or implicitly indicating the number of the indicated technical features. Therefore, the features defined as "first" and "second" may explicitly or implicitly include at least one of the features. In the description of the present invention, the meaning of "plurality" is at least two, such as two, three, etc., unless otherwise clearly and specifically defined.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外，在不相互矛盾的情况下，本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of this specification, the description with reference to the terms "one embodiment", "some embodiments", "example", "specific example", or "some examples" etc. means that the specific features, structures, materials or characteristics described in conjunction with the embodiment or example are included in at least one embodiment or example of the present invention. In this specification, the schematic representations of the above terms do not necessarily refer to the same embodiment or example. Moreover, the specific features, structures, materials or characteristics described may be combined in any one or more embodiments or examples in a suitable manner. In addition, those skilled in the art may combine and combine the different embodiments or examples described in this specification and the features of the different embodiments or examples, without contradiction.

尽管上面已经示出和描述了本发明的实施例，可以理解的是，上述实施例是示例性的，不能理解为对本发明的限制，本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present invention have been shown and described above, it is to be understood that the above embodiments are exemplary and are not to be construed as limitations of the present invention. A person skilled in the art may change, modify, replace and vary the above embodiments within the scope of the present invention.

Claims

1. A method for calculating the door coordinates based on binocular images, comprising the following steps:

Collect binocular images;

Inputting the binocular image into a first neural network model to extract a first feature, and obtaining a hatch area sub-image by boundary regression;

Inputting the hatch area sub-image into a second neural network model to perform second feature extraction, and performing logistic regression on each pixel point of the hatch area sub-image to obtain a hatch door seam edge image;

Extracting the coordinates of edge points based on the hatch door gap edge map, and calculating two two-dimensional coordinate points of the hatch door;

Based on the two two-dimensional coordinate points, two three-dimensional coordinate points of the hatch are obtained by triangulation calculation;

The binocular image includes a first binocular image and a second binocular image, and the two three-dimensional coordinate points of the hatch are obtained by triangulation based on the two two-dimensional coordinate points, including:

The two two-dimensional coordinate points of the first binocular image obtained by preset calculation are , , the two two-dimensional coordinate points of the second binocular image are , , the preset focal length of the binocular camera is , the baseline is , respectively calculate the depth values of the two coordinate points of the first binocular image and the second binocular image:

The internal parameter matrix of the binocular camera is preset as , respectively calculate the three-dimensional coordinates of two points in the first binocular image and the second binocular image as:

.

2. The method according to claim 1 is characterized in that a training image library is constructed to obtain the first neural network model through training; and a training database is constructed to obtain the second neural network model through training.

3. The method according to claim 1, characterized in that after obtaining the hatch door gap edge map, it also includes:

Obtaining hatch door seam edge points of the hatch door seam edge map by edge point sampling;

By classifying edge points, the edge points are divided into four categories: up, down, left, and right, and four curve equations are obtained by calculating the least squares module with weights;

A global curve fitting error is constructed using the four curve equations.

4. The method according to claim 3, characterized in that the step of extracting the coordinates of the edge points based on the hatch door seam edge map and calculating the two two-dimensional coordinate points of the hatch door comprises:

Based on the hatch door seam edge map, using the edge point classification to respectively extract a left edge point set, a right edge point set and a lower edge point set;

Use ransac to fit the left edge point set and the right edge point set into two quadratic curves, and fit the lower edge point set into a straight line;

The intersection points of the two quadratic curves and the straight line are calculated respectively to obtain two two-dimensional coordinate points of the hatch.

5. The method according to claim 1 is characterized in that the method also includes: scaling and cropping the acquired binocular image and inputting it into the first neural network model.

6. The method according to claim 1 is characterized in that two intersection points formed by the extension lines of the left edge and the right edge of the hatch and the extension line of the lower edge of the hatch are used as the two two-dimensional coordinate points of the hatch.

7. A door coordinate calculation device based on binocular images, comprising:

An acquisition module, used for acquiring binocular images;

A first extraction module, used for inputting the binocular image into a first neural network model to perform first feature extraction, and obtaining a hatch area sub-image by boundary regression;

A second extraction module is used to input the hatch area sub-image into a second neural network model to perform second feature extraction, and perform logistic regression on each pixel point of the hatch area sub-image to obtain a hatch door seam edge image;

A first calculation module, used for extracting the coordinates of edge points based on the hatch door seam edge map, and calculating two two-dimensional coordinate points of the hatch door;

A second calculation module, used for obtaining two three-dimensional coordinate points of the hatch through triangulation calculation based on the two two-dimensional coordinate points;

The second calculation module is also used to preset the two two-dimensional coordinate points of the first binocular image obtained by calculation as follows: , , the two 2D coordinate points of the second binocular image are , , the preset focal length of the binocular camera is , the baseline is , respectively calculate the depth values of the two coordinate points of the first binocular image and the second binocular image:

.

8. The device according to claim 7 is characterized in that it also includes: a first training module for constructing a training image library and training to obtain the first neural network model; a second training module for constructing a training database and training to obtain the second neural network model.

9. The device according to claim 7, wherein the second extraction module further comprises:

A sampling module, used for obtaining the hatch door seam edge points of the hatch door seam edge map by edge point sampling;

A classification module is used to classify edge points into four categories: upper, lower, left, and right, and obtain four curve equations by calculating with a weighted least squares module;

A fitting module is used to construct a global curve fitting error using the four curve equations.