CN110598652A

CN110598652A - Fundus data prediction method and device

Info

Publication number: CN110598652A
Application number: CN201910880048.9A
Authority: CN
Inventors: 王欣; 姚轩; 黄烨霖; 赵昕; 和超; 张大磊
Original assignee: Shanghai Eaglevision Medical Technology Co Ltd
Current assignee: Shanghai Eaglevision Medical Technology Co Ltd
Priority date: 2019-09-18
Filing date: 2019-09-18
Publication date: 2019-12-20
Anticipated expiration: 2039-09-18
Also published as: CN110598652B

Abstract

The present invention provides a fundus data prediction method and equipment, the method comprising: acquiring a fundus image; using a machine learning model to predict the center position of the macula, the center position of the optic disc, and the size of the optic disc in the fundus image, and the machine learning model is in A heat map is generated during the prediction process, and the position of the center of the macula and the center of the optic disc are determined according to the pixel values of the heat map.

Description

Fundus data prediction method and device

技术领域technical field

本发明涉及眼科图像检测领域，具体涉及一种眼底数据预测方法和设备。The invention relates to the field of ophthalmology image detection, in particular to a fundus data prediction method and equipment.

背景技术Background technique

在医疗领域中，黄斑在眼底视神经盘的颞侧0.35cm处并稍下方，处于人眼的光学中心区，是视力轴线的投影点。黄斑位于视网膜的中心，该部位集中了大量的视觉功能细胞。黄斑区的异常经常直接导致视觉能力的下降，黄斑区的病变如果没有被及时的发现和治疗，失明的几率将大大提高。In the medical field, the macula is located at 0.35cm on the temporal side of the optic disc of the fundus and slightly below it, in the optical center of the human eye, and is the projection point of the vision axis. The macula is located in the center of the retina, where a large number of visual function cells are concentrated. Abnormalities in the macular area often directly lead to the decline of visual ability. If the lesions in the macular area are not detected and treated in time, the chance of blindness will be greatly increased.

目前，机器学习在医学领域得到了广泛的应用，尤其以深度学习为代表的机器学习技术在医疗影像领域被广泛关注。在眼底图像检测方面，深度学习技术已经被用于青光眼、糖尿病视网膜病变等病种的检测，并取得了良好效果。At present, machine learning has been widely used in the medical field, especially machine learning technology represented by deep learning has been widely concerned in the field of medical imaging. In terms of fundus image detection, deep learning technology has been used in the detection of glaucoma, diabetic retinopathy and other diseases, and achieved good results.

但是，由于眼底图像中黄斑和视盘的形态特征往往因病变程度的不同而差异巨大，导致机器学习技术难以准确分割出黄斑和视盘等影像的边界，因此现有技术的识别结果是一个大致的检测框，虽然可以在一定程度上确保检测框中包含视盘或者黄斑的全部内容，但是这种检测结果仍不够精准，从而会影响后续的异常检测结论。However, because the morphological features of the macula and optic disc in fundus images often vary greatly due to the degree of lesion, it is difficult for machine learning technology to accurately segment the boundaries of images such as the macula and optic disc. Therefore, the recognition result of the existing technology is a rough detection Although it can ensure that the detection frame contains all the content of the optic disc or macula to a certain extent, the detection result is still not accurate enough, which will affect the subsequent abnormal detection conclusion.

发明内容Contents of the invention

有鉴于此，本发明提供一种眼底数据预测方法，包括：In view of this, the present invention provides a fundus data prediction method, including:

获取眼底图像；Obtain fundus images;

利用机器学习模型预测所述眼底图像中的预测黄斑中心位置、视盘中心位置和视盘尺寸，所述机器学习模型在预测过程中生成热力图，所述黄斑中心位置和所述视盘中心位置是根据所述热力图的像素值确定的。A machine learning model is used to predict the predicted macular center position, optic disc center position and optic disc size in the fundus image. The machine learning model generates a heat map during the prediction process. The macular center position and the optic disc center position are based on the determined The pixel value of the above heat map is determined.

可选地，所述热力图是所述机器学习模型中用于提取眼底图像特征的神经网络的最后一层输出的特征图。Optionally, the heat map is a feature map output by the last layer of the neural network used to extract fundus image features in the machine learning model.

可选地，利用机器学习模型预测所述眼底图像中的黄斑中心位置、视盘中心位置包括：Optionally, using a machine learning model to predict the central position of the macula and the central position of the optic disc in the fundus image includes:

获取所述热力图中的两个峰值；Obtain the two peaks in the heat map;

确定所述两个峰值对应所述眼底图像中的两个像素点；determining that the two peaks correspond to two pixels in the fundus image;

根据所述两个像素点的像素值确定黄斑中心位置和视盘中心位置。The central position of the macula and the central position of the optic disc are determined according to the pixel values of the two pixel points.

本发明还提供一种眼底数据预测模型训练方法，包括：The present invention also provides a fundus data prediction model training method, comprising:

获取训练数据，所述训练数据包括标记了黄斑中心位置、视盘中心位置和视盘区域的眼底图像，其中所述视盘区域的标记内容用于确定视盘尺寸；Acquiring training data, the training data includes marking the fundus image of the central position of the macula, the central position of the optic disc and the optic disc area, wherein the marking content of the optic disc area is used to determine the size of the optic disc;

利用所述训练数据对机器学习模型进行训练，以使其根据输入的眼底图像预测黄斑中心位置、视盘中心位置和视盘尺寸。The machine learning model is trained by using the training data, so that it can predict the position of the center of the macula, the position of the center of the optic disc and the size of the optic disc according to the input fundus image.

可选地，训练过程采用如下损失函数：Optionally, the training process uses the following loss function:

Loss＝Lp+λLwh，Loss=Lp+λLwh,

其中Lp表示预测的黄斑中心位置、视盘中心位置与训练数据中的黄斑中心位置、视盘中心位置的差异，Lwh表示预测的视盘尺寸与训练数据中的视盘尺寸的差异，λ为权重，0＜λ＜1。Among them, Lp represents the difference between the predicted macular center position and optic disc center position and the macular center position and optic disc center position in the training data, Lwh represents the difference between the predicted optic disc size and the optic disc size in the training data, λ is the weight, 0<λ <1.

可选地，Optionally,

其中Fmap(x,y)为预测过程中所提取的特征图中的像素点的值，Heatmap(x,y)是热力图中的像素点的值，N为所述特征图中的像素点数量，x、y是像素点的坐标，Ω是特征图和热力图中像素点位置的集合。Among them, Fmap(x, y) is the value of the pixels in the feature map extracted during the prediction process, Heatmap(x, y) is the value of the pixels in the heat map, and N is the number of pixels in the feature map , x, y are the coordinates of the pixel, Ω is the set of pixel positions in the feature map and the heat map.

可选地，Optionally,

其中(x_m0,y_m0)为训练数据中的黄斑中心位置，(x_d0,y_d0)为训练数据中的视盘中心位置，σ为训练数据中的视盘尺寸信息，(x,y)是热力图中像素点的坐标。Where (x _m0 , y _m0 ) is the center position of the macula in the training data, (x _d0 , y _d0 ) is the center position of the optic disc in the training data, σ is the size information of the optic disc in the training data, (x, y) is the thermal force The coordinates of the pixels in the image.

可选地，Optionally,

其中为预测的视盘尺寸，S_d为训练数据中的视盘尺寸，所述视盘尺寸包括视盘标注框的长度信息和/或宽度信息。in is the predicted disc size, S _d is the disc size in the training data, and the disc size includes the length information and/or width information of the disc label frame.

相应地，本发明还提供一种眼底数据预测设备，包括：至少一个处理器；以及与所述至少一个处理器通信连接的存储器；其中，所述存储器存储有可被所述一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器执行上述眼底数据预测方法。Correspondingly, the present invention also provides a fundus data prediction device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be executed by the one processor. Instructions, the instructions are executed by the at least one processor, so that the at least one processor executes the above fundus data prediction method.

相应地，本发明还提供一种眼底数据预测模型训练设备，包括：至少一个处理器；以及与所述至少一个处理器通信连接的存储器；其中，所述存储器存储有可被所述一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器执行上述眼底数据预测模型训练方法。Correspondingly, the present invention also provides a fundus data prediction model training device, including: at least one processor; and a memory connected to the at least one processor; wherein, the memory stores information that can be used by the one processor Executable instructions, the instructions are executed by the at least one processor, so that the at least one processor executes the above fundus data prediction model training method.

根据本发明提供的眼底数据预测方法，结合神经网络算法和回归两方面的优势可以准确地定位到眼底图像中的视盘和黄斑中心位置。预测集合了视盘和黄斑位置信息的热力图，将视盘跟黄斑的距离和位置的先验信息隐含地融合在模型中，提高检测的准确率。本发明提供的方案在预测黄斑和视盘中心的同时还得到视盘的尺寸，可以为医生确定治疗方案提供重要依据。According to the method for predicting fundus data provided by the present invention, combining the advantages of neural network algorithm and regression can accurately locate the center of the optic disc and macula in the fundus image. Predict the heat map that integrates the position information of the optic disc and macula, and implicitly fuse the prior information of the distance and position between the optic disc and the macula into the model to improve the accuracy of detection. The solution provided by the present invention can obtain the size of the optic disc while predicting the center of the macula and the optic disc, which can provide an important basis for doctors to determine the treatment plan.

根据本发明提供的方案得到的热力图可应用于进行对眼底图像进行异常检测，利用注意力选择机制，通过将其与原图对应区域相叠加，给视盘、黄斑相对中心的区域相对更大的关注度，从而更好地模拟临床医生实际诊断方式，即在越靠近视盘和黄斑中心的位置出现的病变越危急，需要给予更多的关注，从而提高异常检测操作的准确性。The thermal map obtained according to the scheme provided by the present invention can be applied to abnormal detection of fundus images, and the attention selection mechanism is used to superimpose it with the corresponding area of the original image to give relatively larger areas of the optic disc and macula relative to the center. Attention, so as to better simulate the actual diagnosis method of clinicians, that is, the closer to the center of the optic disc and macula, the more critical the lesion appears, and more attention needs to be given, thereby improving the accuracy of abnormal detection operations.

附图说明Description of drawings

为了更清楚地说明本发明具体实施方式或现有技术中的技术方案，下面将对具体实施方式或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图是本发明的一些实施方式，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the specific embodiments of the present invention or the technical solutions in the prior art, the following will briefly introduce the accompanying drawings that need to be used in the description of the specific embodiments or prior art. Obviously, the accompanying drawings in the following description The drawings show some implementations of the present invention, and those skilled in the art can also obtain other drawings based on these drawings without creative work.

图1是一幅眼底图像；Figure 1 is a fundus image;

图2是本发明实施例中根据眼底图像得到的热力图及其与原图的结合示意图；Fig. 2 is a schematic diagram of the heat map obtained from the fundus image and its combination with the original image in the embodiment of the present invention;

图3为本发明实施例中的训练数据示意图。Fig. 3 is a schematic diagram of training data in an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合附图对本发明的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

此外，下面所描述的本发明不同实施方式中所涉及的技术特征只要彼此之间未构成冲突就可以相互结合。In addition, the technical features involved in the different embodiments of the present invention described below may be combined with each other as long as there is no conflict with each other.

本发明实施例提供一种眼底数据预测方法，该方法可以由计算机和服务器等电子设备执行。在本方法中使用了机器学习模型识别图像，所述机器学习模型可以是多种类型和结构的神经网络。该方法包括如下步骤：An embodiment of the present invention provides a fundus data prediction method, which can be executed by electronic devices such as computers and servers. In this method, a machine learning model is used to recognize images, and the machine learning model may be neural networks of various types and structures. The method comprises the steps of:

S1A，获取眼底图像。为了提高识别效率，在本实施例中对获取的眼底照片进行了预处理，得到了如图1示的眼底图像。预处理包括但不限于图像增强、去除边界等操作。在其它实施例中也可以不进行这些预处理，可直接使用眼底拍照设备采集的眼底照片。S1A, acquiring a fundus image. In order to improve the recognition efficiency, preprocessing is performed on the obtained fundus photos in this embodiment, and a fundus image as shown in FIG. 1 is obtained. Preprocessing includes but not limited to image enhancement, boundary removal and other operations. In other embodiments, these preprocessing may not be performed, and the fundus photos collected by the fundus photographing equipment may be used directly.

S2A，利用机器学习模型预测眼底图像中的黄斑中心位置、视盘中心位置和视盘尺寸。本实施例中的机器学习模型用于进行回归，该模型是利用大量标注了黄斑中心位置、视盘中心位置和视盘尺寸的样本眼底图像进行训练得到的，具体将在下文中介绍模型的训练方案。S2A, Prediction of macular center location, optic disc center location, and optic disc size in fundus images using a machine learning model. The machine learning model in this embodiment is used for regression. The model is trained by using a large number of sample fundus images marked with the central position of the macula, the central position of the optic disc and the size of the optic disc. The training scheme of the model will be introduced in detail below.

本实施例中的机器学习模型首先提取眼底图像的特征信息，然后根据特征信息得到上述三个信息，这些特征信息属于回归过程的中间结果而非最终结果。模型的特征提取部分可采用堆叠的沙漏结构(stacked hour-glass modules)，也可以选用YoloV3、DSOD的特征提取部分，或类似U-Net的结构。在此步骤中所获取的特征信息是特征提取网络的最后一层输出的二维的特征图(Feature map),最后一层可以是卷积层、池化层或者等等。The machine learning model in this embodiment first extracts the feature information of the fundus image, and then obtains the above three pieces of information according to the feature information. These feature information belong to the intermediate results of the regression process rather than the final results. The feature extraction part of the model can use stacked hour-glass modules, or the feature extraction part of YoloV3, DSOD, or a structure similar to U-Net. The feature information obtained in this step is the two-dimensional feature map (Feature map) output by the last layer of the feature extraction network. The last layer can be a convolutional layer, a pooling layer, or the like.

本实施例中的机器学习模型在预测过程中会生成热力图，这是一种对特征信息的可视化处理。以图2为例，获取机器学习模型预测过程中所提取的特征信息，可生成如图2所示左侧的热力图(也称热图，HeatMap)，在可选的实施例中，可以基于特征图生成热力图，或者特征图就是热力图。例如在预测的过程中，特征提取网络输出为特征图进行归一化处理，使其中每个点的像素值范围在0～255，即可得到热力图。在热力图中，越靠近视盘和黄斑中心位置的像素点值越大，越远离中心的像素点的值越小。The machine learning model in this embodiment generates a heat map during the prediction process, which is a visual processing of feature information. Taking Figure 2 as an example, the feature information extracted during the prediction process of the machine learning model can be obtained to generate a heat map (also known as a heat map, HeatMap) on the left side as shown in Figure 2. In an optional embodiment, it can be based on The feature map generates a heat map, or the feature map is a heat map. For example, in the prediction process, the output of the feature extraction network is normalized as a feature map, so that the pixel value of each point ranges from 0 to 255, and the heat map can be obtained. In the heat map, the value of the pixel closer to the center of the optic disc and macula is larger, and the value of the pixel farther away from the center is smaller.

如图2所示，在对预测结果影响比较大的位置生成的热度(数值)相对于其它位置更高。将热力图与眼底图像结合(右侧)可以更清楚地看出，越接近视盘和黄斑中心的位置热力图的值越高，表示模型的关注度越高，因此模型可以根据热力图的像素值来确定黄斑中心位置和视盘中心位置。As shown in Figure 2, the heat (value) generated at locations that have a greater impact on the prediction results is higher than that at other locations. Combining the heat map with the fundus image (on the right) it can be seen more clearly that the closer to the center of the optic disc and macula, the higher the value of the heat map, the higher the attention of the model, so the model can be based on the pixel value of the heat map To determine the center of the macula and the center of the optic disc.

预测结果中的黄斑中心位置和视盘中心位置可以通过像素点的坐标来表示。视盘尺寸的表达方式有多种，例如可以将视盘视区域为一个圆形区域，在此给出圆形区域的半径，或者将视盘视为一个正方形或长方形区域，在此给出方形区域的长度和/或宽度等等。The central position of the macula and the central position of the optic disc in the prediction result can be represented by coordinates of pixels. There are many ways to express the size of the optic disc. For example, the visual area of the optic disc can be regarded as a circular area, and the radius of the circular area can be given here, or the optic disc can be regarded as a square or rectangular area, and the length of the square area can be given here. and/or width etc.

图2只是为了进行说明而示出的可视化结果，实际应用时可以向用户展示热力图及其与原图的结合结果，提示用户重点观察此区域。也可以不做可视化处理，热力图将作为一种辅助数据用于后续对眼底图像进行分类或分割感兴趣区域。Figure 2 is only a visualization result shown for illustration. In actual application, the heat map and its combination with the original image can be displayed to the user, prompting the user to focus on this area. It is also possible not to perform visualization processing, and the heat map will be used as a kind of auxiliary data for subsequent classification of fundus images or segmentation of regions of interest.

根据本发明实施例提供的眼底数据预测方法，结合神经网络算法和回归两方面的优势可以准确地定位到眼底图像中的视盘和黄斑中心位置。预测集合了视盘和黄斑位置信息的热力图，将视盘跟黄斑的距离和位置的先验信息隐含地融合在模型中，提高检测的准确率。According to the fundus data prediction method provided by the embodiment of the present invention, combining the advantages of neural network algorithm and regression can accurately locate the center of the optic disc and macula in the fundus image. Predict the heat map that integrates the position information of the optic disc and macula, and implicitly fuse the prior information of the distance and position between the optic disc and the macula into the model to improve the accuracy of detection.

对于视盘区域的疾病，病变区域一般集中在视盘的一个视盘直径的范围内。对于黄斑区域的疾病，一般以病变区域距离黄斑中心凹的距离来衡量疾病的严重程度，如糖尿病性黄斑水肿，其中最重要的分级指标是水肿病变与黄斑中心凹的距离。而为了更好的确定相对距离，一般以黄斑中心凹与病变的距离与视盘直径的比值来作为衡量指标。对于一些整体性眼底疾病而言，是否波及黄斑区域和视盘区域，是衡量视力受损程度的重要依据。本发明提供的方案在预测黄斑和视盘中心的同时还得到视盘的尺寸，可以为医生确定治疗方案提供重要依据。For diseases in the optic disc area, the lesion area is generally concentrated in the range of one optic disc diameter of the optic disc. For diseases in the macular area, the severity of the disease is generally measured by the distance between the lesion area and the center of the macula, such as diabetic macular edema, where the most important grading index is the distance between the edema lesion and the center of the macula. In order to better determine the relative distance, the ratio of the distance between the fovea and the lesion to the diameter of the optic disc is generally used as a measure. For some overall fundus diseases, whether it affects the macular area and optic disc area is an important basis for measuring the degree of visual impairment. The solution provided by the present invention can obtain the size of the optic disc while predicting the center of the macula and the optic disc, which can provide an important basis for doctors to determine the treatment plan.

此外，得到的热力图可应用于进行对眼底图像进行异常检测，利用注意力选择机制，通过将其与原图对应区域相叠加，给视盘、黄斑相对中心的区域相对更大的关注度，从而更好地模拟临床医生实际诊断方式，即在越靠近视盘和黄斑中心的位置出现的病变越危急，需要给予更多的关注，从而提高异常检测操作的准确性。In addition, the obtained heat map can be applied to abnormal detection of fundus images. Using the attention selection mechanism, by superimposing it with the corresponding area of the original image, the relative center area of the optic disc and macula is given relatively greater attention, thus To better simulate the actual diagnosis method of clinicians, that is, the closer to the center of the optic disc and macula, the more critical the lesions appear, and more attention needs to be given, thereby improving the accuracy of abnormal detection operations.

在一个优选的实施例中，机器学习模型采用如下方式得到黄斑中心位置和视盘中心位置：获取热力图中的两个峰值；确定两个峰值对应眼底图像中的两个像素点；根据两个像素点的像素值确定黄斑中心位置和视盘中心位置。In a preferred embodiment, the machine learning model obtains the central position of the macula and the central position of the optic disc in the following manner: obtain two peaks in the heat map; determine that the two peaks correspond to two pixels in the fundus image; The pixel value of the point determines the location of the center of the macula and the center of the optic disc.

热力图的峰值位置最有可能是黄斑和视盘的中心位置。特征提取网络输出的特征图中有两个峰值位置，分别是黄斑中心位置和视盘中心位置。在原始眼底图像中，黄斑中心暗于视盘中心，即其像素值低于视盘中心像素值，因此根据这两个峰值位置对应到原始眼底图像中的像素值，即可区分黄斑和视盘中心。The peak location of the heatmap is most likely the center of the macula and optic disc. There are two peak positions in the feature map output by the feature extraction network, which are the center of the macula and the center of the optic disc. In the original fundus image, the center of the macula is darker than the center of the optic disc, that is, its pixel value is lower than that of the center of the optic disc. Therefore, according to the pixel values in the original fundus image corresponding to the two peak positions, the macula and the center of the optic disc can be distinguished.

本发明还提供一种眼底数据预测设备，包括：至少一个处理器；以及与所述至少一个处理器通信连接的存储器；其中，所述存储器存储有可被所述一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器执行上述眼底数据预测方法。The present invention also provides a fundus data prediction device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores instructions executable by the one processor, so The instructions are executed by the at least one processor, so that the at least one processor executes the above fundus data prediction method.

本发明实施例提供一种眼底数据预测模型训练方法，可用于训练上述实施例中所使用的机器学习模型，本方法可以由计算机和服务器等电子设备执行。本方法包括如下步骤：An embodiment of the present invention provides a method for training a fundus data prediction model, which can be used to train the machine learning model used in the above embodiments, and the method can be executed by electronic devices such as computers and servers. This method comprises the steps:

S1B，获取训练数据，训练数据包括如图3所示的标记了黄斑中心31位置、视盘中心32位置和视盘区域33的眼底图像，其中视盘区域33的标记内容用于确定视盘尺寸。在本实施中，视盘区域33的标注为正方形，其边长为视盘尺寸，可以被视为视盘的直径。在其它实施例中也可以采用圆形或其他形状对此区域进行标注。S1B, acquire training data, the training data includes the fundus image marked with the macular center 31 position, the optic disc center 32 position and the optic disc area 33 as shown in Figure 3, wherein the marked content of the optic disc area 33 is used to determine the size of the optic disc. In this implementation, the optic disc region 33 is marked as a square, and its side length is the size of the optic disc, which can be regarded as the diameter of the optic disc. In other embodiments, circles or other shapes may also be used to mark this area.

实际应用时应当获取大量的上述训练数据，如果实际的眼底图像数量不够多，可以添加一个数据增强模块基于实际的眼底图像生成训练数据，数据增强模块可采用随机翻转、镜像、旋转、平移、随机加噪声、模糊化、提高对比度、调整颜色空间等手段，基于原始的眼底图像来进行数据扩增。其原则是尽可能地模拟眼底图自然拍摄中可能出现的形态，使得生成的眼底图与实际拍摄的眼底图一致。In practical applications, a large amount of the above training data should be obtained. If the actual number of fundus images is not large enough, a data enhancement module can be added to generate training data based on the actual fundus images. The data enhancement module can use random flip, mirror image, rotation, translation, random Add noise, blur, improve contrast, adjust color space, etc., and perform data amplification based on the original fundus image. The principle is to simulate as much as possible the shapes that may appear in the natural photographing of the fundus map, so that the generated fundus map is consistent with the actual photographed fundus map.

根据上述扩充方式，基于一幅实际采集的眼底图像进行处理，可以得到多幅变换图像作为训练数据，由此可以显著提高训练数据的数量，从而优化机器学习模型的性能。According to the above expansion method, multiple transformed images can be obtained as training data based on one actually collected fundus image, which can significantly increase the amount of training data, thereby optimizing the performance of the machine learning model.

S2B，利用上述训练数据对机器学习模型进行训练，以使其根据输入的眼底图像预测黄斑中心位置、视盘中心位置和视盘尺寸。训练时应当设置一定的收敛条件，利用大量如图3所示的样本图像使机器学习模型预测的结果与实际标注的内容达到一定的一致性。S2B, using the above training data to train the machine learning model, so that it can predict the center position of the macula, the center position of the optic disc and the size of the optic disc according to the input fundus image. Certain convergence conditions should be set during training, and a large number of sample images as shown in Figure 3 should be used to make the predicted results of the machine learning model consistent with the actual marked content.

具体地，应当设定合适的损失函数来衡量预测结果与标注内容(实际数据)间的差异，并根据差异给与相应的惩罚从而使模型优化自身的参数，直到模型的预测结果与标注内容的差距足够小并保持稳定。损失函数可包括两部分，一部分用来衡量预测的视盘和黄斑的中心点与实际数据的差异，另一部分用来衡量预测的视盘尺寸与实际尺寸的差异，这两部分可以有所偏重，具体根据实际应用场景进行设置。Specifically, an appropriate loss function should be set to measure the difference between the predicted result and the labeled content (actual data), and corresponding penalties should be given according to the difference so that the model can optimize its own parameters until the predicted result of the model is consistent with the labeled content. The gap is small enough and remains stable. The loss function can include two parts, one part is used to measure the difference between the predicted center point of the optic disc and macula and the actual data, and the other part is used to measure the difference between the predicted optic disc size and the actual size. These two parts can be biased, according to Set up the actual application scenarios.

根据本发明实施例提供的眼底数据预测模型训练方法，结合神经网络算法和回归两方面的优势，利用标注了视盘和黄斑中心位置以及视盘尺寸的训练数据对机器学习模型进行训练，使其能够预测这些信息。According to the fundus data prediction model training method provided by the embodiment of the present invention, combining the advantages of neural network algorithm and regression, the machine learning model is trained by using the training data marked with the center position of the optic disc and macula and the size of the optic disc, so that it can predict these messages.

作为一个优选的实施例，训练过程采用如下损失函数：As a preferred embodiment, the training process uses the following loss function:

Loss＝Lp+λLwh，Loss=Lp+λLwh,

其中Lp表示预测的黄斑中心位置、视盘中心位置与训练数据中的黄斑中心位置、视盘中心位置的差异，Lwh表示预测的视盘尺寸与训练数据中的视盘尺寸的差异，λ为权重，0＜λ＜1。在一个具体的实施例中λ取值为0.1。Among them, Lp represents the difference between the predicted macular center position and optic disc center position and the macular center position and optic disc center position in the training data, Lwh represents the difference between the predicted optic disc size and the optic disc size in the training data, λ is the weight, 0<λ <1. In a specific embodiment, the value of λ is 0.1.

进一步地，上式中Further, in the above formula

其中Fmap(x,y)为预测过程中所提取的特征图中的像素点的值，Heatmap(x,y)是热力图中的像素点的值，N为所述特征图中的像素点数量，x、y是像素点的坐标，Ω是像素位置的集合。在模型训练过程中，将heatmap(热力图)作为卷积网络生成的特征图的目标，即模型训练时通过反向传播，使卷积网络生成的特征图Fmap(特征图)尽可能接近heatmap。Among them, Fmap(x, y) is the value of the pixels in the feature map extracted during the prediction process, Heatmap(x, y) is the value of the pixels in the heat map, and N is the number of pixels in the feature map , x, y are the coordinates of the pixel point, and Ω is the set of pixel positions. In the process of model training, the heatmap (heat map) is used as the target of the feature map generated by the convolutional network, that is, the feature map Fmap (feature map) generated by the convolutional network is as close as possible to the heatmap through backpropagation during model training.

进一步地，上式中Further, in the above formula

更进一步地，本实施例采用如下方式获得热力图：Furthermore, in this embodiment, the heat map is obtained in the following manner:

其中(x_m0,y_m0)为训练数据中的黄斑中心位置，(x_d0,y_d0)为训练数据中的视盘中心位置，σ为训练数据中的视盘尺寸信息，(x,y)是heatmap对应像素点的坐标位置。Where (x _m0 , y _m0 ) is the center position of the macula in the training data, (x _d0 , y _d0 ) is the center position of the optic disc in the training data, σ is the size information of the optic disc in the training data, (x, y) is the heatmap Corresponding to the coordinate position of the pixel point.

σ²＝wh/4，w和h分别为训练数据中的视盘区域的宽和高，σ相当于视盘半径。σ ² =wh/4, w and h are the width and height of the optic disc area in the training data respectively, and σ is equivalent to the radius of the optic disc.

本发明还提供一种眼底数据预测模型训练设备，包括：至少一个处理器；以及与所述至少一个处理器通信连接的存储器；其中，所述存储器存储有可被所述一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器执行上述眼底数据预测模型训练方法。The present invention also provides a fundus data prediction model training device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores instructions that can be executed by the one processor , the instructions are executed by the at least one processor, so that the at least one processor executes the above fundus data prediction model training method.

本领域内的技术人员应明白，本发明的实施例可提供为方法、系统、或计算机程序产品。因此，本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且，本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present invention may be provided as methods, systems, or computer program products. Accordingly, the present invention can take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to operate in a specific manner, such that the instructions stored in the computer-readable memory produce an article of manufacture comprising instruction means, the instructions The device realizes the function specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上，使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded onto a computer or other programmable data processing device, causing a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process, thereby The instructions provide steps for implementing the functions specified in the flow chart or blocks of the flowchart and/or the block or blocks of the block diagrams.

显然，上述实施例仅仅是为清楚地说明所作的举例，而并非对实施方式的限定。对于所属领域的普通技术人员来说，在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。而由此所引伸出的显而易见的变化或变动仍处于本发明创造的保护范围之中。Apparently, the above-mentioned embodiments are only examples for clear description, rather than limiting the implementation. For those of ordinary skill in the art, other changes or changes in different forms can be made on the basis of the above description. It is not necessary and impossible to exhaustively list all the implementation manners here. And the obvious changes or changes derived therefrom are still within the scope of protection of the present invention.

Claims

1. A method for predicting fundus data, comprising:

acquiring a fundus image;

predicting a predicted macular center position, a disc center position, and a disc size in the fundus image using a machine learning model that generates a thermodynamic diagram in a prediction process, the macular center position and the disc center position being determined from pixel values of the thermodynamic diagram.

2. The method of claim 1, wherein the thermodynamic diagram is a feature diagram of a last layer output of a neural network used to extract fundus image features in the machine learning model.

3. The method according to claim 1 or 2, wherein predicting the central position of the macula lutea and the central position of the optic disc in the fundus image using a machine learning model comprises:

acquiring two peaks in the thermodynamic diagram;

determining that the two peak values correspond to two pixel points in the fundus image;

and determining the central position of the macula lutea and the central position of the optic disc according to the pixel values of the two pixel points.

4. A method for training a model for predicting eye fundus data, comprising:

acquiring training data including a fundus image in which a macular center position, a disc center position, and a disc region are marked, wherein a marking content of the disc region is used to determine a disc size;

the machine learning model is trained using the training data so as to predict a central position of the macula lutea, a central position of the optic disc, and a size of the optic disc from the input fundus image.

5. The method of claim 4, wherein the training process uses the following loss function:

Loss＝Lp+λLwh，

where Lp represents the difference between the predicted macular center position, the disk center position, and the macular center position, the disk center position in the training data, Lwh represents the difference between the predicted disk size and the disk size in the training data, λ is a weight, and 0 < λ < 1.

6. The method of claim 5, wherein:

the method comprises the steps of obtaining a feature diagram, extracting the feature diagram from a database, extracting the feature diagram from the database, and extracting the feature diagram from the database, wherein Fmap (x, y) is the value of a pixel point in the feature diagram extracted in the prediction process, Heatmap (x, y) is the value of the pixel point in the thermodynamic diagram, N is the number of the pixel points in the feature diagram, x and y are coordinates of the pixel point, and omega is the set of the pixel point positions in the feature diagram and the thermodynamic diagram.

7. The method of claim 6, wherein:

wherein (x)_m0,y_m0) To train the macular center position in the data, (x)_d0,y_d0) σ is the disc size information in the training data, and (x, y) is the coordinates of the pixel points in the thermodynamic diagram.

8. The method of claim 5, wherein:

whereinFor predicted disc size, S_dThe size of the optic disc in the training data comprises the length information and/or the width information of the optic disc labeling frame.

9. An apparatus for predicting eye fundus data, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the one processor to cause the at least one processor to perform the fundus data prediction method of any of claims 1-3.

10. An eye fundus data prediction model training device, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the one processor to cause the at least one processor to perform the fundus data prediction model training method of any of claims 4-8.