CN113112475B

CN113112475B - Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning

Info

Publication number: CN113112475B
Application number: CN202110396463.4A
Authority: CN
Inventors: 梁惠珠; 冯跃; 林卓胜; 朱嘉健; 李胜可; 刘慧琳
Original assignee: Wuyi University Fujian
Current assignee: Wuyi University Fujian
Priority date: 2021-04-13
Filing date: 2021-04-13
Publication date: 2023-04-18
Anticipated expiration: 2041-04-13
Also published as: CN113112475A

Abstract

The invention discloses a traditional Chinese medicine ear five-zang-organ region segmentation method and a device based on machine learning, wherein the method comprises the following steps: collecting ear images and establishing an image data set; labeling the ear image to obtain a label image; respectively expanding the ear images and the tag images; carrying out Canny operator edge detection on the expansion tag image; respectively preprocessing the extended ear image, the extended tag image and the tag edge image; generating edge images of each single region of the heart, the liver, the spleen, the lung and the kidney by utilizing the gray level images and the binary images; inputting the RGB three-channel image into a learning network to obtain a predicted image; obtaining a predicted image of each single region of the heart, the liver, the spleen, the lung and the kidney from the predicted image; respectively carrying out expansion and corrosion treatment on the predicted image to obtain a single-region rough edge prediction image; calculating a total loss function; and calculating the minimum value of the total loss function to make the learning network converge, and segmenting the ear image by using the learning network to obtain the ear five-organ region segmentation image.

Description

A method and device for segmenting ear and five internal organs regions of traditional Chinese medicine based on machine learning

技术领域technical field

本发明涉及图像处理领域，特别涉及一种基于机器学习的中医耳部五脏区域分割方法和装置。The invention relates to the field of image processing, in particular to a machine learning-based method and device for segmenting ear five viscera regions in traditional Chinese medicine.

背景技术Background technique

近年来，机器学习广泛包含图像各个领域并获得了优越的性能，例如图像识别，目标检测，图像分割等等。值得一提的是，属于机器学习范畴的深度学习应用于医学影像的研究对辅助医师临床诊断具有重要意义，中医学望诊中的舌诊图像处理已经取得不少成果。但是耳诊的数字化辅助诊断仍罕见，尤其是耳部望诊的数字化没见以任何形式出现，其中，图像分割是临床应用的必要步骤，尽管出现基于深度学习的耳部生物信息识别，但基于深度学习的耳部区域分割较之更少，且因其各个分割区域的分界是基于中医理论，区域分界不是物体固有的分界轮廓，具有一定的模糊性，带来了分割难度。In recent years, machine learning has widely included various fields of images and achieved superior performance, such as image recognition, object detection, image segmentation and so on. It is worth mentioning that the application of deep learning, which belongs to the category of machine learning, to medical imaging research is of great significance for assisting doctors in clinical diagnosis. Many results have been achieved in tongue diagnosis image processing in traditional Chinese medicine inspection. However, the digital assisted diagnosis of ear diagnosis is still rare, especially the digitization of ear inspection has not appeared in any form, among which image segmentation is a necessary step in clinical application. The ear area segmentation of deep learning is less, and because the boundaries of each segmented area are based on the theory of traditional Chinese medicine, the area boundary is not the inherent boundary outline of the object, which has a certain degree of ambiguity, which brings difficulty in segmentation.

发明内容Contents of the invention

本发明旨在至少解决现有技术中存在的技术问题之一。为此，本发明提出一种一种基于机器学习的中医耳部五脏区域分割方法和装置，所述一种基于机器学习的中医耳部五脏区域分割方法和装置，使得中医耳部五脏对应区域能自动分割。The present invention aims to solve at least one of the technical problems existing in the prior art. For this reason, the present invention proposes a method and device for segmenting the ear five internal organs area based on machine learning in traditional Chinese medicine. Automatic segmentation.

第一方面，本发明实施例提出一种具有上述功能的中医耳部五脏区域分割方法。包括：In the first aspect, the embodiment of the present invention proposes a method for segmenting ear and five viscera regions in traditional Chinese medicine with the above-mentioned functions. include:

采集耳部图像；所述采集耳部图像包括：Gathering ear images; said gathering ear images includes:

充分利用现有的公开耳部数据集，筛选出符合中医耳诊要求的耳部图像作为数据集的原始数据；Make full use of the existing public ear data sets, and select ear images that meet the requirements of TCM ear diagnosis as the original data of the data set;

另一部分是按照中医耳诊要求，自行采集原始数据，所述自行采集原始数据为，采集设备为中医舌象采集仪，采集对象左(右)侧耳部靠近采集仪后，头部向左(右)旋转至约30°合适的角度，使得耳部五脏区域在采集过程中不被遮挡，原始采集图像中包含耳部、脸部以及采集仪器边缘，原始采集图像通过训练好的Opencv耳部检测器模型检测出耳部位置，检测方框存在没有完全包含耳部区域的情况，因此以检测方框为中心，以检测方框宽和高的1.6倍进行剪裁，剪裁后的图像中，完整的耳部区域占整张图像面积约一半，得到所述耳部图像。The other part is to collect raw data by itself according to the requirements of TCM ear diagnosis. The self-collected raw data is that the collection device is a TCM tongue image collection instrument. ) to a suitable angle of about 30°, so that the five internal organs of the ear are not blocked during the collection process. The original collected image includes the ear, face and the edge of the collection instrument. The original collected image passes through the trained Opencv ear detector The model detects the position of the ear, and the detection box does not completely cover the ear area. Therefore, the detection box is centered and cut at 1.6 times the width and height of the detection box. In the cropped image, the complete ear The ear region accounts for about half of the entire image area, and the ear image is obtained.

对所述耳部图像进行标注，得到标签图像；Annotating the ear image to obtain a label image;

分别扩充所述耳部图像和所述标签图像，得到扩充耳部图像和扩充标签图像；Expanding the ear image and the label image respectively to obtain the expanded ear image and the expanded label image;

对所述扩充标签图像进行Canny算子边缘检测，得到标签边缘图像；Carry out Canny operator edge detection to described extended label image, obtain label edge image;

分别对所述扩充耳部图像、所述扩充标签图像和所述标签边缘图像进行预处理，分别得到RGB三通道图像、灰度图像和二值图像；Preprocessing the expanded ear image, the expanded label image and the label edge image respectively to obtain an RGB three-channel image, a grayscale image and a binary image;

利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像；Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney;

将所述RGB三通道图像输入学习网络，得到预测图像；Input the RGB three-channel image into the learning network to obtain a predicted image;

从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像；obtaining prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images;

分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理，得到其对应的区域的单区域粗略边缘预测图；Carrying out dilation and erosion processing on the prediction images of the single regions of the heart, liver, spleen, lung and kidney respectively, to obtain a single region rough edge prediction map of the corresponding regions;

计算总损失函数，所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数；calculating a total loss function, the total loss function including a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map;

计算所述总损失函数的最小值，使所述学习网络收敛，利用所述学习网络对所述耳部图像进行分割，得到耳部五脏区域分割图像。The minimum value of the total loss function is calculated to make the learning network converge, and the learning network is used to segment the ear image to obtain a segmented image of the five viscera regions of the ear.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，至少具有如下技术效果:使得中医耳部五脏对应区域能自动分割。According to the machine learning-based method for segmenting the five internal organs of the ears of traditional Chinese medicine according to the embodiments of the present invention, at least the following technical effect is achieved: the corresponding regions of the five internal organs of the ears of traditional Chinese medicine can be automatically segmented.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，所述扩充所述耳部图像和所述标签图像，得到扩充耳部图像和扩充标签图像，包括：According to the machine learning-based method for segmenting five viscera regions of the ear in traditional Chinese medicine according to an embodiment of the present invention, the expansion of the ear image and the label image to obtain the expanded ear image and the expanded label image includes:

对所述耳部图像进行旋转、改变所述耳部图像的尺寸、对所述耳部图像进行水平翻转和对所述耳部图像进行伽马变换，得到所述扩充耳部图像；Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and performing gamma transformation on the ear image to obtain the expanded ear image;

对所述标签图像进行与所述耳部图像相同的旋转、改变所述标签图像的尺寸、对所述标签图像进行水平翻转，并使其仍只拥有五种像素值对应心、肝、脾、肺、肾五种区域，且灰度像素值分别为50、100、150、200、250，除上述五种区域外为背景区域，像素值为0，得到所述扩充标签图像。Perform the same rotation on the label image as the ear image, change the size of the label image, flip the label image horizontally, and make it still only have five pixel values corresponding to heart, liver, spleen, The five areas of lung and kidney, and the gray pixel values are 50, 100, 150, 200, 250 respectively, except the above five areas are background areas, the pixel value is 0, and the extended label image is obtained.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，对所述耳部图像进行标注，得到标签图像，包括：According to the machine learning-based method for segmenting the ear five internal organs region of traditional Chinese medicine according to an embodiment of the present invention, the ear image is marked to obtain a label image, including:

按照中国标准耳穴定位示意图对对所述耳部图像使用LabelMe工具箱进行标注；According to the Chinese standard auricular point positioning schematic diagram, the ear image is marked using the LabelMe toolbox;

对所述耳部图像的标注信息进行转换，得到每一个所述耳部图像对应的标签图像，所述标签图像中对应中国标准耳穴定位示意图中心、肝、脾、肺、肾以及其余区域为背景六个区域的分别以不同的像素值区分。Convert the labeling information of the ear images to obtain a label image corresponding to each of the ear images, in which the center, liver, spleen, lung, kidney and other areas of the label image correspond to the Chinese standard auricular point positioning schematic diagram as the background Each of the six regions is distinguished by different pixel values.

所述耳部图像以及对应的经过专业中医医师核查的所述标签图像组成的数据集按60％，20％，20％的比例划分成训练集、验证集以及测试集，以深度神经网络U-Net作为训练模型，交并比(Intersection over Union)IoU作为技术指标。一方面，在训练过程中，验证集中五个标注区域(心、肝、脾、肺、肾)的交并比响应顺序为肺、肾、肝、脾、心，另一方面，训练完成后在测试集上五个标注区域的平均交并比在0.38以上。达到上述两个方面的指标的数据集才确定为最后的数据集。The ear image and the corresponding data set composed of the label image checked by a professional TCM physician are divided into a training set, a verification set and a test set according to a ratio of 60%, 20%, and 20%, and the deep neural network U- Net is used as a training model, and Intersection over Union (Intersection over Union) IoU is used as a technical indicator. On the one hand, during the training process, the response sequence of the intersection ratio of the five labeled regions (heart, liver, spleen, lung, and kidney) in the verification set is lung, kidney, liver, spleen, and heart; The average intersection ratio of the five labeled regions on the test set is above 0.38. The data set that meets the above two indicators is determined as the final data set.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，分别对所述扩充耳部图像和所述标签边缘图像进行预处理，包括：According to the machine learning-based method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention, the preprocessing of the expanded ear image and the label edge image is performed respectively, including:

对所述扩充耳部图像进行位置剪切，归一化操作，得到尺寸为W*H的所述RGB三通道图像。Carry out position cut and normalization operation on the expanded ear image to obtain the RGB three-channel image with a size of W*H.

对所述扩充标签图像进行与所述扩充耳部图像对应的位置剪切，得到尺寸为W*H的所述灰度图像，所述灰度图像分为背景、心、肝、脾、肺、肾六个区域，所述六个区域的灰度像素值分别为0、1、2、3、4、5。Cutting the extended label image corresponding to the extended ear image to obtain the grayscale image with a size of W*H, the grayscale image is divided into background, heart, liver, spleen, lung, There are six areas of the kidney, and the gray pixel values of the six areas are 0, 1, 2, 3, 4, and 5, respectively.

对所述标签边缘图像进行与所述扩充耳部图像对应的位置剪切，得到尺寸为W*H的所述二值图像。Cutting the label edge image corresponding to the expanded ear image to obtain the binary image with a size of W*H.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像，包括：According to the machine learning-based method for segmenting the five viscera regions of the ear in traditional Chinese medicine, the grayscale image and the binary image are used to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney, including:

令f_label(x,y)表示在所述灰度图像任意坐标(x,y)处的像素值，令f_edge(x,y)表示在所述二值图像任意坐标(x,y)处的像素值，g_i(x,y)表示这些坐标处相应的变换的像素值，有Let f _label (x, y) represent the pixel value at any coordinate (x, y) of the grayscale image, let f _edge (x, y) represent the pixel value at any coordinate (x, y) of the binary image The pixel value of , g _i (x, y) represents the corresponding transformed pixel value at these coordinates, there is

式中，i∈{1,2,…,5}，x∈{1,2,…,W},y∈{1,2,…,H}，W和H分别是所述灰度图像和所述二值图像的宽和高，g₁、g₂、g₃、g₄、g₅分别是心、肝、脾、肺、肾的所述单个区域边缘图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the grayscale image and The width and height of the binary image, g ₁ , g ₂ , g ₃ , g ₄ , and g ₅ are the edge images of the single regions of the heart, liver, spleen, lung, and kidney, respectively.

根据权利要求1所述的基于机器学习的中医耳部五脏区域分割方法，其特征在于，将所述RGB三通道图像输入学习网络，得到预测图像，包括：According to claim 1, the traditional Chinese medical science ear five internal organs region segmentation method based on machine learning, is characterized in that, said RGB three-channel image is input into learning network, obtains predicted image, comprises:

将所述RGB三通道图像经过两个卷积核都为3*3的卷积层，得到通道为64，尺寸为H×W的特征F_origin；Pass the RGB three-channel image through two convolution layers with 3*3 convolution kernels to obtain a feature F _origin with 64 channels and a size of H×W;

将所述特征F_origin经过PAM模块得到特征F₁。Pass the feature F _origin through the PAM module to obtain the feature F ₁ .

将所述特征F₁经过PAM模块得到特征F₂。The feature F ₁ is passed through the PAM module to obtain the feature F ₂ .

将所述特征F₁与所述特征F₂分别与系数0.75、0.25相乘后的得到的两个结果进行通道上的拼接，然后通过卷积核为3*3的卷积层，得到通道为6，尺寸为H×W的所述预测图像。The two results obtained by multiplying the feature F ₁ and the feature F ₂ by the coefficients 0.75 and 0.25 respectively are spliced on the channel, and then passed through the convolution layer with a convolution kernel of 3*3 to obtain the channel as 6. The prediction image whose size is H×W.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像，包括：According to the machine learning-based method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention, the predicted images of individual regions of the heart, liver, spleen, lung, and kidney are obtained from the predicted images, including:

对所述预测图像在通道的维度上取最大值，得到尺寸为H×W,像素值只有0、1、2、3、4、5六种像素值的维度灰度图像，其中，六种像素值在所述维度灰度图像中所对应的区域分别为背景、心、肝、脾、肺、肾的预测分割区域；Take the maximum value of the predicted image in the dimension of the channel to obtain a dimension grayscale image with a size of H×W and only six pixel values of 0, 1, 2, 3, 4, and 5, wherein the six types of pixels The areas corresponding to the values in the dimension grayscale image are respectively the predicted segmentation areas of the background, heart, liver, spleen, lung, and kidney;

令P(x,y)表示在预测灰度图像任意坐标(x,y)的像素值，有Let P(x,y) denote the pixel value at any coordinate (x,y) of the predicted grayscale image, we have

式中，i∈{1,2,…,5}，x∈{1,2,…,W},y∈{1,2,…,H}，W和H分别是所述维度灰度图像的宽和高，P(x,y)＝{0,1,…,5}，p₁、p₂、p₃、p₄、p₅分别是心、肝、脾、肺、肾的所述单个区域预测图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are grayscale images of the dimensions respectively width and height, P(x,y)={0,1,...,5}, p ₁ , p ₂ , p ₃ , p ₄ , p ₅ are the heart, liver, spleen, lung and kidney respectively A single region predicts an image.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，所述分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理，得到其对应的区域的单区域粗略边缘预测图，包括：According to the machine learning-based segmentation method of TCM ears and five viscera regions, the prediction images of the individual regions of the heart, liver, spleen, lung, and kidney are respectively expanded and eroded to obtain the corresponding regions. Single region rough edge prediction map, including:

定义结构元B，所述结构元B的大小为3*3，原点位于中心，元素值都为1；Define the structure element B, the size of the structure element B is 3*3, the origin is at the center, and the element values are all 1;

使用所述结构元B膨胀和腐蚀心、肝、脾、肺、肾的各所述单区域预测图像,得到膨胀后的所述单区域预测图像和腐蚀后的所述单区域预测图像，所述单区域预测图像和腐蚀后的所述单区域预测图像相减获得其对应的所述区域粗略边缘预测图；Using the structural element B to expand and corrode each of the single-region predicted images of the heart, liver, spleen, lung, and kidney to obtain the expanded single-region predicted image and the corroded single-region predicted image, the Subtracting the single-region prediction image from the corroded single-region prediction image to obtain its corresponding rough edge prediction map of the region;

令pe_i(x,y)表示在所述区域粗略边缘预测图像任意坐标的像素值，有Let pe _i (x, y) denote the pixel value at any coordinate of the predicted image at the rough edge of the region, we have

式中，符号

为膨胀运算，符号

为腐蚀运算，i∈{1,2,…,5}，x∈{1,2,…,W},y∈{1,2,…,H}，W和H分别是图像的宽和高，pe_i(x,y)∈{0,1}，，pe₁、pe₂、pe₃、pe₄、pe₅分别是心、肝、脾、肺、肾的所述单区域粗略边缘预测图像。In the formula, the symbol

is the dilation operation, the symbol

For the corrosion operation, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the image respectively , pe _i (x, y)∈{0,1},, pe ₁ , pe ₂ , pe ₃ , pe ₄ , pe ₅ are the single-region rough edge prediction images of the heart, liver, spleen, lung, and kidney, respectively .

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，计算总损失函数，所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数：According to the machine learning-based method for segmenting the five viscera regions of TCM ears, the total loss function is calculated, and the total loss function includes the first loss function of the grayscale image and the predicted image and the edge of the single region The second loss function for the image and the single-region rough edge prediction map:

所述第一损失函数为The first loss function is

式中，x∈{1,2,…,W},y∈{1,2,…,H}，W和H分别是所述灰度图像和所述预测图像的宽和高，c∈{1,…,C}，C是分割类别数，

对于类别c经过one-hot编码的所述二值图像，

q^c(x,y)是坐标(x,y)像素属于类别c的预测概率；In the formula, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the grayscale image and the predicted image respectively, c∈{ 1,...,C}, C is the number of split categories,

For the binary image of category c after one-hot encoding,

q ^c (x, y) is the predicted probability that the pixel at coordinates (x, y) belongs to category c;

所述第二损失函数为The second loss function is

式中，i∈{1,2,…,5}，x∈{1,2,…,W},y∈{1,2,…,H}，W和H分别是所述单个区域边缘图像与所述单区域粗略边缘预测图的宽和高，g_i(x,y)为第i个区域边缘图像，g_i(x,y)∈{0,1}，AND为交运算，pe_i(x,y)为第i个区域粗略边缘预测图像，pe_i(x,y)∈{0,1}。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the edge images of the single area respectively and the width and height of the single region rough edge prediction map, g _i (x, y) is the i-th region edge image, g _i (x, y)∈{0,1}, AND is the intersection operation, pe _i (x,y) is the rough edge prediction image of the i-th region, pe _i (x,y)∈{0,1}.

所述总损失函数为The total loss function is

式中，参数N为l_i不为0的个数。In the formula, the parameter N is the number of l _i not 0.

第二方面，本发明实施例还提供了一种基于机器学习的中医耳部五脏区域分割装置，包括存储器、处理器及储存在存储器上并能够在处理器上运行的计算机程序，所述处理器执行所述程序时实现本发明第一方面所述的一种基于机器学习的中医耳部五脏区域分割方法。In the second aspect, the embodiment of the present invention also provides a machine learning-based device for segmenting ear five viscera regions in traditional Chinese medicine, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, the processor When the program is executed, the machine learning-based method for segmenting ear five viscera regions in traditional Chinese medicine described in the first aspect of the present invention is realized.

本发明的附加方面和优点将在下面的描述中部分给出，部分将从下面的描述中变得明显，或通过本发明的实践了解到。Additional aspects and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

附图说明Description of drawings

本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解，其中：The above and/or additional aspects and advantages of the present invention will become apparent and comprehensible from the description of the embodiments in conjunction with the following drawings, wherein:

图1为本发明实施例的中医耳部五脏区域分割方法的流程图；Fig. 1 is the flow chart of the method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention;

图2为本发明实施例的中医耳部五脏区域分割处理流程图；Fig. 2 is the flow chart of segmentation processing of five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention;

图3为本发明实施例的中国标准耳穴定位示意图；Fig. 3 is the Chinese standard auricular point positioning schematic diagram of the embodiment of the present invention;

具体实施方式Detailed ways

下面详细描述本发明的实施例，所述实施例的示例在附图中示出，其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的，仅用于解释本发明，而不能理解为对本发明的限制。Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

在本发明的描述中，需要理解的是，涉及到方位描述，例如上、下、前、后、左、右等指示的方位或位置关系为基于附图所示的方位或位置关系，仅是为了便于描述本发明和简化描述，而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作，因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that the orientation descriptions, such as up, down, front, back, left, right, etc. indicated orientations or positional relationships are based on the orientations or positional relationships shown in the drawings, and are only In order to facilitate the description of the present invention and simplify the description, it does not indicate or imply that the device or element referred to must have a specific orientation, be constructed and operated in a specific orientation, and thus should not be construed as limiting the present invention.

在本发明的描述中，若干的含义是一个或者多个，多个的含义是两个以上，大于、小于、超过等理解为不包括本数，以上、以下、以内等理解为包括本数。如果有描述到第一、第二只是用于区分技术特征为目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量或者隐含指明所指示的技术特征的先后关系。In the description of the present invention, several means one or more, and multiple means more than two. Greater than, less than, exceeding, etc. are understood as not including the original number, and above, below, within, etc. are understood as including the original number. If the description of the first and second is only for the purpose of distinguishing the technical features, it cannot be understood as indicating or implying the relative importance or implicitly indicating the number of the indicated technical features or implicitly indicating the order of the indicated technical features relation.

本发明的描述中，除非另有明确的限定，设置、安装、连接等词语应做广义理解，所属技术领域技术人员可以结合技术方案的具体内容合理确定上述词语在本发明中的具体含义。In the description of the present invention, unless otherwise clearly defined, words such as setting, installation, and connection should be understood in a broad sense, and those skilled in the art can reasonably determine the specific meanings of the above words in the present invention in combination with the specific content of the technical solution.

参照图1、图2和图3，描述根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法。包括：Referring to FIG. 1 , FIG. 2 and FIG. 3 , a method for segmenting ear five internal organs regions based on machine learning in traditional Chinese medicine according to a first embodiment of the present invention will be described. include:

S101：采集耳部图像；所述采集耳部图像包括：S101: Collecting ear images; the collecting ear images includes:

S102：对所述耳部图像进行标注，得到标签图像；S102: Label the ear image to obtain a label image;

S103：分别扩充所述耳部图像和所述标签图像，得到扩充耳部图像和扩充标签图像；S103: Expand the ear image and the label image respectively to obtain the expanded ear image and the expanded label image;

S104：对所述扩充标签图像进行Canny算子边缘检测，得到标签边缘图像；S104: Perform Canny operator edge detection on the extended label image to obtain a label edge image;

S105：分别对所述扩充耳部图像、所述扩充标签图像和所述标签边缘图像进行预处理，分别得到RGB三通道图像、灰度图像和二值图像；S105: Perform preprocessing on the extended ear image, the extended label image, and the label edge image, respectively, to obtain an RGB three-channel image, a grayscale image, and a binary image;

S106：利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像；S106: Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney;

S107：将所述RGB三通道图像输入学习网络，得到预测图像；S107: Input the RGB three-channel image into the learning network to obtain a predicted image;

S108：从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像；S108: Obtain prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images;

S109：分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理，得到其对应的区域的单区域粗略边缘预测图；S109: Perform dilation and erosion processing on the single-region prediction images of the heart, liver, spleen, lung, and kidney respectively, to obtain a single-region rough edge prediction image of the corresponding region;

S110：算总损失函数，所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数；S110: Calculate a total loss function, where the total loss function includes a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map;

S111：计算所述总损失函数的最小值，使所述学习网络收敛，利用所述学习网络对所述耳部图像进行分割，得到耳部五脏区域分割图像。S111: Calculate the minimum value of the total loss function, make the learning network converge, use the learning network to segment the ear image, and obtain a segmented image of the five viscera regions of the ear.

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，对所述耳部图像进行标注，得到标签图像，包括：按照中国标准耳穴定位示意图对对所述耳部图像使用LabelMe工具箱进行标注；对所述耳部图像的标注信息进行转换，得到每一个所述耳部图像对应的标签图像，所述标签图像中对应中国标准耳穴定位示意图中心、肝、脾、肺、肾以及其余区域为背景六个区域的分别以不同的像素值区分。Referring to Fig. 1 and Fig. 2, according to the method for segmenting five viscera regions of ears based on machine learning in traditional Chinese medicine according to the first embodiment of the present invention, the ear images are marked to obtain label images, including: pairing according to the Chinese standard auricular point positioning schematic diagram The ear image is marked using the LabelMe toolbox; the label information of the ear image is converted to obtain a label image corresponding to each of the ear images, and the label image corresponds to the center of the Chinese standard auricular point positioning schematic diagram, Liver, spleen, lung, kidney and other regions are distinguished by different pixel values of the six background regions.

所述耳部图像以及对应的经过专业中医医师核查的所述标签图像组成的数据集按60％，20％，20％的比例划分成训练集、验证集以及测试集，以深度神经网络U-Net作为训练模型，交并比(Intersection over Union)IoU作为技术指标。一方面，在训练过程中，验证集中五个标注区域(心、肝、脾、肺、肾)的交并比响应顺序为肺、肾、肝、脾、心。另一方面，训练完成后在测试集上五个标注区域的平均交并比在0.38以上。达到上述两个方面的指标的数据集才确定为最后的数据集。The ear image and the corresponding data set composed of the label image checked by a professional TCM physician are divided into a training set, a verification set and a test set according to a ratio of 60%, 20%, and 20%, and the deep neural network U- Net is used as a training model, and Intersection over Union (Intersection over Union) IoU is used as a technical indicator. On the one hand, during the training process, the intersection and ratio response order of the five labeled regions (heart, liver, spleen, lung, and kidney) in the validation set is lung, kidney, liver, spleen, and heart. On the other hand, after the training is completed, the average intersection ratio of the five labeled regions on the test set is above 0.38. The data set that meets the above two indicators is determined as the final data set.

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，所述扩充所述耳部图像和所述标签图像，得到扩充耳部图像和扩充标签图像，包括：Referring to Fig. 1 and Fig. 2, according to the TCM ear five internal organs region segmentation method based on machine learning according to the first embodiment of the present invention, the expansion of the ear image and the label image obtains the expanded ear image and the expanded label image ,include:

对所述耳部图像地进行旋转、改变所述耳部图像的尺寸、对所述耳部图像进行水平翻转和对所述耳部图像进行伽马变换，得到所述扩充耳部图像；Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and gamma transforming the ear image to obtain the expanded ear image;

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，分别对所述扩充耳部图像和所述标签边缘图像进行预处理，包括：Referring to Fig. 1 and Fig. 2, according to the TCM ear five internal organs region segmentation method based on machine learning according to the first embodiment of the present invention, preprocessing is performed on the expanded ear image and the label edge image respectively, including:

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像，包括：Referring to Fig. 1 and Fig. 2, according to the machine learning-based method for segmenting the five viscera regions of the ear in traditional Chinese medicine according to the first embodiment of the present invention, the grayscale image and the binary image are used to generate the heart, liver, spleen, lung, and kidney. Individual region edge images, including:

参照图1和图2，根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法，利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像，包括：Referring to Fig. 1 and Fig. 2, according to the machine learning-based TCM ear five internal organs region segmentation method according to the embodiment of the present invention, each single region of the heart, liver, spleen, lung and kidney is generated by using the grayscale image and the binary image Edge images, including:

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，其特征在于，将所述RGB三通道图像输入学习网络，得到预测图像，包括：Referring to Fig. 1 and Fig. 2, according to the method for segmenting the ear five viscera region of traditional Chinese medicine based on machine learning according to the first embodiment of the present invention, it is characterized in that the RGB three-channel image is input into the learning network to obtain a predicted image, including:

将所述特征F_origin经过PAM模块得到特征F₁；Pass the feature F _origin through the PAM module to obtain the feature F ₁ ;

将所述特征F₁经过PAM模块得到特征F₂；Pass the feature _F1 through the PAM module to obtain the feature _F2 ;

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像，包括：Referring to Fig. 1 and Fig. 2, according to the machine learning-based method for segmenting five viscera regions of the ear in traditional Chinese medicine according to the first embodiment of the present invention, the prediction images of individual regions of the heart, liver, spleen, lung, and kidney are obtained from the prediction images, include:

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，所述分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理，得到其对应的区域的单区域粗略边缘预测图，包括：Referring to Fig. 1 and Fig. 2, according to the machine learning-based TCM ear five internal organs region segmentation method according to the first embodiment of the present invention, the prediction images of the single regions of the heart, liver, spleen, lung and kidney are respectively expanded, Corrosion processing to obtain a single-region rough edge prediction map of its corresponding region, including:

式中，符号

为膨胀运算，符号

is the dilation operation, the symbol

参照图1和图2，根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法，其特征在于，包括：计算总损失函数，所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数：Referring to Fig. 1 and Fig. 2, according to the first embodiment of the present invention, the method for segmenting the five internal organs of ears based on machine learning in traditional Chinese medicine is characterized in that it includes: calculating a total loss function, the total loss function including the grayscale image and The first loss function of the predicted image and the second loss function of the single region edge image and the single region rough edge prediction map:

所述第一损失函数为The first loss function is

对于类别c经过one-hot编码的所述二值图像，

For the binary image of category c after one-hot encoding,

所述第二损失函数为The second loss function is

所述总损失函数为The total loss function is

一种基于机器学习的中医耳部五脏区域分割装置，其特征在于，包括至少一个处理器，以及与至少一个处理器连接的存储器；其中，所述存储器存储有可被所述至少一个处理器执行的指令，所述指令被所述至少一个处理器执行，以使所述至少一个处理器能够执行中医耳部五脏区域分割任一项所述的方法。A device for segmenting ear five viscera regions based on machine learning, characterized in that it includes at least one processor, and a memory connected to at least one processor; wherein, the memory stores information that can be executed by the at least one processor. Instructions, the instructions are executed by the at least one processor, so that the at least one processor can perform any one of the methods for segmenting ear and five viscera regions in traditional Chinese medicine.

应当认识到，本发明实施例中的方法步骤可以由计算机硬件、硬件和软件的组合、或者通过存储在非暂时性计算机可读存储器中的计算机指令来实现或实施。所述方法可以使用标准编程技术。每个程序可以以高级过程或面向对象的编程语言来实现以与计算机系统通信。然而，若需要，该程序可以以汇编或机器语言实现。在任何情况下，该语言可以是编译或解释的语言。此外，为此目的该程序能够在编程的专用集成电路上运行。It should be recognized that the method steps in the embodiments of the present invention may be implemented or implemented by computer hardware, a combination of hardware and software, or by computer instructions stored in a non-transitory computer-readable memory. The methods can use standard programming techniques. Each program can be implemented in a high-level procedural or object-oriented programming language to communicate with the computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language. Furthermore, the program can be run on an application specific integrated circuit programmed for this purpose.

此外，可按任何合适的顺序来执行本文描述的过程的操作，除非本文另外指示或以其他方式明显地与上下文矛盾。本文描述的过程(或变型和/或其组合)可在配置有可执行指令的一个或多个计算机系统的控制下执行，并且可作为共同地在一个或多个处理器上执行的代码(例如，可执行指令、一个或多个计算机程序或一个或多个应用)、由硬件或其组合来实现。所述计算机程序包括可由一个或多个处理器执行的多个指令。In addition, operations of processes described herein may be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The processes described herein (or variations and/or combinations thereof) can be performed under the control of one or more computer systems configured with executable instructions, and as code that collectively executes on one or more processors (e.g. , executable instructions, one or more computer programs or one or more applications), hardware or a combination thereof. The computer program comprises a plurality of instructions executable by one or more processors.

进一步，所述方法可以在可操作地连接至合适的任何类型的计算平台中实现，包括但不限于个人电脑、迷你计算机、主框架、工作站、网络或分布式计算环境、单独的或集成的计算机平台、或者与带电粒子工具或其它成像装置通信等等。本发明的各方面可以以存储在非暂时性存储介质或设备上的机器可读代码来实现，无论是可移动的还是集成至计算平台，如硬盘、光学读取和/或写入存储介质、RAM、ROM等，使得其可由可编程计算机读取，当存储介质或设备由计算机读取时可用于配置和操作计算机以执行在此所描述的过程。此外，机器可读代码，或其部分可以通过有线或无线网络传输。当此类媒体包括结合微处理器或其他数据处理器实现上文所述步骤的指令或程序时，本文所述的发明包括这些和其他不同类型的非暂时性计算机可读存储介质。当根据本发明所述的方法和技术编程时，本发明还包括计算机本身。Further, the method can be implemented in any type of computing platform operably connected to a suitable one, including but not limited to personal computer, minicomputer, main frame, workstation, network or distributed computing environment, stand-alone or integrated computer platform, or communicate with charged particle tools or other imaging devices, etc. Aspects of the invention can be implemented as machine-readable code stored on a non-transitory storage medium or device, whether removable or integrated into a computing platform, such as a hard disk, optically read and/or written storage medium, RAM, ROM, etc., such that they are readable by a programmable computer, when the storage medium or device is read by the computer, can be used to configure and operate the computer to perform the processes described herein. Additionally, the machine-readable code, or portions thereof, may be transmitted over a wired or wireless network. The invention described herein includes these and other various types of non-transitory computer-readable storage media when such media include instructions or programs that implement the steps described above in conjunction with a microprocessor or other data processor. The invention also includes the computer itself when programmed according to the methods and techniques described herein.

计算机程序能够应用于输入数据以执行本文所述的功能，从而转换输入数据以生成存储至非易失性存储器的输出数据。输出信息还可以应用于一个或多个输出设备如显示器。在本发明优选的实施例中，转换的数据表示物理和有形的对象，包括显示器上产生的物理和有形对象的特定视觉描绘。Computer programs can be applied to input data to perform the functions described herein, thereby transforming the input data to generate output data stored to non-volatile memory. Output information may also be applied to one or more output devices such as a display. In a preferred embodiment of the invention, the transformed data represents physical and tangible objects, including specific visual depictions of physical and tangible objects produced on a display.

上面结合附图对本发明实施例作了详细说明，但是本发明不限于上述实施例，在所述技术领域普通技术人员所具备的知识范围内，还可以在不脱离本发明宗旨的前提下作出各种变化。The embodiments of the present invention have been described in detail above in conjunction with the accompanying drawings, but the present invention is not limited to the above embodiments. Within the scope of knowledge possessed by those of ordinary skill in the technical field, various modifications can be made without departing from the gist of the present invention. kind of change.

在本说明书的描述中，参考术语“一个实施例”、“一些实施例”、“示意性实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不一定指的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, references to the terms "one embodiment," "some embodiments," "exemplary embodiments," "example," "specific examples," or "some examples" are intended to mean that the implementation A specific feature, structure, material, or characteristic described by an embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施例，本领域的普通技术人员可以理解：在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型，本发明的范围由权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those skilled in the art can understand that various changes, modifications, substitutions and modifications can be made to these embodiments without departing from the principle and spirit of the present invention. The scope of the invention is defined by the claims and their equivalents.

Claims

1. A method for segmenting the five internal organs of the ears of traditional Chinese medicine based on machine learning, characterized in that, comprising:

Collect ear images and build image datasets;

Annotating the ear image to obtain a label image;

Expanding the ear image and the label image respectively to obtain the expanded ear image and the expanded label image;

Carry out Canny operator edge detection to described extended label image, obtain label edge image;

Preprocessing the expanded ear image, the expanded label image, and the label edge image respectively to obtain an RGB three-channel image, a grayscale image, and a binary image, wherein the RGB three-channel image, the Both the size of the grayscale image and the binary image are W×H;

Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney;

Input the RGB three-channel image into the learning network to obtain a predicted image;

obtaining prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images;

Carrying out dilation and erosion processing on the single area prediction images of the heart, liver, spleen, lung, and kidney respectively, to obtain a single area rough edge prediction map of the corresponding area;

calculating a total loss function, the total loss function including a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map, wherein,

The first loss function is

In the formula, x∈{1, 2,..., W}, y∈{1, 2,..., H}, W are the widths of the grayscale image and the predicted image respectively, and H is The height of the grayscale image and the predicted image, c ∈ {1, 2, ..., X}, C is the number of segmentation categories,

For the binary image of category c after one-hot encoding

q ^c (x, y) is the predicted probability that the pixel at coordinate (x, y) belongs to class c;

The second loss function is

In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W are respectively the The width of the single region edge image and the single region rough edge prediction map, H is the height of the single region edge image and the single region rough edge prediction map, g _i (x, y)∈{0, 1} is the edge image of the i-th area, g _i (x, y) ∈ {0, 1}, AND is an intersection operation, pe _i (x, y) is the rough edge prediction image of the i-th area, pe _i (x, y ) ∈ {0, 1},

The total loss function is

In the formula, the parameter N is the number that l _i is not 0;

The minimum value of the total loss function is calculated to make the learning network converge, and the learning network is used to segment the ear image to obtain a segmented image of the five viscera regions of the ear.

2. a kind of traditional Chinese medicine ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, described ear image is marked, obtains label image, comprises:

According to the Chinese standard auricular point positioning schematic diagram, the ear image is marked using the LabelMe toolbox;

Convert the labeling information of the ear images to obtain a label image corresponding to each of the ear images, in which the center, liver, spleen, lung, kidney and other areas of the label image correspond to the Chinese standard auricular point positioning schematic diagram as the background Each of the six regions is distinguished by different pixel values.

3. a kind of traditional Chinese medical science ear five viscera region segmentation method based on machine learning according to claim 1, is characterized in that, described expanding described ear image and described label image, obtain expanded ear image and expanded label image ,include:

Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and performing gamma transformation on the ear image to obtain the expanded ear image;

Perform the same rotation on the label image as the ear image, change the size of the label image, flip the label image horizontally, and make it still only have five pixel values corresponding to heart, liver, spleen, The five areas of lung and kidney, and the gray pixel values are 50, 100, 150, 200, 250 respectively, except the above five areas are background areas, the pixel value is 0, and the extended label image is obtained.

4. a kind of TCM ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, preprocessing is carried out to described expansion ear image and described label edge image respectively, comprises:

Carrying out position cutting and normalization operation on the expanded ear image to obtain the RGB three-channel image whose size is W×H;

Cutting the extended label image corresponding to the extended ear image to obtain the grayscale image with a size of W×H, the grayscale image is divided into background, heart, liver, spleen, lung, Six areas of the kidney, the gray pixel values of the six areas are 0, 1, 2, 3, 4, 5 respectively;

Perform position clipping corresponding to the extended ear image on the label edge image to obtain the binary image with a size of W×H.

5. A machine learning-based method for segmenting the five viscera regions of the ears of traditional Chinese medicine according to claim 1, characterized in that, using the grayscale image and the binary image to generate the heart, liver, spleen, lung, and kidney Individual region edge images, including:

Let f _label (x, y) represent the pixel value at any coordinate (x, y) of the grayscale image, let f _edge (x, y) represent the pixel value at any coordinate (x, y) of the binary image The pixel value of , g _i (x, y) represents the corresponding transformed pixel value at these coordinates, there is

In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W are respectively the The width of the grayscale image and the binary image, H is the height of the grayscale image and the binary image respectively, g ₁ , g ₂ , g ₃ , g ₄ , g ₅ are the heart, liver, spleen , the single region edge images of lung and kidney.

6. a kind of traditional Chinese medicine ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, said RGB three-channel image input learning network, obtains predicted image, comprises:

Pass the RGB three-channel image through two convolution layers with 3*3 convolution kernels to obtain a feature F _origin with 64 channels and a size of H×W;

Pass the feature F _origin through the PAM module to obtain the feature F ₁ ;

Pass the feature _F1 through the PAM module to obtain the feature _F2 ;

The two results obtained by multiplying the feature F ₁ and the feature F ₂ by the coefficients 0.75 and 0.25 respectively are spliced on the channel, and then passed through the convolution layer with a convolution kernel of 3*3 to obtain the channel as 6. The prediction image whose size is H×W.

7. A kind of method for segmenting five internal organs regions of the ears of traditional Chinese medical science based on machine learning according to claim 1, is characterized in that, obtains each single region predictive image of heart, liver, spleen, lung, kidney from described predictive image, include:

Take the maximum value of the predicted image in the dimension of the channel to obtain a dimensional grayscale image with a size of H×W and only six pixel values of 0, 1, 2, 3, 4, and 5, wherein the six types of pixels The areas corresponding to the values in the dimension grayscale image are respectively the predicted segmentation areas of the background, heart, liver, spleen, lung, and kidney;

Let P(x, y) denote the pixel value at any coordinate (x, y) of the predicted grayscale image, we have

In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W and H are respectively The width and height of the dimensional grayscale image, P(x, y)={0, 1, ..., 5}, p ₁ , p ₂ , p ₃ , p ₄ , p ₅ are heart, liver, The single region prediction images of spleen, lung and kidney.

8. A kind of machine learning-based TCM ear five internal organs region segmentation method according to claim 7, characterized in that, said single region prediction images of the heart, liver, spleen, lung and kidney are respectively expanded, Corrosion processing to obtain a single-region rough edge prediction map of its corresponding region, including:

Define the structure element B, the size of the structure element B is 3*3, the origin is at the center, and the element values are all 1;

Use the structural element B to dilate and corrode each of the single-region predicted images of the heart, liver, spleen, lung, and kidney to obtain the expanded predicted single-region image and the corroded single-region predicted image, and after dilation Subtracting the predicted image of the single region and the predicted image of the single region after erosion to obtain the corresponding rough edge prediction map of the region;

Let pe _i (x, y) denote the pixel value at any coordinate of the predicted image at the rough edge of the region, we have

In the formula, the symbol

is the dilation operation, the symbol

For corrosion operation, i∈{1, 2,...,5}, x∈{1, 2,...,W}, y∈{1, 2,...,H}, W and H respectively is the width and height of the single region rough edge prediction image, pe _i (x, y) ∈ {0, 1}, pe ₁ , pe ₂ , pe ₃ , pe ₄ , pe ₅ are heart, liver, spleen, The single region rough edge prediction image of the lung and kidney, p _i (x, y) represents the pixel value of any coordinate (x, y) of the corresponding single region prediction image in the heart, liver, spleen, lung and kidney.

9. A traditional Chinese medical science ear five internal organs region segmentation device based on machine learning, characterized in that it comprises at least one processor, and a memory connected to said at least one processor; wherein said memory stores information that can be used by said at least Instructions executed by a processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the method according to any one of claims 1-8.