[go: up one dir, main page]

CN113112475B - Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning - Google Patents

Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning Download PDF

Info

Publication number
CN113112475B
CN113112475B CN202110396463.4A CN202110396463A CN113112475B CN 113112475 B CN113112475 B CN 113112475B CN 202110396463 A CN202110396463 A CN 202110396463A CN 113112475 B CN113112475 B CN 113112475B
Authority
CN
China
Prior art keywords
image
ear
region
label
spleen
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110396463.4A
Other languages
Chinese (zh)
Other versions
CN113112475A (en
Inventor
梁惠珠
冯跃
林卓胜
朱嘉健
李胜可
刘慧琳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuyi University Fujian
Original Assignee
Wuyi University Fujian
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuyi University Fujian filed Critical Wuyi University Fujian
Priority to CN202110396463.4A priority Critical patent/CN113112475B/en
Publication of CN113112475A publication Critical patent/CN113112475A/en
Application granted granted Critical
Publication of CN113112475B publication Critical patent/CN113112475B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/13Edge detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/60Analysis of geometric attributes
    • G06T7/62Analysis of geometric attributes of area, perimeter, diameter or volume
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30196Human being; Person
    • G06T2207/30201Face

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Medical Informatics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Quality & Reliability (AREA)
  • Geometry (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a traditional Chinese medicine ear five-zang-organ region segmentation method and a device based on machine learning, wherein the method comprises the following steps: collecting ear images and establishing an image data set; labeling the ear image to obtain a label image; respectively expanding the ear images and the tag images; carrying out Canny operator edge detection on the expansion tag image; respectively preprocessing the extended ear image, the extended tag image and the tag edge image; generating edge images of each single region of the heart, the liver, the spleen, the lung and the kidney by utilizing the gray level images and the binary images; inputting the RGB three-channel image into a learning network to obtain a predicted image; obtaining a predicted image of each single region of the heart, the liver, the spleen, the lung and the kidney from the predicted image; respectively carrying out expansion and corrosion treatment on the predicted image to obtain a single-region rough edge prediction image; calculating a total loss function; and calculating the minimum value of the total loss function to make the learning network converge, and segmenting the ear image by using the learning network to obtain the ear five-organ region segmentation image.

Description

一种基于机器学习的中医耳部五脏区域分割方法和装置A method and device for segmenting ear and five internal organs regions of traditional Chinese medicine based on machine learning

技术领域technical field

本发明涉及图像处理领域,特别涉及一种基于机器学习的中医耳部五脏区域分割方法和装置。The invention relates to the field of image processing, in particular to a machine learning-based method and device for segmenting ear five viscera regions in traditional Chinese medicine.

背景技术Background technique

近年来,机器学习广泛包含图像各个领域并获得了优越的性能,例如图像识别,目标检测,图像分割等等。值得一提的是,属于机器学习范畴的深度学习应用于医学影像的研究对辅助医师临床诊断具有重要意义,中医学望诊中的舌诊图像处理已经取得不少成果。但是耳诊的数字化辅助诊断仍罕见,尤其是耳部望诊的数字化没见以任何形式出现,其中,图像分割是临床应用的必要步骤,尽管出现基于深度学习的耳部生物信息识别,但基于深度学习的耳部区域分割较之更少,且因其各个分割区域的分界是基于中医理论,区域分界不是物体固有的分界轮廓,具有一定的模糊性,带来了分割难度。In recent years, machine learning has widely included various fields of images and achieved superior performance, such as image recognition, object detection, image segmentation and so on. It is worth mentioning that the application of deep learning, which belongs to the category of machine learning, to medical imaging research is of great significance for assisting doctors in clinical diagnosis. Many results have been achieved in tongue diagnosis image processing in traditional Chinese medicine inspection. However, the digital assisted diagnosis of ear diagnosis is still rare, especially the digitization of ear inspection has not appeared in any form, among which image segmentation is a necessary step in clinical application. The ear area segmentation of deep learning is less, and because the boundaries of each segmented area are based on the theory of traditional Chinese medicine, the area boundary is not the inherent boundary outline of the object, which has a certain degree of ambiguity, which brings difficulty in segmentation.

发明内容Contents of the invention

本发明旨在至少解决现有技术中存在的技术问题之一。为此,本发明提出一种一种基于机器学习的中医耳部五脏区域分割方法和装置,所述一种基于机器学习的中医耳部五脏区域分割方法和装置,使得中医耳部五脏对应区域能自动分割。The present invention aims to solve at least one of the technical problems existing in the prior art. For this reason, the present invention proposes a method and device for segmenting the ear five internal organs area based on machine learning in traditional Chinese medicine. Automatic segmentation.

第一方面,本发明实施例提出一种具有上述功能的中医耳部五脏区域分割方法。包括:In the first aspect, the embodiment of the present invention proposes a method for segmenting ear and five viscera regions in traditional Chinese medicine with the above-mentioned functions. include:

采集耳部图像;所述采集耳部图像包括:Gathering ear images; said gathering ear images includes:

充分利用现有的公开耳部数据集,筛选出符合中医耳诊要求的耳部图像作为数据集的原始数据;Make full use of the existing public ear data sets, and select ear images that meet the requirements of TCM ear diagnosis as the original data of the data set;

另一部分是按照中医耳诊要求,自行采集原始数据,所述自行采集原始数据为,采集设备为中医舌象采集仪,采集对象左(右)侧耳部靠近采集仪后,头部向左(右)旋转至约30°合适的角度,使得耳部五脏区域在采集过程中不被遮挡,原始采集图像中包含耳部、脸部以及采集仪器边缘,原始采集图像通过训练好的Opencv耳部检测器模型检测出耳部位置,检测方框存在没有完全包含耳部区域的情况,因此以检测方框为中心,以检测方框宽和高的1.6倍进行剪裁,剪裁后的图像中,完整的耳部区域占整张图像面积约一半,得到所述耳部图像。The other part is to collect raw data by itself according to the requirements of TCM ear diagnosis. The self-collected raw data is that the collection device is a TCM tongue image collection instrument. ) to a suitable angle of about 30°, so that the five internal organs of the ear are not blocked during the collection process. The original collected image includes the ear, face and the edge of the collection instrument. The original collected image passes through the trained Opencv ear detector The model detects the position of the ear, and the detection box does not completely cover the ear area. Therefore, the detection box is centered and cut at 1.6 times the width and height of the detection box. In the cropped image, the complete ear The ear region accounts for about half of the entire image area, and the ear image is obtained.

对所述耳部图像进行标注,得到标签图像;Annotating the ear image to obtain a label image;

分别扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像;Expanding the ear image and the label image respectively to obtain the expanded ear image and the expanded label image;

对所述扩充标签图像进行Canny算子边缘检测,得到标签边缘图像;Carry out Canny operator edge detection to described extended label image, obtain label edge image;

分别对所述扩充耳部图像、所述扩充标签图像和所述标签边缘图像进行预处理,分别得到RGB三通道图像、灰度图像和二值图像;Preprocessing the expanded ear image, the expanded label image and the label edge image respectively to obtain an RGB three-channel image, a grayscale image and a binary image;

利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像;Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney;

将所述RGB三通道图像输入学习网络,得到预测图像;Input the RGB three-channel image into the learning network to obtain a predicted image;

从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像;obtaining prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images;

分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图;Carrying out dilation and erosion processing on the prediction images of the single regions of the heart, liver, spleen, lung and kidney respectively, to obtain a single region rough edge prediction map of the corresponding regions;

计算总损失函数,所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数;calculating a total loss function, the total loss function including a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map;

计算所述总损失函数的最小值,使所述学习网络收敛,利用所述学习网络对所述耳部图像进行分割,得到耳部五脏区域分割图像。The minimum value of the total loss function is calculated to make the learning network converge, and the learning network is used to segment the ear image to obtain a segmented image of the five viscera regions of the ear.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,至少具有如下技术效果:使得中医耳部五脏对应区域能自动分割。According to the machine learning-based method for segmenting the five internal organs of the ears of traditional Chinese medicine according to the embodiments of the present invention, at least the following technical effect is achieved: the corresponding regions of the five internal organs of the ears of traditional Chinese medicine can be automatically segmented.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,所述扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像,包括:According to the machine learning-based method for segmenting five viscera regions of the ear in traditional Chinese medicine according to an embodiment of the present invention, the expansion of the ear image and the label image to obtain the expanded ear image and the expanded label image includes:

对所述耳部图像进行旋转、改变所述耳部图像的尺寸、对所述耳部图像进行水平翻转和对所述耳部图像进行伽马变换,得到所述扩充耳部图像;Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and performing gamma transformation on the ear image to obtain the expanded ear image;

对所述标签图像进行与所述耳部图像相同的旋转、改变所述标签图像的尺寸、对所述标签图像进行水平翻转,并使其仍只拥有五种像素值对应心、肝、脾、肺、肾五种区域,且灰度像素值分别为50、100、150、200、250,除上述五种区域外为背景区域,像素值为0,得到所述扩充标签图像。Perform the same rotation on the label image as the ear image, change the size of the label image, flip the label image horizontally, and make it still only have five pixel values corresponding to heart, liver, spleen, The five areas of lung and kidney, and the gray pixel values are 50, 100, 150, 200, 250 respectively, except the above five areas are background areas, the pixel value is 0, and the extended label image is obtained.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,对所述耳部图像进行标注,得到标签图像,包括:According to the machine learning-based method for segmenting the ear five internal organs region of traditional Chinese medicine according to an embodiment of the present invention, the ear image is marked to obtain a label image, including:

按照中国标准耳穴定位示意图对对所述耳部图像使用LabelMe工具箱进行标注;According to the Chinese standard auricular point positioning schematic diagram, the ear image is marked using the LabelMe toolbox;

对所述耳部图像的标注信息进行转换,得到每一个所述耳部图像对应的标签图像,所述标签图像中对应中国标准耳穴定位示意图中心、肝、脾、肺、肾以及其余区域为背景六个区域的分别以不同的像素值区分。Convert the labeling information of the ear images to obtain a label image corresponding to each of the ear images, in which the center, liver, spleen, lung, kidney and other areas of the label image correspond to the Chinese standard auricular point positioning schematic diagram as the background Each of the six regions is distinguished by different pixel values.

所述耳部图像以及对应的经过专业中医医师核查的所述标签图像组成的数据集按60%,20%,20%的比例划分成训练集、验证集以及测试集,以深度神经网络U-Net作为训练模型,交并比(Intersection over Union)IoU作为技术指标。一方面,在训练过程中,验证集中五个标注区域(心、肝、脾、肺、肾)的交并比响应顺序为肺、肾、肝、脾、心,另一方面,训练完成后在测试集上五个标注区域的平均交并比在0.38以上。达到上述两个方面的指标的数据集才确定为最后的数据集。The ear image and the corresponding data set composed of the label image checked by a professional TCM physician are divided into a training set, a verification set and a test set according to a ratio of 60%, 20%, and 20%, and the deep neural network U- Net is used as a training model, and Intersection over Union (Intersection over Union) IoU is used as a technical indicator. On the one hand, during the training process, the response sequence of the intersection ratio of the five labeled regions (heart, liver, spleen, lung, and kidney) in the verification set is lung, kidney, liver, spleen, and heart; The average intersection ratio of the five labeled regions on the test set is above 0.38. The data set that meets the above two indicators is determined as the final data set.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,分别对所述扩充耳部图像和所述标签边缘图像进行预处理,包括:According to the machine learning-based method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention, the preprocessing of the expanded ear image and the label edge image is performed respectively, including:

对所述扩充耳部图像进行位置剪切,归一化操作,得到尺寸为W*H的所述RGB三通道图像。Carry out position cut and normalization operation on the expanded ear image to obtain the RGB three-channel image with a size of W*H.

对所述扩充标签图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W*H的所述灰度图像,所述灰度图像分为背景、心、肝、脾、肺、肾六个区域,所述六个区域的灰度像素值分别为0、1、2、3、4、5。Cutting the extended label image corresponding to the extended ear image to obtain the grayscale image with a size of W*H, the grayscale image is divided into background, heart, liver, spleen, lung, There are six areas of the kidney, and the gray pixel values of the six areas are 0, 1, 2, 3, 4, and 5, respectively.

对所述标签边缘图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W*H的所述二值图像。Cutting the label edge image corresponding to the expanded ear image to obtain the binary image with a size of W*H.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像,包括:According to the machine learning-based method for segmenting the five viscera regions of the ear in traditional Chinese medicine, the grayscale image and the binary image are used to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney, including:

令flabel(x,y)表示在所述灰度图像任意坐标(x,y)处的像素值,令fedge(x,y)表示在所述二值图像任意坐标(x,y)处的像素值,gi(x,y)表示这些坐标处相应的变换的像素值,有Let f label (x, y) represent the pixel value at any coordinate (x, y) of the grayscale image, let f edge (x, y) represent the pixel value at any coordinate (x, y) of the binary image The pixel value of , g i (x, y) represents the corresponding transformed pixel value at these coordinates, there is

Figure BDA0003018755690000041
Figure BDA0003018755690000041

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述灰度图像和所述二值图像的宽和高,g1、g2、g3、g4、g5分别是心、肝、脾、肺、肾的所述单个区域边缘图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the grayscale image and The width and height of the binary image, g 1 , g 2 , g 3 , g 4 , and g 5 are the edge images of the single regions of the heart, liver, spleen, lung, and kidney, respectively.

根据权利要求1所述的基于机器学习的中医耳部五脏区域分割方法,其特征在于,将所述RGB三通道图像输入学习网络,得到预测图像,包括:According to claim 1, the traditional Chinese medical science ear five internal organs region segmentation method based on machine learning, is characterized in that, said RGB three-channel image is input into learning network, obtains predicted image, comprises:

将所述RGB三通道图像经过两个卷积核都为3*3的卷积层,得到通道为64,尺寸为H×W的特征ForiginPass the RGB three-channel image through two convolution layers with 3*3 convolution kernels to obtain a feature F origin with 64 channels and a size of H×W;

将所述特征Forigin经过PAM模块得到特征F1Pass the feature F origin through the PAM module to obtain the feature F 1 .

将所述特征F1经过PAM模块得到特征F2The feature F 1 is passed through the PAM module to obtain the feature F 2 .

将所述特征F1与所述特征F2分别与系数0.75、0.25相乘后的得到的两个结果进行通道上的拼接,然后通过卷积核为3*3的卷积层,得到通道为6,尺寸为H×W的所述预测图像。The two results obtained by multiplying the feature F 1 and the feature F 2 by the coefficients 0.75 and 0.25 respectively are spliced on the channel, and then passed through the convolution layer with a convolution kernel of 3*3 to obtain the channel as 6. The prediction image whose size is H×W.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像,包括:According to the machine learning-based method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention, the predicted images of individual regions of the heart, liver, spleen, lung, and kidney are obtained from the predicted images, including:

对所述预测图像在通道的维度上取最大值,得到尺寸为H×W,像素值只有0、1、2、3、4、5六种像素值的维度灰度图像,其中,六种像素值在所述维度灰度图像中所对应的区域分别为背景、心、肝、脾、肺、肾的预测分割区域;Take the maximum value of the predicted image in the dimension of the channel to obtain a dimension grayscale image with a size of H×W and only six pixel values of 0, 1, 2, 3, 4, and 5, wherein the six types of pixels The areas corresponding to the values in the dimension grayscale image are respectively the predicted segmentation areas of the background, heart, liver, spleen, lung, and kidney;

令P(x,y)表示在预测灰度图像任意坐标(x,y)的像素值,有Let P(x,y) denote the pixel value at any coordinate (x,y) of the predicted grayscale image, we have

Figure BDA0003018755690000051
Figure BDA0003018755690000051

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述维度灰度图像的宽和高,P(x,y)={0,1,…,5},p1、p2、p3、p4、p5分别是心、肝、脾、肺、肾的所述单个区域预测图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are grayscale images of the dimensions respectively width and height, P(x,y)={0,1,...,5}, p 1 , p 2 , p 3 , p 4 , p 5 are the heart, liver, spleen, lung and kidney respectively A single region predicts an image.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,所述分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图,包括:According to the machine learning-based segmentation method of TCM ears and five viscera regions, the prediction images of the individual regions of the heart, liver, spleen, lung, and kidney are respectively expanded and eroded to obtain the corresponding regions. Single region rough edge prediction map, including:

定义结构元B,所述结构元B的大小为3*3,原点位于中心,元素值都为1;Define the structure element B, the size of the structure element B is 3*3, the origin is at the center, and the element values are all 1;

使用所述结构元B膨胀和腐蚀心、肝、脾、肺、肾的各所述单区域预测图像,得到膨胀后的所述单区域预测图像和腐蚀后的所述单区域预测图像,所述单区域预测图像和腐蚀后的所述单区域预测图像相减获得其对应的所述区域粗略边缘预测图;Using the structural element B to expand and corrode each of the single-region predicted images of the heart, liver, spleen, lung, and kidney to obtain the expanded single-region predicted image and the corroded single-region predicted image, the Subtracting the single-region prediction image from the corroded single-region prediction image to obtain its corresponding rough edge prediction map of the region;

令pei(x,y)表示在所述区域粗略边缘预测图像任意坐标的像素值,有Let pe i (x, y) denote the pixel value at any coordinate of the predicted image at the rough edge of the region, we have

Figure BDA0003018755690000052
Figure BDA0003018755690000052

式中,符号

Figure BDA0003018755690000053
为膨胀运算,符号
Figure BDA0003018755690000054
为腐蚀运算,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是图像的宽和高,pei(x,y)∈{0,1},,pe1、pe2、pe3、pe4、pe5分别是心、肝、脾、肺、肾的所述单区域粗略边缘预测图像。In the formula, the symbol
Figure BDA0003018755690000053
is the dilation operation, the symbol
Figure BDA0003018755690000054
For the corrosion operation, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the image respectively , pe i (x, y)∈{0,1},, pe 1 , pe 2 , pe 3 , pe 4 , pe 5 are the single-region rough edge prediction images of the heart, liver, spleen, lung, and kidney, respectively .

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,计算总损失函数,所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数:According to the machine learning-based method for segmenting the five viscera regions of TCM ears, the total loss function is calculated, and the total loss function includes the first loss function of the grayscale image and the predicted image and the edge of the single region The second loss function for the image and the single-region rough edge prediction map:

所述第一损失函数为The first loss function is

Figure BDA0003018755690000061
Figure BDA0003018755690000061

式中,x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述灰度图像和所述预测图像的宽和高,c∈{1,…,C},C是分割类别数,

Figure BDA0003018755690000062
对于类别c经过one-hot编码的所述二值图像,
Figure BDA0003018755690000065
qc(x,y)是坐标(x,y)像素属于类别c的预测概率;In the formula, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the grayscale image and the predicted image respectively, c∈{ 1,...,C}, C is the number of split categories,
Figure BDA0003018755690000062
For the binary image of category c after one-hot encoding,
Figure BDA0003018755690000065
q c (x, y) is the predicted probability that the pixel at coordinates (x, y) belongs to category c;

所述第二损失函数为The second loss function is

Figure BDA0003018755690000063
Figure BDA0003018755690000063

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述单个区域边缘图像与所述单区域粗略边缘预测图的宽和高,gi(x,y)为第i个区域边缘图像,gi(x,y)∈{0,1},AND为交运算,pei(x,y)为第i个区域粗略边缘预测图像,pei(x,y)∈{0,1}。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the edge images of the single area respectively and the width and height of the single region rough edge prediction map, g i (x, y) is the i-th region edge image, g i (x, y)∈{0,1}, AND is the intersection operation, pe i (x,y) is the rough edge prediction image of the i-th region, pe i (x,y)∈{0,1}.

所述总损失函数为The total loss function is

Figure BDA0003018755690000064
Figure BDA0003018755690000064

式中,参数N为li不为0的个数。In the formula, the parameter N is the number of l i not 0.

第二方面,本发明实施例还提供了一种基于机器学习的中医耳部五脏区域分割装置,包括存储器、处理器及储存在存储器上并能够在处理器上运行的计算机程序,所述处理器执行所述程序时实现本发明第一方面所述的一种基于机器学习的中医耳部五脏区域分割方法。In the second aspect, the embodiment of the present invention also provides a machine learning-based device for segmenting ear five viscera regions in traditional Chinese medicine, including a memory, a processor, and a computer program stored in the memory and capable of running on the processor, the processor When the program is executed, the machine learning-based method for segmenting ear five viscera regions in traditional Chinese medicine described in the first aspect of the present invention is realized.

本发明的附加方面和优点将在下面的描述中部分给出,部分将从下面的描述中变得明显,或通过本发明的实践了解到。Additional aspects and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

附图说明Description of drawings

本发明的上述和/或附加的方面和优点从结合下面附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and comprehensible from the description of the embodiments in conjunction with the following drawings, wherein:

图1为本发明实施例的中医耳部五脏区域分割方法的流程图;Fig. 1 is the flow chart of the method for segmenting five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention;

图2为本发明实施例的中医耳部五脏区域分割处理流程图;Fig. 2 is the flow chart of segmentation processing of five viscera regions of ears in traditional Chinese medicine according to an embodiment of the present invention;

图3为本发明实施例的中国标准耳穴定位示意图;Fig. 3 is the Chinese standard auricular point positioning schematic diagram of the embodiment of the present invention;

具体实施方式Detailed ways

下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能理解为对本发明的限制。Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

在本发明的描述中,需要理解的是,涉及到方位描述,例如上、下、前、后、左、右等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。In the description of the present invention, it should be understood that the orientation descriptions, such as up, down, front, back, left, right, etc. indicated orientations or positional relationships are based on the orientations or positional relationships shown in the drawings, and are only In order to facilitate the description of the present invention and simplify the description, it does not indicate or imply that the device or element referred to must have a specific orientation, be constructed and operated in a specific orientation, and thus should not be construed as limiting the present invention.

在本发明的描述中,若干的含义是一个或者多个,多个的含义是两个以上,大于、小于、超过等理解为不包括本数,以上、以下、以内等理解为包括本数。如果有描述到第一、第二只是用于区分技术特征为目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量或者隐含指明所指示的技术特征的先后关系。In the description of the present invention, several means one or more, and multiple means more than two. Greater than, less than, exceeding, etc. are understood as not including the original number, and above, below, within, etc. are understood as including the original number. If the description of the first and second is only for the purpose of distinguishing the technical features, it cannot be understood as indicating or implying the relative importance or implicitly indicating the number of the indicated technical features or implicitly indicating the order of the indicated technical features relation.

本发明的描述中,除非另有明确的限定,设置、安装、连接等词语应做广义理解,所属技术领域技术人员可以结合技术方案的具体内容合理确定上述词语在本发明中的具体含义。In the description of the present invention, unless otherwise clearly defined, words such as setting, installation, and connection should be understood in a broad sense, and those skilled in the art can reasonably determine the specific meanings of the above words in the present invention in combination with the specific content of the technical solution.

参照图1、图2和图3,描述根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法。包括:Referring to FIG. 1 , FIG. 2 and FIG. 3 , a method for segmenting ear five internal organs regions based on machine learning in traditional Chinese medicine according to a first embodiment of the present invention will be described. include:

S101:采集耳部图像;所述采集耳部图像包括:S101: Collecting ear images; the collecting ear images includes:

充分利用现有的公开耳部数据集,筛选出符合中医耳诊要求的耳部图像作为数据集的原始数据;Make full use of the existing public ear data sets, and select ear images that meet the requirements of TCM ear diagnosis as the original data of the data set;

另一部分是按照中医耳诊要求,自行采集原始数据,所述自行采集原始数据为,采集设备为中医舌象采集仪,采集对象左(右)侧耳部靠近采集仪后,头部向左(右)旋转至约30°合适的角度,使得耳部五脏区域在采集过程中不被遮挡,原始采集图像中包含耳部、脸部以及采集仪器边缘,原始采集图像通过训练好的Opencv耳部检测器模型检测出耳部位置,检测方框存在没有完全包含耳部区域的情况,因此以检测方框为中心,以检测方框宽和高的1.6倍进行剪裁,剪裁后的图像中,完整的耳部区域占整张图像面积约一半,得到所述耳部图像。The other part is to collect raw data by itself according to the requirements of TCM ear diagnosis. The self-collected raw data is that the collection device is a TCM tongue image collection instrument. ) to a suitable angle of about 30°, so that the five internal organs of the ear are not blocked during the collection process. The original collected image includes the ear, face and the edge of the collection instrument. The original collected image passes through the trained Opencv ear detector The model detects the position of the ear, and the detection box does not completely cover the ear area. Therefore, the detection box is centered and cut at 1.6 times the width and height of the detection box. In the cropped image, the complete ear The ear region accounts for about half of the entire image area, and the ear image is obtained.

S102:对所述耳部图像进行标注,得到标签图像;S102: Label the ear image to obtain a label image;

S103:分别扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像;S103: Expand the ear image and the label image respectively to obtain the expanded ear image and the expanded label image;

S104:对所述扩充标签图像进行Canny算子边缘检测,得到标签边缘图像;S104: Perform Canny operator edge detection on the extended label image to obtain a label edge image;

S105:分别对所述扩充耳部图像、所述扩充标签图像和所述标签边缘图像进行预处理,分别得到RGB三通道图像、灰度图像和二值图像;S105: Perform preprocessing on the extended ear image, the extended label image, and the label edge image, respectively, to obtain an RGB three-channel image, a grayscale image, and a binary image;

S106:利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像;S106: Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney;

S107:将所述RGB三通道图像输入学习网络,得到预测图像;S107: Input the RGB three-channel image into the learning network to obtain a predicted image;

S108:从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像;S108: Obtain prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images;

S109:分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图;S109: Perform dilation and erosion processing on the single-region prediction images of the heart, liver, spleen, lung, and kidney respectively, to obtain a single-region rough edge prediction image of the corresponding region;

S110:算总损失函数,所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数;S110: Calculate a total loss function, where the total loss function includes a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map;

S111:计算所述总损失函数的最小值,使所述学习网络收敛,利用所述学习网络对所述耳部图像进行分割,得到耳部五脏区域分割图像。S111: Calculate the minimum value of the total loss function, make the learning network converge, use the learning network to segment the ear image, and obtain a segmented image of the five viscera regions of the ear.

根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,至少具有如下技术效果:使得中医耳部五脏对应区域能自动分割。According to the machine learning-based method for segmenting the five internal organs of the ears of traditional Chinese medicine according to the embodiments of the present invention, at least the following technical effect is achieved: the corresponding regions of the five internal organs of the ears of traditional Chinese medicine can be automatically segmented.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,对所述耳部图像进行标注,得到标签图像,包括:按照中国标准耳穴定位示意图对对所述耳部图像使用LabelMe工具箱进行标注;对所述耳部图像的标注信息进行转换,得到每一个所述耳部图像对应的标签图像,所述标签图像中对应中国标准耳穴定位示意图中心、肝、脾、肺、肾以及其余区域为背景六个区域的分别以不同的像素值区分。Referring to Fig. 1 and Fig. 2, according to the method for segmenting five viscera regions of ears based on machine learning in traditional Chinese medicine according to the first embodiment of the present invention, the ear images are marked to obtain label images, including: pairing according to the Chinese standard auricular point positioning schematic diagram The ear image is marked using the LabelMe toolbox; the label information of the ear image is converted to obtain a label image corresponding to each of the ear images, and the label image corresponds to the center of the Chinese standard auricular point positioning schematic diagram, Liver, spleen, lung, kidney and other regions are distinguished by different pixel values of the six background regions.

所述耳部图像以及对应的经过专业中医医师核查的所述标签图像组成的数据集按60%,20%,20%的比例划分成训练集、验证集以及测试集,以深度神经网络U-Net作为训练模型,交并比(Intersection over Union)IoU作为技术指标。一方面,在训练过程中,验证集中五个标注区域(心、肝、脾、肺、肾)的交并比响应顺序为肺、肾、肝、脾、心。另一方面,训练完成后在测试集上五个标注区域的平均交并比在0.38以上。达到上述两个方面的指标的数据集才确定为最后的数据集。The ear image and the corresponding data set composed of the label image checked by a professional TCM physician are divided into a training set, a verification set and a test set according to a ratio of 60%, 20%, and 20%, and the deep neural network U- Net is used as a training model, and Intersection over Union (Intersection over Union) IoU is used as a technical indicator. On the one hand, during the training process, the intersection and ratio response order of the five labeled regions (heart, liver, spleen, lung, and kidney) in the validation set is lung, kidney, liver, spleen, and heart. On the other hand, after the training is completed, the average intersection ratio of the five labeled regions on the test set is above 0.38. The data set that meets the above two indicators is determined as the final data set.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,所述扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像,包括:Referring to Fig. 1 and Fig. 2, according to the TCM ear five internal organs region segmentation method based on machine learning according to the first embodiment of the present invention, the expansion of the ear image and the label image obtains the expanded ear image and the expanded label image ,include:

对所述耳部图像地进行旋转、改变所述耳部图像的尺寸、对所述耳部图像进行水平翻转和对所述耳部图像进行伽马变换,得到所述扩充耳部图像;Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and gamma transforming the ear image to obtain the expanded ear image;

对所述标签图像进行与所述耳部图像相同的旋转、改变所述标签图像的尺寸、对所述标签图像进行水平翻转,并使其仍只拥有五种像素值对应心、肝、脾、肺、肾五种区域,且灰度像素值分别为50、100、150、200、250,除上述五种区域外为背景区域,像素值为0,得到所述扩充标签图像。Perform the same rotation on the label image as the ear image, change the size of the label image, flip the label image horizontally, and make it still only have five pixel values corresponding to heart, liver, spleen, The five areas of lung and kidney, and the gray pixel values are 50, 100, 150, 200, 250 respectively, except the above five areas are background areas, the pixel value is 0, and the extended label image is obtained.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,分别对所述扩充耳部图像和所述标签边缘图像进行预处理,包括:Referring to Fig. 1 and Fig. 2, according to the TCM ear five internal organs region segmentation method based on machine learning according to the first embodiment of the present invention, preprocessing is performed on the expanded ear image and the label edge image respectively, including:

对所述扩充耳部图像进行位置剪切,归一化操作,得到尺寸为W*H的所述RGB三通道图像。Carry out position cut and normalization operation on the expanded ear image to obtain the RGB three-channel image with a size of W*H.

对所述扩充标签图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W*H的所述灰度图像,所述灰度图像分为背景、心、肝、脾、肺、肾六个区域,所述六个区域的灰度像素值分别为0、1、2、3、4、5。Cutting the extended label image corresponding to the extended ear image to obtain the grayscale image with a size of W*H, the grayscale image is divided into background, heart, liver, spleen, lung, There are six areas of the kidney, and the gray pixel values of the six areas are 0, 1, 2, 3, 4, and 5, respectively.

对所述标签边缘图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W*H的所述二值图像。Cutting the label edge image corresponding to the expanded ear image to obtain the binary image with a size of W*H.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像,包括:Referring to Fig. 1 and Fig. 2, according to the machine learning-based method for segmenting the five viscera regions of the ear in traditional Chinese medicine according to the first embodiment of the present invention, the grayscale image and the binary image are used to generate the heart, liver, spleen, lung, and kidney. Individual region edge images, including:

令flabel(x,y)表示在所述灰度图像任意坐标(x,y)处的像素值,令fedge(x,y)表示在所述二值图像任意坐标(x,y)处的像素值,gi(x,y)表示这些坐标处相应的变换的像素值,有Let f label (x, y) represent the pixel value at any coordinate (x, y) of the grayscale image, let f edge (x, y) represent the pixel value at any coordinate (x, y) of the binary image The pixel value of , g i (x, y) represents the corresponding transformed pixel value at these coordinates, there is

Figure BDA0003018755690000101
Figure BDA0003018755690000101

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述灰度图像和所述二值图像的宽和高,g1、g2、g3、g4、g5分别是心、肝、脾、肺、肾的所述单个区域边缘图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the grayscale image and The width and height of the binary image, g 1 , g 2 , g 3 , g 4 , and g 5 are the edge images of the single regions of the heart, liver, spleen, lung, and kidney, respectively.

参照图1和图2,根据本发明实施例的基于机器学习的中医耳部五脏区域分割方法,利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像,包括:Referring to Fig. 1 and Fig. 2, according to the machine learning-based TCM ear five internal organs region segmentation method according to the embodiment of the present invention, each single region of the heart, liver, spleen, lung and kidney is generated by using the grayscale image and the binary image Edge images, including:

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,其特征在于,将所述RGB三通道图像输入学习网络,得到预测图像,包括:Referring to Fig. 1 and Fig. 2, according to the method for segmenting the ear five viscera region of traditional Chinese medicine based on machine learning according to the first embodiment of the present invention, it is characterized in that the RGB three-channel image is input into the learning network to obtain a predicted image, including:

将所述RGB三通道图像经过两个卷积核都为3*3的卷积层,得到通道为64,尺寸为H×W的特征ForiginPass the RGB three-channel image through two convolution layers with 3*3 convolution kernels to obtain a feature F origin with 64 channels and a size of H×W;

将所述特征Forigin经过PAM模块得到特征F1Pass the feature F origin through the PAM module to obtain the feature F 1 ;

将所述特征F1经过PAM模块得到特征F2Pass the feature F1 through the PAM module to obtain the feature F2 ;

将所述特征F1与所述特征F2分别与系数0.75、0.25相乘后的得到的两个结果进行通道上的拼接,然后通过卷积核为3*3的卷积层,得到通道为6,尺寸为H×W的所述预测图像。The two results obtained by multiplying the feature F 1 and the feature F 2 by the coefficients 0.75 and 0.25 respectively are spliced on the channel, and then passed through the convolution layer with a convolution kernel of 3*3 to obtain the channel as 6. The prediction image whose size is H×W.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像,包括:Referring to Fig. 1 and Fig. 2, according to the machine learning-based method for segmenting five viscera regions of the ear in traditional Chinese medicine according to the first embodiment of the present invention, the prediction images of individual regions of the heart, liver, spleen, lung, and kidney are obtained from the prediction images, include:

对所述预测图像在通道的维度上取最大值,得到尺寸为H×W,像素值只有0、1、2、3、4、5六种像素值的维度灰度图像,其中,六种像素值在所述维度灰度图像中所对应的区域分别为背景、心、肝、脾、肺、肾的预测分割区域;Take the maximum value of the predicted image in the dimension of the channel to obtain a dimension grayscale image with a size of H×W and only six pixel values of 0, 1, 2, 3, 4, and 5, wherein the six types of pixels The areas corresponding to the values in the dimension grayscale image are respectively the predicted segmentation areas of the background, heart, liver, spleen, lung, and kidney;

令P(x,y)表示在预测灰度图像任意坐标(x,y)的像素值,有Let P(x,y) denote the pixel value at any coordinate (x,y) of the predicted grayscale image, we have

Figure BDA0003018755690000111
Figure BDA0003018755690000111

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述维度灰度图像的宽和高,P(x,y)={0,1,…,5},p1、p2、p3、p4、p5分别是心、肝、脾、肺、肾的所述单个区域预测图像。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are grayscale images of the dimensions respectively width and height, P(x,y)={0,1,...,5}, p 1 , p 2 , p 3 , p 4 , p 5 are the heart, liver, spleen, lung and kidney respectively A single region predicts an image.

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,所述分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图,包括:Referring to Fig. 1 and Fig. 2, according to the machine learning-based TCM ear five internal organs region segmentation method according to the first embodiment of the present invention, the prediction images of the single regions of the heart, liver, spleen, lung and kidney are respectively expanded, Corrosion processing to obtain a single-region rough edge prediction map of its corresponding region, including:

定义结构元B,所述结构元B的大小为3*3,原点位于中心,元素值都为1;Define the structure element B, the size of the structure element B is 3*3, the origin is at the center, and the element values are all 1;

使用所述结构元B膨胀和腐蚀心、肝、脾、肺、肾的各所述单区域预测图像,得到膨胀后的所述单区域预测图像和腐蚀后的所述单区域预测图像,所述单区域预测图像和腐蚀后的所述单区域预测图像相减获得其对应的所述区域粗略边缘预测图;Using the structural element B to expand and corrode each of the single-region predicted images of the heart, liver, spleen, lung, and kidney to obtain the expanded single-region predicted image and the corroded single-region predicted image, the Subtracting the single-region prediction image from the corroded single-region prediction image to obtain its corresponding rough edge prediction map of the region;

令pei(x,y)表示在所述区域粗略边缘预测图像任意坐标的像素值,有Let pe i (x, y) denote the pixel value at any coordinate of the predicted image at the rough edge of the region, we have

Figure BDA0003018755690000121
Figure BDA0003018755690000121

式中,符号

Figure BDA0003018755690000122
为膨胀运算,符号
Figure BDA0003018755690000123
为腐蚀运算,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是图像的宽和高,pei(x,y)∈{0,1},,pe1、pe2、pe3、pe4、pe5分别是心、肝、脾、肺、肾的所述单区域粗略边缘预测图像。In the formula, the symbol
Figure BDA0003018755690000122
is the dilation operation, the symbol
Figure BDA0003018755690000123
For the corrosion operation, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the image respectively , pe i (x, y)∈{0,1},, pe 1 , pe 2 , pe 3 , pe 4 , pe 5 are the single-region rough edge prediction images of the heart, liver, spleen, lung, and kidney, respectively .

参照图1和图2,根据本发明第一实施例的基于机器学习的中医耳部五脏区域分割方法,其特征在于,包括:计算总损失函数,所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数:Referring to Fig. 1 and Fig. 2, according to the first embodiment of the present invention, the method for segmenting the five internal organs of ears based on machine learning in traditional Chinese medicine is characterized in that it includes: calculating a total loss function, the total loss function including the grayscale image and The first loss function of the predicted image and the second loss function of the single region edge image and the single region rough edge prediction map:

所述第一损失函数为The first loss function is

Figure BDA0003018755690000124
Figure BDA0003018755690000124

式中,x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述灰度图像和所述预测图像的宽和高,c∈{1,…,C},C是分割类别数,

Figure BDA0003018755690000125
对于类别c经过one-hot编码的所述二值图像,
Figure BDA0003018755690000126
qc(x,y)是坐标(x,y)像素属于类别c的预测概率;In the formula, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the width and height of the grayscale image and the predicted image respectively, c∈{ 1,...,C}, C is the number of split categories,
Figure BDA0003018755690000125
For the binary image of category c after one-hot encoding,
Figure BDA0003018755690000126
q c (x, y) is the predicted probability that the pixel at coordinates (x, y) belongs to category c;

所述第二损失函数为The second loss function is

Figure BDA0003018755690000127
Figure BDA0003018755690000127

式中,i∈{1,2,…,5},x∈{1,2,…,W},y∈{1,2,…,H},W和H分别是所述单个区域边缘图像与所述单区域粗略边缘预测图的宽和高,gi(x,y)为第i个区域边缘图像,gi(x,y)∈{0,1},AND为交运算,pei(x,y)为第i个区域粗略边缘预测图像,pei(x,y)∈{0,1}。In the formula, i∈{1,2,…,5}, x∈{1,2,…,W}, y∈{1,2,…,H}, W and H are the edge images of the single area respectively and the width and height of the single region rough edge prediction map, g i (x, y) is the i-th region edge image, g i (x, y)∈{0,1}, AND is the intersection operation, pe i (x,y) is the rough edge prediction image of the i-th region, pe i (x,y)∈{0,1}.

所述总损失函数为The total loss function is

Figure BDA0003018755690000131
Figure BDA0003018755690000131

式中,参数N为li不为0的个数。In the formula, the parameter N is the number of l i not 0.

一种基于机器学习的中医耳部五脏区域分割装置,其特征在于,包括至少一个处理器,以及与至少一个处理器连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行中医耳部五脏区域分割任一项所述的方法。A device for segmenting ear five viscera regions based on machine learning, characterized in that it includes at least one processor, and a memory connected to at least one processor; wherein, the memory stores information that can be executed by the at least one processor. Instructions, the instructions are executed by the at least one processor, so that the at least one processor can perform any one of the methods for segmenting ear and five viscera regions in traditional Chinese medicine.

应当认识到,本发明实施例中的方法步骤可以由计算机硬件、硬件和软件的组合、或者通过存储在非暂时性计算机可读存储器中的计算机指令来实现或实施。所述方法可以使用标准编程技术。每个程序可以以高级过程或面向对象的编程语言来实现以与计算机系统通信。然而,若需要,该程序可以以汇编或机器语言实现。在任何情况下,该语言可以是编译或解释的语言。此外,为此目的该程序能够在编程的专用集成电路上运行。It should be recognized that the method steps in the embodiments of the present invention may be implemented or implemented by computer hardware, a combination of hardware and software, or by computer instructions stored in a non-transitory computer-readable memory. The methods can use standard programming techniques. Each program can be implemented in a high-level procedural or object-oriented programming language to communicate with the computer system. However, the programs can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language. Furthermore, the program can be run on an application specific integrated circuit programmed for this purpose.

此外,可按任何合适的顺序来执行本文描述的过程的操作,除非本文另外指示或以其他方式明显地与上下文矛盾。本文描述的过程(或变型和/或其组合)可在配置有可执行指令的一个或多个计算机系统的控制下执行,并且可作为共同地在一个或多个处理器上执行的代码(例如,可执行指令、一个或多个计算机程序或一个或多个应用)、由硬件或其组合来实现。所述计算机程序包括可由一个或多个处理器执行的多个指令。In addition, operations of processes described herein may be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The processes described herein (or variations and/or combinations thereof) can be performed under the control of one or more computer systems configured with executable instructions, and as code that collectively executes on one or more processors (e.g. , executable instructions, one or more computer programs or one or more applications), hardware or a combination thereof. The computer program comprises a plurality of instructions executable by one or more processors.

进一步,所述方法可以在可操作地连接至合适的任何类型的计算平台中实现,包括但不限于个人电脑、迷你计算机、主框架、工作站、网络或分布式计算环境、单独的或集成的计算机平台、或者与带电粒子工具或其它成像装置通信等等。本发明的各方面可以以存储在非暂时性存储介质或设备上的机器可读代码来实现,无论是可移动的还是集成至计算平台,如硬盘、光学读取和/或写入存储介质、RAM、ROM等,使得其可由可编程计算机读取,当存储介质或设备由计算机读取时可用于配置和操作计算机以执行在此所描述的过程。此外,机器可读代码,或其部分可以通过有线或无线网络传输。当此类媒体包括结合微处理器或其他数据处理器实现上文所述步骤的指令或程序时,本文所述的发明包括这些和其他不同类型的非暂时性计算机可读存储介质。当根据本发明所述的方法和技术编程时,本发明还包括计算机本身。Further, the method can be implemented in any type of computing platform operably connected to a suitable one, including but not limited to personal computer, minicomputer, main frame, workstation, network or distributed computing environment, stand-alone or integrated computer platform, or communicate with charged particle tools or other imaging devices, etc. Aspects of the invention can be implemented as machine-readable code stored on a non-transitory storage medium or device, whether removable or integrated into a computing platform, such as a hard disk, optically read and/or written storage medium, RAM, ROM, etc., such that they are readable by a programmable computer, when the storage medium or device is read by the computer, can be used to configure and operate the computer to perform the processes described herein. Additionally, the machine-readable code, or portions thereof, may be transmitted over a wired or wireless network. The invention described herein includes these and other various types of non-transitory computer-readable storage media when such media include instructions or programs that implement the steps described above in conjunction with a microprocessor or other data processor. The invention also includes the computer itself when programmed according to the methods and techniques described herein.

计算机程序能够应用于输入数据以执行本文所述的功能,从而转换输入数据以生成存储至非易失性存储器的输出数据。输出信息还可以应用于一个或多个输出设备如显示器。在本发明优选的实施例中,转换的数据表示物理和有形的对象,包括显示器上产生的物理和有形对象的特定视觉描绘。Computer programs can be applied to input data to perform the functions described herein, thereby transforming the input data to generate output data stored to non-volatile memory. Output information may also be applied to one or more output devices such as a display. In a preferred embodiment of the invention, the transformed data represents physical and tangible objects, including specific visual depictions of physical and tangible objects produced on a display.

上面结合附图对本发明实施例作了详细说明,但是本发明不限于上述实施例,在所述技术领域普通技术人员所具备的知识范围内,还可以在不脱离本发明宗旨的前提下作出各种变化。The embodiments of the present invention have been described in detail above in conjunction with the accompanying drawings, but the present invention is not limited to the above embodiments. Within the scope of knowledge possessed by those of ordinary skill in the technical field, various modifications can be made without departing from the gist of the present invention. kind of change.

在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示意性实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不一定指的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任何的一个或多个实施例或示例中以合适的方式结合。In the description of this specification, references to the terms "one embodiment," "some embodiments," "exemplary embodiments," "example," "specific examples," or "some examples" are intended to mean that the implementation A specific feature, structure, material, or characteristic described by an embodiment or example is included in at least one embodiment or example of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same embodiment or example. Furthermore, the specific features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

尽管已经示出和描述了本发明的实施例,本领域的普通技术人员可以理解:在不脱离本发明的原理和宗旨的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those skilled in the art can understand that various changes, modifications, substitutions and modifications can be made to these embodiments without departing from the principle and spirit of the present invention. The scope of the invention is defined by the claims and their equivalents.

Claims (9)

1.一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,包括:1. A method for segmenting the five internal organs of the ears of traditional Chinese medicine based on machine learning, characterized in that, comprising: 采集耳部图像,建立图像数据集;Collect ear images and build image datasets; 对所述耳部图像进行标注,得到标签图像;Annotating the ear image to obtain a label image; 分别扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像;Expanding the ear image and the label image respectively to obtain the expanded ear image and the expanded label image; 对所述扩充标签图像进行Canny算子边缘检测,得到标签边缘图像;Carry out Canny operator edge detection to described extended label image, obtain label edge image; 分别对所述扩充耳部图像、所述扩充标签图像和所述标签边缘图像进行预处理,分别得到RGB三通道图像、灰度图像和二值图像,其中,所述RGB三通道图像、所述灰度图像和所述二值图像的尺寸均为W×H;Preprocessing the expanded ear image, the expanded label image, and the label edge image respectively to obtain an RGB three-channel image, a grayscale image, and a binary image, wherein the RGB three-channel image, the Both the size of the grayscale image and the binary image are W×H; 利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像;Using the grayscale image and the binary image to generate edge images of individual regions of the heart, liver, spleen, lung, and kidney; 将所述RGB三通道图像输入学习网络,得到预测图像;Input the RGB three-channel image into the learning network to obtain a predicted image; 从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像;obtaining prediction images of individual regions of the heart, liver, spleen, lung, and kidney from the prediction images; 分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图;Carrying out dilation and erosion processing on the single area prediction images of the heart, liver, spleen, lung, and kidney respectively, to obtain a single area rough edge prediction map of the corresponding area; 计算总损失函数,所述总损失函数包括所述灰度图像与所述预测图像的第一损失函数和所述单个区域边缘图像与所述单区域粗略边缘预测图的第二损失函数,其中,calculating a total loss function, the total loss function including a first loss function of the grayscale image and the predicted image and a second loss function of the single region edge image and the single region rough edge prediction map, wherein, 所述第一损失函数为The first loss function is
Figure FDA0004065900810000011
Figure FDA0004065900810000011
式中,x∈{1,2,...,W},y∈{1,2,...,H},W分别是所述灰度图像和所述预测图像的宽,H分别是所述灰度图像和所述预测图像的高,c∈{1,2,...,X},C是分割类别数,
Figure FDA0004065900810000012
对于类别c经过one-hot编码的所述二值图像
Figure FDA0004065900810000013
qc(x,y)是坐标(x,y)像素属于类别c的预测概率;
In the formula, x∈{1, 2,..., W}, y∈{1, 2,..., H}, W are the widths of the grayscale image and the predicted image respectively, and H is The height of the grayscale image and the predicted image, c ∈ {1, 2, ..., X}, C is the number of segmentation categories,
Figure FDA0004065900810000012
For the binary image of category c after one-hot encoding
Figure FDA0004065900810000013
q c (x, y) is the predicted probability that the pixel at coordinate (x, y) belongs to class c;
所述第二损失函数为The second loss function is
Figure FDA0004065900810000021
Figure FDA0004065900810000021
式中,i∈{1,2,...,5},x∈{1,2,...,W},y∈{1,2,...,H},W分别是所述单个区域边缘图像与所述单区域粗略边缘预测图的宽,H分别是所述单个区域边缘图像与所述单区域粗略边缘预测图的高,gi(x,y)∈{0,1}为第i个区域边缘图像,gi(x,y)∈{0,1},AND为交运算,pei(x,y)为第i个区域粗略边缘预测图像,pei(x,y)∈{0,1},In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W are respectively the The width of the single region edge image and the single region rough edge prediction map, H is the height of the single region edge image and the single region rough edge prediction map, g i (x, y)∈{0, 1} is the edge image of the i-th area, g i (x, y) ∈ {0, 1}, AND is an intersection operation, pe i (x, y) is the rough edge prediction image of the i-th area, pe i (x, y ) ∈ {0, 1}, 所述总损失函数为The total loss function is
Figure FDA0004065900810000022
Figure FDA0004065900810000022
式中,参数N为li不为0的个数;In the formula, the parameter N is the number that l i is not 0; 计算所述总损失函数的最小值,使所述学习网络收敛,利用所述学习网络对所述耳部图像进行分割,得到耳部五脏区域分割图像。The minimum value of the total loss function is calculated to make the learning network converge, and the learning network is used to segment the ear image to obtain a segmented image of the five viscera regions of the ear.
2.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,对所述耳部图像进行标注,得到标签图像,包括:2. a kind of traditional Chinese medicine ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, described ear image is marked, obtains label image, comprises: 按照中国标准耳穴定位示意图对所述耳部图像使用LabelMe工具箱进行标注;According to the Chinese standard auricular point positioning schematic diagram, the ear image is marked using the LabelMe toolbox; 对所述耳部图像的标注信息进行转换,得到每一个所述耳部图像对应的标签图像,所述标签图像中对应中国标准耳穴定位示意图中心、肝、脾、肺、肾以及其余区域为背景六个区域的分别以不同的像素值区分。Convert the labeling information of the ear images to obtain a label image corresponding to each of the ear images, in which the center, liver, spleen, lung, kidney and other areas of the label image correspond to the Chinese standard auricular point positioning schematic diagram as the background Each of the six regions is distinguished by different pixel values. 3.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,所述扩充所述耳部图像和所述标签图像,得到扩充耳部图像和扩充标签图像,包括:3. a kind of traditional Chinese medical science ear five viscera region segmentation method based on machine learning according to claim 1, is characterized in that, described expanding described ear image and described label image, obtain expanded ear image and expanded label image ,include: 对所述耳部图像进行旋转、改变所述耳部图像的尺寸、对所述耳部图像进行水平翻转和对所述耳部图像进行伽马变换,得到所述扩充耳部图像;Rotating the ear image, changing the size of the ear image, horizontally flipping the ear image, and performing gamma transformation on the ear image to obtain the expanded ear image; 对所述标签图像进行与所述耳部图像相同的旋转、改变所述标签图像的尺寸、对所述标签图像进行水平翻转,并使其仍只拥有五种像素值对应心、肝、脾、肺、肾五种区域,且灰度像素值分别为50、100、150、200、250,除上述五种区域外为背景区域,像素值为0,得到所述扩充标签图像。Perform the same rotation on the label image as the ear image, change the size of the label image, flip the label image horizontally, and make it still only have five pixel values corresponding to heart, liver, spleen, The five areas of lung and kidney, and the gray pixel values are 50, 100, 150, 200, 250 respectively, except the above five areas are background areas, the pixel value is 0, and the extended label image is obtained. 4.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,分别对所述扩充耳部图像和所述标签边缘图像进行预处理,包括:4. a kind of TCM ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, preprocessing is carried out to described expansion ear image and described label edge image respectively, comprises: 对所述扩充耳部图像进行位置剪切,归一化操作,得到尺寸为W×H的所述RGB三通道图像;Carrying out position cutting and normalization operation on the expanded ear image to obtain the RGB three-channel image whose size is W×H; 对所述扩充标签图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W×H的所述灰度图像,所述灰度图像分为背景、心、肝、脾、肺、肾六个区域,所述六个区域的灰度像素值分别为0、1、2、3、4、5;Cutting the extended label image corresponding to the extended ear image to obtain the grayscale image with a size of W×H, the grayscale image is divided into background, heart, liver, spleen, lung, Six areas of the kidney, the gray pixel values of the six areas are 0, 1, 2, 3, 4, 5 respectively; 对所述标签边缘图像进行与所述扩充耳部图像对应的位置剪切,得到尺寸为W×H的所述二值图像。Perform position clipping corresponding to the extended ear image on the label edge image to obtain the binary image with a size of W×H. 5.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,利用所述灰度图像和所述二值图像生成心、肝、脾、肺、肾各单个区域边缘图像,包括:5. A machine learning-based method for segmenting the five viscera regions of the ears of traditional Chinese medicine according to claim 1, characterized in that, using the grayscale image and the binary image to generate the heart, liver, spleen, lung, and kidney Individual region edge images, including: 令flabel(x,y)表示在所述灰度图像任意坐标(x,y)处的像素值,令fedge(x,y)表示在所述二值图像任意坐标(x,y)处的像素值,gi(x,y)表示这些坐标处相应的变换的像素值,有Let f label (x, y) represent the pixel value at any coordinate (x, y) of the grayscale image, let f edge (x, y) represent the pixel value at any coordinate (x, y) of the binary image The pixel value of , g i (x, y) represents the corresponding transformed pixel value at these coordinates, there is
Figure FDA0004065900810000031
Figure FDA0004065900810000031
式中,i∈{1,2,...,5},x∈{1,2,...,W},y∈{1,2,...,H},W分别是所述灰度图像和所述二值图像的宽,H分别是所述灰度图像和所述二值图像的高,g1、g2、g3、g4、g5分别是心、肝、脾、肺、肾的所述单个区域边缘图像。In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W are respectively the The width of the grayscale image and the binary image, H is the height of the grayscale image and the binary image respectively, g 1 , g 2 , g 3 , g 4 , g 5 are the heart, liver, spleen , the single region edge images of lung and kidney.
6.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,将所述RGB三通道图像输入学习网络,得到预测图像,包括:6. a kind of traditional Chinese medicine ear five internal organs region segmentation method based on machine learning according to claim 1, is characterized in that, said RGB three-channel image input learning network, obtains predicted image, comprises: 将所述RGB三通道图像经过两个卷积核都为3*3的卷积层,得到通道为64,尺寸为H×W的特征ForiginPass the RGB three-channel image through two convolution layers with 3*3 convolution kernels to obtain a feature F origin with 64 channels and a size of H×W; 将所述特征Forigin经过PAM模块得到特征F1Pass the feature F origin through the PAM module to obtain the feature F 1 ; 将所述特征F1经过PAM模块得到特征F2Pass the feature F1 through the PAM module to obtain the feature F2 ; 将所述特征F1与所述特征F2分别与系数0.75、0.25相乘后的得到的两个结果进行通道上的拼接,然后通过卷积核为3*3的卷积层,得到通道为6,尺寸为H×W的所述预测图像。The two results obtained by multiplying the feature F 1 and the feature F 2 by the coefficients 0.75 and 0.25 respectively are spliced on the channel, and then passed through the convolution layer with a convolution kernel of 3*3 to obtain the channel as 6. The prediction image whose size is H×W. 7.根据权利要求1所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,从所述预测图像中获得心、肝、脾、肺、肾的各单个区域预测图像,包括:7. A kind of method for segmenting five internal organs regions of the ears of traditional Chinese medical science based on machine learning according to claim 1, is characterized in that, obtains each single region predictive image of heart, liver, spleen, lung, kidney from described predictive image, include: 对所述预测图像在通道的维度上取最大值,得到尺寸为H×W,像素值只有0、1、2、3、4、5六种像素值的维度灰度图像,其中,六种像素值在所述维度灰度图像中所对应的区域分别为背景、心、肝、脾、肺、肾的预测分割区域;Take the maximum value of the predicted image in the dimension of the channel to obtain a dimensional grayscale image with a size of H×W and only six pixel values of 0, 1, 2, 3, 4, and 5, wherein the six types of pixels The areas corresponding to the values in the dimension grayscale image are respectively the predicted segmentation areas of the background, heart, liver, spleen, lung, and kidney; 令P(x,y)表示在预测灰度图像任意坐标(x,y)的像素值,有Let P(x, y) denote the pixel value at any coordinate (x, y) of the predicted grayscale image, we have
Figure FDA0004065900810000041
Figure FDA0004065900810000041
式中,i∈{1,2,...,5},x∈{1,2,...,W},y∈{1,2,...,H},W和H分别是所述维度灰度图像的宽和高,P(x,y)={0,1,...,5},p1、p2、p3、p4、p5分别是心、肝、脾、肺、肾的所述单个区域预测图像。In the formula, i∈{1,2,...,5}, x∈{1,2,...,W}, y∈{1,2,...,H}, W and H are respectively The width and height of the dimensional grayscale image, P(x, y)={0, 1, ..., 5}, p 1 , p 2 , p 3 , p 4 , p 5 are heart, liver, The single region prediction images of spleen, lung and kidney.
8.根据权利要求7所述的一种基于机器学习的中医耳部五脏区域分割方法,其特征在于,所述分别对心、肝、脾、肺、肾的所述单个区域预测图像进行膨胀、腐蚀处理,得到其对应的区域的单区域粗略边缘预测图,包括:8. A kind of machine learning-based TCM ear five internal organs region segmentation method according to claim 7, characterized in that, said single region prediction images of the heart, liver, spleen, lung and kidney are respectively expanded, Corrosion processing to obtain a single-region rough edge prediction map of its corresponding region, including: 定义结构元B,所述结构元B的大小为3*3,原点位于中心,元素值都为1;Define the structure element B, the size of the structure element B is 3*3, the origin is at the center, and the element values are all 1; 使用所述结构元B膨胀和腐蚀心、肝、脾、肺、肾的各所述单个区域预测图像,得到膨胀后的所述单个区域预测图像和腐蚀后的所述单个区域预测图像,膨胀后的所述单个区域预测图像和腐蚀后的所述单个区域预测图像相减获得其对应的所述区域粗略边缘预测图;Use the structural element B to dilate and corrode each of the single-region predicted images of the heart, liver, spleen, lung, and kidney to obtain the expanded predicted single-region image and the corroded single-region predicted image, and after dilation Subtracting the predicted image of the single region and the predicted image of the single region after erosion to obtain the corresponding rough edge prediction map of the region; 令pei(x,y)表示在所述区域粗略边缘预测图像任意坐标的像素值,有Let pe i (x, y) denote the pixel value at any coordinate of the predicted image at the rough edge of the region, we have
Figure FDA0004065900810000051
Figure FDA0004065900810000051
式中,符号
Figure FDA0004065900810000052
为膨胀运算,符号
Figure FDA0004065900810000053
为腐蚀运算,i∈{1,2,...,5},x∈{1,2,...,W},y∈{1,2,...,H},W和H分别是所述单区域粗略边缘预测图像的宽和高,pei(x,y)∈{0,1},pe1、pe2、pe3、pe4、pe5分别是心、肝、脾、肺、肾的所述单区域粗略边缘预测图像,pi(x,y)表示心、肝、脾、肺、肾中对应的所述单个区域预测图像任意坐标(x,y)的像素值。
In the formula, the symbol
Figure FDA0004065900810000052
is the dilation operation, the symbol
Figure FDA0004065900810000053
For corrosion operation, i∈{1, 2,...,5}, x∈{1, 2,...,W}, y∈{1, 2,...,H}, W and H respectively is the width and height of the single region rough edge prediction image, pe i (x, y) ∈ {0, 1}, pe 1 , pe 2 , pe 3 , pe 4 , pe 5 are heart, liver, spleen, The single region rough edge prediction image of the lung and kidney, p i (x, y) represents the pixel value of any coordinate (x, y) of the corresponding single region prediction image in the heart, liver, spleen, lung and kidney.
9.一种基于机器学习的中医耳部五脏区域分割装置,其特征在于,包括至少一个处理器,以及与所述至少一个处理器连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如权利要求1-8任一项所述的方法。9. A traditional Chinese medical science ear five internal organs region segmentation device based on machine learning, characterized in that it comprises at least one processor, and a memory connected to said at least one processor; wherein said memory stores information that can be used by said at least Instructions executed by a processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the method according to any one of claims 1-8.
CN202110396463.4A 2021-04-13 2021-04-13 Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning Active CN113112475B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110396463.4A CN113112475B (en) 2021-04-13 2021-04-13 Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110396463.4A CN113112475B (en) 2021-04-13 2021-04-13 Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning

Publications (2)

Publication Number Publication Date
CN113112475A CN113112475A (en) 2021-07-13
CN113112475B true CN113112475B (en) 2023-04-18

Family

ID=76716660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110396463.4A Active CN113112475B (en) 2021-04-13 2021-04-13 Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning

Country Status (1)

Country Link
CN (1) CN113112475B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114155240A (en) * 2021-12-13 2022-03-08 韩松洁 Ear acupoint detection method and device and electronic equipment
CN119762483B (en) * 2025-03-05 2025-05-09 江汉大学 Ear acupoint reflection area identification method and device, acupuncture robot and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020119679A1 (en) * 2018-12-14 2020-06-18 深圳先进技术研究院 Three-dimensional left atrium segmentation method and apparatus, terminal device, and storage medium
AU2020103905A4 (en) * 2020-12-04 2021-02-11 Chongqing Normal University Unsupervised cross-domain self-adaptive medical image segmentation method based on deep adversarial learning

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10699412B2 (en) * 2017-03-23 2020-06-30 Petuum Inc. Structure correcting adversarial network for chest X-rays organ segmentation
CN108596193B (en) * 2018-04-27 2021-11-02 东南大学 A method and system for building a deep learning network structure for human ear recognition
CN111402268B (en) * 2020-03-16 2023-05-23 苏州科技大学 A Method for Segmenting the Liver and Its Lesions in Medical Images
CN111583287A (en) * 2020-04-23 2020-08-25 浙江大学 A deep learning model training method for fine portrait image segmentation
CN112070752B (en) * 2020-09-10 2024-08-13 杭州晟视科技有限公司 Auricle segmentation method and device for medical image and storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020119679A1 (en) * 2018-12-14 2020-06-18 深圳先进技术研究院 Three-dimensional left atrium segmentation method and apparatus, terminal device, and storage medium
AU2020103905A4 (en) * 2020-12-04 2021-02-11 Chongqing Normal University Unsupervised cross-domain self-adaptive medical image segmentation method based on deep adversarial learning

Also Published As

Publication number Publication date
CN113112475A (en) 2021-07-13

Similar Documents

Publication Publication Date Title
CN111652845B (en) Automatic labeling method and device for abnormal cells, electronic equipment and storage medium
CN111882560B (en) Lung parenchyma CT image segmentation method based on weighted full convolution neural network
CN110059697B (en) An automatic segmentation method for pulmonary nodules based on deep learning
US7724936B2 (en) Image generation apparatus and image generation method for detecting abnormalities
CN110705565A (en) Lymph node tumor region identification method and device
CN111144486A (en) Heart nuclear magnetic resonance image key point detection method based on convolutional neural network
CN109003269A (en) A kind of mark extracting method for the medical image lesion that can improve doctor's efficiency
CN113724243B (en) Image processing method, image processing device, electronic equipment and storage medium
CN113112475B (en) Traditional Chinese medicine ear five-organ region segmentation method and device based on machine learning
CN112488996B (en) Non-homogeneous three-dimensional esophageal cancer energy spectrum CT weak supervision automatic labeling method and system
CN110246579B (en) Pathological diagnosis method and device
CN116188474B (en) Three-level lymphatic structure identification method and system based on image semantic segmentation
CN113822846A (en) Method, apparatus, device and medium for determining region of interest in medical image
CN116433704A (en) A cell nucleus segmentation method and related equipment based on center points
Li et al. SAP‐cGAN: Adversarial learning for breast mass segmentation in digital mammogram based on superpixel average pooling
CN106600587A (en) Lung CT image auxiliary detection processing device
Mortaheb et al. Metal artifact reduction and segmentation of dental computerized tomography images using least square support vector machine and mean shift algorithm
CN112634266A (en) Semi-automatic marking method, medium, equipment and device for laryngoscope image
CN113408595B (en) Pathological image processing method and device, electronic equipment and readable storage medium
Somasundaram et al. Straightening of highly curved human chromosome for cytogenetic analysis
CN118691908A (en) A method for identifying crypt abscesses
CN118982664A (en) Enhanced CT image anatomical atlas segmentation method and system based on deep learning
Atim et al. Explainable deep learning framework for ground glass opacity (ggo) segmentation from chest ct scans
CN103366183A (en) Nonparametric automatic detection method of focal niduses
Wang et al. Weakly supervised tooth instance segmentation on 3d dental models with multi-label learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant