[go: up one dir, main page]

CN109635618A - Visible images vein developing method based on convolutional neural networks - Google Patents

Visible images vein developing method based on convolutional neural networks Download PDF

Info

Publication number
CN109635618A
CN109635618A CN201810890329.8A CN201810890329A CN109635618A CN 109635618 A CN109635618 A CN 109635618A CN 201810890329 A CN201810890329 A CN 201810890329A CN 109635618 A CN109635618 A CN 109635618A
Authority
CN
China
Prior art keywords
network
layer
convolutional neural
output
visible light
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810890329.8A
Other languages
Chinese (zh)
Other versions
CN109635618B (en
Inventor
唐超颖
押莹
王彪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Aeronautics and Astronautics
Original Assignee
Nanjing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Aeronautics and Astronautics filed Critical Nanjing University of Aeronautics and Astronautics
Priority to CN201810890329.8A priority Critical patent/CN109635618B/en
Publication of CN109635618A publication Critical patent/CN109635618A/en
Application granted granted Critical
Publication of CN109635618B publication Critical patent/CN109635618B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/15Biometric patterns based on physiological signals, e.g. heartbeat, blood flow
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

本发明公开了一种基于卷积神经网络的可见光图像静脉显像方法,属于一种信息感知与识别技术,本发明通过设计了设计端到端的全卷积神经网络静脉显像模型CY‑Net,首先对近红外手臂图像进行预处理,然后以成对的可见光手臂图像和预处理后的NIR图像作为训练样本,端到端地训练一个五层的卷积神经网络,并将网络最后的全连接层换为卷积层,形成全卷积神经网络,使网络自动学习可见光‑近红外图像之间的映射关系,得到目标的映射图;最后用Gabor滤波等方法对网络输出的静脉显像图进行静脉提取。本发明的方法比基于光学和基于三层前向神经网络的可见光显像方法效果更好,体现出较好的鲁棒性,并且验证了本发明方法的有效性。

The invention discloses a visible light image vein imaging method based on a convolutional neural network, which belongs to an information perception and recognition technology. The invention designs an end-to-end full convolutional neural network vein imaging model CY-Net, First, the near-infrared arm images are preprocessed, and then a five-layer convolutional neural network is trained end-to-end with paired visible light arm images and preprocessed NIR images as training samples, and the final network is fully connected. The layer is replaced by a convolutional layer to form a fully convolutional neural network, so that the network can automatically learn the mapping relationship between visible light and near-infrared images, and obtain the target map; Intravenous extraction. Compared with the visible light imaging method based on optics and the three-layer forward neural network, the method of the present invention has better effect, shows better robustness, and verifies the effectiveness of the method of the present invention.

Description

Visible images vein developing method based on convolutional neural networks
Technical field
The invention belongs to a kind of information Perceptions and identification technology, in particular to a kind of visible light based on convolutional neural networks Image vein developing method.
Background technique
In medical domain, intravenous injection is a kind of common treatment means, but the skin of some patients is difficult to find Vein, such as subcutaneous fat is thicker or chaeta is more, vein is relatively thin, the colour of skin is deeper, this makes the success rate of venipuncture not high. Although existing research person develops venous locations detector, traditional hand vein recognition instrument must rely near infrared imaging and set It is standby, it is at high cost, difficult in maintenance.Visible images vein developing method proposed in recent years based on optics and skin physiology can To overcome this limitation (Chaoying Tang, Adams Wai Kin Kong, N.C.Uncovering vein patterns from color skin images for forensic analysis[C]//IEEE International Conference on Computer Vision and Pattern Recognition, 2011:665-672), but the party There is noise, algorithm complexity, to the poor robustness of different skin condition in method.Patent of invention: visible light skin image Vein developing method (application number: 201410734150.5) proposes a kind of visible light imaging based on three layer feedforward neural networks Method, but its structure is too simple, learning ability is weak, therefore imaging results are also limited to.
Deep learning is a kind of method of automatic learning characteristic, wherein convolutional neural networks (Convolutional Neural Networks, CNN) pixel can be retained when handling image spatial relation, its structure is to figure There is height invariance therefore to be very suitable to do the feature of two-dimensional image data for the deformation such as rotation, translation, the scaling of picture It extracts.
The process that vein is extracted from visible light arm image includes that object detection and lines extract in terms of two.Object inspection It surveys, i.e., detects arm from input picture;Lines extract, i.e., the visible light and near-infrared image learnt using network model Mapping relations realized from the arm detected vein image, finally carry out lines extraction again.Traditional object detecting method It is low-level feature using the feature of design, it is insufficient to the ability to express of target.Deep learning method can be from a large amount of sample High-level characteristic is arrived in automatic study, and compared to the feature of engineer, the feature learnt is more abundant, ability to express is also stronger.
With the continuous development of deep learning, researcher's discovery carries out target detection using CNN, and accuracy rate can be mentioned substantially It rises, compared to Conventional visual feature, convolutional neural networks visual signature is more suitable for image detection task.In such as venous blood Pipe, lane, course line lines extract aspect, traditional algorithm is strong to the identification of feature.And the expression way of feature excessively according to Rely artificial selection, researcher not can guarantee the accurate selection and expression to feature.The deep layer group of deep learning main analog human brain Structure is knitted, low-level feature is combined.More abstract effective high-rise expression is formed, therefore mark sheet can be carried out automatically It reaches.
Superfine finger hand vein recognition algorithm (the finger hand vein recognition research based on deep learning proposed based on deep learning of Wu [J] computer technology and development, 2018 (2): 200-204), it proposes to improve on the basis of Alex Net deep neural network Scheme, carrying out classification to finger vein image has better result;Qin Zhi light etc. proposes a kind of based on multi-modal CNN model ((cerebrovascular Study on Extraction Method [J] Journal of UEST of China based on multi-modal convolutional neural networks, 2016,45 (4): 573-581)), brain CT angiographic image is split, and has been experimentally confirmed this model and has been extracted in the cerebrovascular On validity;The it is proposeds such as Wang Yiding obtained using wavelet decomposition and the method for down-sampling it is multiple dimensioned under hand back vein figure Picture, then the extraction of characteristics of image is carried out with centrosymmetric local binary patterns, it is successively trained using limitation Boltzmann machine And discrimination (the hand based on deep learning and multiple dimensioned coded combination of vein is improved by the method for multiple dimensioned coded combination Dorsal vein identifies [J] North China University of Tech journal, 2015,27 (3): 6-13);Zhang Guiying etc. uses two kinds of nets based on CNN Network structure automatically learns and extracts the feature of optic disk in retinal images, realizes the automatic detection of optic disk, substantially increases view Disk detection accuracy rate (optic disk of Zhang Guiying, the Zhang Xianjie based on deep learning detects the Guizhou [J] college of education journal automatically, 2017,33(3):27-32)。
Convolutional neural networks can extract high-level characteristic, and feature extraction, feature selecting and tagsort are merged same Training, non local on the whole to optimize in model, is enhanced the separability of feature.CNN is mentioned applied to vein Other methods can effectively be overcome the shortcomings of by taking.But there are no the skills that CNN is used for the imaging of visible light vein in the prior art Art.
Summary of the invention
The present invention is directed to traditional vein and images the dependence near infrared gear and obtained in the prior art using optical means To image there are obvious noise, imaging results are limited, and the defect of poor robustness proposes a kind of based on the visible of deep learning CNN is used for the technology of visible light vein imaging, solved by light image vein developing method by devising a kind of CNN model Problems of the prior art.
The present invention is implemented as follows:
The invention discloses a kind of visible images vein developing method based on convolutional neural networks, specific steps are such as Under:
Step 1: near-infrared arm vein image preprocessing;
Step 2: designing end as training sample based on the pretreated image of step 1 and visible light arm image Full convolutional neural networks vein to end images MODEL C Y-Net;The network is using visible light arm image as input, using five Layer convolutional layer is filtered operation to image;
Step 3: to the parameter optimization strategy in CY-Net network model: using gradient optimization algorithm and cost Function optimization algorithm;
Step 4: finally carrying out vein extraction to the vein imaging figure of network output.
Further, the adaptive histogram equalization method near-infrared arm of contrast-limited is utilized in the step one Image is pre-processed.
Further, the step two specifically:
2.1, the convolutional neural networks that the network overall structure of design is one five layers are followed by one behind each convolutional layer Layer PReLU activation primitive layer, common activation primitive Sigmoid and Tanh is saturation nonlinearity function.In recent years, most nerves Network all uses unsaturated ReLU (Rectified Linear Unit, non-liner revision unit) or its deformation PReLU The function of (Parametric Rectified Linear Unit) as active coating;Its mathematical expression difference is as follows:
F (x)=max (0, x) (1)
Wherein, x is function input, and f (x) is function output, and a is slope;
One layer of PReLU activation primitive layer is followed by behind each convolutional layer of present networks, increase network non-linear factor and The learning ability of model, it, which increases a small amount of calculation amount than ReLU activation primitive, realizes higher accuracy rate, believes negative axis Breath will not all be lost, and avoid " dying ReLU ".
2.2, Dropout layers are arranged after convolutional layer four, the characteristic pattern finally obtained with 3 × 3 convolution kernel to convolutional layer 5 It carries out convolution and obtains network output image, calculate the cost function of the near-infrared image of output image and input, passed using reversed The gradient descent method optimization that iterates is broadcast to update network parameter and minimize cost function, make e-learning to visible images with Mapping relations between near-infrared image;
Dropout layers be a kind of generalization ability for avoiding occurring in network training over-fitting and improving network plan Slightly, network complexity is reduced by giving up the output of hidden layer neuron at random.The neuron being rejected both be not involved in front of to Communication process is also not involved in back-propagation process, can reduce the coupling between neuron in this way, make network have more generalization ability, It is more robust.When training data is less, it is easy to happen over-fitting, Dropout strategy can be used.
2.3, using pretreated image as supervision message, one five layers of full convolutional Neural net is trained end-to-endly Network, direct prediction label figure;The target of end-to-end study is directly learnt by depth network to (defeated from the primitive form of data Enter) to the mapping relations between the label (output) of data, reduce artificial pretreatment and post-processing and complicated based on artificial The rule of design or the feature of selection increase the whole compatible degree of network.
It is specially input with visible images, corresponding near-infrared image is that target exports, trained one five layers complete Convolutional neural networks;Network output is subjected to backpropagation as error at a distance from target output, is optimized and revised in network The parameters such as weight, bias term, reduce error constantly, until network output is exported close to target.
Further, first and third layer of convolution kernel size is designed as 5, remaining is 3.Convolution operation usually has and makes Useful information enhancing, the effect for reducing noise are practice have shown that lesser convolution kernel preferably extracts the feature in image.
Further, cost function optimization algorithm is specially the model for using L2 regularization to carry out constrained parameters in the step three Number, i.e., with the difference of predicted value and actual value square half to being averaged after the summation of m sample come measure algorithm operating condition:
In formula, W is network weight, and b is network bias term, and m is number of samples,It is network to the mesh of i-th sample Mark output, y(i)It is network to the reality output of i-th of sample,
Therefore the final goal of Solution To The Network is that the minimum value of cost function, solution procedure are asked for weight W and bias term b Gaussian initialization is carried out to each parameter;
L2 regularization is substantially linear attenuation to be carried out to the weight of network, therefore L2 regularization is also referred to as weight decaying, It is the cost function formula (3) in network below along with a regularization term, it may be assumed that
In formula, the subsequent Section 2 of plus sige is regularization term;
L2 regularization term be to all parameter W square summation after be averaged multiplied by λ/2, wherein λ is regularization coefficient, For weighing specific gravity shared by original cost function J (W, b) and regular terms plus derivation again after regular terms, the more new formula of W Become:
In formula, Section 2 is the partial derivative for solving parameter, and α indicates learning rate;
So the coefficient of W becomes the value smaller than 1 from 1 in newerIts effect is to reduce W to update weight Influence, i.e., weight decay.
Further, gradient optimization algorithm carries out parameter update to weight and bias term in the step three, updates Mode is as follows:
In formula,Indicate the weight parameter connecting between j-th of node of l layer and l+1 i-th of node of layer, Indicate the bias term of l+1 i-th of node of layer, α indicates learning rate, can be seen that by two parameter more new formulas and solves parameter Partial derivative is the key that gradient descent method;Cost function is to weightAnd bias termLocal derviation is asked to obtain:
Residual error is to calculate the output node of every layer network the difference of output valve and true value, it is assumed that the output layer of neural network It is n-thlLayer, then the residual error of output layer is usedIt indicates, l=nl-1,nlThe residual error of i-th of node of -2, K, 2 layer is usedTable Show, calculation formula is as follows:
In formula,It is got by the weighted average calculating of l+1 node layer residual error, after having two above residual error, then Cost function J (W, b) can be further calculated to the local derviation of weight and bias term, it may be assumed that
Detailed description of the invention
Fig. 1 is near-infrared arm vein image preprocessing result figure of the present invention;
Fig. 2 is the pretreatment schematic diagram of arm near-infrared image of the present invention;
Fig. 3 is the schematic diagram of CY-Net network structure of the present invention;
Fig. 4 is the image of arm vein post processing of image process of the present invention;
Fig. 5 is to show in the embodiment of the present invention using visible images, based on what optical means and method of the invention obtained As comparative result figure;
Fig. 6 is the local binarization image after vein of the present invention extracts;
Fig. 7 is network in the embodiment of the present invention to the effect picture of data difference fitting degree;
Fig. 8 is the preferable three groups of images of subjective assessment in the embodiment of the present invention and its result figure after binaryzation;
Fig. 9 is CY-Net model robustness verification result figure under shooting conditions different in the embodiment of the present invention.
Specific embodiment
It is clear to keep the purpose of the present invention, technical solution and effect clearer, referring to the drawings and illustrative example pair The present invention is further described.It should be understood that specific implementation described herein is not used to limit only to explain the present invention The fixed present invention.
1. near-infrared image pre-processes
Due to the specific location of near-infrared arm image medium sized vein and unintelligible, utilize the adaptive of contrast-limited Histogram equalization method (Contrast Limited Adaptive Histogram Equalization, CLAHE) is answered to mention The contrast of near-infrared image is risen, as shown in Figure 1.1 (a) is arm near-infrared image, and 1 (b) is the adaptive of contrast-limited The result of histogram equalization method.
The data set of network is made of the synchronous arm image of visible light-near-infrared of 24 people, since sample total is less, Therefore final to obtain 96 pairs of visible lights-near-infrared arm image using horizontal, vertical mirror method EDS extended data set, at random Wherein 80% training set of the data as network is chosen, remaining 20% data is as test set.
2. full convolutional neural networks vein images MODEL C Y-Net end to end
The CY-Net network structure that the present invention designs is as shown in Figure 3: the network is adopted using visible light arm image as input Operation, convolution kernel size and feature map number are filtered to image with five layers of convolutional layer as shown in figure 3, after each convolutional layer It is followed by one layer of PReLU activation primitive layer, Dropout layers of setting inactivates neuron at random excessively quasi- to avoid network after convolutional layer 4 It closes, convolution finally is carried out to the characteristic pattern that convolutional layer 5 obtains with 3 × 3 convolution kernel and obtains network output image, calculates output figure As the cost function with the near-infrared image of input, iterating to optimize using back-propagation gradient descent method updates network parameter Cost function is minimized, makes the e-learning to the mapping relations between visible images and near-infrared image.
First and third layer of convolution kernel size is designed as 5, remaining is 3.Convolutional layer is the core layer in convolutional neural networks, Convolutional filtering operation is carried out to each channel of input picture, extracts different features.The each convolutional layer of network of the invention It is followed by one layer of PReLU activation primitive layer below, increases the non-linear factor of network and the learning ability of model, it swashs than ReLU Function living, which increases a small amount of calculation amount, realizes higher accuracy rate, lose negative axis information will not all, avoids " dying ReLU".It deforms the function of PReLU (Parametric Rectified Linear Unit) as active coating, Mathematical expression difference is as follows:
F (x)=max (0, x) (1)
In formula, x is function input, and f (x) is function output, and a is slope.
Dropout layers: the coupling between neuron is reduced by giving up hidden layer neuron at random, to reduce network Complexity makes network have more generalization ability, more robust.In order to avoid generation over-fitting, the generalization ability of network is improved, Dropout strategy is used when network design.
Finally the full articulamentum with output connection realizes classification task for extracting high-level characteristic to traditional CNN network.This Invention is based on the theory of FCN (Fully Convolutional Network), by full connection last in traditional CNN network Layer has changed convolutional layer into, constitutes full convolutional network, obtains the mapping graph of target finally to realize the imaging of vein image.
The CNN network C Y-Net that the present invention designs is with CLAHE treated near-infrared arm image (Ground truth) As supervision message, full convolutional neural networks of one five layers of training end-to-endly, direct prediction label figure (Label map), As shown in Fig. 2, being input with visible images, corresponding near-infrared image is target output, the full convolution of one five layers of training Neural network.By network output with as error progress backpropagation, optimized and revised at a distance from target output weight in network, The parameters such as bias term, reduce error constantly, until network output is exported close to target.
3. Network Optimization Strategy
(1) cost function optimizes
The present invention is using L2 regularization come the norm of constrained parameters.Vein visualization role substantially belongs to recurrence task, because The cost function of this network is defined as in recurrence task common Euclidean distance loss function, that is, uses the difference of predicted value and actual value Square half to being averaged after the summation of a sample and carry out measure algorithm operating condition:
Therefore the final goal of Solution To The Network is that the minimum value of cost function is sought for weight W and bias term b.Solution procedure Gaussian initialization is carried out to each parameter.
L2 regularization is substantially linear attenuation to be carried out to the weight of network, therefore L2 regularization is also referred to as weight decaying (weight decay).It is the cost function formula (3) in network below along with a regularization term, it may be assumed that
L2 regularization term be to all parameter W square summation after be averaged multiplied by λ/2, wherein λ is regularization coefficient, For weighing specific gravity shared by original cost function J (W, b) and regular terms.In addition derivation again after regular terms, the more new formula of W Become:
So the coefficient of W becomes the value smaller than 1 from 1 in newerIts effect is to reduce W to update weight Influence, i.e., weight decay.In a sense, smaller weight indicates that the complexity of network is lower, therefore, pass through constraint The norm of parameter, regularization can mitigate over-fitting to a certain extent.
(2) gradient descent method
The present invention mainly uses Adam (Adaptive Moment Estimation) optimization algorithm.It is stochastic gradient The extension form of descent algorithm (Stochastic Gradient Descent, SGD), algorithm fusion AdaGrad and The advantages of RMSProp algorithm, is based on first moment mean value computation adaptability parameter learning rate, and makes full use of the second moment of gradient equal Value.According to practical experience, the exponential damping parameter beta1 of single order moments estimation takes 0.9, the exponential damping parameter of second order moments estimation Beta2 takes 0.999, epsilon parameter to take 10E-8.
Gradient optimization algorithm carries out parameter update to weight and bias term in step 3, and update mode is as follows:
In formula,Indicate the weight parameter connecting between j-th of node of l layer and l+1 i-th of node of layer, Indicate the bias term of l+1 i-th of node of layer, α indicates learning rate, can be seen that by two parameter more new formulas and solves parameter Partial derivative is the key that gradient descent method;Cost function is to weightAnd bias termLocal derviation is asked to obtain:
Residual error is to calculate the output node of every layer network the difference of output valve and true value, it is assumed that the output layer of neural network It is n-thlLayer, then the residual error of output layer is usedIt indicates, l=nl-1,nlThe residual error of i-th of node of -2, K, 2 layer is usedTable Show, calculation formula is as follows:
In formula,It is got by the weighted average calculating of l+1 node layer residual error, after having two above residual error, then Cost function J (W, b) can be further calculated to the local derviation of weight and bias term, it may be assumed that
Ubuntu16.04, the Caffe deep learning frame of 64 bit manipulation systems, with NVIDIA are based in the present embodiment TITAN X (Pascal), the GPU of 8G memory are experimental situation.Vein image is shot using JAI-AD080CL camera, the camera The spectrum for obtaining visible light and near-infrared can be synchronized.By convolutional neural networks MODEL C Y-Net complete end to end proposed by the present invention What is proposed in the prior art is compared based on optical developing method, to verify the validity of network.Two methods are obtained To output image and former near-infrared arm image post-processed.
Firstly, carrying out Gabor filtering to obtained vein image, Gabor filter can automatically extract the quiet of output image Then based on this arteries and veins information increases venous information using the energy diagram exported after arm shape exposure mask and Gabor filtering By force, binaryzation is finally carried out to enhanced image using the method for adaptive threshold.To the process of arm NIR post processing of image As shown in Figure 4.4 (a) be NIR image;4 (b) be Gabor filtering and enhanced result;4 (c) be the result after binaryzation.
It subjective assessment based on the method for the present invention and is analyzed as follows:
As shown in figure 5, Fig. 5 is that the result that the visible images of arm, optical means obtain and CY-Net are obtained respectively As a result.The imaging of visible light vein image can be achieved in optical means and CY-Net as seen from Figure 5, wherein optical means needle Each pixel is handled, there are obvious noises, and the method based on deep learning is end-to-end study, and noise is than optics side Method wants small.Fig. 6 is the local binarization figure of the 9th group of result, further demonstrates the vein extracting method based on deep learning and makes an uproar Sound is smaller.From the 5th, 7, the 8 group of result of Fig. 5 also it will be evident that the result obtained based on optical method is in dark Region all pixels gray value is all very low, can not observe venous locations, and the method based on deep learning overcomes this and asks Topic, embodies preferable robustness.
It objectively evaluating based on the method for the present invention and is analyzed as follows:
In order to further verify the validity of invention, it is objective to be carried out with four common counters in identification field to experimental result Evaluation: accuracy rate (Accuracy), precision (Precision), also known as precision ratio, recall ratio (Recall) and FM estimate.It is fixed Justice is as follows respectively:
By definition it is found that accuracy rate and precision ratio measured is accuracy that algorithm extracts vein, recall ratio is then paid close attention to Be extract result comprehensive, FM, which estimates, carries out overall merit to algorithm performance in conjunction with precision ratio and recall ratio, to three kinds of sides The bianry image that method post-processes, aforementioned four index value is bigger to illustrate that vein extracts result closer to true vein, effect Better.Vein extraction optically is carried out with CY-Net respectively on 20 groups of visible light arm images, carries out post-processing it Afterwards, four evaluation indexes for calculating every group of result, using near-infrared image corresponding with visible images as Calculation Estimation index True value (Ground truth).It is the accuracy rate and precision ratio evaluation index value of two methods in table 1, is two kinds of sides in table 2 The recall ratio and FM measuring and evaluation index value of method.
Table 1 is Accuracy the and Precision evaluation index value of two methods
Table 2 is Recall the and FM evaluation index value of two methods
Data comparison in table it can be found that in 20 groups of test images, CY-Net have 70% Accuracy value and Precision value is higher than optical means, and the average value on Accuracy, Precision and FM estimate is all than being based on optics Method it is high, illustrate that CY-Net can reach effect more better than optical means based on Pixel-level.Network proposed by the present invention is tested Having demonstrate,proved convolutional neural networks has the ability to the substantive characteristics of image that learns from low volume data.
Fitting degree based on the method for the present invention is analyzed as follows:
Poor fitting refers to that model does not capture the feature of data well, can not good fitting data;Over-fitting refers to Model is fitted too thorough data, so that the feature of data noise is also learnt to arrive, will lead to can not in test Data are identified well, and the generalization ability of model is poor.As shown in fig. 7, being from left to right trained poor fitting (a) respectively, preferably quasi- The case where closing (b) and over-fitting (c).
Wherein Fig. 7 (b) is the visitor by the preferable three groups of results of subjective assessment in measurement same model, different the number of iterations See what index obtained.Fig. 8 is the preferable three groups of images of subjective assessment, calculates four kinds of evaluation indexes, knot to Fig. 8 (a), (b), (c) Fruit is shown in Table 3, each evaluation index of Comprehensive Correlation.It is considered that (c) fitting degree in Fig. 8 is preferable, as used in inventive network Model.
Table 3 is four kinds of evaluation index values of tri- groups of results of Fig. 8
Training starts, and model is often poor fitting, also just there is the space of optimization just because of this, can be by constantly The hyper parameters such as regularized learning algorithm rate and increase the number of iterations to make the feature of model more fully learning data, makes the expression of model Ability is stronger.The reason of over-fitting occurs also has very much, usually will appear over-fitting in the case where data set is less, can To expand data set, therefore the data set of script 4 times are expanded;Meanwhile the number of iterations is not The more the better, sometimes Big the number of iterations model learning has arrived the details of overabundance of data, effect be not so good as instead the number of iterations it is relatively fewer when model, Therefore, suitable the number of iterations is selected to can also be improved the expression quality of model;It, can also in order to which over-fitting is alleviated or avoided The cost function and Dropout strategy that network is defined using regularization method, by changing network structure, reduction neuron Coupling improve the generalization ability of model.
Robustness verifying based on the method for the present invention is as follows:
Since visible light-near-infrared arm image that network training is concentrated into pair is shot by camera of the same race, illumination condition, The variation of the factors such as the receptance function of camera is little.Therefore the experiment of this part is mainly verified in different illumination conditions, is clapped It takes the photograph under environment, and to the robustness of non-arm position network model.As shown in figure 9, CY-Net mould under Fig. 9 difference shooting condition The verifying of type robustness.Shown in Fig. 10 is different physical feeling CY-Net model robustness verifyings;The left side of every a pair of image is Visible images, the right are CY-Net vein imaging figure.
Have been verified that model proposed by the present invention has preferable Shandong to illumination variation in subjective assessment and analysis part Stick.From Fig. 9 and Figure 10 can be seen that the model that the method based on deep learning trains learn automatically to visible light with it is close red Complicated mapping relations between outer image, it is effective not only for the arm image of training sample, change in external condition or All there is preferable robustness and generalization ability when image content change itself.
The above is only a preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, without departing from the principle of the present invention, several improvement can also be made, these improvement also should be regarded as of the invention Protection scope.

Claims (6)

1.一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,步骤如下:1. a visible light image vein imaging method based on convolutional neural network, is characterized in that, step is as follows: 步骤一、近红外手臂静脉图像预处理;Step 1. Preprocessing of near-infrared arm vein images; 步骤二、基于步骤一预处理后的图像以及可见光手臂图像作为训练样本,设计端到端的全卷积神经网络静脉显像模型CY-Net;Step 2: Design an end-to-end fully convolutional neural network vein imaging model CY-Net based on the preprocessed images and visible light arm images in step 1 as training samples; 步骤三、对CY-Net网络模型中的参数优化策略:采用梯度下降优化算法以及代价函数优化算法;Step 3. Parameter optimization strategy in the CY-Net network model: use gradient descent optimization algorithm and cost function optimization algorithm; 步骤四、最后对网络输出的静脉显像图进行静脉提取。Step 4: Finally, perform vein extraction on the vein imaging image output by the network. 2.根据权利要求1所述的一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,所述的步骤一中利用对比度受限的自适应直方图均衡化方法近红外手臂图像进行预处理。2. a kind of visible light image vein imaging method based on convolutional neural network according to claim 1, is characterized in that, utilizes contrast-limited adaptive histogram equalization method near infrared arm image in described step 1 preprocessing. 3.根据权利要求1所述的一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,所述的步骤二具体为:3. a kind of visible light image vein imaging method based on convolutional neural network according to claim 1, is characterized in that, described step 2 is specifically: 2.1,设计的网络整体结构为一个五层的卷积神经网络,每个卷积层后面均跟有一层PReLU激活函数层;其数学表达分别如下:2.1. The overall structure of the designed network is a five-layer convolutional neural network, and each convolutional layer is followed by a layer of PReLU activation function; its mathematical expressions are as follows: f(x)=max(0,x) (1)f(x)=max(0,x) (1) 其中,x为函数输入,f(x)为函数输出,a为斜率;Among them, x is the function input, f(x) is the function output, and a is the slope; 2.2,在卷积层四后设置Dropout层,最后用3×3的卷积核对卷积层5得到的特征图进行卷积得到网络输出图像,计算输出图像与输入的近红外图像的代价函数,使用反向传播梯度下降法反复迭代优化更新网络参数来最小化代价函数,使网络学习到可见光图像和近红外图像之间的映射关系;2.2, set the Dropout layer after the convolutional layer 4, and finally use the 3×3 convolution kernel to convolve the feature map obtained by the convolutional layer 5 to obtain the network output image, and calculate the cost function of the output image and the input near-infrared image, Use the back-propagation gradient descent method to iteratively optimize and update the network parameters to minimize the cost function, so that the network can learn the mapping relationship between the visible light image and the near-infrared image; 2.3,以可见光图像为输入,对应的近红外图像为目标输出,训练一个五层的全卷积神经网络;将网络输出与目标输出的距离作为误差进行反向传播,优化调整网络中的的参数,使误差不断降低,直至网络输出接近目标输出。2.3, take the visible light image as the input and the corresponding near-infrared image as the target output, train a five-layer full convolutional neural network; use the distance between the network output and the target output as the error for back-propagation, and optimize and adjust the parameters in the network , so that the error continues to decrease until the network output is close to the target output. 4.根据权利要求3所述的一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,所述的第一、三层卷积核大小设计为5,其余均为3。4. a kind of visible light image vein imaging method based on convolutional neural network according to claim 3, is characterized in that, described first and third layer convolution kernel size is designed to be 5, and the rest are all 3. 5.根据权利要求1所述的一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,所述的步骤三中代价函数优化算法具体为采用L2正则化来约束参数的范数,即用预测值和实际值之差的平方的一半对m个样本求和后取平均来衡量算法运行情况:5. a kind of visible light image vein imaging method based on convolutional neural network according to claim 1, is characterized in that, in described step 3, cost function optimization algorithm is specifically adopting L2 regularization to constrain the norm of parameter , that is, use the half of the square of the difference between the predicted value and the actual value to sum the m samples and take the average to measure the operation of the algorithm: 式中,W为网络权重,b为网络偏置项,m为样本个数,为网络对第i个样本的目标输出,y(i)为网络对第i个样本的实际输出,In the formula, W is the network weight, b is the network bias term, m is the number of samples, is the target output of the network for the ith sample, y (i) is the actual output of the network for the ith sample, 因此网络求解的最终目标是针对权重W和偏置项b求代价函数的最小值,求解过程对每一个参数进行Gaussian初始化;Therefore, the ultimate goal of the network solution is to find the minimum value of the cost function for the weight W and the bias term b, and the solution process performs Gaussian initialization on each parameter; L2正则化本质上是对网络的权值进行线性衰减,因此L2正则化也称作权值衰减,它是在网络的代价函数式(3)后面再加上一个正则化项,即:L2 regularization is essentially a linear attenuation of the weight of the network, so L2 regularization is also called weight attenuation, which is to add a regularization term to the cost function of the network (3), that is: 式中,加号后面的第二项为正则化项;In the formula, the second term after the plus sign is the regularization term; L2正则化项是对所有参数W的平方求和后取平均再乘上λ/2,其中λ是正则项系数,用来权衡原始的代价函数J(W,b)和正则项所占的比重,加上正则项后再求导,W的更新公式变为:The L2 regularization term is the sum of the squares of all parameters W and then averaged and then multiplied by λ/2, where λ is the regularization term coefficient, which is used to weigh the original cost function J(W, b) and the proportion of the regularity term , add the regular term and then take the derivative, the update formula of W becomes: 式中,第二项为求解参数的偏导数,α表示学习速率;In the formula, the second term is the partial derivative of the solution parameter, and α represents the learning rate; 所以,更新式中W的系数由1变为比1小的值其效果是减小W对权重更新的影响,即权重衰减。Therefore, the coefficient of W in the update formula changes from 1 to a value smaller than 1 The effect is to reduce the effect of W on weight updates, ie weight decay. 6.根据权利要求1所述的一种基于卷积神经网络的可见光图像静脉显像方法,其特征在于,所述的步骤三中梯度下降优化算法对权重和偏置项进行参数更新,更新方式如下:6. A kind of visible light image vein imaging method based on convolutional neural network according to claim 1, it is characterized in that, in described step 3, gradient descent optimization algorithm carries out parameter update to weight and bias term, update method as follows: 式中,表示第l层第j个节点与第l+1层第i个节点之间的连接的权重参数,表示第l+1层第i个节点的偏置项,α表示学习速率,由两个参数更新公式可看出求解参数的偏导数是梯度下降法的关键;代价函数对权重和偏置项求偏导得:In the formula, represents the weight parameter of the connection between the jth node in the lth layer and the ith node in the l+1th layer, Represents the bias term of the i-th node in the l+1th layer, and α represents the learning rate. From the two parameter update formulas, it can be seen that the partial derivative of the solution parameter is the key to the gradient descent method; and bias term Find the partial derivative: 残差是对每层网络的输出节点计算输出值与真实值之差,假设神经网络的输出层是第nl层,则输出层的残差用表示,第l=nl-1,nl-2,K,2层的第i个节点的残差用表示,其计算公式如下:The residual is the difference between the output value and the real value calculated for the output node of each layer of the network. Assuming that the output layer of the neural network is the n lth layer, the residual of the output layer is Represents that the residual of the i-th node of the l=n l -1, n l -2, K, 2 layers is used said, its calculation formula is as follows: 式中,通过第l+1层节点残差的加权平均值计算得来,有了以上两个残差后,则可进一步计算代价函数J(W,b)对权重和偏置项的偏导,即:In the formula, It is calculated by the weighted average of the residuals of the nodes in the l+1 layer. After the above two residuals are obtained, the partial derivatives of the cost function J(W,b) to the weights and bias terms can be further calculated, namely:
CN201810890329.8A 2018-08-07 2018-08-07 Visible light image vein imaging method based on convolutional neural network Active CN109635618B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810890329.8A CN109635618B (en) 2018-08-07 2018-08-07 Visible light image vein imaging method based on convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810890329.8A CN109635618B (en) 2018-08-07 2018-08-07 Visible light image vein imaging method based on convolutional neural network

Publications (2)

Publication Number Publication Date
CN109635618A true CN109635618A (en) 2019-04-16
CN109635618B CN109635618B (en) 2023-03-31

Family

ID=66066247

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810890329.8A Active CN109635618B (en) 2018-08-07 2018-08-07 Visible light image vein imaging method based on convolutional neural network

Country Status (1)

Country Link
CN (1) CN109635618B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110084867A (en) * 2019-04-24 2019-08-02 河北科技大学 Arteriovenous image rebuilding method based on CNN and multi-electrode electromagnetic measurement
CN110472667A (en) * 2019-07-19 2019-11-19 广东工业大学 Small object classification method based on deconvolution neural network
CN110633633A (en) * 2019-08-08 2019-12-31 北京工业大学 A Road Extraction Method Based on Adaptive Threshold from Remote Sensing Image
CN110956596A (en) * 2019-12-09 2020-04-03 深圳元华医疗设备技术有限公司 Image processing method and terminal based on near-infrared imaging
CN111008550A (en) * 2019-09-06 2020-04-14 上海芯灵科技有限公司 Identification method for finger vein authentication identity based on Multiple loss function
CN111178229A (en) * 2019-12-26 2020-05-19 南京航空航天大学 Vein imaging method and device based on deep learning
CN112138249A (en) * 2020-08-24 2020-12-29 同济大学 A method for controlling the needle insertion angle of an intravenous injection robot based on ultrasound evaluation
CN112546463A (en) * 2021-02-25 2021-03-26 四川大学 Radiotherapy dose automatic prediction method based on deep neural network
CN113112446A (en) * 2020-03-05 2021-07-13 成都理工大学 Tunnel surrounding rock level intelligent judgment method based on residual convolutional neural network
CN113112447A (en) * 2020-03-05 2021-07-13 成都理工大学 Tunnel surrounding rock grade intelligent determination method based on VGG convolutional neural network
CN114595740A (en) * 2022-01-13 2022-06-07 南京航空航天大学 A photodetector-based ultra-high-speed ray image recognition method
CN114926680A (en) * 2022-05-13 2022-08-19 山东省计算中心(国家超级计算济南中心) Malicious software classification method and system based on AlexNet network model

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6178340B1 (en) * 1998-08-24 2001-01-23 Eduardo Svetliza Three-dimensional infrared imager for subcutaneous puncture and study of vascular network
CN103337071A (en) * 2013-06-19 2013-10-02 北京理工大学 Device and method for structure-reconstruction-based subcutaneous vein three-dimensional visualization
CN104688184A (en) * 2014-12-05 2015-06-10 南京航空航天大学 Vein imaging method for visible-light skin images

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6178340B1 (en) * 1998-08-24 2001-01-23 Eduardo Svetliza Three-dimensional infrared imager for subcutaneous puncture and study of vascular network
CN103337071A (en) * 2013-06-19 2013-10-02 北京理工大学 Device and method for structure-reconstruction-based subcutaneous vein three-dimensional visualization
CN104688184A (en) * 2014-12-05 2015-06-10 南京航空航天大学 Vein imaging method for visible-light skin images

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110084867B (en) * 2019-04-24 2023-06-20 河北科技大学 Arteriovenous image reconstruction method based on CNN and multi-electrode electromagnetic measurement
CN110084867A (en) * 2019-04-24 2019-08-02 河北科技大学 Arteriovenous image rebuilding method based on CNN and multi-electrode electromagnetic measurement
CN110472667A (en) * 2019-07-19 2019-11-19 广东工业大学 Small object classification method based on deconvolution neural network
CN110472667B (en) * 2019-07-19 2024-01-09 广东工业大学 Small target classification method based on deconvolution neural network
CN110633633A (en) * 2019-08-08 2019-12-31 北京工业大学 A Road Extraction Method Based on Adaptive Threshold from Remote Sensing Image
CN111008550A (en) * 2019-09-06 2020-04-14 上海芯灵科技有限公司 Identification method for finger vein authentication identity based on Multiple loss function
CN110956596A (en) * 2019-12-09 2020-04-03 深圳元华医疗设备技术有限公司 Image processing method and terminal based on near-infrared imaging
CN110956596B (en) * 2019-12-09 2023-08-29 深圳元华医疗设备技术有限公司 Near infrared imaging-based image processing method and terminal
CN111178229A (en) * 2019-12-26 2020-05-19 南京航空航天大学 Vein imaging method and device based on deep learning
CN113112446A (en) * 2020-03-05 2021-07-13 成都理工大学 Tunnel surrounding rock level intelligent judgment method based on residual convolutional neural network
CN113112447A (en) * 2020-03-05 2021-07-13 成都理工大学 Tunnel surrounding rock grade intelligent determination method based on VGG convolutional neural network
CN112138249B (en) * 2020-08-24 2022-02-18 同济大学 Intravenous injection robot control method based on ultrasonic evaluation
CN112138249A (en) * 2020-08-24 2020-12-29 同济大学 A method for controlling the needle insertion angle of an intravenous injection robot based on ultrasound evaluation
CN112546463A (en) * 2021-02-25 2021-03-26 四川大学 Radiotherapy dose automatic prediction method based on deep neural network
CN114595740A (en) * 2022-01-13 2022-06-07 南京航空航天大学 A photodetector-based ultra-high-speed ray image recognition method
CN114926680A (en) * 2022-05-13 2022-08-19 山东省计算中心(国家超级计算济南中心) Malicious software classification method and system based on AlexNet network model

Also Published As

Publication number Publication date
CN109635618B (en) 2023-03-31

Similar Documents

Publication Publication Date Title
CN109635618A (en) Visible images vein developing method based on convolutional neural networks
CN110084794B (en) A skin cancer image recognition method based on attention convolutional neural network
US9317761B2 (en) Method and an apparatus for determining vein patterns from a colour image
CN110507335A (en) Method and system for assessing the mental health status of prisoners based on multi-modal information
CN106682389B (en) A kind of Eye disease for monitoring hypertension initiation is health management system arranged
CN108960257A (en) A kind of diabetic retinopathy grade stage division based on deep learning
CN113627401A (en) EMG gesture recognition method with feature pyramid network fused with dual attention mechanism
CN110569916A (en) Adversarial example defense system and method for artificial intelligence classification
CN113296087A (en) Frequency modulation continuous wave radar human body action identification method based on data enhancement
CN108710950A (en) A kind of image quantization analysis method
CN109344878A (en) A small target recognition method based on ResNet imitating eagle brain feature integration
CN111815562A (en) A Retinal Vessel Segmentation Method Combining U-Net and Adaptive PCNN
CN115760586A (en) A medical image enhancement method based on multi-scale attention generative adversarial network
CN111724370A (en) A multi-task no-reference image quality assessment method and system based on uncertainty and probability
Sunanthini et al. Comparison of CNN algorithms for feature extraction on fundus images to detect glaucoma
CN113989217A (en) Human eye diopter detection method based on deep learning
KR102028797B1 (en) System and method of diagnosing linguistic ability for early detection of neurodegenerative diseases, and computer readable medium for performing the method
CN102867296B (en) The Method of Medical Image Fusion of cortex model is sent based on pulse
Zahedinasab et al. Using deep convolutional neural networks with adaptive activation functions for medical ct brain image classification
CN107679487A (en) Missing Persons' discrimination method and system
Zou et al. WGAN-based image denoising algorithm
CN109214286A (en) Face identification method based on the fusion of deep neural network multilayer feature
Lyu et al. A local characteristic image restoration based on convolutional neural network
Kashihara Deep convolutional neural networks improve vein image quality
Antony et al. T2FRF Filter: an Effective Algorithm for the Restoration of Fingerprint Images

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant