Abstract
This paper proposes a new two-dimensional manifold learning algorithm called two-dimensional joint local/nonlocal discriminant analysis (2DJLNDA) for 2D image feature extraction, which directly extracts projective vectors from 2D image matrices rather than image vectors. Different from other typical 2D methods, e.g., two-dimensional principal component analysis (2DPCA), two-dimensional linear discriminative analysis (2DLDA), two-dimensional locality-preserving projection (2DLPP), 2DJLNDA preserves not only local/nonlocal intrinsic structure but also local/nonlocal penalization structure of the image data in the high-dimensional space, which can be powerful in extracting intrinsic information of the image data in the low-dimensional space. The experimental results on the ORL, Yale, AR and UMIST face datasets indicate that 2DJLNDA is capable of extracting effective image features and outperforms 2DPCA, 2DLDA and 2DLPP. The 2D image features extracted by 2DJLNDA further improve the performance of deep neural networks (DNNs), e.g., stacked denoising autoencoder, and convolutional neural network (CNN) significantly. These studying results illustrate that the feature face images will provide more discriminant features than the original face images for DNNs. Therefore, 2DJLNDA-based 2D feature image extraction can be used as an effective preprocessing of DNNs (e.g., CNN) for face recognition.




















Similar content being viewed by others
References
Du S, Ward RK (2010) Adaptive region-based image enhancement method for robust face recognition under variable illumination conditions. IEEE Trans Circuits Syst Video Technol 20(9):1165–1175
Deepa GM, Keerthi R, Meghana N, Manikantan K (2012) Face recognition using spectrum-based feature extraction. Appl Soft Comput 12(9):2913–2923
Luh GC, Lin CY (2011) PCA based immune networks for human face recognition. Appl Soft Comput 11(2):1743–1752
Arashloo SR, Kittler J (2014) Class-specific kernel fusion of multiple descriptors for face verification using multiscale binarised statistical image features. IEEE Trans Inf Forensics Secur 9(12):2100–2109
He X, Yan S, Hu Y, Niyogi P, Zhang HJ (2005) Face recognition using Laplacianfaces. IEEE Trans Pattern Anal Mach Intell 27(3):328–340
Nanni L, Lumini A, Brahnam S (2017) Ensemble of texture descriptors for face recognition obtained by varying feature transforms and preprocessing approaches. Appl Soft Comput 61:8–16
Sirovich L, Kirby M (1987) Low-dimensional procedure for the characterization of human faces. J Opt Soc Am A 4(3):519–524
Turk M, Pentland AP (1991) Face recognition using eigenfaces. In: IEEE conference on computer vision and pattern Recognition, pp 586–591
Wold S, Esbensen K, Geladi P (1987) Principal component analysis. Chemom Intell Lab Syst 2(1–3):37–52
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Mika S, Ratsch G, Weston J, Scholkopf B, Mullers KR (1999) Fisher discriminant analysis with kernels. In: Proceedings of the 1999 IEEE signal processing society workshop, pp 41–48
Chang Y, Hu C, Turk M (2003) Manifold of facial expression. In: IEEE international workshop on analysis and modeling of faces and gestures, pp 28–35
Lee KC, Ho J, Yang MH, Kriegman D (2003) Video-based face recognition using probabilistic appearance manifolds. Computer vision and pattern recognition. In: IEEE computer society conference on computer vision and pattern recognition, pp 313–320
Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Belkin M, Niyogi P (2001) Laplacian Eigenmaps and spectral techniques for Embedding and clustering. In: Proceedings conference of advances in neural information processing system, vol 14, pp 585–591
He X, Niyogi P (2003) Locality preserving projections. In: Proceedings conference of advances in neural information processing system, pp 1–6
Yu W, Teng X, Liu C (2006) Face recognition using discriminant locality preserving projections. Image Vis Comput 24(3):239–248
Kokiopoulou E, Saad Y (2005) Orthogonal neighborhood preserving projections. In: Fifth IEEE international conference on data mining, pp 1–8
Gui J, Jia W, Zhu L, Wang SL, Huang DS (2010) Locality preserving discriminant projections for face and palmprint recognition. Neurocomputing 73(13):2696–2707
Krzanowski WJ, Jonathan P, McCarthy WV, Thomas MR (1995) Discriminant analysis with singular covariance matrices: methods and applications to spectroscopic data. Appl Stat 44(1):101–115
Xu Y, Song F, Feng G, Zhao Y (2010) A novel local preserving projection scheme for use with face recognition. Expert Syst Appl 37(9):6718–6721
Ding J, Wen C, Li G, Chua C (2016) Locality sensitive batch feature extraction for high-dimensional data. Neurocomputing 171:664–672
Yu J (2012) Semiconductor manufacturing process monitoring using gaussian mixture model and Bayesian method with local and nonlocal information. IEEE Trans Semicond Manuf 25(3):480–493
Yu J, Lu X (2016) Wafer map defect detection and recognition using joint local and nonlocal linear discriminant analysis. IEEE Trans Semicond Manuf 29(1):33–43
Bian W, Tao D (2011) Max–min distance analysis by using sequential SDP relaxation for dimension reduction. IEEE Trans Pattern Anal Mach Intell 33(5):1037–1050
Xu C, Tao D, Ru Y (2014) Large-margin weakly supervised dimensionality reduction. In: Proceedings of the 31st international conference on machine learning (ICML-14), pp 865–873
Zhang Y, Zhou ZH (2010) Multilabel dimensionality reduction via dependence maximization. ACM Trans Knowl Discov Data (TKDD) 4(3):14–22
Xu C, Tao D, Xu C (2015) Multi-view intact space learning. IEEE Trans Pattern Anal Mach Intell 37(12):2531–2544
Luo Y, Tao D, Ramamohanarao K, Xu C, Wen Y (2015) Tensor canonical correlation analysis for multi-view dimension reduction. IEEE Trans Knowl Data Eng 27(11):3111–3124
Qiao L, Chen S, Tan X (2010) Sparsity preserving projections with applications to face recognition. Pattern Recogn 43(1):331–341
Hu D, Feng G, Zhou Z (2007) Two-dimensional locality preserving projections (2DLPP) with its application to palmprint recognition. Pattern Recogn 40(1):339–342
Yang J, Zhang D, Frangi AF, Yang J (2004) Two-dimensional PCA: a new approach to appearance-based face representation and recognition. IEEE Trans Pattern Anal Mach Intell 26(1):131–137
Zhang H, Ho JK, Wu QJ, Ye Y (2013) Multidimensional latent semantic analysis using term spatial information. IEEE Trans Cybern 43(6):1625–1640
Li M, Yuan B (2005) 2D-LDA: a statistical linear discriminant analysis for image matrix. Pattern Recogn Lett 26(5):527–532
Chen S, Zhao H, Kong M, Luo B (2007) 2D-LPP: a two-dimensional extension of locality preserving projections. Neurocomputing 70(4):912–921
Gao Q, Hao X, Zhao Q, ShenW Ma J (2013) Feature extraction using two-dimensional neighborhood margin and variation embedding. Comput Vis Image Underst 117(5):525–531
Pan X, Ruan QQ (2008) Palmprint recognition with improved two-dimensional locality preserving projections. Image Vis Comput 26(9):1261–1268
Shikkenawis G, Mitra SK (2016) 2D orthogonal locality preserving projection for image denoising. IEEE Trans Image Process 25(1):262–273
Zhang H, Wu QM, Chow WS, Zhao M (2012) A two-dimensional Neighborhood Preserving Projection for appearance-based face recognition. Pattern Recogn 45(5):1866–1876
Jing X, Sui Z, Yao Y, Sun J (2012) Two-dimensional sparse preserving projection for face recognition. In: International conference on automatic control and artificial intelligence (ACAI 2012) pp 2223–2226
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst 25:1097–1105
Wang Y, Wang X, Liu W (2016) Unsupervised local deep feature for image recognition. Inf Sci 351:67–75
Zhan S, Tao QQ, Li XH (2016) Face detection using representation learning. Neurocomputing 187:19–26
Zhang H, Cao X, Ho JK, Chow TW (2017) Object-level video advertising: an optimization framework. IEEE Trans Ind Inf 13(2):520–531
Siniscalchi SM, Yu D, Deng L, Lee CH (2013) Exploiting deep neural networks for detection-based speech recognition. Neurocomputing 106:148–157
TaigmanY, Yang M, Ranzato MA, Wolf L (2014) Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1701–1708
Zhu Z, Luo P, Wang X, Tang X (2014) Recover canonical-view faces in the wild with deep neural networks, pp 1–10. arXiv preprint arXiv:1404.3543
SunY, LiangD, WangX, TangX (2015) Deepid3: face recognition with very deep neural networks. arXiv preprint arXiv:1502.00873
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 815–823
Al WSE, Banerjee S (2016) Report on the BTAS 2016 video person recognition evaluation. In: 2016 IEEE 8th international conference on biometrics theory, applications and systems (BTAS), IEEE, pp 1–8
Ding C, Choi J, Tao D, Davis L (2016) Multi-directional multi-level dual-cross patterns for robust face recognition. IEEE Trans Pattern Anal Mach Intell 38(3):518–531
Lu C, Tang X (2015) Surpassing human-level face verification performance on LFW with Gaussian Face. In: Twenty-ninth AAAI conference on artificial intelligence, pp 3811–3819
Ding C, Xu C, Tao D (2015) Multi-task pose-invariant face recognition. IEEE Trans Image Process 24(3):980–993
Samaria F, Harter AC (1994) Parameterisation of a stochastic model for human face identification. In: IEEE workshop on applications of computer vision, pp 138–142. Sarasota, USA
Graham DB, Allinson NM (1998) Characterizing virtual eigensignatures for general purpose face recognition. Face Recognit Theory Appl 163:446–456
Martinez AM, Benavente R (1998) The AR face database. CVC Technical Report, vol 24. Hammond, IN
Juliana VA, Andrés ÁM, Genaro DS, Germán DC (2009) Automatic choice of the number of nearest neighbors in locally linear embedding. In: Proceedings 14th CIARP, vol 5856, pp 77–84
Kouropteva O, Okun O, Pietikainen M (2002) Selection of the optimal parameter value for the locally linear embedding algorithm. In: Proceedings 1st International conference fuzzy system knowledge discovery, pp 359–363
Acknowledgements
This research was financially supported by National Natural Science Foundation of China (No. 71777173) and Fundamental Research Funds for the Central Universities.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yu, J., Liu, H. & Zheng, X. Two-dimensional joint local and nonlocal discriminant analysis-based 2D image feature extraction for deep learning. Neural Comput & Applic 32, 6009–6024 (2020). https://doi.org/10.1007/s00521-019-04085-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-019-04085-0