Overview
- Focuses on visualization and imputation methods for missing values and practical applications in R
- Describes the advantages, disadvantages, and pitfalls of each imputation method
- Presents modern robust and deep learning-based imputation methods and solutions for complex data
Part of the book series: Statistics and Computing (SCO)
Access this book
Tax calculation will be finalised at checkout
Other ways to access
About this book
This book explores visualization and imputation techniques for missing values and presents practical applications using the statistical software R. It explains the concepts of common imputation methods with a focus on visualization, description of data problems and practical solutions using R, including modern methods of robust imputation, imputation based on deep learning and imputation for complex data. By describing the advantages, disadvantages and pitfalls of each method, the book presents a clear picture of which imputation methods are applicable given a specific data set at hand.
The material covered includes the pre-analysis of data, visualization of missing values in incomplete data, single and multiple imputation, deductive imputation and outlier replacement, model-based methods including methods based on robust estimates, non-linear methods such as tree-based and deep learning methods, imputation of compositional data, imputation quality evaluation from visual diagnostics to precision measures, coverage rates and prediction performance and a description of different model- and design-based simulation designs for the evaluation. The book also features a topic-focused introduction to R and R code is provided in each chapter to explain the practical application of the described methodology.
Addressed to researchers, practitioners and students who work with incomplete data, the book offers an introduction to the subject as well as a discussion of recent developments in the field. It is suitable for beginners to the topic and advanced readers alike.
Keywords
- missing data
- imputation of missing data
- visualization of missing values
- R package
- incomplete data
- simulation
- multiple imputation
- robust imputation methods
- imputation methods for compositional data
- deep learning based imputation methods
- imputation quality
- simulation designs for imputation quality evaluation
- pre-analysis of data
Table of contents (12 chapters)
Authors and Affiliations
About the author
Matthias Templ is a Professor at the Institute for Competitiveness and Communication at the University of Applied Sciences and Arts Northwestern Switzerland in Olten, and a lecturer at ETH Zurich and the Vienna University of Technology, where he was awarded the venia docendi (habilitation) in statistics. His main research interests include computational statistics, compositional data analysis, robust statistics, imputation of missing values and anonymization of data. He is the Editor-in-Chief of the Austrian Journal of Statistics and (co-)author of four books including Statistical Disclosure Control for Microdata and Applied Compositional Data Analysis. He is also an author and the maintainer of several R packages, such as the R package sdcMicro for statistical disclosure control, the package robCompositions for robust analysis of compositional data, the simPop package for simulation of synthetic data, and the VIM package for visualization and imputation of missing values.
Bibliographic Information
Book Title: Visualization and Imputation of Missing Values
Book Subtitle: With Applications in R
Authors: Matthias Templ
Series Title: Statistics and Computing
DOI: https://doi.org/10.1007/978-3-031-30073-8
Publisher: Springer Cham
eBook Packages: Mathematics and Statistics, Mathematics and Statistics (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive licence to Springer Nature Switzerland AG 2023
Hardcover ISBN: 978-3-031-30072-1Published: 30 November 2023
Softcover ISBN: 978-3-031-30075-2Due: 13 December 2024
eBook ISBN: 978-3-031-30073-8Published: 29 November 2023
Series ISSN: 1431-8784
Series E-ISSN: 2197-1706
Edition Number: 1
Number of Pages: XXII, 462
Number of Illustrations: 24 b/w illustrations, 119 illustrations in colour
Topics: Visualization, Statistical Theory and Methods, Data Structures and Information Theory, Artificial Intelligence, Machine Learning, Applied Statistics