[go: up one dir, main page]

Open In App

tidyr Package in R Programming

Last Updated : 06 Aug, 2020
Summarize
Comments
Improve
Suggest changes
Like Article
Like
Save
Share
Report
News Follow

Packages in the R language are a collection of R functions, compiled code, and sample data. They are stored under a directory called “library” in the R environment. By default, R installs a set of packages during installation.  One of the most important packages in R is the tidyr package. The sole purpose of the tidyr package is to simplify the process of creating tidy data. Tidy data describes a standard way of storing data that is used wherever possible throughout the tidyverse. If you once make sure that your data is tidy, you’ll spend less time punching with the tools and more time working on your analysis.

Installation 

To use a package in R programming one must have to install the package first. This task can be done using the command install.packages(“packagename”). To install the whole tidyverse package type this:

install.packages("tidyverse")

installing tidyverse

Alternatively, to install just tidyr package type this:

install.packages("tidyr")

To install the development version from GitHub type this:

# install.packages("devtools")
devtools::install_github("tidyverse/tidyr")

Important Verb Functions in tidyr Package

The Dataset:

Before going to the important verb function let’s prepare the data set first. Define a dataset tidy_dataframe that contains data about the frequency of people in a particular group.

R




# load the tidyr package
library(tidyr)
  
n = 10
# creating a data frame
tidy_dataframe = data.frame(
                      S.No = c(1:n), 
                    Group.1 = c(23, 345, 76, 212, 88, 
                                199, 72, 35, 90, 265),
                    Group.2 = c(117, 89, 66, 334, 90, 
                               101, 178, 233, 45, 200),
                    Group.3 = c(29, 101, 239, 289, 176,
                                320, 89, 109, 199, 56))
  
# print the elements of the data frame
tidy_dataframe


Output:

    S.No Group.1 Group.2 Group.3
1     1      23     117      29
2     2     345      89     101
3     3      76      66     239
4     4     212     334     289
5     5      88      90     176
6     6     199     101     320
7     7      72     178      89
8     8      35     233     109
9     9      90      45     199
10   10     265     200      56

tidyr package provides various important functions that can be used for Data Cleaning. Those are:

  • gather() function: It takes multiple columns and gathers them into key-value pairs. Basically it makes “wide” data longer. The gather() function will take multiple columns and collapse them into key-value pairs, duplicating all other columns as needed.

Syntax:        

gather(data, key = “key”, value = “value”, …, na.rm = FALSE, convert = FALSE, factor_key = FALSE) 

Parameter

Description

data the data frame.
key, value

the names of new key and value columns, 

as strings or as symbols.

…….

the selection of columns. If left empty, all variables are selected. 

You can supply bare variable names, select all variables between

 x and z with x:z, exclude y with -y.

na.rm if set TRUE, it will remove rows from output where the value column is NA.
convert

is set TRUE, it will automatically run type.convert() on the key column. 

This is useful if the column types are actually numeric,

 integer, or logical.

factor_key

if FALSE, the default, the key values will be stored as a character vector.

 If TRUE, will be stored as a factor, which preserves

the original ordering of the columns.

Example: 

Now for a better understanding, we will make our data long with gather() function.

R




# using gather() function on tidy_dataframe
long <- tidy_dataframe %>% 
            gather(Group, Frequency,
                   Group.1:Group.3)
  
# print the data frame in a long format
long


 Output:

    S.No  Group   Frequency
1     1 Group.1        23
2     2 Group.1       345
3     3 Group.1        76
4     4 Group.1       212
5     5 Group.1        88
6     6 Group.1       199
7     7 Group.1        72
8     8 Group.1        35
9     9 Group.1        90
10   10 Group.1       265
11    1 Group.2       117
12    2 Group.2        89
13    3 Group.2        66
14    4 Group.2       334
15    5 Group.2        90
16    6 Group.2       101
17    7 Group.2       178
18    8 Group.2       233
19    9 Group.2        45
20   10 Group.2       200
21    1 Group.3        29
22    2 Group.3       101
23    3 Group.3       239
24    4 Group.3       289
25    5 Group.3       176
26    6 Group.3       320
27    7 Group.3        89
28    8 Group.3       109
29    9 Group.3       199
30   10 Group.3        56
  • separate() function: It converts longer data to a wider format. The separate() function turns a single character column into multiple columns.

   Syntax:

   separate(data, col, into, sep = ” “, remove = TRUE, convert = FALSE)

Parameter

Description

data A data frame.
col Column name or position.
into

Names of new variables to create as character vector. 

Use NA to omit the variable in the output.

sep The separator between the columns.
remove If set TRUE, it will remove input column from the output data frame.
convert If TRUE, will run type.convert() with as.is = TRUE on new columns.

Example: 

We can say that the long datasets created using gather() is appropriate for use, but we can break down Group variable even further using separate()

R




# import tidyr package
library(tidyr)
long <- tidy_dataframe %>%
            gather(Group, Frequency,
                   Group.1:Group.3)
  
# use separate() function to make data wider
separate_data <- long %>% 
            separate(Group, c("Allotment"
                              "Number"))
  
# print the wider format
separate_data


Output:

   S.No Allotment Number Frequency
1     1     Group      1        23
2     2     Group      1       345
3     3     Group      1        76
4     4     Group      1       212
5     5     Group      1        88
6     6     Group      1       199
7     7     Group      1        72
8     8     Group      1        35
9     9     Group      1        90
10   10     Group      1       265
11    1     Group      2       117
12    2     Group      2        89
13    3     Group      2        66
14    4     Group      2       334
15    5     Group      2        90
16    6     Group      2       101
17    7     Group      2       178
18    8     Group      2       233
19    9     Group      2        45
20   10     Group      2       200
21    1     Group      3        29
22    2     Group      3       101
23    3     Group      3       239
24    4     Group      3       289
25    5     Group      3       176
26    6     Group      3       320
27    7     Group      3        89
28    8     Group      3       109
29    9     Group      3       199
30   10     Group      3        56
  • unite() function: It merges two columns into one column. The unite() function is a convenience function to paste together multiple variable values into one. In essence, it combines two variables of a single observation into one variable.

Syntax: 

unite(data, col, …, sep = “_”, remove = TRUE)

Parameter

Description

data A data frame.
col The name of the new column.
…. A selection of desired columns. If empty, all variables are selected. 
sep A separator to use between values.
remove If TRUE, remove input columns from output data frame.

Example:

Unite is the compliment of separate. To undo separate(), we can use unite(), which merges two variables into one. Here we will merge two columns Group and Number with a separator “.”

R




# import tidyr package 
library(tidyr)
  
long <- tidy_dataframe %>%
            gather(Group, Frequency, 
                   Group.1:Group.3)
  
# use separate() function to make data wider
separate_data <- long %>% 
            separate(Group, c("Allotment",
                              "Number"))
  
# use unite() function to glue 
# Allotment and Number columns
unite_data <- separate_data %>%
            unite(Group, Allotment, 
                  Number, sep = ".")
  
# print the new data frame
unite_data


Output:

   S.No   Group Frequency
1     1 Group.1        23
2     2 Group.1       345
3     3 Group.1        76
4     4 Group.1       212
5     5 Group.1        88
6     6 Group.1       199
7     7 Group.1        72
8     8 Group.1        35
9     9 Group.1        90
10   10 Group.1       265
11    1 Group.2       117
12    2 Group.2        89
13    3 Group.2        66
14    4 Group.2       334
15    5 Group.2        90
16    6 Group.2       101
17    7 Group.2       178
18    8 Group.2       233
19    9 Group.2        45
20   10 Group.2       200
21    1 Group.3        29
22    2 Group.3       101
23    3 Group.3       239
24    4 Group.3       289
25    5 Group.3       176
26    6 Group.3       320
27    7 Group.3        89
28    8 Group.3       109
29    9 Group.3       199
30   10 Group.3        56
  • spread() function: It helps in reshaping a longer format to a wider format. The spread() function spreads a key-value pair across multiple columns.

Syntax:

spread(data, key, value, fill = NA, convert = FALSE)

Parameter

Description

data A data frame.
key Column names or positions.
value Column names or positions.
fill If set, missing values will be replaced with this value. 
convert If TRUE, type.convert() with asis = TRUE will be run on each of the new columns.

Example:

We can transform the data from long back to wide with the spread() function. 

R




# import tidyr package 
library(tidyr)
  
long <- tidy_dataframe %>% 
            gather(Group, Frequency,
                   Group.1:Group.3)
  
# use separate() function to make data wider
separate_data <- long %>% 
            separate(Group, c("Allotment"
                              "Number"))
  
# use unite() function to glue
# Allotment and Number columns
unite_data <- separate_data %>% 
            unite(Group, Allotment,
                  Number, sep = ".")
  
# use unite() function to make data wider
back_to_wide <- unite_data %>% 
            spread(Group, Frequency)
  
# print the new data frame
back_to_wide


Output:

   S.No Group.1 Group.2 Group.3
1     1      23     117      29
2     2     345      89     101
3     3      76      66     239
4     4     212     334     289
5     5      88      90     176
6     6     199     101     320
7     7      72     178      89
8     8      35     233     109
9     9      90      45     199
10   10     265     200      56
  • nest() function: It creates a list of data frames containing all the nested variables. Nesting is implicitly a summarizing operation. This is useful in conjunction with other summaries that work with whole datasets, most notably models.

Syntax: nest(data, …, .key = “data”)

Parameter

Description

data A data frame.
…. A selection of columns. If empty, all variables are selected.
.key The name of the new column, as a string or symbol.

Example: Let’s try to nest Group.2 column from the tidy_dataframe we created in the data set.

R




# import tidyr package
library(tidyr)
  
df <- tidy_dataframe
  
# nest column Group.1 in 
# tidy_dataframe using nest()
df %>% nest(data = c(Group.1))


Output:

# A tibble: 10 x 4
    S.No Group.1 Group.3 data            
   <int>   <dbl>   <dbl> <list>          
 1     1      23      29 <tibble [1 x 1]>
 2     2     345     101 <tibble [1 x 1]>
 3     3      76     239 <tibble [1 x 1]>
 4     4     212     289 <tibble [1 x 1]>
 5     5      88     176 <tibble [1 x 1]>
 6     6     199     320 <tibble [1 x 1]>
 7     7      72      89 <tibble [1 x 1]>
 8     8      35     109 <tibble [1 x 1]>
 9     9      90     199 <tibble [1 x 1]>
10    10     265      56 <tibble [1 x 1]>
  • unnest() function: It basically reverses the nest operation. It makes each element of the list its own row. It can handle list columns that contain atomic vectors, lists, or data frames (but not a mixture of the different types).

Syntax:

unnest(data, …, .drop = NA, .id = NULL, .sep = NULL, .preserve = NULL)

Parameter

Description

data A data frame
…. Specification of columns to unnest.  If omitted, defaults to all list-columns.
.drop

Should additional list columns be dropped? By default,

it will drop them if unnesting 

the specified columns requires the rows to be duplicated.

.id Data frame identifier.
.sep

If non-NULL, the names of unnested data frame columns 

will combine the name of the original list-col with

the names from nested data frame, separated by .sep.

.preserve

List-columns to preserve in the output. These will be

 duplicated in the same way as atomic vectors.

Example:

We will try to nest and unnest Species column in the iris dataframe in the tidyr package.

R




# import the tidyr package
library(tidyr)
  
df <- iris
names(iris)
  
# nesting the species column in 
# the df data frame using nest()
head(df %>% nest(data = c(Species)))  # Output (i)
  
# unnesting the species column 
# in the df data frame using unnest()
head(df %>% unnest(Species,.drop = NA,
                   .preserve = NULL)) # Output (ii)


Output (i): 

# A tibble: 6 x 5
  Sepal.Length Sepal.Width Petal.Length Petal.Width data            
         <dbl>       <dbl>        <dbl>       <dbl> <list>          
1          5.1         3.5          1.4         0.2 <tibble [1 x 1]>
2          4.9         3            1.4         0.2 <tibble [1 x 1]>
3          4.7         3.2          1.3         0.2 <tibble [1 x 1]>
4          4.6         3.1          1.5         0.2 <tibble [1 x 1]>
5          5           3.6          1.4         0.2 <tibble [1 x 1]>
6          5.4         3.9          1.7         0.4 <tibble [1 x 1]>

Output (ii):

# A tibble: 6 x 5
  Sepal.Length Sepal.Width Petal.Length Petal.Width Species
         <dbl>       <dbl>        <dbl>       <dbl> <fct>  
1          5.1         3.5          1.4         0.2 setosa 
2          4.9         3            1.4         0.2 setosa 
3          4.7         3.2          1.3         0.2 setosa 
4          4.6         3.1          1.5         0.2 setosa 
5          5           3.6          1.4         0.2 setosa 
6          5.4         3.9          1.7         0.4 setosa 
  • fill() function: Used to fill in the missing values in selected columns using the previous entry. This is useful in the common output format where values are not repeated, they’re recorded each time they change. Missing values are replaced in atomic vectors; NULL is replaced in the list.

Syntax: 

fill(data, …, .direction = c(“down”, “up”))

Parameter

Description

data A data frame.
…. A selection of columns. If empty, nothing happens.
direction Direction in which to fill missing values. Currently, either “down” (the default) or “up”

Example:

R




# import the tidyr package
df <- data.frame(Month = 1:6, 
                 Year = c(2000, rep(NA, 5)))
  
# print the df data frame
df                   # Output (i)
  
# use fill() to fill missing values in 
# Year column in df data frame
df %>% fill(Year)    # Output (ii)


Output (i):

  Month Year
1     1 2000
2     2   NA
3     3   NA
4     4   NA
5     5   NA
6     6   NA

Output (ii):

  Month Year
1     1 2000
2     2 2000
3     3 2000
4     4 2000
5     5 2000
6     6 2000
  • full_seq() function: It basically fills the missing values in a vector which should have been observed but weren’t. The vector should be numeric.

Syntax: full_seq(x, period, tol = 1e-06)

Parameter

Description

x A numeric vector.
period Gap between each observation.
tol Numerical tolerance for checking periodicity.

Example: 

R




# import the tidyr package
library(tidyr)
  
# creating a numeric vector
num_vec <- c(1, 7, 9, 14, 19, 20)
  
# use full_seq() to fill missing
# values in num_vec
full_seq(num_vector, 1)


Output:

[1]  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20
  • drop_na() function: This function drops rows containing missing values.

Syntax: drop_na(data, …)

Parameter

Description

 data  A data frame.
 ….  A selection of columns. If empty, all variables are selected.

Example:

R




# import tidyr package
library(tidyr)
  
# create a tibble df with missing values
df <- tibble(S.No = c(1:10),
             Name = c('John', 'Smith', 'Peter',
                      'Luke', 'King', rep(NA, 5)))
  
# print df tibble
df                    # Output (i)
  
# use drop_na() to drop columns 
# in df with missing values 
df %>% drop_na(Name)  # Output (ii)


Output (i):

# A tibble: 10 x 2
    S.No Name 
   <int> <chr>
 1     1 John 
 2     2 Smith
 3     3 Peter
 4     4 Luke 
 5     5 King 
 6     6 <NA> 
 7     7 <NA> 
 8     8 <NA> 
 9     9 <NA> 
10    10 <NA> 

Output (ii):

# A tibble: 5 x 2
   S.No Name 
  <int> <chr>
1     1 John 
2     2 Smith
3     3 Peter
4     4 Luke 
5     5 King 
  • replace_na() function: It replaces missing values.

Syntax: replace_na(data, replace, …)

Parameter

Description

data A data frame.
replace

If data is a data frame, returns a data frame. If data is a vector, 

returns a vector of class determined by the union of data and replace.

Example:

R




# import tidyr package
library(tidyr)
  
df <- data.frame(S.No = c(1:10),
                 Name = c('John', 'Smith'
                          'Peter', 'Luke',
                          'King', rep(NA, 5)))
  
df                                      # Output (i)
  
# use replace_na() to replace missing values or na
df %>% replace_na(list(Name = 'Henry')) # Output (ii)


Output (i):

# A tibble: 10 x 2
     S.No Name 
   <int> <chr>
 1     1 John 
 2     2 Smith
 3     3 Peter
 4     4 Luke 
 5     5 King 
 6     6 <NA> 
 7     7 <NA> 
 8     8 <NA> 
 9     9 <NA> 
10    10 <NA> 

Output (ii):

    S.No  Name
1     1  John
2     2 Smith
3     3 Peter
4     4  Luke
5     5  King
6     6 Henry
7     7 Henry
8     8 Henry
9     9 Henry
10   10 Henry


Previous Article
Next Article

Similar Reads

How to Install tidyr in Anaconda
The tidyr package is a crucial tool in the R programming language for data cleaning and tidying. If you're using Anaconda, a popular open-source distribution for Python and R, you can easily manage and install packages, including tidyr. This guide will walk you through the steps to install tidyr in Anaconda. What is Anaconda?Anaconda is a distribut
2 min read
Esquisse Package in R Programming
Packages in the R programming are a collection of R functions, compiled code, and sample data. They are stored under a directory called “library” in the R environment. By default, R installs a set of packages during installation. One of the most important packages in R is the Esquisse package. Esquisse package helps to explore and visualize your da
6 min read
Forcats Package in R Programming
'forcats' is a package in R Programming that provides tools for working with categorical (factor) variables. Categorical variables are variables that take on a limited, discrete set of values or levels, and are used to represent qualitative or nominal data. Some examples of categorical variables include gender (male/female), race (white/black/asian
9 min read
Plyr Package in R Programming
Data analysis is an essential part of any research or business process, and R is a popular programming language used for data analysis and statistical computing. One of the significant advantages of R Programming is the availability of packages that can be used to perform complex data manipulation tasks. Plyr is one such package that can be used to
8 min read
Shiny Package in R Programming
The package is an appropriate way to organize the work and share it with others. Packages in the R Programming language are a collection of R functions, compiled code, and sample data. They are stored under a directory called “library” in the R environment. By default, R installs a set of packages during installation. One of the most important pack
12 min read
dplyr Package in R Programming
In this article, we will discuss Aggregating and analyzing data with dplyr package in the R Programming Language. dplyr Package in R The dplyr package in R Programming Language is a structure of data manipulation that provides a uniform set of verbs, helping to resolve the most frequent data manipulation hurdles. By limiting the choices the focus c
6 min read
What is the Best Package for Manipulating Dates in the R Programming Language?
The best package for manipulating dates in R is lubridate. lubridate simplifies working with dates and times, making it easier to parse, manipulate, and perform calculations on date-time objects. This package is widely used in the R community due to its intuitive functions that address common challenges when handling dates, such as converting betwe
2 min read
Deploying a basic Streamlit app using Shiny package in R
In this article, we will be discussing the steps to deploy a basic Streamlit app using the Shiny package in R. Streamlit is a Python library that allows you to build interactive web applications for machine learning and data science. Shiny is an R package that allows you to build interactive web applications for data science. By combining these two
4 min read
How to Plot in 3D clusters using plotly package
R-Language is widely used in Data Science, Data Visualization Data Analysis many more, etc. Plotly package is highly rich in plotting various graphs, These graphs/charts are highly interactive and user-friendly. The 3D cluster visualizes the similarity between variables as 3-D spatial relationships. Each point in the graph represents an individual
2 min read
psych Package in R
The "psych" package is an R package that provides various functions for psychological research and data analysis. It contains tools for data visualization, factor analysis, reliability analysis, correlation analysis, and more. In this article, we will discuss the basics of using the "psych" package in R Programming Language. Introduction to Psych P
6 min read
Density Plots Using Lattice Package in R
The density plots are mainly used to visualize the distribution of continuous numeric variables. The densityplot() uses kernel density probability estimate to calculate the density probability of numeric variables. In R Programming, the density plot can be plotted using the densityplot() function which is present in the lattice package. Syntax: den
2 min read
Car package in R
In R, there are several packages available that provide functions and tools to perform various tasks related to data analysis and visualization. The "car" package is one of the packages that provide functions and tools for regression analysis. In this article, we will provide a comprehensive and in-depth guide to the "car" package in R, including h
9 min read
Desctools Package in R
The DescTools package in R programming is a collection of functions that are used in various scenarios where data description, summary, and exploration are needed. It is a widely used package that was designed to help data scientists, researchers, and data analyst to understand their data and identify their findings. The DescTools package comes wit
6 min read
What is the Glmnet package in R?
R is a popular programming language used for statistical computing, machine learning, and data analysis. The Glmnet package in R is a tool used for fitting linear and logistic regression models with L1 and L2 regularization. Regularization is a method for preventing overfitting in models by introducing a penalty term to the objective function. The
5 min read
Rank variable by group using Dplyr package in R
In this article, we are going to see how to rank the variable by group using dplyr in R Programming Language. The dplyr package in R is used to perform mutations and data manipulations in R. It is particularly useful for working with data frames and data tables. The package can be downloaded and installed into the working directory using the follow
2 min read
Data Visualization using GoogleVis Package
GoogleVis is a package in R that is used to act as an interface between R and the Google API to produce interactive charts which can be easily embedded into web pages. This package helps the user to plot the data without uploading them into google. In this article let's parse through some charts that can be plotted using googleVis in R Programming
5 min read
Introduction to emojifont package in R
In this article, we will see how to use emojifont package in R. This package is designed to bring emoji font to R Programming Language. We will have a look at how to search and print emoji using emojifont package. Installation: To install this package type the below command in the terminal. install.packages("emojifont") Emoji characters in R Now we
3 min read
Create a Choropleth Map by using Plotly Package in R
There are plenty of packages in R that can be used to make maps, like leaflet, mapview, ggplot, spplot, plotly, etc. Each of the packages has its own advantages and disadvantages. But all of them have the common goal of making it easy to create maps and visualize geospatial data. In this article, we will learn how to create Choropleth maps using pl
4 min read
Variable importance plot using random forest package in R
As the name indicates Variable Importance Plot is a which used random forest package to plot the graph based on their accuracy and Gini Coefficient. If the accuracy of the variable is high then it's going to classify data accurately and Gini Coefficient is measured in terms of the homogeneity of nodes in a random forest. And the Mean Decrease Accur
2 min read
Data Visualization using ggvis Package in R
The ggvis is an interactive visualization package in R language that is based on the popular ggplot2 package. It allows you to create interactive plots and graphics that can be explored and manipulated by the user. ggvis supports a wide range of plot types including scatter plots, line charts, bar charts, histograms, and more. One of the key featur
15+ min read
How to Install a Package in R ?
R programming language doesn't come with all packages installed, and they need to be installed explicitly. In this article, we will discuss How to Install a Package in the R language. Installing packages in Rstudio. Method 1: Using application options 1. Open R studio. 2. Select tools 3. After selecting the tools you need to press install packages.
1 min read
How to disable Messages when Loading a Package in R ?
R console displays many messages and warning upon load of some packages and libraries. These messages display the associated packages' info, the warnings, the masked objects, which may be sometimes redundant and confusing for the user. Therefore, there are methods in R programming language to silence these notifications and ensure a smooth executio
2 min read
Cross validation in R without caret package
Cross-validation is a technique for evaluating the performance of a machine learning model by training it on a subset of the data and evaluating it on the remaining data. It is a useful method for estimating the performance of a model when you don't have a separate test set, or when you want to get a better estimate of the model's performance than
4 min read
Draw Multiple Overlaid Histograms with ggplot2 Package in R
In this article, we are going to see how to draw multiple overlaid histograms with the ggplot2 package in the R programming language. To draw multiple overlaid histograms with the ggplot2 package in R, you can use the geom_histogram() layer multiple times, each with different data and mapping specifications. Here is an example to create multiple hi
6 min read
How to create a scatter plot using lattice package in R?
In this article, we will discuss how to create the scatter plots using lattice package in R programming language. In R programming, the Lattice package is a data visualization library that consists of various functions to plot different kinds of plots. Using the lattice library we can able to plot various kinds of plots like scatter plot, Box plots
2 min read
How to create a boxplots using lattice package in R?
In this article, we will discuss how to create the boxplots using lattice package in R Programming language. In R programming, Lattice package is a data visualization library and consists of various functions to plot different kind of plots. Using lattice library we can able to plot different plots like scatter plot, Box plots, Histograms, 3D scatt
2 min read
Working With Different Versions of an R Package
R programming language offers different functionalities through different packages available in online repositories. The official repository CRAN keeps updating these packages and frequently publishes newer versions of packages. R's package system is set by default to overwrite the existing versions of packages so that the user always has the lates
8 min read
Union() & union_all() functions in Dplyr package in R
In this article, we will discuss union() and union_all() functions using Dplyr package in the R programming language. Dataframes in use: Example: R program to create data frames with college student data and display them C/C++ Code # create dataframe1 with college # 1 data data1=data.frame(id=c(1,2,3,4,5), name=c('sravan','ojaswi','bobby', 'gnanesh
2 min read
How to Remove a Column using Dplyr package in R
In this article, we are going to remove a column(s) in the R programming language using dplyr library. Dataset in use: Remove column using column nameHere we will use select() method to select and remove column by its name. Syntax: select(dataframe,-column_name) Here, dataframe is the input dataframe and column_name is the column in the dataframe t
3 min read
How to Remove a Column by name and index using Dplyr Package in R
In this article, we are going to remove columns by name and index in the R programming language using dplyr package. Dataset in use: Remove a column by using column name We can remove a column with select() method by its column name. Syntax: select(dataframe,-column_name) Where, dataframe is the input dataframe and column_name is the name of the co
2 min read
three90RightbarBannerImg