0% found this document useful (0 votes)

21 views13 pages

R

R is a programming language often used for statistical analysis and visualization. Key features include: plotting functions like plot() to create graphs; vectors to store lists of data; matrices to organize data into rows and columns; and data frames to store different data types in a table-like structure. Common tasks involve importing and exploring data, performing calculations and statistical tests, and visualizing results using graphs.

Uploaded by

Nermine Limeme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views13 pages

R

Uploaded by

Nermine Limeme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

*R is a programming language.

*R is often used for statistical computing and graphical presentation to

analyze and visualize data.
*Creating a plot from 1 to 10 on both x and y: plot(1:10) .
*In R we comment with #.
*This how we assign a variable in R: carName <- “Volvo”.
*This is how we assign a value for a variable in R: maxSpeed <- 120.
*Combining the text "Hello" with the txt variable, to output "Hello
World!" :
txt <- "World!"
paste("Hello", txt)
*Output integers from 1 to 10 : for (x in 1:10) { print(x) } .
*Basic data types in R can be divided into the following types:
1)Numeric - (10.5, 55, 787).
2)Integer - (1L, 55L, 100L, where the letter "L" declares this as an
integer) .
3)Complex - (9 + 3i, where "i" is the imaginary part) .
4)Character - (string) - ("k", "R is exciting", "FALSE", "11.5") .
5)Logical - (boolean) - (TRUE or FALSE) .
*The class() function prints the type of the variables assigned.

*We can convert from one type to another with the following functions:
 as.numeric()
 as.integer()
 as.complex()
*min() and max() are built-in mathematical functions in R.
*sqrt() function : is the square root function.
*abs() is the absolute value function.
*The ceiling() function rounds a number upwards to its nearest integer.
*The floor() function rounds a number downwards to its nearest integer,
and returns the result.
*cat() function is used to line break a paragraph.
*The nchar() function used to provide the length of the character.
*The grepl() function is used to check if a character is present inside
another string.
*Operators: + addition
- substraction
* multiplication
/ division
^ exponent
%% modulus (remainder from division)
%/% integer division
*R comparison Operators:
== Equal
!= Not equal
> Greater than
< Less than
>= Greater than or equal to
<= Less than or equal to
* & Element-wise Logical AND operator. It returns TRUE if both
elements are TRUE.
&& Logical AND operator - Returns TRUE if both statements are
TRUE.
| Elementwise- Logical OR operator. It returns TRUE if one of the
statement is TRUE
|| Logical OR operator. It returns TRUE if one of the statement is
TRUE.
! Logical NOT - returns FALSE if statement is TRUE.
*R miscellaneous operators:
: Creates a series of numbers in a sequence x <- 1:10
%in% Find out if an element belongs to a vector x %in% y
%*% Matrix Multiplication x <- Matrix1 %*% Matrix2
*The & symbol (and) is a logical operator, and is used to combine
conditional statements.
*The | symbol (or) is a logical operator, and is used to combine
conditional statements.
*R has two loop commands: while loops / for loops.
*To create a function, use the function() keyword.
*To create a function, use the function() keyword.
*There are two ways to create a nested function:
 Call a function within another function.
 Write a function within a function.
*R also accepts function recursion, which means a defined function can
call itself.
*Global variables can be used by everyone, both inside of functions and
outside.
*A vector is simply a list of items that are of the same type. To combine
the list of items to a vector, use the c() function and separate the items
by a comma.
*To create a vector with numerical values in a sequence, use
the : operator.
*To find out how many items a vector has, use the length() function.
*You can access the vector items by referring to its index number inside
brackets []. The first item has index 1, the second item has index 2, and
so on.
* To repeat vectors, use the rep() function :
repeat_each <- rep(c(1,2,3), each = 3)

repeat_each
repeat_indepent <- rep(c(1,2,3), times = c(5,2,1))

repeat_indepent
* To make bigger or smaller steps in a sequence, use the seq() function :
numbers <- seq(from = 0, to = 100, by = 20)

numbers
* To create a list, use the list() function.
* To find out if a specified item is present in a list, use the %in
% operator.
* To add an item to the end of the list, use the append() function.
*Creation of a matrix using the matrix function :
thismatrix <- matrix(c(1,2,3,4,5,6), nrow = 3, ncol = 2)
*You can access the items by using [ ] brackets. The first number "1" in
the bracket specifies the row-position, while the second number "2"
specifies the column-position.
*The whole row can be accessed if you specify a comma after the
number in the bracket : thismatrix[2,].
*The whole column can be accessed if you specify a comma before the
number in the bracket : thismatrix[,2].
*More than one row can be accessed if you use the c() function :
thismatrix[c(1,2),].
*More than one column can be accessed if you use the c() function :
thismatrix[, c(1,2)].
*Use the cbind() function to add additional columns in a Matrix.
*Use the rbind() function to add additional rows in a Matrix.
*Use the dim() function to find the number of rows and columns in a
Matrix.
*Use the array() function to create an array.
*Data Frames are data displayed in a format as a table.
*Data Frames can have different types of data inside it. While the first
column can be character, the second and third can be numeric or logical.
However, each column should have the same type of data.
*Use the data.frame() function to create a data frame.
*Exp: Data_Frame <- data.frame (
Training = c("Strength", "Stamina", "Other"),
Pulse = c(100, 150, 120),
Duration = c(60, 30, 45)
)
*Use the summary() function to summarize the data from a Data Frame.
For the previous example we’ll get the following output :
Training Pulse Duration
Other :1 Min. :100.0 Min. :30.0
Stamina :1 1st Qu.:110.0 1st Qu.:37.5
Strength:1 Median :120.0 Median :45.0
Mean :123.3 Mean :45.0
3rd Qu.:135.0 3rd Qu.:52.5
Max. :150.0 Max. :60.0
*We can use single brackets [ ], double brackets [[ ]] or $ to access
columns from a data frame:
Data_Frame[1]

Data_Frame[["Training"]]

Data_Frame$Training
*Factors are used to categorize data. Examples of factors are:
 Demography: Male/Female
 Music: Rock, Pop, Classic, Jazz
 Training: Strength, Stamina
*To create a factor, use the factor() function and add a vector as
argument.
*The plot() function is used to draw points (markers) in a diagram.
The function takes parameters for specifying points in the diagram.
Parameter 1 specifies points on the x-axis.
Parameter 2 specifies points on the y-axis.
*To draw more points we use vectors :

 plot(c(1, 8), c(3, 10))

 plot(c(1, 2, 3, 4, 5), c(3, 7, 8, 9, 12))
 x <- c(1, 2, 3, 4, 5)
y <- c(3, 7, 8, 9, 12)

plot(x, y)
*If you want to draw dots in a sequence, on both the x-axis and the y-
axis, use the : operator :
plot(1:10)
*Use col="color" to add a color to the points :
plot(1:10, col="red")
*Use pch with a value from 0 to 25 to change the point shape format :
plot(1:10, pch=25)

*To create a line, use the plot() function and add the type parameter
with a value of "l" :

 plot(1:10, type="l")
*To change the width of the line, use the lwd parameter (1 is default,
while 0.5 means 50% smaller, and 2 means 100% larger) :

 plot(1:10, type="l", lwd=2)

*The line is solid by default. Use the lty parameter with a value from 0
to 6 to specify the line format.
For example, lty=3 will display a dotted line instead of a solid line:

 plot(1:10, type="l", lwd=5, lty=3)

Available parameter values for lty:
 0 removes the line.
 1 displays a solid line.
 2 displays a dashed line.
 3 displays a dotted line.
 4 displays a "dot dashed" line.
 5 displays a "long dashed" line.
 6 displays a "two dashed" line.
*Use the barplot() function to draw a vertical bar chart.
*Example:
# x-axis values
x <- c("A", "B", "C", "D")

# y-axis values
y <- c(2, 4, 6, 8)

barplot(y, names.arg = x)
 The x variable represents values in the x-axis (A,B,C,D)
 The y variable represents values in the y-axis (2,4,6,8)
 Then we use the barplot() function to create a bar chart of the
values
 names.arg defines the names of each observation in the x-axis
*If you want the bars to be displayed horizontally instead of vertically,
use horiz=TRUE
barplot(y, names.arg = x, horiz = TRUE)
*To sort the values, use the sort() function.
*we can use the summary() function to get a statistical summary of the
data.
*The summary() function returns six statistical numbers for each
variable:
 Min
 First quantile (percentile)
 Median
 Mean
 Third quantile (percentile)
 Max
*The min() and max() functions can be used to find the lowest or
highest value in a set.
*We can use the which.max() and which.min() functions to find the
index position of the max and min value in the table.
*The mean() function in R is used to find the mean in a dataset.
*the median() function is used to find the middle value in a dataset.
*In R we don’t have a function to find the mode but we can use the
following code :
Data_x <- y

names(sort(-table(Data_x$y)))[1]
*R is not sensitive to space.
*R is case sensitive.
*T is True and F is False.
*\ This is a back slash and it’s made to avoid a special character.
*We can create a factor with the function as.factor() .
*We create a vector composed from a factors as below:
Exp : fact1 <- as.factor(c(“male”,”female”))
*A list in R can contain anything, it can contain a list of lists or
characters, a list o vectors or whatever.
*In a data frame we can put vectors with different number of rows.
*Indexes in R starts with 1.
Facto Mine R:
install.packages("FactoMineR")
install.packages("factoextra")
library(FactoMineR)
library(factoextra)
# Load a built-in dataset (iris dataset in this example)
data(iris)
# Perform PCA
res_pca <- PCA (iris[, 1:4], graph = FALSE)
# Print summary
summary(res_pca)
K-means clustering :
# Generate some example data
set.seed(123) data <- data.frame( x = rnorm(100), y = rnorm(100) )
# Perform K-means clustering with K = 3
k <- 3 kmeans_result <- kmeans(data, centers = k)
# View the clustering
results print(kmeans_result)
# Plot the data with cluster assignments
plot(data, col = kmeans_result$cluster, main = "K-means Clustering")
points (kmeans_result$centers, col = 1:k, pch = 8, cex = 2)

Hierarchical Clustering:
# Generate some example data
set.seed(123) data <- data.frame( x = rnorm(100), y = rnorm(100) ) #
Compute hierarchical clustering using Euclidean distance and complete
linkage
hc_result <- hclust(dist(data), method = "complete")
# Plot the dendrogram
plot(hc_result, main = "Hierarchical Clustering Dendrogram")
# Cut the dendrogram to create clusters
k <- 3 cluster_cut <- cutree(hc_result, k)
# Plot the data with cluster assignments
plot(data, col = cluster_cut, main = "Hierarchical Clustering")
DBSCAN:
install.packages("dbscan")
library(dbscan)
# Generate some example data
set.seed(123)
data <- data.frame( x = c(rnorm(50, mean = 5), rnorm(50, mean = 10)),
y = c(rnorm(50), rnorm(50, mean = 5)) )
# Perform DBSCAN clustering
dbscan_result <- dbscan(data, eps = 2, MinPts = 5)
# Plot the data with cluster assignments
plot(data, col = dbscan_result$cluster + 1, pch = 16, main = "DBSCAN
Clustering")
spectral clustering:
install.packages("kernlab")library(kernlab)
# Generate some example data
set.seed(123) data <- matrix(rnorm(200), ncol = 2)
# Perform spectral clustering
num_clusters <- 3
spectral_result <- specc(data, centers = num_clusters)
# Plot the data with cluster assignments
plot(data, col = spectral_result, pch = 16, main = "Spectral Clustering")
#########################################
X=iris[,1:4]
D=dist(X,method = "euclidean")
h1=hclust(d = D,method = "complete")
plot(h1)
p1=cutree(h1,3)
plot(X[,1:2],col=p1,pch=p1)

############## with Manhattan distance ############

D=dist(X,method= "manhattan")
h2=hclust(d = D,method = "complete")
p2=cutree(h2,3)
plot(X[,1:2],col=p2,pch=p2)

######## with single link criteria ##################

D=dist(X,method = "euclidean")
h3=hclust(d = D,method = "single")
p3=cutree(h3,3)
plot(X[,1:2],col=p3,pch=p3)

########### NbClust ########################

NbClust(iris[,1:4],method="kmeans")

BDA Section 3
No ratings yet
BDA Section 3
33 pages
Introduction To R
No ratings yet
Introduction To R
91 pages
cours
No ratings yet
cours
33 pages
Module 3 R Data Science
No ratings yet
Module 3 R Data Science
158 pages
First Course On R
No ratings yet
First Course On R
26 pages
R_Vectors
No ratings yet
R_Vectors
22 pages
R Comandos
No ratings yet
R Comandos
13 pages
R Practicals
No ratings yet
R Practicals
53 pages
Introdution to R - Network Analysis_ Practical 1 - Sacha Epskamp - University of Amsterdam, 2013
No ratings yet
Introdution to R - Network Analysis_ Practical 1 - Sacha Epskamp - University of Amsterdam, 2013
34 pages
R-pres
No ratings yet
R-pres
53 pages
Network Analysis and Visualization With R and Igraph
No ratings yet
Network Analysis and Visualization With R and Igraph
62 pages
DA_Lab_Week-2
No ratings yet
DA_Lab_Week-2
22 pages
Practical 1_Data Frame Manipulation_072502
No ratings yet
Practical 1_Data Frame Manipulation_072502
16 pages
Da Session 4
No ratings yet
Da Session 4
75 pages
STTN 225 R Summary
No ratings yet
STTN 225 R Summary
18 pages
Data_analysis_with_R _24
No ratings yet
Data_analysis_with_R _24
47 pages
R PPT
No ratings yet
R PPT
63 pages
R WorkSamples
No ratings yet
R WorkSamples
44 pages
Introduction To Data Science With R Programming
No ratings yet
Introduction To Data Science With R Programming
91 pages
Unit 2 Notes - Data Analysis Using r
No ratings yet
Unit 2 Notes - Data Analysis Using r
19 pages
r file code
No ratings yet
r file code
16 pages
R study material I
No ratings yet
R study material I
8 pages
Unit 4
No ratings yet
Unit 4
27 pages
Bdo Co1 Session 4
No ratings yet
Bdo Co1 Session 4
43 pages
R Programming
No ratings yet
R Programming
50 pages
R Programming Notes
No ratings yet
R Programming Notes
23 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Chapter 1 Introduction To R
No ratings yet
Chapter 1 Introduction To R
33 pages
Untitled
No ratings yet
Untitled
59 pages
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
No ratings yet
Rtips. Revival 2012!: Paul E. Johnson June 8, 2012
72 pages
R Session A
No ratings yet
R Session A
107 pages
An Introduction To R: Biostatistics 615/815
No ratings yet
An Introduction To R: Biostatistics 615/815
59 pages
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
No ratings yet
Introduction To R: Nihan Acar-Denizli, Pau Fonseca
50 pages
basics of R
No ratings yet
basics of R
12 pages
data anlytics using r notes
No ratings yet
data anlytics using r notes
14 pages
R Software - Notes
No ratings yet
R Software - Notes
18 pages
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
No ratings yet
Muthayammal College of Arts and Science Rasipuram: Assignment No - 1
10 pages
Data Analysis Using R and Vectors
No ratings yet
Data Analysis Using R and Vectors
35 pages
STATS LAB Basics of R PDF
No ratings yet
STATS LAB Basics of R PDF
77 pages
Programming With R: Lecture #4
No ratings yet
Programming With R: Lecture #4
34 pages
R Reference Card
No ratings yet
R Reference Card
6 pages
R Programming
No ratings yet
R Programming
22 pages
NN
No ratings yet
NN
1 page
Part I: Introductory Materials: Introduction To R
No ratings yet
Part I: Introductory Materials: Introduction To R
25 pages
Importing The Files
No ratings yet
Importing The Files
14 pages
Session Set Working Directory Choose Directlry
No ratings yet
Session Set Working Directory Choose Directlry
17 pages
Rintro
No ratings yet
Rintro
14 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Introduction To R: 1 Getting Started
No ratings yet
Introduction To R: 1 Getting Started
14 pages
R Studio
No ratings yet
R Studio
8 pages
The Vertical Plane Webster Ken download
No ratings yet
The Vertical Plane Webster Ken download
32 pages
R Reference Card
No ratings yet
R Reference Card
6 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
R Short Tutorial
No ratings yet
R Short Tutorial
5 pages
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
R-Programming Notes
100% (1)
R-Programming Notes
33 pages
Workplace Safety & Health Guidelines for the Private Security Industry
No ratings yet
Workplace Safety & Health Guidelines for the Private Security Industry
36 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
class12PRACTICAL FILE WriteUp
No ratings yet
class12PRACTICAL FILE WriteUp
35 pages
Debremarkos University Burie Campus College of Agriculture and Natural Resources Department of Animal Science Under Graduate Program
100% (2)
Debremarkos University Burie Campus College of Agriculture and Natural Resources Department of Animal Science Under Graduate Program
38 pages
Nook Jack Catalog
No ratings yet
Nook Jack Catalog
212 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
The Tamil Nadu Agricultural Lands Record of Tenancy Rights Act
No ratings yet
The Tamil Nadu Agricultural Lands Record of Tenancy Rights Act
14 pages
Density Based
No ratings yet
Density Based
52 pages
Network (CODASYL) Data Model
100% (3)
Network (CODASYL) Data Model
47 pages
Chapter 6 - Managing Social Responsibility & Ethics
No ratings yet
Chapter 6 - Managing Social Responsibility & Ethics
26 pages
Chapter 4 Markov Chain
No ratings yet
Chapter 4 Markov Chain
39 pages
Reporting Skills Booklet 1
No ratings yet
Reporting Skills Booklet 1
30 pages
Manual Pioneer Dehp645r
No ratings yet
Manual Pioneer Dehp645r
98 pages
Group 2 - MAM
No ratings yet
Group 2 - MAM
19 pages
SSRN Id3558469
No ratings yet
SSRN Id3558469
11 pages
SLG 16.3 Probability, Part II - Total Probability
No ratings yet
SLG 16.3 Probability, Part II - Total Probability
3 pages
Question Bank Fuel
No ratings yet
Question Bank Fuel
3 pages
Investment Strategy
No ratings yet
Investment Strategy
13 pages
Summary For Moubarkis Online Session About K Means
No ratings yet
Summary For Moubarkis Online Session About K Means
12 pages
B.E. Marine 2017 Syllabus-92-95
No ratings yet
B.E. Marine 2017 Syllabus-92-95
4 pages
0000 - Eb Approach Inherent and Residual Risk File
No ratings yet
0000 - Eb Approach Inherent and Residual Risk File
10 pages
What Are Stock Market Indices?
100% (1)
What Are Stock Market Indices?
11 pages
British Colonies
No ratings yet
British Colonies
2 pages
Teamwork in Construction Activities
No ratings yet
Teamwork in Construction Activities
14 pages
Principles of Retentive Pins Placement in Amalgam Restoration in Dentistry
No ratings yet
Principles of Retentive Pins Placement in Amalgam Restoration in Dentistry
8 pages
Tutorial 2 Solutions
No ratings yet
Tutorial 2 Solutions
7 pages
Coord Milestones
No ratings yet
Coord Milestones
11 pages
Partial Replacement of Lateritic Soil With Crushed Rock Sand Stone Dust in Compressed Earth Brick Production
No ratings yet
Partial Replacement of Lateritic Soil With Crushed Rock Sand Stone Dust in Compressed Earth Brick Production
4 pages
DriveSafe DSD Waiver
100% (1)
DriveSafe DSD Waiver
1 page
Question and Answer PCA
No ratings yet
Question and Answer PCA
4 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Project Management
No ratings yet
Project Management
3 pages
Negotiation Roleplay V2
No ratings yet
Negotiation Roleplay V2
3 pages
Copy GGG
No ratings yet
Copy GGG
3 pages
DODGE Type C Pillow Blocks, Flange Bearings, Hanger Bearings and Units
No ratings yet
DODGE Type C Pillow Blocks, Flange Bearings, Hanger Bearings and Units
4 pages
Econometrics Summary
No ratings yet
Econometrics Summary
3 pages
Probability Sampling - Formulas Sheet
No ratings yet
Probability Sampling - Formulas Sheet
3 pages
Tuto 1
No ratings yet
Tuto 1
2 pages
Macabacus Quick Start Guide
No ratings yet
Macabacus Quick Start Guide
2 pages
Employee Evaluation: Responsibilities
No ratings yet
Employee Evaluation: Responsibilities
2 pages
COURSE UNIT - CU5 Nurses Role in Disaster Part 2 - Copy-2
No ratings yet
COURSE UNIT - CU5 Nurses Role in Disaster Part 2 - Copy-2
4 pages
Revision
No ratings yet
Revision
2 pages
Data Sheet: HLMP-HD61, HLMP-HM61 and HLMP-HB61
No ratings yet
Data Sheet: HLMP-HD61, HLMP-HM61 and HLMP-HB61
12 pages
Group F: Marketing Objective
No ratings yet
Group F: Marketing Objective
4 pages
Subject Area Coordinator Duties and Responsibilities
89% (35)
Subject Area Coordinator Duties and Responsibilities
2 pages
Pneu New E
No ratings yet
Pneu New E
4 pages
CCG - Sanger Sequencing Mlpa Request Form
No ratings yet
CCG - Sanger Sequencing Mlpa Request Form
1 page
TVM Additional Q
0% (1)
TVM Additional Q
1 page

R

Uploaded by

R

Uploaded by

*R is a programming language.

*R is often used for statistical computing and graphical presentation to

 plot(c(1, 8), c(3, 10))

 plot(1:10, type="l", lwd=2)

 plot(1:10, type="l", lwd=5, lty=3)

############## with Manhattan distance ############

######## with single link criteria ##################

########### NbClust ########################

You might also like