0% found this document useful (0 votes)

53 views16 pages

Lecture 2: More Data Structures: Outline

This document provides an overview of various data structures in R, including arrays, matrices, lists, and data frames. It discusses how to create, access, and manipulate these structures. Arrays allow the storage of data in multiple dimensions. Matrices are a specialized type of two-dimensional array. Lists allow the grouping of different data types together. Various functions like apply(), rowMeans(), and colMeans() are demonstrated for performing uniform operations on arrays and matrices.

Uploaded by

Bakari Hamisi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views16 pages

Lecture 2: More Data Structures: Outline

Uploaded by

Bakari Hamisi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Lecture 2: More Data Structures

Statistical Computing, 36-350

Wednesday September 2, 2015

Outline
• Arrays
• Matrices
• Lists
• Data frames
• Structures of structures

Vector structures, starting with arrays

Many data structures in R are made by adding bells and whistles to vectors, i.e., they are “vector structures”
Most useful: arrays

x = c(7, 8, 10, 45)

x.arr = array(x, dim=c(2,2))
x.arr

## [,1] [,2]
## [1,] 7 10
## [2,] 8 45

dim says how many rows and columns; filled by columns

Can have 3, 4, . . . arrays; dim is vector of arbitrary length

Some properties of our array:

dim(x.arr)

## [1] 2 2

is.vector(x.arr)

## [1] FALSE

is.array(x.arr)

## [1] TRUE

1
typeof(x.arr)

## [1] "double"

str(x.arr)

## num [1:2, 1:2] 7 8 10 45

attributes(x.arr)

## $dim
## [1] 2 2

typeof() returns the type of the array elements

str() gives the structure: here, a numeric array, with two dimensions, both indexed 1–2, and then the actual
numbers
Exercise: try all these with x

Accessing and indexing arrays

Can access a 2d array either by pairs of indices or by the underlying vector:

x.arr[1,2]

## [1] 10

x.arr[3]

## [1] 10

Omitting an index means “all of it”:

x.arr[c(1:2),2]

## [1] 10 45

x.arr[,2]

## [1] 10 45

Functions on arrays
Many functions applied to a vector-structure like an array will just boil things down to the underlying vector:

2
which(x.arr > 9)

## [1] 3 4

This happens unless the function is set up to handle arrays specifically

Many functions do preserve array structure:

y = -x
y.arr = array(y,dim=c(2,2))
y.arr + x.arr

## [,1] [,2]
## [1,] 0 0
## [2,] 0 0

Others specifically act on each row or column of the array separately:

rowSums(x.arr)

## [1] 17 53

(We will see a lot more of this idea soon)

Example: houses prices in Pennsylvania

Census data for California and Pennsylvania on housing prices, by Census “tract”

calif_penn = read.csv("http://www.stat.cmu.edu/~cshalizi/uADA/13/hw/01/calif_penn_2011.csv")
penn = calif_penn[calif_penn[,"STATEFP"]==42,]
coefficients(lm(Median_house_value ~ Median_household_income, data=penn))

## (Intercept) Median_household_income
## -26206.564325 3.651256

Fit a simple linear model, predicting median house price from median household income

It turns out census tracts 24–425 are Allegheny county

Tract 24 has a median income of $14,719; actual median house value is $34,100; is that above or below what’s
predicted?

3
34100 < -26206.564 + 3.651*14719

## [1] FALSE

Tract 25 has income $48,102 and house price $155,900

155900 < -26206.564 + 3.651*48102

## [1] FALSE

What about tract 26?

We could just keep plugging in numbers like this, but that’s

• boring and repetitive

• error-prone (what if I forget to change the median income, or drop a minus sign from the intercept?)
• obscure if we come back to our work later (what are these numbers, again?)

Use variables and names

penn.coefs = coefficients(lm(Median_house_value ~ Median_household_income, data=penn))

penn.coefs

## (Intercept) Median_household_income
## -26206.564325 3.651256

allegheny.rows = 24:425
allegheny.medinc = penn[allegheny.rows,"Median_household_income"]
allegheny.values = penn[allegheny.rows,"Median_house_value"]
allegheny.fitted = penn.coefs["(Intercept)"] +
penn.coefs["Median_household_income"]*allegheny.medinc

plot(x=allegheny.fitted, y=allegheny.values,
xlab="Model-predicted median house values",
ylab="True median house values",
xlim=c(0,5e5), ylim=c(0,5e5))
abline(a=0, b=1, col="red")

4
4e+05
True median house values

2e+05
0e+00

0e+00 1e+05 2e+05 3e+05 4e+05 5e+05

Model−predicted median house values

Running example: resource allocation

Factory makes cars and trucks, using labor and steel

• a car takes 40 hours of labor and 1 ton of steel

• a truck takes 60 hours and 3 tons of steel
• resources: 1600 hours of labor and 70 tons of steel each week

Matrices
In R, a matrix is a specialization of a 2d array

factory = matrix(c(40,1,60,3), nrow=2)

factory

## [,1] [,2]
## [1,] 40 60
## [2,] 1 3

is.array(factory)

## [1] TRUE

is.matrix(factory)

## [1] TRUE

5
could also specify ncol; to fill by rows, use byrow=TRUE
Elementwise operations with the usual arithmetic and comparison operators (e.g., factory/3)
Compare whole matrices with identical() or all.equal()

Matrix multiplication
Has its own special operator, written %*%:

six.sevens = matrix(rep(7,6), ncol=3)

six.sevens

## [,1] [,2] [,3]

## [1,] 7 7 7
## [2,] 7 7 7

factory %% six.sevens # [2x2] [2x3]

## [,1] [,2] [,3]

## [1,] 700 700 700
## [2,] 28 28 28

(What happens if you try six.sevens %*% factory?)

Multiplying matrices and vectors

Numeric vectors can act like proper vectors:

output = c(10,20)
factory %*% output

## [,1]
## [1,] 1600
## [2,] 70

output %*% factory

## [,1] [,2]
## [1,] 420 660

(R silently casts the vector as either a 1-column or 1-row matrix, as appropriate)

Matrix operators
Transpose:

6
t(factory)

## [,1] [,2]
## [1,] 40 1
## [2,] 60 3

Determinant:

det(factory)

## [1] 60

The matrix diagonal

The diag() function can be used to extract the diagonal entries of a matrix:

diag(factory)

## [1] 40 3

It can also be used to change the diagonal:

diag(factory) = c(35,4)
factory

## [,1] [,2]
## [1,] 35 60
## [2,] 1 4

Re-set it for later:

diag(factory) = c(40,3)

Creating a diagonal or identity matrix

diag(c(3,4))

## [,1] [,2]
## [1,] 3 0
## [2,] 0 4

diag(2)

## [,1] [,2]
## [1,] 1 0
## [2,] 0 1

(How do you get a 1 x 1 matrix containing a single entry 2?)

7
Inverting a matrix

solve(factory)

## [,1] [,2]
## [1,] 0.05000000 -1.0000000
## [2,] -0.01666667 0.6666667

factory %*% solve(factory)

## [,1] [,2]
## [1,] 1 0
## [2,] 0 1

Why is it called “solve” anyway?

Solving the linear system Ax = b for x:

available = c(1600,70)
solve(factory,available)

## [1] 10 20

factory %*% solve(factory,available)

## [,1]
## [1,] 1600
## [2,] 70

Names in matrices
We can name either rows or columns or both, with rownames() and colnames()
These are just character vectors, and we use the same function to get and to set their values
Names help us understand what we’re working with
Names can be used to coordinate different objects

rownames(factory) = c("labor","steel")
colnames(factory) = c("cars","trucks")
factory

## cars trucks
## labor 40 60
## steel 1 3

8
available = c(1600,70)
names(available) = c("labor","steel")

output = c(20,10)
names(output) = c("trucks","cars")
factory %*% output # But we've got cars and trucks mixed up!

## [,1]
## labor 1400
## steel 50

factory %*% output[colnames(factory)]

## [,1]
## labor 1600
## steel 70

all(factory %*% output[colnames(factory)] <= available[rownames(factory)])

## [1] TRUE

Note that last lines don’t have to change if we add motorcycles as output or rubber and glass as inputs
(abstraction again)

Doing the same thing to each row or column

Take the mean: rowMeans(), colMeans(), input is matrix, output is vector. Also rowSums(), colSums
summary(): vector-style summary of column

colMeans(factory)

## cars trucks
## 20.5 31.5

summary(factory)

## cars trucks
## Min. : 1.00 Min. : 3.00
## 1st Qu.:10.75 1st Qu.:17.25
## Median :20.50 Median :31.50
## Mean :20.50 Mean :31.50
## 3rd Qu.:30.25 3rd Qu.:45.75
## Max. :40.00 Max. :60.00

9
apply(), takes 3 arguments:

• the array or matrix,

• then 1 for rows and 2 for columns,
• then a name of the function to apply to each

rowMeans(factory)

## labor steel
## 50 2

apply(factory, 1, mean)

## labor steel
## 50 2

(What would apply(factory, 1, sd) do?)

Lists
Sequence of values, not necessarily all of the same type

my.distribution = list("exponential", 7, FALSE)

my.distribution

## [[1]]
## [1] "exponential"
##
## [[2]]
## [1] 7
##
## [[3]]
## [1] FALSE

Most of what you can do with vectors you can also do with lists

Accessing pieces of lists

Can use [ ] as with vectors
Or use [[ ]], but only with a single index
[[ ]] drops names and structures, [ ] does not

my.distribution[2]

## [[1]]
## [1] 7

10
my.distribution[[2]]

## [1] 7

my.distribution[[2]]^2

## [1] 49

(What happens if you try my.distribution[2]ˆ2?) (What happens if you try [[ ]] on a vector?)

Expanding and contracting lists

Add to lists with c() (also works with vectors):

my.distribution = c(my.distribution,7)
my.distribution

## [[1]]
## [1] "exponential"
##
## [[2]]
## [1] 7
##
## [[3]]
## [1] FALSE
##
## [[4]]
## [1] 7

Chop off the end of a list by setting the length to something smaller (also works with vectors):

length(my.distribution)

## [1] 4

length(my.distribution) = 3
my.distribution

## [[1]]
## [1] "exponential"
##
## [[2]]
## [1] 7
##
## [[3]]
## [1] FALSE

11
Pluck out all but one piece of a list (also works with vectors):

my.distribution[-2]

## [[1]]
## [1] "exponential"
##
## [[2]]
## [1] FALSE

(What happens if you try my.distribution[[-2]]?)

Naming list elements

We can name some or all of the elements of a list:

names(my.distribution) = c("family","mean","is.symmetric")
my.distribution

## $family
## [1] "exponential"
##
## $mean
## [1] 7
##
## $is.symmetric
## [1] FALSE

my.distribution[["family"]]

## [1] "exponential"

my.distribution["family"]

## $family
## [1] "exponential"

Lists have a special shortcut way of using names, with $:

my.distribution[["family"]]

## [1] "exponential"

12
my.distribution$family

## [1] "exponential"

Names in lists (continued)

Creating a list with names:

another.distribution = list(family="gaussian",
mean=7, sd=1, is.symmetric=TRUE)

Adding named elements:

my.distribution$was.estimated = FALSE
my.distribution[["last.updated"]] = "2015-09-01"

Removing a named list element, by assigning it the value NULL:

my.distribution$was.estimated = NULL

Key-value pairs
Lists give us a natural way to store and look up data by name, rather than by position
A really useful programming concept with many names: key-value pairs, dictionaries, associative arrays
If all our distributions have components named family, we can look that up by name, without caring where
it is (in what position it lies) in the list

Data frames
The classic data table, n rows for cases, p columns for variables
Lots of the really-statistical parts of R presume data frames
Not just a matrix because columns can have different types
Many matrix functions also work for data frames (e.g.,rowSums(), summary(), apply())
(But no matrix multiplication with data frames, even if all columns are numeric!)

a.matrix = matrix(c(35,8,10,4), nrow=2)

colnames(a.matrix) = c("v1","v2")
a.matrix

## v1 v2
## [1,] 35 10
## [2,] 8 4

13
a.matrix[,"v1"] # Try a.matrix$v1 and see what happens

## [1] 35 8

a.data.frame = data.frame(a.matrix,logicals=c(TRUE,FALSE))
a.data.frame

## v1 v2 logicals
## 1 35 10 TRUE
## 2 8 4 FALSE

a.data.frame$v1

## [1] 35 8

a.data.frame[,"v1"]

## [1] 35 8

a.data.frame[1,]

## v1 v2 logicals
## 1 35 10 TRUE

colMeans(a.data.frame)

## v1 v2 logicals
## 21.5 7.0 0.5

Adding rows and columns

We can add rows or columns to an array or data frame with rbind() and cbind(), but be careful about
forced type conversions

rbind(a.data.frame,list(v1=-3,v2=-5,logicals=TRUE))

## v1 v2 logicals
## 1 35 10 TRUE
## 2 8 4 FALSE
## 3 -3 -5 TRUE

rbind(a.data.frame,c(3,4,6))

## v1 v2 logicals
## 1 35 10 1
## 2 8 4 0
## 3 3 4 6

14
Structures of structures
So far, every list element has been a single data value
List elements can be other data structures, e.g., vectors and matrices:

plan = list(factory=factory, available=available, output=output)

plan$output

## trucks cars
## 20 10

Internally, a data frame is basically a list of vectors (all of the same length)

List elements can even be other lists

which may contain other data structures
including other lists
which may contain other data structures . . .
This recursion lets us build arbitrarily complicated data structures from the basic ones
Most complicated objects are (usually) lists of data structures

Example: eigen-decomposition
eigen() finds eigenvalues and eigenvectors of a matrix
Returns a list of a vector (the eigenvalues) and a matrix (the eigenvectors)

eigen(factory)

## $values
## [1] 41.556171 1.443829
##
## $vectors
## [,1] [,2]
## [1,] 0.99966383 -0.8412758
## [2,] 0.02592747 0.5406062

class(eigen(factory))

## [1] "list"

With complicated objects, you can access parts of parts (of parts . . . )

15
factory %*% eigen(factory)$vectors[,2]

## [,1]
## labor -1.2146583
## steel 0.7805429

eigen(factory)$values[2] * eigen(factory)$vectors[,2]

## [1] -1.2146583 0.7805429

eigen(factory)$values[2]

## [1] 1.443829

eigen(factory)[[1]][[2]] # NOT [[1,2]]

## [1] 1.443829

Summary
• Arrays add multi-dimensional structure to vectors
• Matrices act like you’d hope they would
• Lists let us combine different types of data
• Data frames are hybrids of matrices and lists, allowing each column to have a different basic type
• Recursion lets us build complicated data structures out of simpler ones

Lecture 1
No ratings yet
Lecture 1
42 pages
N2 Data in R
No ratings yet
N2 Data in R
7 pages
Network Analysis and Visualization With R and Igraph
No ratings yet
Network Analysis and Visualization With R and Igraph
62 pages
R Programming Materials
No ratings yet
R Programming Materials
51 pages
R Basics for Economics Students
No ratings yet
R Basics for Economics Students
7 pages
Mod 2 Summary Table
No ratings yet
Mod 2 Summary Table
16 pages
Introduction To R
No ratings yet
Introduction To R
21 pages
M2 Dar
No ratings yet
M2 Dar
46 pages
Chapter 4
No ratings yet
Chapter 4
13 pages
R Tutorial 1A - Basics
No ratings yet
R Tutorial 1A - Basics
10 pages
R Lab Record 2024
No ratings yet
R Lab Record 2024
35 pages
R Programming Language: History
No ratings yet
R Programming Language: History
20 pages
R - Tutorial: Matrices Are Vectors
No ratings yet
R - Tutorial: Matrices Are Vectors
13 pages
R Pres
No ratings yet
R Pres
53 pages
Rbasics
No ratings yet
Rbasics
96 pages
Screenshot 2025-01-24 at 9.23.10 AM
No ratings yet
Screenshot 2025-01-24 at 9.23.10 AM
42 pages
STAT 04 Simplify Notes
No ratings yet
STAT 04 Simplify Notes
34 pages
R Lists, Matrices & Arrays Guide
No ratings yet
R Lists, Matrices & Arrays Guide
6 pages
Intr2R Week2 2020
No ratings yet
Intr2R Week2 2020
13 pages
R Session A
No ratings yet
R Session A
107 pages
Module2 DAR
No ratings yet
Module2 DAR
40 pages
Introduction to R Basics and Data Types
No ratings yet
Introduction to R Basics and Data Types
33 pages
R Study Material I
No ratings yet
R Study Material I
8 pages
Introduction To Spatial Data Handling in R
No ratings yet
Introduction To Spatial Data Handling in R
25 pages
A Crash Course in R - Intro To Statistical Programming
No ratings yet
A Crash Course in R - Intro To Statistical Programming
53 pages
Unit 3 Chatgpt
No ratings yet
Unit 3 Chatgpt
6 pages
R 03 Matrices Handouts
No ratings yet
R 03 Matrices Handouts
14 pages
Chapter 5 Slides
No ratings yet
Chapter 5 Slides
73 pages
Data Structure in
No ratings yet
Data Structure in
18 pages
Teaching R
No ratings yet
Teaching R
15 pages
03 Matrices
No ratings yet
03 Matrices
60 pages
Tutorial 1
No ratings yet
Tutorial 1
29 pages
R Cheatsheet Base R
No ratings yet
R Cheatsheet Base R
2 pages
R Statistical Package
No ratings yet
R Statistical Package
63 pages
R
No ratings yet
R
38 pages
R Programming Checklist of Basic Skills With Examples
No ratings yet
R Programming Checklist of Basic Skills With Examples
33 pages
RStudio
No ratings yet
RStudio
60 pages
R22 Unit3 Vector List Matrix
No ratings yet
R22 Unit3 Vector List Matrix
37 pages
R Programming Basics for Beginners
No ratings yet
R Programming Basics for Beginners
14 pages
Introduction To R
No ratings yet
Introduction To R
74 pages
Unit 2 Matrices
No ratings yet
Unit 2 Matrices
65 pages
r22 Unit3 Vector Matrix
No ratings yet
r22 Unit3 Vector Matrix
30 pages
Intro to Data Science with R
No ratings yet
Intro to Data Science with R
40 pages
Ids Unit 3 by
No ratings yet
Ids Unit 3 by
109 pages
Chap 3 - BSD2223
No ratings yet
Chap 3 - BSD2223
29 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
People Analytics With R Part 3
No ratings yet
People Analytics With R Part 3
11 pages
Arrays in R
No ratings yet
Arrays in R
5 pages
1 - Introduction To Programming With R
No ratings yet
1 - Introduction To Programming With R
13 pages
R Programming Basics for Beginners
No ratings yet
R Programming Basics for Beginners
16 pages
IDS - Unit 3 - 5
No ratings yet
IDS - Unit 3 - 5
80 pages
P1 - NotesOnR
No ratings yet
P1 - NotesOnR
17 pages
Introduction To R
No ratings yet
Introduction To R
91 pages
Introduction To R PDF
No ratings yet
Introduction To R PDF
56 pages
Week 12 - Lecture Notes Special Matrices
No ratings yet
Week 12 - Lecture Notes Special Matrices
25 pages
Biostat S1 Handout
No ratings yet
Biostat S1 Handout
7 pages
R Tutorial: Vectors, Matrices, Arrays
No ratings yet
R Tutorial: Vectors, Matrices, Arrays
8 pages
Data Science Using R Programming - Data Science Using R Unit 1-5
No ratings yet
Data Science Using R Programming - Data Science Using R Unit 1-5
25 pages
Mathematical Foundations of Actuarial Science. Society of Actuaries - Course 1. Casualty Actuarial - Exam 1
No ratings yet
Mathematical Foundations of Actuarial Science. Society of Actuaries - Course 1. Casualty Actuarial - Exam 1
68 pages
Data Science Classes
No ratings yet
Data Science Classes
13 pages
Copulas - Course Notes
No ratings yet
Copulas - Course Notes
11 pages
DSI Guide - Intro To SQL
No ratings yet
DSI Guide - Intro To SQL
23 pages
Permuatations and Combinations Formulas Cracku PDF
No ratings yet
Permuatations and Combinations Formulas Cracku PDF
14 pages
3 The Rao-Blackwell Theorem: 3.1 Mean Squared Error
No ratings yet
3 The Rao-Blackwell Theorem: 3.1 Mean Squared Error
2 pages
Graduation Tests - Course Notes
No ratings yet
Graduation Tests - Course Notes
11 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
Mementopython3 English PDF
No ratings yet
Mementopython3 English PDF
2 pages
On The Auspicious Event of Eid-e-Milad-un-Nabi, I.T Majlis Presents You A Glamorous Gift
100% (1)
On The Auspicious Event of Eid-e-Milad-un-Nabi, I.T Majlis Presents You A Glamorous Gift
2 pages
Chapter 4 - Insurance Benefits
No ratings yet
Chapter 4 - Insurance Benefits
57 pages
Barbara Wothaya - Associate Actuary: Objective
No ratings yet
Barbara Wothaya - Associate Actuary: Objective
3 pages
Sta 2100 Notes PDF
No ratings yet
Sta 2100 Notes PDF
73 pages
Package Prophet': April 29, 2020
No ratings yet
Package Prophet': April 29, 2020
17 pages
Dot Product and Vector Projections
No ratings yet
Dot Product and Vector Projections
7 pages
CS1 Mapping Syllabus PDF
No ratings yet
CS1 Mapping Syllabus PDF
9 pages
STA 112 INTRODUCTION TO PROBABILITY AND STATISTICS II Course Outline
No ratings yet
STA 112 INTRODUCTION TO PROBABILITY AND STATISTICS II Course Outline
2 pages
Benford's Law - Wikipedia
No ratings yet
Benford's Law - Wikipedia
7 pages
Statistical Analysis Techniques
No ratings yet
Statistical Analysis Techniques
8 pages
CS1 Specimen Questions and Solutions: July 2020
No ratings yet
CS1 Specimen Questions and Solutions: July 2020
7 pages
Specimen Standard Keyboard Notation For Ifoa Examinations
No ratings yet
Specimen Standard Keyboard Notation For Ifoa Examinations
4 pages
h75 Scalar and Vector Projections PDF
No ratings yet
h75 Scalar and Vector Projections PDF
2 pages
Extreme Value Theory
No ratings yet
Extreme Value Theory
6 pages
The Infinite Actuary - Group and Health DP Exam - Syllabus Changes Fall 2019 Exam Restructuring
No ratings yet
The Infinite Actuary - Group and Health DP Exam - Syllabus Changes Fall 2019 Exam Restructuring
2 pages
Mathematics PDF
No ratings yet
Mathematics PDF
198 pages
CM1 Flashcards Sample - Chapter 14
No ratings yet
CM1 Flashcards Sample - Chapter 14
56 pages
UAE - Africa Combined Slides May 2015-1
No ratings yet
UAE - Africa Combined Slides May 2015-1
28 pages
CM1 Specimen Questions and Solutions
No ratings yet
CM1 Specimen Questions and Solutions
5 pages
IFO - 3821 - THE - GUIDE - 2019 - PRINT Spreads Web
No ratings yet
IFO - 3821 - THE - GUIDE - 2019 - PRINT Spreads Web
13 pages
Practice Problems Graph Theory
100% (3)
Practice Problems Graph Theory
4 pages
Investigacion de Operaciones Un Campo Multidisciplinario PDF
No ratings yet
Investigacion de Operaciones Un Campo Multidisciplinario PDF
22 pages
Easymath Assignment 1 Ch3 Pair of Linear Equations in Two Variables - Class 10-1
No ratings yet
Easymath Assignment 1 Ch3 Pair of Linear Equations in Two Variables - Class 10-1
6 pages
Mathematics and Nature Patterns
No ratings yet
Mathematics and Nature Patterns
41 pages
Mathematics KBSR Year 3: Nor Salhana Binti Mohd. Arshad - SK Langkawi
No ratings yet
Mathematics KBSR Year 3: Nor Salhana Binti Mohd. Arshad - SK Langkawi
17 pages
Electrical Machines With Matlab R Second Edition 90063
0% (2)
Electrical Machines With Matlab R Second Edition 90063
3 pages
Lesson Plan 3-Class 6
No ratings yet
Lesson Plan 3-Class 6
4 pages
Escape The Room - Answers
No ratings yet
Escape The Room - Answers
5 pages
MATH 6 PPT Q3 W6 - Routine and Non-Routine Problems Involving Different Types of Numerical Expressions and Equations
100% (2)
MATH 6 PPT Q3 W6 - Routine and Non-Routine Problems Involving Different Types of Numerical Expressions and Equations
24 pages
3-d Geometry Notes
100% (2)
3-d Geometry Notes
5 pages
JR Maths-Ib Laq Solutions
No ratings yet
JR Maths-Ib Laq Solutions
81 pages
DM Two Mark Unit Wise
No ratings yet
DM Two Mark Unit Wise
5 pages
Portal & Gable Frame Analysis Tool
No ratings yet
Portal & Gable Frame Analysis Tool
6 pages
Thesis in Mathematics
100% (3)
Thesis in Mathematics
7 pages
MATHEMATICS ACTIVITY FILE JJJJJJJJ
50% (4)
MATHEMATICS ACTIVITY FILE JJJJJJJJ
45 pages
Mathematics Syllabus: 2015 (Patna University) B.A./B.Sc. PART I
No ratings yet
Mathematics Syllabus: 2015 (Patna University) B.A./B.Sc. PART I
11 pages
Algorithm Analysis Essentials
No ratings yet
Algorithm Analysis Essentials
50 pages
Contraction Mappings in b-Metric Spaces
No ratings yet
Contraction Mappings in b-Metric Spaces
8 pages
Geometry Shapes Assignment
No ratings yet
Geometry Shapes Assignment
1 page
Intermediate Algebra 5th Edition Ron Larson No Waiting Time
No ratings yet
Intermediate Algebra 5th Edition Ron Larson No Waiting Time
136 pages
Branch and Bound
No ratings yet
Branch and Bound
49 pages
NCERT Solutions For Class 5 Maths Chapter 5 - Does It Look The Same - .
No ratings yet
NCERT Solutions For Class 5 Maths Chapter 5 - Does It Look The Same - .
9 pages
The Great Math Mystery
No ratings yet
The Great Math Mystery
1 page
Intro To MoMP
No ratings yet
Intro To MoMP
43 pages
Calculus Single and Multivariable Enhanced Etext 7th Edition Deborah Hughes-Hallett Available All Format
100% (1)
Calculus Single and Multivariable Enhanced Etext 7th Edition Deborah Hughes-Hallett Available All Format
177 pages
Trigonometry Formulas For Class 12: The Di!erence Between Trigonometric Identities and Trigonometric Ratios
100% (1)
Trigonometry Formulas For Class 12: The Di!erence Between Trigonometric Identities and Trigonometric Ratios
7 pages
GCSE Calculator Paper
No ratings yet
GCSE Calculator Paper
24 pages
Complex Analysis
No ratings yet
Complex Analysis
3 pages
CollegeAlgebra 04 Equations-and-Inequalities
No ratings yet
CollegeAlgebra 04 Equations-and-Inequalities
22 pages
DAA Assignment-1
No ratings yet
DAA Assignment-1
5 pages

Lecture 2: More Data Structures: Outline

Uploaded by

Lecture 2: More Data Structures: Outline

Uploaded by

Lecture 2: More Data Structures

Statistical Computing, 36-350

Vector structures, starting with arrays

x = c(7, 8, 10, 45)

dim says how many rows and columns; filled by columns

Some properties of our array:

## num [1:2, 1:2] 7 8 10 45

typeof() returns the type of the array elements

Accessing and indexing arrays

Omitting an index means “all of it”:

This happens unless the function is set up to handle arrays specifically

Many functions do preserve array structure:

Others specifically act on each row or column of the array separately:

(We will see a lot more of this idea soon)

Example: houses prices in Pennsylvania

It turns out census tracts 24–425 are Allegheny county

Tract 25 has income $48,102 and house price $155,900

155900 < -26206.564 + 3.651*48102

What about tract 26?

We could just keep plugging in numbers like this, but that’s

• boring and repetitive

Use variables and names

penn.coefs = coefficients(lm(Median_house_value ~ Median_household_income, data=penn))

0e+00 1e+05 2e+05 3e+05 4e+05 5e+05

Model−predicted median house values

Running example: resource allocation

• a car takes 40 hours of labor and 1 ton of steel

factory = matrix(c(40,1,60,3), nrow=2)

six.sevens = matrix(rep(7,6), ncol=3)

## [,1] [,2] [,3]

factory %*% six.sevens # [2x2] * [2x3]

## [,1] [,2] [,3]

(What happens if you try six.sevens %*% factory?)

Multiplying matrices and vectors

output %*% factory

(R silently casts the vector as either a 1-column or 1-row matrix, as appropriate)

The matrix diagonal

It can also be used to change the diagonal:

Re-set it for later:

Creating a diagonal or identity matrix

(How do you get a 1 x 1 matrix containing a single entry 2?)

factory %*% solve(factory)

Why is it called “solve” anyway?

factory %*% solve(factory,available)

factory %*% output[colnames(factory)]

all(factory %*% output[colnames(factory)] <= available[rownames(factory)])

Doing the same thing to each row or column

• the array or matrix,

(What would apply(factory, 1, sd) do?)

my.distribution = list("exponential", 7, FALSE)

Accessing pieces of lists

Expanding and contracting lists

(What happens if you try my.distribution[[-2]]?)

Naming list elements

Lists have a special shortcut way of using names, with $:

Names in lists (continued)

Adding named elements:

Removing a named list element, by assigning it the value NULL:

a.matrix = matrix(c(35,8,10,4), nrow=2)

Adding rows and columns

plan = list(factory=factory, available=available, output=output)

List elements can even be other lists

## [1] -1.2146583 0.7805429

eigen(factory)[[1]][[2]] # NOT [[1,2]]

You might also like

factory %% six.sevens # [2x2] [2x3]