[go: up one dir, main page]

0% found this document useful (0 votes)
204 views114 pages

DS&RP (All Units Notes)

Uploaded by

miryalalavanya11
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
204 views114 pages

DS&RP (All Units Notes)

Uploaded by

miryalalavanya11
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 114

DEPARTMENT

OF
COMPUTER SCIENCE & ENGINEERING

COURSE MATERIAL

Subject: DATA SCIENCE & R PROGRAMMING


Subject Code: 20CS7PC25

Regulation: R20
Year/Sem: B.Tech IV/I

Prepared by:
P. Ratna Tejaswi
Asst Professor
TEEGALA KRISHNA REDDY ENGINEERING COLLEGE
IV Year B.Tech. CSE-I Sem L T P C
3 0 0 3
(20CS7PC25) DATA SCIENCE AND R PROGRAMMING
Course Objectives:
1. Understanding and being able to use basic programming concepts
2. Automate data analysis
3. Working collaboratively and openly on code
4. Knowing how to generate dynamic documents
5. Being able to use a continuous test-driven development approach
Course Outcomes: The student will be able to
1. Make use of the concepts of R programming language.
2. To use R to solve statistical problems.
3. To apply and able to implement functions to list and to access data frames.
4. Able to minimize and maximize functions using R.
5. To demonstrate advanced skills in data acquisition and management.
UNIT – I
Introduction: Overview of R, R data types and objects, reading and writing data, sub setting R Objects,
Essentials of the R Language, Installing R, Running R, Packages in R, Calculations, Complex numbers in
R, Rounding, Arithmetic, Modulo and integer quotients, Variable names and assignment, Operators,
Integers, Factors, Logical operations.

UNIT – II
Control structures, functions, scoping rules, dates and times, Introduction to Functions, preview of
Some Important R Data Structures, Vectors, Character Strings, Matrices, Lists, Data Frames, Classes
Vectors: Generating sequences, Vectors and subscripts, Extracting elements of a vector using subscripts,
Working with logical subscripts, Scalars, Vectors, Arrays, and Matrices, Adding and Deleting Vector
Elements, Obtaining the Length of a Vector, Matrices and Arrays as Vectors Vector Arithmetic and Logical
Operations, Vector Indexing, Common Vector Operations.

UNIT – III
Lists: Creating Lists, General List Operations, List Indexing Adding and Deleting List Elements, Getting
the Size of a List, Extended Example: Text Concordance Accessing List Components and Values Applying
Functions to Lists, DATA FRAMES, Creating Data Frames, Accessing Data Frames, Other Matrix-Like
Operations.

UNIT - IV
Factors And Tables, Factors and Levels, Common Functions Used with Factors, Working with Tables,
Matrix/Array-Like Operations on Tables, extracting a Sub table, Finding the Largest Cells in a Table, Math
Functions, Calculating a Probability, Cumulative Sums and Products, Minima and Maxima, Calculus,
Functions for Statistical Distributions.

UNIT - V
Data Management: Design Data Architecture and manage the data for analysis, understand various sources
of Data like Sensors/Signals/GPS etc. Data Management, Data Quality (noise, outliers, missing values,
duplicate data) and Data Processing & Processing.
TEXT BOOKS:
1. R Programming for Data Science by Roger D. Peng
2. The Art of R Programming by Prashanth singh, Vivek Mourya, Cengage Learning India.
3. Data Mining Concepts and Techniques, Han, Kamber, 3rd Edition, Morgan Kaufmann
Publishers.

You might also like