MODULE 1: INTRODUCTION
Course Overview
Introduction to Data Analysis
Data Analyst Job Description
Why The Use of Python
SYLLABUS
SETTING UP THE ENVIRONMENT
DATA ANALYTICS WITH
PYTHON AND SQL Jupyter Introduction
Installing Anaconda
Working with Notebook Files
Using Shortcuts in Jupyter
Handling Error Messages
Restarting the Kernel
PYTHON BASICS
Python Variables
Data Types:
a. Numbers
b. Boolean Values
c. Strings
Reassigning Values
Adding Comments
Indexing Elements
Indentation
Operators
Comparison Operators
Logical Operators
Arithmetic Operators
Bitwise Operators
Assignment Operators
Identity Operators
Membership Operators
LOCTECH IT TRAINING
www.loctechng.com
Logical and Identity Operators
Conditional Statements
IF, ELSE, and ELIF Statements
Boolean Values
MODULE 2: SEQUENCES
SEQUENCES
Lists
Using List Methods
List Slicing
Tuples
SYLLABUS Dictionaries
DATA ANALYTICS WITH
PYTHON AND SQL ITERATION
For Loops
While Loops and Incrementing
Creating Lists with range()
Using Conditional Statements and Loops Together
Nested For-Loops
List Comprehensions
FUNDAMENTALS FOR CODING IN PYTHON
Object-Oriented Programming (OOP)
Modules, Packages, and the Python Standard Library
Importing Modules
Introduction to Using NumPy and pandas
Software Documentation
Python Documentation
MODULE 3: INTRODUCTION TO NUMPY AND PANDAS
INTRODUCTION TO NUMPY
Ndarrays
Indexing in NumPy
Assigning Values in NumPy
LOCTECH IT TRAINING Supported Data Types in NumPy
www.loctechng.com
Characteristics of NumPy Functions
Arrays vs Lists
INTRODUCTION TO PANDAS
Introduction to pandas Library
Pandas Series
Pandas DataFrames
Creating DataFrames from Scratch
SYLLABUS Indexing in Pandas (Label-based vs Position-based)
DATA ANALYTICS WITH WORKING WITH TEXT FILES
PYTHON AND SQL
Importing *.csv Files with Pandas
Using index_col Parameter
Importing *.json Files
Working with Excel Data (*.xlsx)
Importing Data with pandas’ squeeze Parameter
MODULE 4: WORKING WITH TEXT DATA
Text Data and Argument Specifiers
Manipulating Python Strings
Using Various Python String Methods
String Accessors
.format() Method
MODULE 5: DATA CLEANING AND TRANSFORMATION
Combining Multiple Datasets
Performing Joins in Pandas
Grouping Data
Group by Categorical Columns and Date Index
Summary Statistics with groupby
Joining Data with Pandas
Inner Join, Left Join, Outer Join
LOCTECH IT TRAINING Merging on Indexes
www.loctechng.com
Merging Multiple DataFrames
EXPLORATORY DATA ANALYSIS (EDA)
Uni-variate and Bi-variate Analysis
Pair Plots
Heatmaps
Count Plot
SYLLABUS MODULE 6: MEASURES OF CENTRAL
DATA ANALYTICS WITH
PYTHON AND SQL TENDENCY AND VARIABILITY
MEASURES OF CENTRAL TENDENCY
Mean
Mode
Median
Measuring Skewness
MEASURES OF VARIABILITY AND SPREAD
Range
Quartiles
Variance
Standard Deviation
MODULE 7: PROBABILITY AND DISTRIBUTIONS
PROBABILITY
What is Probability
Probability Formula
Calculating Probability
Conditional Probability
DISTRIBUTION
Frequency Distribution for Quantitative and Qualitative
LOCTECH IT TRAINING Data
www.loctechng.com
Relative Frequency Distributions
Cumulative Frequency Distributions
CORRELATION
Correlation Relationships
Scatter Plots
SYLLABUS Correlation Coefficient (r)
Computational Formula for r
DATA ANALYTICS WITH
PYTHON AND SQL MODULE 8: DATA VISUALIZATION WITH MATPLOTLIB
Introduction to Matplotlib
Customizing Plots
Small Multiples
Plotting Various Charts in Matplotlib
MODULE 9: DATA VISUALIZATION WITH SEABORN
Introduction to Seaborn
Scatter Plot and Count Plot with hue
Relational Plot and Subplots
Distribution Plots
MODULE 10: REGRESSION ANALYSIS
What is Regression?
Regression Line
Least Squares Regression Line
Standard Error of Estimate
MODULE 11: PROJECT I
Project For Python
MODULE 12: SQL FOR DATA SCIENCE
LOCTECH IT TRAINING
www.loctechng.com
Introduction to Databases
Installing Postgres Database
SQL Query Basics
Common SQL Commands (SELECT, FROM, WHERE)
Logical Operators in SQL
SYLLABUS SQL Joins
DATA ANALYTICS WITH
PYTHON AND SQL Inner and Outer Joins
Right and Left Joins
Filtering and Sorting
Advanced Filtering using IN, OR, NOT
Sorting with GROUP BY and ORDER BY
SQL Aggregations
Common Aggregations (COUNT, SUM, MIN, MAX)
CASE and DATE Functions, Handling NULL Values
Subqueries and Temp Tables
Using Subqueries to Run Multiple Queries
Using Temp Tables for Data Manipulation
SQL Data Cleaning and Transformation
Handling Missing and Duplicate Data
Data Standardization
Date and Time Manipulation
Conditional Data Transformation and Filtering
Data Aggregation, Joining, and Reshaping
LOCTECH IT TRAINING
www.loctechng.com
MODULE 13: FINAL PROJECT II
Capstone Project