0% found this document useful (0 votes)

25 views69 pages

01-Numpy & Pandas

The document provides an overview of NumPy and Pandas, focusing on their fundamental concepts and functionalities. It covers topics such as creating and manipulating arrays in NumPy, as well as data structures, data cleaning, and analysis techniques in Pandas. Additionally, it discusses operations like merging, concatenating, and filtering data in Pandas.

Uploaded by

Maha Mohy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views69 pages

01-Numpy & Pandas

Uploaded by

Maha Mohy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 69

Data Analysis & Visualization

NumPy Basics
NumPy Agenda
• NumPy Intro
• Creating Arrays
• NumPy Array Indexing
• NumPy Array Slicing
• NumPy Data Types
• NumPy Copy vs View
• NumPy Array Shape
• NumPy Array Reshape
• NumPy Array Join
• NumPy Array Sort
• NumPy Array Filter
Creating NumPy Arrays

array([[[1, 2, 3],
[1, 2, 3],
[1, 2, 3]],
[[1, 2, 3],
[1, 2, 3],
[1, 2, 3]],
[[1, 2, 3],
[1, 2, 3],
[1, 2, 3]]])
NumPy Arrays Shape
NumPy Arrays Transpose
NumPy Array Indexing
NumPy Array Indexing
Basic array operations
Broadcasting
Basic array operations
More useful array operations
More useful array operations
Creating NumPy Array
NumPy Copy vs View
NumPy Array Reshape
flattening multidimensional arrays
Working with mathematical formulas
Working with mathematical formulas
Pandas Basics
Pandas
Pandas Agenda
• Pandas Intro

• Pandas Series

• Pandas DataFrames

• Pandas Read CSV

• Pandas Analyzing Data

• Cleaning Data

• Cleaning Empty Cells

• Cleaning Wrong Format

• Cleaning Wrong Data

• Removing Duplicates

• Pandas Correlations

• Pandas Plotting

• Merging, joining, and concatenating

• Operations

• Apply function

• Data input and output

Pandas Agenda
Pandas Data Structure
Pandas Data Structure
Pandas Data Structure
Pandas Data Structure
Reading files
Indexing
Pandas Data Structure

index Column-1 Column-2 … Column-n

Row-1 0 ...
Row-2 1 ...
… ... ... ... ...
...
Row-L L ...

DataFrame
Creating a DataFrame
Creating a DataFrame
Indexing and slicing
Indexing and slicing
loc Vs. iloc
loc Vs. iloc
loc Vs. iloc
loc Vs. iloc
loc Vs. iloc
loc Vs. iloc
Analyzing DataFrames
• head()
• tail()
• info()
• describe()
Cleaning Empty Cells
Cleaning Empty Cells
Cleaning Empty Cells
Cleaning Empty Cells
Removing Duplicates
Data Format
Set DataFrame Index
Reset DataFrame Index
Apply Function
Apply Function
Drop Function
Drop Function
filter
filter
Group by
Group by
Aggregation Methods
Aggregation Method Description

.count() The number of non-null records

.sum() The sum of the values

.mean() The arithmetic mean of the values

.median() The median of the values

.min() The minimum value of the group

.max() The maximum value of the group

.mode() The most frequent value in the group

.std() The standard deviation of the group

.var() The variance of the group

Group by Example
daily_spend_count = df.groupby('Day')['Debit'].count()
daily_spend_sum = df.groupby('Day')['Debit'].sum()

df.groupby(['Category','Month'])['Debit'].sum()
Sort
Correlations

df.corr()
Concatenate
append
Merge Function

Column-1 … Column-n

... ... ... Data Integration

Merge Function

Year
Column-1 Temperature
… Rainfall
Column-n

... ...
... ...
...
Pandas Merge

df.merge(right=other_df, on=‘common_column’ , how=‘how_to_join’ )

df Other_df

+ =
Pandas Merge
Pandas concat Vs append Vs join Vs merge
• Concat gives the flexibility to join based on the axis( all rows or all
columns)
• Append is the specific case(axis=0, join='outer') of concat
• Merge is based on any particular column each of the two dataframes,
this columns are variables on like 'left_on', 'right_on', 'on’.
• Join is based on the indexes (set by set_index) on how variable
=['left','right','inner','outer']
THANK YOU

Pandas Notes
No ratings yet
Pandas Notes
20 pages
Pandas Data Analytics
No ratings yet
Pandas Data Analytics
61 pages
Pandas
No ratings yet
Pandas
13 pages
Data Wrangling & Analysis Guide
100% (1)
Data Wrangling & Analysis Guide
36 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
6 pages
Module 4
No ratings yet
Module 4
38 pages
DevOps Session 3 Pandas
No ratings yet
DevOps Session 3 Pandas
33 pages
Pandas: DataFrames & Series Guide
No ratings yet
Pandas: DataFrames & Series Guide
2 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Introduction To Pandas
No ratings yet
Introduction To Pandas
14 pages
Data Analysis Tools
No ratings yet
Data Analysis Tools
26 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
04-Data Manipulation With Pandas
No ratings yet
04-Data Manipulation With Pandas
28 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
33 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Lecture - 2 Pandas
No ratings yet
Lecture - 2 Pandas
24 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
Pandas Cheat Sheet
100% (4)
Pandas Cheat Sheet
2 pages
Pandas Cheat Sheet CN
No ratings yet
Pandas Cheat Sheet CN
4 pages
Pandas Cheat Sheet
85% (13)
Pandas Cheat Sheet
2 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Introduction to Pandas Library
No ratings yet
Introduction to Pandas Library
31 pages
Unit 4 Fod
100% (1)
Unit 4 Fod
21 pages
Python For Data Science
No ratings yet
Python For Data Science
4 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Pandas
No ratings yet
Pandas
5 pages
Chapter 2 Python Pandas - II
No ratings yet
Chapter 2 Python Pandas - II
19 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
16 pages
Session2-DM Using Pandas
No ratings yet
Session2-DM Using Pandas
51 pages
Unit IV
No ratings yet
Unit IV
49 pages
Python Programming For Data Science
No ratings yet
Python Programming For Data Science
36 pages
Pandas
No ratings yet
Pandas
26 pages
Python 2.1.2
No ratings yet
Python 2.1.2
7 pages
Learn Complete Pandas With Real World Interviews Questions
No ratings yet
Learn Complete Pandas With Real World Interviews Questions
40 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Introduction To Pandas in Data Analytics
No ratings yet
Introduction To Pandas in Data Analytics
12 pages
Numpy Basics Introduction To
No ratings yet
Numpy Basics Introduction To
35 pages
Day 11 Pandas For Data Science - Part 2
No ratings yet
Day 11 Pandas For Data Science - Part 2
21 pages
Pandas
No ratings yet
Pandas
7 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Pandas Merged
No ratings yet
Pandas Merged
2 pages
Learn Pandas
No ratings yet
Learn Pandas
37 pages
MLStack Cafe 2
No ratings yet
MLStack Cafe 2
11 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Pandas & PyNumS Essentials
No ratings yet
Pandas & PyNumS Essentials
10 pages
PLC PLC Overviews
No ratings yet
PLC PLC Overviews
28 pages
341 Tutorial1 Answers
No ratings yet
341 Tutorial1 Answers
4 pages
Sugar Refining Process Data
No ratings yet
Sugar Refining Process Data
33 pages
Finance Professionals' Guide
No ratings yet
Finance Professionals' Guide
28 pages
Elements of Trigonometry ( 200 Pages)
100% (2)
Elements of Trigonometry ( 200 Pages)
166 pages
Central Battery Systems for Emergency Lighting
No ratings yet
Central Battery Systems for Emergency Lighting
12 pages
Pulverizer Performance Guide
No ratings yet
Pulverizer Performance Guide
42 pages
Lab Manual 21
No ratings yet
Lab Manual 21
46 pages
Design Basis Memorandum 1
No ratings yet
Design Basis Memorandum 1
52 pages
LucasAssetPrice 241003 191423
No ratings yet
LucasAssetPrice 241003 191423
7 pages
Eclinic Presentation For PRJ
No ratings yet
Eclinic Presentation For PRJ
62 pages
2023-P6-Maths-Weighted Assessment 1-SCGS
No ratings yet
2023-P6-Maths-Weighted Assessment 1-SCGS
10 pages
Assessment of Groundwater Potential Zone
No ratings yet
Assessment of Groundwater Potential Zone
26 pages
Triumph 4850 Automatic Stack Paper Cutter Manual
No ratings yet
Triumph 4850 Automatic Stack Paper Cutter Manual
48 pages
GB 2.75 Expert 7050 7060 7070 Manual 0811
No ratings yet
GB 2.75 Expert 7050 7060 7070 Manual 0811
2 pages
Computer Notes Class 7
No ratings yet
Computer Notes Class 7
3 pages
Arithmetic Progression Study Material Class 10 Maths
100% (1)
Arithmetic Progression Study Material Class 10 Maths
3 pages
Advanced Thermodynamics Engineering 2nd Edition Annamalai Instant Download
100% (5)
Advanced Thermodynamics Engineering 2nd Edition Annamalai Instant Download
61 pages
FPGAs For Software Programmers
No ratings yet
FPGAs For Software Programmers
331 pages
Pulse Modulation Techniques: From: Er Manjit Singh SR Lect ECE GPC, Bathinda
No ratings yet
Pulse Modulation Techniques: From: Er Manjit Singh SR Lect ECE GPC, Bathinda
61 pages
08 - Chapter 2 PDF
No ratings yet
08 - Chapter 2 PDF
33 pages
Ideematec - The New Definition of Unlinked Tracking
100% (1)
Ideematec - The New Definition of Unlinked Tracking
39 pages
KeyboardShortcuts NOTES
No ratings yet
KeyboardShortcuts NOTES
8 pages
DSD Mini Project Report1
No ratings yet
DSD Mini Project Report1
15 pages
Helifix Complete Crack Stitching Repair Details
No ratings yet
Helifix Complete Crack Stitching Repair Details
14 pages
ME Math 8 Q1 0103 AK
No ratings yet
ME Math 8 Q1 0103 AK
3 pages
STM32MCU Basics
No ratings yet
STM32MCU Basics
16 pages
Architecture for Design Students
No ratings yet
Architecture for Design Students
11 pages
Union: Jurnalilmiah Pendidikan Matematika: Dini Dahlia, M. Rosyadi, Almas Zati Hulwani, Heni Pujiastuti
No ratings yet
Union: Jurnalilmiah Pendidikan Matematika: Dini Dahlia, M. Rosyadi, Almas Zati Hulwani, Heni Pujiastuti
15 pages

01-Numpy & Pandas

Uploaded by

01-Numpy & Pandas

Uploaded by

Data Analysis & Visualization

• Pandas Read CSV

• Pandas Analyzing Data

• Cleaning Empty Cells

• Cleaning Wrong Format

• Cleaning Wrong Data

• Merging, joining, and concatenating

• Data input and output

index Column-1 Column-2 … Column-n

.count() The number of non-null records

.sum() The sum of the values

.mean() The arithmetic mean of the values

.median() The median of the values

.min() The minimum value of the group

.max() The maximum value of the group

.mode() The most frequent value in the group

.std() The standard deviation of the group

.var() The variance of the group

... ... ... Data Integration

df.merge(right=other_df, on=‘common_column’ , how=‘how_to_join’ )

You might also like