0% found this document useful (0 votes)

4 views4 pages

Pandas

Uploaded by

db7646461

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Pandas

Uploaded by

db7646461

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

📘 Class Notes + Colab Code: Pandas DataFrame

Basics

1. Introduction to Pandas
Pandas is a Python library for data analysis.

Provides two main data structures: - Series → one-dimensional (like a single column). - DataFrame → two-
dimensional (like an Excel spreadsheet).

Why use Pandas instead of spreadsheets? - Automation: repeat tasks easily. - Reproducibility: every step is
written in code. - Flexibility: works across OS, integrates with many data sources.

2. Load Your First Dataset

# Import pandas
import pandas as pd

# Load the Gapminder dataset (tab-separated file)

df = pd.read_csv("https://raw.githubusercontent.com/jennybc/gapminder/master/
data/gapminder.tsv", sep="\t")

# Print first few rows

print(df.head())

👉 Teaching Point: - .read_csv() loads CSV/TSV files. - Always check .head() to preview data.

3. Inspect DataFrame Structure

# Type of object
print(type(df))

# Shape: rows and columns

print("Shape:", df.shape)

# Column names
print("Columns:", df.columns)

1
# Data types
print(df.dtypes)

# More detailed info

print(df.info())

👉 Teaching Point: - .shape is an attribute, not a method → no parentheses. - Columns can be

object , int64 , float64 .

4. Select Columns

# Single column → Series

country_series = df['country']
print(type(country_series))

# Single column → DataFrame

country_df = df[['country']]
print(type(country_df))

# Multiple columns
subset = df[['country', 'year', 'lifeExp']]
print(subset.head())

# Dot notation (shortcut)

print(df.country.head())

👉 Teaching Point: - df['col'] → Series - df[['col']] → DataFrame

5. Select Rows

# By label with .loc[]

print(df.loc[0]) # First row
print(df.loc[[0, 99]]) # Multiple rows

# By index with .iloc[]

print(df.iloc[0]) # First row
print(df.iloc[-1]) # Last row
print(df.iloc[[0, 99, 999]])

👉 Teaching Point: - .loc[] → uses labels (row index names). - .iloc[] → uses positions (row
numbers).

2
6. Subset Rows and Columns

# Select rows 0, 99, 999 and columns country, lifeExp, gdpPercap

print(df.loc[[0, 99, 999], ['country', 'lifeExp', 'gdpPercap']])

# Same with iloc (by position)

print(df.iloc[[0, 99, 999], [0, 3, 5]])

7. Grouped and Aggregated Statistics

# Average life expectancy by year

print(df.groupby('year')['lifeExp'].mean())

# Average lifeExp and gdpPercap by year + continent

grouped = df.groupby(['year', 'continent'])[['lifeExp', 'gdpPercap']].mean()
print(grouped.head())

# Flatten the grouped result

print(grouped.reset_index().head())

# Number of countries per continent

print(df.groupby('continent')['country'].nunique())

👉 Teaching Point: - .groupby() = split → apply → combine. - Use .mean() , .sum() , .count() ,
etc.

8. Basic Plotting

import matplotlib.pyplot as plt

# Global yearly life expectancy trend

global_yearly_life = df.groupby('year')['lifeExp'].mean()

# Plot
global_yearly_life.plot(title="Average Life Expectancy Over Time")
plt.xlabel("Year")
plt.ylabel("Life Expectancy")
plt.show()

3
👉 Teaching Point: - Pandas integrates with Matplotlib. - .plot() quickly visualizes trends.

Pandas EDA for Data Science Students
No ratings yet
Pandas EDA for Data Science Students
20 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Pandas Complete Notes
No ratings yet
Pandas Complete Notes
105 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
Pandas
No ratings yet
Pandas
25 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas
No ratings yet
Pandas
25 pages
PandasGUIA PYTHON-04
No ratings yet
PandasGUIA PYTHON-04
1 page
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
No ratings yet
Pandas Basics Cheat Sheet Python For Data Science: Retrieving Series/Dataframe Information
1 page
Justenoughpython Pandas 220915 175329
No ratings yet
Justenoughpython Pandas 220915 175329
64 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Pandas Guide
No ratings yet
Pandas Guide
50 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
1 page
Pandas - Cheat - Sheet (1) - 240511 - 113437
No ratings yet
Pandas - Cheat - Sheet (1) - 240511 - 113437
1 page
Pandas PDF
No ratings yet
Pandas PDF
25 pages
DAP 3 Module
No ratings yet
DAP 3 Module
62 pages
Introduction To Pandas - Loading and Exploring Data
No ratings yet
Introduction To Pandas - Loading and Exploring Data
4 pages
Pandas Guide for Data Analysts
No ratings yet
Pandas Guide for Data Analysts
9 pages
Pandas - Digitalocean
No ratings yet
Pandas - Digitalocean
15 pages
Pandas
No ratings yet
Pandas
20 pages
Dataframes-I (Create - Selection)
No ratings yet
Dataframes-I (Create - Selection)
12 pages
Pandas, Numpy, Matplotlib
No ratings yet
Pandas, Numpy, Matplotlib
11 pages
Data Analysis With Pandas
No ratings yet
Data Analysis With Pandas
122 pages
Lab-3 Pandas Library
No ratings yet
Lab-3 Pandas Library
14 pages
Data Frame
No ratings yet
Data Frame
95 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Pandas DataFrames & Jupyter Guide
No ratings yet
Pandas DataFrames & Jupyter Guide
10 pages
Pandaspythonfordatascience
No ratings yet
Pandaspythonfordatascience
1 page
Pandas Tutorial
No ratings yet
Pandas Tutorial
7 pages
Panda 1
No ratings yet
Panda 1
18 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Data Analysis with Pandas & Matplotlib
No ratings yet
Data Analysis with Pandas & Matplotlib
8 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
No ratings yet
DV0101EN-2-2-1-Area-Plots-Histograms-and-Bar-Charts-py-v2.0: 1 Exploring Datasets With Pandas and Matplotlib
29 pages
Summary: Introduction To Data Visualization Tools
No ratings yet
Summary: Introduction To Data Visualization Tools
13 pages
Pandas Complete + Visualisation Summary of IBM Visualization
No ratings yet
Pandas Complete + Visualisation Summary of IBM Visualization
21 pages
Class 12 Panda Project
No ratings yet
Class 12 Panda Project
13 pages
Pandas Cheet Sheet
No ratings yet
Pandas Cheet Sheet
1 page
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Pandas DataFrame Basics Guide
No ratings yet
Pandas DataFrame Basics Guide
9 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Pandas
No ratings yet
Pandas
26 pages
Subject IP
No ratings yet
Subject IP
9 pages
Pandas Tutorial 1: Pandas Basics (Reading Data Files, Dataframes, Data Selection)
No ratings yet
Pandas Tutorial 1: Pandas Basics (Reading Data Files, Dataframes, Data Selection)
15 pages
Pandas
No ratings yet
Pandas
27 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
1 page
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
4 pages
Pandas DataFrame Cheat Sheet
100% (1)
Pandas DataFrame Cheat Sheet
10 pages
Data Analysis with Pandas
No ratings yet
Data Analysis with Pandas
31 pages
Python Data Science 101
100% (1)
Python Data Science 101
41 pages
Harvey and Penzo - Parenting A Child Who Has Intense Emotions
100% (2)
Harvey and Penzo - Parenting A Child Who Has Intense Emotions
225 pages
Campfire Songs
100% (2)
Campfire Songs
39 pages
TME 7 Pandu Gelombang
No ratings yet
TME 7 Pandu Gelombang
27 pages
Hyundai: No Engine Car Name/Year/Model Full Set Head Set Cylinder Head
No ratings yet
Hyundai: No Engine Car Name/Year/Model Full Set Head Set Cylinder Head
8 pages
List of Licensed Companies 2024-06-30
No ratings yet
List of Licensed Companies 2024-06-30
230 pages
Asthma Control Test
No ratings yet
Asthma Control Test
8 pages
HW Fluido II
No ratings yet
HW Fluido II
33 pages
Geith Instalation Quick-Coupling Exc
50% (2)
Geith Instalation Quick-Coupling Exc
11 pages
Solar Powered Back Pack
No ratings yet
Solar Powered Back Pack
44 pages
Representation For Evaluation
No ratings yet
Representation For Evaluation
6 pages
MACDxxxxxx
100% (1)
MACDxxxxxx
8 pages
Dual Nature of Radiation of Matter
No ratings yet
Dual Nature of Radiation of Matter
7 pages
CHHINDWARA
No ratings yet
CHHINDWARA
4 pages
Liehr
No ratings yet
Liehr
9 pages
All Category Database in Jammu and Kashmir
No ratings yet
All Category Database in Jammu and Kashmir
131 pages
Competency Framework For Southeast Asian School Heads 2014
100% (1)
Competency Framework For Southeast Asian School Heads 2014
11 pages
SDM-1 Mini Manual: Page 1 of 54 426006-2101-013-A0
No ratings yet
SDM-1 Mini Manual: Page 1 of 54 426006-2101-013-A0
54 pages
(Ebook PDF) Thinking For Yourself 9th Bymarlys Mayfieldinstant Download
100% (2)
(Ebook PDF) Thinking For Yourself 9th Bymarlys Mayfieldinstant Download
30 pages
Budget 23 24 Fees and Charges
No ratings yet
Budget 23 24 Fees and Charges
14 pages
Components of Lesson Plan 00
No ratings yet
Components of Lesson Plan 00
4 pages
Inter Islamic Physics 2 2024
No ratings yet
Inter Islamic Physics 2 2024
4 pages
Introduction To Conditional Statements and Loops in JavaScript
No ratings yet
Introduction To Conditional Statements and Loops in JavaScript
8 pages
Kapil CPF
No ratings yet
Kapil CPF
3 pages
Ultraviolet Presentation
No ratings yet
Ultraviolet Presentation
7 pages
Aikon VDF
50% (2)
Aikon VDF
20 pages
Drug
No ratings yet
Drug
4 pages
Chapter 2
No ratings yet
Chapter 2
25 pages
Human Cell Structure & Function Guide
No ratings yet
Human Cell Structure & Function Guide
11 pages
Chinese Firm
No ratings yet
Chinese Firm
13 pages
Elite 42M & Elite 42Ms Elite 51M & Elite 51Ms Elite 27 MS: Parts List
No ratings yet
Elite 42M & Elite 42Ms Elite 51M & Elite 51Ms Elite 27 MS: Parts List
198 pages

Pandas

Uploaded by

Pandas

Uploaded by

📘 Class Notes + Colab Code: Pandas DataFrame

2. Load Your First Dataset

# Load the Gapminder dataset (tab-separated file)

# Print first few rows

3. Inspect DataFrame Structure

# Shape: rows and columns

# More detailed info

👉 Teaching Point: - .shape is an attribute, not a method → no parentheses. - Columns can be

# Single column → Series

# Single column → DataFrame

# Dot notation (shortcut)

👉 Teaching Point: - df['col'] → Series - df[['col']] → DataFrame

# By label with .loc[]

# By index with .iloc[]

# Select rows 0, 99, 999 and columns country, lifeExp, gdpPercap

# Same with iloc (by position)

7. Grouped and Aggregated Statistics

# Average life expectancy by year

# Average lifeExp and gdpPercap by year + continent

# Flatten the grouped result

# Number of countries per continent

import matplotlib.pyplot as plt

# Global yearly life expectancy trend

You might also like