Cheat Sheet: Learn Python For Data Science Interactively at

1. The document discusses various data manipulation and analysis techniques using the Pandas library in Python such as selecting, filtering, querying, reshaping, merging, and indexing data. 2. It provides examples of how to pivot/reshape data, select columns based on conditions, query a DataFrame, merge/join datasets, and set/reset the index. 3. The techniques described allow exploring, cleaning, and preparing data for further analysis using Pandas functions like loc, query, merge, pivot, and reindex.

Uploaded by

Juan Galvan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

132 views1 page

Cheat Sheet: Learn Python For Data Science Interactively at

Uploaded by

Juan Galvan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

PYTHON FOR DATA SCIENCE Iteration Also see NumPy Arrays Combining Data

Cheat Sheet Selecting

>>> df3.loc[:,(df3>1).any()] Select cols with any vals >1 X1
data1
X2 X1
data2
X3
>>> df3.loc[:,(df3>1).all()] Select cols with vals > 1
a 11.432 a 20.784
>>> df3.loc[:,df3.isnull().any()] Select cols with NaN

PANDAS
>>> df3.loc[:,df3.notnull().all()] Select cols without NaN b 1.303 b NaN
Indexing With isin c 99.906 d 20.784
>>> df[(df.Country.isin(df2.Type))] Find same elements
Learn Python for Data Science Interactively at >>> df3.filter(items=”a”,”b”])
>>> df.select(lambda x: not x%5)
Filter on values
Select specific elements
www.DataCamp.com Where
>>> s.where(s > 0) Subset the data
Setting/Resetting Index
Query
>>> df6.query(‘second > first’) Query DataFrame >>> pd.merge(data1, X1 X2 X3
data2, a 11.432 20.784
Reshaping Data how=’left’,
b 1.303 NaN
on=’X1’)
c 99.906 NaN
Pivot Setting/Resetting Index
>>> pd.merge(data1, X1 X2 X3
>>> df3= df2.pivot (index=’Date’, Spread rows into columns >>> df.set_index(‘Country’) Set the index
columns=’Type’, >>> df4 = df.reset_index() Reset the index data2, a 11.432 20.784
values=’Value’) >>> df = df.rename (index=str, Rename DataFrame how=’right’,
b 1.303 NaN
columns={“Country”:”cntry”, on=’X1’)
d NaN 20.784
“Capital”:”cptl”,
Date Type Value Type a b c “Population”:”ppltn”})
>>> pd.merge(data1, X1 X2 X3
0 2016-03-01 a 11.432 Date data2, a 11.432 20.784
1 2016-03-02 b 13.031 2016-03-01 11.432 NaN 20.784 Reindexing how=’inner’,
b 1.303 NaN
on=’X1’)
2 2016-03-01 c 20.784 2016-03-02 1.303 13.031 NaN
>>> s2 = s.reindex([‘a’,’c’,’d’,’e’,’b’])
3 2016-03-03 a 99.906 2016-03-03 99.906 NaN 20.784
4 2016-03-02 a 1.303 >>> pd.merge(data1, X1 X2 X3
5 2016-03-03 c 20.784 Forward Filling Backward Filling data2, a 11.432 20.784
>>> df.reindex (range(4), >>> s3 = s.reindex (range(5), how=’outer’,
b 1.303 NaN
method=’ffill’) method=’bfill’) on=’X1’)
c 99.906 NaN
Pivot Table Country Capital Population 0 3
0 Belgium Brussels 11190846 1 3 d NaN 20.784
>>> df4 = pd.pivot_table(df2, Spread rows into columns 1 India New Delhi 1303171035 2 3
values=’Value’, 2 Brazil Brasília 207847528 3 3
index=’Date’, 3 Brazil Brasília 207847528 4 3 Join
columns=’Type’])
>>> data1.join(data2, how=’right’)
MultiIndexing
Stack / Unstack
>>> arrays = [np.array([1,2,3]), MultiIndexing
>>> stacked = df5.stack() Pivot a level of column labels np.array([5,4,3])]
>>> stacked.unstack() Pivot a level of index labels >>> df5 = pd.DataFrame(np.random.rand(3, 2), index=arrays) Vertical
>>> tuples = list(zip(*arrays)) >>> s.append(s2)
0 1 1 5 0 0.233482 Horizontal/Vertical
>>> index = pd.MultiIndex.from_tuples(tuples,
1 5 0.233482 0.390959 1 0.390959 names=[‘first’, ‘second’]) >>> pd.concat([s,s2],axis=1, keys=[‘One’,’Two’])
2 4 0.184713 0.237102 2 4 0 0.184713 >>> df6 = pd.DataFrame(np.random.rand(3, 2), index=index) >>> pd.concat([data1, data2], axis=1, join=’inner’)
>>> df2.set_index([“Date”, “Type”])
3 3 0.433522 0.429401 1 0.237102
Unstacked 3 3 0 0.433522
1 0.429401
Stacked Dates
Duplicate Data >>> df2[‘Date’]= pd.to_datetime(df2[‘Date’])
Melt >>> df2[‘Date’]= pd.date_range(‘2000-1-1’,
>>> s3.unique() Return unique values periods=6,
>>> pd.melt(df2, Gather columns into rows >>> df2.duplicated(‘Type’) Check duplicates freq=’M’)
id_vars=[“Date”], >>> df2.drop_duplicates(‘Type’, keep=’last’) Drop duplicates >>> dates = [datetime(2012,5,1), datetime(2012,5,2)]
value_vars=[“Type”,“Value”], >>> df.index.duplicated() Check index duplicates >>> index = pd.DatetimeIndex(dates)
value_name=”Observations”) >>> index = pd.date_range(datetime(2012,2,1), end, freq=’BM’)

Date Type Value Date Variable Observations

0 2016-03-01 a 11.432 0 2016-03-01 Type a
Grouping Data
1 2016-03-02 b 13.031 1 2016-03-02 Type b Visualization
2 2016-03-01 c 20.784 2 2016-03-01 Type c
Aggregation
3 2016-03-03 a 99.906 3 2016-03-03 Type a >>> df2.groupby(by=[‘Date’,’Type’]).mean() >>> import matplotlib.pyplot as plt
4 2016-03-02 a 1.303 4 2016-03-02 Type a >>> df4.groupby(level=0).sum()
>>> df4.groupby(level=0).agg({‘a’:lambda x:sum(x)/len(x), >>> s.plot() >>> df2.plot()
5 2016-03-03 c 20.784 5 2016-03-03 Type c
‘b’: np.sum}) >>> plt.show() >>> plt.show()
6 2016-03-01 Value 11.432 Transformation
7 2016-03-02 Value 13.031 >>> customSum = lambda x: (x+x%2)
>>> df4.groupby(level=0).transform(customSum)
8 2016-03-01 Value 20.784
9 2016-03-03 Value 99.906
10 2016-03-02 Value 1.303
11 2016-03-03 Value 20.784
Missing Data
>>> df.dropna() Drop NaN values
Iteration >>> df3.fillna(df3.mean()) Fill NaN values with a pre
determined value
>>> df.iteritems() (Column-index, Series) pairs >>> df2.replace(“a”, “f”) Replace values with others
>>> df.iterrows() (Row-index, Series) pairs

Download Full Programming PHP 4th Edition Peter Macintyre PDF All Chapters
100% (4)
Download Full Programming PHP 4th Edition Peter Macintyre PDF All Chapters
55 pages
Pankowecki Robert Domaindriven Rails
100% (1)
Pankowecki Robert Domaindriven Rails
278 pages
Deisenroth Faisal Ong MathMLbook
No ratings yet
Deisenroth Faisal Ong MathMLbook
417 pages
Productivity, Earthmoving and OOC Komatsu
100% (8)
Productivity, Earthmoving and OOC Komatsu
58 pages
Developing Apps with Python and Flet
From Everand
Developing Apps with Python and Flet
Williams Asiedu
No ratings yet
Beginners Python Cheat Sheet PCC If While PDF
No ratings yet
Beginners Python Cheat Sheet PCC If While PDF
2 pages
Custom Light Settings - Coding E60, E63, E90 BMW-Driver - Net Forums
No ratings yet
Custom Light Settings - Coding E60, E63, E90 BMW-Driver - Net Forums
1 page
Personal Rapid Transit
No ratings yet
Personal Rapid Transit
26 pages
openpyxl
No ratings yet
openpyxl
213 pages
Creating Dataframes Reshaping Data
100% (1)
Creating Dataframes Reshaping Data
2 pages
Essential Guide To Python For All Levels (2024 Collection
No ratings yet
Essential Guide To Python For All Levels (2024 Collection
184 pages
Np534tr-Np534tre 2004 1 1 2
No ratings yet
Np534tr-Np534tre 2004 1 1 2
9 pages
Hands-On Data Science and Python Machine Learning - Perform Data Mining and Machine Learning Efficiently Using Python and Spark PDF
No ratings yet
Hands-On Data Science and Python Machine Learning - Perform Data Mining and Machine Learning Efficiently Using Python and Spark PDF
415 pages
Duckdb Docs
No ratings yet
Duckdb Docs
721 pages
Write Python Instead of SQL!: An Introduction To Sqlalchemy
No ratings yet
Write Python Instead of SQL!: An Introduction To Sqlalchemy
10 pages
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
No ratings yet
Programa Ciencia de Datos y Machine Learning Con Python - Feb23
13 pages
Instant Download Learning SQL Master SQL Fundamentals Alan Beaulieu PDF All Chapters
83% (6)
Instant Download Learning SQL Master SQL Fundamentals Alan Beaulieu PDF All Chapters
52 pages
Learning Apache Cassandra - Second Edition
From Everand
Learning Apache Cassandra - Second Edition
Sandeep Yarabarla
No ratings yet
Python Material
No ratings yet
Python Material
314 pages
MySQL Management and Administration with Navicat
From Everand
MySQL Management and Administration with Navicat
Gokhan Ozar
No ratings yet
Python Data Persistence
From Everand
Python Data Persistence
Malhar Lathkar
No ratings yet
SQL Guide
No ratings yet
SQL Guide
213 pages
Portfolio
No ratings yet
Portfolio
13 pages
Java 8 & 9 in Action, Second Edition
100% (2)
Java 8 & 9 in Action, Second Edition
22 pages
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
From Everand
Data Cleaning with Power BI: The definitive guide to transforming dirty data into actionable insights
Gus Frazer
No ratings yet
Web Scraping With Python Tutorials From A To Z
100% (2)
Web Scraping With Python Tutorials From A To Z
35 pages
Essential SQL Alchemy
No ratings yet
Essential SQL Alchemy
25 pages
Resume Tkinter
100% (1)
Resume Tkinter
2 pages
Python Specialization2
No ratings yet
Python Specialization2
3 pages
JavaScript JSON Cookbook - Sample Chapter
100% (3)
JavaScript JSON Cookbook - Sample Chapter
24 pages
Step by Step Guide How To Rapidly Build Neural Networks
No ratings yet
Step by Step Guide How To Rapidly Build Neural Networks
6 pages
Practical Data Science
No ratings yet
Practical Data Science
121 pages
Kivymd PDF
100% (1)
Kivymd PDF
306 pages
SQL Injection Technique Upload
No ratings yet
SQL Injection Technique Upload
19 pages
Instant MapReduce Patterns – Hadoop Essentials How-to
From Everand
Instant MapReduce Patterns – Hadoop Essentials How-to
Srinath Perera
No ratings yet
Aprendendo Python PDF
No ratings yet
Aprendendo Python PDF
576 pages
Aplicaciones Python
No ratings yet
Aplicaciones Python
444 pages
FastAPI Cookbook: Develop high-performance APIs and web applications with Python
From Everand
FastAPI Cookbook: Develop high-performance APIs and web applications with Python
Giunio De Luca
No ratings yet
Flask Socketio
No ratings yet
Flask Socketio
49 pages
Python Master Level
No ratings yet
Python Master Level
310 pages
Python Bibliography - The Top Technical Resources On Python
No ratings yet
Python Bibliography - The Top Technical Resources On Python
44 pages
MySQL Administrator's Bible
From Everand
MySQL Administrator's Bible
Sheeri K. Cabral
4.5/5 (1)
Getting Started With Python
No ratings yet
Getting Started With Python
3 pages
Sqlite3 Cheat Sheet: by Via
No ratings yet
Sqlite3 Cheat Sheet: by Via
2 pages
How To Build A Python GUI Application With Wxpython
No ratings yet
How To Build A Python GUI Application With Wxpython
17 pages
Django Models
No ratings yet
Django Models
86 pages
Future - Python The Complete Manual - 16th Edition 2023
100% (1)
Future - Python The Complete Manual - 16th Edition 2023
134 pages
h2 Database Tutorial
100% (1)
h2 Database Tutorial
58 pages
Pandas
No ratings yet
Pandas
2,977 pages
El Libro de Python - Rob Mastrodomenico
No ratings yet
El Libro de Python - Rob Mastrodomenico
260 pages
T SQL Join Types
100% (1)
T SQL Join Types
1 page
Python programms
No ratings yet
Python programms
8 pages
BIRT 2.6 Data Analysis and Reporting
From Everand
BIRT 2.6 Data Analysis and Reporting
John Ward
2/5 (1)
Api Rest
No ratings yet
Api Rest
220 pages
Learning JavaScript Robotics - Sample Chapter
No ratings yet
Learning JavaScript Robotics - Sample Chapter
14 pages
Teach Yourself Advanced C in 21 Days (Sams-1994) PDF
No ratings yet
Teach Yourself Advanced C in 21 Days (Sams-1994) PDF
904 pages
Python BeautifulSoup - Parse HTML, XML Documents in Python
100% (1)
Python BeautifulSoup - Parse HTML, XML Documents in Python
21 pages
JavaFX 1.2 Application Development Cookbook
From Everand
JavaFX 1.2 Application Development Cookbook
Vladimir Vivien
No ratings yet
Banner 9 Navigation Guide
No ratings yet
Banner 9 Navigation Guide
23 pages
Django - Overview: MVC Pattern
No ratings yet
Django - Overview: MVC Pattern
3 pages
Talend Open Studio Cookbook
From Everand
Talend Open Studio Cookbook
Rick Barton
2/5 (1)
Top 100 Python Interview Questions & Answers For 2021 - Edureka
No ratings yet
Top 100 Python Interview Questions & Answers For 2021 - Edureka
24 pages
Data Analysis With Python - FreeCodeCamp
No ratings yet
Data Analysis With Python - FreeCodeCamp
28 pages
Books _ Authors 2
No ratings yet
Books _ Authors 2
10 pages
Scope: This Specification Shall Not Be Used For New Product Fuel System Components, Use CES 16602
No ratings yet
Scope: This Specification Shall Not Be Used For New Product Fuel System Components, Use CES 16602
6 pages
Exercises With Solutions - CH0104
No ratings yet
Exercises With Solutions - CH0104
17 pages
Need For Communication Interfaces: Why Are Communication Interfaces Required in Embedded Systems
No ratings yet
Need For Communication Interfaces: Why Are Communication Interfaces Required in Embedded Systems
76 pages
MV Voltage Selection Notes
No ratings yet
MV Voltage Selection Notes
2 pages
Glocalization in Food Business: Strategies of Adaptation To Local Needs and Demands Dr. Ajai Prakash1 Dr. V - B. Singh2
No ratings yet
Glocalization in Food Business: Strategies of Adaptation To Local Needs and Demands Dr. Ajai Prakash1 Dr. V - B. Singh2
21 pages
Basics of Powder Metallurgy
No ratings yet
Basics of Powder Metallurgy
98 pages
Road Safety Week 2023 Opening Remarks
No ratings yet
Road Safety Week 2023 Opening Remarks
3 pages
FM Mock 6
No ratings yet
FM Mock 6
101 pages
TVM Challenging
100% (1)
TVM Challenging
5 pages
CHN Reviewer
No ratings yet
CHN Reviewer
8 pages
Gauss-Markov Theorem - Wikipedia, The Free Encyclopedia
No ratings yet
Gauss-Markov Theorem - Wikipedia, The Free Encyclopedia
8 pages
Patanjali Project
No ratings yet
Patanjali Project
75 pages
CMGTCB554 Final Course Reflection
No ratings yet
CMGTCB554 Final Course Reflection
7 pages
Bella vita- Social Media marketing ppt.
No ratings yet
Bella vita- Social Media marketing ppt.
13 pages
Chapter 4 Assembly Language and Programming
100% (3)
Chapter 4 Assembly Language and Programming
85 pages
Sonali Portfolio
100% (1)
Sonali Portfolio
31 pages
Anti-Sicilians, Part One
No ratings yet
Anti-Sicilians, Part One
3 pages
Company Profile
100% (1)
Company Profile
2 pages
Client Feedback Form
No ratings yet
Client Feedback Form
1 page
River Cities' Reader #930 - Spring Guide Edition
No ratings yet
River Cities' Reader #930 - Spring Guide Edition
52 pages
Post-Risk-Assessment-for-RMG
No ratings yet
Post-Risk-Assessment-for-RMG
21 pages
Abuzo ITC TP
No ratings yet
Abuzo ITC TP
4 pages
Aeb010171 ISX 15 Base
100% (2)
Aeb010171 ISX 15 Base
12 pages
Lung Transplantation Principles and Practice - 1st Edition Full MOBI eBook
100% (14)
Lung Transplantation Principles and Practice - 1st Edition Full MOBI eBook
14 pages
Use of Session and Cookie in Login System
No ratings yet
Use of Session and Cookie in Login System
4 pages
Club Sports Council Constitution
No ratings yet
Club Sports Council Constitution
2 pages