[go: up one dir, main page]

0% found this document useful (0 votes)
32 views9 pages

01 Data Handlinng Using Pandas-I-1-9

The document provides an overview of the Pandas library, highlighting its advantages for data analysis and manipulation, including handling missing data and efficient data slicing. It details the primary data structures in Pandas, specifically Series and DataFrame, and explains the characteristics and creation of Series, including examples. Additionally, it covers mathematical operations, as well as the use of head() and tail() functions to access specific rows of a Series.

Uploaded by

princenandi123x
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views9 pages

01 Data Handlinng Using Pandas-I-1-9

The document provides an overview of the Pandas library, highlighting its advantages for data analysis and manipulation, including handling missing data and efficient data slicing. It details the primary data structures in Pandas, specifically Series and DataFrame, and explains the characteristics and creation of Series, including examples. Additionally, it covers mathematical operations, as well as the use of head() and tail() functions to access specific rows of a Series.

Uploaded by

princenandi123x
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

CHAPTER-1 Data Handling using Pandas –I

Pandas:
 It is a package useful for data analysis and manipulation.
 Pandas provide an easy way to create, manipulate and wrangle the
data.
 Pandas provide powerful and easy-to-use data structures, as well
as the means to quickly perform operations on these structures.

Data scientists use Pandas for its following advantages:

 Easily handles missing data.


 It uses Series for one-dimensional data structure and DataFrame
for multi-dimensional data structure.
 It provides an efficient way to slice the data.
 It provides a flexible way to merge, concatenate or reshape the
data.

DATA STRUCTURE IN PANDAS


A data structure is a way to arrange the data in such a way that so it
can be accessed quickly and we can perform various operation on this
data like- retrieval, deletion, modification etc.

Pandas deals with 3 data structure-

1. Series
2. Data Frame
3. Panel

We are having only series and data frame in our syllabus.


Series
Series-Series is a one-dimensional array like structure with
homogeneous data, which can be used to handle and manipulate data.
What makes it special is its index attribute, which has incredible
functionality and is heavily mutable.

It has two parts-


1. Data part (An array of actual data)
2. Associated index with data (associated array of indexes or data labels)

e.g.-

Index Data

0 10

1 15

2 18

3 22

 We can say that Series is a labeled one-dimensional array


which can hold any type of data.
 Data of Series is always mutable, means it can be changed.
 But the size of Data of Series is always immutable, means it
cannot be changed.
 Series may be considered as a Data Structure with two
arrays out which one array works as Index (Labels) and the
second array works as original Data.
 Row Labels in Series are called Index.
Syntax to create a Series:

<Series Object>=pandas.Series (data, index=idx (optional))

 Where data may be python sequence (Lists), ndarray,


scalar value or a python dictionary.

How to create Series with nd array

Program-

import pandas as pd
Output-
import numpy as np Default Index
0 10
arr=np.array([10,15,18,22])
1 15
s = pd.Series(arr) 2 18

print(s) 3 22

Data
Here we create an
array of 4 values.
How to create Series with Mutable index

Program-

import pandas as pd Output-


import numpy as np first a
arr=np.array(['a','b','c','d']) second b
third c
s=pd.Series(arr,
fourth d
index=['first','second','third','fourth'])

print(s)
Creating a series from Scalar value

To create a series from scalar value, an index must be provided. The


scalar value will be repeated as per the length of index.

Creating a series from a Dictionary


Mathematical Operations in Series

Print all the values of the Series by multiplying them by 2.

Print Square of all the values of the series.

Print all the values of the Series that are greater than 2.
Example-2

While adding two series, if Non-Matching Index is found in either of the


Series, Then NaN will be printed corresponds to Non-Matching Index.

If Non-Matching Index is found in either of the series, then this Non-


Matching Index corresponding value of that series will be filled as 0.
Head and Tail Functions in Series

head (): It is used to access the first 5 rows of a series.


Note :To access first 3 rows we can call series_name.head(3)

Result of s.head()

Result of s.head(3)
tail(): It is used to access the last 5 rows of a series.
Note :To access last 4 rows we can call series_name.tail (4)

You might also like