0% found this document useful (0 votes)

18 views10 pages

Assignment1 Param

PRACTICAL PACKET CAPTURING USING Whireshark TEMPLATE

Uploaded by

paramdholakia3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views10 pages

Assignment1 Param

PRACTICAL PACKET CAPTURING USING Whireshark TEMPLATE

Uploaded by

paramdholakia3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

AIML 202046702

ASSIGNMENT-1
importing required libraries
In [1]: import pandas as pd
import matplotlib.pyplot as plt

reading the dataset

In [2]: df=pd.read_csv('amazon.csv')

1. Display Top 5 Rows of The Dataset.

In [3]: df.head(5)

Out[3]: year state month number date

0 1998 Acre Janeiro 0.0 1998-01-01

1 1999 Acre Janeiro 0.0 1999-01-01

2 2000 Acre Janeiro 0.0 2000-01-01

3 2001 Acre Janeiro 0.0 2001-01-01

4 2002 Acre Janeiro 0.0 2002-01-01

2. Check Last 5 Rows.

In [4]: df.tail(5)

Out[4]: year state month number date

6449 2012 Tocantins Dezembro 128.0 2012-01-01

6450 2013 Tocantins Dezembro 85.0 2013-01-01

6451 2014 Tocantins Dezembro 223.0 2014-01-01

6452 2015 Tocantins Dezembro 373.0 2015-01-01

6453 2016 Tocantins Dezembro 119.0 2016-01-01

3. Find Shape of Our Dataset (Number of Rows

and Number of Columns).
In [5]: print('No. of rows: ',df.shape[0])
print('No. of columns: ',df.shape[1])

12202040501049 PARAM H DHOLAKIA

AIML 202046702

No. of rows: 6454

No. of columns: 5

4. Getting Information About Our Dataset Like

Total Number Rows, Total Number of Columns,
Datatypes of Each Column and Memory
Requirement.
In [6]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 6454 entries, 0 to 6453
Data columns (total 5 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 year 6454 non-null int64
1 state 6454 non-null object
2 month 6454 non-null object
3 number 6454 non-null float64
4 date 6454 non-null object
dtypes: float64(1), int64(1), object(3)
memory usage: 252.2+ KB

5. Check For Duplicate Data and Drop Them.

In [7]: df.columns

Out[7]: Index(['year', 'state', 'month', 'number', 'date'], dtype='object')

In [8]: duplicate=df[df.duplicated()]
duplicate

12202040501049 PARAM H DHOLAKIA

AIML 202046702

Out[8]: year state month number date

259 2017 Alagoas Janeiro 38.0 2017-01-01

2630 1998 Mato Grosso Janeiro 0.0 1998-01-01

2650 1998 Mato Grosso Fevereiro 0.0 1998-01-01

2670 1998 Mato Grosso Março 0.0 1998-01-01

2690 1998 Mato Grosso Abril 0.0 1998-01-01

2710 1998 Mato Grosso Maio 0.0 1998-01-01

3586 1998 Paraiba Janeiro 0.0 1998-01-01

3606 1998 Paraiba Fevereiro 0.0 1998-01-01

3621 2013 Paraiba Fevereiro 9.0 2013-01-01

3626 1998 Paraiba Março 0.0 1998-01-01

3646 1998 Paraiba Abril 0.0 1998-01-01

3666 1998 Paraiba Maio 0.0 1998-01-01

4542 1998 Rio Janeiro 0.0 1998-01-01

4562 1998 Rio Fevereiro 0.0 1998-01-01

4582 1998 Rio Março 0.0 1998-01-01

4585 2001 Rio Março 0.0 2001-01-01

4590 2006 Rio Março 8.0 2006-01-01

4602 1998 Rio Abril 0.0 1998-01-01

4608 2004 Rio Abril 3.0 2004-01-01

4613 2009 Rio Abril 1.0 2009-01-01

4622 1998 Rio Maio 0.0 1998-01-01

4631 2007 Rio Maio 2.0 2007-01-01

4632 2008 Rio Maio 0.0 2008-01-01

4645 2001 Rio Junho 13.0 2001-01-01

4781 1998 Rio Janeiro 0.0 1998-01-01

4800 2017 Rio Janeiro 28.0 2017-01-01

4801 1998 Rio Fevereiro 0.0 1998-01-01

4821 1998 Rio Março 0.0 1998-01-01

4841 1998 Rio Abril 0.0 1998-01-01

4861 1998 Rio Maio 0.0 1998-01-01

4864 2001 Rio Maio 4.0 2001-01-01

4910 2007 Rio Julho 7.0 2007-01-01

12202040501049 PARAM H DHOLAKIA

AIML 202046702

In [9]: df=df.drop_duplicates()

In [10]: df

Out[10]: year state month number date

0 1998 Acre Janeiro 0.0 1998-01-01

1 1999 Acre Janeiro 0.0 1999-01-01

2 2000 Acre Janeiro 0.0 2000-01-01

3 2001 Acre Janeiro 0.0 2001-01-01

4 2002 Acre Janeiro 0.0 2002-01-01

... ... ... ... ... ...

6449 2012 Tocantins Dezembro 128.0 2012-01-01

6450 2013 Tocantins Dezembro 85.0 2013-01-01

6451 2014 Tocantins Dezembro 223.0 2014-01-01

6452 2015 Tocantins Dezembro 373.0 2015-01-01

6453 2016 Tocantins Dezembro 119.0 2016-01-01

6422 rows × 5 columns

6. Check Null Values in The Dataset.

In [11]: #checks for total no.of null values for each column
df.isna().sum()

Out[11]: year 0
state 0
month 0
number 0
date 0
dtype: int64

7. Get Overall Statistics About the Dataframe.

In [12]: df.describe()

12202040501049 PARAM H DHOLAKIA

AIML 202046702

Out[12]: year number

count 6422.000000 6422.000000

mean 2007.490969 108.815178

std 5.731806 191.142482

min 1998.000000 0.000000

25% 2003.000000 3.000000

50% 2007.000000 24.497000

75% 2012.000000 114.000000

max 2017.000000 998.000000

8. Rename Month Names to English.

In [13]: df['month'].unique()

Out[13]: array(['Janeiro', 'Fevereiro', 'Março', 'Abril', 'Maio', 'Junho', 'Julho',

'Agosto', 'Setembro', 'Outubro', 'Novembro', 'Dezembro'],
dtype=object)

In [14]: month_map={'Janeiro':'January','Fevereiro':'February','Março':'March','Abril':'A
'Agosto':'August', 'Setembro':'September', 'Outubro':'October', 'Novembro

In [15]: df['month']=df['month'].map(month_map)
df['month'].unique()

Out[15]: array(['January', 'February', 'March', 'April', 'May', 'June', 'July',

'August', 'September', 'October', 'November', 'December'],
dtype=object)

9. Total Number of Fires Registered.

In [16]: print('Total fires registered: ',df.shape[0])

Total fires registered: 6422

10.In Which Month Maximum Number of Forest

Fires Were Reported?
In [17]: df.columns

Out[17]: Index(['year', 'state', 'month', 'number', 'date'], dtype='object')

In [18]: no_of_cases=df.groupby('month')['number'].sum().sort_values(ascending=False).ind
print(no_of_cases[0],' is the month with highest no. of cases')

July is the month with highest no. of cases

12202040501049 PARAM H DHOLAKIA

AIML 202046702

11.In Which Year Maximum Number of Forest Fires

Was Reported?
In [19]: no_of_cases=df.groupby('year')['number'].sum().sort_values(ascending=False).inde
print(no_of_cases[0],' is the year with highest no. of cases')

2003 is the year with highest no. of cases

12.In Which State Maximum Number of Forest

Fires Was Reported?
In [20]: no_of_cases=df.groupby('state')['number'].sum().sort_values(ascending=False).ind
print(no_of_cases[0],' is the state with highest no. of cases')

Mato Grosso is the state with highest no. of cases

13.Find Total Number of Fires Were Reported in

Amazonas.
In [21]: df.columns

Out[21]: Index(['year', 'state', 'month', 'number', 'date'], dtype='object')

In [22]: #extraxt rows with state Amazonas

df2=df[df['state']=='Amazonas']

In [23]: print("Total number of forest fires in Amazonas:",df2['number'].sum()) #Get tota

Total number of forest fires in Amazonas: 30650.129

14.Display Number of Fires Were Reported in

Amazonas (Year-Wise).
In [24]: df.columns

Out[24]: Index(['year', 'state', 'month', 'number', 'date'], dtype='object')

In [25]: df3=df[df['state']=='Amazonas'].groupby('year')['number'].sum()
df3

12202040501049 PARAM H DHOLAKIA

AIML 202046702

Out[25]: year
1998 946.000
1999 1061.000
2000 853.000
2001 1297.000
2002 2852.000
2003 1524.268
2004 2298.207
2005 1657.128
2006 997.640
2007 589.601
2008 2717.000
2009 1320.601
2010 2324.508
2011 1652.538
2012 1110.641
2013 905.217
2014 2385.909
2015 1189.994
2016 2060.972
2017 906.905
Name: number, dtype: float64

15.Display Number of Fires Were Reported in

Amazonas (Day-Wise).
In [26]: #extract rows with state amazonas
df2=df[df['state']=='Amazonas']

In [27]: #convert date column to date-time format

df2['date'] = pd.to_datetime(df2['date'])
df3=df2.groupby(df2['date'].dt.dayofweek)['number'].sum()

C:\Users\PARAM\AppData\Local\Temp\ipykernel_8680\3119725923.py:2: SettingWithCopy
Warning:
A value is trying to be set on a copy of a slice from a DataFrame.
Try using .loc[row_indexer,col_indexer] = value instead

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stabl

e/user_guide/indexing.html#returning-a-view-versus-a-copy
df2['date'] = pd.to_datetime(df2['date'])

In [28]: dict = {0: 'Sunday',1: 'Monday',2: 'Tuesday',3: 'Wednesday',4: 'Thursday',5: 'Fr

In [29]: #map numeric day to names of day

df3.index = df3.index.map(dict)

In [30]: df3

12202040501049 PARAM H DHOLAKIA

AIML 202046702

Out[30]: date
Sunday 1886.601
Monday 6474.217
Tuesday 3910.177
Wednesday 5754.802
Thursday 5446.480
Friday 4162.666
Saturday 3015.186
Name: number, dtype: float64

16.Find Total Number of Fires Were Reported In

2015 And Visualize Data Based on Each ‘Month’.
In [31]: #total fire reports in each month for 2015
df2=df[df['year']==2015].groupby('month')['number'].sum().reset_index()

In [32]: df2

Out[32]: month number

0 April 2573.000

1 August 4363.125

2 December 4088.522

3 February 2309.000

4 January 4635.000

5 July 4364.392

6 June 3260.552

7 March 2202.000

8 May 2384.000

9 November 4034.518

10 October 4499.525

11 September 2494.658

In [33]: plt.figure(figsize=(20, 5)) #to ensure image readability

plt.bar(df2['month'],df2['number'])
plt.show()

12202040501049 PARAM H DHOLAKIA

AIML 202046702

17.Find Average Number of Fires Were Reported

from Highest to Lowest (State-Wise).
In [34]: #Group the data by state and find average reports state-wise
df2=df.groupby('state')['number'].mean().reset_index()

In [35]: #sort values from highest to lowest average

df2.sort_values('number',ascending=False)

Out[35]: state number

20 Sao Paulo 213.896226

10 Mato Grosso 203.479975

4 Bahia 187.222703

15 Piau 158.174674

8 Goias 157.721841

11 Minas Gerais 156.800243

22 Tocantins 141.037176

3 Amazonas 128.243218

5 Ceara 127.314071

12 Paraiba 111.073979

9 Maranhao 105.142808

13 Pará 102.561272

14 Pernambuco 102.502092

18 Roraima 102.029598

19 Santa Catarina 101.924067

2 Amapa 91.345506

17 Rondonia 84.876272

0 Acre 77.255356

16 Rio 64.698515

7 Espirito Santo 27.389121

1 Alagoas 19.271967

6 Distrito Federal 14.899582

21 Sergipe 13.543933

18.To Find the State Names Where Fires Were

Reported In 'dec' Month.

12202040501049 PARAM H DHOLAKIA

AIML 202046702

In [36]: states=df[df['month']=='December']['state'].unique()

In [37]: print("List of states:")

for i in states:
print(i)

List of states:
Acre
Alagoas
Amapa
Amazonas
Bahia
Ceara
Distrito Federal
Espirito Santo
Goias
Maranhao
Mato Grosso
Minas Gerais
Pará
Paraiba
Pernambuco
Piau
Rio
Rondonia
Roraima
Santa Catarina
Sao Paulo
Sergipe
Tocantins

12202040501049 PARAM H DHOLAKIA

Untitled 5
No ratings yet
Untitled 5
10 pages
Forest Fires Analysis
No ratings yet
Forest Fires Analysis
11 pages
Exemplar - Dataframes With Pandas
No ratings yet
Exemplar - Dataframes With Pandas
11 pages
Pandas Library
No ratings yet
Pandas Library
5 pages
Pandas Methods
No ratings yet
Pandas Methods
6 pages
Pandas 1705297450
No ratings yet
Pandas 1705297450
21 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
Week 10 Intro Time Series
No ratings yet
Week 10 Intro Time Series
34 pages
Chapter 2 - Python Pandas II
No ratings yet
Chapter 2 - Python Pandas II
71 pages
100 Pandas Puzzles
No ratings yet
100 Pandas Puzzles
20 pages
Data Analysis
No ratings yet
Data Analysis
4 pages
DMV - 4 - Jupyter Notebook
No ratings yet
DMV - 4 - Jupyter Notebook
8 pages
Pandas & Vis 2
No ratings yet
Pandas & Vis 2
11 pages
Pandas
No ratings yet
Pandas
5 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Python Pandas for Data Science
No ratings yet
Python Pandas for Data Science
59 pages
Loc Iloc at Dataframe
No ratings yet
Loc Iloc at Dataframe
9 pages
Python Data Cleaning
100% (1)
Python Data Cleaning
20 pages
Unit 1 Python Pandas
No ratings yet
Unit 1 Python Pandas
20 pages
PJT Explanation of Code Line by Line
No ratings yet
PJT Explanation of Code Line by Line
2 pages
2-Introduction To Data Cleaning P02
No ratings yet
2-Introduction To Data Cleaning P02
7 pages
Data Cheat Sheet
No ratings yet
Data Cheat Sheet
2 pages
Project Intern - Jupyter Notebook
No ratings yet
Project Intern - Jupyter Notebook
16 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
4 pages
Pandas DataFrame Cheat Sheet
100% (1)
Pandas DataFrame Cheat Sheet
10 pages
Pandas Data Structures: Sections
No ratings yet
Pandas Data Structures: Sections
13 pages
Series 1
No ratings yet
Series 1
408 pages
Pandas 1
No ratings yet
Pandas 1
49 pages
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
No ratings yet
Pandas Cheat Sheet Free Resources At: Dataquest - Io/guide
7 pages
Cs Sem V Dav Upc 32347507 Sl. No. Qp. 4432 Dec '23
No ratings yet
Cs Sem V Dav Upc 32347507 Sl. No. Qp. 4432 Dec '23
16 pages
Lecture 3 - Pandas
No ratings yet
Lecture 3 - Pandas
37 pages
PYQ Data Analysis and Visualisation Using Python GE May 2024
No ratings yet
PYQ Data Analysis and Visualisation Using Python GE May 2024
6 pages
Python2 Master
No ratings yet
Python2 Master
12 pages
Numpy - Pandas - Lab - Jupyter Notebook
No ratings yet
Numpy - Pandas - Lab - Jupyter Notebook
29 pages
Data Analyst Interview Q&A Guide
No ratings yet
Data Analyst Interview Q&A Guide
20 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
5 pages
Pandas Operations Guide
No ratings yet
Pandas Operations Guide
6 pages
Cheat Sheet Pandas
No ratings yet
Cheat Sheet Pandas
4 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Pandas
No ratings yet
Pandas
36 pages
Numpy
No ratings yet
Numpy
9 pages
Python Pandas Demo PDF
100% (2)
Python Pandas Demo PDF
23 pages
Exp3 Python
No ratings yet
Exp3 Python
15 pages
Pandas for Data Science Beginners
No ratings yet
Pandas for Data Science Beginners
21 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Python - DataScience Question - Paper
No ratings yet
Python - DataScience Question - Paper
5 pages
Data Manipulation With Pandas - Yulei's Sandbox
No ratings yet
Data Manipulation With Pandas - Yulei's Sandbox
18 pages
Pandas Cheat Sheet for Data Science
No ratings yet
Pandas Cheat Sheet for Data Science
1 page
Analystics Data Cleaning Questions Interview
No ratings yet
Analystics Data Cleaning Questions Interview
8 pages
Pandas Python For Data Science
100% (1)
Pandas Python For Data Science
1 page
Tuning PID Controller Parameters Using Tabu Search Algorithm
No ratings yet
Tuning PID Controller Parameters Using Tabu Search Algorithm
3 pages
Capgemini Previous Papers
No ratings yet
Capgemini Previous Papers
48 pages
Alcatel 4020 Premium Reflex User Guide
No ratings yet
Alcatel 4020 Premium Reflex User Guide
19 pages
XMeye NVR Security Camera Systems Manuals
No ratings yet
XMeye NVR Security Camera Systems Manuals
14 pages
Delete Doctor Manual
No ratings yet
Delete Doctor Manual
4 pages
Design and Implementation of E-Commerce Site For Online Shopping
0% (1)
Design and Implementation of E-Commerce Site For Online Shopping
23 pages
Aiep1 S3 SC
No ratings yet
Aiep1 S3 SC
3 pages
Computer Care and Lab Mangement
No ratings yet
Computer Care and Lab Mangement
41 pages
Sed 24 II Malayalam GGGG
No ratings yet
Sed 24 II Malayalam GGGG
20 pages
Biometric KYC Case Study: ASLI RI Success
No ratings yet
Biometric KYC Case Study: ASLI RI Success
4 pages
Internet of Things (Iot)
No ratings yet
Internet of Things (Iot)
35 pages
Multiplexer & Demultiplexer by Dr. Arvind Nautiyal
No ratings yet
Multiplexer & Demultiplexer by Dr. Arvind Nautiyal
23 pages
EXP-5 Rimendra RA2011033010064
No ratings yet
EXP-5 Rimendra RA2011033010064
7 pages
Seminar PPT On HAR Depth
No ratings yet
Seminar PPT On HAR Depth
37 pages
Materiels XR
No ratings yet
Materiels XR
5 pages
BlueTooth Earbuds
No ratings yet
BlueTooth Earbuds
30 pages
Rust Design Patterns
No ratings yet
Rust Design Patterns
91 pages
ISTQB - CT TAE - Sample Exam A Answers - v1.2
No ratings yet
ISTQB - CT TAE - Sample Exam A Answers - v1.2
17 pages
A Machine Learning Based Crop Yield Prediction
No ratings yet
A Machine Learning Based Crop Yield Prediction
25 pages
OOAD Principles
No ratings yet
OOAD Principles
37 pages
Log
No ratings yet
Log
3 pages
S-35390A H Series: For Automotive 105°C Operation 2-Wire Real-Time Clock
No ratings yet
S-35390A H Series: For Automotive 105°C Operation 2-Wire Real-Time Clock
51 pages
Soundcore by Anker R50i True Wireless Earbuds 10m
No ratings yet
Soundcore by Anker R50i True Wireless Earbuds 10m
1 page
Coursework Title: Control System Design and Simulation For A Chemical Process
No ratings yet
Coursework Title: Control System Design and Simulation For A Chemical Process
6 pages
Beginner's Data Science Workshop
No ratings yet
Beginner's Data Science Workshop
5 pages
IMagic Service Manual
100% (1)
IMagic Service Manual
125 pages
Ic 8855
No ratings yet
Ic 8855
36 pages
056-035 Replacing Obsolete Modules
No ratings yet
056-035 Replacing Obsolete Modules
6 pages
MLL Study Materials Maths Standard Class X 2020 21
No ratings yet
MLL Study Materials Maths Standard Class X 2020 21
81 pages
9000C User Manual
No ratings yet
9000C User Manual
5 pages