0% found this document useful (0 votes)

6 views12 pages

Data Science Practical 01

The document provides a series of Python programs using the Pandas library to manipulate and analyze data in DataFrames. It includes tasks such as appending data, filtering rows based on conditions, joining DataFrames, handling duplicates, and reading from a CSV file. Additionally, it demonstrates how to visualize data using line plots.

Uploaded by

jktechf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views12 pages

Data Science Practical 01

Uploaded by

jktechf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Q[1] Write a Pandas program to append a list of dictionaries or series to an existing

DataFrame and display the combined data.

import pandas as pd

import numpy as np

exam_dic1 = {'name': ['jaynandan', 'shivnandan', 'dilkhush', 'jitendra', 'Abhishek', 'raushan', 'rajesh',

'Kartik', 'Kavita', 'Pooja'],

'perc': [79.5, 29, 90.5, np.nan, 32, 65, 56, np.nan, 29, 89],

'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}

exam_data1 = pd.DataFrame(exam_dic1)

exam_dic2 = {'name': ['Rohan', 'amar', 'santosh', 'badal', 'ravish'],

'perc': [89.5, 92, 90.5, 91.5, 90],

'qualify': ['yes', 'yes', 'yes', 'yes', 'yes']}

exam_data2 = pd.DataFrame(exam_dic2)

print("Original DataFrames:")

print(exam_data1)

print("-------------------------------------")

print(exam_data2)

print("\nJoin the said two dataframes along rows:")

result_data = pd.concat([exam_data1, exam_data2],axis=1)

print(result_data) Output:-
Q.[2] Create a data frame using dictionary with column heading, i.e., Name, Age,
Percentage and qualify.
import pandas as pd
import numpy as np
Data = {'name': ['jaynandan', 'shivnandan', 'dilkhush', 'jitendra', 'Abhishek', 'raushan',
'rajesh', 'Kartik', 'Kavita', 'Pooja'],
'Age':[20,30,45,35,60,80,70,60,14,32],
'perc': [79.5, 29, 90.5, np.nan, 32, 65, 56, np.nan, 29, 89],
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
df = pd.DataFrame(Data)
print(df)
Output:

a. Write a Pandas program to select the rows from the DataFrame in previous program
where the percentage greater than 70.
import pandas as pd

import numpy as np

Data = {'name': ['jaynandan', 'shivnandan', 'dilkhush', 'jitendra', 'Abhishek', 'raushan', 'rajesh', 'Kartik',
'Kavita', 'Pooja'],

'Age':[20,30,45,35,60,80,70,60,14,32],

'perc': [79.5, 29, 90.5, np.nan, 32, 65, 56, np.nan, 29, 89],

'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}

df = pd.DataFrame(Data)

print(df[df['perc'] > 70])

Output:-

b. Write a Pandas program to select the rows the percentage is between 70

and 90
import pandas as pd
import numpy as np
Data = {'name': ['jaynandan', 'shivnandan', 'dilkhush', 'jitendra', 'Abhishek', 'raushan', 'rajesh', 'Kartik',
'Kavita', 'Pooja'],

'Age':[20,30,45,35,60,80,70,60,14,32],

'perc': [79.5, 85, 90.5, np.nan, 75, 65, 56, np.nan, 29, 89],

'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}

df = pd.DataFrame(Data)

print(df[df['perc'].between(70,90)]) 90.

OUTPUT:-

[3] Write a Pandas program to join the two given DataFrame using the
column header, i.e., Name, Age,Department, and Percentage.
=> a. Along rows and assign all data.
import pandas as pd
import numpy as np
exam_dic1 = {'name': ['jaynandan', 'shivnandan', 'jitendra', 'dilkhush', 'roshan', 'raju',
'Mohan', 'Kartik', 'Kavita', 'Pooja'],
'perc': [79.5, 29, 90.5, np.nan, 32, 65, 56, np.nan, 29, 89],
'Age':[20,30,45,35,60,80,70,60,14,32],
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
exam_data1 = pd.DataFrame(exam_dic1)
exam_dic2 = {'name': ['Parveen', 'yaduvanshi', 'Ashaz', 'yadavjee', 'Ahir'],
'Age':[20,30,45,35,60],
'perc': [89.5, 92, 90.5, 91.5, 90],
'qualify': ['yes', 'yes', 'yes', 'yes', 'yes']}
exam_data2 = pd.DataFrame(exam_dic2)
print("Original DataFrames:")
print(exam_data1)
print("-------------------------------------")
print(exam_data2)
print("\nJoin the said two dataframes along rows:")
result_data = pd.concat([exam_data1, exam_data2])
print(result_data)
OUTPUT:-

Q c Along columns and assign all data.

import pandas as pd
import numpy as np
exam_dic1 = {'name': ['Aman', 'Kamal', 'Amjad', 'Rohan', 'Amit', 'Sumit', 'Matthew', 'Kartik',
'Kavita', 'Pooja'],
'Age':[20,30,45,35,60,80,70,60,14,32],
'perc': [79.5, 29, 90.5, np.nan, 32, 65, 56, np.nan, 29, 89]
'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes']}
exam_data1 = pd.DataFrame(exam_dic1)
exam_dic2 = {'name': ['Parveen', 'Ahil', 'Ashaz', 'Shifin', 'Hanash'],
'Age':[20,30,45,35,60],
'perc': [89.5, 92, 90.5, 91.5, 90],
'qualify': ['yes', 'yes', 'yes', 'yes', 'yes']}
exam_data2 = pd.DataFrame(exam_dic2)
print("Original DataFrames:")
print(exam_data1)
print("-------------------------------------")
print(exam_data2)
print("\nJoin the said two dataframes along columns:")
result_data = pd.concat([exam_data1, exam_data2],axis=1)
print(result_data)
OUTPUTS:-

c. Sorting the concatenated dataset by age

d. Filter out rows based on different criteria such as duplicate rows.
import pandas as pd
data={'Name':['Aman','Rohit','Deepika','Aman','Deepika','Sohit','Geeta'],
'Sales':[8500,4500,9200,8500,9200,9600,8400]}
sales=pd.DataFrame(data)
duplicated = sales[sales.duplicated(keep=False)]
print("duplicate Row:\n",duplicated)
OUTPUT :-

Q 4 Write a program to create DataFrame using ‘Student_result.csv’ file using

Pandas and perform following operations
import pandas as pd
import csv
df = pd.read_csv("student_result.csv")
print(df)

A .To display row labels, column labels data types of each column and the dimensions
import pandas as pd
import csv
#Reading the Data
df = pd.read_csv("student_result.csv")
# Display Name of Columns
print(df.columns)
# Display Column Names and their types
print(df.info())

B . To display the shape (number of rows and columns) of the CSV file.
import pandas as pd
import csv
#Reading the Data
df = pd.read_csv("student_result.csv")
# Display no of rows and column
print(df.shape)
OUTPUT :-
C . To display Admission_No, Gender and Percentage from ‘student_result.csv’ file.

import pandas as pd
import csv
#To display Adm_No, Gender and Percentage from ‘student_result.csv’ file.
df = pd.read_csv("student_result.csv",usecols = ["id",'gender', 'sciences.grade'])

print("To display Adm_No, Gender and Percentage from ‘student_result.csv’ file.")

print(df)

D. To display the first 5 and last 5 records from ‘student_result.csv’ file

import pandas as pd
import csv
#To display first 5 and last 5 records from ‘student_result.csv’ file.
df1 = pd.read_csv("student_result.csv")
print(df1.head())
print(df1.tail())
E. To modify the Percentage of student below 40 with NaN value in DataFrame.
import pandas as pd
import numpy as np
import csv
df = pd.read_csv("student_result.csv")
print(df)
# To modify the Percentage of student below 40 with NaN value.
df2 = pd.read_csv("student_result.csv")
print(df2)
print("To modify the Percentage of student below 40 with NaN value.")
df2.loc[(df2['PERCENTAGE'] <40, 'PERCENTAGE')] = np.nan
print(df2)

Q [5] Write a program to find the sum of each column, and find the column with the
lowest mean (Consider Student result file with marks in different subject)
import pandas as pd
Pass_Perc ={'Phy': {'2017':95.4,'2018':96.4,'2019':99.2,'2020':97.4},
'Che': {'2017':96.5,'2018':97.4,'2019':100,'2020':99.2},
'Maths': {'2017':90.2,'2018':92.6,'2019':97.4,'2020':98.0},
'Eng': {'2017':99.2,'2018':100,'2019':100,'2020':100},
'IP': {'2017':95.6,'2018':100,'2019':100,'2020':100}}
df=pd.DataFrame(Pass_Perc)
print(df)
print()
print('Column wise sum in datframe is :')
print(df.sum(axis=0))
# Print mean vaLue of each coLumn
print()
print('Column wise mean value are:')
print(df.mean(axis=0).round(1))
# Returns CoLumn with minimum mean vaLue
print()
print('Column with minimum mean value is:')
print(df.mean(axis=0).idxmin())

OUTPUTS :-

Q 6.Read Total marks of all students and show line plot with the following Style
properties. Generated line plot must include following Style properties: –
import pandas as pd
import matplotlib.pyplot as plt
df = pd.read_csv("D:\\Python\\Articles\\matplotlib\\sales_data.csv")
profitList = df ['total_profit'].tolist()
monthList = df ['month_number'].tolist()
plt.plot(monthList, profitList, label = 'Month-wise Profit data of last year')
plt.xlabel('Month number')
plt.ylabel('Profit in dollar')
plt.xticks(monthList)
plt.title('Company profit per month')
plt.yticks([100000, 200000, 300000, 400000, 500000])
plt.show()

12 IP File Programs 6 To 17
No ratings yet
12 IP File Programs 6 To 17
9 pages
Ip-12-2023-24 Practical File
No ratings yet
Ip-12-2023-24 Practical File
19 pages
Class 12 Pandas Practical Guide
No ratings yet
Class 12 Pandas Practical Guide
15 pages
Lab Programs
No ratings yet
Lab Programs
53 pages
Practicals
No ratings yet
Practicals
11 pages
Info Practical
No ratings yet
Info Practical
56 pages
Pandas Data Handling Exercises
No ratings yet
Pandas Data Handling Exercises
21 pages
Python Pandas Practical Guide
No ratings yet
Python Pandas Practical Guide
111 pages
List of Programs For Informatics - XII - IP
No ratings yet
List of Programs For Informatics - XII - IP
26 pages
Ip HHW
No ratings yet
Ip HHW
32 pages
2) Pandas DataFrame
No ratings yet
2) Pandas DataFrame
46 pages
Python Data Handling with Pandas
No ratings yet
Python Data Handling with Pandas
12 pages
List of Practical Ip065 Xii Session 2025 CKC Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 CKC Academy
19 pages
List of Programs For Informatics
No ratings yet
List of Programs For Informatics
43 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Dataframe Practical
No ratings yet
Dataframe Practical
14 pages
List of Practical Ip065 Xii Session 2025 CKC Academy
No ratings yet
List of Practical Ip065 Xii Session 2025 CKC Academy
19 pages
Python Pandas Assignment Guide
No ratings yet
Python Pandas Assignment Guide
9 pages
Lab Record IP
No ratings yet
Lab Record IP
13 pages
Lab Session 06: Perform Following Operations Using Pandas
No ratings yet
Lab Session 06: Perform Following Operations Using Pandas
5 pages
Class XII Pandas & SQL Practical List
100% (1)
Class XII Pandas & SQL Practical List
7 pages
Lab 8
No ratings yet
Lab 8
9 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
12 IP Practial Programs 2025-26
No ratings yet
12 IP Practial Programs 2025-26
10 pages
Exercise 4 - Python Pandas Exercise
No ratings yet
Exercise 4 - Python Pandas Exercise
3 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Xii Ip Practical List 2022-23-1
No ratings yet
Xii Ip Practical List 2022-23-1
23 pages
DSBDL Pract 2
No ratings yet
DSBDL Pract 2
6 pages
List of Programs For Informatics 24-25 - 1575540280755490817SD - PDF
No ratings yet
List of Programs For Informatics 24-25 - 1575540280755490817SD - PDF
11 pages
Ipclass 12
No ratings yet
Ipclass 12
21 pages
Dataframe in Pandas
No ratings yet
Dataframe in Pandas
23 pages
Info Programs Questions
No ratings yet
Info Programs Questions
18 pages
Journal 12
No ratings yet
Journal 12
54 pages
IP Lab Record
No ratings yet
IP Lab Record
23 pages
DS Practical
No ratings yet
DS Practical
30 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
MCQ On Dataframe
No ratings yet
MCQ On Dataframe
11 pages
12 Pandas
100% (1)
12 Pandas
21 pages
Notebook PYTHON DATA SCIENCE
No ratings yet
Notebook PYTHON DATA SCIENCE
16 pages
Prac File Sol GR XII - 1
No ratings yet
Prac File Sol GR XII - 1
10 pages
AI Practical 2025
No ratings yet
AI Practical 2025
14 pages
Info Fair Record
No ratings yet
Info Fair Record
28 pages
Dataframe Cheat Sheet
No ratings yet
Dataframe Cheat Sheet
2 pages
Revision - Data Frames
No ratings yet
Revision - Data Frames
6 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Practical File Class Xii
No ratings yet
Practical File Class Xii
25 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
Screenshot 2024-11-19 at 2.02.47 PM
No ratings yet
Screenshot 2024-11-19 at 2.02.47 PM
2 pages
Xii Record (Dataframe & CSV)
No ratings yet
Xii Record (Dataframe & CSV)
11 pages
12 IP CBSE Practical File (PART-1)
No ratings yet
12 IP CBSE Practical File (PART-1)
27 pages
Creation of Series Using List, Dictionary & Ndarray
No ratings yet
Creation of Series Using List, Dictionary & Ndarray
65 pages
Programs of Python Pandas
No ratings yet
Programs of Python Pandas
15 pages
Practical File Programs
No ratings yet
Practical File Programs
8 pages
Ip Practical File
No ratings yet
Ip Practical File
20 pages
Ds&bda 1-14
No ratings yet
Ds&bda 1-14
95 pages
Vantika Kamra's Practical File 12 Diamond (26600872)
No ratings yet
Vantika Kamra's Practical File 12 Diamond (26600872)
46 pages
Ssce-2025 Practical Test Solution
No ratings yet
Ssce-2025 Practical Test Solution
7 pages
Class 12 IP File 23 24
No ratings yet
Class 12 IP File 23 24
27 pages
Informatics Practices Practical List22-2323
No ratings yet
Informatics Practices Practical List22-2323
6 pages
Khairul's BASIC MATH ( )
No ratings yet
Khairul's BASIC MATH ( )
136 pages
Logcat
No ratings yet
Logcat
2 pages
Motivation PDFs Hub
No ratings yet
Motivation PDFs Hub
178 pages
PDF.js Viewer Guide
No ratings yet
PDF.js Viewer Guide
107 pages
Developer Image Samples
No ratings yet
Developer Image Samples
1 page
Джозеф Альберс "Взаимодействие цвета"
No ratings yet
Джозеф Альберс "Взаимодействие цвета"
456 pages
Modul 1
No ratings yet
Modul 1
4 pages
Dte Fex Ic R96692160-9 F127932
No ratings yet
Dte Fex Ic R96692160-9 F127932
8 pages
Mahesh Dattani's Tara eText PDF
No ratings yet
Mahesh Dattani's Tara eText PDF
64 pages
Azad Hind Fauj Ki Kahani PDF
No ratings yet
Azad Hind Fauj Ki Kahani PDF
383 pages
M (A) - 32 Top Bracing-Mark 2 Dia - 300
No ratings yet
M (A) - 32 Top Bracing-Mark 2 Dia - 300
43 pages
Mobile Data Tracking Log
No ratings yet
Mobile Data Tracking Log
29 pages
JWT - Magazine May 2024
No ratings yet
JWT - Magazine May 2024
145 pages
O Level Physics
81% (16)
O Level Physics
371 pages
7" Casing Tally for Drilling Crew
No ratings yet
7" Casing Tally for Drilling Crew
7 pages
Biologia Hoje Vol 3 PDF Pages 1-50
No ratings yet
Biologia Hoje Vol 3 PDF Pages 1-50
119 pages
Recovery Log
No ratings yet
Recovery Log
415 pages
100 MB
No ratings yet
100 MB
100 pages
Google - 5 - Senior System Administrator Resume Searches
No ratings yet
Google - 5 - Senior System Administrator Resume Searches
6 pages
Soumya Ranjan Swain: Agri MBA Profile
No ratings yet
Soumya Ranjan Swain: Agri MBA Profile
2 pages
Know Your State Arihant - Download Free PDF - Ashoka
No ratings yet
Know Your State Arihant - Download Free PDF - Ashoka
773 pages
Spoken English Password Jobninja NoRestriction
No ratings yet
Spoken English Password Jobninja NoRestriction
104 pages
Manual - Economia Intreprinderii Pages
No ratings yet
Manual - Economia Intreprinderii Pages
150 pages
BMP
No ratings yet
BMP
3 pages
High School Musical
No ratings yet
High School Musical
171 pages
Travel Data: Days & Prices Analysis
No ratings yet
Travel Data: Days & Prices Analysis
6 pages
Love Hope and Magic
No ratings yet
Love Hope and Magic
225 pages
US Job Listings During Covid
No ratings yet
US Job Listings During Covid
3 pages
Earth Science Textbook Chapter PDFs - Boiling Springs High School
No ratings yet
Earth Science Textbook Chapter PDFs - Boiling Springs High School
1 page
Minotauro en Zapatillas.
No ratings yet
Minotauro en Zapatillas.
68 pages

Data Science Practical 01

Uploaded by

Data Science Practical 01

Uploaded by

Q[1] Write a Pandas program to append a list of dictionaries or series to an existing

DataFrame and display the combined data.

exam_dic1 = {'name': ['jaynandan', 'shivnandan', 'dilkhush', 'jitendra', 'Abhishek', 'raushan', 'rajesh',

exam_dic2 = {'name': ['Rohan', 'amar', 'santosh', 'badal', 'ravish'],

'perc': [89.5, 92, 90.5, 91.5, 90],

'qualify': ['yes', 'yes', 'yes', 'yes', 'yes']}

print("\nJoin the said two dataframes along rows:")

result_data = pd.concat([exam_data1, exam_data2],axis=1)

print(df[df['perc'] > 70])

b. Write a Pandas program to select the rows the percentage is between 70

Q c Along columns and assign all data.

c. Sorting the concatenated dataset by age

Q 4 Write a program to create DataFrame using ‘Student_result.csv’ file using

print("To display Adm_No, Gender and Percentage from ‘student_result.csv’ file.")

D. To display the first 5 and last 5 records from ‘student_result.csv’ file

You might also like