A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
A Report Submitted in Partial Fulfillment of The Requirement of The Award of Degree of
By
KEERTHI R M
Reg. No.:23083076511012007
DEPARTMENT OF MATHEMATICS
Submitted by
KEERTHI R M
OCTOBER- 2024
ABSTRACT
Science.
with numpy and Panda, data science in real time applications, data
and continuous learning within the latest updates of Python with data science.
This report details the project undertaken, skills developed and lessons
INTRODUCTION 1
1 PYTHON 3
2 PYTHON OPERATORS 8
5 DATA VISUALIZATION
21
CONCLUSION
INTRODUCTION
the field of data science. With its simple syntax and vast array of libraries
and tools, python has made it easier for data science to manipulate and
analyze data, build predictive models and make data driven decisions. In
this report, we will explore how python is used in data science, as well as
some of the key libraries and tools that data scientists use to perform
their work.
and easy to read, making it ideal for data science who may not have
libraries and tools that are specifically designed for data analysis and
1
These libraries allow data science to easily manipulate and visualize data,
these arrays. Pandas are powerful data manipulation library that offers
data structure like data frames and series, which allow data science to
2
CHAPTER 1
PHYTHON
Guido van Rossum, and released in 1991. It is used for web development
and analysis. It introduces data structures like Data Frames that simplify
NumPy: This library offers support for arrays and matrices, along
In Python we have list, that serve the purpose of arrays, but they
50x faster than traditional Python lists. The array object in NumPy is
3
called ndarray, it provides a lot of supporting functions that make
working with nd array very easy. Arrays are very frequently used in data
efficient tools for data mining and data analysis. It supports various
reduction.
Workflow in Python
including databases, APIs and web scraping. Libraries like requests and
4
Data Cleaning and Preparation: Data often needs to be cleaned
data. This involves selecting algorithms, training the model, and tuning
hyper parameters.
PYTHON BASICS
5
CONTROL STRUCTURES
decisions based on conditions. Loops like ‘for’ and ‘while’ can be used
FUNCTIONS
task. Functions can take arguments as input and return values as output.
Python modules and files contain python code. Modules are used
FILE I/O
6
OBJECT-ORIENTED PROGRAMMING
7
CHAPTER 2
PYTHON OPERATORS
8
Python Arithmetic Operators
mathematical operations.
Addition x+y
+
x-y
Subtraction
-
* Multiplication x*y
/ Division x/y
% Modulus x*y
** Exponentiation x ** y
Floor division x // y
\\
9
Python Assignment Operators
x=5
= x=5
+= x += 3 x=x+3
-= x -= 3 x=x-3
*= x *= 3 x=x*3
/= x /= 3 x=x/3
%= x %= 3 x=x%3
10
Python Comparison Operators
== Equal x == y
!= Not equal x != y
11
Python Logical Operators
result is true
12
Python Identity Operators
Identity operators are used to compare the objects, not if they are
equal, but if they are actually the same object, with the same memory
location.
13
Python Membership Operators
object
present in object
14
Python Bitwise Operators
is 1
bits is 1
Shift left by
pushing zeros in
from the right and x << 2
<< Zero fill left shift
let the leftmost bits
off
15
Examples
16
CHAPTER 3
mathematical functions.
Pandas offer data structures like data frames and series that make
manipulation.
17
NumPy and pandas are indispensable tools for best practice
18
CHAPTER 4
maintenance).
In Real time, data sources are used for streaming data (twitter,
sensor, data), API calls (weather, stock prices) and web scraping. Data
19
cleaning and preprocessing, Feature extraction and selection and Data
Example:
Import panda as pd
import numpy as np
from sklearn.Linear_modelimport
20
CHAPTER 5
DATA VISUALIZATION
are used for web based visualizations and Pandas are used for data
BASIC PLOTS
Histograms (plt.hidt( ))
21
In real world Data Visualization is applied in business
BASIC PLOT
interactive plots.
22
INPUT
OUTPUT
23
CHAPTER 6
VARIABLES
a variable named ‘a’ with a value of 10. Variables can store different
SETS
Arrays in python are data structures that can store multiple values
of the same type. Python does not have built-in support for arrays, but the
Examples:
DATA FRAMES
NumPy and pandas which allow developers to work with tabular data in
25
PACKAGES
26
ASSIGNMENTS
27
28
29
CONCLUSION
in tackling diverse data science can easily process large data set, derive
30
31