Mini Project Report On: Computer Science and Engineering
Mini Project Report On: Computer Science and Engineering
BACHELOR OF TECHNOLOGY
in
COMPUTER SCIENCE AND ENGINEERING
Submitted by
(MIP-C30)
Student Name : 197Y1A05G6 & 197Y1A05E1
Under the Guidance
of
Professor/Associate/Assistant Professor
Data Warehousing
Data Mining
2.REQUIREMENTS
Hardware/ Specification
Hardware / Software element
Software /version
Hardware Processor i3
RAM 2GB
Hard Disk 2
50GB
Software OS Windows,Li
nux.Jupyter
Python IDE
NoteBook.
Microsoft Azure
Python3.
Explanation:
1) In this module we first gather the data(dataset) for our
prediction model.Data comes in all forms, most of it being very
messy and unstructured. They rarely come ready to use.
Datasets, large and small, come with a variety of issues- invalid
fields, missing and additional values, and values that are in
forms different from the one we require. In order to bring it to
workable or structured form, we need to “clean” our data, and
make it ready to use. Some common cleaning includes parsing,
converting to one-hot, removing unnecessary data, etc.
2) In our case, our data has some days where some factors
weren’t recorded. And the rainfall in cm was marked as T if
there was trace precipitation. Our algorithm requires numbers,
so we can’t work with alphabets popping up in our data. so we
need to clean the data before applying it on our model.
MINI PROJECT REPORT (2019-2023 Batch) Dept. of CSE, MLRITM 11
2)Once the data is cleaned, In this module that cleaned data can
be used as an input to our Linear regression model. Linear
regression is a linear approach to form a relationship between a
dependent variable and many independent explanatory
variables. This is done by plotting a line that fits our scatter plot
the best, ie, with the least errors. This gives value predictions, ie,
how much, by substituting the independent values in the line
equation.
Module Outcomes:
1) By the end of the first module the fully cleaned and useful
data is available for the apply the algorithm for the prediction.
1) By the end of the second module the actual prediction will be
happen the outcome is the amount of rainfall in inches based
upon the users input.
y=mx+c
3.1 Algorithms:
Linear Regression:
Module-1 :Data gathering and pre - processing.
import requests
import json
# Initialize Window
root = Tk()
city_value = StringVar()
def time_format_for_location(utc_with_tz):
local_time = datetime.utcfromtimestamp(utc_with_tz)
return local_time.time()
city_value = StringVar()
def showWeather():
api_key = "5be7fdb24b0e829e6a5ec295bc3cd52a"
# sample API
# Get city name from user from the input field (later in the
code)
city_name = city_value.get()
response = requests.get(weather_url)
weather_info = response.json()
if weather_info['cod'] == 200:
feels_like_temp = int(weather_info['main']['feels_like'] -
kelvin)
pressure = weather_info['main']['pressure']
humidity = weather_info['main']['humidity']
sunrise = weather_info['sys']['sunrise']
sunset = weather_info['sys']['sunset']
timezone = weather_info['timezone']
cloudy = weather_info['clouds']['all']
description = weather_info['weather'][0]['description']
sunset_time = time_format_for_location(sunset +
timezone)
else:
# to show output
tfield.pack()
root.mainloop()
outputs