Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 3
Name: Aditya Parade Roll No: 281047 PRN: 22311577 Batch: A-2 Assignment 3
Problem Statement:
Visualize the data using R/Python by plotting the graphs for assignment no. 1 and 2. Consider suitable
data set. Use Scatter plot, Bar plot, Box plot, Pie chart, Line Chart.
Introduction:
Data visualization plays a crucial role in data analysis by transforming raw data into a visual format,
making it easier to interpret patterns, trends, and relationships within the data. Using Python, libraries
such as Seaborn and Matplotlib allow us to create a wide variety of plots to visualize data
effectively. This documentation will explore how to visualize data using Scatter plots, Bar plots, Box
plots, Pie charts, and Line charts.
Objectives:
1. To introduce and explore basic visualization techniques in Python using Seaborn and
Matplotlib.
2. To demonstrate how to visualize data using different plot types, including Scatter plot, Bar
plot, Box plot, Pie chart, and Line chart.
3. To analyze a suitable dataset using the various plot types for better insights.
o More attractive default plots, including categorical plots and regression plots.
● Matplotlib: Matplotlib is one of the most widely used Python libraries for creating static,
animated, and interactive visualizations. It provides a low-level interface for creating basic
plots and offers extensive customization options.
Key features:
o Extensive support for various plot types such as line, scatter, bar, and pie charts.
Methodology:
For this assignment, we'll visualize data using the following plot types:
1. Scatter Plot: A scatter plot is used to display the relationship between two continuous
variables. Each point represents a data point on the x and y axes.
Example: Plotting the relationship between height and weight.
2. Bar Plot: Bar plots are used to represent categorical data with rectangular bars, where the
length of each bar is proportional to the value it represents.
Example: Comparing sales across different regions.
3. Box Plot: Box plots display the distribution of a dataset, highlighting the median, quartiles,
and any potential outliers. They are useful for understanding the spread of data.
4. Pie Chart: Pie charts are used to represent proportions of a whole. Each segment of the pie
represents a category’s contribution to the total.
Example: Displaying the percentage distribution of a survey response.
5. Line Chart: Line charts show trends over time or continuous data points. It is useful for
visualizing time series data.
Example: Visualizing stock prices over time.
Conclusion:
Data visualization is a key aspect of data analysis and communication. Using libraries like Seaborn
and Matplotlib in Python, we can easily create a variety of visualizations to make sense of complex
datasets. The Scatter plot, Bar plot, Box plot, Pie chart, and Line chart each serve a specific purpose
and provide unique insights into the data. Proper visualization not only makes it easier to interpret the
data but also aids in decision-making and sharing results with others effectively.