[go: up one dir, main page]

0% found this document useful (0 votes)
28 views44 pages

Data Visualization Introduction-Presentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views44 pages

Data Visualization Introduction-Presentation

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 44

Data Visualization

Lab
Syllabus
Course Outcomes:
• CO : 1 Identify the different data types, visualization types to bring out
the insight.
• CO : 2 Relate the visualization towards the problem based on the
dataset to analyze and bring out valuable insight on a large dataset.
• CO : 3 Demonstrate the analysis of a large dataset using various
visualization techniques and tools.
• CO : 4 Identify the different attributes and showcasing them in plots.
Identify and create various visualizations for geospatial and table
data.
• CO : 5 Ability to create and interpret plots using R/Python.
The raw data undergoes
different stages within a
pipeline, which are:
1.Fetching/Obtaining the
Data
2.Scrubbing/Cleaning the
Data
3.Data Visualization
4.Modeling the Data
5.Interpreting the Data
6.Revision
What is Data Visualization?
• Data visualization is the graphical representation of information and
data. By using visual elements like charts, graphs, and maps, data
visualization tools provide an accessible way to see and understand
trends, outliers, and patterns in data.
• Data visualization tools and technologies are essential to analyzing
massive amounts of information and making data-driven decisions.
Categories of Data Visualization
Why is Data Visualization
Important?
1. Data Visualization Discovers the Trends in Data
2. Data Visualization Provides a Perspective on the
Data
3. Data Visualization Puts the Data into the Correct
Context
4. Data Visualization Saves Time
5. Data Visualization Tells a Data Story
Top Data Visualization Tools
The following are the 10 best Data Visualization Tools
1.Tableau
2.Looker
3.Zoho Analytics
4.Sisense
5.IBM Cognos Analytics
6.Qlik Sense
7.Domo
8.Microsoft Power BI
9.Klipfolio
10.SAP Analytics Cloud
Is Excel not a Data Visualization
Tool??
Top Data Visualization Libraries Available in
Python, R, and Javascript
Python:
• Matplotlib
• Plotly
• ggplot R:
• Seaborn • ggplot2
• Altair • Plotly
• Leaflet
• Geoplotlib
• Esquisse
• Bokeh
• Lattice
Javascript:
• D3.js
• Chart.js
• Plotly
What is a Dataset?
• A data set (or dataset) is a collection of data.
• It can be a collection of numbers or values that relate to
a particular subject/problem
• In the case of tabular data, a data set corresponds to
one or more database tables, where every column of a
table represents a particular variable, and each row
corresponds to a given record of the data set in
question.
Why do we use dataset?
• Datasets for training : To train the Models.
Eg:Machine Learning Models
• Datasets for validation: To make the model more
accurate.
Eg:To solve Overfitting problems
• Datasets for testing: Used for testing the final output of
the model to confirm its accuracy.
Dataset File Formats
• Comma-separated values(.csv)
• Excel(.xls)
• Office Open XML file format(.xlsx)
• Tab-separated values(.tsv)
• Portable Document Format(.pdf)
• ZIP archive file format(.zip)
• Plain text(.txt)
• JSON(JavaScript Object Notation.)(.json)
Power BI - Introduction
• Power Business Intelligence (BI) is a tool that allows you to connect to various data
sources, visualize the data in reports and dashboards, and then share them with
anyone you want.
• Power BI desktop app is used to create reports, while Power BI Services (Software
as a Service - SaaS) is used to publish the reports, and Power BI mobile app is used
to view the reports and dashboards.
• Power BI Desktop is available in both 32-bit and 64-bit versions.

Power BI is made up of 3 main elements:

1.Power BI Desktop - a free desktop application for building and designing reports.
2.Power BI Service - the online publishing service for viewing and sharing reports
and dashboards.
3.Power BI mobile apps - for viewing reports and dashboards on the go.
What is Power BI Used For?
Power BI is a tool in the category of Business
Intelligence (BI). The purpose of BI is to track
Key Performance Indicators (KPIs) and
uncover insights in business data so as to
better inform decision-making across the
organization.

Power BI is used in different ways depending


on the role of the individual, from
developers, analysts, managers, and
directors, to everyone in between.
To download the latest version, you can use the following link −
https://powerbi.microsoft.com/en-us/downloads/
Supported Operating Systems

• Windows 10, Windows 7, Windows 8, Windows 8.1, Windows Server


2008 R2, Windows Server 2012, Windows Server 2012 R2
• Microsoft Power BI Desktop requires Internet Explorer 9 or higher
• Microsoft Power BI Desktop is available for 32-bit (x86) and 64-bit
(x64) platforms.
• Users can select a language in which they want to install Power BI and
following files are available for download.
When you run the installation file, following screen is displayed.
When you run the installation file, following screen is displayed.

Accept the license agreement and follow the instructions on the


screen to finish the installation.
When Power BI is installed, it launches a welcome screen. This
screen is used to launch different options related to get data, enrich
the existing data models, create reports as well as publish and share
reports.
Click the Finish button to initialize the process.
Please, Wait for a few seconds to start Power BI Desktop.
Here you can see the installed Power BI Desktop Page. Let me Close
the start page.
•That’s it. Power BI Desktop is now ready for analytics or report
development! From there, you can begin creating data models or
interactive reports.

•When Power BI is installed, it generates a welcome screen.

•This screen is used to launch different options related to getting data,


enriching the existing data models, creating reports as well as
publishing and sharing reports.
When we launch the application, Power BI will start with a blank report. Let us
have a look at Components of Power BI Desktop.

•Ribbon - the top ribbon contains most of the


controls and options needed for building the
report.

•Views - this is made up of the report view, the


data view, and the model view.

•Canvas - this is the main design area where


visualizations and other elements are added.

•Page selector - for navigation to other pages


• Filters - fields can be added here to filter the data.

• Visualizations - this contains the list of available


visualizations.

• Fields - this section contains the tables and fields that


are available in the data model.
Importing and Transforming Data in
Power
Data BI Desktop
Sources and Connections
Power BI offers a plethora of supported data sources and
connections. This makes it incredibly easy to connect to the
data source of our choosing
Importing Data
We can download the sample data and import it by selecting the
Excel
A data window
preview source. will pop up where you can select the table or sheet you want
to import from the Excel file.
Tables and sheets are designated by their
respective icons.
It is generally better to import tables as they are neatly defined in Excel with
strict headers and row boundaries.
Power Query Editor

The Power Query Editor can be broken up into 4 main parts:

•Ribbon - the top ribbon contains almost all of the data transformation options
you need to shape your data. We will explore a few common transformations
below.

•Queries - this lists all the queries you have set up for this report. For complex
reports, you can organize queries into groups for better navigation and
management.

•Data view - this is the main table containing the data for the selected query as
well as a formula bar. A preview of the data is shown with only the first 1000
rows.

•Transformation steps - the right-hand pane contains each of the


transformation steps that have been applied to the selected query. This allows
you to keep track of each individual change that has been made to the data.
You can insert, delete, and move steps around as needed.
The most common transformation steps:

•Removing rows and/or columns - some Excel data can have a lot of blank
rows and/or columns inserted for readability and aesthetic purposes, but these
are not useful in Power BI and should be removed.

•Changing data types - data types such as number, date, or text should be
specified for each column. Power BI will try to automatically detect the data
type, yet it can sometimes be wrong or there can be errors so it is a good idea
to always double-check the data types.

•Combining data with merge and append - similar to join and concatenate
in SQL, these transformations allow you to combine queries from multiple
sources.

•Pivot and unpivot - these options allow you to transform your data from a
wide to a long format and vice versa. The unpivot option is particularly useful
when dealing with Excel files that have information (such as dates) running
across the columns of a table rather than as rows.

•Adding a conditional column - this is a useful transformation that allows you


to add a column based on if/then/else logic.
What is Univariate, Bivariate &
Multivariate Analysis in Data
Visualisation?
Univariate Analysis : Univariate Analysis is a type of data visualization
where we visualize only a single variable at a time.
Univariate Analysis helps us to analyse the distribution of the variable
present in the data so that we can perform further analysis.
Like Pie Chart, Donut chart ,Histogram, Bar chart
Bivariate analysis : Bivariate analysis is the simultaneous analysis of
two variables.
It explores the concept of the relationship between two variable
whether there exists an association and the strength of this association
or whether there are differences between two variables and the
significance of these differences.
The main three types we will see here are:
1. Categorical v/s Numerical
2. Numerical V/s Numerical
3. Categorical V/s Categorical data
Multivariate Analysis : It is an extension of bivariate analysis which
means it involves multiple variables at the same time to find correlation
between them.
Multivariate Analysis is a set of statistical model that examine patterns
in multidimensional data by considering at once, several data variable.
Like Scatterplot, Heatmap.

You might also like