[go: up one dir, main page]

0% found this document useful (0 votes)
13 views4 pages

Descriptive Analysis in Tableau

Uploaded by

mahesh95176
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views4 pages

Descriptive Analysis in Tableau

Uploaded by

mahesh95176
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Descriptive Analysis:

Definition: Descriptive analysis is the initial step in statistical analysis that involves
summarizing and describing the main characteristics of a dataset. It encompasses various
statistical techniques and measures to organize, present, and interpret data in a meaningful
and understandable manner. Unlike inferential statistics, which aim to draw conclusions
about a population based on a sample, descriptive analysis focuses solely on describing the
data at hand.
Descriptive analysis involves the systematic application of these statistical measures
to gain a comprehensive understanding of the dataset. For instance, in a study examining the
heights of individuals, the mean height provides an average representation, the mode
identifies the most common height range, the median offers insight into the typical height,
and the standard deviation indicates the degree of height variation within the sample.
Moreover, descriptive analysis encompasses a range of statistical techniques beyond
central tendency and variability measures, including measures of distribution shape
(skewness, kurtosis), percentiles, and correlation coefficients. These techniques offer
additional insights into the characteristics and structure of the data.
Descriptive analysis serves as a fundamental tool for researchers and analysts to
explore, summarize, and interpret data effectively. By employing various statistical measures
and techniques, descriptive analysis facilitates data-driven decision-making and enhances
understanding across diverse fields and disciplines.
Applications:
 Summarizing Data: Descriptive analysis allows researchers to condense large
and complex datasets into concise summaries. This facilitates easier interpretation
and communication of findings to stakeholders.
 Exploratory Data Analysis (EDA): It serves as a precursor to more advanced
analyses by providing insights into the underlying structure, patterns, and
relationships within the data. EDA techniques, such as histograms, box plots, and
scatterplots, are commonly employed in descriptive analysis.
 Data Visualization: Descriptive analysis often involves the creation of visual
representations, such as charts, graphs, and tables, to visually convey the
characteristics of the data. Visualizations aid in identifying trends, outliers, and
anomalies.
 Data Cleaning and Preprocessing: Before performing inferential analyses,
researchers use descriptive techniques to identify and address data quality issues,
such as missing values, outliers, and data inconsistencies. This ensures the
reliability and validity of subsequent analyses.
Key Statistical Measures:
1. Mean:
The mean, or arithmetic average, is calculated by summing up all the values in a
dataset and dividing the sum by the total number of values. It represents the central tendency
of the data. To calculate the mean, you sum up all the values and then divide by the total
number of values.
Mean = (75 + 82 + 68 + 90 + 82 + 78 + 88 + 92 + 70 + 85) / 10
= 800 / 10
= 80
Application: The mean is widely used in various fields, including economics, finance, and
science, to characterize typical or average values. However, it can be sensitive to extreme
values, known as outliers, which may skew its accuracy.
2. Mode:
The mode is the value that occurs most frequently in a dataset.
 The mode is the value that appears most frequently in the dataset.
 In this case, the mode is 82, as it appears twice, which is more than any other value.
 So, the mode exam score is 82.
Application: The mode is particularly useful for categorical or nominal data, where values
are discrete and non-numeric. It provides insight into the most common or prevalent category
within the dataset.
3. Median:
The median is the middle value in a sorted list of numbers. If the dataset has an odd
number of observations, the median is the middle value. If it has an even number, the median
is the average of the two middle values.
 To find the median, you first need to sort the values in ascending order and then
determine the middle value.
Sorted Exam Scores: 68, 70, 75, 78, 82, 82, 85, 88, 90, 92
Since there are 10 values, the median is the average of the two middle values, which are the
5th and 6th values.
Median = (82 + 82) / 2
= 164 / 2
= 82
So, the median exam score is 82.
Application: The median is a robust measure of central tendency that is less affected by
outliers compared to the mean. It is often preferred when the data distribution is skewed or
contains extreme values.
4. Standard Deviation:
The standard deviation measures the dispersion or spread of values around the mean.
It quantifies the variability within a dataset.
To calculate the standard deviation, you first find the mean, then calculate the squared
difference between each value and the mean, sum up these squared differences, divide by the
total number of values, and finally take the square root.
Mean = 80
Squared differences:
(75-80)^2 = 25
(82-80)^2 = 4
(68-80)^2 = 144
(90-80)^2 = 100
(82-80)^2 = 4
(78-80)^2 = 4
(88-80)^2 = 64
(92-80)^2 = 144
(70-80)^2 = 100
(85-80)^2 = 25
Sum of squared differences = 25 + 4 + 144 + 100 + 4 + 4 + 64 + 144 + 100 + 25 = 614
Variance = Sum of squared differences / Total number of values
= 614 / 10
= 61.4
Standard Deviation = √Variance
= √61.4
≈ 7.83
So, the standard deviation of the exam scores is approximately 7.83.
Application: Standard deviation provides insights into the consistency or variability of data
points. A low standard deviation indicates that the values are clustered closely around the
mean, while a high standard deviation suggests greater variability or dispersion.

You might also like