Chapter 2 Notes
Explore data from a dataset
Frequency distribution
Crosstabs
Visual displays
Histograms
Pie Charts
Bar Charts
Complete the following problems from Chapter 2: 10,18,28,43
Create a frequency table; create visual displays: bar chart, pie chart, histogram, scatterplot, side-by-side chart, stacked bar chart; create a crosstab
the frequency,
relative frequency,
percent frequency
Calculate for a given dataset:
cumulative frequency
cumulative relative frequency
cumulative percent frequency.
Summarize categorical data using: a bar chart and a pie chart.
Summarize quantitative data using: a histogram.
Create a crosstab for two variables.
Summarize two variables using a scatter diagram and a side-by-side chart.
diagrama de esparcimiento
side-by-side chart.
We visualize data using graphs or tables to help us make sense of data.
In this chapter, we learn how to summarize and visualize data for both categorical and quantitative data
Categorical data – use labels or names to identify categories
Quantitative data – numerical values that indicate how much or how many
• Frequency distribution – summary of data showing the number (frequency) of observations in each category/class.
• Frequency – the number/count of observations in a given category/class
• Relative frequency – it equals the proportion of observations that belong to a given class
• Percent frequency – the relative frequency X 100
• Cumulative frequency – the sum of the frequencies
• Cumulative relative frequency – the sum of the relative frequencies
• Cumulative percent frequency – the sum of the percent frequencies
Create a frequency
distribution for the soft
drink dataset .
100
Drink Frequency R Freq. Per. Freq Cum, Fre Cum. Rel Fr Cum % Fre Drink Frequency R Freq. Per. Freq Cum, Fre Cum. Rel Fr Cum % Fre
Coca Cola 19 0.38 38 19 0.38 38 Coca Cola 19 0.38 38 19 0.38 38
Diet Coke 8 0.16 16 27 0.54 54 Pepsi 13 0.26 26 32 0.64 64
Dr. Pepper 5 0.1 10 32 0.64 64 Diet Coke 8 0.16 16 40 0.8 80
Pepsi 13 0.26 26 45 0.9 90 Dr. Pepper 5 0.1 10 45 0.9 90
Sprite 5 0.1 10 50 1 100 Sprite 5 0.1 10 50 1 100
50 1 100 50 1 100
Most prefered
Bar chart – graphical display for showing categorical data summarized in a frequency, relative frequency, or percent frequency distribution
Pie chart – provides another graphical display for categorical data for displaying relative frequency and percent frequency distributions.
Soft Drinks Soft Drinks Order
20
18
16 19
14
12
10 13
8
6 8
4 5 5
2
0
Coca Cola Diet Coke Dr. Pepper Pepsi Sprite Coca Cola Pepsi Diet Coke Dr. Pepper Sprite
Sofft Drinks Soft Drinks
5 5
19
10%
10%
38%
8
16%
13
26%
Coca Cola Pepsi Diet Coke Dr. Pepper Sprite Coca Cola Pepsi Diet Coke Dr. Pepper Sprite
Similar to when we had categorical data, we need to create categories, or classes, which we can use to assign our data observations.
Classes – nonoverlapping categories used to create frequency distributions
Three steps to creating classes: Determine the # of classes
Determine the width of each class
Audit Data Clases Determine the class limits
1 Three steps to creating classes:
2 Determine the # of classes (5-20) 5
3 Determine the width of each class = (Largest value – Smallest value) / Number of classes = (33 – 12) / 5 = 4.2 ~ 5
Determine the class limits (nonoverlapping)
'=(33-12)/5 4.2 = 5