1
Analysis of Complaints Data Across Regions
MATH 1281-01 Statistical Inference - AY2025-T4
Math Assignment Unit 5
Instructor: Ankita Devdhara
May 15, 2025
2
a.Descriptive Statistics Explanation:
After importing the data into JASP, descriptive statistics for the variable "Complaint" split across
four regions were generated. The summary shows the mean, standard deviation, minimum, and
maximum values for each region.
1. Which of the regions has the highest average number of complaints, indicate the
average?
Region 4 has the highest average number of complaints, with a mean of 4.200.
2. Which of the regions has the largest standard deviation of complaints, indicate the
standard deviation?
3
Region 1 has the largest standard deviation, at 1.225, indicating that the complaints in this region
vary more widely around the mean compared to other regions.
3. Based on the descriptive output from JASP, can we expect the average number of
complaints across the four regions to be different? Why?
Yes, we can expect the average number of complaints across the four regions to be different. The
descriptive statistics indicate that the average number of complaints ranges from 1.000 in Region
1 to 4.200 in Region 4, suggesting significant variation. This disparity in means, along with
differences in standard deviations, indicates that complaints are not uniformly distributed across
the regions. Therefore, an ANOVA test would help confirm these differences statistically (Diez
et al., 2019).
- Region 1 has an average of 1.000 complaints.
- Region 2 has an average of 1.400 complaints.
- Region 3 has an average of 3.600 complaints.
- Region 4 has an average of 4.200 complaints
b. Hypotheses for Evaluating Average Complaints Across Regions:
Null Hypothesis (H₀): The average number of complaints is the same across all four regions.
𝐻0: µ𝑅𝑒𝑔𝑖𝑜𝑛 1 = µ𝑅𝑒𝑔𝑖𝑜𝑛 2 = µ𝑅𝑒𝑔𝑖𝑜𝑛 3 = µ𝑅𝑒𝑔𝑖𝑜𝑛 4
Where:
μ1 = Mean number of complaints in Region 1
μ2 = Mean number of complaints in Region 2
4
μ3 = Mean number of complaints in Region 3
μ4= Mean number of complaints in Region 4
Alternative Hypothesis (H1)
The average number of complaints is not the same across all four regions (at least one region has
a different mean).
c. ANOVA Analysis Results:
Based on the ANOVA output below:
After running the ANOVA with "Complaints" as the dependent variable and "Region" as the
fixed factor:
1. F-statistic and p-value:
5
The F-statistic is 10.486, and the p-value is < 0.001.
2. Conclusion for the Hypothesis:
Since the p-value is less than the significance level of 0.05, we reject the null hypothesis. This
indicates that there is a statistically significant difference in the average number of complaints
among the four regions.
By utilizing ANOVA, we can conclude that complaint levels are not consistent across the
regions, suggesting variability in the frequency of complaints that could be further
investigated.
Word Count: 445
6
Reference
Diez, D. M., Barr, C. D., & Çetinkaya-Rundel, M. (2019). OpenIntro Statistics (4th ed.).
OpenIntro. https://www.openintro.org/book/os/