[go: up one dir, main page]

0% found this document useful (0 votes)
17 views3 pages

Applied Statistics Project Guidelines

Uploaded by

trangnths180669
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views3 pages

Applied Statistics Project Guidelines

Uploaded by

trangnths180669
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

MAS202- Applied Statistics for Business (FPTU)

Computer Project

Ngày 31 tháng 10 năm 2025

Time of presentation : Week 10: (12/11/2025 and 14/11/2025)

In this project, you are required to work in a group and present your work to the class.
Each group will need to find data (about housing, finance, health, . . . or any topic of your
choice), then use Excel to perform inferential statistics on the data to obtain useful infor-
mation.
More specifically. Each group are required to formulate your own questions and apply
the following methods in presenting your answers:

1. Summaries of data sample: computing mean, variance, quartiles, IQR, box plot,
summary table, frequency distribution,... and visualization of data via diagrams:
choose among histogram, side by side bar chart, doughnut chart,... ( SEE CHAPTERS
II, III for more details).

2. Create a confidence interval for the difference in means of two populations OR or


the difference in proportions of two populations (CHAPTER X). Based on the data
that you decide to work with:

- Understanding the meaning of a confidence interval: Why do we need to construct


confidence intervals?
- Compute the critical values
- Perform EXCEL computations to reach your confidence interval.
- If you want your confidence interval estimation to be tighter, how do you change
your confidence level?

3. Hypothesis testing for the difference in means of two populations OR for the dif-
ference in proportions of two populations. Based on the data that you decide to work
with:

- You need to clearly explain the setting of your hypothesis testing (decide if two-tail
or one-tail test)

1
Thi Minh Phuong Vu (FE FPTU)
email: phuongvtm11@[Link]/[Link]@[Link]

- Explain explicitly step by step (how many steps for performing a hypothesis test-
ing?) to reach to the final conclusion
- Study two different methods in Hypothesis Testing (critical values and P-values).

4. Hypothesis testing for the ratio of two variances (CHAPTER X). Due to your
dataset, explain:

- Among t-test, z-test, F-test, which is chosen? What is the distribution (shape) of
the test?
- Perform your calculations in 4 (or 5) steps to reach your conclusion
- Explain clearly your conclusion, what are critical values, and the rejection region?

5. ANOVA two way (see CHAPTER XI). Answer the following questions:

- Create an EXCEL table (50-100 observations) to be able to do two-way ANOVA,


create a contingency table and a visulization for this data
- Explain the two factors in your test, precise the null hypothesis and alternative
hypothesis
- Identify all needed informations in EXCEL table: Sums of squares, mean squares,
degree of freedom, test statistic, critical values, rejection region,...
- Test for each of two factors effect and interaction. Explain in details your conclu-
sion according to your data.

6. Simple linear regression (CHAPTER XIII). Choose a dataset in pairs, Using EX-
CEL to do the following requests:

– Create a scatter plot for the dataset


– Construct the sample linear regression line
– Determine the essential computations: sum of squares, coefficient of determina-
tion, residuals
– Verify 2 (out of 4) assumptions on residuals for your dataset
– Test for zero regression slope (use two different methods). Explain your final
conclusion Explain your final conclusion (by determining critical values, the value
of the test, rejection region,...)..

Note:
• For 1, 2, 3, 4, 6 each group chooses only one dataset (performing different tasks on
this dataset, so I suggest choosing a dataset of at least 2 variables). For 5, you need
to choose related samples to be able to perform the two-way ANOVA, see
more details & examples in lecture notes.

• It is required to select a recent dataset, containing data from 2024 or 2025.

• Choose a dataset of 50-150 observations

2
Thi Minh Phuong Vu (FE FPTU)
email: phuongvtm11@[Link]/[Link]@[Link]

• "Clean" your data before performing computations, delete the missing/unnecessary


data cells/colums, each variable must have an unit of measurement

• State clearly your problem, explain why you chose this dataset

• Try to do the "right things", do not say what you do not understand... If you use the
notations that I did not use, try to clearly explain their meanings.

• Encourage creativity: groups with a creative way of choosing, introducing, presenting


the topic will heavily earn extra points.

• ...

Pay attention: DURATION OF PRESENTATION: 15-25 MINUTES (< 5 MINUTES EACH


PERSON).
Total grade = group presentation grade (3 points) + personal interview (3 points)+
bonus (4 points).

You might also like