0% found this document useful (0 votes)

13 views8 pages

Problemset 1 2022 2

The document presents a statistical analysis of math test scores and computer access across districts, revealing a mean math score of 653.34 and a significant difference in scores between computer-intensive and non-intensive districts. It also discusses the challenges in establishing a causal link between computer access and math achievement, and provides a detailed analysis of gender wage differences, concluding that there is evidence of gender discrimination in wages. Additionally, it outlines the steps for constructing confidence intervals and hypothesis testing using Stata software.

Uploaded by

Kumar Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views8 pages

Problemset 1 2022 2

Uploaded by

Kumar Vivek

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Problem set 1

Basic summary statistics for math test scores and the number of computers per student, by
district, are reported below:

sum math_scr comp_stu

Variable | Obs Mean Std. Dev. Min Max

-------------+--------------------------------------------------------
math_scr | 420 653.3426 18.7542 605.4 709.5
comp_stu | 420 .1359266 .0649558 0 .4208333

1. Using this table, report the following statistics.

a) Mean math test score (! X ) = 653.3426

2
b) Variance of math test score ( S X ) = (18.7542)2 = 351.72
S X2
c) Variance of mean math test score (! ) = 351.72 / 420 = 0.8374
d) Standard error of mean test score (! S X ) = √0.8374 = 0.9150

2. Estimate a 95% confidence interval for the mean math test score in this sample.

Confidence interval = [x̄ ± m]

Where, m = margin of error = t*(SE)
t* for 95% confidence interval = 1.984
Therefore, m = (1.984) (0.9150) = 1.81536
Finally, confidence interval = [653.3426 ± 1.81536] = [651.52, 655.15]

3. Some education analysts have speculated that access to computers can increase math
achievement. Let’s compare the average math test scores of computer-intensive districts
(defined as having a computer per student ratio above the median) with the rest of the
districts.

Computer-intensive districts

Variable | Obs Mean Std. Dev. Min Max

-------------+--------------------------------------------------------
math_scr | 210 657.81 18.91115 605.4 709.5
_____________________________________________________________________________

All other districts

Variable | Obs Mean Std. Dev. Min Max

-------------+--------------------------------------------------------
math_scr | 210 648.8752 17.53241 612.5 703.6

Assuming independent random samples, indicate the value of:

a. Mean difference in test scores between high-spending districts and low-spending

ones
= 657.81 - 648.87 = 8.93
b. Standard error of mean difference =
2 2
sA s
SE ( X A − X B ) = + B
NA NB
!

= √ (18.911152/ 210 + 17.532412/210)

= 1.77

4. Test the null hypothesis that the two means are the same (against the alternative that they
are different). State your conclusions in both statistical terms and policy terms.

HO : µA - µB = 0
H A: µ A - µ B ≠ 0
Degree of freedom = 210 - 1 = 209
t statistic = [(X̄ A - X̄ B) - 0] / SE
= 8.93 / 1.77
= 5.04
P value = 2P (T > t)
= 2 P (T > 5.05) = 0

Since P value is less than any significance level we choose, we reject the null hypothesis
in favour of the alternate hypothesis that the two means are different. It means, there is
enough evidence to conclude that access to computers can increase math achievement.

5. Can we conclude from these results that increasing the number of computers per student
is an effective way to increase math achievement? Explain.

No, it is difficult to establish a causal link between the number of computers per student and
math achievement. It is so because in the computer intensive districts, there may be other
factors like income of family, teacher student ratio, method of instruction etc. that may have
led to increase in math scores.
6. Suppose an NGO donates a large number of computers to 100 schools in India Using the
concept of the counterfactual, how would you define the effect of this program on math
test scores a year after the program was implemented?

It is not possible to observe the counterfactual in this case as we can’t go back in time to see
the impact of having computers and not having them on the math scores at the same time.
One way to mimic the counterfactual is to conduct a randomised controlled experiment.
Under this experiment, the math scores of 100 schools that received the computers (treatment
group) would be compared with the ones that didn’t receive the computers (control group).
However, it is important to make sure that the schools in the control and the treatment groups
are identical on all other parameters.
Part II

Using results from the Stata review in lab, the gender2009.dta dataset, and any additional
commands you need, create a “do-file” (call it ps1part2.do) to give you the information
necessary to answer the following questions. You need to show your work in answering
questions (1)-(3), i.e. write down the formulas you are using, and use the numbers from the
Stata output to compute the relevant confidence interval or test statistic. Submit your
Stata output with your answers.

1. Construct a 95% confidence interval for the proportion of men in the sample.
sum gender

Variable | Obs Mean Std. Dev. Min Max

-------------+--------------------------------------------------------
gender | 950 .5136842 .500076 0 1

95% confidence interval = [x̄ ± m]

Where, m = margin of error = t*(SE)
t* for 95% confidence interval = 1.984
SE = √ (0.5000762 / 950) = 0.0162
Therefore, m = (1.984) (0.0162) = 0.0321
Finally, confidence interval = [0.5136842 ± 0.0321] = [0.481, 0.546]

2. Test the null hypothesis that the average wage in the population is equal to $13/hour
(against the alternative hypothesis that it is not equal to $13/hour) using a 5%
significance level. In doing so, indicate:

a. Null hypothesis

b. Alternative hypothesis

c. Test statistic used

d. p-value of this test and interpretation of the p-value.

generate wage=salary/(hours*weeks)

. sum wage
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
wage | 950 12.41145 8.906282 .1748252 81.73029

HO : µ = 13
HA: µ ≠ 13
α = 0.05
SE = √ (8.906282 2 / 950) = 0.29
t statistic = [(12.41145- 13)] / 0.29
= -2.03
P value = 2P (T > 2.03)
= 0.02 to 0.05
Since P value < α (0.05), we reject the null hypothesis that the average wage of
the population is $13/hour.

3. What is the difference in the average wage between men and women? Is this
difference statistically significant at the 5% significance level? In doing so, indicate:

a. Null hypothesis

b. Alternative hypothesis

c. Test statistic used

d. p-value of this test and interpretation of the p-value.

. sum wage if gender==1

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------

wage | 488 14.01115 10.12239 .5 81.73029

. sum wage if gender==0

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------

wage | 462 10.72173 7.034033 .1748252 42.89773

Difference in average wage = 14.01115 - 10.72173 = 3.29

HO : µA - µB = 0
H A: µ A - µ B ≠ 0
Now,
2 2
sA s
SE ( X A − X B ) = + B
NA NB
%

= √ (10.122392 / 488 + 7.0340332 / 462) = 0.563

t statistic = 6.98/ 0.563 = 5.84

P value (for t = 5.58) = 0 which is less than α = 0.05.

We reject the null hypothesis in favour of the alternate hypothesis. Therefore, the
difference in the average wage between men and women is statistically significant at the 5%
significance level.

4. Based on your results from question 3, would it be correct to say that there is less
than a 5% chance that the average wages of women and men are the same in the
population?

No, the result above only says that we can reject the null hypothesis. It means that there
is a less than 5% probability that the difference being observed between the average wage of
men and women are due to any error in sampling.

5. Does the result of question 3 provide evidence to conclude that there is gender
discrimination in wages, i.e. that that women earn less simply because they are
women? Explain briefly.

There is some evidence in the data to show the prevalence of gender discrimination
when it comes to wages. However, it is difficult to establish a causal link between gender and
wages from the data. For example, the difference could also be a result of the fact that
women take maternity and childcare leave. If we were to conduct an RCT, it would have
provided more conclusive results with respect to causal relationship.
Do File

cd "\\sipaxafsc\users\kv2373\Desktop\PS 1 6501"

log using Lab1_log, replace

use "gender2009"

sum gender

generate wage=salary/(hours*weeks)

sum wage if gender==1

sum wage if gender==0

Log File

--------------------------------------------------------------------------------

name: <unnamed>

log: \\sipaxafsc\users\kv2373\Desktop\PS 1 6501\Lab1_log.smcl

log type: smcl

opened on: 6 Feb 2022, 20:35:53

. use "gender2009"

end of do-file

. do "C:\Users\kv2373\AppData\Local\Temp\126\STD3890_000000.tmp"

. sum gender

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------

gender | 950 .5136842 .500076 0 1

.
end of do-file

. do "C:\Users\kv2373\AppData\Local\Temp\126\STD3890_000000.tmp"

. generate wage=salary/(hours*weeks)

end of do-file

. do "C:\Users\kv2373\AppData\Local\Temp\126\STD3890_000000.tmp"

. sum wage if gender==1

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------

wage | 488 14.01115 10.12239 .5 81.73029

end of do-file

. do "C:\Users\kv2373\AppData\Local\Temp\126\STD3890_000000.tmp"

. sum wage if gender==0

Variable | Obs Mean Std. dev. Min Max

-------------+---------------------------------------------------------

wage | 462 10.72173 7.034033 .1748252 42.89773

end of do-file

BFA TCP Exam Brief 21-22 Final 080322
No ratings yet
BFA TCP Exam Brief 21-22 Final 080322
5 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Homework 2 With Suggested Answers
No ratings yet
Homework 2 With Suggested Answers
14 pages
Grade: Midterm II (Quantitative Methods I)
No ratings yet
Grade: Midterm II (Quantitative Methods I)
3 pages
Econometrics II Teaching Material
No ratings yet
Econometrics II Teaching Material
88 pages
Practice
No ratings yet
Practice
2 pages
Discussion1 Solution
No ratings yet
Discussion1 Solution
5 pages
Dummy Variable Regression Guide
No ratings yet
Dummy Variable Regression Guide
48 pages
Econ 251 PS5 Solutions
No ratings yet
Econ 251 PS5 Solutions
16 pages
Capitulo1 Exercicios
No ratings yet
Capitulo1 Exercicios
3 pages
CH 1 - Economic Data & Nature of Econometrics
No ratings yet
CH 1 - Economic Data & Nature of Econometrics
3 pages
Eco220y A17
No ratings yet
Eco220y A17
28 pages
Midterm Fall2011
No ratings yet
Midterm Fall2011
13 pages
Assignment - I - Problems
No ratings yet
Assignment - I - Problems
2 pages
Econometrics Solutions for Students
No ratings yet
Econometrics Solutions for Students
8 pages
Stats Mid-Term Exam
No ratings yet
Stats Mid-Term Exam
2 pages
Chapter 1 Dummy Variable Regression
No ratings yet
Chapter 1 Dummy Variable Regression
45 pages
Seminar Questions
No ratings yet
Seminar Questions
5 pages
ch4 Dummy
No ratings yet
ch4 Dummy
54 pages
Assignment A (Hand In)
No ratings yet
Assignment A (Hand In)
6 pages
ECMT1020 - Week 04 Workshop PDF
No ratings yet
ECMT1020 - Week 04 Workshop PDF
4 pages
5103A1
No ratings yet
5103A1
6 pages
Exercise 2 Exam1practice Sa
No ratings yet
Exercise 2 Exam1practice Sa
11 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (98)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
6 pages
Exam Practice 2
No ratings yet
Exam Practice 2
6 pages
Statistics Homework Analysis
No ratings yet
Statistics Homework Analysis
8 pages
Gender Pay Gap Analysis
No ratings yet
Gender Pay Gap Analysis
43 pages
Problem Set 4 With Solutions
No ratings yet
Problem Set 4 With Solutions
4 pages
Multiple Regression with Dummy Variables
No ratings yet
Multiple Regression with Dummy Variables
50 pages
Homework
No ratings yet
Homework
2 pages
Econometrics Assignment Guide
No ratings yet
Econometrics Assignment Guide
3 pages
T - Test Ass.
No ratings yet
T - Test Ass.
4 pages
ps5 Fall+2015
No ratings yet
ps5 Fall+2015
9 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (72)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
Test of Inequality Between Male Employees and Female Employeesedited
No ratings yet
Test of Inequality Between Male Employees and Female Employeesedited
9 pages
4SSMN902 May 2022
No ratings yet
4SSMN902 May 2022
10 pages
QP
No ratings yet
QP
5 pages
Return To Education: Topic: Probability
No ratings yet
Return To Education: Topic: Probability
3 pages
Categorical Predictor S
No ratings yet
Categorical Predictor S
41 pages
Statistics Homework for Students
No ratings yet
Statistics Homework for Students
16 pages
Chapter 1 Qualitative Variables Final
No ratings yet
Chapter 1 Qualitative Variables Final
74 pages
School of Economics and Business Administration University of Navarra Academic Year: 2024/25 Econometrics I Problem Set III: Ch. 4
No ratings yet
School of Economics and Business Administration University of Navarra Academic Year: 2024/25 Econometrics I Problem Set III: Ch. 4
3 pages
Dummy Variable Regression Models (Lec 8-9) : 1 Nguyen Thu Hang, BMNV, FTU CS2
No ratings yet
Dummy Variable Regression Models (Lec 8-9) : 1 Nguyen Thu Hang, BMNV, FTU CS2
48 pages
Module 1 - QB
No ratings yet
Module 1 - QB
9 pages
ECON1203 Exam 10 S 2
0% (1)
ECON1203 Exam 10 S 2
13 pages
ECN225 Week2 PS
No ratings yet
ECN225 Week2 PS
3 pages
AE Week 3
No ratings yet
AE Week 3
3 pages
Dummy Variable Ques
No ratings yet
Dummy Variable Ques
7 pages
Y F (X, Z) : Regression Statistics
No ratings yet
Y F (X, Z) : Regression Statistics
12 pages
Econ5025 Practice Problems
43% (7)
Econ5025 Practice Problems
33 pages
Multiple Regression Analysis Problem Set
No ratings yet
Multiple Regression Analysis Problem Set
5 pages
Session6 Worksheet Student Information
No ratings yet
Session6 Worksheet Student Information
13 pages
Econ3033 Fall 2024 Professor Loretta Fung Assignment 2: Multiple Regression Analysis Due Date: October 25 (Friday) at 23:59
No ratings yet
Econ3033 Fall 2024 Professor Loretta Fung Assignment 2: Multiple Regression Analysis Due Date: October 25 (Friday) at 23:59
2 pages
PDF
No ratings yet
PDF
9 pages
SW 2e Ex ch05
No ratings yet
SW 2e Ex ch05
5 pages
ps1 Build
No ratings yet
ps1 Build
4 pages
Review Final Ex
100% (2)
Review Final Ex
20 pages
Applied Regression - HW1 - JP, Savio, Leila, Mohan
100% (1)
Applied Regression - HW1 - JP, Savio, Leila, Mohan
18 pages
Chapter 1
No ratings yet
Chapter 1
76 pages
Pami Dua - Macroeconometric Methods - Applications To The Indian Economy-Springer (2023)
100% (1)
Pami Dua - Macroeconometric Methods - Applications To The Indian Economy-Springer (2023)
394 pages
Murali Kallummal (Editor), Santosh Kumar (Editor), P L Beena (Editor) - Indian Economy and Neoliberal Globalization - Finance, Trade, Industry and Employment-Routledge India (2022)
No ratings yet
Murali Kallummal (Editor), Santosh Kumar (Editor), P L Beena (Editor) - Indian Economy and Neoliberal Globalization - Finance, Trade, Industry and Employment-Routledge India (2022)
407 pages
IMF Research Perspectives An Economy For All 1595706350
No ratings yet
IMF Research Perspectives An Economy For All 1595706350
25 pages
The Many Mutinies of Maamla Legal Hai EPW 1711861297
No ratings yet
The Many Mutinies of Maamla Legal Hai EPW 1711861297
3 pages
Simple Linear Regression: Y XI. XI X
No ratings yet
Simple Linear Regression: Y XI. XI X
25 pages
Measures of Dispersion Guide
No ratings yet
Measures of Dispersion Guide
13 pages
12th Business Mathematics Statistics Monthly Test Question Paper 2022 2023 English Medium PDF Download.
No ratings yet
12th Business Mathematics Statistics Monthly Test Question Paper 2022 2023 English Medium PDF Download.
3 pages
3 Sem Core QUANTITATIVE TECHNIQUES FOR BUSINESS I MCQ
No ratings yet
3 Sem Core QUANTITATIVE TECHNIQUES FOR BUSINESS I MCQ
19 pages
Employee Salary Influences
No ratings yet
Employee Salary Influences
8 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
35 pages
Stata Essentials for Analysts
No ratings yet
Stata Essentials for Analysts
22 pages
Statistical Moments Explained
100% (1)
Statistical Moments Explained
6 pages
MScFE 610 ECON - Video - Transcript - Lecture2 - M3 - U3
No ratings yet
MScFE 610 ECON - Video - Transcript - Lecture2 - M3 - U3
6 pages
STAT Q4 Week 2 Enhanced.v1
No ratings yet
STAT Q4 Week 2 Enhanced.v1
11 pages
Types of Errors in Hypothesis Testing
100% (1)
Types of Errors in Hypothesis Testing
18 pages
Statistical Inference-BSA&F-III-Morning
No ratings yet
Statistical Inference-BSA&F-III-Morning
3 pages
Paired-Samples T-Test
No ratings yet
Paired-Samples T-Test
4 pages
Soft Drink Demand Analysis
No ratings yet
Soft Drink Demand Analysis
4 pages
Lean-Six-Sigma-Green-Belt-Certification-Training-Manual-CSSC-2018-06b (1) (351-370)
No ratings yet
Lean-Six-Sigma-Green-Belt-Certification-Training-Manual-CSSC-2018-06b (1) (351-370)
20 pages
Forecasting: "Prediction Is Very Difficult
No ratings yet
Forecasting: "Prediction Is Very Difficult
57 pages
AP Statistics Multiple Choice Practice
No ratings yet
AP Statistics Multiple Choice Practice
10 pages
Basic Sampling Theory May 2017
No ratings yet
Basic Sampling Theory May 2017
1 page
MANOVA
No ratings yet
MANOVA
33 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Metlit 10 - Besar Sample - 20180824-Slides-1
No ratings yet
Metlit 10 - Besar Sample - 20180824-Slides-1
39 pages
Econometrics Assignment Guide
No ratings yet
Econometrics Assignment Guide
3 pages
16.03.2020 Traffic Engineering and Safety Assignment QP Set No. 1
No ratings yet
16.03.2020 Traffic Engineering and Safety Assignment QP Set No. 1
3 pages
The Reliable Change Index
No ratings yet
The Reliable Change Index
8 pages
Find The Value of and Make A Verbal Interpretation of The Following Scores. (With
No ratings yet
Find The Value of and Make A Verbal Interpretation of The Following Scores. (With
2 pages
2017 Volume 14 Number 2 2017 Volume 14 Number 2: ISSN 1739-4341 ISSN 1739-4341
No ratings yet
2017 Volume 14 Number 2 2017 Volume 14 Number 2: ISSN 1739-4341 ISSN 1739-4341
124 pages
Lecture-6-7-8-Descriptive Statistics-Dispersion
No ratings yet
Lecture-6-7-8-Descriptive Statistics-Dispersion
42 pages
Faculty of Business and Management: Assignment/ Project Declaration Form
No ratings yet
Faculty of Business and Management: Assignment/ Project Declaration Form
8 pages
PLS-SEM or CB-SEM: Updated Guidelines On Which Method To Use
No ratings yet
PLS-SEM or CB-SEM: Updated Guidelines On Which Method To Use
17 pages
Chapter 3 Test of Difference Between Means
No ratings yet
Chapter 3 Test of Difference Between Means
44 pages

Problemset 1 2022 2

Uploaded by

Problemset 1 2022 2

Uploaded by

Problem set 1

sum math_scr comp_stu

Variable | Obs Mean Std. Dev. Min Max

1. Using this table, report the following statistics.

a) Mean math test score (! X ) = 653.3426

Confidence interval = [x̄ ± m]

Variable | Obs Mean Std. Dev. Min Max

All other districts

Variable | Obs Mean Std. Dev. Min Max

Assuming independent random samples, indicate the value of:

a. Mean difference in test scores between high-spending districts and low-spending

= √ (18.911152/ 210 + 17.532412/210)

Variable | Obs Mean Std. Dev. Min Max

95% confidence interval = [x̄ ± m]

c. Test statistic used

d. p-value of this test and interpretation of the p-value.

c. Test statistic used

d. p-value of this test and interpretation of the p-value.

. sum wage if gender==1

Variable | Obs Mean Std. dev. Min Max

wage | 488 14.01115 10.12239 .5 81.73029

. sum wage if gender==0

Variable | Obs Mean Std. dev. Min Max

wage | 462 10.72173 7.034033 .1748252 42.89773

Difference in average wage = 14.01115 - 10.72173 = 3.29

= √ (10.122392 / 488 + 7.0340332 / 462) = 0.563

t statistic = 6.98/ 0.563 = 5.84

P value (for t = 5.58) = 0 which is less than α = 0.05.

log using Lab1_log, replace

sum wage if gender==1

sum wage if gender==0

log: \\sipaxafsc\users\kv2373\Desktop\PS 1 6501\Lab1_log.smcl

log type: smcl

opened on: 6 Feb 2022, 20:35:53

Variable | Obs Mean Std. dev. Min Max

gender | 950 .5136842 .500076 0 1

. sum wage if gender==1

Variable | Obs Mean Std. dev. Min Max

wage | 488 14.01115 10.12239 .5 81.73029

. sum wage if gender==0

Variable | Obs Mean Std. dev. Min Max

wage | 462 10.72173 7.034033 .1748252 42.89773

You might also like