[go: up one dir, main page]

Academia.eduAcademia.edu
NBER WORKING PAPER SERIES THE EFFECT OF EDUCATION ON MORTALITY AND HEALTH: EVIDENCE FROM A SCHOOLING EXPANSION IN ROMANIA Ofer Malamud Andreea Mitrut Cristian Pop-Eleches Working Paper 24341 http://www.nber.org/papers/w24341 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts Avenue Cambridge, MA 02138 February 2018 We would especially like to thank Andreea Balan-Cohen for her work on the schooling reform in Romania for a different project that is still in progress. Andreea Mitrut gratefully acknowledge support from Jan Wallanders and Tom Hedelius Fond. We have benefited from comments by participants at the ERMAS 2017 and the CHERP conference at the Federal Reserve Bank of Chicago. All errors are our own. The views expressed herein are those of the authors and do not necessarily reflect the views of the National Bureau of Economic Research. NBER working papers are circulated for discussion and comment purposes. They have not been peer-reviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications. © 2018 by Ofer Malamud, Andreea Mitrut, and Cristian Pop-Eleches. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source. The Effect of Education on Mortality and Health: Evidence from a Schooling Expansion in Romania Ofer Malamud, Andreea Mitrut, and Cristian Pop-Eleches NBER Working Paper No. 24341 February 2018 JEL No. I1,I12,I15,I25,I26 ABSTRACT This paper examines a schooling expansion in Romania which increased educational attainment for successive cohorts born between 1945 and 1950. We use a regression discontinuity design at the day level based on school entry cutoff dates to estimate impacts on mortality with 1994-2016 Vital Statistics data and self-reported health with 2011 Census data. We find that the schooling reform led to significant increases in years of schooling and changes in labor market outcomes but did not affect mortality or self-reported health. These estimates provide new evidence for the causal relationship between education and mortality outside of high-income countries and at lower margins of educational attainment. Ofer Malamud School of Education and Social Policy Northwestern University Annenberg Hall 2120 Campus Drive Evanston, IL 60208 and NBER ofer.malamud@northwestern.edu Andreea Mitrut University of Gothenburg Department of Economics Box 640 405 30 Göteborg, Sweden andreea.mitrut@economics.gu.se Cristian Pop-Eleches The School of International and Public Affairs Columbia University 1401A International Affairs Building, MC 3308 420 West 118th Street New York, NY 10027 and NBER cp2124@columbia.edu 1. Introduction There is substantial evidence showing that more educated people have better health and longer life expectancies. However, whether this correlation reflects a causal relationship remains an open question. A number of recent papers have used changes in compulsory schooling requirements to identify the causal impact of schooling on health and mortality in the United States (Lleras-Muney, 2005; Mazumder, 2008), the United Kingdom (Oreopoulos, 2006; Clark and Royer, 2013), Denmark (Arendt, 2008), France (Albouy and Lequien, 2009), the Netherlands (van Kippersluis, et al., 2011), and Sweden (Meghir, et al., forthcoming). While this empirical approach can be compelling, the findings have been mixed and sometimes contradictory, even when based on the same educational expansions. Moreover, all of these studies are focused on the United States or Western Europe where compulsory schooling laws usually affect students enrolled in secondary school. As a result, we know relatively little about the causal effect of education on health and mortality in developing or middle-income countries, and at lower margins of educational attainment. This paper examines the impact of a schooling expansion in Romania during the late 1950s and early 1960s, which sought to provide all students with at least 7 years of compulsory education. We show that successive cohorts of individuals, born between 1945 and 1950, who were affected by this schooling expansion, experienced rising educational attainment. Then we use a regression discontinuity design at the day level to compare individuals born just before the school entry cutoff of January 1 to those born just after, who were almost identical in age but began school later and therefore had greater opportunities to extend their education. Since students born immediately before and after January 1 were also the oldest and youngest in their respective classes, 2 we also draw on cohorts born after the schooling expansion had concluded to separate the effect of increased education from that of relative age and starting school younger. 1 We demonstrate that the schooling expansion led to significant increases in years of schooling for the affected cohorts born between 1945 and 1950. This increase in educational attainment was accompanied by significant increases in labor force participation and decreases in fertility for women. Nevertheless, using detailed information on deaths from Vital Statistics data between 1994 and 2016, we do not find evidence that the schooling expansion reduced the mortality of affected cohorts up to the age of 71. Nor are there reductions in mortality from more specific causes of death. In addition, there are no significant effects for measures of self-reported health using data from the 2011 Romanian Census. Our findings indicate that more education does not help individuals avoid or postpone deaths during middle and old age. This is consistent with the null results in the most recent papers by Clark and Royer (2013) and Meghir et al. (forthcoming) for the United Kingdom and Sweden. However, to the best of our knowledge, this is the first paper to provide compelling estimates for the causal relationship between education and mortality outside of high-income countries and at lower margins of educational attainment. We do not interpret these estimates as an argument against further educational expansions in the developing world. But they do suggest the need to be more circumspect about the potential for such expansions to improve health and increase life expectancy, at least at lower margins of educational attainment. 1 See Cascio and Schanzenbach (2016) for evidence on the impacts of relative age in Tennessee and Black, Devereux, and Salvanes (2011) for evidence on the effect of starting school younger in Norway. 3 The paper is organized as follows. Section 2 reviews the related literature. Section 3 provides a background of the Romanian educational system and the educational expansion. Section 4 describes the data and the empirical strategy. Section 5 presents the results, alternative explanations and potential mechanisms, while Section 6 concludes. 2. Related Literature This section reviews some of the previous literature estimating the causal impact of education on health and mortality. We begin with a more detailed discussion of the papers that take advantage of changes in compulsory schooling requirements. Then we describe some of the alternative empirical approaches used for identifying the causal effect of education at higher margins of educational attainment. For more detailed reviews of these and other studies, see Mazumder (2012) and Galama et al. (2018). For the United States, Lleras-Muney (2005) uses Census data to examine the impact of changes in compulsory schooling laws between 1915 and 1939 that affected students over 14 years of age. Her instrumental variables (IV) estimates indicate that an additional year of schooling leads to significant declines in the probability of dying in the next 10 years. In a follow-up study, Mazumder (2008) notes that these results are not robust to including state-specific trends but presents evidence from the Survey of Income and Program Participation (SIPP) showing positive impacts of education on selfreported health status. Finally, Black et al. (2016) argue that virtually all of the variation in mortality rates is captured by cohort effects and state effects, making it difficult to 4 reliably estimate the effects of changing educational attainment due to state-level changes in compulsory schooling. 2 For the United Kingdom, Clark and Royer (2013) use changes to British compulsory schooling laws in 1947 and 1972 that increased the minimum school leaving age from 14 to 15 and then from 15 to 16. Their regression discontinuity (RD) design does not provide strong evidence for an impact of education on mortality or other health outcomes. Davies et al. (2016) re-examine the 1972 change in compulsory schooling using UK Biobank data and find a statistically significant decline in mortality but their results are somewhat sensitive to functional form. Other studies are focused on European countries: For Sweden, Meghir et al. (forthcoming) do not find improvements in mortality and other health measures for affected cohorts following an educational reform in Sweden that raised the number of years of compulsory schooling from 7/8 to 9, eliminated early selection based on academic ability, and introduced a national curriculum. Arendt (2005) and Albouy and Lequien (2009) also find no statistically significant impact of compulsory school reforms on health outcomes in Denmark and France, respectively. Yet van Kippersluis et al. (2011) do find that increasing compulsory school beyond grade 6 in the Netherlands leads to significant reduction in mortality in old age. Finally, a different set of studies use draft avoidance behavior in the United States during the Vietnam War to estimate the impact of college education on mortality and health outcomes. Buckles et al. (2016) show that the increased college going among men in cohorts associated with greater draft avoidance also leads to lower mortality in 2 In a paper that considers the effect of school quality on health, Aaronson et al. (2017) find that childhood exposure to Rosenwald schools in the Jim Crow south increased life expectancy, after accounting for the negative effects of migration. 5 subsequent years. Grimard and Parent (2007) and de Walque (2007) use a similar identification strategy to estimate impacts on smoking behavior and find evidence suggesting that more education reduces the take-up of smoking and current smoking. However, the estimates in these papers are based on changes at the margin of college education, which may differ from changes due to compulsory schooling laws. In our own review of the literature, and in those by Mazumder (2012) and Galama et al. (2018), we have not found any papers that provide compelling causal estimates for the impact of education on health or mortality in low and middle-income countries. 3. Background on Education in Romania During the post-war period, the structure and the organization of education in Romania was largely based on the model in the Soviet Union as codified by Decree No. 175 of 1948 (Braham, 1972). 3 There were several different types of schools. First, there were 4-year primary schools that offered grades 1 through 4 and were often located in rural areas. Second, there were 7-year general schools, called gymnasiums, which offered grades 1 through 7 (and later expanded to grade 8), with the first four years covering similar material as in the 4- year primary schools. Third, there were 11-year schools, which offered grades 1 through 11 in one school that provided both primary and secondary education. After a successful campaign to provide basic literacy education targeted towards all ages in the late 1940s and early 1950s, the government focused its attention on increasing enrollment beyond the first four grades. According to Giurescu et al. (1971, p. 3 This section relies heavily on information provided in Barham (1963, 1972). 6 351), the five year plan of 1955-1960 specified that the extension of compulsory schooling to 7 years was to be given special attention by the party and government. Thus, the directives of the Communist Party’s Second Congress of 1955 which outlined the second five year plan, envisioned a “situation under which, by 1960-1961, the fifth grade would enroll 90 percent of the 4-year school graduates, and under which, according to the Third Five Year plan, the 7-year school would be universal and compulsory. At first only the first four grades were made compulsory, but villages and rural communities having 7-year schools were required by virtue of Decision No. 1035/1958 to make the 7 year schooling period universal beginning with the 19581959 academic year” (Braham, 1963). Nevertheless, this process was not immediate and was constrained by a lack of enough schools offering 7 years of compulsory schooling: “Since this governmental action applied only to places where 7-year schools already existed, it appears that the extension of free compulsory education is to a large extent only nominal. Furthermore, with rural communities retaining the 4-year compulsory level, the lack of detailed planning to elevate their schools to the 7-year compulsory level has left an irregular pattern of schooling in the provinces” (Braham, 1963). Filipescu and Oprea (1972) also confirm the gradual process of expanding education at the gymnasium level. They explain that the expansion of 7-year compulsory education began in 1956 within towns and larger villages that already had schools beyond the 4th grade, and that it gradually expanded until it was close to universal by 1961-1962. We can document some of these changes using aggregate data on enrollment from the Annual Statistics of the Socialist Republic of Romania. Figure 1 shows the large increase in the number of students graduating from gymnasium between 1955 and 1965. During this period, graduation from gymnasiums increased sharply from 116,698 7 in 1959 to 329,739 in 1963 and stayed at similar levels through the late 1960s and early 1970s. Further evidence for these dramatic changes can be observed at the cohort level. By law, students entered grade 1 in September of the year following the calendar year in which they reached 6 years of age. Thus, the cohort born in 1945 was 6 years of age in 1951, entered first grade in the fall of 1952, entered fifth grade in the fall of 1956 and would have graduated with 7 years of schooling in the spring of 1959. This cohort should be the first cohort that could have been affected by the policy reform. Similarly, the cohort born in 1947 was the first cohort to have potentially benefited from the 1958 Government Decision that made 7-year of schooling compulsory. Finally, the cohort that entered fifth grade in 1961-1962, which according to Filipescu and Oprea (1972) is the first cohort to have achieved universal 7 year compulsory education, was born in 1950. Figure 2 shows the highest educational attainment by year of birth for cohorts of individuals in the Romanian Census of 1992. There is a sharp decline in the proportion of individuals with primary education between cohorts born in 1944 and 1950. At the same time, we observe a sharp increase in the proportion of individuals who complete secondary education (which includes graduates of gymnasiums). Note that cohorts born between 1935 and 1944 also experienced large increases in educational attainment. This is mainly driven by the early literacy and education campaigns introduced after the Communist government came to power. In Figure 3 we plot the “residual” percent of individuals born between 1943 and 1955 who completed primary education by their month of birth, after accounting for calendar month of birth effects. A number of interesting patterns emerge from this graph. First, and consistent with the results in Figure 2, we observe the large decrease in the proportion of students who have only primary education for those born between 8 1945 and 1950. Secondly, and more importantly for our empirical strategy, the decreases in percent of students with only primary education occur discontinuously, with disproportionately large decreases for those born after January 1st in this period. The discontinuities are especially visible for those born around January 1st of 1945, 1947, 1948 and 1949 and to a smaller extent for those born around January 1st of 1946 and 1950. At the same time, no similar discontinuities are visible for the control cohorts born between 1950 and 1953. The patterns in Figure 3 suggest that we can use detailed information on date of birth to estimate the impact of these educational expansions using a regression discontinuity design. To summarize, the evidence on graduation rates from gymnasiums in the aggregate data coincides with the cohort analysis of educational attainment in the 1992 Census; and both are broadly consistent with the historical record of educational reforms in Romania. Together, they indicate that education levels past the first 4 years of primary schooling started to expand in the 1956-1957 school-year and by 19611962, enrollment in the 5th grade was essentially universal. In other words, the expansion affected cohorts born starting in 1945 and universal gymnasium education was essentially completed for cohorts born in or after 1950. 4. Data and Empirical Strategy 4.1 Data Our main sample consists of individuals born in Romania between 1944 and 1952. Those born from 1945-1949 were enrolled in the affected grades during the period of schooling expansion while those born from 1950-1952 were enrolled after the 9 expansions had already been completed. 4 We put together information on these cohorts from several different datasets. We use the 1992 Romanian Census, when individuals were 40 to 48 years of age, to estimate the impact of the schooling reform on educational attainment; certain labor market outcomes, and conduct specification checks of our empirical strategy. 5 Two features make this dataset especially useful for our analysis: First, with 35,000 to 45,000 observations in each yearly birth cohort, we have sufficient power to employ a regression discontinuity design. Second, there is detailed information about the day, month, and year of birth so we can identify the discontinuity induced by the policy within a narrow window. The 1992 Census provides detailed information about the highest level of educational attainment for each respondent according to the following categories: none, primary, gymnasium, secondary education, post-secondary, and university education. For simplicity, we impute years of schooling by assigning the number of years associated with each level of education. 6 This serves as our main summary measure of education when estimating the impact of the schooling expansion. The Census also has information on socio-economic characteristics of our respondents, such as gender, ethnicity, and region of birth. We use these variables to validate our research design. Finally, it contains information on labor force participation and occupational status (for 4 We use the three subsequent cohorts born immediately after the end of the schooling expansion as our preferred comparison group both because they are most similar in age to the cohorts affected by the schooling expansion and offer sufficiently large samples. However, our results are essentially unchanged when we use four, five, or six subsequent cohorts as our comparison group. 5 This is a 15% random sample taken from the full Romanian Census by the Population Activities Unit (PAU) of the United Nations Economic Commission for Europe (UNECE). 6 We also use data collected by the Romanian National Statistics Institute in 1995 and 1996 with reports of actual years of schooling (rather than educational attainment) in order to validate our imputed measure of years of schooling. These data come from surveys based on the 1994 World Bank’s Living Standards Measurement Studies (LSMS) for Romania. 10 those employed) as well as the fertility of women, which serve as useful auxiliary outcomes. Panel A of Table 1 presents summary statistics for the individuals in cohorts born between 1944 and 1952. The average age at the time of the 1992 census is 42.2 years and the fraction of female respondents is almost exactly half. Almost 90 percent of the sample is ethnic Romanian, with about 7 percent ethnic Hungarians, and about 1.5 percent are Roma. The average imputed years of schooling in our sample is 9.58 years. We use the 1994-2016 Vital Statistics Mortality files (VSM) to estimate the impact of the schooling expansion on mortality. These data cover the universe of deceased persons in Romania with detailed information about their socio-economic characteristics, including the day of birth/death and the main cause of death. Thus, we can observe mortality for the cohorts used in our analysis between the ages of 42 and 71 by day and year of birth. 7 We compute mortality for the cohorts born between 1944 and 1952 by dividing the total deaths of these cohorts during the period 1994-2016 to the population at risk defined here as the 1944-1952 cohorts (alive) at the 1992 census. 8 Our calculation of the mortality rate may differ from the true mortality because of migration in and out of Romania. The number of immigrants (for the cohorts we study here) is close to zero and should not affect our results. Moreover, the VSM files include all people deceased abroad as long as they still have a Romanian residence and/or citizenship. Therefore, our mortality files should account for the majority of the Romanian migrants abroad who are temporary emigrants and do not change their 7 Lleras-Muney (2005) and Clark and Royer (2013) suggest that the largest effects of education on mortality occur before the age of 64. Life expectancy in Romania was 69.5 years in 1994, 74.2 in 2011, and 75.5 years in 2016. 8 We use sample weights to calculate the total population because we only have a 15% census sample. 11 permanent residence. 9 But we also directly examine the potential for bias due to migration by checking whether schooling expansion affects the probably of migration. The VSM file provides detailed information on the main cause of death so we are able to look separately at deaths associated with circulatory diseases and cancer. These are the two most important causes of death in Romania, accounting for 44.6% and 26.5% respectively of all deaths. Similar to Meghir et al. (2017) we also reclassify diseases according to the epidemiological literature as preventable and treatable; preventable causes of death may reflect health behaviors while the treatable causes of death may be related to access to healthcare. 10 Panel B of Table 1 shows the overall mortality rate and the mortality rate by category for our main sample. Approximately 26 percent of our sample died between 1994-2016. The largest category of deaths was due to circulatory diseases which account for 10.5 percentage points, followed by cancer and preventable deaths, at 47.7 and 5.9 percentage points respectively. Treatable diseases only accounted for 3.9 percentage points. Finally, we use the 2011 Romanian Census to compute a self-reported measure of health, which provides the impact of the schooling expansion on individuals who survived until 2011. All respondents are asked whether they have any health related problems that may affect their daily life at work, school, at home, etc. Approximately 7.6 percent of people in our cohorts of interest reported having such problems. Those who answered affirmatively were given a set of six follow-up questions – whether they were (i) visually, (ii) hearing, or (iii) movement impaired, (iv) whether they had any memory 9 According to Statistics Romania these emigrants are the vast majority (over the 95%) of emigrants. We use the ICD 10 codes for defining cancer, circulatory diseases and treatable and preventable causes of death. See the Notes at the end of the tables for more information. 10 12 or concentration problems, (v) self-care or (vi) difficulties in communication with their peers. 4.2 Empirical Strategy As described earlier, the schooling expansions in Romania occurred over a five year period from 1956 to 1961 and affected born between 1945 and 1950. Since the government rapidly expanded access to schooling during this period, a child born just after January 1 would have benefited from the additional schools slots created by the government over the course of a year, as compared to a child born just before January 1 who would have been part of an earlier cohort. Indeed, the discontinuities in the fraction of individuals whose highest level of education was primary school were clearly visible in Figure 3 for the years 1945, 1947, 1948 and 1949. In this section, we estimate these discontinuities more formally using a regression discontinuity (RD) design. We estimate the differences across successive cohorts during the period of educational expansion (i.e. in the “treatment years” of 1945-1950) using the following equation: 𝑦𝑦𝑖𝑖 = 𝛽𝛽 ′ 𝑋𝑋𝑖𝑖 + 𝛼𝛼𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 + 𝑓𝑓(𝑑𝑑𝑑𝑑𝑦𝑦𝑖𝑖 ) + 𝜀𝜀𝑖𝑖 (1) where 𝑦𝑦𝑖𝑖 is an outcome such as education or mortality for individual 𝑖𝑖, 𝑋𝑋𝑖𝑖 is a set of control variables, 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 is an indicator for individuals born just after the school entry cutoff of January 1, and 𝑓𝑓(𝑑𝑑𝑑𝑑𝑦𝑦𝑖𝑖 ) is a parametric or non-parametric function of the day of birth which serves as our running variable. For simplicity, our preferred specifications do not include any control variables except for a constant, although including them does not affect our results. The coefficient on 𝛼𝛼 is an estimate for the effect of being born just after the school entry cutoff on the relevant outcome. When the outcome is a measure of 13 education, such as years of schooling, it represents a “first stage” estimate; when the outcome is a measure of health, such as mortality, it represents the “reduced-form” estimate. If we assume that the exclusion restriction holds (i.e. that being born after the school entry cutoff affects mortality only through years of schooling), the ratio of the reduced-form and first stage coefficients represents an estimate for the impact of education on mortality. However, the exclusion restriction may not hold since those individuals born just after the school entry cutoff are generally the oldest children in their class; that is, if relative age has an independent effect on health or mortality. In order to account for any independent effect of relative age, we also compare individuals who were born just before and after the school entry cutoff in a period without educational expansion (i.e. in the “control years” of 1950-1953). We do this by estimating a regression equation similar to equation (1) above using this set of control years. But we also estimate the following models that directly compare the impact of being born just after the school entry cutoff in treatment years to control years: 𝑦𝑦𝑖𝑖 = 𝛽𝛽 ′ 𝑋𝑋𝑖𝑖 + 𝛼𝛼𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 + 𝛾𝛾𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 + 𝛿𝛿𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 ∗ 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 + 𝑓𝑓(𝑑𝑑𝑑𝑑𝑦𝑦𝑖𝑖 ) + 𝜀𝜀𝑖𝑖 (2) where 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 is an indicator for individuals born during years of educational expansion 1945-1950, and the other variables are defined as before (with some abuse of notation). In this specification, the coefficient on the interaction term, 𝛿𝛿, yields the impact of being born just after the school entry cutoff during treatment years over and above the effect in control years that did not experience educational expansions. A key consideration when implementing a regression discontinuity design is the functional form of the forcing variable, 𝑓𝑓(𝑑𝑑𝑑𝑑𝑦𝑦𝑖𝑖 ). We present estimates using a local linear regression as suggested by Hahn, Todd, and van der Klaauw (2001). The choice of 14 the window is somewhat arbitrary as we need to strike a balance between the advantages of having more precise estimates with larger windows and mitigating the possibility of confounding time effects with more narrow windows. Therefore, for our main tables we present specifications using a 180, 120, 90, 60 and 30 day intervals, as well as the Imbens-Kalyanarman (IK) optimal bandwidth (Imbens and Kalyanarman, 2012). We also confirm that our results are robust to using parametric specifications that include higher order polynomials such as linear, quadratic and cubic trends in day of birth (results available by request). All regressions cluster on day of birth in order to avoid the problems associated with specification error in the case of discrete covariates (Lee and Card, 2008). A common specification check for the regression discontinuity design is to verify that the density of observations is continuous around the cutoff (McCrary, 2008). When we examine the density, we find substantial heaping on January 1 and on some of the days immediately preceding it. 11 We believe that this heaping is due to delays in the reporting of births that occurred during the holiday period between Christmas and New Year’s Day when government offices were closed. 12 Insofar as this type of heaping is similar for our “treatment” and “control” years, we can account for this issue in the regression that uses both sets of years. However, we also attempt to deal with this issue using a “donut-RD” design as suggested by Barreca, Lindo, Waddel (2016). In particular, we present all of our results when dropping individuals born within 7 days of January 1 in order to be symmetric around the cutoff. Results are qualitatively similar when we exclude individuals born more than one week 11 These density tests are shown in Appendix Table 1 and Appendix Figure 1. They are structured in a similar fashion to the main tables as described in the results section. 12 Indeed, it appears the spike in observations occurs on January 2 in years when January 1 is a Sunday. 15 before or after January 1 or when we exclude individuals born only one or several days before January 1 (available by request). 13 5. Results 5.1 Effects on educational attainment We begin by estimating the impact of the schooling expansion on years of completed schooling based on the level of education recorded in the 1992 Census. These “first stage” results are shown in Table 2 which has three panels: Panel A presents estimates for 𝛼𝛼 from equation (1) using the treatment years, 1945-1950; Panel B presents estimates for 𝛼𝛼 from equation (1) using the control years, 1950-1953; Panel C presents estimates for 𝛼𝛼 and 𝛿𝛿 from our preferred specification (2) which includes both treatment and control years. Columns (1) to (6) in each panel show estimates for alternative bandwidths. These include 180, 120, 90, 60 and 30 days of the January 1 cutoff, as well as the optimal bandwidth proposed in Imbens and Kalyanaraman (2012). Columns (7) to (12) show analogous specifications that exclude observations within 7 days of the January 1 cutoff (i.e. 7 day donut-RD regressions). Panel A of Table 2 indicates that each successive cohort during the school expansion period 1945-1950 received an additional 1/5 to 3/5 years of schooling; the point estimates for the impact of being born just after vs. just before the January 1 cutoff in the treatment years range from 0.21 to 0.67 years of schooling using our different bandwidths. In contrast, the estimates in Panel B showing the impact of being born just after vs. just before January 1 in the control years of 1950-1953 are small and 13 We also verify that our available covariates vary smoothly around the discontinuity in Appendix Tables 2 and 3. With a few exceptions, the coefficients are mostly small and insignificant. 16 statistically insignificant in all specifications. Panel C shows estimates from the specification that combines both treatment and control years. In these specifications, the impact of the school expansion is captured by 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 ∗ 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 and shows impacts of 0.23 to 0.59 years of schooling, all highly significant. The results using the donut specifications are about 30% smaller in magnitude but still statistically significant in all specifications. 14 The range of these estimate is not altogether surprising given the large number of different specifications that we consider. However, we take our preferred specification to be the IK bandwidth for the full sample, implying a first stage effect of approximately a 1/2 year of schooling. We also present our “first stage” results graphically in Figure 4. Panels A, C and E plot average years of schooling by day of birth for individuals born six months before and after January 1st of each year; panels B, D and F plot the same data by week of birth, which often makes it easier to discern the patterns. The graphs are normalized so that day 1 corresponds to January 1 and week 1 corresponds to the week of January 1 to January 7. The fitted lines are based on smoothed local linear regressions using the IK bandwidth. Panels A and B show a clear discontinuity after January 1 for the treatment years of 1945-1950. This visual evidence confirms that individuals born merely a couple of days apart received a substantially different amount of schooling as a result of the school expansion. In contrast, panels C and D of Figure 4 reveal no change in average educational attainment before and after January 1st in the control cohort. Nevertheless, each of the first four panels in Figure 4 show some time trends, consistent with the 14 Appendix Table 4 uses the 1994-1996 LSMS datasets to estimate the impact of the schooling expansion on reported years of schooling rather than an imputed measure based on completed educational levels. The results are somewhat less precise but generally similar to those in Table 4. 17 presence of seasonality in the timing of births. Such time effects are not visible in Panels E and F of Figure 4, which use both treatment and control years to estimate a version of equation (2) that differences out the impacts in the control years from those in the treatment years. 5.2 Effects on mortality and self-reported health In this section we analyze whether, in addition to affecting education, the school expansion policy had an impact on mortality and health. We begin by using the Vital Statistics data from 1994 to 2016 to examine mortality. Table 3, which has the same structure as the previous tables, reveals no evidence of a statistically significant effect of being born just after vs. just before the January 1 cutoff on mortality in the treatment years of 1945-1950 (in Panel A) or in the control years of 1950-1953 (in Panel B), except for the smallest bandwidths. Furthermore, all of the significant effects disappear once we consider the donut regression that excludes individuals born 7 days before and after January 1. We see a couple of marginally significant effects on 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 ∗ 𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝐴𝑖𝑖 in Panel C that includes both treatment and control years, although these have positive signs. Still, with 10 out of the 12 point estimates from our preferred specification in Panel C not showing any statistically significant effect, we conclude that there is no evidence for an impact of the schooling expansion on mortality. Given the standard errors for our full sample, we can rule out with 95% confidence that the schooling expansions reduced mortality by more than 0.8 percentage points between 1994-2016 when the average mortality rate was 26 percent. A graphical analysis of the mortality results is presented in Figure 5, structured similarly to the preceding figures. The patterns in Panels A-F provide a visual 18 interpretation of the regression estimates from Table 3. We do not see evidence for large discontinuities in the mortality rate between 1994 and 2016 and, if anything, they point against the finding that education reduces mortality. We also consider the effect of the schooling expansion on specific causes of death. We first focus on mortality from the two most common causes of death in Romania: cancer and circulatory diseases. The regression estimates for these causes of death are shown in Tables 4 and 5 respectively, while the figures are shown in Appendix Figure 2 and 3 respectively. We also classify certain causes of death as preventable or treatable, similar to Meghir et al. (2017). The regression estimates for these causes of death are shown in Appendix Table 5 and 6 respectively. In none of the tables do we observe evidence for a consistent effect of the schooling expansion on mortality. Similarly, none of the corresponding graphs show visible discontinuities around the regression discontinuity cutoffs. Thus, we do not find any more evidence for the impact of the schooling expansion on specific causes of death than on the mortality rate as a whole. In addition to the impact of the schooling expansion on mortality, we also examine its effect on self-reported health using the 2011 Romanian Census. This is shown in Table 6 and Appendix Figure 4, which are again structured in a similar fashion to the previous tables and figures. Overall, we do not find an impact of the schooling expansion on self-reported health among individuals who survived until 2011. It is worth noting that, since we have access to the full 100% sample of the 2011 Census, these results are estimated with substantial precision. Given our standard errors for the full sample, we can rule out with 95% confidence that the schooling expansions reduced the fraction of people with health related problems by 0.06 percent, which corresponds to 0.023 standard deviation units. We have also explored the specific dimensions of 19 health used to create our self-reported health index (i.e. vision, hearing, impaired movement, memory, self-care and communication) and did not find any meaningful impacts for these specific categories. 5.3 Alternative explanations 5.3.1 Quality of education Despite the clear impacts of the schooling reform on educational attainment, one might question the quality of this education (especially in light of the rapid expansion of education during these early years) and ask whether the expansion also had an impact on other important outcomes. Table 7 presents estimates for the impact of the schooling expansions on labor force participation, measured as an indicator for being employed at the time of the 1992 Census. 15 This table is structured similarly to the other tables, with Panels A, B, and C showing impacts for the treatment years, control years, and all the years together. The impact of the schooling expansion is most clearly visible in Panel A of Table 7 which shows that individuals entering school in successive cohorts are 1.2 to 1.6 percent more likely to be employed. 16 These impacts are less robust in Panel C and in the donut regressions, but the patterns are largely consistent. A graphical depiction of these impacts can be seen in Panels A, B, E and F of Appendix Figure 5. We also observe significant impacts of Romania’s schooling expansion on fertility, as shown in Appendix Table 7. Our preferred estimates reported in Panel C using the full sample of women show that exposure to the expansion decreased fertility 15 Unfortunately, the 1992 Census does not contain any information about earnings or income. We also found impacts on occupational composition, such as the likelihood of working in a manual occupation or the skill level associated with one’s occupation. These are available by request. 16 20 by 0.08 and 0.29 children. To summarize, these labor market and fertility effects suggest that the educational expansion had an impact on a range of socio-economic outcomes. 5.3.2 Migration To address concerns about bias due to migration, we consider whether our school expansion directly affected the probability of external migration. The 2011 census contains information on all persons who migrated abroad for a period of at least 12 months (at the time of the census). So the vast majority of the Romanian emigrants are covered; i.e., all individuals working abroad who maintain their houses, identity cards or/and remain registered by the Romanian administrative bodies. 17 Using a similar strategy as before, we show in Appendix Table 8 that there is no impact of the schooling expansion on the likelihood of the individuals (who survived until 2011) to emigrate. The migration results presented above, while reassuring, are not able to capture any possible effects of the schooling expansion on permanent migration. We address this possibility through an indirect test. Using information from the 1992 and 2011 census samples, we calculate the (weighted) number of people born in a given day who are in the 2011 census as a (weighted) fraction of the number in the 1992 census. This ratio should capture a combination of both mortality and migration between 19922011. These results are presented in Appendix Table 9, and confirm that there is no impact of the school expansion on this combined measure of mortality and migration. 17 According to Statistics Romania, about 95% of the Romanian emigrants are temporary migrants, meaning that they keep their Romanian ID’s. Moreover, the death of these individuals is reported in the Romanian Mortality Files. While permanent migrants who do not remain registered by the Romanian administrative bodies are not covered, we believe this is a second-order issue because these are mostly highly educated migrants (university or more) who, most likely, were not affected by our policy. 21 5.4 Mechanisms Our findings indicate that the Romanian schooling expansion did not improve health or reduce mortality. In this section, we attempt to explore some of the mechanisms underlying these findings. However, insofar as education can impact health and mortality through many different channels, our discussion remains largely speculative. First, more education may lead to higher income and perhaps better health care. While Romania has universal access to the public healthcare system independently of the individual income, financial resources may still be important because of informal payments (i.e. bribes). Our main results did suggest that the schooling expansion led to greater labor market opportunities (e.g. higher employment) but the Census data did not include information on income. In Appendix Table 10, we use the LSMS survey to examine whether the impact of the school expansion affected income and found positive but insignificant effects. Note that, using the LSMS data, we find positive and significant impacts of the schooling expansion on employment, similar to the results using Census data. 18 Second, even if more education would lead to higher incomes, the impact of income on health is not obvious. Income could allow individuals to access better health care, but it may also lead to an increased consumption of unhealthy goods, such as alcohol and cigarettes. This seems to be the case in Romania where, using the Romanian Household Budget Survey, we find positive and significant correlations between 18 Education could also affect mortality through changes in the occupation structure. Indeed, we observe some evidence that Romania’s schooling expansion, shifted individuals out of manual jobs and farming and into technicians and professional jobs. However, whether these changes should have led to improved health is not completely clear if more education enables individuals to find work in more skilled occupations, with better working conditions, we might expect to find positive health impacts. However, some skilled occupations may be associated with more stress than certain less skilled occupations. Moreover, it is possible that some relatively skilled manufacturing jobs may have worse working conditions than jobs in the informal sector such as agriculture. 22 education and smoking. However, when we attempted to estimate our regression discontinuity specifications using this data, we find no significant effects of education on smoking behavior (see Appendix Table 11). 19 Using the same data, we also find no effects on the likelihood of having a chronic condition. 20 Thus, our analysis does not yield any strong conclusions about the role of particular mechanisms in explaining our results. 21 However, these results need to be interpreted with care since they are mostly based on imprecise estimates using small auxiliary datasets. 6. Conclusion This paper analyzes a schooling expansion in Romania, which aimed to ensure that all students received at least 7 years of compulsory schooling. The schooling expansion affected five consecutive cohorts born between 1945-1950 and we use a regression discontinuity (RD) design to estimate impacts by comparing the differences across successive cohorts of affected students. We find that beginning school in a (one year) later cohort increases educational attainment by approximately a 1/2 year of schooling. We do not find any consistent significant impacts of the schooling reform on selfreported health or mortality. Moreover, we can rule out that the schooling expansions reduced mortality by more than 0.08 percentage points between 1994 and 2016 or that 19 Specifically, we use the 2001-2009 Romanian Household Budget Survey (RHBS) which is a national representative survey, covering about 30,000 households each year and contains detailed socio-economic information on all household members. Note that the RHBS data does not have the day of birth, but only the month and year and therefore we cannot show the donuts specifications. 20 The RHBS data also showed no effect on the likelihood of being hospitalized or on the number of days hospitalized during the last 30 days (results available by request). 21 Given the findings in Aaronson et al. (2017), we also examined the role of internal migration. However, we did not find significant effects of the schooling expansion on internal migration, measured as an indicator for whether the person lives in the locality of birth in 2011 (results available upon request). 23 they improved self-reported health by more than 0.02 standard deviation units for the full sample of individuals in the affected cohorts. Whether education causally affects health and mortality is an important question for both developed and developing countries alike. However, most of the previous work has focused on the United States and Western Europe. The findings in this literature are mixed and there is not strong evidence that education significantly improves health or decreases mortality. We extend the literature by estimating causal impacts for a population that is substantially poorer and also experienced changes at a lower margin of educational attainment. However, our findings only serve to reinforce the absence of a causal effect of education health and mortality, even in this setting. While we have attempted to examine the underlying mechanisms for these findings, more work needs to be done to better understand why we do not observe a strong relationship between education and health across a variety of different settings. 24 References Daniel A., B. Mazumder, S.G. Sanders, and E. Taylor (2017) “Estimating the Effect of School Quality on Mortality in the Presence of Migration: Evidence from the Jim Crow South”. SSRN working paper. Albouy, V. and l. Lequien (2009) ”Does Compulsory education lower mortality?” Journal of Health Economics, 28(1): 155-168. Arendt, J.N. (2005) “Does education cause better health? A panel data analysis using school reforms for identification.” Economics of Education Review, 24(2):149 –160. Barreca, A., J. Lindo, and G. Waddel (2016) “Heaping- Induced Bias in RegressionDiscontinuity Designs” Economic Inquiry, 54(1): 268-293. Black, D. A., Hsu, Y. C. & Taylor, L. J. (2015). “The effect of early-life education on later-life mortality”. Journal of Health Economics, 44, 1-9. Black, S.E., P.J. Devereux, and K.G. Salvanes (2011) “Too young to leave the nest? The effects of school starting age”. Review of Economics and Statistics 93(2):455–467 Braham, R. L. (1963) Education in the Rumanian People's Republic. Washington, D.C : U.S. Dept. of Health, Education, and Welfare, Office of Education. Braham, R.L., 1972. Education in Romania: A Decade of Change. US Government Printing Press. Buckles, K., A. Hagemann, O. Malamud, M. Morrill, and A. Wozniak (2016) “The effect of college education on mortality” Journal of Health Economics, 50: 99-114. Cascio, E.U. and D. W. Schanzenbach (2016) “First in the Class? Age and the Education Production Function” Education Finance and Policy 11(3): 225-250. Clark, D. and H. Royer (2013) “The effect of education on adult mortality and health: Evidence from Britain” The American Economic Review, 103(6), 2087-2120. Davies, N. M., Dickson, M., Smith, G. D., Van den Berg, G. & Windmeijer, F. (2016) “The causal effects of education on health, mortality, cognition, well-being, and income in the UK Biobank”. bioRxiv 074815. Preprint. De Walque, D. (2007) “Does education affect smoking behaviors? Evidence using the Vietnam draft as an instrument for college education” Journal of Health Economics 26 (5), 877–895. Filipescu, V. and Oprea O. (1972) Invatamantul obligatoriu in Romania si in alte tari. Editura Didactica si Pedagogica, Bucuresti 25 Galama, T.J., A. Lleras-Muney, H. van Kippersluis (2018) “The Effect of Education on Health and Mortality: A Review of Experimental and Quasi-Experimental Evidence”. NBER Working Paper No. 24225 Giurescu, C, I Ivanov and N. Mihaileanu, editors (1971) Istoria învăţământului din România : compendiu. Editura Didactică şi Pedagogică, Bucuresti Grimard, F. and D. Parent (2007) “Education and smoking: Were Vietnam war draft avoiders also more likely to avoid smoking?” Journal of Health Economics, 26 (5), 896– 926. Hahn, J., Todd, P. and van der Klaauw (2001) “Identification and Estimation of Treatment Effects with a Regression-Discontinuity Design” Econometrica, 69 (1): 201-209. Imbens, G. and K. Kalyanaraman (2012) “Optimal Bandwidth Choice for the Regression Discontinuity Estimator” Review of Economic Studies 79, 933–959. Lee, D.S., and D. Card (2008) “Regression discontinuity inference with specification error” Journal of Econometrics, 142, 655-674. Lleras-Muney, A. (2005) “The Relationship Between Education and Adult Mortality in the United States” Review of Economic Studies, 72, 189-221. Mazumder, B. (2008) “Does Education Improve Health: A Reexamination of the Evidence from Compulsory Schooling Laws” Economic Perspectives, 33(2), 2-16. Mazumder, B. (2012) “The effects of education on health and mortality” Nordic Economic Policy Review No. 1, 261-302. Meghir, C., M. Palme and E. Simeonova (2017) “Education, Health and Mortality: Evidence from a Social Experiment” forthcoming, American Economic Journal: Applied Economics Oreopoulos, P. (2006) “Estimating Average and Local Average Treatment Effects of Education when Compulsory School Laws Really Matter” American Economic Review 96(1), 152-175. van Kippersluis, H. O. O’Donnell and E. van Doorslaer (2011) “Long-Run Returns to Education: Does Schooling Lead to an Extended Old Age?” Journal of Human Resources 46(4): 695–721. 26 Thousands 400 350 Number of Graduates 300 250 200 150 100 50 0 1950 1955 1960 1965 1970 Year of Graduation FIGURE 1: Graduates from Gymnasium Schools by Year of Graduation Notes: Figure 1 plots the number of students graduating from gymnasium between 1951 and 1971. Source: Romanian Statistical Yearbook 1975 Figure 2: Educational achievements in Romania by year of birth 1 0.9 Proportion 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 1915 1920 1925 1930 1935 1940 1945 1950 1955 1960 Year of Birth primary secondary tertiary FIGURE 2: Educational achievements in Romania by year of birth Notes: Figure 2 plots the the highest educational attainment by year of birth for cohorts of individuals. Source: 1992 Romanian Census (PAU sample) 1965 1970 0.2 Percent Primary (residuals) 0.15 0.1 0.05 0 -0.05 -0.1 Jan-43 Jan-44 Jan-45 Jan-46 Jan-47 Jan-48 Jan-49 Jan-50 Jan-51 Jan-52 Jan-53 Jan-54 Jan-55 Jan-56 FIGURE 3:Effect of educational expansion for cohorts born 1943-1955 by month of birth Notes: This figure plot the percent of individuals born between 1943 and 1955 who completed primary education by their month of birth, which are based on residuals. Source: 1992 Romanian Census (PAU sample) Years of schooling 8 8 Years of schooling 11 Panel B: Treatment 11 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Years of schooling 9 9 Years of schooling 11 Panel D: Control 11 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth Panel F: Treatment-Control Years of schooling -2 -2 Years of schooling 0 0 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth FIGURE 4: Years of Schooling Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1992 Romanian Census (PAU sample). Panel B: Treatment .2 .2 Mortality Rate Mortality Rate .4 .4 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Panel D: Control .1 .1 Mortality Rate Mortality Rate .3 .3 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth Mortality Rate -.1 -.1 Mortality Rate .2 Panel F: Treatment-Control .2 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth FIGURE 5: Mortality Rate Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1994-2011 Vital Statistics Mortality files Table 1: Summary Statistics Mean S.D. Obs 0.506 42.224 0.500 2.575 375,103 375,103 0.893 0.073 0.015 0.018 9.578 0.076 0.309 0.261 0.122 0.134 3.660 0.265 375,103 375,103 375,103 375,103 373,980 2,058,787 0.260 0.081 3,284 0.077 0.105 0.059 0.039 0.026 0.039 0.018 0.014 3,284 3,284 3,284 3,284 Panel A: Census data Female Age Ethnicity Romanian Hugarian Roma Other Years of schooling Self-reported health index (2011) Panel B: Mortality data Overall mortality Mortality by category Cancer Circulatory Preventable Treatable Source: 1992 Romanian Census (PAU sample) & Romania VSM files Table 2: Effects of the Educational Expansion on Years of Schooling Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) After 0.209*** [0.048] 0.299*** [0.058] 0.395*** [0.067] 0.517*** [0.082] 0.669*** [0.124] 0.644*** [0.115] 0.104*** [0.039] 0.160*** [0.049] 0.242*** [0.059] 0.336*** [0.077] 0.329** [0.125] 0.344*** [0.082] Sample size 232,899 150,135 108,458 68,527 32,013 36,820 224,186 141,422 99,745 59,814 23,300 53,167 0.020 0.019 0.019 0.018 0.018 0.018 0.020 0.019 0.019 0.018 0.016 0.017 R-squared Panel B: Control years After -0.025 [0.049] -0.005 [0.064] 0.030 [0.078] 0.062 [0.102] 0.079 [0.156] 0.073 [0.146] -0.056 [0.045] -0.042 [0.060] -0.004 [0.074] 0.023 [0.100] -0.094 [0.172] 0.016 [0.108] Sample size 135,114 89,330 65,606 41,882 19,599 22,519 130,133 84,349 60,625 36,901 14,618 32,849 0.001 0.001 0.001 0.000 0.000 0.000 0.001 0.001 0.000 0.000 0.000 0.000 -0.025 [0.049] -0.005 [0.064] 0.030 [0.078] 0.062 [0.102] 0.079 [0.156] 0.073 [0.146] -0.056 [0.045] -0.042 [0.060] -0.004 [0.074] 0.023 [0.100] -0.094 [0.172] 0.016 [0.108] After*Treatment 0.234*** [0.056] 0.304*** [0.072] 0.365*** [0.085] 0.455*** [0.103] 0.590*** [0.126] 0.572*** [0.122] 0.160*** [0.057] 0.202*** [0.077] 0.246** [0.096] 0.313** [0.129] 0.423* [0.214] 0.328** [0.139] Sample size R-squared 368,013 0.022 239,465 0.021 174,064 0.020 110,409 0.019 51,612 0.018 59,339 0.018 354,319 0.022 225,771 0.021 160,370 0.020 96,715 0.019 37,918 0.018 86,016 0.019 R-squared Panel C: All years After Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Table 3: Effects of Educational Expansion on Mortality Rate Bandwidth (days) 180 (1) Panel A: Treated years 120 (2) Full sample 90 60 (3) (4) 30 (5) IK (6) 180 (7) Excluding 7 days on each side of cutoff 120 90 60 30 (8) (9) (10) (11) IK (12) 0.010 [0.009] 0.006 [0.011] 0.003 [0.013] 0.003 [0.015] -0.006 [0.016] 0.003 [0.012] 0.011 [0.010] 0.007 [0.014] 0.003 [0.018] 0.004 [0.026] -0.016 [0.053] 0.011 [0.010] Sample size 2,154 1,434 1,074 714 354 1,152 2,070 1,350 990 630 270 2,105 R-squared 0.118 0.112 0.105 0.108 0.130 0.106 0.124 0.119 0.113 0.118 0.147 0.124 -0.011 [0.007] -0.012 [0.008] -0.015 [0.009] -0.020* [0.011] -0.046*** [0.015] -0.014 [0.009] -0.002 [0.007] 0.004 [0.009] 0.006 [0.010] 0.013 [0.012] -0.001 [0.022] -0.002 [0.007] Sample size 1,077 717 537 357 177 576 1,035 675 495 315 135 1,053 R-squared 0.071 0.073 0.079 0.082 0.146 0.078 0.058 0.055 0.055 0.046 0.042 0.059 After -0.011 [0.007] -0.012 [0.008] -0.015 [0.009] -0.020* [0.011] -0.046*** [0.015] -0.014 [0.009] -0.002 [0.007] 0.004 [0.009] 0.006 [0.010] 0.013 [0.012] -0.001 [0.022] -0.002 [0.007] After*Treatment 0.021* [0.012] 0.018 [0.014] 0.018 [0.016] 0.023 [0.019] 0.040* [0.020] 0.017 [0.016] 0.013 [0.014] 0.003 [0.018] -0.003 [0.023] -0.009 [0.032] -0.014 [0.062] 0.013 [0.013] 3,231 0.253 2,151 0.242 1,611 0.232 1,071 0.219 531 0.223 1,728 0.234 3,105 0.261 2,025 0.252 1,485 0.242 945 0.229 405 0.228 3,158 0.261 After Panel B: Control years After Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Table 4: Effects of Educational Expansion on Mortality Rate due to Cancer Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 0.001 [0.003] 0.001 [0.004] -0.000 [0.005] -0.001 [0.005] -0.003 [0.005] 0.000 [0.004] 0.001 [0.004] -0.000 [0.005] -0.001 [0.007] -0.002 [0.010] -0.012 [0.019] 0.001 [0.004] Sample size 2,154 1,434 1,074 714 354 1,248 2,070 1,350 990 630 270 2,105 R-squared 0.062 0.058 0.055 0.063 0.098 0.056 0.067 0.063 0.060 0.066 0.094 0.067 -0.003 [0.002] -0.002 [0.003] -0.003 [0.004] -0.004 [0.004] -0.014** [0.006] -0.002 [0.003] 0.001 [0.003] 0.003 [0.003] 0.004 [0.004] 0.008 [0.005] -0.000 [0.009] 0.000 [0.002] Sample size 1,077 717 537 357 177 624 1,035 675 495 315 135 1,053 R-squared 0.049 0.047 0.049 0.049 0.104 0.048 0.039 0.036 0.035 0.028 0.022 0.040 After -0.003 [0.002] -0.002 [0.003] -0.003 [0.004] -0.004 [0.004] -0.014** [0.006] -0.002 [0.003] 0.001 [0.003] 0.003 [0.003] 0.004 [0.004] 0.008 [0.005] -0.000 [0.009] 0.000 [0.002] After*Treatment 0.004 [0.004] 0.003 [0.006] 0.003 [0.006] 0.004 [0.008] 0.011 [0.008] 0.003 [0.006] 0.000 [0.005] -0.003 [0.007] -0.006 [0.009] -0.010 [0.013] -0.012 [0.024] 0.001 [0.005] 3,231 0.167 2,151 0.160 1,611 0.153 1,071 0.145 531 0.163 1,872 0.157 3,105 0.174 2,025 0.169 1,485 0.162 945 0.152 405 0.158 3,158 0.174 After Panel B: Control years After Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. We use the ICD-10 diseases codes - chapter C for cancer. Table 5: Effects of Educational Expansion on Mortality Rate due to Circulatory Diseases Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 0.007** [0.004] 0.005 [0.004] 0.003 [0.005] 0.002 [0.006] -0.003 [0.007] 0.005 [0.004] 0.009** [0.004] 0.007 [0.006] 0.005 [0.007] 0.006 [0.010] 0.005 [0.019] 0.009** [0.004] Sample size 2,154 1,434 1,074 714 354 1,530 2,070 1,350 990 630 270 1,974 R-squared 0.223 0.216 0.210 0.215 0.238 0.217 0.227 0.220 0.215 0.222 0.255 0.226 -0.003 [0.002] -0.003 [0.003] -0.004 [0.003] -0.005 [0.003] -0.012*** [0.004] -0.003 [0.003] -0.000 [0.003] 0.000 [0.004] 0.001 [0.004] 0.003 [0.005] -0.004 [0.010] -0.000 [0.003] Sample size 1,077 717 537 357 177 765 1,035 675 495 315 135 987 R-squared 0.123 0.128 0.137 0.144 0.201 0.126 0.108 0.105 0.108 0.100 0.095 0.107 After -0.003 [0.002] -0.003 [0.003] -0.004 [0.003] -0.005 [0.003] -0.012*** [0.004] -0.003 [0.003] -0.000 [0.003] 0.000 [0.004] 0.001 [0.004] 0.003 [0.005] -0.004 [0.010] -0.000 [0.003] After*Treatment 0.010** [0.004] 0.008 [0.005] 0.007 [0.006] 0.007 [0.007] 0.009 [0.007] 0.008 [0.005] 0.010* [0.005] 0.006 [0.007] 0.004 [0.008] 0.004 [0.011] 0.009 [0.020] 0.009* [0.005] 3,231 0.410 2,151 0.399 1,611 0.391 1,071 0.383 531 0.388 2,295 0.401 3,105 0.413 2,025 0.402 1,485 0.393 945 0.382 405 0.380 2,961 0.412 After Panel B: Control years After Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. We use the ICD-10 diseases codes- chapter I for the circulatory diseases. Table 6: Effects of Educational Expansion on Self-reported Health Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) -0.001 [0.001] -0.001 [0.002] -0.001 [0.002] -0.001 [0.002] -0.000 [0.002] -0.001 [0.002] -0.000 [0.001] -0.001 [0.002] -0.001 [0.002] -0.001 [0.002] 0.000 [0.004] 0.000 [0.003] 1,247,056 808,952 586,032 370,857 171,204 331,445 1,202,917 764,813 541,893 326,718 127,065 287,306 0.013 0.013 0.012 0.012 0.012 0.012 0.013 0.013 0.013 0.013 0.013 0.013 After Sample size R-squared Panel B: Control years After 0.001 [0.001] -0.000 [0.001] -0.001 [0.002] -0.002 [0.002] -0.001 [0.003] -0.002 [0.002] 0.001 [0.001] -0.000 [0.002] -0.002 [0.002] -0.004 [0.003] -0.007 [0.005] -0.005* [0.003] Sample size 777,000 515,459 379,783 242,941 113,725 217,098 749,024 487,483 351,807 214,965 85,749 189,122 0.011 0.011 0.011 0.011 0.011 0.011 0.011 0.011 0.011 0.011 0.011 0.011 After 0.001 [0.001] -0.000 [0.001] -0.001 [0.002] -0.001 [0.002] -0.000 [0.003] -0.002 [0.002] 0.002 [0.001] -0.000 [0.002] -0.002 [0.002] -0.003 [0.003] -0.006 [0.005] -0.005 [0.003] After*Treatment -0.002 [0.002] -0.001 [0.002] -0.001 [0.002] -0.000 [0.003] -0.000 [0.003] 0.000 [0.003] -0.002 [0.002] -0.001 [0.002] 0.001 [0.003] 0.003 [0.004] 0.006 [0.008] 0.005 [0.005] 2,024,056 0.012 1,324,411 0.012 965,815 0.012 613,798 0.012 284,929 0.012 548,543 0.012 1,951,941 0.012 1,252,296 0.012 893,700 0.012 541,683 0.012 212,814 0.012 476,428 0.012 R-squared Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Table 7: Effect of Educational Expansion on Employment Full sample Bandwidth (days) 180 Excluding 7 days on each side of cutoff 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years 0.010*** After [0.004] (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 0.017*** [0.004] 0.021*** [0.005] 0.027*** [0.005] 0.037*** [0.007] 0.034*** [0.007] 0.003 [0.003] 0.008* [0.004] 0.010* [0.005] 0.015** [0.007] 0.023 [0.015] 0.008* [0.004] 233,402 0.002 150,439 0.002 108,658 0.002 68,721 0.002 32,116 0.003 40,385 0.003 224,659 0.002 141,696 0.002 99,915 0.002 59,978 0.002 23,373 0.002 143,836 0.002 Panel B: Control years -0.005 After [0.005] 0.001 [0.006] 0.007 [0.006] 0.011 [0.008] 0.022** [0.010] 0.022** [0.009] -0.009* [0.005] -0.005 [0.007] 0.001 [0.008] 0.007 [0.012] 0.040* [0.020] -0.005 [0.007] 135,396 0.000 89,500 0.000 65,692 0.000 42,013 0.000 19,661 0.000 24,801 0.000 130,393 0.000 84,497 0.000 60,689 0.000 37,010 0.000 14,658 0.001 85,598 0.000 -0.005 [0.005] 0.001 [0.006] 0.007 [0.006] 0.011 [0.008] 0.022** [0.010] 0.022** [0.009] -0.009* [0.005] -0.005 [0.007] 0.001 [0.008] 0.007 [0.012] 0.040* [0.020] -0.005 [0.007] After*Treatment 0.015*** [0.005] 0.016** [0.007] 0.015* [0.008] 0.016* [0.009] 0.015 [0.012] 0.012 [0.011] 0.012** [0.006] 0.013 [0.008] 0.009 [0.010] 0.008 [0.014] -0.017 [0.026] 0.013 [0.008] Sample size R-squared 368,798 0.002 239,939 0.002 174,350 0.002 110,734 0.002 51,777 0.003 65,186 0.002 355,052 0.002 226,193 0.002 160,604 0.002 96,988 0.002 38,031 0.002 229,434 0.002 Sample size R-squared Sample size R-squared Panel C: All years After Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. density 0 0 density .01 Panel B: Treatment .01 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth density 0 0 density .01 Panel D: Control .01 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth Panel F: Treatment-Control -.001 -.001 density density .001 .001 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth APPENDIX FIGURE 1: Density check Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1992 Romanian Census (PAU sample). Mortality Rate 0 0 Mortality Rate .2 Panel B: Treatment .2 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Mortality Rate -.05 -.05 Mortality Rate .15 Panel D: Control .15 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth Mortality Rate -.05 -.05 Mortality Rate .15 Panel F: Treatment-Control .15 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth APPENDIX FIGURE 2: Mortality Rate due to Cancer Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1994-2011 Vital Statistics Mortality files Mortality Rate 0 0 Mortality Rate .2 Panel B: Treatment .2 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Mortality Rate -.05 -.05 Mortality Rate .15 Panel D: Control .15 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth Mortality Rate -.05 -.05 Mortality Rate .15 Panel F: Treatment-Control .15 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth APPENDIX FIGURE 3: Mortality Rate due to Circulatory Diseases Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1994-2011 Vital Statistics Mortality files Health Index 0 0 Health Index .1 Panel B: Treatment .1 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Health Index 0 0 Health Index .1 Panel D: Control .1 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth -.05 -.05 Health Health Index .05 Panel F: Treatment-Control .05 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth APPENDIX FIGURE 4:Self-Reported Health Index Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 2011 Romanian Census fraction employedloyed .7 .7 fraction employedloyed .9 Panel B: Treatment .9 Panel A: Treatment -180 0 180 -26 day of birth 0 26 week of birth Panel D: Control .7 .7 fraction employedloyed fraction employedloyed .9 .9 Panel C: Control -180 0 180 -26 day of birth 0 26 week of birth fraction employedloyed -.1 -.1 fraction employedloyed .1 Panel F: Treatment-Control .1 Panel E: Treatment-Control -180 0 day of birth 180 -26 0 26 week of birth APPENDIX FIGURE 5: Rate of Employment Notes: Panels A and B are restricted to individuals born in the treatment years (1944-1950). Panels C and D are restricted to individuals born in the control years (1950-1953). Panels E and F are restricted to individuals born in both treatment and control years (1944-1953). The open circles indicate the mean of the outcome by day of birth (panels A, C and E) or week of birth (panels B, D and F). The solid lines are fitted values of residuals from local linear regressions of the dependent variable. Source: 1992 Romanian Census (PAU sample). Appendix Table 1: Density Checks Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) After 0.167** [0.073] 0.224** [0.100] 0.262** [0.122] 0.337** [0.150] 0.519*** [0.179] 1.005*** [0.119] 0.064*** [0.011] 0.072*** [0.013] 0.060*** [0.013] 0.049*** [0.011] 0.076*** [0.018] - Sample size R-squared 233,596 0.139 150,586 0.188 108,797 0.227 68,738 0.305 32,116 0.502 8,743 0.843 224,853 0.210 141,843 0.284 100,054 0.312 59,995 0.352 23,373 0.473 - Panel B: Control years After 0.134*** [0.050] 0.174** [0.070] 0.200** [0.088] 0.249** [0.114] 0.388** [0.148] 0.823*** [0.111] 0.069*** [0.010] 0.078*** [0.011] 0.069*** [0.012] 0.055*** [0.012] 0.067** [0.026] - Sample size R-squared 135,524 0.157 89,614 0.199 65,819 0.223 42,014 0.280 19,661 0.455 5,003 0.828 130,521 0.172 84,611 0.286 60,816 0.337 37,011 0.405 14,658 0.446 - 0.134*** [0.050] 0.174** [0.070] 0.200** [0.088] 0.249** [0.114] 0.388** [0.148] 0.823*** [0.111] 0.069*** [0.010] 0.078*** [0.011] 0.069*** [0.012] 0.055*** [0.012] 0.067** [0.026] - After*Treatment 0.033 [0.024] 0.050 [0.030] 0.063* [0.034] 0.088** [0.037] 0.131*** [0.032] 0.182*** [0.012] -0.005 [0.006] -0.006 [0.007] -0.009 [0.008] -0.006 [0.010] 0.009 [0.018] - Sample size R-squared 369,120 0.144 240,200 0.191 174,616 0.226 110,752 0.299 51,777 0.492 13,746 0.842 355,374 0.199 226,454 0.288 160,870 0.326 97,006 0.378 38,031 0.481 - Panel C: All years After Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Coefficients are multuplied by 100 for clarity. The IK bandwidth when excluding 7 days on each side of the cutoff is less than 7 days so these estimates are missing. Appendix Table 2: Specification Tests for Covariates (1) dependent variable: years of schooling bandwidth (days) 180 (1) 120 (2) Full sample 90 60 (3) (4) 30 (5) IK (6) 180 (1) Excluding 7 days on each side of cutoff 120 90 60 30 (2) (3) (4) (5) IK (6) Female After -0.025*** -0.038*** -0.050*** -0.073*** -0.104*** -0.104*** [0.009] [0.012] [0.014] [0.017] [0.022] [0.022] -0.009 [0.008] -0.016 [0.010] -0.023* [0.012] -0.047*** -0.101*** -0.036** [0.016] [0.025] [0.014] After*Treatment -0.013 [0.008] -0.022** [0.010] -0.028** [0.011] -0.024* [0.014] -0.026 [0.020] -0.026 [0.020] -0.006 [0.009] -0.012 [0.012] -0.016 [0.014] -0.002 [0.019] 0.050 [0.031] -0.012 [0.017] Sample size 368,798 239,939 174,350 110,734 51,777 51,777 355,052 226,193 160,604 96,988 38,031 115,133 0.001 0.001 0.002 0.003 0.006 0.006 0.000 0.001 0.001 0.001 0.002 0.001 0.022 [0.022] 0.032*** [0.010] R-squared Ethnic Romanian After 0.015*** 0.024*** 0.029*** 0.032*** 0.033*** 0.032*** [0.005] [0.006] [0.006] [0.007] [0.011] [0.008] 0.013** [0.005] 0.023*** 0.029*** 0.031*** [0.007] [0.008] [0.011] After*Treatment -0.008 [0.005] -0.011 [0.007] -0.012 [0.008] -0.012 [0.010] -0.007 [0.014] -0.010 [0.011] -0.009 [0.005] -0.014* [0.007] -0.017** [0.008] -0.019* [0.011] -0.007 [0.021] -0.019* [0.010] Sample size 368,798 239,939 174,350 110,734 51,777 90,026 355,052 226,193 160,604 96,988 38,031 109,578 0.001 0.001 0.001 0.001 0.002 0.001 0.001 0.001 0.001 0.001 0.001 0.001 -0.020 [0.015] -0.026*** [0.007] R-squared Ethnic Hungarian After -0.013*** -0.019*** -0.021*** -0.023*** -0.022*** -0.023*** -0.013*** -0.020*** -0.024*** -0.027*** [0.003] [0.004] [0.005] [0.005] [0.007] [0.006] [0.004] [0.005] [0.006] [0.008] After*Treatment 0.006 [0.004] 0.009 [0.006] 0.010 [0.007] 0.012 [0.008] 0.011 [0.012] 0.012 [0.008] 0.005 [0.005] 0.010 [0.006] 0.013 [0.008] 0.018* [0.011] 0.006 [0.019] 0.015* [0.009] Sample size R-squared 368,798 0.001 239,939 0.001 174,350 0.001 110,734 0.001 51,777 0.001 104,061 0.001 355,052 0.001 226,193 0.001 160,604 0.001 96,988 0.002 38,031 0.001 128,744 0.001 Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 3: Specification Tests for Covariates (2) dependent variable: years of schooling bandwidth (days) Full sample 90 60 (3) (4) 180 (1) 120 (2) After -0.000 [0.002] -0.002 [0.002] -0.004 [0.003] After*Treatment 0.001 [0.002] 0.000 [0.003] Sample size 368,798 Excluding 7 days on each side of cutoff 120 90 60 30 (2) (3) (4) (5) 30 (5) IK (6) 180 (1) IK (6) -0.004 [0.003] -0.005 [0.005] -0.004 [0.003] 0.001 [0.002] -0.001 [0.003] -0.003 [0.003] -0.002 [0.004] 0.006 [0.008] -0.002 [0.004] 0.000 [0.003] -0.001 [0.005] -0.005 [0.008] -0.002 [0.005] 0.002 [0.002] 0.001 [0.003] 0.001 [0.003] -0.002 [0.004] -0.012* [0.006] -0.002 [0.004] 239,939 174,350 110,734 51,777 105,028 355,052 226,193 160,604 96,988 38,031 105,506 0.000 0.001 0.001 0.001 0.001 0.001 0.000 0.000 0.000 0.000 0.001 0.000 After -0.002 [0.002] -0.004* [0.002] -0.004* [0.003] -0.004 [0.003] -0.006 [0.004] -0.004* [0.002] -0.001 [0.002] -0.002 [0.003] -0.003 [0.003] -0.002 [0.005] -0.008 [0.011] -0.001 [0.002] After*Treatment 0.001 [0.002] 0.002 [0.003] 0.002 [0.003] 0.001 [0.004] 0.001 [0.006] 0.002 [0.003] 0.002 [0.002] 0.003 [0.003] 0.003 [0.004] 0.003 [0.005] 0.013 [0.010] 0.001 [0.002] Sample size 368,798 239,939 174,350 110,734 51,777 238,075 355,052 226,193 160,604 96,988 38,031 360,792 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 -0.006 [0.006] -0.006 [0.012] -0.009*** [0.003] Ethnic Roma R-squared Ethnic Other R-squared Born in Bucharest After -0.009*** -0.011*** -0.012*** -0.011** [0.003] [0.003] [0.004] [0.005] -0.014** -0.012** -0.008*** -0.010*** -0.010** [0.007] [0.005] [0.003] [0.004] [0.004] After*Treatment 0.002 [0.003] 0.003 [0.004] 0.003 [0.005] 0.001 [0.006] 0.003 [0.009] 0.001 [0.006] 0.004 [0.003] 0.005 [0.004] 0.005 [0.005] 0.002 [0.008] 0.009 [0.016] 0.004 [0.004] Sample size R-squared 368,798 0 239,939 0 174,350 0.001 110,734 0.001 51,777 0.001 115,361 0.001 355,052 0 226,193 0 160,604 0 96,988 0.001 38,031 0.001 279,260 0 Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 4: Effects of Educational Expansion on Actual Years of Schooling 1994 LSMS Bandwidth (months) 6 1995-96 LSMS 5 4 3 2 6 5 4 3 2 (2) (3) (4) (5) (7) (8) (9) (10) (11) 0.550** [0.212] 0.641** [0.197] 0.981*** [0.149] 1.183*** [0.020] 0.175 [0.225] 0.356* [0.189] 0.495*** [0.126] 0.601*** [0.098] 0.837*** [0.014] 4,376 0.018 3,439 0.017 2,526 0.017 1,596 0.019 8,383 0.027 6,909 0.029 5,416 0.032 3,909 0.032 2,421 0.029 0.455 [0.268] 0.471** [0.172] 0.565*** [0.111] 0.724*** [0.025] -0.029 [0.194] -0.110 [0.151] -0.156 [0.250] 0.155** [0.044] 0.129*** [0.004] 2,356 0.001 1,968 0.001 1,550 0.004 1,143 0.007 690 0.006 4,887 0.004 4,068 0.006 3,209 0.003 2,377 0.006 1,498 0.003 After 0.107 [0.321] 0.455 [0.268] 0.471** [0.172] 0.565*** [0.111] 0.724*** [0.025] -0.029 [0.194] -0.110 [0.151] -0.156 [0.250] 0.155** [0.044] 0.129*** [0.004] After*Treatment 0.184 [0.267] 0.095 [0.270] 0.169 [0.197] 0.416 [0.210] 0.458*** [0.033] 0.204 [0.184] 0.466** [0.167] 0.651** [0.220] 0.446*** [0.102] 0.708*** [0.013] 7,650 0.040 6,344 0.042 4,989 0.046 3,669 0.050 2,286 0.044 13,270 0.041 10,977 0.043 8,625 0.046 6,286 0.043 3,919 0.042 (1) Panel A: Treated years 0.291 After [0.275] Sample size R-squared 5,294 0.017 Panel B: Control years 0.107 After [0.321] Sample size R-squared Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 5: Effects of Educational Expansion on Mortality Rate due to Preventable Diseases Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 0.000 [0.002] -0.000 [0.003] -0.001 [0.003] -0.001 [0.003] -0.003 [0.004] -0.001 [0.003] 0.001 [0.002] 0.000 [0.003] -0.000 [0.004] 0.000 [0.006] -0.003 [0.012] 0.000 [0.002] Sample size 2,154 1,434 1,074 714 354 1,236 2,070 1,350 990 630 270 2,105 R-squared 0.023 0.022 0.020 0.024 0.047 0.021 0.025 0.026 0.025 0.031 0.056 0.025 -0.002 [0.002] -0.002 [0.003] -0.002 [0.003] -0.003 [0.004] -0.010* [0.006] -0.002 [0.003] -0.000 [0.002] 0.002 [0.003] 0.003 [0.003] 0.005 [0.003] 0.002 [0.006] -0.001 [0.002] Sample size 1,077 717 537 357 177 618 1,035 675 495 315 135 1,053 R-squared 0.025 0.025 0.026 0.027 0.067 0.026 0.019 0.021 0.021 0.017 0.016 0.021 After -0.002 [0.002] -0.002 [0.003] -0.002 [0.003] -0.003 [0.004] -0.010* [0.006] -0.002 [0.003] -0.000 [0.002] 0.002 [0.003] 0.003 [0.003] 0.005 [0.003] 0.002 [0.006] -0.001 [0.002] After*Treatment 0.002 [0.003] 0.001 [0.004] 0.001 [0.004] 0.002 [0.005] 0.007 [0.006] 0.001 [0.004] 0.001 [0.003] -0.002 [0.005] -0.004 [0.006] -0.005 [0.008] -0.005 [0.014] 0.001 [0.003] 3,231 0.057 2,151 0.056 1,611 0.051 1,071 0.049 531 0.070 1,854 0.053 3,105 0.061 2,025 0.061 1,485 0.058 945 0.055 405 0.070 3,158 0.059 After Panel B: Control years After Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. The preventable causes of death include: Lung cancer (C33C34), Cirrhosis of liver (K70, K74.3-K74.6), External causes of death (V, W, X, Y). Appendix Table 6: Effects of Educational Expansion on Mortality Rate due to Treatable Diseases Full sample Bandwidth (days) Excluding 7 days on each side of cutoff 180 120 90 60 30 IK 180 120 90 60 30 IK (1) Panel A: Treated years (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 0.002 [0.002] 0.001 [0.002] -0.000 [0.002] -0.001 [0.002] -0.003 [0.003] 0.001 [0.002] 0.002 [0.002] 0.002 [0.002] 0.001 [0.003] 0.001 [0.004] -0.002 [0.008] 0.002 [0.002] Sample size 2,154 1,434 1,074 714 354 1,638 2,070 1,350 990 630 270 1,974 R-squared 0.106 0.102 0.095 0.096 0.101 0.103 0.111 0.108 0.103 0.109 0.133 0.110 -0.002 [0.001] -0.002 [0.001] -0.003** [0.002] -0.004** [0.002] -0.007*** [0.002] -0.002 [0.001] -0.000 [0.001] -0.000 [0.002] -0.001 [0.002] 0.001 [0.002] 0.001 [0.005] -0.000 [0.001] Sample size 1,077 717 537 357 177 819 1,035 675 495 315 135 987 R-squared 0.045 0.045 0.051 0.055 0.094 0.043 0.038 0.031 0.031 0.027 0.026 0.037 After -0.002 [0.001] -0.002 [0.001] -0.003** [0.002] -0.004** [0.002] -0.007*** [0.002] -0.002 [0.001] -0.000 [0.001] -0.000 [0.002] -0.001 [0.002] 0.001 [0.002] 0.001 [0.005] -0.000 [0.001] After*Treatment 0.003 [0.002] 0.003 [0.002] 0.003 [0.003] 0.003 [0.003] 0.004 [0.004] 0.003 [0.002] 0.003 [0.002] 0.002 [0.003] 0.002 [0.004] 0.001 [0.005] -0.003 [0.011] 0.002 [0.003] 3,231 0.224 2,151 0.218 1,611 0.211 1,071 0.200 531 0.191 2,457 0.220 3,105 0.228 2,025 0.224 1,485 0.216 945 0.208 405 0.201 2,961 0.228 After Panel B: Control years After Panel C: All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Treatable causes of death (cf. ICD 10) include: Tuberculosis (A15-A19. B90), Malignant neoplasm of cervix uteri (C53); Chronic rheumatic heart disease (I05-I09); All respiratory diseases (J00-J99); Asthma (J45, J46); Appendicitis (K35-K38); Abdominal hernia (K40-K46); Hypertensive and cerebrovascular disease (I10-I15, I60-I69); Chollelthiasis and cholecystitis (K80-K81). Appendix Table 7: Effect of Educational Expansion on Fertility dependent variable: number of children bandwidth (days) 180 (1) 120 (2) Full sample 90 60 (3) (4) 30 (5) IK (6) 180 (1) Excluding 7 days on each side of cutoff 120 90 60 30 (2) (3) (4) (5) IK (6) Treated years After -0.034 [0.023] -0.061** -0.089*** -0.118*** -0.119*** -0.120*** [0.027] [0.030] [0.033] [0.032] [0.034] -0.012 [0.027] -0.035 [0.037] -0.068 [0.047] -0.110 [0.067] -0.132 [0.138] -0.113 [0.071] Sample size R-squared 119,118 0.001 76,505 0.001 55,473 0.001 35,243 0.001 16,431 0.001 28,114 0.001 114,832 0.001 72,219 0.000 51,187 0.000 30,957 0.000 12,145 0.001 29,171 0.000 After 0.048 [0.036] 0.084* [0.046] 0.096* [0.054] 0.123* [0.065] 0.171* [0.088] 0.150** [0.072] 0.027 [0.039] 0.062 [0.054] 0.068 [0.069] 0.091 [0.099] 0.255 [0.212] 0.100 [0.104] Sample size R-squared 67,719 0.001 44,737 0.001 32,800 0.001 20,860 0.001 9,724 0.002 16,608 0.002 65,295 0.001 42,313 0.001 30,376 0.001 18,436 0.001 7,300 0.002 17,400 0.001 0.048 [0.036] 0.084* [0.046] 0.096* [0.054] 0.123* [0.065] 0.171* [0.088] 0.150** [0.072] 0.027 [0.039] 0.062 [0.054] 0.068 [0.069] 0.091 [0.099] 0.255 [0.212] 0.100 [0.104] Control years All years After After*Treatment -0.082** -0.145*** -0.185*** -0.241*** -0.290*** -0.270*** [0.038] [0.047] [0.055] [0.065] [0.088] [0.072] -0.039 [0.042] -0.097* [0.056] -0.136* [0.072] -0.201* [0.103] -0.388* [0.222] -0.214* [0.109] Sample size R-squared 186,837 0.001 180,127 0.001 114,532 0.001 81,563 0.001 49,393 0.001 19,445 0.002 46,571 0.001 121,242 0.001 88,273 0.001 56,103 0.001 26,155 0.001 44,722 0.001 Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 8: Effect of Educational Expansion on International Migration in the 2011 census dependent variable: migration rate bandwidth (days) Full sample 90 60 (3) (4) 180 (1) 120 (2) -0.000 [0.000] -0.000 [0.000] -0.000 [0.000] Sample size 2,170 1,448 R-squared 0.182 Excluding 7 days on each side of cutoff 120 90 60 30 (2) (3) (4) (5) 30 (5) IK (6) 180 (1) IK (6) -0.001 [0.001] 0.000 [0.001] -0.000 [0.000] -0.001* [0.000] -0.001** [0.000] -0.001* [0.000] -0.001** [0.001] -0.001 [0.001] -0.001* [0.000] 1,085 718 354 1,430 2,086 1,364 1,001 634 270 1,672 0.191 0.195 0.197 0.202 0.191 0.176 0.182 0.184 0.181 0.168 0.178 -0.000 [0.001] -0.000 [0.001] -0.001 [0.001] -0.001 [0.001] -0.002** [0.001] -0.000 [0.001] 0.000 [0.001] 0.000 [0.001] 0.000 [0.001] 0.001 [0.001] -0.001 [0.001] 0.000 [0.001] Sample size 1,080 719 538 357 177 710 1,038 677 496 315 135 831 R-squared 0.090 0.096 0.096 0.089 0.119 0.096 0.099 0.105 0.104 0.105 0.133 0.101 After -0.000 [0.001] -0.000 [0.001] -0.001 [0.001] -0.001 [0.001] -0.002** [0.001] -0.000 [0.001] 0.000 [0.001] 0.000 [0.001] 0.000 [0.001] 0.001 [0.001] -0.001 [0.001] 0.000 [0.001] After*Treatment -0.000 [0.001] -0.000 [0.001] 0.000 [0.001] 0.000 [0.001] 0.002** [0.001] -0.000 [0.001] -0.001 [0.001] -0.001 [0.001] -0.001 [0.001] -0.002* [0.001] 0.000 [0.002] -0.001 [0.001] 3,250 0.364 2,167 0.367 1,623 0.371 1,075 0.382 531 0.420 2,140 0.367 3,124 0.355 2,041 0.351 1,497 0.348 949 0.351 405 0.353 2,503 0.352 Treated years After Control years After All years Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 9: Effect of Educational Expansion on Attrition between the 1992 and 2011 census dependent variable: attrition rate bandwidth (days) Full sample 90 60 (3) (4) 180 (1) 120 (2) -0.008 [0.025] -0.017 [0.031] -0.020 [0.035] Sample size 2,155 1,435 R-squared 0.017 Excluding 7 days on each side of cutoff 120 90 60 30 (2) (3) (4) (5) 30 (5) IK (6) 180 (1) IK (6) -0.015 [0.040] -0.026 [0.043] -0.020 [0.036] -0.012 [0.029] -0.027 [0.040] -0.035 [0.051] -0.033 [0.073] -0.102 [0.151] -0.018 [0.034] 1,075 714 354 1,045 2,071 1,351 991 630 270 1,645 0.023 0.030 0.038 0.075 0.031 0.014 0.015 0.017 0.019 0.049 0.014 -0.058** [0.026] -0.060* [0.031] -0.064* [0.035] -0.071* [0.039] -0.156*** [0.054] -0.065* [0.035] -0.031 [0.029] -0.016 [0.036] -0.003 [0.041] 0.034 [0.045] -0.037 [0.084] -0.023 [0.033] Sample size 1,078 718 538 357 177 523 1,036 676 496 315 135 823 R-squared 0.015 0.023 0.027 0.023 0.061 0.027 0.010 0.017 0.021 0.015 0.010 0.014 -0.058** [0.026] -0.060* [0.031] -0.064* [0.035] -0.071* [0.039] -0.156*** [0.054] -0.065* [0.035] -0.031 [0.029] -0.016 [0.036] -0.003 [0.041] 0.034 [0.045] -0.037 [0.084] -0.023 [0.033] 0.050 [0.039] 0.043 [0.047] 0.044 [0.053] 0.057 [0.059] 0.130** [0.060] 0.045 [0.053] 0.019 [0.045] -0.011 [0.060] -0.032 [0.074] -0.067 [0.100] -0.065 [0.198] 0.005 [0.052] 3,233 0.022 2,153 0.028 1,613 0.036 1,071 0.045 531 0.088 1,568 0.037 3,107 0.016 2,027 0.018 1,487 0.022 945 0.023 405 0.043 2,468 0.017 Treated years After Control years After All years After After*Treatment Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. The IK bandwidth is the Imbens and Kalyanarman (2012) optimal bandwidth. Appendix Table 10: Effects of Educational Expansion on Employment and Income in the 1995-2000 LSMS data Employment Bandwidth (months) 6 5 4 3 2 6 5 4 3 2 (2) (3) (4) (5) (7) (8) (9) (10) (11) 0.0127 [0.0096] 0.0188* [0.0094] 0.0227* [0.0096] 0.0341*** [0.0011] -0.0130 [0.0260] 0.0079 [0.0232] 0.0122 [0.0266] 0.0299 [0.0209] 0.0675*** [0.0009] 33,512 0.016 26,211 0.015 19,089 0.014 12,005 0.013 40,234 0.010 33,245 0.010 26,012 0.010 18,955 0.009 11,920 0.010 -0.0132 [0.0112] -0.0224 [0.0143] -0.0188 [0.0130] -0.0131*** [0.0015] -0.0337 [0.0412] -0.0275 [0.0426] -0.0286 [0.0506] 0.0105 [0.0438] 0.0905*** [0.0021] 23,158 0.001 19,268 0.001 15,220 0.001 11,251 0.001 7,129 0.001 22,999 0.001 19,140 0.002 15,119 0.001 11,178 0.002 7,092 0.004 -0.0239* [0.0111] -0.0132 [0.0112] -0.0224 [0.0143] -0.0188 [0.0130] -0.0131*** [0.0015] -0.0337 [0.0412] -0.0275 [0.0426] -0.0286 [0.0506] 0.0105 [0.0438] 0.0905*** [0.0021] 0.0234*** [0.0070] 0.0259** [0.0082] 0.0412*** [0.0064] 0.0415*** [0.0044] 0.0473*** [0.0018] 0.0207 [0.0202] 0.0355 [0.0217] 0.0408 [0.0282] 0.0195 [0.0229] -0.0230*** [0.0013] 63,716 0.021 52,780 0.021 41,431 0.020 30,340 0.020 19,134 0.019 63,233 0.010 52,385 0.010 41,131 0.010 30,133 0.009 19,012 0.011 (1) Panel A: Treated years -0.0006 After [0.0120] Sample size R-squared 40,558 0.016 Panel B: Control years -0.0239* After [0.0111] Sample size R-squared Log Income Panel C: All years After After*Treatment Sample size R-squared Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion. Appendix Table 11: Effect of Educational Expansion on Smoking and Chronic Conditions dependent variable: smoking (columns 1-5) and chronic conditions (columns 7-12) Full sample - smoking Bandwidth (months) Full sample - chronic condition 6 5 4 3 2 6 5 4 3 2 (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) 0.0092* [0.0051] 0.0092* [0.0044] 0.0095** [0.0040] 0.0117** [0.0041] 0.0067 [0.0045] 0.0099** [0.0044] 0.0101* [0.0050] 0.0107* [0.0054] 0.0116* [0.0054] 0.0151** [0.0050] 113,367 94,611 73,720 53,302 32,486 113,367 94,611 73,720 53,302 32,486 0.003 0.003 0.003 0.003 0.003 0.001 0.001 0.001 0.001 0.001 0.0017 [0.0060] -0.0006 [0.0064] 0.0007 [0.0071] 0.0031 [0.0058] -0.0004 [0.0030] 0.0017 [0.0063] 0.0016 [0.0060] 0.0040 [0.0051] 0.0032 [0.0047] 0.0087 [0.0054] 64,852 54,126 42,289 30,730 19,048 64,852 54,126 42,289 30,730 19,048 0.001 0.001 0.001 0.001 0.001 0.001 0.001 0.000 0.001 0.001 After 0.0017 [0.0060] -0.0006 [0.0064] 0.0007 [0.0071] 0.0031 [0.0058] -0.0004 [0.0030] 0.0017 [0.0063] 0.0016 [0.0060] 0.0040 [0.0051] 0.0032 [0.0047] 0.0087 [0.0054] After*Treatment 0.0075 [0.0066] 0.0097 [0.0073] 0.0088 [0.0085] 0.0086 [0.0089] 0.0071 [0.0075] 0.0082 [0.0079] 0.0084 [0.0081] 0.0068 [0.0083] 0.0083 [0.0074] 0.0064 [0.0093] Sample size R-squared 178,219 0.006 148,737 0.006 116,009 0.006 84,032 0.006 51,534 0.006 178,219 0.001 148,737 0.001 116,009 0.001 84,032 0.001 51,534 0.001 Treated years After Sample size R-squared Control years After Sample size R-squared All years Notes: Heteroskedacticity-robust standard errors clustered by day of birth are in parentheses. ***, **, and * indicate statistical significance at the 1, 5, and 10 percent level respectively. After is an indicator for individuals born after January 1. Treatment is an indicator that equals 1 for cohorts who experienced an education expansion.